Robust methods to detect disease-genotype association in genetic association studies: calculate p-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations
From MaRDI portal
(Redirected from Publication:482819)
Robust methods to detect disease-genotype association in genetic association studies: calculate \(p\)-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations
Robust methods to detect disease-genotype association in genetic association studies: calculate \(p\)-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations
Abstract: In genetic association studies, detecting disease-genotype associations is a primary goal. For most diseases, the underlying genetic model is unknown, and we study seven robust test statistics for monotone association. For a given test statistic, there are many ways to calculate a p-value, but in genetic association studies, calculations have predominantly been based on asymptotic approximations or on simulated permutations. We show that when the number of permutations tends to infinity, the permutation p-value approaches the exact conditional enumeration p-value, and further that calculating the latter p-value is much more efficient than performing simulated permutations. We then answer two research questions. (i) Which of the test statistics under study are the most powerful for monotone genetic models? (ii) Based on test size, power, and computational considerations, should asymptotic approximations or exact conditional enumeration be used for calculating p-values? We have studied case-control sample sizes with 500-5000 cases and 500-15000 controls, and significance levels from 5e-8 to 0.05, thus our results are applicable to genetic association studies with only one genetic marker under study, intermediate follow-up studies, and genome wide association studies. We find that if all monotone genetic models are of interest, the best performance is achieved for a test statistics based on the maximum over a range of Cochrane-Armitage trend tests with different scores and for a constrained likelihood ratio test. For significance levels below 0.05, asymptotic approximations may give a test size up to 20 times the nominal level, and should therefore be used with caution. Further, calculating p-values based on exact conditional enumeration is a powerful, valid and computationally feasible approach, and we advocate its use in genetic association studies.
Recommendations
- Accurate and Efficient P-value Calculation Via Gaussian Approximation: A Novel Monte-Carlo Method
- Fast and accurate approximation to significance tests in genome-wide association studies
- Evaluating the accuracy of small \(p\)-values in genetic association studies using Edgeworth expansions
- Permutation \(p\)-values should never be zero: calculating exact \(p\)-values when permutations are randomly drawn
- How to analyze many contingency tables simultaneously in genetic association studies
Cites work
- scientific article; zbMATH DE number 46578 (Why is no real title available?)
- scientific article; zbMATH DE number 838305 (Why is no real title available?)
- scientific article; zbMATH DE number 6193518 (Why is no real title available?)
- scientific article; zbMATH DE number 3211340 (Why is no real title available?)
- A Network Algorithm for Performing Fisher's Exact Test in r × c Contingency Tables
- A Note on Comparing the Power of Test Statistics at Low Significance Levels
- A cautionary note on exact unconditional inference for a difference between two independent binomial proportions
- A robust genome-wide scan statistic of the Wellcome Trust Case-Control Consortium
- Choice of scores in trend tests for case-control studies of candidate-gene associations
- From Genotypes to Genes: Doubling the Sample Size
- Genomic Control for Association Studies
- On the Robustness of Combined Tests for Trends in Proportions
- Permutation \(p\)-values should never be zero: calculating exact \(p\)-values when permutations are randomly drawn
- Some Methods for Strengthening the Common χ 2 Tests
- Some Surprising Results about Covariate Adjustment in Logistic Regression Models
- The Fisher, Neyman-Pearson Theories of Testing Hypotheses: One Theory or Two?
- The Use of Maximin Efficiency Robust Tests in Combining Contingency Tables and Survival Analysis
Cited in
(3)
This page was built for publication: Robust methods to detect disease-genotype association in genetic association studies: calculate \(p\)-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q482819)