92
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Robust Methods for Disease-Genotype Association in Genetic Association Studies: Calculate P-values Using Exact Conditional Enumeration instead of Asymptotic Approximations

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In genetic association studies, detecting disease-genotype associations is a primary goal. For most diseases, the underlying genetic model is unknown, and we study seven robust test statistics for monotone association. For a given test statistic, there are many ways to calculate a p-value, but in genetic association studies, calculations have predominantly been based on asymptotic approximations or on simulated permutations. We show that when the number of permutations tends to infinity, the permutation p-value approaches the exact conditional enumeration p-value, and further that calculating the latter p-value is much more efficient than performing simulated permutations. We then answer two research questions. (i) Which of the test statistics under study are the most powerful for monotone genetic models? (ii) Based on test size, power, and computational considerations, should asymptotic approximations or exact conditional enumeration be used for calculating p-values? We have studied case-control sample sizes with 500-5000 cases and 500-15000 controls, and significance levels from 5e-8 to 0.05, thus our results are applicable to genetic association studies with only one genetic marker under study, intermediate follow-up studies, and genome wide association studies. We find that if all monotone genetic models are of interest, the best performance is achieved for a test statistics based on the maximum over a range of Cochrane-Armitage trend tests with different scores and for a constrained likelihood ratio test. For significance levels below 0.05, asymptotic approximations may give a test size up to 20 times the nominal level, and should therefore be used with caution. Further, calculating p-values based on exact conditional enumeration is a powerful, valid and computationally feasible approach, and we advocate its use in genetic association studies.

          Related collections

          Most cited references18

          • Record: found
          • Abstract: found
          • Article: not found

          A genome-wide association study identifies novel risk loci for type 2 diabetes.

          Type 2 diabetes mellitus results from the interaction of environmental factors with a combination of genetic variants, most of which were hitherto unknown. A systematic search for these variants was recently made possible by the development of high-density arrays that permit the genotyping of hundreds of thousands of polymorphisms. We tested 392,935 single-nucleotide polymorphisms in a French case-control cohort. Markers with the most significant difference in genotype frequencies between cases of type 2 diabetes and controls were fast-tracked for testing in a second cohort. This identified four loci containing variants that confer type 2 diabetes risk, in addition to confirming the known association with the TCF7L2 gene. These loci include a non-synonymous polymorphism in the zinc transporter SLC30A8, which is expressed exclusively in insulin-producing beta-cells, and two linkage disequilibrium blocks that contain genes potentially involved in beta-cell development or function (IDE-KIF11-HHEX and EXT2-ALX4). These associations explain a substantial portion of disease risk and constitute proof of principle for the genome-wide approach to the elucidation of complex genetic traits.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Tests for Linear Trends in Proportions and Frequencies

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              From genotypes to genes: doubling the sample size.

              This paper considers the analysis of genetic case-control data. One approach considers the allele frequency in cases and controls. Because each individual has two alleles at any autosomal locus, there will be twice as many alleles as people. Another approach considers the risk of the disease in those who do not have the allele of interest (A), those who have a single copy (heterozygous), and those who are homozygous for A. A third approach does not differentiate between individuals with one or two copies of A. This was common when alleles were determined serologically and one could not distinguish between homozygotes and those with one copy of A and one of an unknown allele. All three approaches have been used in the literature, but this is the first systematic comparison of them. The different interpretations of the odds ratios from such analyses are explored and conditions are given under which the first two approaches are asymptotically equivalent. The chi-squared statistics from the three approaches are discussed. Both the odds ratio and the chi-squared statistic from the analysis that treats alleles rather than genotypes as individual entities are appropriate only when the Hardy-Weinberg equilibrium holds. When the equilibrium holds, the allele-based test statistic is asymptotically equivalent to the test for trend using the genotype data. Thus, analyses that treat alleles rather than people as observations should not be used. Instead, we recommend that such data should be analyzed by genotype.
                Bookmark

                Author and article information

                Journal
                2013-07-29
                Article
                10.1515/sagmb-2013-0084
                1307.7536
                5ef64803-f731-435e-9a48-03f8ee73099b

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                stat.ME stat.AP

                Applications,Methodology
                Applications, Methodology

                Comments

                Comment on this article