3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Comparing the Utility of Mitochondrial and Nuclear DNA to Adjust for Genetic Ancestry in Association Studies

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Mitochondrial genome-wide association studies identify mitochondrial single nucleotide polymorphisms (mtSNPs) that associate with disease or disease-related phenotypes. Most mitochondrial and nuclear genome-wide association studies adjust for genetic ancestry by including principal components derived from nuclear DNA, but not from mitochondrial DNA, as covariates in statistical regression analyses. Furthermore, there is no standard when controlling for genetic ancestry during mitochondrial and nuclear genetic interaction association scans, especially across ethnicities with substantial mitochondrial genetic heterogeneity. The purpose of this study is to (1) compare the degree of ethnic variation captured by principal components calculated from microarray-defined nuclear and mitochondrial DNA and (2) assess the utility of mitochondrial principal components for association studies. Analytic techniques used in this study include a principal component analysis for genetic ancestry, decision-tree classification for self-reported ethnicity, and linear regression for association tests. Data from the Health and Retirement Study, which includes self-reported White, Black, and Hispanic Americans, was used for all analyses. We report that (1) mitochondrial principal component analysis (PCA) captures ethnic variation to a similar or slightly greater degree than nuclear PCA in Blacks and Hispanics, (2) nuclear and mitochondrial DNA classify self-reported ethnicity to a high degree but with a similar level of error, and 3) mitochondrial principal components can be used as covariates to adjust for population stratification in association studies with complex traits, as demonstrated by our analysis of height—a phenotype with a high heritability. Overall, genetic association studies might reveal true and robust mtSNP associations when including mitochondrial principal components as regression covariates.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: found
          • Article: not found

          The mitochondrial-derived peptide MOTS-c promotes metabolic homeostasis and reduces obesity and insulin resistance.

          Mitochondria are known to be functional organelles, but their role as a signaling unit is increasingly being appreciated. The identification of a short open reading frame (sORF) in the mitochondrial DNA (mtDNA) that encodes a signaling peptide, humanin, suggests the possible existence of additional sORFs in the mtDNA. Here we report a sORF within the mitochondrial 12S rRNA encoding a 16-amino-acid peptide named MOTS-c (mitochondrial open reading frame of the 12S rRNA-c) that regulates insulin sensitivity and metabolic homeostasis. Its primary target organ appears to be the skeletal muscle, and its cellular actions inhibit the folate cycle and its tethered de novo purine biosynthesis, leading to AMPK activation. MOTS-c treatment in mice prevented age-dependent and high-fat-diet-induced insulin resistance, as well as diet-induced obesity. These results suggest that mitochondria may actively regulate metabolic homeostasis at the cellular and organismal level via peptides encoded within their genome.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Quality control and quality assurance in genotypic data for genome-wide association studies.

            Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies (GWAS). This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium test P-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis to SNP selection. The methods are illustrated with examples from the "Gene Environment Association Studies" (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of GWAS. (c) 2010 Wiley-Liss, Inc.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Fast Principal Component Analysis of Large-Scale Genome-Wide Data

              Principal component analysis (PCA) is routinely used to analyze genome-wide single-nucleotide polymorphism (SNP) data, for detecting population structure and potential outliers. However, the size of SNP datasets has increased immensely in recent years and PCA of large datasets has become a time consuming task. We have developed flashpca, a highly efficient PCA implementation based on randomized algorithms, which delivers identical accuracy in extracting the top principal components compared with existing tools, in substantially less time. We demonstrate the utility of flashpca on both HapMap3 and on a large Immunochip dataset. For the latter, flashpca performed PCA of 15,000 individuals up to 125 times faster than existing tools, with identical results, and PCA of 150,000 individuals using flashpca completed in 4 hours. The increasing size of SNP datasets will make tools such as flashpca essential as traditional approaches will not adequately scale. This approach will also help to scale other applications that leverage PCA or eigen-decomposition to substantially larger datasets.
                Bookmark

                Author and article information

                Journal
                Cells
                Cells
                cells
                Cells
                MDPI
                2073-4409
                03 April 2019
                April 2019
                : 8
                : 4
                : 306
                Affiliations
                [1 ]Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA; brendajm@ 123456usc.edu (B.M.); arpawong@ 123456usc.edu (T.E.A.); henrygji@ 123456usc.edu (H.J.); sujkim@ 123456usc.edu (S.-J.K.); kelviny@ 123456usc.edu (K.Y.); hemalmeh@ 123456usc.edu (H.H.M.); junxianw@ 123456usc.edu (J.W.)
                [2 ]Department of Translational Genomics and Institute for Translational Genomics, Keck School of Medicine of the University of Southern California, Los Angeles, CA 90033, USA; carpten@ 123456usc.edu
                Author notes
                [* ]Correspondence: hassy@ 123456usc.edu
                Article
                cells-08-00306
                10.3390/cells8040306
                6523867
                30987182
                69761e0b-1ddf-4ec3-b479-83a8e4df659c
                © 2019 by the authors.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 01 March 2019
                : 29 March 2019
                Categories
                Article

                genome-wide association study,dna,mitochondrial,polymorphism,single nucleotide,principal component analysis,genetics,population

                Comments

                Comment on this article