1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Bayesian multivariate reanalysis of large genetic studies identifies many new associations

      research-article
      1 , 1 , 2 , *
      PLoS Genetics
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genome-wide association studies (GWAS) have now been conducted for hundreds of phenotypes of relevance to human health. Many such GWAS involve multiple closely-related phenotypes collected on the same samples. However, the vast majority of these GWAS have been analyzed using simple univariate analyses, which consider one phenotype at a time. This is despite the fact that, at least in simulation experiments, multivariate analyses have been shown to be more powerful at detecting associations. Here, we conduct multivariate association analyses on 13 different publicly-available GWAS datasets that involve multiple closely-related phenotypes. These data include large studies of anthropometric traits (GIANT), plasma lipid traits (GlobalLipids), and red blood cell traits (HaemgenRBC). Our analyses identify many new associations (433 in total across the 13 studies), many of which replicate when follow-up samples are available. Overall, our results demonstrate that multivariate analyses can help make more effective use of data from both existing and future GWAS.

          Author summary

          Genome-wide association studies (GWAS) have become a common and powerful tool for identifying significant correlations between markers of genetic variation and physical traits of interest. Often these studies are conducted by comparing genetic variation against single traits one at a time (‘univariate’); however, it has previously been shown that it is possible to increase your power to detect significant associations by comparing genetic variation against multiple traits simultaneously (‘multivariate’). Despite this apparent increase in power though, researchers still rarely conduct multivariate GWAS, even when studies have multiple traits readily available. Here, we reanalyze 13 previously published GWAS using a multivariate method and find >400 additional associations. Our method makes use of univariate GWAS summary statistics and is available as a software package, thus making it accessible to other researchers interested in conducting the same analyses. We also show, using studies that have multiple releases, that our new associations have high rates of replication. Overall, we argue multivariate approaches in GWAS should no longer be overlooked and how, often, there is low-hanging fruit in the form of new associations by running these methods on data already collected.

          Related collections

          Most cited references18

          • Record: found
          • Abstract: found
          • Article: not found

          Moving toward System Genetics through Multiple Trait Analysis in Genome-Wide Association Studies

          Association studies are a staple of genotype–phenotype mapping studies, whether they are based on single markers, haplotypes, candidate genes, genome-wide genotypes, or whole genome sequences. Although genetic epidemiological studies typically contain data collected on multiple traits which themselves are often correlated, most analyses have been performed on single traits. Here, I review several methods that have been developed to perform multiple trait analysis. These methods range from traditional multivariate models for systems of equations to recently developed graphical approaches based on network theory. The application of network theory to genetics is termed systems genetics and has the potential to address long-standing questions in genetics about complex processes such as coordinate regulation, homeostasis, and pleiotropy.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions

            We introduce new statistical methods for analyzing genomic datasets that measure many effects in many conditions (e.g., gene expression changes under many treatments). These new methods improve on existing methods by allowing for arbitrary correlations in effect sizes among conditions. This flexible approach increases power, improves effect estimates, and allows for more quantitative assessments of effect-size heterogeneity compared to simple “shared/condition-specific” assessments. We illustrate these features through an analysis of locally-acting variants associated with gene expression (“cis eQTLs”) in 44 human tissues. Our analysis identifies more eQTLs than existing approaches, consistent with improved power. We show that while genetic effects on expression are extensively shared among tissues, effect sizes can still vary greatly among tissues. Some shared eQTLs show stronger effects in subsets of biologically related tissues (e.g., brain-related tissues), or in only one tissue (e.g., testis). Our methods are widely applicable, computationally tractable for many conditions, and available online.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Multiple trait analysis of genetic mapping for quantitative trait loci.

              We present in this paper models and statistical methods for performing multiple trait analysis on mapping quantitative trait loci (QTL) based on the composite interval mapping method. By taking into account the correlated structure of multiple traits, this joint analysis has several advantages, compared with separate analyses, for mapping QTL, including the expected improvement on the statistical power of the test for QTL and on the precision of parameter estimation. Also this joint analysis provides formal procedures to test a number of biologically interesting hypotheses concerning the nature of genetic correlations between different traits. Among the testing procedures considered are those for joint mapping, pleiotropy, QTL by environment interaction, and pleiotropy vs. close linkage. The test of pleiotropy (one pleiotropic QTL at a genome position) vs. close linkage (multiple nearby nonpleiotropic QTL) can have important implications for our understanding of the nature of genetic correlations between different traits in certain regions of a genome and also for practical applications in animal and plant breeding because one of the major goals in breeding is to break unfavorable linkage. Results of extensive simulation studies are presented to illustrate various properties of the analyses.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: ResourcesRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – Original DraftRole: Writing – review & editing
                Role: ConceptualizationRole: Funding acquisitionRole: MethodologyRole: Project administrationRole: ResourcesRole: SoftwareRole: SupervisionRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Genet
                PLoS Genet
                plos
                plosgen
                PLoS Genetics
                Public Library of Science (San Francisco, CA USA )
                1553-7390
                1553-7404
                October 2019
                9 October 2019
                : 15
                : 10
                : e1008431
                Affiliations
                [1 ] Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America
                [2 ] Department of Statistics, The University of Chicago, Chicago, Illinois, United States of America
                Emory University, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                [¤]

                Current address: Center for Computational Molecular Biology, Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America

                Author information
                http://orcid.org/0000-0003-3569-1529
                http://orcid.org/0000-0001-5397-9257
                Article
                PGENETICS-D-19-00888
                10.1371/journal.pgen.1008431
                6802844
                31596850
                b4534d03-d2de-4024-a707-ebf8d0ade077
                © 2019 Turchin, Stephens

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 24 May 2019
                : 17 September 2019
                Page count
                Figures: 4, Tables: 2, Pages: 18
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: R01 HG002585
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: T32 GM007197
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: TL1 TR000432
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: F31 AI118375
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: R01 GM118652
                This work was supported by National Institutes of Health (NIH) Grant R01 HG002585 to MS, NIH Grants T32 GM007197, TL1 TR000432, and F31 AI118375 to MCT, and NIH Grant R01 GM118652. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Human Genetics
                Genome-Wide Association Studies
                Research and Analysis Methods
                Mathematical and Statistical Techniques
                Statistical Methods
                Multivariate Analysis
                Physical Sciences
                Mathematics
                Statistics
                Statistical Methods
                Multivariate Analysis
                Biology and Life Sciences
                Genetics
                Phenotypes
                Biology and Life Sciences
                Genetics
                Genetic Loci
                Alleles
                Biology and Life Sciences
                Anatomy
                Bone
                Bone Density
                Medicine and Health Sciences
                Anatomy
                Bone
                Bone Density
                Biology and Life Sciences
                Anatomy
                Biological Tissue
                Connective Tissue
                Bone
                Bone Density
                Medicine and Health Sciences
                Anatomy
                Biological Tissue
                Connective Tissue
                Bone
                Bone Density
                Science Policy
                Open Science
                Open Data
                Medicine and Health Sciences
                Vascular Medicine
                Blood Pressure
                Biology and Life Sciences
                Evolutionary Biology
                Population Genetics
                Genetic Polymorphism
                Biology and Life Sciences
                Genetics
                Population Genetics
                Genetic Polymorphism
                Biology and Life Sciences
                Population Biology
                Population Genetics
                Genetic Polymorphism
                Custom metadata
                vor-update-to-uncorrected-proof
                2019-10-21
                All relevant data are within the manuscript and its Supporting Information files.

                Genetics
                Genetics

                Comments

                Comment on this article