17
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An integrative functional genomics framework for effective identification of novel regulatory variants in genome–phenome studies

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Genome–phenome studies have identified thousands of variants that are statistically associated with disease or traits; however, their functional roles are largely unclear. A comprehensive investigation of regulatory mechanisms and the gene regulatory networks between phenome-wide association study (PheWAS) and genome-wide association study (GWAS) is needed to identify novel regulatory variants contributing to risk for human diseases.

          Methods

          In this study, we developed an integrative functional genomics framework that maps 215,107 significant single nucleotide polymorphism (SNP) traits generated from the PheWAS Catalog and 28,870 genome-wide significant SNP traits collected from the GWAS Catalog into a global human genome regulatory map via incorporating various functional annotation data, including transcription factor (TF)-based motifs, promoters, enhancers, and expression quantitative trait loci (eQTLs) generated from four major functional genomics databases: FANTOM5, ENCODE, NIH Roadmap, and Genotype-Tissue Expression (GTEx). In addition, we performed a tissue-specific regulatory circuit analysis through the integration of the identified regulatory variants and tissue-specific gene expression profiles in 7051 samples across 32 tissues from GTEx.

          Results

          We found that the disease-associated loci in both the PheWAS and GWAS Catalogs were significantly enriched with functional SNPs. The integration of functional annotations significantly improved the power of detecting novel associations in PheWAS, through which we found a number of functional associations with strong regulatory evidence in the PheWAS Catalog. Finally, we constructed tissue-specific regulatory circuits for several complex traits: mental diseases, autoimmune diseases, and cancer, via exploring tissue-specific TF-promoter/enhancer-target gene interaction networks. We uncovered several promising tissue-specific regulatory TFs or genes for Alzheimer’s disease (e.g. ZIC1 and STX1B) and asthma (e.g. CSF3 and IL1RL1).

          Conclusions

          This study offers powerful tools for exploring the functional consequences of variants generated from genome–phenome association studies in terms of their mechanisms on affecting multiple complex diseases and traits.

          Electronic supplementary material

          The online version of this article (10.1186/s13073-018-0513-x) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references44

          • Record: found
          • Abstract: not found
          • Article: not found

          Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants

            The information about the genetic basis of human diseases lies at the heart of precision medicine and drug discovery. However, to realize its full potential to support these goals, several problems, such as fragmentation, heterogeneity, availability and different conceptualization of the data must be overcome. To provide the community with a resource free of these hurdles, we have developed DisGeNET (http://www.disgenet.org), one of the largest available collections of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype–phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, scripts in several programming languages and an R package. DisGeNET is a versatile platform that can be used for different research purposes including the investigation of the molecular underpinnings of specific human diseases and their comorbidities, the analysis of the properties of disease genes, the generation of hypothesis on drug therapeutic action and drug adverse effects, the validation of computationally predicted disease genes and the evaluation of text-mining methods performance.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse

              PhosphoSitePlus (http://www.phosphosite.org) is an open, comprehensive, manually curated and interactive resource for studying experimentally observed post-translational modifications, primarily of human and mouse proteins. It encompasses 1 30 000 non-redundant modification sites, primarily phosphorylation, ubiquitinylation and acetylation. The interface is designed for clarity and ease of navigation. From the home page, users can launch simple or complex searches and browse high-throughput data sets by disease, tissue or cell line. Searches can be restricted by specific treatments, protein types, domains, cellular components, disease, cell types, cell lines, tissue and sequences or motifs. A few clicks of the mouse will take users to substrate pages or protein pages with sites, sequences, domain diagrams and molecular visualization of side-chains known to be modified; to site pages with information about how the modified site relates to the functions of specific proteins and cellular processes and to curated information pages summarizing the details from one record. PyMOL and Chimera scripts that colorize reactive groups on residues that are modified can be downloaded. Features designed to facilitate proteomic analyses include downloads of modification sites, kinase–substrate data sets, sequence logo generators, a Cytoscape plugin and BioPAX download to enable pathway visualization of the kinase–substrate interactions in PhosphoSitePlus®.
                Bookmark

                Author and article information

                Contributors
                713-500-3631 , zhongming.zhao@uth.tmc.edu
                Journal
                Genome Med
                Genome Med
                Genome Medicine
                BioMed Central (London )
                1756-994X
                29 January 2018
                29 January 2018
                2018
                : 10
                : 7
                Affiliations
                [1 ]ISNI 0000 0000 9206 2401, GRID grid.267308.8, Center for Precision Health, School of Biomedical Informatics, , The University of Texas Health Science Center at Houston, ; 7000 Fannin St. Suite 820, Houston, TX 77030 USA
                [2 ]ISNI 000000041936754X, GRID grid.38142.3c, Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute, , Harvard Medical School, ; Boston, MA 02215 USA
                [3 ]ISNI 0000 0001 2173 3359, GRID grid.261112.7, Center for Complex Networks Research, , Northeastern University, ; Boston, MA 02215 USA
                [4 ]ISNI 0000 0001 2264 7217, GRID grid.152326.1, Vanderbilt Genetics Institute, , Vanderbilt University School of Medicine, ; Nashville, TN 37232 USA
                [5 ]ISNI 0000 0004 1936 9916, GRID grid.412807.8, Department of Medicine, , Vanderbilt University Medical Center, ; Nashville, TN 37232 USA
                [6 ]ISNI 0000 0004 1936 9916, GRID grid.412807.8, Department of Biomedical Informatics, , Vanderbilt University Medical Center, ; Nashville, TN 37232 USA
                [7 ]ISNI 0000 0000 9206 2401, GRID grid.267308.8, Human Genetics Center, School of Public Health, , The University of Texas Health Science Center at Houston, ; Houston, TX 77030 USA
                Author information
                http://orcid.org/0000-0002-3477-0914
                Article
                513
                10.1186/s13073-018-0513-x
                5789733
                29378629
                af3b18b2-0263-40ef-a9fd-fdef757a55ec
                © The Author(s). 2018

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 19 June 2017
                : 4 January 2018
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000009, Foundation for the National Institutes of Health;
                Award ID: R01LM010685
                Award ID: R01LM012806
                Award Recipient :
                Categories
                Research
                Custom metadata
                © The Author(s) 2018

                Molecular medicine
                phenome-wide association study (phewas),genome-wide association study (gwas),regulatory variants,enhancer,promoter,human disease

                Comments

                Comment on this article