14
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Unsupervised gene expression analyses identify IPF-severity correlated signatures, associated genes and biomarkers

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Idiopathic Pulmonary Fibrosis (IPF) is a fatal fibrotic lung disease occurring predominantly in middle-aged and older adults. The traditional diagnostic classification of IPF is based on clinical, radiological, and histopathological features. However, the considerable heterogeneity in IPF presentation suggests that differences in gene expression profiles can help to characterize and distinguish disease severity.

          Methods

          We used data-driven unsupervised clustering analysis, combined with a knowledge-based approach to identify and characterize IPF subgroups.

          Results

          Using transcriptional profiles on lung tissue from 131 patients with IPF/UIP and 12 non-diseased controls, we identified six subgroups of IPF that generally correlated with the disease severity and lung function decline. Network-informed clustering identified the most severe subgroup of IPF that was enriched with genes regulating inflammatory processes, blood pressure and branching morphogenesis of the lung. The differentially expressed genes in six subgroups of IPF compared to healthy control include transcripts of extracellular matrix, epithelial-mesenchymal cell cross-talk, calcium ion homeostasis, and oxygen transport. Further, we compiled differentially expressed gene signatures to identify unique gene clusters that can segregate IPF from normal, and severe from mild IPF. Additional validations of these signatures were carried out in three independent cohorts of IPF/UIP. Finally, using knowledge-based approaches, we identified several novel candidate genes which may also serve as potential biomarkers of IPF.

          Conclusions

          Discovery of unique and redundant gene signatures for subgroups in IPF can be greatly facilitated through unsupervised clustering. Findings derived from such gene signatures may provide insights into pathogenesis of IPF and facilitate the development of clinically useful biomarkers.

          Electronic supplementary material

          The online version of this article (10.1186/s12890-017-0472-9) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants

          The information about the genetic basis of human diseases lies at the heart of precision medicine and drug discovery. However, to realize its full potential to support these goals, several problems, such as fragmentation, heterogeneity, availability and different conceptualization of the data must be overcome. To provide the community with a resource free of these hurdles, we have developed DisGeNET (http://www.disgenet.org), one of the largest available collections of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype–phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, scripts in several programming languages and an R package. DisGeNET is a versatile platform that can be used for different research purposes including the investigation of the molecular underpinnings of specific human diseases and their comorbidities, the analysis of the properties of disease genes, the generation of hypothesis on drug therapeutic action and drug adverse effects, the validation of computationally predicted disease genes and the evaluation of text-mining methods performance.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments.

            One of the main objectives in the analysis of microarray experiments is the identification of genes that are differentially expressed under two experimental conditions. This task is complicated by the noisiness of the data and the large number of genes that are examined simultaneously. Here, we present a novel technique for identifying differentially expressed genes that does not originate from a sophisticated statistical model but rather from an analysis of biological reasoning. The new technique, which is based on calculating rank products (RP) from replicate experiments, is fast and simple. At the same time, it provides a straightforward and statistically stringent way to determine the significance level for each gene and allows for the flexible control of the false-detection rate and familywise error rate in the multiple testing situation of a microarray experiment. We use the RP technique on three biological data sets and show that in each case it performs more reliably and consistently than the non-parametric t-test variant implemented in Tusher et al.'s significance analysis of microarrays (SAM). We also show that the RP results are reliable in highly noisy data. An analysis of the physiological function of the identified genes indicates that the RP approach is powerful for identifying biologically relevant expression changes. In addition, using RP can lead to a sharp reduction in the number of replicate experiments needed to obtain reproducible results.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Three-dimensional reconstruction of protein networks provides insight into human genetic disease.

              To better understand the molecular mechanisms and genetic basis of human disease, we systematically examine relationships between 3,949 genes, 62,663 mutations and 3,453 associated disorders by generating a three-dimensional, structurally resolved human interactome. This network consists of 4,222 high-quality binary protein-protein interactions with their atomic-resolution interfaces. We find that in-frame mutations (missense point mutations and in-frame insertions and deletions) are enriched on the interaction interfaces of proteins associated with the corresponding disorders, and that the disease specificity for different mutations of the same gene can be explained by their location within an interface. We also predict 292 candidate genes for 694 unknown disease-to-gene associations with proposed molecular mechanism hypotheses. This work indicates that knowledge of how in-frame disease mutations alter specific interactions is critical to understanding pathogenesis. Structurally resolved interaction networks should be valuable tools for interpreting the wealth of data being generated by large-scale structural genomics and disease association studies.
                Bookmark

                Author and article information

                Contributors
                yunguan.wang@cchmc.org
                jaswanth.yella@cchmc.org
                jing.chen2@cchmc.org
                mccormfx@ucmail.uc.edu
                satish.madala@cchmc.org
                anil.jegga@cchmc.org
                Journal
                BMC Pulm Med
                BMC Pulm Med
                BMC Pulmonary Medicine
                BioMed Central (London )
                1471-2466
                20 October 2017
                20 October 2017
                2017
                : 17
                : 133
                Affiliations
                [1 ]ISNI 0000 0000 9025 8099, GRID grid.239573.9, Division of Biomedical Informatics, , Cincinnati Children’s Hospital Medical Center, ; Cincinnati, OH USA
                [2 ]ISNI 0000 0001 2179 9593, GRID grid.24827.3b, Division of Pulmonary, Critical Care and Sleep Medicine, , University of Cincinnati, ; Cincinnati, OH USA
                [3 ]ISNI 0000 0000 9025 8099, GRID grid.239573.9, Division of Pulmonary Medicine, , Cincinnati Children’s Hospital Medical Center, ; Cincinnati, OH USA
                [4 ]ISNI 0000 0001 2179 9593, GRID grid.24827.3b, Department of Pediatrics, , University of Cincinnati College of Medicine, ; Cincinnati, OH USA
                [5 ]ISNI 0000 0001 2179 9593, GRID grid.24827.3b, Department of Computer Science, , University of Cincinnati College of Engineering, ; Cincinnati, OH USA
                Author information
                http://orcid.org/0000-0002-4881-7752
                Article
                472
                10.1186/s12890-017-0472-9
                5649521
                29058630
                2a703168-adf4-4e97-b94d-def26ea8c538
                © The Author(s). 2017

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 8 August 2017
                : 1 October 2017
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000050, National Heart, Lung, and Blood Institute;
                Award ID: 1R01 HL134801
                Award ID: 1R21HL133539
                Award ID: 1R21HL133539
                Award ID: 1R21 HL135368
                Categories
                Research Article
                Custom metadata
                © The Author(s) 2017

                Respiratory medicine
                idiopathic pulmonary fibrosis,ipf,gene expression analysis,gene signature,ipf subtyping

                Comments

                Comment on this article