2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Gene expression microarray public dataset reanalysis in chronic obstructive pulmonary disease

      research-article
      1 , 2 , 2 , 2 , 3 , *
      PLoS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Chronic obstructive pulmonary disease (COPD) was classified by the Centers for Disease Control and Prevention in 2014 as the 3 rd leading cause of death in the United States (US). The main cause of COPD is exposure to tobacco smoke and air pollutants. Problems associated with COPD include under-diagnosis of the disease and an increase in the number of smokers worldwide. The goal of our study is to identify disease variability in the gene expression profiles of COPD subjects compared to controls, by reanalyzing pre-existing, publicly available microarray expression datasets. Our inclusion criteria for microarray datasets selected for smoking status, age and sex of blood donors reported. Our datasets used Affymetrix, Agilent microarray platforms (7 datasets, 1,262 samples). We re-analyzed the curated raw microarray expression data using R packages, and used Box-Cox power transformations to normalize datasets. To identify significant differentially expressed genes we used generalized least squares models with disease state, age, sex, smoking status and study as effects that also included binary interactions, followed by likelihood ratio tests (LRT). We found 3,315 statistically significant (Storey-adjusted q-value <0.05) differentially expressed genes with respect to disease state (COPD or control). We further filtered these genes for biological effect using results from LRT q-value <0.05 and model estimates’ 10% two-tailed quantiles of mean differences between COPD and control), to identify 679 genes. Through analysis of disease, sex, age, and also smoking status and disease interactions we identified differentially expressed genes involved in a variety of immune responses and cell processes in COPD. We also trained a logistic regression model using the common array genes as features, which enabled prediction of disease status with 81.7% accuracy. Our results give potential for improving the diagnosis of COPD through blood and highlight novel gene expression disease signatures.

          Related collections

          Most cited references26

          • Record: found
          • Abstract: not found
          • Article: not found

          Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            ArrayExpress--a public repository for microarray gene expression data at the EBI.

            A Brazma (2003)
            ArrayExpress is a new public database of microarray gene expression data at the EBI, which is a generic gene expression database designed to hold data from all microarray platforms. ArrayExpress uses the annotation standard Minimum Information About a Microarray Experiment (MIAME) and the associated XML data exchange format Microarray Gene Expression Markup Language (MAGE-ML) and it is designed to store well annotated data in a structured way. The ArrayExpress infrastructure consists of the database itself, data submissions in MAGE-ML format or via an online submission tool MIAMExpress, online database query interface, and the Expression Profiler online analysis tool. ArrayExpress accepts three types of submission, arrays, experiments and protocols, each of these is assigned an accession number. Help on data submission and annotation is provided by the curation team. The database can be queried on parameters such as author, laboratory, organism, experiment or array types. With an increasing number of organisations adopting MAGE-ML standard, the volume of submissions to ArrayExpress is increasing rapidly. The database can be accessed at http://www.ebi.ac.uk/arrayexpress.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Chronic obstructive pulmonary disease.

              Chronic obstructive pulmonary disease (COPD) is a common disease with high global morbidity and mortality. COPD is characterized by poorly reversible airway obstruction, which is confirmed by spirometry, and includes obstruction of the small airways (chronic obstructive bronchiolitis) and emphysema, which lead to air trapping and shortness of breath in response to physical exertion. The most common risk factor for the development of COPD is cigarette smoking, but other environmental factors, such as exposure to indoor air pollutants - especially in developing countries - might influence COPD risk. Not all smokers develop COPD and the reasons for disease susceptibility in these individuals have not been fully elucidated. Although the mechanisms underlying COPD remain poorly understood, the disease is associated with chronic inflammation that is usually corticosteroid resistant. In addition, COPD involves accelerated ageing of the lungs and an abnormal repair mechanism that might be driven by oxidative stress. Acute exacerbations, which are mainly triggered by viral or bacterial infections, are important as they are linked to a poor prognosis. The mainstay of the management of stable disease is the use of inhaled long-acting bronchodilators, whereas corticosteroids are beneficial primarily in patients who have coexisting features of asthma, such as eosinophilic inflammation and more reversibility of airway obstruction. Apart from smoking cessation, no treatments reduce disease progression. More research is needed to better understand disease mechanisms and to develop new treatments that reduce disease activity and progression.
                Bookmark

                Author and article information

                Contributors
                Role: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Data curationRole: InvestigationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SoftwareRole: SupervisionRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2019
                15 November 2019
                : 14
                : 11
                : e0224750
                Affiliations
                [1 ] Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, United States of America
                [2 ] Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, United States of America
                [3 ] Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, United States of America
                University of Alabama-Birmingham, UNITED STATES
                Author notes

                Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: GIM has previously consulted for Colgate-Palmolive. LRKR and MV have declared that no competing interests exist. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

                Author information
                http://orcid.org/0000-0002-9083-1216
                Article
                PONE-D-19-16838
                10.1371/journal.pone.0224750
                6857915
                31730674
                d6329f08-e500-4ae8-96b9-7be856ec53ee
                © 2019 Rogers et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 17 June 2019
                : 21 October 2019
                Page count
                Figures: 11, Tables: 4, Pages: 23
                Funding
                LRKR is funded through the University Enrichment Fellowship at Michigan State University. GIM is funded by Jean P. Schultz Endowed Biomedical Research Fund.
                Categories
                Research Article
                Medicine and Health Sciences
                Pulmonology
                Chronic Obstructive Pulmonary Disease
                Biology and Life Sciences
                Genetics
                Gene Expression
                Research and Analysis Methods
                Bioassays and Physiological Analysis
                Microarrays
                Biology and Life Sciences
                Psychology
                Behavior
                Habits
                Smoking Habits
                Social Sciences
                Psychology
                Behavior
                Habits
                Smoking Habits
                Research and Analysis Methods
                Mathematical and Statistical Techniques
                Statistical Methods
                Multivariate Analysis
                Principal Component Analysis
                Physical Sciences
                Mathematics
                Statistics
                Statistical Methods
                Multivariate Analysis
                Principal Component Analysis
                Biology and Life Sciences
                Genetics
                Genomics
                Human Genomics
                Biology and Life Sciences
                Anatomy
                Body Fluids
                Blood
                Medicine and Health Sciences
                Anatomy
                Body Fluids
                Blood
                Biology and Life Sciences
                Physiology
                Body Fluids
                Blood
                Medicine and Health Sciences
                Physiology
                Body Fluids
                Blood
                People and Places
                Population Groupings
                Age Groups
                Custom metadata
                All of our datasets, data files and results from our COPD meta-analysis have been deposited to FigShare. The file names begin with the prefix “DF” and are referred to throughout the manuscript. To access our supplemental data files access the FigShare online repository at: https://doi.org/10.6084/m9.figshare.8233175. Datasets used in the meta-analysis are from Gene Expression Omnibus and Array Express. The data were originally deposited under the following accessions (also listed in Table 1 of main text): GSE42057, GSE47415, GSE54837, GSE71220, GSE87072, E-MTAB-5278, E-MTAB-5279.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article