6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Comparison and development of machine learning tools for the prediction of chronic obstructive pulmonary disease in the Chinese population

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Chronic obstructive pulmonary disease (COPD) is a major public health problem and cause of mortality worldwide. However, COPD in the early stage is usually not recognized and diagnosed. It is necessary to establish a risk model to predict COPD development.

          Methods

          A total of 441 COPD patients and 192 control subjects were recruited, and 101 single-nucleotide polymorphisms (SNPs) were determined using the MassArray assay. With 5 clinical features as well as SNPs, 6 predictive models were established and evaluated in the training set and test set by the confusion matrix AU-ROC, AU-PRC, sensitivity (recall), specificity, accuracy, F1 score, MCC, PPV (precision) and NPV. The selected features were ranked.

          Results

          Nine SNPs were significantly associated with COPD. Among them, 6 SNPs (rs1007052, OR = 1.671, P = 0.010; rs2910164, OR = 1.416, P < 0.037; rs473892, OR = 1.473, P < 0.044; rs161976, OR = 1.594, P < 0.044; rs159497, OR = 1.445, P < 0.045; and rs9296092, OR = 1.832, P < 0.045) were risk factors for COPD, while 3 SNPs (rs8192288, OR = 0.593, P < 0.015; rs20541, OR = 0.669, P < 0.018; and rs12922394, OR = 0.651, P < 0.022) were protective factors for COPD development. In the training set, KNN, LR, SVM, DT and XGboost obtained AU-ROC values above 0.82 and AU-PRC values above 0.92. Among these models, XGboost obtained the highest AU-ROC (0.94), AU-PRC (0.97), accuracy (0.91), precision (0.95), F1 score (0.94), MCC (0.77) and specificity (0.85), while MLP obtained the highest sensitivity (recall) (0.99) and NPV (0.87). In the validation set, KNN, LR and XGboost obtained AU-ROC and AU-PRC values above 0.80 and 0.85, respectively. KNN had the highest precision (0.82), both KNN and LR obtained the same highest accuracy (0.81), and KNN and LR had the same highest F1 score (0.86). Both DT and MLP obtained sensitivity (recall) and NPV values above 0.94 and 0.84, respectively. In the feature importance analyses, we identified that AQCI, age, and BMI had the greatest impact on the predictive abilities of the models, while SNPs, sex and smoking were less important.

          Conclusions

          The KNN, LR and XGboost models showed excellent overall predictive power, and the use of machine learning tools combining both clinical and SNP features was suitable for predicting the risk of COPD development.

          Related collections

          Most cited references58

          • Record: found
          • Abstract: found
          • Article: not found

          An official American Thoracic Society public policy statement: Novel risk factors and the global burden of chronic obstructive pulmonary disease.

          Although cigarette smoking is the most important cause of chronic obstructive pulmonary disease (COPD), a substantial proportion of COPD cases cannot be explained by smoking alone. To evaluate the risk factors for COPD besides personal cigarette smoking. We constituted an ad hoc subcommittee of the American Thoracic Society Environmental and Occupational Health Assembly. An international group of members was invited, based on their scientific expertise in a specific risk factor for COPD. For each risk factor area, the committee reviewed the literature, summarized the evidence, and developed conclusions about the likelihood of it causing COPD. All conclusions were based on unanimous consensus. The population-attributable fraction for smoking as a cause of COPD ranged from 9.7 to 97.9%, but was less than 80% in most studies, indicating a substantial burden of disease attributable to nonsmoking risk factors. On the basis of our review, we concluded that specific genetic syndromes and occupational exposures were causally related to the development of COPD. Traffic and other outdoor pollution, secondhand smoke, biomass smoke, and dietary factors are associated with COPD, but sufficient criteria for causation were not met. Chronic asthma and tuberculosis are associated with irreversible loss of lung function, but there remains uncertainty about whether there are important phenotypic differences compared with COPD as it is typically encountered in clinical settings. In public health terms, a substantive burden of COPD is attributable to risk factors other than smoking. To prevent COPD-related disability and mortality, efforts must focus on prevention and cessation of exposure to smoking and these other, less well-recognized risk factors.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Inducible targeting of IL-13 to the adult lung causes matrix metalloproteinase- and cathepsin-dependent emphysema.

            Cigarette smoke exposure is the major cause of chronic obstructive pulmonary disease (COPD). However, only a minority of smokers develop significant COPD, and patients with asthma or asthma-like airway hyperresponsiveness or eosinophilia experience accelerated loss of lung function after cigarette smoke exposure. Pulmonary inflammation is a characteristic feature of lungs from patients with COPD. Surprisingly, the mediators of this inflammation and their contributions to the pathogenesis and varied natural history of COPD are not well defined. Here we show that IL-13, a critical cytokine in asthma, causes emphysema with enhanced lung volumes and compliance, mucus metaplasia, and inflammation, when inducibly overexpressed in the adult murine lung. MMP-2, -9, -12, -13, and -14 and cathepsins B, S, L, H, and K were induced by IL-13 in this setting. In addition, treatment with MMP or cysteine proteinase antagonists significantly decreased the emphysema and inflammation, but not the mucus in these animals. These studies demonstrate that IL-13 is a potent stimulator of MMP and cathepsin-based proteolytic pathways in the lung. They also demonstrate that IL-13 causes emphysema via a MMP- and cathepsin-dependent mechanism(s) and highlight common mechanisms that may underlie COPD and asthma.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Pulmonary function is a long-term predictor of mortality in the general population: 29-year follow-up of the Buffalo Health Study.

              Results from several studies have described a relationship between pulmonary function and both all-cause and cause-specific mortality. The purpose of this study was to investigate the predictive value of pulmonary function by gender after 29 years of follow-up. Prospective study with 29-year follow-up of the Buffalo Health Study cohort. Randomly selected sample of 554 men and 641 women, aged 20 to 89 years, from all listed households of the city of Buffalo, NY. Baseline measurements were performed in 1960 to 1961. Pulmonary function was assessed based on FEV(1) expressed as the normal percent predicted (FEV(1)%pred). FEV(1)%pred adjusted by age, body mass index, systolic BP, education, and smoking status was inversely related to all-cause mortality in both men and women (p 25 years, we observed a statistically significant negative association between FEV(1)%pred and all-cause mortality. FEV(1)%pred was also inversely related to ischemic heart disease (IHD) mortality. When participants were divided into quintiles of FEV(1)%pred, participants in the lowest quintile of FEV(1)%pred experienced significantly higher all-cause mortality compared with participants in the highest quintile of FEV(1)%pred. For the entire follow-up period, the adjusted hazard ratios for all-cause mortality were 2.24 (95% confidence interval [CI], 1.60 to 3.13) for men and 1. 81 (95% CI, 1.24 to 2.63) for women, respectively. Hazard ratios for death from IHD in the lowest quintile of FEV(1)%pred were 2.11 (95% CI, 1.20 to 3.71) and 1.96 (95% CI, 0.99 to 3.88) for men and women, respectively. These results suggest that pulmonary function is a long-term predictor for overall survival rates in both genders and could be used as a tool in general health assessment.
                Bookmark

                Author and article information

                Contributors
                ping209@163.com
                tang11_23@126.com
                1051569807@qq.com
                Journal
                J Transl Med
                J Transl Med
                Journal of Translational Medicine
                BioMed Central (London )
                1479-5876
                31 March 2020
                31 March 2020
                2020
                : 18
                : 146
                Affiliations
                [1 ]Department of Pulmonary and Critical Care Medicine, General Hospital of Datong Coal Mine Group Co., Ltd., Datong, 037000 China
                [2 ]GRID grid.263452.4, ISNI 0000 0004 1798 4018, Department of Respiratory, , General Hospital of Tisco (Sixth Hospital of Shanxi Medical University), ; 2 Yingxin Street, Jiancaoping District, Taiyuan, 030008 Shanxi Province China
                [3 ]Department of Respiratory, Linfen People’s Hospital, Linfen, 041000 China
                [4 ]Shanghai Biotecan Pharmaceuticals Co., Ltd., 180 Zhangheng Road, Shanghai, 201204 China
                [5 ]Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, 201204 China
                [6 ]GRID grid.440208.a, Department of Respiratory Medicine, , Hebei General Hospital, ; Shijiazhuang, 050000 China
                [7 ]GRID grid.254020.1, Department of Respiratory Medicine, , Heji Hospital Affiliated with Changzhi Medical College, ; Changzhi, 046011 China
                [8 ]Department of Clinical Laboratory, JinCheng People’s Hospital, Jincheng, 048000 China
                [9 ]Department of Clinical Laboratory, Linfen People’s Hospital, West of Rainbow Bridge, West Binhe Road, Yaodu District, Linfen, 041000 Shanxi Province China
                [10 ]GRID grid.452461.0, ISNI 0000 0004 1762 8478, Department of Pulmonary and Critical Care Medicine, , The First Hospital of Shanxi Medical University, ; Taiyuan, 030001 China
                Article
                2312
                10.1186/s12967-020-02312-0
                7110698
                32234053
                22dd2ffe-f7b3-4c9f-b9d0-5facb7efea8f
                © The Author(s) 2020

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

                History
                : 16 July 2019
                : 17 March 2020
                Funding
                Funded by: Major Projects of Special Development Funds in Zhangjiang National Independent Innovation Demonstration Zone, Shanghai
                Award ID: ZJ2017-ZD-012
                Categories
                Research
                Custom metadata
                © The Author(s) 2020

                Medicine
                copd,snp,aqci,allele frequencies,machine learning tools
                Medicine
                copd, snp, aqci, allele frequencies, machine learning tools

                Comments

                Comment on this article