6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A long non‐coding RNA signature for diagnostic prediction of sepsis upon ICU admission

      letter

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Dear Editor, Sepsis, the highest mortality disease in critically ill patients, is clinically diagnosed through the dysregulated systemic inflammatory response of patients to infection in the presence of organ dysfunction. 1 , 2 , 3 No effective biomarkers and approved molecular therapies have been developed for sepsis to diagnose and treat the immune response state of the patients, leading to the management of these critically ill patients only relies on early recognition by experience and supportive care. 4 , 5 Long noncoding RNAs (lncRNAs) are implicated in a wide variety of biological processes and accumulative studies have demonstrated that several dysregulated lncRNAs play important roles in tumorigenesis and tumor progression. 6 , 7 , 8 However, the lncRNA signature has not been studied for the rapid diagnosis of sepsis, due to the limitation of data sources and lack of RNA‐seq datasets. 3 Hence, we analyzed three whole blood transcriptome cohorts of critically ill adult patients and identified a 28‐lncRNA signature for sepsis diagnosis, which imputes a score to assess the risk of sepsis. The expression profiling of 3745 lncRNAs in three cohorts, GSE95233, GSE28750, and GSE57065, were normalized and reannotated for the investigation 6 , 9 (Table S1). The largest cohort GSE95233 was set as the discovery dataset, while the other two independent cohorts were set as the validation datasets. To select lncRNAs for the predictive signature, we first determined 84 differentially expressed (DE) lncRNAs between sepsis patients and healthy individuals based on the discovery dataset. Then we took advantage of a regression algorithm least absolute shrinkage and selection operator (LASSO) to further identify 28 predictive lncRNAs, named SepSig28, which serves as a molecular diagnostic signature to calculate the risk score to predict whether individuals were suffering from sepsis or not. After that, we validated the diagnostic signature in two independent datasets and demonstrated the high performance of the 28 lncRNAs in the risk prediction of sepsis (Figure 1A). FIGURE 1 Model construction and internal validation. A, Workflow to identify the lncRNA signature of sepsis. B, ROC curves for the 28‐lncRNA signature and other 28‐minus‐one lncRNA signatures. C, AUC, accuracy, sensitivity, and specificity for the 28‐lncRNA signature and other 28‐minus‐one lncRNA signatures. D, Distribution of AUCs for the simulated models in which the lncRNAs were randomly picked up. ROC curve, receiver operating characteristic curve; AUC, area under curve Risk score = (BOLA3.AS1 × 0.254) + (LINC00354 × 0.1996) + (C5orf27 × 0.1537) + (RP1.187B23.1 × ‐0.1427) + (MBNL1.AS1 × ‐0.1419) + (LINC01420 × ‐0.1140) + (RP13.436F16.1 × 0.1060) + (CTB.31O20.2 × 0.1023) + (LINC01425 × 0.0949) + (C10orf25 × ‐0.0763) + (RP11.111M22.3 × 0.0743) + (LAMTOR5.AS1 × 0.0739) + (FLJ37453 × 0.0713) + (AX746755 × ‐0.0690) + (TTTY12 × 0.0678) + (ASMTL.AS1 × ‐0.0535) + (LOC101928491 × 0.0461) + (RBM26.AS1 × ‐0.0438) + (ANP32A.IT1 × 0.0437) + (LOC101060691 × 0.0319) + (MSH5 × ‐0.0311) + (LOC100507221 × 0.0289) + (RP11.1137G4.3 × ‐0.0245) + (LOC100506457 × 0.0237) + (MIR612 × ‐0.0189) + (AC114730.11 × 0.0079) + (LOC101927526 × 0.0026) + (LINC01019 × ‐0.0020). The values following the symbols are the importance weights of the expression abundance of each lncRNA. These lncRNAs are listed in order of decreasing importance. When tuned in the discovery dataset using fivefold cross‐validation, the SepSig28 can perfectly classify the sepsis patient samples and healthy control samples, with all the measures equal 1, including the area under curve (AUC), accuracy, sensitivity, and specificity (Figure 1B,C). To test the randomness of the model, we randomly picked up an equivalent number of lncRNAs 1000 times and evaluated their performance using the same procedure as SepSig28. Our result shows that no random combinations can achieve the score of AUC as high as 1 (Figure 1D). Besides, we constructed all possible 27‐lncRNA signatures (28 minus 1) by excluding one lncRNA once a time to evaluate the predictive capability of each lncRNA in the SepSig28 model. For the discovery dataset, two lncRNA members are not necessary for the model, as the model can perform equally well without either of them (Figure 1C). We added these two as supplementary features to make the model more robust. In the independent cohorts GSE28750 and GSE57065, the hierarchical clustering shows altered expression pattern of the SepSig28 lncRNAs cannot well distinguish sepsis patient samples from the normal ones (Figure 2A,B). Using the computed risk scores by weighted sum, however, SepSig28 can achieve the AUC scores as high as 0.9712 for GSE57065 and 0.95 for GSE28750, respectively (Figure 2C,D), which outperforms almost all the other combinations of 27 (28 minus 1) lncRNAs. Overall, SepSig28 has the best classification performance for all three cohorts according to the measures of AUC, accuracy, sensitivity, and specificity (Figure 2E). FIGURE 2 External validation of SepSig28. Hierarchical clustering of the expression samples based on the 28‐lncRNA signature in dataset GSE57065 (A) and 28750 (B), respectively. ROC curves for the 28‐lncRNA signature and other 28‐minus‐one lncRNA signatures in dataset GSE57065 (C) and 28750 (D). E, AUC, accuracy, sensitivity, and specificity for the 28‐lncRNA signature and other 28‐minus‐one lncRNA signatures in the two validation cohorts To investigate the biological functions the SepSig28 involved, we associated them with their co‐expressed genes across the sepsis samples of each cohort. Genes co‐expressed with the lncRNAs in all the cohorts (Pearson correlation coefficient > 0.7) were considered to be co‐expressed. Gene Ontology (GO) and KEGG pathway enrichment analysis were separately performed for the set of co‐expressed genes. 10 GO enrichment analysis showed that the lncRNAs of SepSig28 are mainly involved in three biological processes, including hormone mediated signaling pathway, RNA splicing, and histone modification (Figure S1A). KEGG analysis showed the SepSig28 associated genes are significantly implicated in pathways that are known to be related to sepsis pathogenesis, including Wnt signaling pathway, Th17 cell differentiation, Notch signaling pathway, etc. (Figure S1B). Interestingly, both GO and KEGG enrichment revealed that lncRNAs in SepSig28 tend to participate in hormone signaling related pathways, indicating an underlying association between hormone signaling and sepsis. In conclusion, we identified and validated the first non‐coding signature consisting of 28 lncRNAs that can well distinguish sepsis patients from healthy controls for adults. Despite limitations such as the limited number of lncRNA features and the small sample size, we provided evidence that lncRNAs could be adopted as markers for the diagnosis of critical diseases. The proposed model could be used as an alternative or complementary diagnostic metric for sepsis. AUTHOR CONTRIBUTIONS LC conceived the idea and drafted the manuscript. LC performed data analysis. XL, XZ, JW, NZ, and RW performed data management and analysis. XL, KL, and XY helped interpret the results and give suggestions. All authors read and approved the final manuscript. CONFLICT OF INTEREST The authors declare no conflict of interest. Supporting information Figure S1. Functional analysis of the protein‐coding genes co‐expressed with the 28 lncRNAs in SepSig28. (A) Functional network of the enriched GO terms. Nodes represent enriched GO terms while edges represent Kappa scores among the nodes. Only the edges with Kappa scores over 0.5 are shown. Node size represents the number of coexpressed genes in GO terms, while color indicates the statistical significance of term enrichment. (B) The enriched KEGG pathways. Node size represents the number of coexpressed genes in the pathways, while the color represents the enrichment significance. Click here for additional data file. Table S1. Discovery and validation cohorts used in this study. Click here for additional data file.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Development and validation of a novel molecular biomarker diagnostic test for the early detection of sepsis

          Introduction Sepsis is a complex immunological response to infection characterized by early hyper-inflammation followed by severe and protracted immunosuppression, suggesting that a multi-marker approach has the greatest clinical utility for early detection, within a clinical environment focused on Systemic Inflammatory Response Syndrome (SIRS) differentiation. Pre-clinical research using an equine sepsis model identified a panel of gene expression biomarkers that define the early aberrant immune activation. Thus, the primary objective was to apply these gene expression biomarkers to distinguish patients with sepsis from those who had undergone major open surgery and had clinical outcomes consistent with systemic inflammation due to physical trauma and wound healing. Methods This was a multi-centre, prospective clinical trial conducted across four tertiary critical care settings in Australia. Sepsis patients were recruited if they met the 1992 Consensus Statement criteria and had clinical evidence of systemic infection based on microbiology diagnoses (n = 27). Participants in the post-surgical (PS) group were recruited pre-operatively and blood samples collected within 24 hours following surgery (n = 38). Healthy controls (HC) included hospital staff with no known concurrent illnesses (n = 20). Each participant had minimally 5 ml of PAXgene blood collected for leucocyte RNA isolation and gene expression analyses. Affymetrix array and multiplex tandem (MT)-PCR studies were conducted to evaluate transcriptional profiles in circulating white blood cells applying a set of 42 molecular markers that had been identified a priori. A LogitBoost algorithm was used to create a machine learning diagnostic rule to predict sepsis outcomes. Results Based on preliminary microarray analyses comparing HC and sepsis groups, a panel of 42-gene expression markers were identified that represented key innate and adaptive immune function, cell cycling, WBC differentiation, extracellular remodelling and immune modulation pathways. Comparisons against GEO data confirmed the definitive separation of the sepsis cohort. Quantitative PCR results suggest the capacity for this test to differentiate severe systemic inflammation from HC is 92%. The area under the curve (AUC) receiver operator characteristics (ROC) curve findings demonstrated sepsis prediction within a mixed inflammatory population, was between 86 and 92%. Conclusions This novel molecular biomarker test has a clinically relevant sensitivity and specificity profile, and has the capacity for early detection of sepsis via the monitoring of critical care patients.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Analysis of long noncoding RNAs highlights region-specific altered expression patterns and diagnostic roles in Alzheimer’s disease

            Increasing evidence has revealed the multiple roles of long noncoding RNAs (lncRNAs) in neurodevelopment, brain function and aging, and their dysregulation was implicated in many types of neurological diseases. However, expression pattern and diagnostic role of lncRNAs in Alzheimer's disease (AD) remain largely unknown and has gained significant attention. In this study, we performed a comparative analysis for lncRNA expression profiles in four brain regions in brain aging and AD. Our analysis revealed age- and disease-dependent region-specific lncRNA expression patterns in aging and AD. Moreover, we identified a panel of nine lncRNAs (termed LncSigAD9) in a discovery cohort of 114 samples using supervised machine learning and stepwise selection method. The LncSigAD9 was able to differentiate between AD and healthy controls with high diagnostic sensitivity and specificity both in the discovery cohort (86.3 and 89.5%) and the additional independent AD cohort (90.8 and 83.8%). The receiver operating characteristic curves for the LncSigAD9 were 0.863 and 0.939 for discovery and independent cohorts, respectively. Furthermore, the LncSigAD9 demonstrated higher diagnostic performance than nine-minus-one lncRNA signature and mRNA-based signature with a similar number of genes. In silico functional analysis indicated the involvement of lncRNA expression variation in brain development- and metabolism-related biological processes. Taken together, our study highlights the importance of lncRNAs in brain aging and AD, and demonstrated the utility of lncRNAs as a promising biomarker for early AD diagnosis and treatment.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              CrossNorm: a novel normalization strategy for microarray data in cancers

              Normalization is essential to get rid of biases in microarray data for their accurate analysis. Existing normalization methods for microarray gene expression data commonly assume a similar global expression pattern among samples being studied. However, scenarios of global shifts in gene expressions are dominant in cancers, making the assumption invalid. To alleviate the problem, here we propose and develop a novel normalization strategy, Cross Normalization (CrossNorm), for microarray data with unbalanced transcript levels among samples. Conventional procedures, such as RMA and LOESS, arbitrarily flatten the difference between case and control groups leading to biased gene expression estimates. Noticeably, applying these methods under the strategy of CrossNorm, which makes use of the overall statistics of the original signals, the results showed significantly improved robustness and accuracy in estimating transcript level dynamics for a series of publicly available datasets, including titration experiment, simulated data, spike-in data and several real-life microarray datasets across various types of cancers. The results have important implications for the past and the future cancer studies based on microarray samples with non-negligible difference. Moreover, the strategy can also be applied to other sorts of high-throughput data as long as the experiments have global expression variations between conditions.
                Bookmark

                Author and article information

                Contributors
                szrosexiu@126.com
                easonlcheng@gmail.com
                Journal
                Clin Transl Med
                Clin Transl Med
                10.1002/(ISSN)2001-1326
                CTM2
                Clinical and Translational Medicine
                John Wiley and Sons Inc. (Hoboken )
                2001-1326
                02 July 2020
                July 2020
                : 10
                : 3 ( doiID: 10.1002/ctm2.v10.3 )
                : e123
                Affiliations
                [ 1 ] Department of Critical Care Medicine Shenzhen People's Hospital First Affiliated Hospital of Southern University of Science and Technology Shenzhen China
                [ 2 ] Shenzhen People's Hospital First Affiliated Hospital of Southern University of Science and Technology Shenzhen China
                [ 3 ] Department of Computer Science and Engineering The Chinese University of Hong Kong Shatin New Territories Hong Kong
                Author notes
                [*] [* ] Correspondence

                Lixin Cheng, Department of Critical Care Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China.

                Xiufeng Ye, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China.

                Email: easonlcheng@ 123456gmail.com ; szrosexiu@ 123456126.com

                [†]

                Both authors contributed equally to this work.

                Author information
                https://orcid.org/0000-0002-9427-383X
                Article
                CTM2123
                10.1002/ctm2.123
                7418814
                32614495
                3ceb8ca5-f876-4fb7-b8f5-8d62d39c97b3
                © 2020 The Authors. Clinical and Translational Medicine published by John Wiley & Sons Australia, Ltd on behalf of Shanghai Institute of Clinical Bioinformatics

                This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

                History
                : 22 June 2020
                : 23 June 2020
                Page count
                Figures: 2, Tables: 0, Pages: 4, Words: 1649
                Categories
                Letter to Editor
                Letter to Editor
                Custom metadata
                2.0
                July 2020
                Converter:WILEY_ML3GV2_TO_JATSPMC version:5.8.6 mode:remove_FC converted:11.08.2020

                Medicine
                Medicine

                Comments

                Comment on this article