7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      RNA-sequencing and mass-spectrometry proteomic time-series analysis of T-cell differentiation identified multiple splice variants models that predicted validated protein biomarkers in inflammatory diseases

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Profiling of mRNA expression is an important method to identify biomarkers but complicated by limited correlations between mRNA expression and protein abundance. We hypothesised that these correlations could be improved by mathematical models based on measuring splice variants and time delay in protein translation. We characterised time-series of primary human naïve CD4 + T cells during early T helper type 1 differentiation with RNA-sequencing and mass-spectrometry proteomics. We performed computational time-series analysis in this system and in two other key human and murine immune cell types. Linear mathematical mixed time delayed splice variant models were used to predict protein abundances, and the models were validated using out-of-sample predictions. Lastly, we re-analysed RNA-seq datasets to evaluate biomarker discovery in five T-cell associated diseases, further validating the findings for multiple sclerosis (MS) and asthma. The new models significantly out-performing models not including the usage of multiple splice variants and time delays, as shown in cross-validation tests. Our mathematical models provided more differentially expressed proteins between patients and controls in all five diseases. Moreover, analysis of these proteins in asthma and MS supported their relevance. One marker, sCD27, was validated in MS using two independent cohorts for evaluating response to treatment and disease prognosis. In summary, our splice variant and time delay models substantially improved the prediction of protein abundance from mRNA expression in three different immune cell types. The models provided valuable biomarker candidates, which were further validated in MS and asthma.

          Related collections

          Most cited references56

          • Record: found
          • Abstract: found
          • Article: not found

          STAR: ultrafast universal RNA-seq aligner.

          Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Regression Shrinkage and Selection Via the Lasso

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.

              Methods used to sequence the transcriptome often produce more than 200 million short sequences. We introduce StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts. When used to analyze both simulated and real data sets, StringTie produces more complete and accurate reconstructions of genes and better estimates of expression levels, compared with other leading transcript assembly programs including Cufflinks, IsoLasso, Scripture and Traph. For example, on 90 million reads from human blood, StringTie correctly assembled 10,990 transcripts, whereas the next best assembly was of 7,187 transcripts by Cufflinks, which is a 53% increase in transcripts assembled. On a simulated data set, StringTie correctly assembled 7,559 transcripts, which is 20% more than the 6,310 assembled by Cufflinks. As well as producing a more complete transcriptome assembly, StringTie runs faster on all data sets tested to date compared with other assembly software, including Cufflinks.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Mol Biosci
                Front Mol Biosci
                Front. Mol. Biosci.
                Frontiers in Molecular Biosciences
                Frontiers Media S.A.
                2296-889X
                29 August 2022
                2022
                : 9
                : 916128
                Affiliations
                [1] 1 Bioinformatics , Department of Physics, Chemistry and Biology , Linköping University , Linköping, Sweden
                [2] 2 Department of Applied Chemistry , College of Applied Sciences , Kyung Hee University , Yong-in, South Korea
                [3] 3 Department of Biomedical and Clinical Sciences , Linköping University , Linköping, Sweden
                [4] 4 Department of Neurology , Institute for Cell Engineering , Johns Hopkins University School of Medicine , Baltimore, MD, United States
                [5] 5 Centre for Personalised Medicine , Linköping University , Linköping, Sweden
                [6] 6 Navarrabiomed , Complejo Hospitalario de Navarra , Universidad Pública de Navarra , IdiSNA , Pamplona, Spain
                [7] 7 Department of Clinical Neuroscience , Center for Molecular Medicine , Karolinska Institute , Stockholm, Sweden
                [8] 8 Biological and Environmental Sciences and Engineering Division , Computer, Electrical and Mathematical Sciences and Engineering Division , King Abdullah University of Science and Technology (KAUST) , Thuwal, Saudi Arabia
                [9] 9 Unit of Computational Medicine , Department of Medicine, Solna , Center for Molecular Medicine , Karolinska Institutet , Solna, Sweden
                [10] 10 Science for Life Laboratory , Solna, Sweden
                [11] 11 Department of Neurology , Linköping University , Linköping, Sweden
                [12] 12 Department of Biomedical and Clinical Sciences , Linköping University , Linköping, Sweden
                [13] 13 Department of Automatic Control , Linköping University , Linköping, Sweden
                [14] 14 Department of Clinical Immunology and Transfusion Medicine , Linköping University , Linköping, Sweden
                [15] 15 Department of New Biology , Daegu Gyeongbuk Institute of Science and Technology , Daegu, South Korea
                Author notes

                Edited by: Sergio Oller Moreno, University Medical Center Hamburg-Eppendorf, Germany

                Reviewed by: Andre Kahles, ETH Zürich, Switzerland

                Timuçin Avşar, Bahçeşehir University, Turkey

                *Correspondence: Maria C. Jenmalm, maria.jenmalm@ 123456liu.se ; Mika Gustafsson, mika.gustafsson@ 123456liu.se
                [ † ]

                These authors have contributed equally to this work and share first authorship

                This article was submitted to Metabolomics, a section of the journal Frontiers in Molecular Biosciences

                Article
                916128
                10.3389/fmolb.2022.916128
                9465313
                36106020
                aa96e5aa-7f17-4df7-8f0c-41fafcfa07df
                Copyright © 2022 Magnusson, Rundquist, Kim, Hellberg, Na, Benson, Gomez-Cabrero, Kockum, Tegnér, Piehl, Jagodic, Mellergård, Altafini, Ernerudh, Jenmalm, Nestor, Kim and Gustafsson.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 08 April 2022
                : 25 July 2022
                Categories
                Molecular Biosciences
                Original Research

                proteomics,rna-seq,t-cell differentiation,biomarkers,multiple sclerosis

                Comments

                Comment on this article