7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Detecting differential alternative splicing events in scRNA-seq with or without Unique Molecular Identifiers

      research-article
      1 , 2 , 3 , 1 , *
      PLoS Computational Biology
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The emergence of single-cell RNA-seq (scRNA-seq) technology has made it possible to measure gene expression variations at cellular level. This breakthrough enables the investigation of a wider range of problems including analysis of splicing heterogeneity among individual cells. However, compared to bulk RNA-seq, scRNA-seq data are much noisier due to high technical variability and low sequencing depth. Here we propose SCATS (Single-Cell Analysis of Transcript Splicing) for differential splicing analysis in scRNA-seq, which achieves high sensitivity at low coverage by accounting for technical noise. SCATS models scRNA-seq data either with or without Unique Molecular Identifiers (UMIs). For non-UMI data, SCATS explicitly models technical noise by accounting for capture efficiency and amplification bias through the use of external spike-ins; for UMI data, SCATS models capture efficiency and further accounts for transcriptional burstiness. A key aspect of SCATS lies in its ability to group “exons” that originate from the same isoform(s). Grouping exons is essential in splicing analysis of scRNA-seq data as it naturally aggregates spliced reads across different exons, making it possible to detect splicing events even when sequencing depth is low. To evaluate the performance of SCATS, we analyzed both simulated and real scRNA-seq datasets and compared with existing methods including Census and DEXSeq. We show that SCATS has well controlled type I error rate, and is more powerful than existing methods, especially when splicing difference is small. In contrast, Census suffers from severe type I error inflation, whereas DEXSeq is more conservative. When applied to mouse brain scRNA-seq datasets, SCATS identified more differential splicing events with subtle difference across cell types compared to Census and DEXSeq. With the increasing adoption of scRNA-seq, we believe SCATS will be well-suited for various splicing studies. The implementation of SCATS can be downloaded from https://github.com/huyustats/SCATS.

          Author summary

          Alternative splicing is a major mechanism for generating transcriptome diversity. However, few published scRNA-seq studies have investigated alternative splicing, and even when studied, methods developed for bulk RNA-seq were utilized. Compared to bulk RNA-seq, scRNA-seq data are much noisier due to high technical variability and low sequencing depth. Methods developed for bulk RNA-seq may not be optimal when analyzing data generated from scRNA-seq experiments. To fill in this gap, we developed SCATS, an open-source software package, which allows analysis of scRNA-seq data with or without Unique Molecular Identifiers (UMIs). SCATS is able to detect splicing events even when sequencing depth is low. When applied to mouse brain scRNA-seq datasets, SCATS identified more differential splicing events with subtle differences across cortical cell types than Census and DEXSeq. Additionally, SCATS accurately characterized splicing heterogeneity across cortical cell types, which was further confirmed by qRT-PCR measurements. Our study highlights the benefit of SCATS for elucidating splicing heterogeneity across cells in scRNA-seq data.

          Related collections

          Most cited references9

          • Record: found
          • Abstract: found
          • Article: not found

          Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells

          Recent molecular studies have revealed that, even when derived from a seemingly homogenous population, individual cells can exhibit substantial differences in gene expression, protein levels, and phenotypic output 1–5 , with important functional consequences 4,5 . Existing studies of cellular heterogeneity, however, have typically measured only a few pre-selected RNAs 1,2 or proteins 5,6 simultaneously because genomic profiling methods 3 could not be applied to single cells until very recently 7–10 . Here, we use single-cell RNA-Seq to investigate heterogeneity in the response of bone marrow derived dendritic cells (BMDCs) to lipopolysaccharide (LPS). We find extensive, and previously unobserved, bimodal variation in mRNA abundance and splicing patterns, which we validate by RNA-fluorescence in situ hybridization (RNA-FISH) for select transcripts. In particular, hundreds of key immune genes are bimodally expressed across cells, surprisingly even for genes that are very highly expressed at the population average. Moreover, splicing patterns demonstrate previously unobserved levels of heterogeneity between cells. Some of the observed bimodality can be attributed to closely related, yet distinct, known maturity states of BMDCs; other portions reflect differences in the usage of key regulatory circuits. For example, we identify a module of 137 highly variable, yet co-regulated, antiviral response genes. Using cells from knockout mice, we show that variability in this module may be propagated through an interferon feedback circuit involving the transcriptional regulators Stat2 and Irf7. Our study demonstrates the power and promise of single-cell genomics in uncovering functional diversity between cells and in deciphering cell states and circuits.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Single-Cell Alternative Splicing Analysis with Expedition Reveals Splicing Dynamics during Neuron Differentiation

            Alternative splicing (AS) generates isoform diversity for cellular identity and homeostasis in multicellular life. Although AS variation has been observed among single cells, little is known about the biological or evolutionary significance of such variation. We developed Expedition , a computational framework consisting of outrigger , a de novo splice graph transversal algorithm to detect AS; anchor , a Bayesian approach to assign modalities and bonvoyage , a visualization tool using non-negative matrix factorization to display modality changes. Applying Expedition to single pluripotent stem cells undergoing neuronal differentiation, we discover that up to 20% of AS exons exhibit bimodality. Bimodal exons are flanked by more conserved intronic sequences harboring distinct cis -regulatory motifs, constitute much of cell-type specific splicing, are highly dynamic during cellular transitions, preserve reading frame and reveal intricacy of cell states invisible to conventional gene expression analysis. Systematic AS characterization in single cells redefines our understanding of AS complexity in cell biology.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A discriminative learning approach to differential expression analysis for single-cell RNA-seq

                Bookmark

                Author and article information

                Contributors
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Writing – review & editing
                Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SupervisionRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput. Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, CA USA )
                1553-734X
                1553-7358
                5 June 2020
                June 2020
                : 16
                : 6
                : e1007925
                Affiliations
                [1 ] Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
                [2 ] Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Pennsylvania, United States of America
                [3 ] Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
                Carnegie Mellon University, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                Author information
                http://orcid.org/0000-0003-4163-7166
                http://orcid.org/0000-0002-5585-982X
                http://orcid.org/0000-0001-8180-0621
                Article
                PCOMPBIOL-D-19-02090
                10.1371/journal.pcbi.1007925
                7299405
                32502143
                c3d094d1-a087-4866-a112-267ec9152ea8
                © 2020 Hu et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 2 December 2019
                : 4 May 2020
                Page count
                Figures: 4, Tables: 0, Pages: 19
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: NIH R01GM108600
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: R01GM125301
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: R01HL113147
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: R01EY030192
                Award Recipient :
                This work was supported by the following grants: NIH R01GM108600, R01GM125301, R01HL113147, R01HL150359, and R01EY030192 (to M.L.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Research and Analysis Methods
                Research Design
                Survey Research
                Census
                Biology and life sciences
                Genetics
                Gene expression
                RNA processing
                Alternative Splicing
                Biology and life sciences
                Biochemistry
                Nucleic acids
                RNA
                RNA processing
                Alternative Splicing
                Biology and Life Sciences
                Molecular Biology
                Molecular Biology Techniques
                Gene Mapping
                Exon Mapping
                Research and Analysis Methods
                Molecular Biology Techniques
                Gene Mapping
                Exon Mapping
                Biology and Life Sciences
                Genetics
                Gene Expression
                Research and Analysis Methods
                Research Assessment
                Research Errors
                Research and Analysis Methods
                Simulation and Modeling
                Biology and life sciences
                Molecular biology
                Molecular biology techniques
                Sequencing techniques
                RNA sequencing
                Research and analysis methods
                Molecular biology techniques
                Sequencing techniques
                RNA sequencing
                Biology and Life Sciences
                Cell Biology
                Cellular Types
                Animal Cells
                Neurons
                Ganglion Cells
                Pyramidal Cells
                Biology and Life Sciences
                Neuroscience
                Cellular Neuroscience
                Neurons
                Ganglion Cells
                Pyramidal Cells
                Custom metadata
                vor-update-to-uncorrected-proof
                2020-06-17
                All data are available from the GEO database under the following accession numbers: GSE71585: Adult mouse cortical cell taxonomy by single cell transcriptomics (Tasic et al. 2016 Nature Neuroscience) Link: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE71585 Description: The adult mouse brain scRNA-seq data were acquired from Tasic et al. This data set includes 1,679 cells from 49 cell types (23 GABAergic, 19 glutamatergic and 7 non-neuronal types). GSE60361: Single-cell RNA-seq of mouse cerebral cortex Link: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE60361 Description: The adult mouse brain scRNA-seq data were generated from two regions of the mouse cerebral cortex: the somatosensory cortex and hippocampus CA1 by Zeisel et al. This data set includes 3,005 cells from 9 cell types.

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article