57
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation: Single-cell RNA sequencing (scRNA-seq) is increasingly used to study gene expression at the level of individual cells. However, preparing raw sequence data for further analysis is not a straightforward process. Biases, artifacts and other sources of unwanted variation are present in the data, requiring substantial time and effort to be spent on pre-processing, quality control (QC) and normalization.

          Results: We have developed the R/Bioconductor package scater to facilitate rigorous pre-processing, quality control, normalization and visualization of scRNA-seq data. The package provides a convenient, flexible workflow to process raw sequencing reads into a high-quality expression dataset ready for downstream analysis. scater provides a rich suite of plotting tools for single-cell data and a flexible data structure that is compatible with existing tools and can be used as infrastructure for future software development.

          Availability and Implementation: The open-source code, along with installation instructions, vignettes and case studies, is available through Bioconductor at http://bioconductor.org/packages/scater.

          Contact: davis@ 123456ebi.ac.uk

          Supplementary information: Supplementary data are available at Bioinformatics online.

          Related collections

          Most cited references25

          • Record: found
          • Abstract: found
          • Article: not found

          Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells

          Recent molecular studies have revealed that, even when derived from a seemingly homogenous population, individual cells can exhibit substantial differences in gene expression, protein levels, and phenotypic output 1–5 , with important functional consequences 4,5 . Existing studies of cellular heterogeneity, however, have typically measured only a few pre-selected RNAs 1,2 or proteins 5,6 simultaneously because genomic profiling methods 3 could not be applied to single cells until very recently 7–10 . Here, we use single-cell RNA-Seq to investigate heterogeneity in the response of bone marrow derived dendritic cells (BMDCs) to lipopolysaccharide (LPS). We find extensive, and previously unobserved, bimodal variation in mRNA abundance and splicing patterns, which we validate by RNA-fluorescence in situ hybridization (RNA-FISH) for select transcripts. In particular, hundreds of key immune genes are bimodally expressed across cells, surprisingly even for genes that are very highly expressed at the population average. Moreover, splicing patterns demonstrate previously unobserved levels of heterogeneity between cells. Some of the observed bimodality can be attributed to closely related, yet distinct, known maturity states of BMDCs; other portions reflect differences in the usage of key regulatory circuits. For example, we identify a module of 137 highly variable, yet co-regulated, antiviral response genes. Using cells from knockout mice, we show that variability in this module may be propagated through an interferon feedback circuit involving the transcriptional regulators Stat2 and Irf7. Our study demonstrates the power and promise of single-cell genomics in uncovering functional diversity between cells and in deciphering cell states and circuits.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Accounting for technical noise in single-cell RNA-seq experiments.

            Single-cell RNA-seq can yield valuable insights about the variability within a population of seemingly homogeneous cells. We developed a quantitative statistical method to distinguish true biological variability from the high levels of technical noise in single-cell experiments. Our approach quantifies the statistical significance of observed cell-to-cell variability in expression strength on a gene-by-gene basis. We validate our approach using two independent data sets from Arabidopsis thaliana and Mus musculus.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Classification of low quality cells from single-cell RNA-seq data

              Single-cell RNA sequencing (scRNA-seq) has broad applications across biomedical research. One of the key challenges is to ensure that only single, live cells are included in downstream analysis, as the inclusion of compromised cells inevitably affects data interpretation. Here, we present a generic approach for processing scRNA-seq data and detecting low quality cells, using a curated set of over 20 biological and technical features. Our approach improves classification accuracy by over 30 % compared to traditional methods when tested on over 5,000 cells, including CD4+ T cells, bone marrow dendritic cells, and mouse embryonic stem cells. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-0888-1) contains supplementary material, which is available to authorized users.
                Bookmark

                Author and article information

                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                15 April 2017
                14 January 2017
                14 January 2017
                : 33
                : 8
                : 1179-1186
                Affiliations
                [1 ]European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, CB10 1SD Hinxton, Cambridge, UK
                [2 ]Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
                [3 ]St Vincent’s Institute of Medical Research, Fitzroy, Victoria 3065, Australia
                [4 ]Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, UK
                [5 ]CRUK Cambridge Institute, University of Cambridge, Cambridge CB2 0RE, UK
                [6 ]Weatherall Institute for Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, UK
                Author notes
                [* ]To whom correspondence should be addressed.

                Associate Editor: Ivo Hofacker

                Article
                btw777
                10.1093/bioinformatics/btw777
                5408845
                28088763
                f26c247d-d6eb-4285-a4ea-336530c112e4
                © The Author 2017. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 15 August 2016
                : 7 November 2016
                : 7 December 2016
                Page count
                Pages: 8
                Funding
                Funded by: National Health and Medical Research Council of Australia
                Award ID: APP1112
                Award ID: 681 to D.J.M.
                Funded by: European Molecular Biology Laboratory
                Award ID: D.J.M.
                Funded by: Cancer Research UK
                Award ID: A1719
                Award ID: 7 to A.T.L.L.
                Funded by: United Kingdom Medical Research Council
                Award ID: K.R.C.
                Funded by: Oxford Single Cell Biology Consortium
                Award ID: Q.F.W.
                Categories
                Original Papers
                Gene Expression

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article