12
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Comparison of clustering tools in R for medium-sized 10x Genomics single-cell RNA-sequencing data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background: The commercially available 10x Genomics protocol to generate droplet-based single-cell RNA-seq (scRNA-seq) data is enjoying growing popularity among researchers. Fundamental to the analysis of such scRNA-seq data is the ability to cluster similar or same cells into non-overlapping groups. Many competing methods have been proposed for this task, but there is currently little guidance with regards to which method to use.

          Methods: Here we use one gold standard 10x Genomics dataset, generated from the mixture of three cell lines, as well as three silver standard 10x Genomics datasets generated from peripheral blood mononuclear cells to examine not only the accuracy but also robustness of a dozen methods.

          Results: We found that some methods, including Seurat and Cell Ranger, outperform other methods, although performance seems to be dependent on the complexity of the studied system. Furthermore, we found that solutions produced by different methods have little in common with each other.

          Conclusions: In light of this, we conclude that the choice of clustering tool crucially determines interpretation of scRNA-seq data generated by 10x Genomics. Hence practitioners and consumers should remain vigilant about the outcome of 10x Genomics scRNA-seq analysis.

          Related collections

          Most cited references14

          • Record: found
          • Abstract: found
          • Article: not found

          Comparative Analysis of Single-Cell RNA Sequencing Methods.

          Single-cell RNA sequencing (scRNA-seq) offers new possibilities to address biological and medical questions. However, systematic comparisons of the performance of diverse scRNA-seq protocols are lacking. We generated data from 583 mouse embryonic stem cells to evaluate six prominent scRNA-seq methods: CEL-seq2, Drop-seq, MARS-seq, SCRB-seq, Smart-seq, and Smart-seq2. While Smart-seq2 detected the most genes per cell and across cells, CEL-seq2, Drop-seq, MARS-seq, and SCRB-seq quantified mRNA levels with less amplification noise due to the use of unique molecular identifiers (UMIs). Power simulations at different sequencing depths showed that Drop-seq is more cost-efficient for transcriptome quantification of large numbers of cells, while MARS-seq, SCRB-seq, and Smart-seq2 are more efficient when analyzing fewer cells. Our quantitative comparison offers the basis for an informed choice among six prominent scRNA-seq methods, and it provides a framework for benchmarking further improvements of scRNA-seq protocols.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications

            RNA sequencing (RNA-seq) is a genomic approach for the detection and quantitative analysis of messenger RNA molecules in a biological sample and is useful for studying cellular responses. RNA-seq has fueled much discovery and innovation in medicine over recent years. For practical reasons, the technique is usually conducted on samples comprising thousands to millions of cells. However, this has hindered direct assessment of the fundamental unit of biology—the cell. Since the first single-cell RNA-sequencing (scRNA-seq) study was published in 2009, many more have been conducted, mostly by specialist laboratories with unique skills in wet-lab single-cell genomics, bioinformatics, and computation. However, with the increasing commercial availability of scRNA-seq platforms, and the rapid ongoing maturation of bioinformatics approaches, a point has been reached where any biomedical researcher or clinician can use scRNA-seq to make exciting discoveries. In this review, we present a practical guide to help researchers design their first scRNA-seq studies, including introductory information on experimental hardware, protocol choice, quality control, data analysis and biological interpretation.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data

              Most existing dimensionality reduction and clustering packages for single-cell RNA-seq (scRNA-seq) data deal with dropouts by heavy modeling and computational machinery. Here, we introduce CIDR (Clustering through Imputation and Dimensionality Reduction), an ultrafast algorithm that uses a novel yet very simple implicit imputation approach to alleviate the impact of dropouts in scRNA-seq data in a principled manner. Using a range of simulated and real data, we show that CIDR improves the standard principal component analysis and outperforms the state-of-the-art methods, namely t-SNE, ZIFA, and RaceID, in terms of clustering accuracy. CIDR typically completes within seconds when processing a data set of hundreds of cells and minutes for a data set of thousands of cells. CIDR can be downloaded at https://github.com/VCCRI/CIDR. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1188-0) contains supplementary material, which is available to authorized users.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data CurationRole: Formal AnalysisRole: Funding AcquisitionRole: InvestigationRole: MethodologyRole: SoftwareRole: VisualizationRole: Writing – Original Draft PreparationRole: Writing – Review & Editing
                Role: InvestigationRole: Writing – Review & Editing
                Role: ConceptualizationRole: InvestigationRole: MethodologyRole: Writing – Review & Editing
                Role: ConceptualizationRole: Funding AcquisitionRole: InvestigationRole: MethodologyRole: Writing – Review & Editing
                Role: ConceptualizationRole: InvestigationRole: MethodologyRole: Project AdministrationRole: SupervisionRole: Writing – Review & Editing
                Journal
                F1000Res
                F1000Res
                F1000Research
                F1000Research
                F1000 Research Limited (London, UK )
                2046-1402
                15 August 2018
                2018
                : 7
                : 1297
                Affiliations
                [1 ]Population Health and Immunity, Walter and Eliza Hall Institute of Medical Research, Parkville, Australia
                [2 ]Department of Medical Biology, University of Melbourne, Parkville, Australia
                [3 ]Molecular Medicine Division, Walter and Eliza Hall Institute of Medical Research, Parkville, Australia
                [4 ]Bio21 Insititute, CSL Limited, Parkville, Australia
                [1 ]Johns Hopkins Bloomberg School of Public Health (JHSPH), Baltimore, MD, USA
                [1 ]School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia
                [1 ]Victor Chang Cardiac Research Institute (VCCRI), Darlinghurst, NSW, Australia
                Author notes

                Freytag S: Conceptualization, Data Curation, Funding Acquisition, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

                Tian L: Investigation, Writing – Review & Editing

                Lönnstedt I: Conceptualization, Methodology, Writing – Review & Editing

                NG M: Conceptualization, Investigation, Funding Acquisition, Methodology, Writing – Review & Editing

                Bahlo M: Supervision Conceptualization, Investigation, Funding Acquisition, Methodology, Writing – Review & Editing

                No competing interests were disclosed.

                Competing interests: No competing interests were disclosed.

                Competing interests: No competing interests were disclosed.

                Competing interests: No competing interests were disclosed.

                Author information
                https://orcid.org/0000-0002-2185-7068
                https://orcid.org/0000-0001-5132-0774
                Article
                10.12688/f1000research.15809.1
                6124389
                30228881
                0044cf71-b03a-472f-98bc-c922d46ebe92
                Copyright: © 2018 Freytag S et al.

                This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 7 August 2018
                Funding
                Funded by: National Health and Medical Research Council
                Award ID: IRIIS
                Award ID: ProgramGrant1054618
                Award ID: SeniorResearchFellowship110297
                We would like to thank the Australian Genome Research Facility and the Genomics Innovation Hub for their generous support of this project, including funding. This work was also supported by the Victorian Government’s Operational Infrastructure Support Program and Australian Government NHMRC IRIIS. MB is funded by NHMRC Senior Research Fellowship 110297 and NHMRC Program Grant 1054618.
                The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Articles

                Comments

                Comment on this article