21
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      CaSTLe – Classification of single cells by transfer learning: Harnessing the power of publicly available single cell RNA sequencing experiments to annotate new experiments

      research-article
      1 , 2 , * , 2 , 1 , 2 , *
      PLoS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Single-cell RNA sequencing (scRNA-seq) is an emerging technology for profiling the gene expression of thousands of cells at the single cell resolution. Currently, the labeling of cells in an scRNA-seq dataset is performed by manually characterizing clusters of cells or by fluorescence-activated cell sorting (FACS). Both methods have inherent drawbacks: The first depends on the clustering algorithm used and the knowledge and arbitrary decisions of the annotator, and the second involves an experimental step in addition to the sequencing and cannot be incorporated into the higher throughput scRNA-seq methods. We therefore suggest a different approach for cell labeling, namely, classifying cells from scRNA-seq datasets by using a model transferred from different (previously labeled) datasets. This approach can complement existing methods, and–in some cases–even replace them. Such a transfer-learning framework requires selecting informative features and training a classifier. The specific implementation for the framework that we propose, designated ''CaSTLe–classification of single cells by transfer learning,'' is based on a robust feature engineering workflow and an XGBoost classification model built on these features. Evaluation of CaSTLe against two benchmark feature-selection and classification methods showed that it outperformed the benchmark methods in most cases and yielded satisfactory classification accuracy in a consistent manner. CaSTLe has the additional advantage of being parallelizable and well suited to large datasets. We showed that it was possible to classify cell types using transfer learning, even when the databases contained a very small number of genes, and our study thus indicates the potential applicability of this approach for analysis of scRNA-seq datasets.

          Related collections

          Most cited references12

          • Record: found
          • Abstract: found
          • Article: not found

          A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation.

          Maintenance of the blood system requires balanced cell fate decisions by hematopoietic stem and progenitor cells (HSPCs). Because cell fate choices are executed at the individual cell level, new single-cell profiling technologies offer exciting possibilities for mapping the dynamic molecular changes underlying HSPC differentiation. Here, we have used single-cell RNA sequencing to profile more than 1600 single HSPCs, and deep sequencing has enabled detection of an average of 6558 protein-coding genes per cell. Index sorting, in combination with broad sorting gates, allowed us to retrospectively assign cells to 12 commonly sorted HSPC phenotypes while also capturing intermediate cells typically excluded by conventional gating. We further show that independently generated single-cell data sets can be projected onto the single-cell resolution expression map to directly compare data from multiple groups and to build and refine new hypotheses. Reconstruction of differentiation trajectories reveals dynamic expression changes associated with early lymphoid, erythroid, and granulocyte-macrophage differentiation. The latter two trajectories were characterized by common upregulation of cell cycle and oxidative phosphorylation transcriptional programs. By using external spike-in controls, we estimate absolute messenger RNA (mRNA) levels per cell, showing for the first time that despite a general reduction in total mRNA, a subset of genes shows higher expression levels in immature stem cells consistent with active maintenance of the stem-cell state. Finally, we report the development of an intuitive Web interface as a new community resource to permit visualization of gene expression in HSPCs at single-cell resolution for any gene of choice.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Defining cell types and states with single-cell genomics

            A revolution in cellular measurement technology is under way: For the first time, we have the ability to monitor global gene regulation in thousands of individual cells in a single experiment. Such experiments will allow us to discover new cell types and states and trace their developmental origins. They overcome fundamental limitations inherent in measurements of bulk cell population that have frustrated efforts to resolve cellular states. Single-cell genomics and proteomics enable not only precise characterization of cell state, but also provide a stunningly high-resolution view of transitions between states. These measurements may finally make explicit the metaphor that C.H. Waddington posed nearly 60 years ago to explain cellular plasticity: Cells are residents of a vast “landscape” of possible states, over which they travel during development and in disease. Single-cell technology helps not only locate cells on this landscape, but illuminates the molecular mechanisms that shape the landscape itself. However, single-cell genomics is a field in its infancy, with many experimental and computational advances needed to fully realize its full potential.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              RNA Sequencing of Single Human Islet Cells Reveals Type 2 Diabetes Genes.

              Pancreatic islet cells are critical for maintaining normal blood glucose levels, and their malfunction underlies diabetes development and progression. We used single-cell RNA sequencing to determine the transcriptomes of 1,492 human pancreatic α, β, δ, and PP cells from non-diabetic and type 2 diabetes organ donors. We identified cell-type-specific genes and pathways as well as 245 genes with disturbed expression in type 2 diabetes. Importantly, 92% of the genes have not previously been associated with islet cell function or growth. Comparison of gene profiles in mouse and human α and β cells revealed species-specific expression. All data are available for online browsing and download and will hopefully serve as a resource for the islet research community.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Formal analysisRole: MethodologyRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: Formal analysisRole: Funding acquisitionRole: MethodologyRole: Project administrationRole: ResourcesRole: SupervisionRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                10 October 2018
                2018
                : 13
                : 10
                : e0205499
                Affiliations
                [1 ] Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
                [2 ] Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
                Universitatsmedizin Greifswald, GERMANY
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Author information
                http://orcid.org/0000-0001-8346-3542
                Article
                PONE-D-18-03179
                10.1371/journal.pone.0205499
                6179251
                30304022
                17c4404e-6893-44f4-aca3-b85fd9adc356
                © 2018 Lieberman et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 30 January 2018
                : 26 September 2018
                Page count
                Figures: 3, Tables: 4, Pages: 16
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/501100003977, Israel Science Foundation;
                Award ID: 500/15
                Award Recipient :
                Funded by: Broad-Israel Science Foundation
                Award ID: 1644/15
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: Grants R24 AI072073
                Award Recipient :
                This work was supported by Israel Science Foundation Grant 500/15, Broad-Israel Science Foundation Grant 1644/15 and the National Institutes of Health Grants R24 AI072073 to TS and YL. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Anatomy
                Ocular System
                Ocular Anatomy
                Retina
                Medicine and Health Sciences
                Anatomy
                Ocular System
                Ocular Anatomy
                Retina
                Biology and Life Sciences
                Developmental Biology
                Embryology
                Embryos
                Biology and Life Sciences
                Genetics
                Gene Expression
                Biology and Life Sciences
                Anatomy
                Endocrine System
                Pancreas
                Medicine and Health Sciences
                Anatomy
                Endocrine System
                Pancreas
                Biology and Life Sciences
                Anatomy
                Exocrine Glands
                Pancreas
                Medicine and Health Sciences
                Anatomy
                Exocrine Glands
                Pancreas
                Computer and Information Sciences
                Software Engineering
                Preprocessing
                Engineering and Technology
                Software Engineering
                Preprocessing
                Biology and life sciences
                Molecular biology
                Molecular biology techniques
                Sequencing techniques
                RNA sequencing
                Research and analysis methods
                Molecular biology techniques
                Sequencing techniques
                RNA sequencing
                Biology and Life Sciences
                Cell Biology
                Cell Processes
                Cell Cycle and Cell Division
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Custom metadata
                Source code of the method is available from GitHub: https://github.com/yuvallb/CaSTLe. Validation of the method was done on data available from NCBI GEO and ArrayExpress - full references included in the article.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article