Inviting an author to review:
Find an author and click ‘Invite to review selected article’ near their name.
Search for authorsSearch for similar articles
1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      ACTIVA: realistic single-cell RNA-seq generation with automatic cell-type identification using introspective variational autoencoders

      1 , 2 , 2 , 3 , 1 , 2 , 4 , 1 , 2
      Bioinformatics
      Oxford University Press (OUP)

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          Single-cell RNA sequencing (scRNAseq) technologies allow for measurements of gene expression at a single-cell resolution. This provides researchers with a tremendous advantage for detecting heterogeneity, delineating cellular maps or identifying rare subpopulations. However, a critical complication remains: the low number of single-cell observations due to limitations by rarity of subpopulation, tissue degradation or cost. This absence of sufficient data may cause inaccuracy or irreproducibility of downstream analysis. In this work, we present Automated Cell-Type-informed Introspective Variational Autoencoder (ACTIVA): a novel framework for generating realistic synthetic data using a single-stream adversarial variational autoencoder conditioned with cell-type information. Within a single framework, ACTIVA can enlarge existing datasets and generate specific subpopulations on demand, as opposed to two separate models [such as single-cell GAN (scGAN) and conditional scGAN (cscGAN)]. Data generation and augmentation with ACTIVA can enhance scRNAseq pipelines and analysis, such as benchmarking new algorithms, studying the accuracy of classifiers and detecting marker genes. ACTIVA will facilitate analysis of smaller datasets, potentially reducing the number of patients and animals necessary in initial studies.

          Results

          We train and evaluate models on multiple public scRNAseq datasets. In comparison to GAN-based models (scGAN and cscGAN), we demonstrate that ACTIVA generates cells that are more realistic and harder for classifiers to identify as synthetic which also have better pair-wise correlation between genes. Data augmentation with ACTIVA significantly improves classification of rare subtypes (more than 45% improvement compared with not augmenting and 4% better than cscGAN) all while reducing run-time by an order of magnitude in comparison to both models.

          Availability and implementation

          The codes and datasets are hosted on Zenodo (https://doi.org/10.5281/zenodo.5879639). Tutorials are available at https://github.com/SindiLab/ACTIVA.

          Supplementary information

          Supplementary data are available at Bioinformatics online.

          Related collections

          Author and article information

          Contributors
          Journal
          Bioinformatics
          Oxford University Press (OUP)
          1367-4803
          1460-2059
          April 15 2022
          April 12 2022
          February 18 2022
          April 15 2022
          April 12 2022
          February 18 2022
          : 38
          : 8
          : 2194-2201
          Affiliations
          [1 ]Department of Applied Mathematics, University of California, Merced, CA 95343, USA
          [2 ]Health Sciences Research Institute, University of California, Merced, CA 95343, USA
          [3 ]Quantitative and Systems Biology Graduate Program, University of California, Merced, CA 95343, USA
          [4 ]Department of Molecular and Cell Biology, University of California, Merced, CA 95343, USA
          Article
          10.1093/bioinformatics/btac095
          9004654
          35179571
          c941a530-d148-468a-a225-a7e9730fec8d
          © 2022

          https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model

          History

          Comments

          Comment on this article