5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      End Sequence Analysis Toolkit (ESAT) expands the extractable information from single-cell RNA-seq data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          RNA-seq protocols that focus on transcript termini are well suited for applications in which template quantity is limiting. Here we show that, when applied to end-sequencing data, analytical methods designed for global RNA-seq produce computational artifacts. To remedy this, we created the End Sequence Analysis Toolkit (ESAT). As a test, we first compared end-sequencing and bulk RNA-seq using RNA from dendritic cells stimulated with lipopolysaccharide (LPS). As predicted by the telescripting model for transcriptional bursts, ESAT detected an LPS-stimulated shift to shorter 3′-isoforms that was not evident by conventional computational methods. Then, droplet-based microfluidics was used to generate 1000 cDNA libraries, each from an individual pancreatic islet cell. ESAT identified nine distinct cell types, three distinct β-cell types, and a complex interplay between hormone secretion and vascularization. ESAT, then, offers a much-needed and generally applicable computational pipeline for either bulk or single-cell RNA end-sequencing.

          Related collections

          Most cited references61

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

          In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0550-8) contains supplementary material, which is available to authorized users.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

            DAVID bioinformatics resources consists of an integrated biological knowledgebase and analytic tools aimed at systematically extracting biological meaning from large gene/protein lists. This protocol explains how to use DAVID, a high-throughput and integrated data-mining environment, to analyze gene lists derived from high-throughput genomic experiments. The procedure first requires uploading a gene list containing any number of common gene identifiers followed by analysis using one or more text and pathway-mining tools such as gene functional classification, functional annotation chart or clustering and functional annotation table. By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome

              Background RNA-Seq is revolutionizing the way transcript abundances are measured. A key challenge in transcript quantification from RNA-Seq data is the handling of reads that map to multiple genes or isoforms. This issue is particularly important for quantification with de novo transcriptome assemblies in the absence of sequenced genomes, as it is difficult to determine which transcripts are isoforms of the same gene. A second significant issue is the design of RNA-Seq experiments, in terms of the number of reads, read length, and whether reads come from one or both ends of cDNA fragments. Results We present RSEM, an user-friendly software package for quantifying gene and isoform abundances from single-end or paired-end RNA-Seq data. RSEM outputs abundance estimates, 95% credibility intervals, and visualization files and can also simulate RNA-Seq data. In contrast to other existing tools, the software does not require a reference genome. Thus, in combination with a de novo transcriptome assembler, RSEM enables accurate transcript quantification for species without sequenced genomes. On simulated and real data sets, RSEM has superior or comparable performance to quantification methods that rely on a reference genome. Taking advantage of RSEM's ability to effectively use ambiguously-mapping reads, we show that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads. On the other hand, estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired-end reads, depending on the number of possible splice forms for each gene. Conclusions RSEM is an accurate and user-friendly software tool for quantifying transcript abundances from RNA-Seq data. As it does not rely on the existence of a reference genome, it is particularly useful for quantification with de novo transcriptome assemblies. In addition, RSEM has enabled valuable guidance for cost-efficient design of quantification experiments with RNA-Seq, which is currently relatively expensive.
                Bookmark

                Author and article information

                Journal
                Genome Res
                Genome Res
                genome
                genome
                GENOME
                Genome Research
                Cold Spring Harbor Laboratory Press
                1088-9051
                1549-5469
                October 2016
                October 2016
                : 26
                : 10
                : 1397-1410
                Affiliations
                [1 ]Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts 01655, USA;
                [2 ]Program in Molecular Medicine, Diabetes Center of Excellence, University of Massachusetts Medical School, Worcester, Massachusetts 01655, USA;
                [3 ]Department of System Biology, Harvard Medical School, Boston, Massachusetts 02115, USA;
                [4 ]Institute of Biotechnology, Vilnius University, LT 02241 Vilnius, Lithuania;
                [5 ]Computer Technologies Department, ITMO University, Saint Petersburg, 197101, Russia;
                [6 ]Department of Pathology and Immunology, Washington University in St. Louis, St. Louis, Missouri 63110, USA;
                [7 ]Department of Medicine, Diabetes Center of Excellence, University of Massachusetts Medical School, Worcester, Massachusetts 01655, USA;
                [8 ]Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01655, USA;
                [9 ]Biological Chemistry Department, Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel
                Author notes
                [10]

                These authors contributed equally to this work.

                Author information
                http://orcid.org/0000-0001-8732-1293
                Article
                9509184
                10.1101/gr.207902.116
                5052061
                27470110
                c69f5dde-3d61-46f1-bc6d-849b9ca90d0e
                © 2016 Derr et al.; Published by Cold Spring Harbor Laboratory Press

                This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

                History
                : 4 April 2016
                : 27 July 2016
                Page count
                Pages: 14
                Funding
                Funded by: National Institutes of Health (NIH) http://dx.doi.org/10.13039/100000002
                Award ID: DK105837
                Award ID: AI119885
                Funded by: NIH http://dx.doi.org/10.13039/100000002
                Funded by: National Human Genome Research Institute http://dx.doi.org/10.13039/100000051
                Award ID: U01 HG007910
                Funded by: NIH http://dx.doi.org/10.13039/100000002
                Funded by: National Institute on Drug Abuse http://dx.doi.org/10.13039/100000026
                Award ID: DP1DA034990
                Funded by: NIH http://dx.doi.org/10.13039/100000002
                Funded by: Human Islet Research Network
                Award ID: UC4DK104218
                Categories
                Method

                Comments

                Comment on this article