175
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A comparison of methods for differential expression analysis of RNA-seq data

      research-article
      1 , , 1 , 2
      BMC Bioinformatics
      BioMed Central
      Differential expression, Gene expression, RNA-seq

      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Finding genes that are differentially expressed between conditions is an integral part of understanding the molecular basis of phenotypic variation. In the past decades, DNA microarrays have been used extensively to quantify the abundance of mRNA corresponding to different genes, and more recently high-throughput sequencing of cDNA (RNA-seq) has emerged as a powerful competitor. As the cost of sequencing decreases, it is conceivable that the use of RNA-seq for differential expression analysis will increase rapidly. To exploit the possibilities and address the challenges posed by this relatively new type of data, a number of software packages have been developed especially for differential expression analysis of RNA-seq data.

          Results

          We conducted an extensive comparison of eleven methods for differential expression analysis of RNA-seq data. All methods are freely available within the R framework and take as input a matrix of counts, i.e. the number of reads mapping to each genomic feature of interest in each of a number of samples. We evaluate the methods based on both simulated data and real RNA-seq data.

          Conclusions

          Very small sample sizes, which are still common in RNA-seq experiments, impose problems for all evaluated methods and any results obtained under such conditions should be interpreted with caution. For larger sample sizes, the methods combining a variance-stabilizing transformation with the ‘limma’ method for differential expression analysis perform well under many different conditions, as does the nonparametric SAMseq method.

          Related collections

          Most cited references14

          • Record: found
          • Abstract: found
          • Article: not found

          Small-sample estimation of negative binomial dispersion, with applications to SAGE data.

          We derive a quantile-adjusted conditional maximum likelihood estimator for the dispersion parameter of the negative binomial distribution and compare its performance, in terms of bias, to various other methods. Our estimation scheme outperforms all other methods in very small samples, typical of those from serial analysis of gene expression studies, the motivating data for this study. The impact of dispersion estimation on hypothesis testing is studied. We derive an "exact" test that outperforms the standard approximate asymptotic tests.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Moderated statistical tests for assessing differences in tag abundance.

            Digital gene expression (DGE) technologies measure gene expression by counting sequence tags. They are sensitive technologies for measuring gene expression on a genomic scale, without the need for prior knowledge of the genome sequence. As the cost of sequencing DNA decreases, the number of DGE datasets is expected to grow dramatically. Various tests of differential expression have been proposed for replicated DGE data using binomial, Poisson, negative binomial or pseudo-likelihood (PL) models for the counts, but none of the these are usable when the number of replicates is very small. We develop tests using the negative binomial distribution to model overdispersion relative to the Poisson, and use conditional weighted likelihood to moderate the level of overdispersion across genes. Not only is our strategy applicable even with the smallest number of libraries, but it also proves to be more powerful than previous strategies when more libraries are available. The methodology is equally applicable to other counting technologies, such as proteomic spectral counts. An R package can be accessed from http://bioinf.wehi.edu.au/resources/
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              From RNA-seq reads to differential expression results

              Many methods and tools are available for preprocessing high-throughput RNA sequencing data and detecting differential expression.
                Bookmark

                Author and article information

                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central
                1471-2105
                2013
                9 March 2013
                : 14
                : 91
                Affiliations
                [1 ]Bioinformatics Core Facility, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
                [2 ]Département de formation et recherche, Centre Hospitalier Universitaire Vaudois and University of Lausanne, Lausanne, Switzerland
                Article
                1471-2105-14-91
                10.1186/1471-2105-14-91
                3608160
                23497356
                ca92b1d5-6556-4a80-89b8-9b62d818d5ab
                Copyright ©2013 Soneson and Delorenzi; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 2 October 2012
                : 1 March 2013
                Categories
                Research Article

                Bioinformatics & Computational biology
                differential expression,gene expression,rna-seq
                Bioinformatics & Computational biology
                differential expression, gene expression, rna-seq

                Comments

                Comment on this article