Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0550-8) contains supplementary material, which is available to authorized users.

Related collections

Most cited references 35

Record: found
Abstract: not found
Article: not found

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Yoav Benjamini, Yosef Hochberg (1995)

0 comments Cited 24609 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

featureCounts: An efficient general-purpose program for assigning sequence reads to genomic features

, , (2013)

Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

0 comments Cited 770 times – based on 0 reviews

Preprint

     Review now

Bookmark

Record: found
Abstract: found
Article: not found

Small-sample estimation of negative binomial dispersion, with applications to SAGE data.

Mark Robinson, Gordon K. Smyth (2008)

We derive a quantile-adjusted conditional maximum likelihood estimator for the dispersion parameter of the negative binomial distribution and compare its performance, in terms of bias, to various other methods. Our estimation scheme outperforms all other methods in very small samples, typical of those from serial analysis of gene expression studies, the motivating data for this study. The impact of dispersion estimation on hypothesis testing is studied. We derive an "exact" test that outperforms the standard approximate asymptotic tests.

0 comments Cited 463 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Michael I Love: mlove@jimmy.harvard.edu

Wolfgang Huber: whuber@embl.de

Simon Anders: sanders@fs.tum.de

Journal

Journal ID (nlm-ta): Genome Biol

Title: Genome Biology

Publisher: BioMed Central (London )

ISSN (Print): 1465-6906

ISSN (Electronic): 1465-6914

Publication date (Electronic): 5 December 2014

Publication date (Print): 2014

Volume: 15

Issue: 12

Electronic Location Identifier: 550

Affiliations

[ ]Department of Biostatistics and Computational Biology, Dana Farber Cancer Institute and Department of Biostatistics, Harvard School of Public Health, 450 Brookline Avenue, Boston, 02215 MA USA

[ ]Genome Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, Heidelberg, 69117 Germany

[ ]Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-7314195, Berlin, Germany

Article

Publisher ID: 550

DOI: 10.1186/s13059-014-0550-8

PMC ID: 4302049

PubMed ID: 25516281

SO-VID: 67a05331-ede4-4dc2-88c1-1d5f685902c9

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

History

Date received : 27 May 2014

Date accepted : 19 November 2014

Custom metadata

ScienceOpen disciplines: Genetics

Data availability:

ScienceOpen disciplines: Genetics

Comments

Comment on this article

scite_

Cited by 32,459

See all cited by

Most referenced authors 1,053

See all reference authors

- Version 1

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

Read this article at

Abstract

Electronic supplementary material

Related collections

RNA drug delivery

Most cited references 35

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

featureCounts: An efficient general-purpose program for assigning sequence reads to genomic features

Small-sample estimation of negative binomial dispersion, with applications to SAGE data.

Author and article information

Contributors

Journal

Affiliations

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 134

Cited by 32,459

Most referenced authors 1,053