A quantitative atlas of polyadenylation in five mammals

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

We developed PolyA-seq, a strand-specific and quantitative method for high-throughput sequencing of 3′ ends of polyadenylated transcripts, and used it to globally map polyadenylation (polyA) sites in 24 matched tissues in human, rhesus, dog, mouse, and rat. We show that PolyA-seq is as accurate as existing RNA sequencing (RNA-seq) approaches for digital gene expression (DGE), enabling simultaneous mapping of polyA sites and quantitative measurement of their usage. In human, we confirmed 158,533 known sites and discovered 280,857 novel sites (FDR < 2.5%). On average 10% of novel human sites were also detected in matched tissues in other species. Most novel sites represent uncharacterized alternative polyA events and extensions of known transcripts in human and mouse, but primarily delineate novel transcripts in the other three species. A total of 69.1% of known human genes that we detected have multiple polyA sites in their 3′UTRs, with 49.3% having three or more. We also detected polyadenylation of noncoding and antisense transcripts, including constitutive and tissue-specific primary microRNAs. The canonical polyA signal was strongly enriched and positionally conserved in all species. In general, usage of polyA sites is more similar within the same tissues across different species than within a species. These quantitative maps of polyA usage in evolutionarily and functionally related samples constitute a resource for understanding the regulatory mechanisms underlying alternative polyadenylation.

Related collections

Most cited references 50

Record: found
Abstract: found
Article: not found

Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Ali Mortazavi, Brian Williams, Kenneth McCue … (2008)

We have mapped and quantified mouse transcriptomes by deeply sequencing them and recording how frequently each gene is represented in the sequence sample (RNA-Seq). This provides a digital measure of the presence and prevalence of transcripts from known and previously unknown genes. We report reference measurements composed of 41-52 million mapped 25-base-pair reads for poly(A)-selected RNA from adult mouse brain, liver and skeletal muscle tissues. We used RNA standards to quantify transcript prevalence and to test the linear range of transcript detection, which spanned five orders of magnitude. Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3' untranscribed regions, as well as new candidate microRNA precursors. RNA splice events, which are not readily measured by standard gene expression microarray or serial analysis of gene expression methods, were detected directly by mapping splice-crossing sequence reads. We observed 1.45 x 10(5) distinct splices, and alternative splices were prominent, with 3,500 different genes expressing one or more alternate internal splices.

0 comments Cited 1332 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

RNA-Seq: a revolutionary tool for transcriptomics.

Zhong Wang, Mark Gerstein, Michael Snyder (2009)

RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

0 comments Cited 1324 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Alternative Isoform Regulation in Human Tissue Transcriptomes

Eric T Wang, Rickard Sandberg, Shujun Luo … (2008)

Through alternative processing of pre-mRNAs, individual mammalian genes often produce multiple mRNA and protein isoforms that may have related, distinct or even opposing functions. Here we report an in-depth analysis of 15 diverse human tissue and cell line transcriptomes based on deep sequencing of cDNA fragments, yielding a digital inventory of gene and mRNA isoform expression. Analysis of mappings of sequence reads to exon-exon junctions indicated that 92-94% of human genes undergo alternative splicing (AS), ∼86% with a minor isoform frequency of 15% or more. Differences in isoform-specific read densities indicated that a majority of AS and of alternative cleavage and polyadenylation (APA) events vary between tissues, while variation between individuals was ∼2- to 3-fold less common. Extreme or ‘switch-like’ regulation of splicing between tissues was associated with increased sequence conservation in regulatory regions and with generation of full-length open reading frames. Patterns of AS and APA were strongly correlated across tissues, suggesting coordinated regulation of these processes, and sequence conservation of a subset of known regulatory motifs in both alternative introns and 3′ UTRs suggested common involvement of specific factors in tissue-level regulation of both splicing and polyadenylation.

0 comments Cited 715 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Genome Res

Journal ID (iso-abbrev): Genome Res

Journal ID (publisher-id): GENOME

Title: Genome Research

Publisher: Cold Spring Harbor Laboratory Press

ISSN (Print): 1088-9051

ISSN (Electronic): 1549-5469

Publication date (Print): June 2012

Publication date PMC-release: June 2012

Volume: 22

Issue: 6

Pages: 1173-1183

Affiliations

[1 ]Department of Informatics IT,

[2 ]Department of Molecular Biomarkers,

[3 ]Department of Informatics and Analysis, Merck and Co., Inc., Boston, Massachusetts 02115, USA

Author notes

Present addresses: ⁴Novartis Institutes for BioMedical Research, Cambridge, MA 02139, USA;

[5]

Department of Biology, Stanford University, Stanford, CA 93105, USA.

[6 ]Corresponding author. E-mail tbabak@ 123456stanford.edu .

Article

Medline ID: 9518021

DOI: 10.1101/gr.132563.111

PMC ID: 3371698

PubMed ID: 22454233

SO-VID: 6e497798-a4b6-4091-86e0-abd35011f8a0

License:

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.

History

Date received : 27 September 2011

Date accepted : 19 March 2012

Comments

Comment on this article

scite_

Cited by 288

See all cited by

Most referenced authors 1,495

See all reference authors

- Version 1
- Version 1

A quantitative atlas of polyadenylation in five mammals

Read this article at

Abstract

Related collections

Smart Contracts Programming Languages

Most cited references 50

Mapping and quantifying mammalian transcriptomes by RNA-Seq.

RNA-Seq: a revolutionary tool for transcriptomics.

Alternative Isoform Regulation in Human Tissue Transcriptomes

Author and article information

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 133

Cited by 288

Most referenced authors 1,495