Towards a complete map of the human long non-coding RNA transcriptome

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Gene maps, or annotations, enable us to navigate the functional landscape of our genome. They are a resource upon which virtually all studies depend, from single-gene to genome-wide scales and from basic molecular biology to medical genetics. Yet present-day annotations suffer from trade-offs between quality and size, with serious but often unappreciated consequences for downstream studies. This is particularly true for long non-coding RNAs (lncRNAs), which are poorly characterized compared to protein-coding genes. Long-read sequencing technologies promise to improve current annotations, paving the way towards a complete annotation of lncRNAs expressed throughout a human lifetime.

Related collections

Most cited references 73

Record: found
Abstract: found
Article: not found

Gene Ontology: tool for the unification of biology

Michael Ashburner, Catherine A. Ball, Judith Blake … (2002)

Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.

0 comments Cited 15204 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The transcriptional landscape of the mammalian genome.

P Carninci, T Kasukawa, S. Katayama … (2005)

This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.

0 comments Cited 1277 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions

Michael Lin, Irwin Jungreis, Manolis Kellis (2011)

Motivation: As high-throughput transcriptome sequencing provides evidence for novel transcripts in many species, there is a renewed need for accurate methods to classify small genomic regions as protein coding or non-coding. We present PhyloCSF, a novel comparative genomics method that analyzes a multispecies nucleotide sequence alignment to determine whether it is likely to represent a conserved protein-coding region, based on a formal statistical comparison of phylogenetic codon models. Results: We show that PhyloCSF's classification performance in 12-species Drosophila genome alignments exceeds all other methods we compared in a previous study. We anticipate that this method will be widely applicable as the transcriptomes of many additional species, tissues and subcellular compartments are sequenced, particularly in the context of ENCODE and modENCODE, and as interest grows in long non-coding RNAs, often initially recognized by their lack of protein coding potential rather than conserved RNA secondary structures. Availability and Implementation: The Objective Caml source code and executables for GNU/Linux and Mac OS X are freely available at http://compbio.mit.edu/PhyloCSF Contact: mlin@mit.edu; manoli@mit.edu

0 comments Cited 453 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Title: Nature Reviews Genetics

Abbreviated Title: Nat Rev Genet

Publisher: Springer Nature America, Inc

ISSN (Print): 1471-0056

ISSN (Electronic): 1471-0064

Publication date Created: September 2018

Publication date (Electronic): May 23 2018

Publication date (Print): September 2018

Volume: 19

Issue: 9

Pages: 535-548

Article

DOI: 10.1038/s41576-018-0017-y

PMC ID: 6451964

PubMed ID: 29795125

SO-VID: 1bb30c09-db67-4569-928c-b0eff83d2a02

License:

http://www.springer.com/tdm

History

Data availability:

Comments

Comment on this article

scite_

Cited by 242

See all cited by

Most referenced authors 6,009

See all reference authors

Towards a complete map of the human long non-coding RNA transcriptome

Read this article at

Abstract

Related collections

RNA drug delivery

Most cited references 73

Gene Ontology: tool for the unification of biology

The transcriptional landscape of the mammalian genome.

PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 1,202

Cited by 242

Most referenced authors 6,009