SOAPsplice: Genome-Wide ab initio Detection of Splice Junctions from RNA-Seq Data

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

RNA-Seq, a method using next generation sequencing technologies to sequence the transcriptome, facilitates genome-wide analysis of splice junction sites. In this paper, we introduce SOAPsplice, a robust tool to detect splice junctions using RNA-Seq data without using any information of known splice junctions. SOAPsplice uses a novel two-step approach consisting of first identifying as many reasonable splice junction candidates as possible, and then, filtering the false positives with two effective filtering strategies. In both simulated and real datasets, SOAPsplice is able to detect many reliable splice junctions with low false positive rate. The improvement gained by SOAPsplice, when compared to other existing tools, becomes more obvious when the depth of sequencing is low. SOAPsplice is freely available at http://soap.genomics.org.cn/soapsplice.html.

Related collections

Most cited references 13

Record: found
Abstract: found
Article: not found

SOAP: short oligonucleotide alignment program.

Ruiqiang Li, Yingrui Li, Karsten Kristiansen … (2008)

We have developed a program SOAP for efficient gapped and ungapped alignment of short oligonucleotides onto reference sequences. The program is designed to handle the huge amounts of short reads generated by parallel sequencing using the new generation Illumina-Solexa sequencing technology. SOAP is compatible with numerous applications, including single-read or pair-end resequencing, small RNA discovery and mRNA tag sequence mapping. SOAP is a command-driven program, which supports multi-threaded parallel computing, and has a batch module for multiple query sets. http://soap.genomics.org.cn.

0 comments Cited 389 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

SOAP2: an improved ultrafast tool for short read alignment.

R Li, C. Yu, Y. Li … (2009)

SOAP2 is a significantly improved version of the short oligonucleotide alignment program that both reduces computer memory usage and increases alignment speed at an unprecedented rate. We used a Burrows Wheeler Transformation (BWT) compression index to substitute the seed strategy for indexing the reference sequence in the main memory. We tested it on the whole human genome and found that this new algorithm reduced memory usage from 14.7 to 5.4 GB and improved alignment speed by 20-30 times. SOAP2 is compatible with both single- and paired-end reads. Additionally, this tool now supports multiple text and compressed file formats. A consensus builder has also been developed for consensus assembly and SNP detection from alignment of short reads on a reference genome. http://soap.genomics.org.cn.

0 comments Cited 364 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Function of alternative splicing.

Stefan Stamm, Shani Ben-Ari, Ilona Rafalska … (2005)

Alternative splicing is one of the most important mechanisms to generate a large number of mRNA and protein isoforms from the surprisingly low number of human genes. Unlike promoter activity, which primarily regulates the amount of transcripts, alternative splicing changes the structure of transcripts and their encoded proteins. Together with nonsense-mediated decay (NMD), at least 25% of all alternative exons are predicted to regulate transcript abundance. Molecular analyses during the last decade demonstrate that alternative splicing determines the binding properties, intracellular localization, enzymatic activity, protein stability and posttranslational modifications of a large number of proteins. The magnitude of the effects range from a complete loss of function or acquisition of a new function to very subtle modulations, which are observed in the majority of cases reported. Alternative splicing factors regulate multiple pre-mRNAs and recent identification of physiological targets shows that a specific splicing factor regulates pre-mRNAs with coherent biological functions. Therefore, evidence is now accumulating that alternative splicing coordinates physiologically meaningful changes in protein isoform expression and is a key mechanism to generate the complex proteome of multicellular organisms.

0 comments Cited 239 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Front Genet

Journal ID (publisher-id): Front. Gene.

Title: Frontiers in Genetics

Publisher: Frontiers Research Foundation

ISSN (Electronic): 1664-8021

Publication date (Electronic): 07 July 2011

Publication date Collection: 2011

Volume: 2

Electronic Location Identifier: 46

Affiliations

[1] ¹simpleBioinformatics Center, Beijing Genomics Institute at Shenzhen Shenzhen, China

[2] ²simpleDepartment of Computer Science, The University of Hong Kong Hong Kong, China

Author notes

Edited by: Paul T. Spellman, Oregon Health and Sciences University, USA

Reviewed by: Bertrand Tan, Chang Gung University, Taiwan; Xiyin Wang, Hebei United University, China; Obi Lee Griffith, Lawrence Berkeley National Laboratory, USA

*Correspondence: Zhiyu Peng, Beijing Genomics Institute at Shenzhen, Shenzhen 518083, China. e-mail: pengbgi@ 123456gmail.com ;Siu-Ming Yiu, Department of Computer Science, The University of Hong Kong, Hong Kong, China. e-mail: smyiu@ 123456cs.hku.hk

^†Songbo Huang and Jinbo Zhang have contributed equally to this work.

This article was submitted to Frontiers in Genomic Assay Technology, a specialty of Frontiers in Genetics.

Article

DOI: 10.3389/fgene.2011.00046

PMC ID: 3268599

PubMed ID: 22303342

SO-VID: 7feec351-d4be-4b07-a14a-b9b10799c315

License:

This is an open-access article subject to a non-exclusive license between the authors and Frontiers Media SA, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and other Frontiers conditions are complied with.

History

Date received : 18 March 2011

Date accepted : 26 June 2011

Page count

Figures: 5, Tables: 6, Equations: 0, References: 28, Pages: 12, Words: 6461

Comments

Comment on this article

scite_

Cited by 31

See all cited by

Most referenced authors 857

See all reference authors

SOAPsplice: Genome-Wide ab initio Detection of Splice Junctions from RNA-Seq Data

Read this article at

Abstract

Related collections

RNA drug delivery

Most cited references 13

SOAP: short oligonucleotide alignment program.

SOAP2: an improved ultrafast tool for short read alignment.

Function of alternative splicing.

Author and article information

Journal

Affiliations

Author notes

Article

History

Page count

Categories

Comments

Comment on this article

Similar content 174

Cited by 31

Most referenced authors 857