49
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      SOAPsplice: Genome-Wide ab initio Detection of Splice Junctions from RNA-Seq Data

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          RNA-Seq, a method using next generation sequencing technologies to sequence the transcriptome, facilitates genome-wide analysis of splice junction sites. In this paper, we introduce SOAPsplice, a robust tool to detect splice junctions using RNA-Seq data without using any information of known splice junctions. SOAPsplice uses a novel two-step approach consisting of first identifying as many reasonable splice junction candidates as possible, and then, filtering the false positives with two effective filtering strategies. In both simulated and real datasets, SOAPsplice is able to detect many reliable splice junctions with low false positive rate. The improvement gained by SOAPsplice, when compared to other existing tools, becomes more obvious when the depth of sequencing is low. SOAPsplice is freely available at http://soap.genomics.org.cn/soapsplice.html.

          Related collections

          Most cited references13

          • Record: found
          • Abstract: found
          • Article: not found

          SOAP: short oligonucleotide alignment program.

          We have developed a program SOAP for efficient gapped and ungapped alignment of short oligonucleotides onto reference sequences. The program is designed to handle the huge amounts of short reads generated by parallel sequencing using the new generation Illumina-Solexa sequencing technology. SOAP is compatible with numerous applications, including single-read or pair-end resequencing, small RNA discovery and mRNA tag sequence mapping. SOAP is a command-driven program, which supports multi-threaded parallel computing, and has a batch module for multiple query sets. http://soap.genomics.org.cn.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            SOAP2: an improved ultrafast tool for short read alignment.

            R Li, C. Yu, Y. Li (2009)
            SOAP2 is a significantly improved version of the short oligonucleotide alignment program that both reduces computer memory usage and increases alignment speed at an unprecedented rate. We used a Burrows Wheeler Transformation (BWT) compression index to substitute the seed strategy for indexing the reference sequence in the main memory. We tested it on the whole human genome and found that this new algorithm reduced memory usage from 14.7 to 5.4 GB and improved alignment speed by 20-30 times. SOAP2 is compatible with both single- and paired-end reads. Additionally, this tool now supports multiple text and compressed file formats. A consensus builder has also been developed for consensus assembly and SNP detection from alignment of short reads on a reference genome. http://soap.genomics.org.cn.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Function of alternative splicing.

              Alternative splicing is one of the most important mechanisms to generate a large number of mRNA and protein isoforms from the surprisingly low number of human genes. Unlike promoter activity, which primarily regulates the amount of transcripts, alternative splicing changes the structure of transcripts and their encoded proteins. Together with nonsense-mediated decay (NMD), at least 25% of all alternative exons are predicted to regulate transcript abundance. Molecular analyses during the last decade demonstrate that alternative splicing determines the binding properties, intracellular localization, enzymatic activity, protein stability and posttranslational modifications of a large number of proteins. The magnitude of the effects range from a complete loss of function or acquisition of a new function to very subtle modulations, which are observed in the majority of cases reported. Alternative splicing factors regulate multiple pre-mRNAs and recent identification of physiological targets shows that a specific splicing factor regulates pre-mRNAs with coherent biological functions. Therefore, evidence is now accumulating that alternative splicing coordinates physiologically meaningful changes in protein isoform expression and is a key mechanism to generate the complex proteome of multicellular organisms.
                Bookmark

                Author and article information

                Journal
                Front Genet
                Front. Gene.
                Frontiers in Genetics
                Frontiers Research Foundation
                1664-8021
                07 July 2011
                2011
                : 2
                : 46
                Affiliations
                [1] 1simpleBioinformatics Center, Beijing Genomics Institute at Shenzhen Shenzhen, China
                [2] 2simpleDepartment of Computer Science, The University of Hong Kong Hong Kong, China
                Author notes

                Edited by: Paul T. Spellman, Oregon Health and Sciences University, USA

                Reviewed by: Bertrand Tan, Chang Gung University, Taiwan; Xiyin Wang, Hebei United University, China; Obi Lee Griffith, Lawrence Berkeley National Laboratory, USA

                *Correspondence: Zhiyu Peng, Beijing Genomics Institute at Shenzhen, Shenzhen 518083, China. e-mail: pengbgi@ 123456gmail.com ;Siu-Ming Yiu, Department of Computer Science, The University of Hong Kong, Hong Kong, China. e-mail: smyiu@ 123456cs.hku.hk

                Songbo Huang and Jinbo Zhang have contributed equally to this work.

                This article was submitted to Frontiers in Genomic Assay Technology, a specialty of Frontiers in Genetics.

                Article
                10.3389/fgene.2011.00046
                3268599
                22303342
                7feec351-d4be-4b07-a14a-b9b10799c315
                Copyright © 2011 Huang, Zhang, Li, Zhang, He, Lam, Peng and Yiu.

                This is an open-access article subject to a non-exclusive license between the authors and Frontiers Media SA, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and other Frontiers conditions are complied with.

                History
                : 18 March 2011
                : 26 June 2011
                Page count
                Figures: 5, Tables: 6, Equations: 0, References: 28, Pages: 12, Words: 6461
                Categories
                Genetics
                Methods Article

                Genetics
                splice junction,rna-seq,spliced alignment
                Genetics
                splice junction, rna-seq, spliced alignment

                Comments

                Comment on this article