209
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Systematically Improved High Quality Genome and Transcriptome of the Human Blood Fluke Schistosoma mansoni

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Schistosomiasis is one of the most prevalent parasitic diseases, affecting millions of people in developing countries. Amongst the human-infective species, Schistosoma mansoni is also the most commonly used in the laboratory and here we present the systematic improvement of its draft genome. We used Sanger capillary and deep-coverage Illumina sequencing from clonal worms to upgrade the highly fragmented draft 380 Mb genome to one with only 885 scaffolds and more than 81% of the bases organised into chromosomes. We have also used transcriptome sequencing (RNA-seq) from four time points in the parasite's life cycle to refine gene predictions and profile their expression. More than 45% of predicted genes have been extensively modified and the total number has been reduced from 11,807 to 10,852. Using the new version of the genome, we identified trans-splicing events occurring in at least 11% of genes and identified clear cases where it is used to resolve polycistronic transcripts. We have produced a high-resolution map of temporal changes in expression for 9,535 genes, covering an unprecedented dynamic range for this organism. All of these data have been consolidated into a searchable format within the GeneDB ( www.genedb.org) and SchistoDB ( www.schistodb.net) databases. With further transcriptional profiling and genome sequencing increasingly accessible, the upgraded genome will form a fundamental dataset to underpin further advances in schistosome research.

          Author Summary

          Schistosomiasis is a disease caused by parasitic blood flukes of the genus Schistosoma. Human-infective species are prevalent in developing countries, where they represent a major disease burden as well as an impediment to socioeconomic development. In addition to its clinical relevance, Schistosoma mansoni is the species most widely used for laboratory experimentation. In 2009, the first draft of the S. mansoni and S. japonicum genomes were published. Both genome sequences represented a great step forward for schistosome research, but their highly fragmented nature compromised the quality of potential downstream analyses. In this study, we have substantially improved both the genome and the transcriptome resources for S. mansoni. We collated existing data and added deep DNA sequence data from clonal worms and RNA sequence data from four key time points in the life cycle of the parasite. We were able to identify transcribed regions to single-base resolution and have profiled gene expression from the free-living larvae to the early human parasitic stage. We uncovered extensive use of single transcripts from multiple genes, which the organism subsequently resolves by trans-splicing. All data from this study comprise a major new release of the genome, which is publicly and easily accessible.

          Related collections

          Most cited references44

          • Record: found
          • Abstract: found
          • Article: not found

          Schistosomiasis and water resources development: systematic review, meta-analysis, and estimates of people at risk.

          An estimated 779 million people are at risk of schistosomiasis, of whom 106 million (13.6%) live in irrigation schemes or in close proximity to large dam reservoirs. We identified 58 studies that examined the relation between water resources development projects and schistosomiasis, primarily in African settings. We present a systematic literature review and meta-analysis with the following objectives: (1) to update at-risk populations of schistosomiasis and number of people infected in endemic countries, and (2) to quantify the risk of water resources development and management on schistosomiasis. Using 35 datasets from 24 African studies, our meta-analysis showed pooled random risk ratios of 2.4 and 2.6 for urinary and intestinal schistosomiasis, respectively, among people living adjacent to dam reservoirs. The risk ratio estimate for studies evaluating the effect of irrigation on urinary schistosomiasis was in the range 0.02-7.3 (summary estimate 1.1) and that on intestinal schistosomiasis in the range 0.49-23.0 (summary estimate 4.7). Geographic stratification showed important spatial differences, idiosyncratic to the type of water resources development. We conclude that the development and management of water resources is an important risk factor for schistosomiasis, and hence strategies to mitigate negative effects should become integral parts in the planning, implementation, and operation of future water projects.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            SSAHA: a fast search method for large DNA databases.

            We describe an algorithm, SSAHA (Sequence Search and Alignment by Hashing Algorithm), for performing fast searches on databases containing multiple gigabases of DNA. Sequences in the database are preprocessed by breaking them into consecutive k-tuples of k contiguous bases and then using a hash table to store the position of each occurrence of each k-tuple. Searching for a query sequence in the database is done by obtaining from the hash table the "hits" for each k-tuple in the query sequence and then performing a sort on the results. We discuss the effect of the tuple length k on the search speed, memory usage, and sensitivity of the algorithm and present the results of computational experiments which show that SSAHA can be three to four orders of magnitude faster than BLAST or FASTA, while requiring less memory than suffix tree methods. The SSAHA algorithm is used for high-throughput single nucleotide polymorphism (SNP) detection and very large scale sequence assembly. Also, it provides Web-based sequence search facilities for Ensembl projects.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              From RNA-seq reads to differential expression results

              Many methods and tools are available for preprocessing high-throughput RNA sequencing data and detecting differential expression.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Negl Trop Dis
                plos
                plosntds
                PLoS Neglected Tropical Diseases
                Public Library of Science (San Francisco, USA )
                1935-2727
                1935-2735
                January 2012
                10 January 2012
                : 6
                : 1
                : e1455
                Affiliations
                [1 ]Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, United Kingdom
                [2 ]Texas Biomedical Research Institute, San Antonio, Texas, United States of America
                [3 ]Departments of Biochemistry and Pathology, University of Texas Health Science Center, San Antonio, Texas, United States of America
                [4 ]Center for Excellence in Bioinformatics, Centro de Pesquisas René Rachou, Fundação Oswaldo Cruz, Belo Horizonte, Minas Gerais, Brazil
                [5 ]Genomics and Computational Biology Group, Centro de Pesquisas René Rachou, Fundação Oswaldo Cruz, Belo Horizonte, Minas Gerais, Brazil
                [6 ]National Institute for Science and Technology in Tropical Diseases, Centro de Pesquisas René Rachou, Fundação Oswaldo Cruz, Belo Horizonte, Minas Gerais, Brazil
                [7 ]Department of Biology, University of York, Heslington, York, United Kingdom
                [8 ]Department of Pathology, University of Cambridge, Cambridge, United Kingdom
                IBERS, United Kingdom
                Author notes

                Conceived and designed the experiments: MB. Performed the experiments: AVP AB. Analyzed the data: AVP IJT MH TDO. Contributed reagents/materials/analysis tools: CL TJCA SJPM RAW DWD PTL. Wrote the paper: AVP IJT MB. Data processing and loading GeneDB and SchistoDB: IJT NDS JM GO AZ MAA GSV. Manual genome improvement and curation of gene annotations: AVP IJT SN RCC CD GPD. Project management: NEH. Construction of sequencing libraries: MAQ.

                Article
                PNTD-D-11-00834
                10.1371/journal.pntd.0001455
                3254664
                22253936
                b3ff87fa-c5db-4e3a-a462-13e3b65d5a7c
                Protasio et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 17 August 2011
                : 13 November 2011
                Page count
                Pages: 13
                Categories
                Research Article
                Biology
                Genomics

                Infectious disease & Microbiology
                Infectious disease & Microbiology

                Comments

                Comment on this article