85
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome-wide analysis of mRNA abundance in two life-cycle stages of Trypanosoma brucei and identification of splicing and polyadenylation sites

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Transcription of protein-coding genes in trypanosomes is polycistronic and gene expression is primarily regulated by post-transcriptional mechanisms. Sequence motifs in the untranslated regions regulate mRNA trans-splicing and RNA stability, yet where UTRs begin and end is known for very few genes. We used high-throughput RNA-sequencing to determine the genome-wide steady-state mRNA levels (‘transcriptomes’) for ∼90% of the genome in two stages of the Trypanosoma brucei life cycle cultured in vitro. Almost 6% of genes were differentially expressed between the two life-cycle stages. We identified 5′ splice-acceptor sites (SAS) and polyadenylation sites (PAS) for 6959 and 5948 genes, respectively. Most genes have between one and three alternative SAS, but PAS are more dispersed. For 488 genes, SAS were identified downstream of the originally assigned initiator ATG, so a subsequent in-frame ATG presumably designates the start of the true coding sequence. In some cases, alternative SAS would give rise to mRNAs encoding proteins with different N-terminal sequences. We could identify the introns in two genes known to contain them, but found no additional genes with introns. Our study demonstrates the usefulness of the RNA-seq technology to study the transcriptional landscape of an organism whose genome has not been fully annotated.

          Related collections

          Most cited references47

          • Record: found
          • Abstract: found
          • Article: not found

          The transcriptional landscape of the yeast genome defined by RNA sequencing.

          The identification of untranslated regions, introns, and coding regions within an organism remains challenging. We developed a quantitative sequencing-based method called RNA-Seq for mapping transcribed regions, in which complementary DNA fragments are subjected to high-throughput sequencing and mapped to the genome. We applied RNA-Seq to generate a high-resolution transcriptome map of the yeast genome and demonstrated that most (74.5%) of the nonrepetitive sequence of the yeast genome is transcribed. We confirmed many known and predicted introns and demonstrated that others are not actively used. Alternative initiation codons and upstream open reading frames also were identified for many yeast genes. We also found unexpected 3'-end heterogeneity and the presence of many overlapping genes. These results indicate that the yeast transcriptome is more complex than previously appreciated.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A tightly regulated inducible expression system for conditional gene knock-outs and dominant-negative genetics in Trypanosoma brucei.

            First-generation inducible expression vectors for Trypanosoma brucei utilized a single tetracycline-responsive promoter to drive expression of an experimental gene, in tandem with a drug-resistance marker gene to select for integration (Wirtz E, Clayton CE. Science 1995; 268:1179-1183). Because drug resistance and experimental gene expression both depended upon the activity of the regulated promoter, this approach could not be used for inducible expression of toxic products. We have now developed a dual-promoter approach, for expressing highly toxic products and generating conditional gene knock-outs, using back-to-back constitutive T7 and tetracycline-responsive PARP promoters to drive expression of the selectable marker and test gene, respectively. Transformants are readily obtained with these vectors in the absence of tetracycline, in bloodstream or procyclic T. brucei cell lines co-expressing T7 RNA polymerase and Tet repressor, and consistently show tetracycline-responsive expression through a 10(3)-10(4)-fold range. Uninduced background expression of a luciferase reporter averages no more than one molecule per cell, enabling dominant-negative approaches relying upon inducible expression of toxic products. This tight regulation also permits the production of functional gene knock-outs through regulated expression of an experimental gene in a null-mutant background.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution.

              Recent data from several organisms indicate that the transcribed portions of genomes are larger and more complex than expected, and that many functional properties of transcripts are based not on coding sequences but on regulatory sequences in untranslated regions or non-coding RNAs. Alternative start and polyadenylation sites and regulation of intron splicing add additional dimensions to the rich transcriptional output. This transcriptional complexity has been sampled mainly using hybridization-based methods under one or few experimental conditions. Here we applied direct high-throughput sequencing of complementary DNAs (RNA-Seq), supplemented with data from high-density tiling arrays, to globally sample transcripts of the fission yeast Schizosaccharomyces pombe, independently from available gene annotations. We interrogated transcriptomes under multiple conditions, including rapid proliferation, meiotic differentiation and environmental stress, as well as in RNA processing mutants to reveal the dynamic plasticity of the transcriptional landscape as a function of environmental, developmental and genetic factors. High-throughput sequencing proved to be a powerful and quantitative method to sample transcriptomes deeply at maximal resolution. In contrast to hybridization, sequencing showed little, if any, background noise and was sensitive enough to detect widespread transcription in >90% of the genome, including traces of RNAs that were not robustly transcribed or rapidly degraded. The combined sequencing and strand-specific array data provide rich condition-specific information on novel, mostly non-coding transcripts, untranslated regions and gene structures, thus improving the existing genome annotation. Sequence reads spanning exon-exon or exon-intron junctions give unique insight into a surprising variability in splicing efficiency across introns, genes and conditions. Splicing efficiency was largely coordinated with transcript levels, and increased transcription led to increased splicing in test genes. Hundreds of introns showed such regulated splicing during cellular proliferation or differentiation.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                nar
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                August 2010
                August 2010
                12 April 2010
                12 April 2010
                : 38
                : 15
                : 4946-4957
                Affiliations
                1Laboratory of Molecular Parasitology, 2Laboratory of Living Matter and 3Genomics Resource Center, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
                Author notes
                *To whom correspondence should be addressed. Tel: +1 212 327 7571; Fax: +1 212 327 7572; Email: george.cross@ 123456rockefeller.edu

                Present address: T. Nicolai Siegel, Unité de Biologie des Interactions Hôte-Parasite, CNRS URA2581, Institut Pasteur, 25 Rue du Dr Roux, 75724 Paris, France.

                Article
                gkq237
                10.1093/nar/gkq237
                2926603
                20385579
                3770caf8-b414-4749-a1ca-0d7d7157dddc
                © The Author(s) 2010. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 1 September 2009
                : 23 February 2010
                : 19 March 2010
                Categories
                Gene Regulation, Chromatin and Epigenetics

                Genetics
                Genetics

                Comments

                Comment on this article