89
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Rapid Turnover of Long Noncoding RNAs and the Evolution of Gene Expression

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A large proportion of functional sequence within mammalian genomes falls outside protein-coding exons and can be transcribed into long RNAs. However, the roles in mammalian biology of long noncoding RNA (lncRNA) are not well understood. Few lncRNAs have experimentally determined roles, with some of these being lineage-specific. Determining the extent by which transcription of lncRNA loci is retained or lost across multiple evolutionary lineages is essential if we are to understand their contribution to mammalian biology and to lineage-specific traits. Here, we experimentally investigated the conservation of lncRNA expression among closely related rodent species, allowing the evolution of DNA sequence to be uncoupled from evolution of transcript expression. We generated total RNA (RNAseq) and H3K4me3-bound (ChIPseq) DNA data, and combined both to construct catalogues of transcripts expressed in the adult liver of Mus musculus domesticus (C57BL/6J), Mus musculus castaneus, and Rattus norvegicus. We estimated the rate of transcriptional turnover of lncRNAs and investigated the effects of their lineage-specific birth or death. LncRNA transcription showed considerably greater gain and loss during rodent evolution, compared with protein-coding genes. Nucleotide substitution rates were found to mirror the in vivo transcriptional conservation of intergenic lncRNAs between rodents: only the sequences of noncoding loci with conserved transcription were constrained. Finally, we found that lineage-specific intergenic lncRNAs appear to be associated with modestly elevated expression of genomically neighbouring protein-coding genes. Our findings show that nearly half of intergenic lncRNA loci have been gained or lost since the last common ancestor of mouse and rat, and they predict that such rapid transcriptional turnover contributes to the evolution of tissue- and lineage-specific gene expression.

          Author Summary

          The best-understood portion of mammalian genomes contains genes transcribed into RNAs, which are subsequently translated into proteins. These genes are generally under high selective pressure and deeply conserved between species. Recent publications have revealed novel classes of genes, which are also transcribed into RNA but are not subsequently translated into proteins. One such novel class are long noncoding RNA (lncRNA). LncRNA loci are controlled in a similar manner to protein-coding genes, yet are more often expressed tissue-specifically, and their conservation and function(s) are mostly unknown. Previous reports suggest that lncRNAs can affect the expression of nearby protein-coding genes or act at a distance to control broader biological processes. Also, lncRNA sequence is poorly conserved between mammals compared with protein-coding genes, but how rapidly their transcription evolves, particularly between closely related species, remains unknown. By comparing lncRNA expression between homologous tissues in two species of mouse and in rat, we discovered that lncRNA genes are “born” or “die” more rapidly than protein-coding genes and that this rapid evolution impacts the expression levels of nearby coding genes. This local regulation of gene expression reveals a functional role for the rapid evolution of lncRNAs, which may contribute to biological differences between species.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          The transcriptional landscape of the mammalian genome.

          This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The evolution of gene expression levels in mammalian organs.

            Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, large-scale analyses of gene expression evolution were until recently prevented by technological limitations. Here we report the sequencing of polyadenylated RNA from six organs across ten species that represent all major mammalian lineages (placentals, marsupials and monotremes) and birds (the evolutionary outgroup), with the goal of understanding the dynamics of mammalian transcriptome evolution. We show that the rate of gene expression evolution varies among organs, lineages and chromosomes, owing to differences in selective pressures: transcriptome change was slow in nervous tissues and rapid in testes, slower in rodents than in apes and monotremes, and rapid for the X chromosome right after its formation. Although gene expression evolution in mammals was strongly shaped by purifying selection, we identify numerous potentially selectively driven expression switches, which occurred at different rates across lineages and tissues and which probably contributed to the specific organ biology of various mammals.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs.

              Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Genet
                PLoS Genet
                plos
                plosgen
                PLoS Genetics
                Public Library of Science (San Francisco, USA )
                1553-7390
                1553-7404
                July 2012
                July 2012
                26 July 2012
                : 8
                : 7
                : e1002841
                Affiliations
                [1 ]Cancer Research UK, Cambridge Research Institute, Li Ka Shing Centre, Cambridge, United Kingdom
                [2 ]University of Cambridge, Cambridge, United Kingdom
                [3 ]European Bioinformatics Institute, European Molecular Biology Laboratory, Hinxton, United Kingdom
                [4 ]Wellcome Trust Sanger Institute, Hinxton, United Kingdom
                [5 ]MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
                [6 ]Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford, United Kingdom
                Massachusetts Institute of Technology, United States of America
                Author notes

                ¤: Current address: SickKids Research Institute and Department of Molecular Genetics, University of Toronto, Toronto, Canada

                Conceived and designed the experiments: CK SW CPP DTO ACM. Performed the experiments: CK SW KS MDW AG ACM. Analyzed the data: CK SW ACM. Wrote the paper: CK SW CPP DTO ACM. Designed the experiments: CK SW DTO. Performed RNAseq and ChIPseq experiments: CK SW KS MDW. Processed and mapped the reads: SW AG ACM. Assembled and annotated transcripts: SW ACM. Performed qPCR and RT–PCR validations: CK SW. Estimated the transcriptional turnover of lncRNAs: CK SW ACM. Performed genome enrichments and sequence constraint analysis of intergenic lncRNAs: ACM. Analysed the effect of lineage specific gene expression on neighbouring genes: ACM. Supervised the study: DTO CPP.

                ¶ These authors also contributed equally to this work.

                Article
                PGENETICS-D-12-00087
                10.1371/journal.pgen.1002841
                3406015
                22844254
                77ba5e67-d6c8-45a4-ab2a-bcc3e28dde00
                Kutter et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 11 January 2012
                : 30 May 2012
                Page count
                Pages: 15
                Categories
                Research Article
                Biology
                Computational Biology
                Evolutionary Biology
                Genetics
                Genomics
                Model Organisms

                Genetics
                Genetics

                Comments

                Comment on this article