23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Conserved structure and inferred evolutionary history of long terminal repeats (LTRs)

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Long terminal repeats (LTRs, consisting of U3-R-U5 portions) are important elements of retroviruses and related retrotransposons. They are difficult to analyse due to their variability.

          The aim was to obtain a more comprehensive view of structure, diversity and phylogeny of LTRs than hitherto possible.

          Results

          Hidden Markov models (HMM) were created for 11 clades of LTRs belonging to Retroviridae (class III retroviruses), animal Metaviridae (Gypsy/Ty3) elements and plant Pseudoviridae (Copia/Ty1) elements, complementing our work with Orthoretrovirus HMMs. The great variation in LTR length of plant Metaviridae and the few divergent animal Pseudoviridae prevented building HMMs from both of these groups.

          Animal Metaviridae LTRs had the same conserved motifs as retroviral LTRs, confirming that the two groups are closely related. The conserved motifs were the short inverted repeats (SIRs), integrase recognition signals (5´TGTTRNR…YNYAACA 3´); the polyadenylation signal or AATAAA motif; a GT-rich stretch downstream of the polyadenylation signal; and a less conserved AT-rich stretch corresponding to the core promoter element, the TATA box. Plant Pseudoviridae LTRs differed slightly in having a conserved TATA-box, TATATA, but no conserved polyadenylation signal, plus a much shorter R region.

          The sensitivity of the HMMs for detection in genomic sequences was around 50% for most models, at a relatively high specificity, suitable for genome screening.

          The HMMs yielded consensus sequences, which were aligned by creating an HMM model (a ‘Superviterbi’ alignment). This yielded a phylogenetic tree that was compared with a Pol-based tree. Both LTR and Pol trees supported monophyly of retroviruses. In both, Pseudoviridae was ancestral to all other LTR retrotransposons. However, the LTR trees showed the chromovirus portion of Metaviridae clustering together with Pseudoviridae, dividing Metaviridae into two portions with distinct phylogeny.

          Conclusion

          The HMMs clearly demonstrated a unitary conserved structure of LTRs, supporting that they arose once during evolution. We attempted to follow the evolution of LTRs by tracing their functional foundations, that is, acquisition of RNAse H, a combined promoter/ polyadenylation site, integrase, hairpin priming and the primer binding site (PBS). Available information did not support a simple evolutionary chain of events.

          Related collections

          Most cited references49

          • Record: found
          • Abstract: not found
          • Article: not found

          MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment.

          S. KUMAR (2004)
          With its theoretical basis firmly established in molecular evolutionary and population genetics, the comparative DNA and protein sequence analysis plays a central role in reconstructing the evolutionary histories of species and multigene families, estimating rates of molecular evolution, and inferring the nature and extent of selective forces shaping the evolution of genes and genomes. The scope of these investigations has now expanded greatly owing to the development of high-throughput sequencing techniques and novel statistical and computational methods. These methods require easy-to-use computer programs. One such effort has been to produce Molecular Evolutionary Genetics Analysis (MEGA) software, with its focus on facilitating the exploration and analysis of the DNA and protein sequence variation from an evolutionary perspective. Currently in its third major release, MEGA3 contains facilities for automatic and manual sequence alignment, web-based mining of databases, inference of the phylogenetic trees, estimation of evolutionary distances and testing evolutionary hypotheses. This paper provides an overview of the statistical methods, computational tools, and visual exploration modules for data input and the results obtainable in MEGA.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Architecture and Secondary Structure of an Entire HIV-1 RNA Genome

            Single-stranded RNA viruses encompass broad classes of infectious agents and cause the common cold, cancer, AIDS, and other serious health threats. Viral replication is regulated at many levels, including using conserved genomic RNA structures. Most potential regulatory elements within viral RNA genomes are uncharacterized. Here we report the structure of an entire HIV-1 genome at single nucleotide resolution using SHAPE, a high-throughput RNA analysis technology. The genome encodes protein structure at two levels. In addition to the correspondence between RNA and protein primary sequences, a correlation exists between high levels of RNA structure and sequences that encode inter-domain loops in HIV proteins. This correlation suggests RNA structure modulates ribosome elongation to promote native protein folding. Some simple genome elements previously shown to be important, including the ribosomal gag-pol frameshift stem-loop, are components of larger RNA motifs. We also identify organizational principles for unstructured RNA regions. Highly used splice acceptors lie in unstructured motifs and hypervariable regions are sequestered from flanking genome regions by stable insulator helices. These results emphasize that the HIV-1 genome and, potentially, many coding RNAs are punctuated by numerous previously unrecognized regulatory motifs and that extensive RNA structure may constitute an additional level of the genetic code.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Origin and evolution of retroelements based upon their reverse transcriptase sequences.

              To study the evolutionary relationship of reverse transcriptase (RT) containing genetic elements, a phylogenetic tree of 82 retroelements from animals, plants, protozoans and bacteria was constructed. The tree was based on seven amino acid domains totalling 178 residues identified in all RTs. We have also identified these seven domains in the RNA-directed RNA polymerases from various plus-strand RNA viruses. The sequence similarity of these RNA polymerases to RT suggests that these two enzymes evolved from a common ancestor, and thus RNA polymerase can be used as an outgroup to root the RT tree. A comparison of the genetic organization of the various RT containing elements and their position on the tree allows several inferences concerning the origin and evolution of these elements. The most probable ancestor of current retroelements was a retrotransposable element with both gag-like and pol-like genes. On one major branch of the tree, organelle and bacterial sequences (e.g. group II introns and bacterial msDNA) appear to have captured the RT sequences from retrotransposons which lack long terminal repeats (LTRs). On the other major branch, acquisition of LTRs gave rise to two distinct groups of LTR retrotransposons and three groups of viruses: retroviruses, hepadnaviruses and caulimoviruses.
                Bookmark

                Author and article information

                Contributors
                Journal
                Mob DNA
                Mob DNA
                Mobile DNA
                BioMed Central
                1759-8753
                2013
                1 February 2013
                : 4
                : 5
                Affiliations
                [1 ]Section of Virology, Department of Medical Sciences, Uppsala University, Uppsala, Sweden
                [2 ]Unit of Physiology, Department of Neuroscience, Uppsala University, Uppsala, Sweden
                [3 ]Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
                [4 ]High Throughput Biology Center, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
                [5 ]Section of Virology, Department of Medical Sciences, Academic Hospital, Uppsala 751 85, Sweden
                Article
                1759-8753-4-5
                10.1186/1759-8753-4-5
                3601003
                23369192
                1a84eb61-9ad3-4f8c-91db-1815da99fd42
                Copyright ©2013 Benachenhou et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 17 July 2012
                : 14 December 2012
                Categories
                Research

                Genetics
                ltr,long terminal repeat,retrotransposon,retrovirus,phylogeny,genome evolution
                Genetics
                ltr, long terminal repeat, retrotransposon, retrovirus, phylogeny, genome evolution

                Comments

                Comment on this article