Group II intron and repeat-rich red algal mitochondrial genomes demonstrate the dynamic recent history of autocatalytic RNAs

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

Group II introns are mobile genetic elements that can insert at specific target sequences, however, their origins are often challenging to reconstruct because of rapid sequence decay following invasion and spread into different sites. To advance understanding of group II intron spread, we studied the intron-rich mitochondrial genome (mitogenome) in the unicellular red alga, Porphyridium.

Results

Analysis of mitogenomes in three closely related species in this genus revealed they were 3–6-fold larger in size (56–132 kbp) than in other red algae, that have genomes of size 21–43 kbp. This discrepancy is explained by two factors, group II intron invasion and expansion of repeated sequences in large intergenic regions. Phylogenetic analysis demonstrates that many mitogenome group II intron families are specific to Porphyridium, whereas others are closely related to sequences in fungi and in the red alga-derived plastids of stramenopiles. Network analysis of intron-encoded proteins (IEPs) shows a clear link between plastid and mitochondrial IEPs in distantly related species, with both groups associated with prokaryotic sequences.

Conclusion

Our analysis of group II introns in Porphyridium mitogenomes demonstrates the dynamic nature of group II intron evolution, strongly supports the lateral movement of group II introns among diverse eukaryotes, and reveals their ability to proliferate, once integrated in mitochondrial DNA.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12915-021-01200-3.

Related collections

Most cited references 73

Record: found
Abstract: found
Article: found

Is Open Access

IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies

Lam-Tung Nguyen, Heiko Schmidt, Arndt von Haeseler … (2014)

Large phylogenomics data sets require fast tree inference methods, especially for maximum-likelihood (ML) phylogenies. Fast programs exist, but due to inherent heuristics to find optimal trees, it is not clear whether the best tree is found. Thus, there is need for additional approaches that employ different search strategies to find ML trees and that are at the same time as fast as currently available ML programs. We show that a combination of hill-climbing approaches and a stochastic perturbation method can be time-efficiently implemented. If we allow the same CPU time as RAxML and PhyML, then our software IQ-TREE found higher likelihoods between 62.2% and 87.1% of the studied alignments, thus efficiently exploring the tree-space. If we use the IQ-TREE stopping rule, RAxML and PhyML are faster in 75.7% and 47.1% of the DNA alignments and 42.2% and 100% of the protein alignments, respectively. However, the range of obtaining higher likelihoods with IQ-TREE improves to 73.3-97.1%.

0 comments Cited 6393 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation

Sergey Koren, Brian Walenz, Konstantin Berlin … (2017)

Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new overlapping and assembly algorithms, including an adaptive overlapping strategy based on tf-idf weighted MinHash and a sparse assembly graph construction that avoids collapsing diverged repeats and haplotypes. We demonstrate that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either Pacific Biosciences (PacBio) or Oxford Nanopore technologies and achieves a contig NG50 of >21 Mbp on both human and Drosophila melanogaster PacBio data sets. For assembly structures that cannot be linearly represented, Canu provides graph-based assembly outputs in graphical fragment assembly (GFA) format for analysis or integration with complementary phasing and scaffolding techniques. The combination of such highly resolved assembly graphs with long-range scaffolding information promises the complete and automated assembly of complex genomes.

0 comments Cited 2111 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

RNAmmer: consistent and rapid annotation of ribosomal RNA genes

Karin Lagesen, Peter F. Hallin, Einar Andreas Rødland … (2008)

The publication of a complete genome sequence is usually accompanied by annotations of its genes. In contrast to protein coding genes, genes for ribosomal RNA (rRNA) are often poorly or inconsistently annotated. This makes comparative studies based on rRNA genes difficult. We have therefore created computational predictors for the major rRNA species from all kingdoms of life and compiled them into a program called RNAmmer. The program uses hidden Markov models trained on data from the 5S ribosomal RNA database and the European ribosomal RNA database project. A pre-screening step makes the method fast with little loss of sensitivity, enabling the analysis of a complete bacterial genome in less than a minute. Results from running RNAmmer on a large set of genomes indicate that the location of rRNAs can be predicted with a very high level of accuracy. Novel, unannotated rRNAs are also predicted in many genomes. The software as well as the genome analysis results are available at the CBS web server.

0 comments Cited 830 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Hwan Su Yoon:

ORCID: http://orcid.org/0000-0001-9507-0105

hsyoon2011@skku.edu

Journal

Journal ID (nlm-ta): BMC Biol

Journal ID (iso-abbrev): BMC Biol

Title: BMC Biology

Publisher: BioMed Central (London )

ISSN (Electronic): 1741-7007

Publication date (Electronic): 7 January 2022

Publication date PMC-release: 7 January 2022

Publication date Collection: 2022

Volume: 20

Electronic Location Identifier: 2

Affiliations

[1 ]GRID grid.264381.a, ISNI 0000 0001 2181 989X, Department of Biological Sciences, , Sungkyunkwan University, ; Suwon, 16419 South Korea

[2 ]GRID grid.258803.4, ISNI 0000 0001 0661 1556, Department of Oceanography, , Kyungpook National University, ; Daegu, 41566 South Korea

[3 ]GRID grid.430387.b, ISNI 0000 0004 1936 8796, Department of Biochemistry and Microbiology, , Rutgers University, ; New Brunswick, NJ 08901 USA

Author information

Hwan Su Yoon http://orcid.org/0000-0001-9507-0105

Article

Publisher ID: 1200

DOI: 10.1186/s12915-021-01200-3

PMC ID: 8742464

PubMed ID: 34996446

SO-VID: e4eb1c69-a074-4fbd-bf3b-7e2a75fe21b9

License:

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

History

Date received : 27 June 2021

Date accepted : 29 November 2021

Funding

Funded by: FundRef http://dx.doi.org/10.13039/501100003566, ministry of oceans and fisheries;

Award ID: Collaborative Genome Program of the KIMST (20180430)

Award Recipient : Hwan Su Yoon

Funded by: national research foundation (kr)

Award ID: NRF-2017R1A2B3001923

Award Recipient : Hwan Su Yoon

Funded by: FundRef http://dx.doi.org/10.13039/100000104, national aeronautics and space administration;

Award ID: 80NSSC19K0462

Award Recipient : Debashish Bhattacharya

Funded by: nifa-usda hatch grant

Award ID: NJ01180

Award Recipient : Debashish Bhattacharya

Custom metadata

ScienceOpen disciplines: Life sciences

Keywords: genome expansion,group ii introns,repeated sequences,horizontal gene transfer,red algae

Data availability:

ScienceOpen disciplines: Life sciences

Keywords: genome expansion, group ii introns, repeated sequences, horizontal gene transfer, red algae

Group II intron and repeat-rich red algal mitochondrial genomes demonstrate the dynamic recent history of autocatalytic RNAs

Read this article at

Abstract

Background

Results

Conclusion

Supplementary Information

Related collections

The Dynamic Brain

Most cited references 73

IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation

RNAmmer: consistent and rapid annotation of ribosomal RNA genes

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 29

Cited by 7

Most referenced authors 477