Bootstrapping phylogenies inferred from rearrangement data

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

Large-scale sequencing of genomes has enabled the inference of phylogenies based on the evolution of genomic architecture, under such events as rearrangements, duplications, and losses. Many evolutionary models and associated algorithms have been designed over the last few years and have found use in comparative genomics and phylogenetic inference. However, the assessment of phylogenies built from such data has not been properly addressed to date. The standard method used in sequence-based phylogenetic inference is the bootstrap, but it relies on a large number of homologous characters that can be resampled; yet in the case of rearrangements, the entire genome is a single character. Alternatives such as the jackknife suffer from the same problem, while likelihood tests cannot be applied in the absence of well established probabilistic models.

Results

We present a new approach to the assessment of distance-based phylogenetic inference from whole-genome data; our approach combines features of the jackknife and the bootstrap and remains nonparametric. For each feature of our method, we give an equivalent feature in the sequence-based framework; we also present the results of extensive experimental testing, in both sequence-based and genome-based frameworks. Through the feature-by-feature comparison and the experimental results, we show that our bootstrapping approach is on par with the classic phylogenetic bootstrap used in sequence-based reconstruction, and we establish the clear superiority of the classic bootstrap for sequence data and of our corresponding new approach for rearrangement data over proposed variants. Finally, we test our approach on a small dataset of mammalian genomes, verifying that the support values match current thinking about the respective branches.

Conclusions

Our method is the first to provide a standard of assessment to match that of the classic phylogenetic bootstrap for aligned sequences. Its support values follow a similar scale and its receiver-operating characteristics are nearly identical, indicating that it provides similar levels of sensitivity and specificity. Thus our assessment method makes it possible to conduct phylogenetic analyses on whole genomes with the same degree of confidence as for analyses on aligned sequences. Extensions to search-based inference methods such as maximum parsimony and maximum likelihood are possible, but remain to be thoroughly tested.

Related collections

Most cited references 22

Record: found
Abstract: not found
Article: not found

Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods

Bradley Efron, Marcelo Saad (1981)

0 comments Cited 233 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Efficient sorting of genomic permutations by translocation, inversion and block interchange.

Richard Friedberg, Oliver Attie, Sophia Yancopoulos (2005)

Finding genomic distance based on gene order is a classic problem in genome rearrangements. Efficient exact algorithms for genomic distances based on inversions and/or translocations have been found but are complicated by special cases, rare in simulations and empirical data. We seek a universal operation underlying a more inclusive set of evolutionary operations and yielding a tractable genomic distance with simple mathematical form. We study a universal double-cut-and-join operation that accounts for inversions, translocations, fissions and fusions, but also produces circular intermediates which can be reabsorbed. The genomic distance, computable in linear time, is given by the number of breakpoints minus the number of cycles (b-c) in the comparison graph of the two genomes; the number of hurdles does not enter into it. Without changing the formula, we can replace generation and re-absorption of a circular intermediate by a generalized transposition, equivalent to a block interchange, with weight two. Our simple algorithm converts one multi-linear chromosome genome to another in the minimum distance.

0 comments Cited 140 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Rare genomic changes as a tool for phylogenetics.

Dave Holland, Rokas Pečiulis (2000)

DNA sequence data have offered valuable insights into the relationships between living organisms. However, most phylogenetic analyses of DNA sequences rely primarily on single nucleotide substitutions, which might not be perfect phylogenetic markers. Rare genomic changes (RGCs), such as intron indels, retroposon integrations, signature sequences, mitochondrial and chloroplast gene order changes, gene duplications and genetic code changes, provide a suite of complementary markers with enormous potential for molecular systematics. Recent exploitation of RGCs has already started to yield exciting phylogenetic information.

0 comments Cited 80 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Algorithms Mol Biol

Journal ID (iso-abbrev): Algorithms Mol Biol

Title: Algorithms for Molecular Biology : AMB

Publisher: BioMed Central

ISSN (Electronic): 1748-7188

Publication date Collection: 2012

Publication date (Electronic): 29 August 2012

Volume: 7

Page: 21

Affiliations

[1 ]Laboratory for Computational Biology and Bioinformatics, EPFL, EPFL-IC-LCBB INJ230, Station 14, CH-1015 Lausanne, Switzerland

Article

Publisher ID: 1748-7188-7-21

DOI: 10.1186/1748-7188-7-21

PMC ID: 3487984

PubMed ID: 22931958

SO-VID: 7b897c2a-7c5f-4b07-82f0-f93cac6c3c76

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Bootstrapping phylogenies inferred from rearrangement data

Read this article at

Abstract

Background

Results

Conclusions

Related collections

Microbial Genomics

Most cited references 22

Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods

Efficient sorting of genomic permutations by translocation, inversion and block interchange.

Rare genomic changes as a tool for phylogenetics.

Author and article information

Journal

Affiliations

Article

History

Categories

Comments

Comment on this article

Similar content 216

Cited by 4

Most referenced authors 365