24
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      L.U.St: a tool for approximated maximum likelihood supertree reconstruction

      product-review

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Supertrees combine disparate, partially overlapping trees to generate a synthesis that provides a high level perspective that cannot be attained from the inspection of individual phylogenies. Supertrees can be seen as meta-analytical tools that can be used to make inferences based on results of previous scientific studies. Their meta-analytical application has increased in popularity since it was realised that the power of statistical tests for the study of evolutionary trends critically depends on the use of taxon-dense phylogenies. Further to that, supertrees have found applications in phylogenomics where they are used to combine gene trees and recover species phylogenies based on genome-scale data sets.

          Results

          Here, we present the L.U.St package, a python tool for approximate maximum likelihood supertree inference and illustrate its application using a genomic data set for the placental mammals. L.U.St allows the calculation of the approximate likelihood of a supertree, given a set of input trees, performs heuristic searches to look for the supertree of highest likelihood, and performs statistical tests of two or more supertrees. To this end, L.U.St implements a winning sites test allowing ranking of a collection of a-priori selected hypotheses, given as a collection of input supertree topologies. It also outputs a file of input-tree-wise likelihood scores that can be used as input to CONSEL for calculation of standard tests of two trees (e.g. Kishino-Hasegawa, Shimidoara-Hasegawa and Approximately Unbiased tests).

          Conclusion

          This is the first fully parametric implementation of a supertree method, it has clearly understood properties, and provides several advantages over currently available supertree approaches. It is easy to implement and works on any platform that has python installed.

          Availability: bitBucket page - https://afro-juju@bitbucket.org/afro-juju/l.u.st.git.

          Contact: Davide.Pisani@bristol.ac.uk.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: found
          • Article: not found

          Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoidea.

          A maximum likelihood method for inferring evolutionary trees from DNA sequence data was developed by Felsenstein (1981). In evaluating the extent to which the maximum likelihood tree is a significantly better representation of the true tree, it is important to estimate the variance of the difference between log likelihood of different tree topologies. Bootstrap resampling can be used for this purpose (Hasegawa et al. 1988; Hasegawa and Kishino 1989), but it imposes a great computation burden. To overcome this difficulty, we developed a new method for estimating the variance by expressing it explicitly. The method was applied to DNA sequence data from primates in order to evaluate the maximum likelihood branching order among Hominoidea. It was shown that, although the orangutan is convincingly placed as an outgroup of a human and African apes clade, the branching order among human, chimpanzee, and gorilla cannot be determined confidently from the DNA sequence data presently available when the evolutionary rate constancy is not assumed.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The placental mammal ancestor and the post-K-Pg radiation of placentals.

            To discover interordinal relationships of living and fossil placental mammals and the time of origin of placentals relative to the Cretaceous-Paleogene (K-Pg) boundary, we scored 4541 phenomic characters de novo for 86 fossil and living species. Combining these data with molecular sequences, we obtained a phylogenetic tree that, when calibrated with fossils, shows that crown clade Placentalia and placental orders originated after the K-Pg boundary. Many nodes discovered using molecular data are upheld, but phenomic signals overturn molecular signals to show Sundatheria (Dermoptera + Scandentia) as the sister taxon of Primates, a close link between Proboscidea (elephants) and Sirenia (sea cows), and the monophyly of echolocating Chiroptera (bats). Our tree suggests that Placentalia first split into Xenarthra and Epitheria; extinct New World species are the oldest members of Afrotheria.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model.

              The reconstruction of the Tree of Life has relied almost entirely on concatenation methods, which do not accommodate gene tree heterogeneity, a property that simulations and theory have identified as a likely cause of incongruent phylogenies. However, this incongruence has not yet been demonstrated in empirical studies. Several key relationships among eutherian mammals remain controversial and conflicting among previous studies, including the root of eutherian tree and the relationships within Euarchontoglires and Laurasiatheria. Both bayesian and maximum-likelihood analysis of genome-wide data of 447 nuclear genes from 37 species show that concatenation methods indeed yield strong incongruence in the phylogeny of eutherian mammals, as revealed by subsampling analyses of loci and taxa, which produced strongly conflicting topologies. In contrast, the coalescent methods, which accommodate gene tree heterogeneity, yield a phylogeny that is robust to variable gene and taxon sampling and is congruent with geographic data. The data also demonstrate that incomplete lineage sorting, a major source of gene tree heterogeneity, is relevant to deep-level phylogenies, such as those among eutherian mammals. Our results firmly place the eutherian root between Atlantogenata and Boreoeutheria and support ungulate polyphyly and a sister-group relationship between Scandentia and Primates. This study demonstrates that the incongruence introduced by concatenation methods is a major cause of long-standing uncertainty in the phylogeny of eutherian mammals, and the same may apply to other clades. Our analyses suggest that such incongruence can be resolved using phylogenomic data and coalescent methods that deal explicitly with gene tree heterogeneity.
                Bookmark

                Author and article information

                Contributors
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central
                1471-2105
                2014
                12 June 2014
                : 15
                : 183
                Affiliations
                [1 ]Department of Biology, The National University of Ireland, Maynooth, Maynooth, Kildare, Ireland
                [2 ]Department of Life Sciences, The Natural History Museum, London SW7 5BD, UK
                [3 ]Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, Ceredigion SY23 3FG, UK
                [4 ]School of Biological Sciences and School of Earth Sciences, The University of Bristol, Woodland Road, BS8 1UG Bristol, UK
                Article
                1471-2105-15-183
                10.1186/1471-2105-15-183
                4073192
                24925766
                5d192a85-8436-4059-ae43-4aa0d15c5ace
                Copyright © 2014 Akanni et al.; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 30 January 2014
                : 2 June 2014
                Categories
                Software

                Bioinformatics & Computational biology
                supertrees,maximum likelihood,phylogenomics,tests of two trees

                Comments

                Comment on this article