4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Allele Phasing Greatly Improves the Phylogenetic Utility of Ultraconserved Elements

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Advances in high-throughput sequencing techniques now allow relatively easy and affordable sequencing of large portions of the genome, even for nonmodel organisms. Many phylogenetic studies reduce costs by focusing their sequencing efforts on a selected set of targeted loci, commonly enriched using sequence capture. The advantage of this approach is that it recovers a consistent set of loci, each with high sequencing depth, which leads to more confidence in the assembly of target sequences. High sequencing depth can also be used to identify phylogenetically informative allelic variation within sequenced individuals, but allele sequences are infrequently assembled in phylogenetic studies. Instead, many scientists perform their phylogenetic analyses using contig sequences which result from the de novo assembly of sequencing reads into contigs containing only canonical nucleobases, and this may reduce both statistical power and phylogenetic accuracy. Here, we develop an easy-to-use pipeline to recover allele sequences from sequence capture data, and we use simulated and empirical data to demonstrate the utility of integrating these allele sequences to analyses performed under the multispecies coalescent model. Our empirical analyses of ultraconserved element locus data collected from the South American hummingbird genus Topaza demonstrate that phased allele sequences carry sufficient phylogenetic information to infer the genetic structure, lineage divergence, and biogeographic history of a genus that diversified during the last 3 myr. The phylogenetic results support the recognition of two species and suggest a high rate of gene flow across large distances of rainforest habitats but rare admixture across the Amazon River. Our simulations provide evidence that analyzing allele sequences leads to more accurate estimates of tree topology and divergence times than the more common approach of using contig sequences.

          Related collections

          Most cited references45

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          ASTRAL: genome-scale coalescent-based species tree estimation

          Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree estimation. One of the most frequent causes for conflicting topologies between gene trees and species trees is incomplete lineage sorting (ILS), which is modelled by the multi-species coalescent. While many methods have been developed to estimate species trees from multiple genes, some which have statistical guarantees under the multi-species coalescent model, existing methods are too computationally intensive for use with genome-scale analyses or have been shown to have poor accuracy under some realistic conditions. Results: We present ASTRAL, a fast method for estimating species trees from multiple genes. ASTRAL is statistically consistent, can run on datasets with thousands of genes and has outstanding accuracy—improving on MP-EST and the population tree from BUCKy, two statistically consistent leading coalescent-based methods. ASTRAL is often more accurate than concatenation using maximum likelihood, except when ILS levels are low or there are too few gene trees. Availability and implementation: ASTRAL is available in open source form at https://github.com/smirarab/ASTRAL/. Datasets studied in this article are available at http://www.cs.utexas.edu/users/phylo/datasets/astral. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales.

            Although massively parallel sequencing has facilitated large-scale DNA sequencing, comparisons among distantly related species rely upon small portions of the genome that are easily aligned. Methods are needed to efficiently obtain comparable DNA fragments prior to massively parallel sequencing, particularly for biologists working with non-model organisms. We introduce a new class of molecular marker, anchored by ultraconserved genomic elements (UCEs), that universally enable target enrichment and sequencing of thousands of orthologous loci across species separated by hundreds of millions of years of evolution. Our analyses here focus on use of UCE markers in Amniota because UCEs and phylogenetic relationships are well-known in some amniotes. We perform an in silico experiment to demonstrate that sequence flanking 2030 UCEs contains information sufficient to enable unambiguous recovery of the established primate phylogeny. We extend this experiment by performing an in vitro enrichment of 2386 UCE-anchored loci from nine, non-model avian species. We then use alignments of 854 of these loci to unambiguously recover the established evolutionary relationships within and among three ancient bird lineages. Because many organismal lineages have UCEs, this type of genetic marker and the analytical framework we outline can be applied across the tree of life, potentially reshaping our understanding of phylogeny at many taxonomic levels.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Multiple alignment of DNA sequences with MAFFT.

              Multiple alignment of DNA sequences is an important step in various molecular biological analyses. As a large amount of sequence data is becoming available through genome and other large-scale sequencing projects, scalability, as well as accuracy, is currently required for a multiple sequence alignment (MSA) program. In this chapter, we outline the algorithms of an MSA program MAFFT and provide practical advice, focusing on several typical situations a biologist sometimes faces. For genome alignment, which is beyond the scope of MAFFT, we introduce two tools: TBA and MAUVE.
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Syst Biol
                Syst. Biol
                sysbio
                Systematic Biology
                Oxford University Press
                1063-5157
                1076-836X
                January 2019
                15 May 2018
                15 May 2018
                : 68
                : 1
                : 32-46
                Affiliations
                [1 ]Department of Biological and Environmental Sciences, University of Gothenburg, SE-413 19 Göteborg, Sweden
                [2 ]Gothenburg Global Biodiversity Centre, Box 461, SE-405 30 Göteborg, Sweden
                [3 ]Universidade Federal Rural de Pernambuco, Serra Talhada, Brazil
                [4 ]Department of Marine Sciences, University of Gothenburg, SE-413 19 Göteborg, Sweden
                [5 ]Museu Paraense Emílio Goeldi, Collection of Birds, Belém, Brazil
                [6 ]Department of Biological Sciences and Museum of Natural Science, Louisiana State University, Baton Rouge, LA, USA
                [7 ]Gothenburg Botanical Garden, SE-413 19 Göteborg, Sweden
                [8 ]Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
                Author notes
                Correspondence to be sent to: Department of Biological and Environmental Sciences, University of Gothenburg, Carl Skottsbergs Gata 22B, SE-413 19 Göteborg, Sweden E-mail: tobias.andermann@ 123456bioenv.gu.se
                Article
                syy039
                10.1093/sysbio/syy039
                6292485
                29771371
                d8d4b7fd-9a5d-40e6-8abc-48ad6cb5e4c6
                © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For permissions, please email: journals.permissions@ 123456oup.com

                History
                : 21 December 2017
                : 9 May 2018
                : 10 May 2018
                Page count
                Pages: 15
                Funding
                Funded by: Swedish Research Council
                Award ID: B0569601
                Award ID: 2012-3917
                Funded by: CNPq
                Award ID: 310593/2009-3
                Funded by: INCT em Biodiversidade e Uso da Terra da Amazônia
                Award ID: 574008/2008-0
                Award ID: 563236/2010-8
                Award ID: 471342/2011-4
                Funded by: FAPESPA
                Award ID: ICAAF 023/2011
                Funded by: NSF-FAPESP
                Award ID: 1241066
                Funded by: European Research Council under the European Union’s Seventh Framework Programme
                Award ID: FP/2007-2013
                Funded by: ERC 331024
                Funded by: Swedish Foundation for Strategic Research
                Funded by: Faculty of Sciences at the University of Gothenburg
                Funded by: Wenner-Gren Foundations
                Funded by: Harvard University 10.13039/100007229
                Funded by: Wallenberg Academy Fellowship
                Categories
                Regular Articles

                Animal science & Zoology
                aves,gene tree,heterozygous sites,mitochondrial genome,snp,species tree,target enrichment,trochilidae

                Comments

                Comment on this article