3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      PhyloHerb: A high‐throughput phylogenomic pipeline for processing genome skimming data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Premise

          The application of high‐throughput sequencing, especially to herbarium specimens, is rapidly accelerating biodiversity research. Low‐coverage sequencing of total genomic DNA (genome skimming) is particularly promising and can simultaneously recover the plastid, mitochondrial, and nuclear ribosomal regions across hundreds of species. Here, we introduce PhyloHerb, a bioinformatic pipeline to efficiently assemble phylogenomic data sets derived from genome skimming.

          Methods and Results

          PhyloHerb uses either a built‐in database or user‐specified references to extract orthologous sequences from all three genomes using a BLAST search. It outputs FASTA files and offers a suite of utility functions to assist with alignment, partitioning, concatenation, and phylogeny inference. The program is freely available at https://github.com/lmcai/PhyloHerb/.

          Conclusions

          We demonstrate that PhyloHerb can accurately identify genes using a published data set from Clusiaceae. We also show via simulations that our approach is effective for highly fragmented assemblies from herbarium specimens and is scalable to thousands of species.

          Related collections

          Most cited references35

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

          We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies

            Motivation: Phylogenies are increasingly used in all fields of medical and biological research. Moreover, because of the next-generation sequencing revolution, datasets used for conducting phylogenetic analyses grow at an unprecedented pace. RAxML (Randomized Axelerated Maximum Likelihood) is a popular program for phylogenetic analyses of large datasets under maximum likelihood. Since the last RAxML paper in 2006, it has been continuously maintained and extended to accommodate the increasingly growing input datasets and to serve the needs of the user community. Results: I present some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting post-analyses on sets of trees. In addition, an up-to-date 50-page user manual covering all new RAxML options is available. Availability and implementation: The code is available under GNU GPL at https://github.com/stamatak/standard-RAxML. Contact: alexandros.stamatakis@h-its.org Supplementary information: Supplementary data are available at Bioinformatics online.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era

              Abstract IQ-TREE (http://www.iqtree.org, last accessed February 6, 2020) is a user-friendly and widely used software package for phylogenetic inference using maximum likelihood. Since the release of version 1 in 2014, we have continuously expanded IQ-TREE to integrate a plethora of new models of sequence evolution and efficient computational approaches of phylogenetic inference to deal with genomic data. Here, we describe notable features of IQ-TREE version 2 and highlight the key advantages over other software.
                Bookmark

                Author and article information

                Contributors
                lmcai@utexas.edu
                cdavis@oeb.harvard.edu
                Journal
                Appl Plant Sci
                Appl Plant Sci
                10.1002/(ISSN)2168-0450
                APS3
                Applications in Plant Sciences
                John Wiley and Sons Inc. (Hoboken )
                2168-0450
                02 June 2022
                May-Jun 2022
                : 10
                : 3 ( doiID: 10.1002/aps3.v10.3 )
                : e11475
                Affiliations
                [ 1 ] Harvard University Herbaria 22 Divinity Avenue, Cambridge Massachusetts 02138 USA
                [ 2 ] Department of Integrative Biology University of Texas at Austin Austin Texas 78712 USA
                [ 3 ] Department of Botany and Plant Sciences University of California Riverside California 92507 USA
                Author notes
                [*] [* ] Correspondence Liming Cai, Department of Integrative Biology, 205 W. 24th Street, Room 201, University of Texas at Austin, Austin, Texas 78712, USA.

                Email: lmcai@ 123456utexas.edu

                Charles C. Davis, Harvard University Herbaria, 22 Divinity Avenue, Cambridge, Massachusetts 02138, USA.

                Email: cdavis@ 123456oeb.harvard.edu

                Author information
                http://orcid.org/0000-0002-8982-2435
                http://orcid.org/0000-0002-4845-260X
                http://orcid.org/0000-0001-8747-1101
                Article
                APS311475
                10.1002/aps3.11475
                9215275
                35774988
                5fdeeb00-1cf4-45a5-9653-e19e86d9a072
                © 2022 The Authors. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America.

                This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

                History
                : 07 March 2022
                : 14 November 2021
                : 07 March 2022
                Page count
                Figures: 3, Tables: 1, Pages: 9, Words: 6442
                Categories
                Software Note
                Software Note
                Custom metadata
                2.0
                May-June 2022
                Converter:WILEY_ML3GV2_TO_JATSPMC version:6.1.7 mode:remove_FC converted:22.06.2022

                herbariomics,high‐throughput sequencing,mitochondria,plastome,ribosomal genes

                Comments

                Comment on this article