4
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The Easter Egg Weevil ( Pachyrhynchus) genome reveals syntenic patterns in Coleoptera across 200 million years of evolution

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Patterns of genomic architecture across insects remain largely undocumented or decoupled from a broader phylogenetic context. For instance, it is unknown whether translocation rates differ between insect orders. We address broad scale patterns of genome architecture across Insecta by examining synteny in a phylogenetic framework from open-source insect genomes. To accomplish this, we add a chromosome level genome to a crucial lineage, Coleoptera. Our assembly of the Pachyrhynchus sulphureomaculatus genome is the first chromosome scale genome for the hyperdiverse Phytophaga lineage and currently the largest insect genome assembled to this scale. The genome is significantly larger than those of other weevils, and this increase in size is caused by repetitive elements. Our results also indicate that, among beetles, there are instances of long-lasting (>200 Ma) localization of genes to a particular chromosome with few translocation events. While some chromosomes have a paucity of translocations, intra-chromosomal synteny was almost absent, with gene order thoroughly shuffled along a chromosome. This large amount of reshuffling within chromosomes with few inter-chromosomal events contrasts with patterns seen in mammals in which the chromosomes tend to exchange larger blocks of material more readily. To place our findings in an evolutionary context, we compared syntenic patterns across Insecta in a phylogenetic framework. For the first time, we find that synteny decays at an exponential rate relative to phylogenetic distance. Additionally, there are significant differences in decay rates between insect orders, this pattern was not driven by Lepidoptera alone which has a substantially different rate.

          Author summary

          Patterns of genomic architecture across insects remain largely undocumented or decoupled from a broader evolutionary context. For instance, it is unknown whether rates of gene order decay differ between insect orders. We address broad scale patterns of genome architecture across Insecta by examining synteny (shared gene order) in a phylogenetic framework from open-source insect genomes (143 complete chromosome assemblies in total). To accomplish this, we add a chromosome level genome to a crucial lineage, Coleoptera (beetles). Our assembly of the Easter Egg Weevil Pachyrhynchus sulphureomaculatus genome is the first chromosome scale genome for the hyperdiverse Phytophaga lineage and currently the largest insect genome assembled to this scale. We are the first to identify in beetles that genes stay localized on chromosomes for hundreds of millions of years, while their order along chromosomes gets completely shuffled over time. We are also the first to empirically demonstrate that synteny decay rates different significantly between insect orders and that this pattern in not driven solely by Lepidoptera (moths and butterflies), which has a substantially different rate.

          Related collections

          Most cited references105

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies

            Motivation: Phylogenies are increasingly used in all fields of medical and biological research. Moreover, because of the next-generation sequencing revolution, datasets used for conducting phylogenetic analyses grow at an unprecedented pace. RAxML (Randomized Axelerated Maximum Likelihood) is a popular program for phylogenetic analyses of large datasets under maximum likelihood. Since the last RAxML paper in 2006, it has been continuously maintained and extended to accommodate the increasingly growing input datasets and to serve the needs of the user community. Results: I present some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting post-analyses on sets of trees. In addition, an up-to-date 50-page user manual covering all new RAxML options is available. Availability and implementation: The code is available under GNU GPL at https://github.com/stamatak/standard-RAxML. Contact: alexandros.stamatakis@h-its.org Supplementary information: Supplementary data are available at Bioinformatics online.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              fastp: an ultra-fast all-in-one FASTQ preprocessor

              Abstract Motivation Quality control and preprocessing of FASTQ files are essential to providing clean data for downstream analysis. Traditionally, a different tool is used for each operation, such as quality control, adapter trimming and quality filtering. These tools are often insufficiently fast as most are developed using high-level programming languages (e.g. Python and Java) and provide limited multi-threading support. Reading and loading data multiple times also renders preprocessing slow and I/O inefficient. Results We developed fastp as an ultra-fast FASTQ preprocessor with useful quality control and data-filtering features. It can perform quality control, adapter trimming, quality filtering, per-read quality pruning and many other operations with a single scan of the FASTQ data. This tool is developed in C++ and has multi-threading support. Based on our evaluation, fastp is 2–5 times faster than other FASTQ preprocessing tools such as Trimmomatic or Cutadapt despite performing far more operations than similar tools. Availability and implementation The open-source code and corresponding instructions are available at https://github.com/OpenGene/fastp.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SoftwareRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: Project administrationRole: ResourcesRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – review & editing
                Role: Data curationRole: Formal analysisRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: Writing – review & editing
                Role: Methodology
                Role: Data curationRole: MethodologyRole: SoftwareRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: ResourcesRole: SupervisionRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: ResourcesRole: Writing – original draft
                Role: Editor
                Journal
                PLoS Genet
                PLoS Genet
                plos
                PLoS Genetics
                Public Library of Science (San Francisco, CA USA )
                1553-7390
                1553-7404
                30 August 2021
                August 2021
                : 17
                : 8
                : e1009745
                Affiliations
                [1 ] Entomology Department, Institute for Biodiversity Science and Sustainability, California Academy of Sciences, San Francisco, California, United States of America
                [2 ] Center for Comparative Genomics, Institute for Biodiversity Science and Sustainability, California Academy of Science, San Francisco, California, United States of America
                [3 ] Coleoptera Research Center, Institute for Biodiversity and Environment, University of Mindanao, Matina, Davao City, Philippines
                [4 ] School of Biology and Ecology, University of Maine, Orono, Maine, United States of America
                [5 ] The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
                The University of North Carolina at Chapel Hill, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                Author information
                https://orcid.org/0000-0002-7473-9727
                https://orcid.org/0000-0002-0980-1651
                https://orcid.org/0000-0003-4173-3756
                https://orcid.org/0000-0003-1336-2505
                https://orcid.org/0000-0001-9163-9544
                https://orcid.org/0000-0002-7782-6041
                Article
                PGENETICS-D-21-00087
                10.1371/journal.pgen.1009745
                8432895
                34460814
                048c64e6-d4ee-483b-9a4e-94969e808132
                © 2021 Van Dam et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 20 January 2021
                : 27 July 2021
                Page count
                Figures: 8, Tables: 3, Pages: 27
                Product
                Funding
                Funded by: national science foundation, division of environmental biology
                Award ID: 1856402
                Award Recipient :
                MHVD received grant funding from the NSF:DEB:1856402 ( https://www.nsf.gov/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Genetics
                Genomics
                Animal Genomics
                Invertebrate Genomics
                Biology and Life Sciences
                Zoology
                Entomology
                Insects
                Beetles
                Biology and Life Sciences
                Organisms
                Eukaryota
                Animals
                Invertebrates
                Arthropoda
                Insects
                Beetles
                Biology and Life Sciences
                Zoology
                Animals
                Invertebrates
                Arthropoda
                Insects
                Beetles
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Biology and Life Sciences
                Zoology
                Entomology
                Insects
                Biology and Life Sciences
                Organisms
                Eukaryota
                Animals
                Invertebrates
                Arthropoda
                Insects
                Biology and Life Sciences
                Zoology
                Animals
                Invertebrates
                Arthropoda
                Insects
                Biology and Life Sciences
                Evolutionary Biology
                Evolutionary Systematics
                Phylogenetics
                Animal Phylogenetics
                Biology and Life Sciences
                Taxonomy
                Evolutionary Systematics
                Phylogenetics
                Animal Phylogenetics
                Computer and Information Sciences
                Data Management
                Taxonomy
                Evolutionary Systematics
                Phylogenetics
                Animal Phylogenetics
                Biology and Life Sciences
                Zoology
                Animal Phylogenetics
                Biology and Life Sciences
                Evolutionary Biology
                Evolutionary Systematics
                Phylogenetics
                Phylogenetic Analysis
                Biology and Life Sciences
                Taxonomy
                Evolutionary Systematics
                Phylogenetics
                Phylogenetic Analysis
                Computer and Information Sciences
                Data Management
                Taxonomy
                Evolutionary Systematics
                Phylogenetics
                Phylogenetic Analysis
                Biology and Life Sciences
                Cell Biology
                Chromosome Biology
                Chromosomal Aberrations
                Chromosomal Translocations
                Biology and Life Sciences
                Zoology
                Entomology
                Insects
                Moths and Butterflies
                Biology and Life Sciences
                Organisms
                Eukaryota
                Animals
                Invertebrates
                Arthropoda
                Insects
                Moths and Butterflies
                Biology and Life Sciences
                Zoology
                Animals
                Invertebrates
                Arthropoda
                Insects
                Moths and Butterflies
                Custom metadata
                vor-update-to-uncorrected-proof
                2021-09-10
                Part of the data are within the manuscript and its Supporting Information files and the other part is deposited in https://doi.org/10.5061/dryad.cz8w9gj48 with identical data also in https://www.dnazoo.org/assemblies/Pachyrhynchus_sulphureomaculatus Genome assembly from this project are archived under NCBI accession GCA_019049505.1, RNA reads via SRA, bioproject: PRJNA731935.

                Genetics
                Genetics

                Comments

                Comment on this article