0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome assembly of Luehdorfia taibai, an endangered butterfly endemic to Qinling Moutains in China with extremely small populations

      , , , , ,
      Frontiers in Ecology and Evolution
      Frontiers Media SA

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Conservation genomic resources over the past decade has drastically improved, since genomes can be used to predict diverse parameters vital to conservation management. Luehdorfia taibai is an endemic butterfly only found in restricted aeras in middle-west China and is critically endangered. It was classfied as a vunerlable (VN) species in the “China species red list.” Here we generated 34.38 Gb of raw DNA sequencing reads and obtained a high-qualified draft genome assembly of L. taibai. The final genome is ~683.3 Mb, with contig N50 size of 10.19 Mb. Further, 98.6% of single-copy orthologous genes have been recovered by BUSCO. An estimated 42.34% of the genome of L. taibai consists of repetitive elements. Combined with gene prediction and transcriptome sequencing, genome annotation produced 15,968 protein-coding genes. Additionally, a nearly 1:1 orthology ratio of syntenic blocks between L. taibai and its closest genome Luehdorfia chinensis suggested that the genome structures have not changed much after speciation. The genome of L. taibai have not undergone a whole genome duplication event. Population dynamics analyses indicates that L. taibai has an extremely low heterozygosity of 0.057%, and its population size has declined dramatically over the past 10 thousand years. Our study describes a draft genome assembly of the L. taibai, the first implication of this species. We consider the globally overexploited of the host plants is not the main reason to threaten L. taibai. The genome will provide advice for the conservation to the economically important Luehdorfia lineage and this specific species.

          Related collections

          Most cited references45

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

            We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              edgeR: a Bioconductor package for differential expression analysis of digital gene expression data

              Summary: It is expected that emerging digital gene expression (DGE) technologies will overtake microarray technologies in the near future for many functional genomics applications. One of the fundamental data analysis tasks, especially for gene expression studies, involves determining whether there is evidence that counts for a transcript or exon are significantly different across experimental conditions. edgeR is a Bioconductor software package for examining differential expression of replicated count data. An overdispersed Poisson model is used to account for both biological and technical variability. Empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference. The methodology can be used even with the most minimal levels of replication, provided at least one phenotype or experimental condition is replicated. The software may have other applications beyond sequencing data, such as proteome peptide count data. Availability: The package is freely available under the LGPL licence from the Bioconductor web site (http://bioconductor.org). Contact: mrobinson@wehi.edu.au
                Bookmark

                Author and article information

                Journal
                Frontiers in Ecology and Evolution
                Front. Ecol. Evol.
                Frontiers Media SA
                2296-701X
                August 4 2022
                August 4 2022
                : 10
                Article
                10.3389/fevo.2022.955246
                ecd99281-52be-4fce-9c60-e3a87c2c4b12
                © 2022

                Free to read

                https://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article