23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry)

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca.

          Findings

          In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced.

          Conclusions

          Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family.

          Related collections

          Most cited references31

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          InParanoid 7: new algorithms and tools for eukaryotic orthology analysis

          The InParanoid project gathers proteomes of completely sequenced eukaryotic species plus Escherichia coli and calculates pairwise ortholog relationships among them. The new release 7.0 of the database has grown by an order of magnitude over the previous version and now includes 100 species and their collective 1.3 million proteins organized into 42.7 million pairwise ortholog groups. The InParanoid algorithm itself has been revised and is now both more specific and sensitive. Based on results from our recent benchmarking of low-complexity filters in homology assignment, a two-pass BLAST approach was developed that makes use of high-precision compositional score matrix adjustment, but avoids the alignment truncation that sometimes follows. We have also updated the InParanoid web site (http://InParanoid.sbc.su.se). Several features have been added, the response times have been improved and the site now sports a new, clearer look. As the number of ortholog databases has grown, it has become difficult to compare among these resources due to a lack of standardized source data and incompatible representations of ortholog relationships. To facilitate data exchange and comparisons among ortholog databases, we have developed and are making available two XML schemas: SeqXML for the input sequences and OrthoXML for the output ortholog clusters.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads.

            Repetitive DNA makes up large portions of plant and animal nuclear genomes, yet it remains the least-characterized genome component in most species studied so far. Although the recent availability of high-throughput sequencing data provides necessary resources for in-depth investigation of genomic repeats, its utility is hampered by the lack of specialized bioinformatics tools and appropriate computational resources that would enable large-scale repeat analysis to be run by biologically oriented researchers. Here we present RepeatExplorer, a collection of software tools for characterization of repetitive elements, which is accessible via web interface. A key component of the server is the computational pipeline using a graph-based sequence clustering algorithm to facilitate de novo repeat identification without the need for reference databases of known elements. Because the algorithm uses short sequences randomly sampled from the genome as input, it is ideal for analyzing next-generation sequence reads. Additional tools are provided to aid in classification of identified repeats, investigate phylogenetic relationships of retroelements and perform comparative analysis of repeat composition between multiple species. The server allows to analyze several million sequence reads, which typically results in identification of most high and medium copy repeats in higher plant genomes.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Rapid recent growth and divergence of rice nuclear genomes.

              By employing the nuclear DNA of the African rice Oryza glaberrima as a reference genome, the timing, natures, mechanisms, and specificities of recent sequence evolution in the indica and japonica subspecies of Oryza sativa were identified. The data indicate that the genome sizes of both indica and japonica have increased substantially, >2% and >6%, respectively, since their divergence from a common ancestor, mainly because of the amplification of LTR-retrotransposons. However, losses of all classes of DNA sequence through unequal homologous recombination and illegitimate recombination have attenuated the growth of the rice genome. Small deletions have been particularly frequent throughout the genome. In >1 Mb of orthologous regions that we analyzed, no cases of complete gene acquisition or loss from either indica or japonica were found, nor was any example of precise transposon excision detected. The sequences between genes were observed to have a very high rate of divergence, indicating a molecular clock for transposable elements that is at least 2-fold more rapid than synonymous base substitutions within genes. We found that regions prone to frequent insertions and deletions also exhibit higher levels of point mutation. These results indicate a highly dynamic rice genome with competing processes for the generation and removal of genetic variation.
                Bookmark

                Author and article information

                Journal
                Gigascience
                Gigascience
                gigascience
                GigaScience
                Oxford University Press
                2047-217X
                April 2018
                15 February 2017
                15 February 2017
                : 7
                : 4
                : 1-14
                Affiliations
                [1 ]Department of Genomics and Biology of Fruit Crops, Research and Innovation Centre, Fondazione Edmund Mach (FEM) Via E. Mach 1, 38010 San Michele all'Adige, Italy
                [2 ]Center for the Development and Improvement of Agri-Food Resources (BIOGEST-SITEIA) University of Modena and Reggio Emilia, P.le Europa 1, 42124 Reggio nell'Emilia (RE), Italy
                [3 ]Unit of Computational Biology, Research and Innovation Centre, Fondazione Edmund Mach (FEM) Via E. Mach 1, 38010 San Michele all'Adige, Italy
                [4 ]Department of Agricultural, Food, and Environmental Sciences, University of Pisa, Pisa I-56124, Italy
                [5 ]Department of Agronomy, Food, Natural Resources, Animals and Environment, University of Padova Agripolis, V.le dell'Università 16, 35020 Legnaro (PD), Italy
                [6 ]Dipartimento di Bioscienze e Centro di Ricerca Pediatrica Romeo ed Enrica Invernizzi, Università degli Studi di Milano, Via Celoria 26, 20133 Milano
                [7 ]Wallace H. Coulter Department of Biomedical Engineering, Georgia Tech, Atlanta, Georgia
                [8 ]Driscoll's Strawberry Associates, Cassin Ranch, 121 Silliman Drive, Watsonville, California
                [9 ]Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach (FEM) Via E. Mach 1, 38010 San Michele all'Adige, Italy
                [10 ]Driscoll's Genetics Limited, East Malling Enterprise Centre, New Road, East Malling, Kent ME19 6BJ, UK
                Author notes
                Corresponding author: Dr Daniel James Sargent, Driscoll's Genetics Limited, East Malling Enterprise Centre, New Road, East Malling, Kent ME19 6BJ, UK. E-mail: sargentdj@ 123456gmail.com
                Article
                giy010
                10.1093/gigascience/giy010
                5893959
                29659812
                417b3fb5-5c97-40d1-93f0-f18d1ce2b01d
                © The Author(s) 2018. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 07 February 2018
                : 27 June 2017
                : 12 November 2017
                Page count
                Pages: 14
                Categories
                Data Note

                long-read sequencing,evolutionary development,angiosperms,genome sequence,transcriptomics

                Comments

                Comment on this article