26
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Restriction site-associated DNA sequencing (RADseq) has revolutionized the study of wild organisms by allowing cost-effective genotyping of thousands of loci. However, for species lacking reference genomes, it can be challenging to select the restriction enzyme that offers the best balance between the number of obtained RAD loci and depth of coverage, which is crucial for a successful outcome. To address this issue, PredRAD was recently developed, which uses probabilistic models to predict restriction site frequencies from a transcriptome assembly or other sequence resource based on either GC content or mono-, di- or trinucleotide composition. This program generates predictions that are broadly consistent with estimates of the true number of restriction sites obtained through in silico digestion of available reference genome assemblies. However, in practice the actual number of loci obtained could potentially differ as incomplete enzymatic digestion or patchy sequence coverage across the genome might lead to some loci not being represented in a RAD dataset, while erroneous assembly could potentially inflate the number of loci. To investigate this, we used genome and transcriptome assemblies together with RADseq data from the Antarctic fur seal ( Arctocephalus gazella) to compare PredRAD predictions with empirical estimates of the number of loci obtained via in silico digestion and from de novo assemblies.

          Results

          PredRAD yielded consistently higher predicted numbers of restriction sites for the transcriptome assembly relative to the genome assembly. The trinucleotide and dinucleotide models also predicted higher frequencies than the mononucleotide or GC content models. Overall, the dinucleotide and trinucleotide models applied to the transcriptome and the genome assemblies respectively generated predictions that were closest to the number of restriction sites estimated by in silico digestion. Furthermore, the number of de novo assembled RAD loci mapping to restriction sites was similar to the expectation based on in silico digestion.

          Conclusions

          Our study reveals generally high concordance between PredRAD predictions and empirical estimates of the number of RAD loci. This further supports the utility of PredRAD, while also suggesting that it may be feasible to sequence and assemble the majority of RAD loci present in an organism’s genome.

          Electronic supplementary material

          The online version of this article (10.1186/s12864-019-5440-8) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: found
          • Article: not found

          RADSeq: next-generation population genetics.

          Next-generation sequencing technologies are making a substantial impact on many areas of biology, including the analysis of genetic diversity in populations. However, genome-scale population genetic studies have been accessible only to well-funded model systems. Restriction-site associated DNA sequencing, a method that samples at reduced complexity across target genomes, promises to deliver high resolution population genomic data-thousands of sequenced markers across many individuals-for any organism at reasonable costs. It has found application in wild populations and non-traditional study species, and promises to become an important technology for ecological population genomics.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Restriction site-associated DNA sequencing, genotyping error estimation and de novo assembly optimization for population genetic inference.

            Restriction site-associated DNA sequencing (RADseq) provides researchers with the ability to record genetic polymorphism across thousands of loci for nonmodel organisms, potentially revolutionizing the field of molecular ecology. However, as with other genotyping methods, RADseq is prone to a number of sources of error that may have consequential effects for population genetic inferences, and these have received only limited attention in terms of the estimation and reporting of genotyping error rates. Here we use individual sample replicates, under the expectation of identical genotypes, to quantify genotyping error in the absence of a reference genome. We then use sample replicates to (i) optimize de novo assembly parameters within the program Stacks, by minimizing error and maximizing the retrieval of informative loci; and (ii) quantify error rates for loci, alleles and single-nucleotide polymorphisms. As an empirical example, we use a double-digest RAD data set of a nonmodel plant species, Berberis alpina, collected from high-altitude mountains in Mexico. © 2014 John Wiley & Sons Ltd.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Bioinformatic processing of RAD-seq data dramatically impacts downstream population genetic inference

                Bookmark

                Author and article information

                Contributors
                +49 521 1062725 , david.vendrami@student.unife.it
                jfor@bas.ac.uk
                joseph.hoffman@uni-bielefeld.de
                Journal
                BMC Genomics
                BMC Genomics
                BMC Genomics
                BioMed Central (London )
                1471-2164
                22 January 2019
                22 January 2019
                2019
                : 20
                : 72
                Affiliations
                [1 ]ISNI 0000 0001 0944 9128, GRID grid.7491.b, Department of Animal Behavior, , University of Bielefeld, ; Postfach 100131, 33615 Bielefeld, Germany
                [2 ]ISNI 0000 0004 0598 3800, GRID grid.478592.5, British Antarctic Survey, ; High Cross, Madingley Road, Cambridge, CB3 OET UK
                Author information
                http://orcid.org/0000-0001-9409-4084
                Article
                5440
                10.1186/s12864-019-5440-8
                6341687
                1432af34-a059-491c-9d17-0b285d33e428
                © The Author(s). 2019

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 12 April 2018
                : 8 January 2019
                Funding
                Funded by: European Union Marie Curie Seventh Framework Programme
                Award ID: 605051
                Funded by: Deutsche Forschungsgemeinschaft
                Award ID: HO 5122/3-1
                Categories
                Research Article
                Custom metadata
                © The Author(s) 2019

                Genetics
                restriction site associated dna sequencing (radseq),restriction enzyme,reference genome,transcriptome assembly,predrad,antarctic fur seal,pinniped

                Comments

                Comment on this article