25
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Genomic exploration and molecular marker development in a large and complex conifer genome using RADseq and mRNAseq.

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We combined restriction site associated DNA sequencing (RADseq) using a hypomethylation-sensitive enzyme and messenger RNA sequencing (mRNAseq) to develop molecular markers for the 16 gigabase genome of Cedrus atlantica, a conifer tree species. With each method, Illumina(®) reads from one individual were used to generate de novo assemblies. SNPs from the RADseq data set were detected in a panel of one single individual and three pools of three individuals each. We developed a flexible script to estimate the ascertainment bias in SNP detection considering the pooling and sampling effects on the probability of not detecting an existing polymorphism. Gene Ontology (GO) and transposable element (TE) search analyses were applied to both data sets. The RADseq and the mRNAseq assemblies represented 0.1% and 0.6% of the genome, respectively. Genome complexity reduction resulted in 17% of the RADseq contigs potentially coding for proteins. This rate was doubled in the mRNAseq data set, suggesting that RADseq also explores noncoding low-repeat regions. The two methods gave very similar GO-slim profiles. As expected, the two assemblies were poor in TE-like sequences (<4% of contigs length). We identified 17,348 single nucleotide polymorphisms (SNPs) in the RADseq data set and 5,714 simple sequence repeats (SSRs) in the transcriptome. A subset of 282 SNPs was validated using the Fluidigm genotyping technology, giving a conversion rate of 50.4%, falling within the expected range for conifers. Increasing sample size had the greatest effect for ascertainment bias reduction. These results validated the utility of the RADseq approach for highly complex genomes such as conifers.

          Related collections

          Author and article information

          Journal
          Mol Ecol Resour
          Molecular ecology resources
          1755-0998
          1755-098X
          May 2015
          : 15
          : 3
          Affiliations
          [1 ] INRA, UR 629 Ecologie des Forêts Méditerranéennes, URFM, Avignon, France.
          Article
          10.1111/1755-0998.12329
          25224750
          96f0fa12-9b96-49c7-8280-525f32f65230
          © 2014 John Wiley & Sons Ltd.
          History

          Cedrus atlantica,RADseq,SNP,SSR,next generation sequencing,transcriptome

          Comments

          Comment on this article