7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Next-generation sequencing for molecular ecology: a caveat regarding pooled samples.

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We develop a model based on the Dirichlet-compound multinomial distribution (CMD) and Ewens sampling formula to predict the fraction of SNP loci that will appear fixed for alternate alleles between two pooled samples drawn from the same underlying population. We apply this model to next-generation sequencing (NGS) data from Baltic Sea herring recently published by (Corander et al., 2013, Molecular Ecology, 2931-2940), and show that there are many more fixed loci than expected in the absence of genetic structure. However, we show through coalescent simulations that the degree of population structure required to explain the fraction of alternatively fixed SNPs is extraordinarily high and that the surplus of fixed loci is more likely a consequence of limited representation of individual gene copies in the pooled samples, than it is of population structure. Our analysis signals that the use of NGS on pooled samples to identify divergent SNPs warrants caution. With pooled samples, it is hard to diagnose when an NGS experiment has gone awry; especially when NGS data on pooled samples are of low read depth with a limited number of individuals, it may be worthwhile to temper claims of unexpected population differentiation from pooled samples, pending verification with more reliable methods or stricter adherence to recommended sampling designs for pooled sequencing e.g. Futschik & Schlötterer 2010, Genetics, 186, 207; Gautier et al., 2013a, Molecular Ecology, 3766-3779). Analysis of the data and diagnosis of problems is easier and more reliable (and can be less costly) with individually barcoded samples. Consequently, for some scenarios, individual barcoding may be preferable to pooling of samples.

          Related collections

          Author and article information

          Journal
          Mol. Ecol.
          Molecular ecology
          1365-294X
          0962-1083
          Feb 2014
          : 23
          : 3
          Affiliations
          [1 ] Fisheries Ecology Division, Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, 110 Shaffer Road, Santa Cruz, CA, 95060, USA; Department of Applied Math and Statistics (SOE2), University of California, 1156 High Street, Santa Cruz, CA, 95064, USA.
          Article
          10.1111/mec.12609
          24304095
          b75056f2-f98c-4873-8dec-e172a01c8933
          © 2013 John Wiley & Sons Ltd.
          History

          SNP discovery,compound multinomial distribution,outlier analysis,population divergence

          Comments

          Comment on this article