32
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Extracting abundance information from DNA‐based data

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The accurate extraction of species‐abundance information from DNA‐based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and food‐web reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring of environmental state and change, and the inference of false positives and negatives. However, multiple sources of bias and noise in sampling and processing combine to inject error into DNA‐based data sets. To understand how to extract abundance information, it is useful to distinguish two concepts. (i) Within‐sample across‐species quantification describes relative species abundances in one sample. (ii) Across‐sample within‐species quantification describes how the abundance of each individual species varies from sample to sample, such as over a time series, an environmental gradient or different experimental treatments. First, we review the literature on methods to recover across‐species abundance information (by removing what we call “species pipeline biases”) and within‐species abundance information (by removing what we call “pipeline noise”). We argue that many ecological questions can be answered with just within‐species quantification, and we therefore demonstrate how to use a “DNA spike‐in” to correct for pipeline noise and recover within‐species abundance information. We also introduce a model‐based estimator that can be used on data sets without a physical spike‐in to approximate and correct for pipeline noise.

          Related collections

          Most cited references87

          • Record: found
          • Abstract: not found
          • Article: not found

          Fitting Linear Mixed-Effects Models Usinglme4

            • Record: found
            • Abstract: not found
            • Article: not found

            QIIME allows analysis of high-throughput community sequencing data.

              • Record: found
              • Abstract: found
              • Article: not found

              Search and clustering orders of magnitude faster than BLAST.

              Biological sequence data is accumulating rapidly, motivating the development of improved high-throughput methods for sequence classification. UBLAST and USEARCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude faster than BLAST in practical applications, though sensitivity to distant protein relationships is lower. UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters. UCLUST offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of much larger datasets. Binaries are available at no charge for non-commercial use at http://www.drive5.com/usearch.

                Author and article information

                Contributors
                dougwyu@mac.com
                Journal
                Mol Ecol Resour
                Mol Ecol Resour
                10.1111/(ISSN)1755-0998
                MEN
                Molecular Ecology Resources
                John Wiley and Sons Inc. (Hoboken )
                1755-098X
                1755-0998
                30 August 2022
                January 2023
                : 23
                : 1 ( doiID: 10.1111/men.v23.1 )
                : 174-189
                Affiliations
                [ 1 ] State Key Laboratory of Genetic Resources and Evolution and Yunnan Key Laboratory of Biodiversity and Ecological Security of Gaoligong Mountain Kunming Institute of Zoology, Chinese Academy of Sciences Kunming Yunnan China
                [ 2 ] Kunming College of Life Sciences University of Chinese Academy of Sciences Kunming Yunnan China
                [ 3 ] School of Mathematics and Statistics UNSW Sydney Sydney New South Wales Australia
                [ 4 ] Evolution and Ecology Research Centre, UNSW Sydney Sydney New South Wales Australia
                [ 5 ] Center for Excellence in Animal Evolution and Genetics Chinese Academy of Sciences Kunming Yunnan China
                [ 6 ] School of Biological Sciences University of East Anglia, Norwich Research Park Norwich Norfolk UK
                Author notes
                [*] [* ] Correspondence

                Douglas W. Yu, State Key Laboratory of Genetic Resources and Evolution and Yunnan Key Laboratory of Biodiversity and Ecological Security of Gaoligong Mountain, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China.

                Email: dougwyu@ 123456mac.com

                Author information
                https://orcid.org/0000-0001-8551-5609
                Article
                MEN13703 MER-22-0072.R1
                10.1111/1755-0998.13703
                10087802
                35986714
                ee382e5f-6aa4-4f87-b362-6ec178dca78f
                © 2022 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.

                This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

                History
                : 31 July 2022
                : 22 February 2022
                : 16 August 2022
                Page count
                Figures: 5, Tables: 1, Pages: 16, Words: 12127
                Funding
                Funded by: Key Research Program of Frontier Sciences, CAS
                Award ID: QYZDY‐SSW‐SMC024
                Funded by: University of Chinese Academy of Sciences , doi 10.13039/501100011332;
                Funded by: Kunming Institute of Zoology , doi 10.13039/501100011191;
                Funded by: State Key Laboratory of Genetic Resources and Evolution , doi 10.13039/501100011239;
                Award ID: GREKF19‐01
                Award ID: GREKF20‐01
                Award ID: GREKF21‐01
                Funded by: University of East Anglia , doi 10.13039/501100000736;
                Funded by: Chinese Academy of Sciences , doi 10.13039/501100002367;
                Award ID: SMC024
                Funded by: idiv.de
                Award ID: sCom
                Funded by: Strategic Priority Research Program, Chinese Academy of Sciences
                Award ID: XDA20050202
                Categories
                Resource Article
                RESOURCE ARTICLES
                Molecular and Statistical Advances
                Custom metadata
                2.0
                January 2023
                Converter:WILEY_ML3GV2_TO_JATSPMC version:6.2.7 mode:remove_FC converted:11.04.2023

                Ecology
                arthropoda,biomonitoring,community composition,dna barcoding,environmental dna,insecta,internal standard,polymerase chain reaction,taxonomic bias

                Comments

                Comment on this article

                Related Documents Log