Automated high throughput animal CO1 metabarcode classification

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

We introduce a method for assigning names to CO1 metabarcode sequences with confidence scores in a rapid, high-throughput manner. We compiled nearly 1 million CO1 barcode sequences appropriate for classifying arthropods and chordates. Compared to our previous Insecta classifier, the current classifier has more than three times the taxonomic coverage, including outgroups, and is based on almost five times as many reference sequences. Unlike other popular rDNA metabarcoding markers, we show that classification performance is similar across the length of the CO1 barcoding region. We show that the RDP classifier can make taxonomic assignments about 19 times faster than the popular top BLAST hit method and reduce the false positive rate from nearly 100% to 34%. This is especially important in large-scale biodiversity and biomonitoring studies where datasets can become very large and the taxonomic assignment problem is not trivial. We also show that reference databases are becoming more representative of current species diversity but that gaps still exist. We suggest that it would benefit the field as a whole if all investigators involved in metabarocoding studies, through collaborations with taxonomic experts, also planned to barcode representatives of their local biota as a part of their projects.

Related collections

Most cited references 19

Record: found
Abstract: found
Article: not found

The Bioperl toolkit: Perl modules for the life sciences.

Jason E Stajich, David Block, Kris Boulez … (2002)

The Bioperl project is an international open-source collaboration of biologists, bioinformaticians, and computer scientists that has evolved over the past 7 yr into the most comprehensive library of Perl modules available for managing and manipulating life-science information. Bioperl provides an easy-to-use, stable, and consistent programming interface for bioinformatics application programmers. The Bioperl modules have been successfully and repeatedly used to reduce otherwise complex tasks to only a few lines of code. The Bioperl object model has been proven to be flexible enough to support enterprise-level applications such as EnsEMBL, while maintaining an easy learning curve for novice Perl programmers. Bioperl is capable of executing analyses and processing results from programs such as BLAST, ClustalW, or the EMBOSS suite. Interoperation with modules written in Python and Java is supported through the evolving BioCORBA bridge. Bioperl provides access to data stores such as GenBank and SwissProt via a flexible series of sequence input/output modules, and to the emerging common sequence data storage format of the Open Bioinformatics Database Access project. This study describes the overall architecture of the toolkit, the problem domains that it addresses, and gives specific examples of how the toolkit can be used to solve common life-sciences problems. We conclude with a discussion of how the open-source nature of the project has contributed to the development effort.

0 comments Cited 714 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Environmental DNA metabarcoding: Transforming how we survey animal and plant communities

Kristy Deiner, Holly M. Bik, Elvira Mächler … (2018)

0 comments Cited 521 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Robust Detection of Rare Species Using Environmental DNA: The Importance of Primer Specificity

Taylor Wilcox, Kevin S. McKelvey, Michael K. Young … (2013)

Environmental DNA (eDNA) is being rapidly adopted as a tool to detect rare animals. Quantitative PCR (qPCR) using probe-based chemistries may represent a particularly powerful tool because of the method’s sensitivity, specificity, and potential to quantify target DNA. However, there has been little work understanding the performance of these assays in the presence of closely related, sympatric taxa. If related species cause any cross-amplification or interference, false positives and negatives may be generated. These errors can be disastrous if false positives lead to overestimate the abundance of an endangered species or if false negatives prevent detection of an invasive species. In this study we test factors that influence the specificity and sensitivity of TaqMan MGB assays using co-occurring, closely related brook trout (Salvelinus fontinalis) and bull trout (S. confluentus) as a case study. We found qPCR to be substantially more sensitive than traditional PCR, with a high probability of detection at concentrations as low as 0.5 target copies/µl. We also found that number and placement of base pair mismatches between the Taqman MGB assay and non-target templates was important to target specificity, and that specificity was most influenced by base pair mismatches in the primers, rather than in the probe. We found that insufficient specificity can result in both false positive and false negative results, particularly in the presence of abundant related species. Our results highlight the utility of qPCR as a highly sensitive eDNA tool, and underscore the importance of careful assay design.

0 comments Cited 173 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Teresita M. Porter:

ORCID: http://orcid.org/0000-0002-0227-6874

terrimporter@gmail.com

Journal

Journal ID (nlm-ta): Sci Rep

Journal ID (iso-abbrev): Sci Rep

Title: Scientific Reports

Publisher: Nature Publishing Group UK (London )

ISSN (Electronic): 2045-2322

Publication date (Electronic): 9 March 2018

Publication date PMC-release: 9 March 2018

Publication date Collection: 2018

Volume: 8

Electronic Location Identifier: 4226

Affiliations

[1 ]ISNI 0000 0004 1936 8198, GRID grid.34429.38, The Centre for Biodiversity Genomics & Department of Integrative Biology, , University of Guelph, ; 50 Stone Road East, Guelph, ON N1G 2W1 Canada

[2 ]ISNI 0000 0001 2295 5236, GRID grid.202033.0, Great Lakes Forestry Centre, , Natural Resources Canada, ; 1219 Queen Street East, Sault Ste. Marie, ON P6A 2E5 Canada

Author information

Teresita M. Porter http://orcid.org/0000-0002-0227-6874

Article

Publisher ID: 22505

DOI: 10.1038/s41598-018-22505-4

PMC ID: 5844909

PubMed ID: 29523803

SO-VID: 0c3bccc1-a6b8-4c19-bfc7-d48c747c9eeb

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 16 November 2017

Date accepted : 1 February 2018

Custom metadata

ScienceOpen disciplines: Uncategorized

Data availability:

ScienceOpen disciplines: Uncategorized

Comments

Comment on this article

scite_

Cited by 52

See all cited by

Most referenced authors 1,423

See all reference authors

- Version 1

Automated high throughput animal CO1 metabarcode classification

Read this article at

Abstract

Related collections

Animal Welfare

Most cited references 19

The Bioperl toolkit: Perl modules for the life sciences.

Environmental DNA metabarcoding: Transforming how we survey animal and plant communities

Robust Detection of Rare Species Using Environmental DNA: The Importance of Primer Specificity

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 332

Cited by 52

Most referenced authors 1,423