Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.

Related collections

Most cited references 42

Record: found
Abstract: found
Article: found

Is Open Access

Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform

Martin Kircher, Susanna Sawyer, Matthias Meyer (2011)

Due to the increasing throughput of current DNA sequencing instruments, sample multiplexing is necessary for making economical use of available sequencing capacities. A widely used multiplexing strategy for the Illumina Genome Analyzer utilizes sample-specific indexes, which are embedded in one of the library adapters. However, this and similar multiplex approaches come with a risk of sample misidentification. By introducing indexes into both library adapters (double indexing), we have developed a method that reveals the rate of sample misidentification within current multiplex sequencing experiments. With ~0.3% these rates are orders of magnitude higher than expected and may severely confound applications in cancer genomics and other fields requiring accurate detection of rare variants. We identified the occurrence of mixed clusters on the flow as the predominant source of error. The accuracy of sample identification is further impaired if indexed oligonucleotides are cross-contaminated or if indexed libraries are amplified in bulk. Double-indexing eliminates these problems and increases both the scope and accuracy of multiplex sequencing on the Illumina platform.

0 comments Cited 504 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The genome of the model beetle and pest Tribolium castaneum.

Stephen H. Richards, Richard A. Gibbs, George Weinstock … (2008)

Tribolium castaneum is a member of the most species-rich eukaryotic order, a powerful model organism for the study of generalized insect development, and an important pest of stored agricultural products. We describe its genome sequence here. This omnivorous beetle has evolved the ability to interact with a diverse chemical environment, as shown by large expansions in odorant and gustatory receptors, as well as P450 and other detoxification enzymes. Development in Tribolium is more representative of other insects than is Drosophila, a fact reflected in gene content and function. For example, Tribolium has retained more ancestral genes involved in cell-cell communication than Drosophila, some being expressed in the growth zone crucial for axial elongation in short-germ development. Systemic RNA interference in T. castaneum functions differently from that in Caenorhabditis elegans, but nevertheless offers similar power for the elucidation of gene function and identification of targets for selective insect control.

0 comments Cited 481 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Next-generation transcriptome assembly.

Jeffrey A Martin, Zhong Wang (2011)

Transcriptomics studies often rely on partial reference transcriptomes that fail to capture the full catalogue of transcripts and their variations. Recent advances in sequencing technologies and assembly algorithms have facilitated the reconstruction of the entire transcriptome by deep RNA sequencing (RNA-seq), even without a reference genome. However, transcriptome assembly from billions of RNA-seq reads, which are often very short, poses a significant informatics challenge. This Review summarizes the recent developments in transcriptome assembly approaches - reference-based, de novo and combined strategies - along with some perspectives on transcriptome assembly in the near future.

0 comments Cited 472 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Patrick O'Grady: Role: Editor

Journal

Journal ID (nlm-ta): PLoS One

Journal ID (iso-abbrev): PLoS ONE

Journal ID (publisher-id): plos

Journal ID (pmc): plosone

Title: PLoS ONE

Publisher: Public Library of Science (San Francisco, CA USA )

ISSN (Electronic): 1932-6203

Publication date (Electronic): 30 December 2015

Publication date Collection: 2015

Volume: 10

Issue: 12

Electronic Location Identifier: e0143929

Affiliations

[1 ]Department of Integrative Biology, Oregon State University, Corvallis, Oregon, United States of America

[2 ]Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, United States of America

University of California, Berkeley, UNITED STATES

Author notes

Competing Interests: The authors have declared that no competing interests exist.

Conceived and designed the experiments: KK JMP JSS MAD DRM. Performed the experiments: KK JMP JSS MAD DRM. Analyzed the data: KK JMP JSS DRM. Contributed reagents/materials/analysis tools: KK JMP JSS MAD DRM. Wrote the paper: KK JMP JSS DRM.

* E-mail: coniontis@ 123456gmail.com

Article

Publisher ID: PONE-D-15-39183

DOI: 10.1371/journal.pone.0143929

PMC ID: 4696846

PubMed ID: 26716693

SO-VID: 107567f3-f2d6-45ce-a88a-6f610d760215

License:

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

History

Date received : 8 September 2015

Date accepted : 12 October 2015

Page count

Figures: 16, Tables: 15, Pages: 53

Funding

This work was funded in part by the Harold E. and Leona M. Rice Endowment Fund at Oregon State University, as well as National Science Foundation grant DEB-1258220 to DRM.

Custom metadata

Data Availability Raw reads for all museum and reference specimens are submitted to NCBI Sequence Read Archive (accessions SRR2939013– SRR2939027). Focal gene fragments recovered from the de novo assembly of Lagriinae n. gen. and those that were newly sequenced for the phylogeny of Lagriinae are deposited in GenBank (accessions KU233685-KU234083). Focal gene fragments from PCR/Sanger sequencing and the IlluminaMerged sequences of carabids are also deposited in GenBank (accessions KU233685-KU234083). The Tribolium castaneum and Bembidion sp. nr transversale query sequences used to probe our museum specimens for the 67 nuclear protein-coding gene fragments and all alignments used in phylogenetic analyses (including the DeNovo, FarRef, and NearRef sequences), as well as trees from the phylogenetic tests, are deposited in Dryad (data available from the Dryad Digital Repository: http://doi.org/xx).

Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

Read this article at

Abstract

Related collections

PLOS Climate

Most cited references 42

Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform

The genome of the model beetle and pest Tribolium castaneum.

Next-generation transcriptome assembly.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 95

Cited by 25

Most referenced authors 1,646