Reducing assembly complexity of microbial genomes with single-molecule sequencing

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem.

Results

To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads.

Conclusions

Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.

Related collections

Most cited references 36

Record: found
Abstract: found
Article: not found

The potential and challenges of nanopore sequencing.

Daniel Branton, David W. Deamer, Andre Marziali … (2008)

A nanopore-based device provides single-molecule detection and analytical capabilities that are achieved by electrophoretically driving molecules in solution through a nano-scale pore. The nanopore provides a highly confined space within which single nucleic acid polymers can be analyzed at high throughput by one of a variety of means, and the perfect processivity that can be enforced in a narrow pore ensures that the native order of the nucleobases in a polynucleotide is reflected in the sequence of signals that is detected. Kilobase length polymers (single-stranded genomic DNA or RNA) or small molecules (e.g., nucleosides) can be identified and characterized without amplification or labeling, a unique analytical capability that makes inexpensive, rapid DNA sequencing a possibility. Further research and development to overcome current challenges to nanopore identification of each successive nucleotide in a DNA strand offers the prospect of 'third generation' instruments that will sequence a diploid mammalian genome for approximately $1,000 in approximately 24 h.

0 comments Cited 590 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Performance comparison of benchtop high-throughput sequencing platforms.

Nicholas Loman, Raju V. Misra, Timothy Dallman … (2012)

Three benchtop high-throughput sequencing instruments are now available. The 454 GS Junior (Roche), MiSeq (Illumina) and Ion Torrent PGM (Life Technologies) are laser-printer sized and offer modest set-up and running costs. Each instrument can generate data required for a draft bacterial genome sequence in days, making them attractive for identifying and characterizing pathogens in the clinical setting. We compared the performance of these instruments by sequencing an isolate of Escherichia coli O104:H4, which caused an outbreak of food poisoning in Germany in 2011. The MiSeq had the highest throughput per run (1.6 Gb/run, 60 Mb/h) and lowest error rates. The 454 GS Junior generated the longest reads (up to 600 bases) and most contiguous assemblies but had the lowest throughput (70 Mb/run, 9 Mb/h). Run in 100-bp mode, the Ion Torrent PGM had the highest throughput (80–100 Mb/h). Unlike the MiSeq, the Ion Torrent PGM and 454 GS Junior both produced homopolymer-associated indel errors (1.5 and 0.38 errors per 100 bases, respectively).

0 comments Cited 470 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Continuous base identification for single-molecule nanopore DNA sequencing.

James Clarke, Hai-Chen Wu, Lakmal Jayasinghe … (2009)

A single-molecule method for sequencing DNA that does not require fluorescent labelling could reduce costs and increase sequencing speeds. An exonuclease enzyme might be used to cleave individual nucleotide molecules from the DNA, and when coupled to an appropriate detection system, these nucleotides could be identified in the correct order. Here, we show that a protein nanopore with a covalently attached adapter molecule can continuously identify unlabelled nucleoside 5'-monophosphate molecules with accuracies averaging 99.8%. Methylated cytosine can also be distinguished from the four standard DNA bases: guanine, adenine, thymine and cytosine. The operating conditions are compatible with the exonuclease, and the kinetic data show that the nucleotides have a high probability of translocation through the nanopore and, therefore, of not being registered twice. This highly accurate tool is suitable for integration into a system for sequencing nucleic acids and for analysing epigenetic modifications.

0 comments Cited 447 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Sergey Koren

Gregory P Harhay

Timothy PL Smith

James L Bono

Dayna M Harhay

Scott D Mcvey

Diana Radune

Nicholas H Bergman

Adam M Phillippy

Journal

Journal ID (nlm-ta): Genome Biol

Journal ID (iso-abbrev): Genome Biol

Title: Genome Biology

Publisher: BioMed Central

ISSN (Print): 1465-6906

ISSN (Electronic): 1465-6914

Publication date (Print): 2013

Publication date (Electronic): 13 September 2013

Volume: 14

Issue: 9

Page: R101

Affiliations

[1 ]National Biodefense Analysis and Countermeasures Center, 110 Thomas Johnson Drive, Frederick, MD 21702, USA

[2 ]USDA, ARS, Meat Animal Research Center, Clay Center, NE 68933, USA

[3 ]USDA, ARS, Center for Grain and Animal Health Research, Manhattan, KS 66502, USA

Article

Publisher ID: gb-2013-14-9-r101

DOI: 10.1186/gb-2013-14-9-r101

PMC ID: 4053942

PubMed ID: 24034426

SO-VID: dfc77b45-6b80-4776-acbc-4096a1234d29

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

History

Date received : 15 April 2013

Date accepted : 13 September 2013

Comments

Comment on this article

scite_

Cited by 171

See all cited by

Most referenced authors 2,219

See all reference authors

- Version 1

Reducing assembly complexity of microbial genomes with single-molecule sequencing

Read this article at

Abstract

Background

Results

Conclusions

Related collections

Genomic Prediction

Most cited references 36

The potential and challenges of nanopore sequencing.

Performance comparison of benchtop high-throughput sequencing platforms.

Continuous base identification for single-molecule nanopore DNA sequencing.

Author and article information

Contributors

Journal

Affiliations

Article

History

Categories

Comments

Comment on this article

Similar content 58

Cited by 171

Most referenced authors 2,219