VizBin - an application for reference-independent visualization and human-augmented binning of metagenomic data

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

Metagenomics is limited in its ability to link distinct microbial populations to genetic potential due to a current lack of representative isolate genome sequences. Reference-independent approaches, which exploit for example inherent genomic signatures for the clustering of metagenomic fragments (binning), offer the prospect to resolve and reconstruct population-level genomic complements without the need for prior knowledge.

Results

We present VizBin, a Java™-based application which offers efficient and intuitive reference-independent visualization of metagenomic datasets from single samples for subsequent human-in-the-loop inspection and binning. The method is based on nonlinear dimension reduction of genomic signatures and exploits the superior pattern recognition capabilities of the human eye-brain system for cluster identification and delineation. We demonstrate the general applicability of VizBin for the analysis of metagenomic sequence data by presenting results from two cellulolytic microbial communities and one human-borne microbial consortium. The superior performance of our application compared to other analogous metagenomic visualization and binning methods is also presented.

Conclusions

VizBin can be applied de novo for the visualization and subsequent binning of metagenomic datasets from single samples, and it can be used for the post hoc inspection and refinement of automatically generated bins. Due to its computational efficiency, it can be run on common desktop machines and enables the analysis of complex metagenomic datasets in a matter of minutes. The software implementation is available at https://claczny.github.io/VizBin under the BSD License (four-clause) and runs under Microsoft Windows™, Apple Mac OS X™ (10.7 to 10.10), and Linux.

Electronic supplementary material

The online version of this article (doi:10.1186/s40168-014-0066-1) contains supplementary material, which is available to authorized users.

Related collections

Most cited references 12

Record: found
Abstract: found
Article: not found

Community structure and metabolism through reconstruction of microbial genomes from the environment.

Gene W. Tyson, Jarrod Chapman, Philip Hugenholtz … (2004)

Microbial communities are vital in the functioning of all ecosystems; however, most microorganisms are uncultivated, and their roles in natural systems are unclear. Here, using random shotgun sequencing of DNA from a natural acidophilic biofilm, we report reconstruction of near-complete genomes of Leptospirillum group II and Ferroplasma type II, and partial recovery of three other genomes. This was possible because the biofilm was dominated by a small number of species populations and the frequency of genomic rearrangements and gene insertions or deletions was relatively low. Because each sequence read came from a different individual, we could determine that single-nucleotide polymorphisms are the predominant form of heterogeneity at the strain level. The Leptospirillum group II genome had remarkably few nucleotide polymorphisms, despite the existence of low-abundance variants. The Ferroplasma type II genome seems to be a composite from three ancestral strains that have undergone homologous recombination to form a large population of mosaic genomes. Analysis of the gene complement for each organism revealed the pathways for carbon and nitrogen fixation and energy generation, and provided insights into survival strategies in an extreme environment.

0 comments Cited 668 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla.

Kelly Wrighton, Brian C Thomas, Itai Sharon … (2012)

BD1-5, OP11, and OD1 bacteria have been widely detected in anaerobic environments, but their metabolisms remain unclear owing to lack of cultivated representatives and minimal genomic sampling. We uncovered metabolic characteristics for members of these phyla, and a new lineage, PER, via cultivation-independent recovery of 49 partial to near-complete genomes from an acetate-amended aquifer. All organisms were nonrespiring anaerobes predicted to ferment. Three augment fermentation with archaeal-like hybrid type II/III ribulose-1,5-bisphosphate carboxylase-oxygenase (RuBisCO) that couples adenosine monophosphate salvage with CO(2) fixation, a pathway not previously described in Bacteria. Members of OD1 reduce sulfur and may pump protons using archaeal-type hydrogenases. For six organisms, the UGA stop codon is translated as tryptophan. All bacteria studied here may play previously unrecognized roles in hydrogen production, sulfur cycling, and fermentation of refractory sedimentary carbon.

0 comments Cited 298 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Genomic insights to SAR86, an abundant and uncultivated marine bacterial lineage

Chris Dupont, Douglas B. Rusch, Shibu Yooseph … (2011)

Bacteria in the 16S rRNA clade SAR86 are among the most abundant uncultivated constituents of microbial assemblages in the surface ocean for which little genomic information is currently available. Bioinformatic techniques were used to assemble two nearly complete genomes from marine metagenomes and single-cell sequencing provided two more partial genomes. Recruitment of metagenomic data shows that these SAR86 genomes substantially increase our knowledge of non-photosynthetic bacteria in the surface ocean. Phylogenomic analyses establish SAR86 as a basal and divergent lineage of γ-proteobacteria, and the individual genomes display a temperature-dependent distribution. Modestly sized at 1.25–1.7 Mbp, the SAR86 genomes lack several pathways for amino-acid and vitamin synthesis as well as sulfate reduction, trends commonly observed in other abundant marine microbes. SAR86 appears to be an aerobic chemoheterotroph with the potential for proteorhodopsin-based ATP generation, though the apparent lack of a retinal biosynthesis pathway may require it to scavenge exogenously-derived pigments to utilize proteorhodopsin. The genomes contain an expanded capacity for the degradation of lipids and carbohydrates acquired using a wealth of tonB-dependent outer membrane receptors. Like the abundant planktonic marine bacterial clade SAR11, SAR86 exhibits metabolic streamlining, but also a distinct carbon compound specialization, possibly avoiding competition.

0 comments Cited 262 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Cedric C Laczny: cedric.laczny@uni.lu

Tomasz Sternal: sternal.tomasz@gmail.com

Valentin Plugaru: valentin.plugaru@gmail.com

Piotr Gawron: piotr.gawron@uni.lu

Arash Atashpendar: arash.atashpendar.001@student.uni.lu

Houry Hera Margossian: houry.margossian.001@student.uni.lu

Sergio Coronado: sergio.coronado@ext.uni.lu

Laurens van der Maaten: lvdmaaten@gmail.com

Nikos Vlassis: nikos.vlassis@gmail.com

Paul Wilmes: paul.wilmes@uni.lu

Journal

Journal ID (nlm-ta): Microbiome

Journal ID (iso-abbrev): Microbiome

Title: Microbiome

Publisher: BioMed Central (London )

ISSN (Electronic): 2049-2618

Publication date (Electronic): 20 January 2015

Publication date PMC-release: 20 January 2015

Publication date Collection: 2015

Volume: 3

Issue: 1

Electronic Location Identifier: 1

Affiliations

[ ]Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, 4362 Luxembourg

[ ]Institute of Computing Science, Poznan University of Technology, Poznan, 60-965 Poland

[ ]Computer Science and Communications Research Unit, University of Luxembourg, Luxembourg, 1359 Luxembourg

[ ]Pattern Recognition and Bioinformatics Group, Delft University of Technology, CD Delft, 2628 Netherlands

[ ]Adobe Research, Adobe, San Jose, 95110 USA

Article

Publisher ID: 66

DOI: 10.1186/s40168-014-0066-1

PMC ID: 4305225

PubMed ID: 25621171

SO-VID: 8f47bf91-ad9f-4d47-9cc1-c2e7f0c51aea

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

History

Date received : 15 September 2014

Date accepted : 18 December 2014

Custom metadata

Keywords: metagenomics,machine learning,visualization,binning

Data availability:

Keywords: metagenomics, machine learning, visualization, binning

Comments

Comment on this article

scite_

Cited by 152

See all cited by

Most referenced authors 1,642

See all reference authors

- Version 1
- Version 1

VizBin - an application for reference-independent visualization and human-augmented binning of metagenomic data

Read this article at

Abstract

Background

Results

Conclusions

Electronic supplementary material

Related collections

Data-Driven Civil Engineering

Most cited references 12

Community structure and metabolism through reconstruction of microbial genomes from the environment.

Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla.

Genomic insights to SAR86, an abundant and uncultivated marine bacterial lineage

Author and article information

Contributors

Journal

Affiliations

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 113

Cited by 152

Most referenced authors 1,642