I-Min A. Chen 1 , * , Victor M. Markowitz 1 , Ken Chu 1 , Krishna Palaniappan 1 , Ernest Szeto 1 , Manoj Pillay 1 , Anna Ratner 1 , Jinghua Huang 1 , Evan Andersen 1 , Marcel Huntemann 2 , Neha Varghese 2 , Michalis Hadjithomas 2 , Kristin Tennessen 2 , Torben Nielsen 2 , Natalia N. Ivanova 2 , Nikos C. Kyrpides 2 , *
13 October 2016
The Integrated Microbial Genomes with Microbiome Samples (IMG/M: https://img.jgi.doe.gov/m/) system contains annotated DNA and RNA sequence data of (i) archaeal, bacterial, eukaryotic and viral genomes from cultured organisms, (ii) single cell genomes (SCG) and genomes from metagenomes (GFM) from uncultured archaea, bacteria and viruses and (iii) metagenomes from environmental, host associated and engineered microbiome samples. Sequence data are generated by DOE's Joint Genome Institute (JGI), submitted by individual scientists, or collected from public sequence data archives. Structural and functional annotation is carried out by JGI's genome and metagenome annotation pipelines. A variety of analytical and visualization tools provide support for examining and comparing IMG/M's datasets. IMG/M allows open access interactive analysis of publicly available datasets, while manual curation, submission and access to private datasets and computationally intensive workspace-based analysis require login/password access to its expert review (ER) companion system (IMG/M ER: https://img.jgi.doe.gov/mer/). Since the last report published in the 2014 NAR Database Issue, IMG/M's dataset content has tripled in terms of number of datasets and overall protein coding genes, while its analysis tools have been extended to cope with the rapid growth in the number and size of datasets handled by the system.