WormBase 2016: expanding to enable helminth genomic research

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

WormBase ( www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite ( parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research.

Related collections

Most cited references 23

Record: found
Abstract: found
Article: found

Is Open Access

Ensembl BioMarts: a hub for data retrieval across taxonomic space

Rhoda J. Kinsella, Andreas Kähäri, Syed Haider … (2011)

For a number of years the BioMart data warehousing system has proven to be a valuable resource for scientists seeking a fast and versatile means of accessing the growing volume of genomic data provided by the Ensembl project. The launch of the Ensembl Genomes project in 2009 complemented the Ensembl project by utilizing the same visualization, interactive and programming tools to provide users with a means for accessing genome data from a further five domains: protists, bacteria, metazoa, plants and fungi. The Ensembl and Ensembl Genomes BioMarts provide a point of access to the high-quality gene annotation, variation data, functional and regulatory annotation and evolutionary relationships from genomes spanning the taxonomic space. This article aims to give a comprehensive overview of the Ensembl and Ensembl Genomes BioMarts as well as some useful examples and a description of current data content and future objectives. Database URLs: http://www.ensembl.org/biomart/martview/; http://metazoa.ensembl.org/biomart/martview/; http://plants.ensembl.org/biomart/martview/; http://protists.ensembl.org/biomart/martview/; http://fungi.ensembl.org/biomart/martview/; http://bacteria.ensembl.org/biomart/martview/

0 comments Cited 596 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The generic genome browser: a building block for a model organism system database.

Lincoln D. Stein, Christopher John Mungall, ShengQiang Shu … (2002)

The Generic Model Organism System Database Project (GMOD) seeks to develop reusable software components for model organism system databases. In this paper we describe the Generic Genome Browser (GBrowse), a Web-based application for displaying genomic annotations and other features. For the end user, features of the browser include the ability to scroll and zoom through arbitrary regions of a genome, to enter a region of the genome by searching for a landmark or performing a full text search of all features, and the ability to enable and disable tracks and change their relative order and appearance. The user can upload private annotations to view them in the context of the public ones, and publish those annotations to the community. For the data provider, features of the browser software include reliance on readily available open source components, simple installation, flexible configuration, and easy integration with other components of a model organism system Web site. GBrowse is freely available under an open source license. The software, its documentation, and support are available at http://www.gmod.org.

0 comments Cited 533 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Vardges Ter-Hovhannisyan, Alexandre Lomsadze, Yury Chernoff … (2008)

We describe a new ab initio algorithm, GeneMark-ES version 2, that identifies protein-coding genes in fungal genomes. The algorithm does not require a predetermined training set to estimate parameters of the underlying hidden Markov model (HMM). Instead, the anonymous genomic sequence in question is used as an input for iterative unsupervised training. The algorithm extends our previously developed method tested on genomes of Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster. To better reflect features of fungal gene organization, we enhanced the intron submodel to accommodate sequences with and without branch point sites. This design enables the algorithm to work equally well for species with the kinds of variations in splicing mechanisms seen in the fungal phyla Ascomycota, Basidiomycota, and Zygomycota. Upon self-training, the intron submodel switches on in several steps to reach its full complexity. We demonstrate that the algorithm accuracy, both at the exon and the whole gene level, is favorably compared to the accuracy of gene finders that employ supervised training. Application of the new method to known fungal genomes indicates substantial improvement over existing annotations. By eliminating the effort necessary to build comprehensive training sets, the new algorithm can streamline and accelerate the process of annotation in a large number of fungal genome sequencing projects.

0 comments Cited 433 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Nucleic Acids Res

Journal ID (iso-abbrev): Nucleic Acids Res

Journal ID (hwp): nar

Journal ID (publisher-id): nar

Title: Nucleic Acids Research

Publisher: Oxford University Press

ISSN (Print): 0305-1048

ISSN (Electronic): 1362-4962

Publication date (Print): 04 January 2016

Publication date (Electronic): 17 November 2015

Publication date PMC-release: 17 November 2015

Volume: 44

Issue: Database issue , Database issue

Pages: D774-D780

Affiliations

[1 ]European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK

[2 ]Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada

[3 ]Division of Biology and Biological Engineering 156–29, California Institute of Technology, Pasadena, CA 91125, USA

[4 ]Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK

[5 ]Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA

[6 ]Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125, USA

Author notes

[* ]To whom correspondence should be addressed. Tel: +44 1223 494417; Fax: +44 1223 494 468; Email: kevin.howe@ 123456wormbase.org

Author information

Kevin L. Howe http://orcid.org/0000-0002-1751-9226

Bruce J. Bolt http://orcid.org/0000-0002-6536-0950

Paul Kersey http://orcid.org/0000-0002-7054-800X

Article

DOI: 10.1093/nar/gkv1217

PMC ID: 4702863

PubMed ID: 26578572

SO-VID: 33abb4ab-489e-48d0-9953-fa9df3d3d241

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

History

Date accepted : 28 October 2015

Date revision received : 26 October 2015

Date received : 30 September 2015

Page count

Pages: 7

Custom metadata

cover-date 04 January 2016

ScienceOpen disciplines: Genetics

Data availability:

ScienceOpen disciplines: Genetics

WormBase 2016: expanding to enable helminth genomic research

Read this article at

Abstract

Related collections

Genomic Prediction

Most cited references 23

Ensembl BioMarts: a hub for data retrieval across taxonomic space

The generic genome browser: a building block for a model organism system database.

Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Categories

Custom metadata

Comments

Comment on this article

Similar content 371

Cited by 167

Most referenced authors 1,505