There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

GenBank is a comprehensive database that contains publicly available DNA sequences for more than 165,000 named organisms, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Most submissions are made using the web-based BankIt or standalone Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the EMBL Data Library in the UK and the DNA Data Bank of Japan helps to ensure worldwide coverage. GenBank is accessible through NCBI's retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, go to the NCBI Homepage at http://www.ncbi.nlm.nih.gov.

Related collections

Most cited references 10

Record: found
Abstract: not found
Article: not found

dbEST--database for "expressed sequence tags".

M S Boguski, T. M. Lowe, C Tolstoshev (1993)

0 comments Cited 325 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

CDD: a Conserved Domain Database for protein classification

Aron Marchler-Bauer, John B. Anderson, Praveen F Cherukuri … (2004)

The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD is linked to other Entrez databases such as Proteins, Taxonomy and PubMed®, and can be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. CD-Search, which is available at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, is a fast, interactive tool to identify conserved domains in new protein sequences. CD-Search results for protein sequences in Entrez are pre-computed to provide links between proteins and domain models, and computational annotation visible upon request. Protein–protein queries submitted to NCBI's BLAST search service at http://www.ncbi.nlm.nih.gov/BLAST are scanned for the presence of conserved domains by default. While CDD started out as essentially a mirror of publicly available domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and in some cases replace these models with domain hierarchies curated at the NCBI. Here, we report on the progress of the curation effort and associated improvements in the functionality of the CDD information retrieval system.

0 comments Cited 254 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Database resources of the National Center for Biotechnology Information

David Wheeler, Tanya Barrett, Dennis A Benson … (2004)

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data retrieval systems and computational resources for the analysis of data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, Entrez Programming Utilities, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov.

0 comments Cited 157 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

PubMed ID:: 15608212

PMC ID:: 540017

DOI:: 10.1093/nar/gki063

ScienceOpen disciplines: Chemistry

Keywords: Animals,Base Sequence,DNA,chemistry,classification,Databases, Nucleic Acid,Humans,Internet

Data availability:

ScienceOpen disciplines: Chemistry

Keywords: Animals, Base Sequence, DNA, chemistry, classification, Databases, Nucleic Acid, Humans, Internet

GenBank.

Read this article at

Abstract

Related collections

EPA CompTox Chemicals Dashboard

Most cited references 10

dbEST--database for "expressed sequence tags".

CDD: a Conserved Domain Database for protein classification

Database resources of the National Center for Biotechnology Information

Author and article information

Journal

Comments

Comment on this article

Similar content 4

Cited by 323

Most referenced authors 1,282