Aron Marchler-Bauer , John B. Anderson , Praveen F. Cherukuri , Carol DeWeese-Scott , Lewis Y. Geer , Marc Gwadz , Siqian He , David I. Hurwitz , John D. Jackson , Zhaoxi Ke , Christopher J. Lanczycki , Cynthia A. Liebert , Chunlei Liu , Fu Lu , Gabriele H. Marchler , Mikhail Mullokandov , Benjamin A. Shoemaker , Vahan Simonyan , James S. Song , Paul A. Thiessen , Roxanne A. Yamashita , Jodie J. Yin , Dachuan Zhang , Stephen H. Bryant
17 December 2004
The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD is linked to other Entrez databases such as Proteins, Taxonomy and PubMed®, and can be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. CD-Search, which is available at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, is a fast, interactive tool to identify conserved domains in new protein sequences. CD-Search results for protein sequences in Entrez are pre-computed to provide links between proteins and domain models, and computational annotation visible upon request. Protein–protein queries submitted to NCBI's BLAST search service at http://www.ncbi.nlm.nih.gov/BLAST are scanned for the presence of conserved domains by default. While CDD started out as essentially a mirror of publicly available domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and in some cases replace these models with domain hierarchies curated at the NCBI. Here, we report on the progress of the curation effort and associated improvements in the functionality of the CDD information retrieval system.