| Literature DB >> 23197659 |
Aron Marchler-Bauer1, Chanjuan Zheng, Farideh Chitsaz, Myra K Derbyshire, Lewis Y Geer, Renata C Geer, Noreen R Gonzales, Marc Gwadz, David I Hurwitz, Christopher J Lanczycki, Fu Lu, Shennan Lu, Gabriele H Marchler, James S Song, Narmada Thanki, Roxanne A Yamashita, Dachuan Zhang, Stephen H Bryant.
Abstract
CDD, the Conserved Domain Database, is part of NCBI's Entrez query and retrieval system and is also accessible via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml. CDD provides annotation of protein sequences with the location of conserved domain footprints and functional sites inferred from these footprints. Pre-computed annotation is available via Entrez, and interactive search services accept single protein or nucleotide queries, as well as batch submissions of protein query sequences, utilizing RPS-BLAST to rapidly identify putative matches. CDD incorporates several protein domain and full-length protein model collections, and maintains an active curation effort that aims at providing fine grained classifications for major and well-characterized protein domain families, as supported by available protein three-dimensional (3D) structure and the published literature. To this date, the majority of protein 3D structures are represented by models tracked by CDD, and CDD curators are characterizing novel families that emerge from protein structure determination efforts.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23197659 PMCID: PMC3531192 DOI: 10.1093/nar/gks1243
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.This histogram illustrates the distribution of protein 3D structures between conserved domain superfamilies. Although the majority of superfamilies cannot be linked to a 3D structure representative, about one quarter of those that can be linked have only a single representative 3D structure. Data prepared with NCBI FLink (http://www.ncbi.nlm.nih.gov/Structure/flink/flink.cgi).
Figure 2.CD-Search results for a nucleotide query sequence, the complete genome sequence of a Hepatitis B virus. Results have been obtained for three different reading frames used for translation of the nucleotide query. Consequently, the display is split into three panels, which are labeled with ‘RF +1’, ‘RF +2’ and ‘RF +3’.
URLs and other resources associated with the CDD project
| CDD | Database home page | |
| CDD help | CDD help documentation | |
| CDD FTP | CD models and alignments, pre-built search databases | |
| CD-Search | Live and pre-computed RPS-BLAST | |
| Batch CD-Search | Live and pre-computed RPS-BLAST | |
| CDART | Domain architecture viewer | |
| CDART FTP | Data summarizing conserved domain architectures | |
| CDTree/Cn3D | Domain hierarchy viewer and editor | |
| RPS-BLAST | Stand-alone tool for searching databases of profile models, part of the NCBI toolkit distribution |