| Literature DB >> 22139939 |
Harinder Singh1, Jagat Singh Chauhan, M Michael Gromiha, Gajendra P S Raghava.
Abstract
ccPDB (http://crdd.osdd.net/raghava/ccpdb/) is a database of data sets compiled from the literature and Protein Data Bank (PDB). First, we collected and compiled data sets from the literature used for developing bioinformatics methods to annotate the structure and function of proteins. Second, data sets were derived from the latest release of PDB using standard protocols. Third, we developed a powerful module for creating a wide range of customized data sets from the current release of PDB. This is a flexible module that allows users to create data sets using a simple six step procedure. In addition, a number of web services have been integrated in ccPDB, which include submission of jobs on PDB-based servers, annotation of protein structures and generation of patterns. This database maintains >30 types of data sets such as secondary structure, tight-turns, nucleotide interacting residues, metals interacting residues, DNA/RNA binding residues and so on.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22139939 PMCID: PMC3245168 DOI: 10.1093/nar/gkr1150
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Brief description of major data sets created at ccPDB
| Type of data set | Description of data set | Software package |
|---|---|---|
| Secondary structure | Data sets related to secondary structure, helix, strand, coil, etc. | DSSP |
| Tight turns | Data sets created for variuos types of tight turns (e.g. β-turn). | PROMOTIF |
| Nucleotide interacting | Data sets created for small nucleotide and metal (e.g. ATP, GTP, Fe, Mg etc) binding residues. | LPC |
| DNA/RNA binding residues | Data sets of DNA and RNA binding proteins/residues. | PDIdb |
| Metal binding residues | Data set of metal binding proteins/residues (e.g. Zn, Ca interacting residues). | LPC |
Description of each process/step of data set creation module of ccPDB
| Process/step | Description of process | Example |
|---|---|---|
| Protein/chain | Allows users to extract PDB chains having desired structure or function. | Extract ATP binding protein chains from PDB. |
| General filters | Extract chains using various filters like resolution, experimental technique. | Protein chains having resolution better than 3 Å, solved by X-ray crystallography. |
| Combination of sets | Allows to combine two sets of data. | ATP binding protein chains having resolution better than 3 Å, solved by X-ray crystallography. |
| Extract sequences | Extract the amino acid sequences of PDB chains. | Extract sequences of ATP binding proteins. |
| Non-redundant data sets | Creation of non-redundant data sets using BLASTClust. | Generate non-redundant data set at 25% of ATP binding proteins. |
| Annotation of residues | Assigning structure/function of each residue in PDB chains. | Mapping of ATP interacting residues in ATP binding protein chains. |