| Literature DB >> 30689843 |
Piyush Agrawal1,2, Sumeet Patiyal2, Rajesh Kumar1,2, Vinod Kumar1,2, Harinder Singh3, Pawan Kumar Raghav2, Gajendra P S Raghava2.
Abstract
ccPDB 2.0 (http://webs.iiitd.edu.in/raghava/ccpdb) is an updated version of the manually curated database ccPDB that maintains datasets required for developing methods to predict the structure and function of proteins. The number of datasets compiled from literature increased from 45 to 141 in ccPDB 2.0. Similarly, the number of protein structures used for creating datasets also increased from ~74 000 to ~137 000 (PDB March 2018 release). ccPDB 2.0 provides the same web services and flexible tools which were present in the previous version of the database. In the updated version, links of the number of methods developed in the past few years have also been incorporated. This updated resource is built on responsive templates which is compatible with smartphones (mobile, iPhone, iPad, tablets etc.) and large screen gadgets. In summary, ccPDB 2.0 is a user-friendly web-based platform that provides comprehensive as well as updated information about datasets.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30689843 PMCID: PMC6343045 DOI: 10.1093/database/bay142
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1Dataset types present in ccPDB 2.0.
Figure 2Schematic representation of steps of data set creation module of ccPDB 2.0.
Comparison of datasets compiled from literature and created using PDB at ccPDB and ccPDB 2.0
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 1 | Secondary structure | Eight state | 5877 | 17 608 | 2 | 10 |
| Three state | 5877 | 17 608 | ||||
| 2 | Irregular secondary structure | Beta turn I | 6691 | 16 195 | 9 | 13 |
| Beta turn I’ | 5424 | 7070 | ||||
| Beta turn II | 2324 | 12 429 | ||||
| Beta turn II’ | 6618 | 5393 | ||||
| Beta turn IV | 3197 | 16 183 | ||||
| Beta turn VIa1 | 671 | 1397 | ||||
| Beta turn VIa2 | 215 | 406 | ||||
| Beta turn VIb | 1028 | 2350 | ||||
| Beta turn VIII | 4874 | 11 821 | ||||
| Gamma turn C | 1059 | 2889 | ||||
| Gamma turn I | 5833 | 12 720 | ||||
| Beta buldge-B | 271 | 524 | ||||
| Beta buldge-C | 4926 | 9214 | ||||
| Beta buldge-G | 3694 | 6931 | ||||
| Beta buldge-S | 717 | 1304 | ||||
| Beta buldge-W | 864 | 1752 | ||||
| Hairpin | 4931 | 12 984 | ||||
| Psiloop | 1197 | 2460 | ||||
| 3 | DNA/RNA-interacting residues | DNA | 417 | 560 | 7 | 13 |
| RNA | 282 | 415 | ||||
| 4 | DNA/RNA-interacting proteins | DNA | 417 | 560 | 7 | 13 |
| RNA | 282 | 415 | ||||
| 5 | Nucleotide-interacting residues | ATP | 228 | 313 | 4 | 22 |
| ADP | 300 | 353 | ||||
| GTP | 52 | 83 | ||||
| GDP | 88 | 120 | ||||
| NAD | 133 | 140 | ||||
| FAD | 156 | 172 | ||||
| FMN | 103 | 117 | ||||
| UDP | 51 | 68 | ||||
| 6 | Nucleotide-interacting proteins | ATP | 228 | 313 | 4 | 22 |
| ADP | 300 | 353 | ||||
| GTP | 52 | 83 | ||||
| GDP | 88 | 120 | ||||
| NAD | 133 | 140 | ||||
| FAD | 156 | 172 | ||||
| FMN | 103 | 117 | ||||
| UDP | 51 | 68 | ||||
| 7 | Ligand-binding residues | SO4 | 2604 | 3312 | 0 | 4 |
| PO4 | 1002 | 1299 | ||||
| NAG | 488 | 727 | ||||
| HEM | 167 | 176 | ||||
| BME | 163 | 191 | ||||
| EDO | 1095 | 1507 | ||||
| PLP | 64 | 65 | ||||
| 8 | Ligand-binding proteins | SO4 | 2604 | 3312 | 0 | 4 |
| PO4 | 1002 | 1299 | ||||
| NAG | 488 | 727 | ||||
| HEM | 167 | 176 | ||||
| BME | 163 | 191 | ||||
| EDO | 1095 | 1507 | ||||
| PLP | 64 | 65 | ||||
| 9 | Metal-interacting residues | Fe | 163 | 215 | 0 | 9 |
| Mg | 1384 | 1908 | ||||
| Ca | 1018 | 1402 | ||||
| Mn | 386 | 521 | ||||
| Zn | 1118 | 1660 | ||||
| Co | 149 | 201 | ||||
| Ni | 243 | 355 | ||||
| 10 | Metal-interacting proteins | Fe | 163 | 215 | 0 | 9 |
| Mg | 1384 | 1908 | ||||
| Ca | 1018 | 1402 | ||||
| Mn | 386 | 521 | ||||
| Zn | 1118 | 1660 | ||||
| Co | 149 | 201 | ||||
| Ni | 243 | 355 |
*Number of non-redundant PDB chains generated using BlastClust at 25% sequence similarity and resolution in between 0–3 Å.
Figure 3Architecture of ccPDB 2.0.