| Literature DB >> 20671979 |
Adeel Malik1, Ahmad Firoz, Vivekanand Jha, Shandar Ahmad.
Abstract
Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins) as well as noncovalently (protein-carbohydrate complexes) is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i) Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB), (ii) Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii) CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth.Entities:
Year: 2010 PMID: 20671979 PMCID: PMC2909730 DOI: 10.1155/2010/436036
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Figure 1Screen shot of PROCARB Homepage. From this homepage the user can search the database with a four-letter PDB code or by a keyword search.
Figure 2Client-side working of PROCARB modules.
Various types of energies for each N-linked model after energy minimization.
| S. No. | Uniprot Id (Model) | Initial Potential Energy (kcal/mol) | Potential Energy (kcal/mol) | Van der Waals Energy (kcal/mol) | Electrostatic Energy (kcal/mol) |
|---|---|---|---|---|---|
| (1) | P02765 | 7202.80 | −5269.26 | −526.46 | −5619.75 |
| (2) | P03952 | 1432684.90 | −13432.61 | −1664.91 | −13617.06 |
| (3) | P08195 | 128794.29 | −27001.31 | −3014.99 | −29181.87 |
| (4) | P08861 | 37688.55 | −12399.68 | −1643.87 | −12720.00 |
| (5) | P08962 | 7860.19 | −4611.32 | −358.31 | −5632.08 |
| (6) | P10253 | 4277.54 | −25010.03 | −1708.90 | −32940.69 |
| (7) | P14625 | 334135.61 | −28901.57 | −3115.71 | −30375.74 |
| (8) | P15586 | 33799.59 | −25047.64 | −2548.84 | −28518.54 |
| (9) | P17346 | 2615.44 | −4216.14 | 227.87 | −8462.63 |
| (10) | P19823 | 2306.05 | −5570.83 | −396.18 | −7580.99 |
| (11) | P29622 | 761535.82 | −17395.98 | −2115.57 | −17857.89 |
| (12) | P30805 | 809.78 | −6347.63 | −627.03 | −7150.57 |
| (13) | P35613 | 11089.12 | −8539.86 | −814.15 | −9260.29 |
| (14) | P49256 | 20519.42 | −11541.44 | −735.38 | −14684.69 |
| (15) | P50897 | 10651.77 | −16006.13 | −2022.72 | −16075.47 |
| (16) | P51688 | 302177.17 | −19114.24 | −1683.13 | −24207.85 |
| (17) | P52193 | 569701.07 | −16750.64 | −1694.27 | −18734.06 |
| (18) | Q13510 | 27411.06 | −12442.43 | −1252.58 | −14055.71 |
| (19) | Q13586 | 517.28 | −2456.82 | −306.03 | −2525.30 |
| (20) | Q14126 | 627306.18 | −11241.12 | −1366.40 | −11832.11 |
| (21) | Q95114 | 7486375.26 | −17552.50 | −1706.38 | −19594.20 |
| (22) | Q96PD5 | 9317.93 | −7957.20 | −1001.23 | −8462.80 |
| (23) | Q9HB40 | 31337.61 | −21434.10 | −1582.26 | −25924.32 |
| (24) | Q9HDC9 | 8706.65 | −16186.38 | −1371.38 | −18996.95 |
| (25) | Q9Y4L1 | 151149.78 | −22399.76 | −2519.80 | −23513.20 |
| (26) | Q9YGP1 | 6321.75 | −8051.55 | −921.83 | −8160.34 |
Various types of energies for each O-linked model after energy minimization.
| S. No. | Uniprot Id (Model) | Initial Potential Energy (kcal/mol) | Potential Energy (kcal/mol) | Van der Waals Energy (kcal/mol) | Electrostatic Energy (kcal/mol) |
|---|---|---|---|---|---|
| (1) | P00304 | 1011.16 | −3277.8 | −107.88 | −5373.8 |
| (2) | P01217 | −1564.4 | −3973.3 | −492.97 | −4077.3 |
| (3) | P01588 | 937.248 | −8253.1 | −991.77 | −8539.6 |
| (4) | P01866 | 4.1 | −15390 | −2062.7 | −16107 |
| (5) | P05451 | −1936.7 | −7479.7 | −966.71 | −7567.7 |
| (6) | P06027 | 3794.54 | −7010.8 | −823.42 | −7490.5 |
| (7) | P06870 | 27038.2 | −13000 | −1715 | −13041 |
| (8) | P08514 | 140740.16 | −49738.11 | −4626.67 | −57424.53 |
| (9) | P26631 | 4.1 | −2829.4 | −263.85 | −3032.5 |
| (10) | P28314 | −1098 | −17977 | −2188.1 | −18289 |
| (11) | P28512 | 165706808.06 | −3414.87 | −282.99 | −3687.12 |
| (12) | P36912 | 7304.93 | −11640.89 | −1207.84 | −12525.97 |
| (13) | P40225 | −2638.4 | −6971.1 | −954.33 | −7128.9 |
| (14) | P48304 | −2795 | −7659 | −930.64 | −7811.2 |
| (15) | P51671 | 76.0533 | −2922.7 | −436.77 | −3114 |
| (16) | P80370 | −230.89 | −1203.16 | −102.38 | −1272.45 |
| (17) | P81054 | 127345.09 | −19664.21 | −1900.14 | −21238.18 |
| (18) | P81428 | 127345 | −17179 | −1873 | −18763 |
| (19) | P98119 | 3162.77 | −18009.99 | −1034.49 | −23939.79 |
| (20) | Q09163 | 15646.54 | −5165.18 | −500.29 | −5614.36 |
Ramachandran analysis of all N-linked models.
| S. No. | Uniprot Id (Model) | Most favoured (%) | Additionally allowed (%) | Generously allowed (%) | Disallowed (%) |
|---|---|---|---|---|---|
| (1) | P02765 | 56.2 | 31.5 | 9.0 | 3.4 |
| (2) | P03952 | 75.7 | 22.8 | 1.0 | 0.5 |
| (3) | P08195 | 63.0 | 29.6 | 5.1 | 2.2 |
| (4) | P08861 | 75.9 | 22.1 | 2.0 | 0.5 |
| (5) | P08962 | 70.1 | 25.3 | 3.4 | 1.1 |
| (6) | P10253 | 60.8 | 30.4 | 6.8 | 2.0 |
| (7) | P14625 | 68.2 | 25.5 | 5.1 | 1.3 |
| (8) | P15586 | 61.3 | 29.7 | 6.7 | 2.3 |
| (9) | P17346 | 41.1 | 47.3 | 8.5 | 3.1 |
| (10) | P19823 | 70.0 | 26.9 | 3.1 | 0.0 |
| (11) | P29622 | 71.7 | 24.9 | 1.7 | 1.7 |
| (12) | P30805 | 60.6 | 32.1 | 4.6 | 2.8 |
| (13) | P35613 | 44.8 | 38.5 | 13.3 | 3.5 |
| (14) | P49256 | 59.6 | 34.3 | 5.6 | 0.5 |
| (15) | P50897 | 77.9 | 21.2 | 0.8 | 0.4 |
| (16) | P51688 | 66.8 | 25.1 | 4.3 | 3.8 |
| (17) | P52193 | 51.1 | 36.6 | 36.6 | 9.1 |
| (18) | Q13510 | 64.1 | 29.6 | 4.0 | 2.2 |
| (19) | Q13586 | 80.4 | 13.0 | 4.3 | 2.2 |
| (20) | Q14126 | 69.9 | 27.5 | 2.6 | 0.0 |
| (21) | Q95114 | 61.5 | 33.0 | 3.3 | 2.2 |
| (22) | Q96PD5 | 68.9 | 27.4 | 3.0 | 0.7 |
| (23) | Q9HB40 | 61.4 | 32.5 | 4.1 | 1.9 |
| (24) | Q9HDC9 | 46.7 | 39.7 | 9.7 | 3.9 |
| (25) | Q9Y4L1 | 66.9 | 24.4 | 5.9 | 2.8 |
| (26) | Q9YGP1 | 69.4 | 28.2 | 2.4 | 0.0 |
Ramachandran analysis of all O-linked models.
| S. No. | Uniprot Id (Model) | Most favoured (%) | Additionally allowed (%) | Generously allowed (%) | Disallowed (%) |
|---|---|---|---|---|---|
| (1) | P00304 | 37.3 | 54.2 | 4.8 | 3.6 |
| (2) | P01217 | 70.1 | 23.4 | 3.9 | 2.6 |
| (3) | P01588 | 65.3 | 28.6 | 4.8 | 1.4 |
| (4) | P01866 | 70.0 | 25.4 | 2.5 | 2.1 |
| (5) | P05451 | 75.4 | 23.0 | 1.6 | 0.0 |
| (6) | P06027 | 64.8 | 25.6 | 5.6 | 4.0 |
| (7) | P06870 | 77.9 | 19.6 | 1.0 | 1.5 |
| (8) | P08514 | 50.6 | 38.7 | 7.4 | 3.4 |
| (9) | P26631 | 58.3 | 35.4 | 4.2 | 2.1 |
| (10) | P28314 | 76.0 | 23.3 | 0.7 | 0.0 |
| (11) | P28512 | 54.0 | 38.0 | 4.0 | 4.0 |
| (12) | P36912 | 55.7 | 39.7 | 4.0 | 0.6 |
| (13) | P40225 | 80.2 | 16.5 | 3.3 | 0.0 |
| (14) | P48304 | 74.2 | 23.4 | 2.3 | 0.0 |
| (15) | P51671 | 68.8 | 25.0 | 3.1 | 3.1 |
| (16) | P80370 | 45.0 | 55.0 | 0.0 | 0.0 |
| (17) | P81054 | 78.0 | 22.0 | 0.0 | 0.0 |
| (18) | P81428 | 49.3 | 38.2 | 7.4 | 5.1 |
| (19) | P98119 | 45.5 | 38.0 | 11.1 | 5.4 |
| (20) | Q09163 | 46.5 | 36.0 | 8.1 | 9.3 |
AUC scores for protein-carbohydrate complexes submitted to PDB between January 2007 and November 2008. Here PROCARB30-SS: single-sequence-based scores for 30 nonredundant protein-carbohydrate complexes, PROCARB30-PSSM: PSSM-based scores for 30 nonredundant protein-carbohydrate complexes, PROCARB61-SS: single-sequence-based scores for all 61 protein-carbohydrate complexes, and PROCARB61-PSSM: PSSM-based scores for all 61 protein-carbohydrate complexes.
| S. no. | Dataset | AUC Scores |
|---|---|---|
| (1) | PROCARB30-SS | 0.6518 |
| (2) | PROCARB30-PSSM | 0.637 |
| (3) | PROCARB61-SS | 0.6834 |
| (4) | PROCARB61-PSSM | 0.656 |
Figure 3(a) Roc plot for 30 nonredundant protein carbohydrate complexes submitted to PDB between January 2007 and November 2008. (b) Roc plot for 61 protein carbohydrate complexes submitted to PDB between January 2007 and November 2008.
Sample output from Contact Calculator exhibiting residue-carbohydrate details at atomic level.
| Your uploaded file: IFV2.pdb: | |||||||
|---|---|---|---|---|---|---|---|
| TITLE: THE HC FRAGMENT OF TETANUS TOXIN COMPLEXED WITH AN ANAL 2 OF ITS GANGLIOSIDE RECEPTOR GT1B | |||||||
| The protein-carbohydrate contact details at ≤4 Å: (For all chains) | |||||||
|
| |||||||
| Distance (Å) | Residue | Position | Atom | Protein-Chain | Carbohydrate | Atom Chain | |
| 3.739 | ASN | 219 | CB | A |
| O6 | Z |
| 3.753 | ASN | 219 | CG | A |
| O6 | Z |
| 3.937 | ASN | 219 | OD1 | A |
| O6 | Z |
| 3.993 | ASN | 219 | ND2 | A |
| C4 | Z |
| 2.813 | ASN | 219 | ND2 | A |
| O4 | Z |
| 3.660 | ASN | 219 | ND2 | A |
| O5 | Z |
| 3.922 | LEU | 221 | CB | A |
| C6 | Z |
| 3.446 | LEU | 221 | CB | A |
| O6 | Z |
| 3.546 | LEU | 221 | CDl | A |
| C6 | Z |
| 3.827 | LEU | 221 | CDl | A |
| O6 | Z |
| *** | *** | *** | *** | *** | *** | *** | *** |
| *** | *** | *** | *** | *** | *** | *** | *** |
| 3.826 | TRP | 289 | CDI | A |
| CS | Z |
| 3.826 | TRP | 289 | CDI | A |
| O6 | Z |
| 3.934 | TRP | 289 | CH2 | A |
| O1A | Z |
| 3.907 | TYR | 290 | OH | A |
| C6 | Z |
|
| |||||||
| The summary of Protein-Carbohydrate contact positions of your uploaded pdb file: | |||||||
| Total number of contacts = 7 | |||||||
| At Position: 219 221 222 270 271 289 290***** | |||||||