| Literature DB >> 19958487 |
Yu-Feng Huang1, Chun-Chin Huang, Yu-Cheng Liu, Yen-Jen Oyang, Chien-Kang Huang.
Abstract
BACKGROUND: Protein-DNA interactions are essential for fundamental biological activities including DNA transcription, replication, packaging, repair and rearrangement. Proteins interacting with DNA can be classified into two categories of binding mechanisms - sequence-specific and non-specific binding. Protein-DNA specific binding provides a mechanism to recognize correct nucleotide base pairs for sequence-specific identification. Protein-DNA non-specific binding shows sequence independent interaction for accelerated targeting by interacting with DNA backbone. Both sequence-specific and non-specific binding residues contribute to their roles for interaction.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19958487 PMCID: PMC2788376 DOI: 10.1186/1471-2164-10-S3-S23
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Sequence-specific and non-specific binding residues of PDB 2PRT:A. Residues colored by red are sequence-specific binding residues. Residues colored by blue are non-specific binding residues. Residues colored by purple are both sequence-specific and non-specific binding residues.
Overall performance of proposed approach
| Binding type | # of residues | TP | FP | TN | FN | Sensitivity | Specificity | Precision | Accuracy |
|---|---|---|---|---|---|---|---|---|---|
| 60466 | 1764 | 395 | 56553 | 1754 | 50.14% | 99.31% | 81.70% | 96.45% | |
| 60466 | 4652 | 2454 | 49245 | 4115 | 53.06% | 95.25% | 65.47% | 89.14% | |
| 60466 | 5651 | 2206 | 48321 | 4288 | 56.86% | 95.63% | 71.92% | 89.26% |
Performance of the binding site prediction in terms of secondary structure elements
| Binding type | Secondary structure elements | # of residues | TP | FP | TN | FP | Sensitivity | Specificity | Precision | Accuracy |
|---|---|---|---|---|---|---|---|---|---|---|
| 32670 | 1322 | 279 | 30160 | 909 | 59.26% | 99.08% | 82.57% | 96.36% | ||
| 5259 | 22 | 0 | 5077 | 160 | 12.09% | 100.00% | 100.00% | 96.96% | ||
| 22537 | 420 | 116 | 21316 | 685 | 38.01% | 99.46% | 78.36% | 96.45% | ||
| 32670 | 2197 | 1005 | 27458 | 2010 | 52.22% | 96.47% | 66.61% | 90.77% | ||
| 5259 | 257 | 185 | 4524 | 293 | 46.73% | 96.07% | 58.15% | 90.91% | ||
| 22537 | 2198 | 1264 | 17263 | 1812 | 54.81% | 93.18% | 63.49% | 86.35% | ||
| 32670 | 2988 | 858 | 26783 | 2041 | 59.42% | 96.90% | 77.69% | 91.13% | ||
| 5259 | 261 | 181 | 4472 | 345 | 43.07% | 96.11% | 59.05% | 90.00% | ||
| 22537 | 2402 | 1167 | 17066 | 1902 | 55/81% | 93.60% | 67.305 | 86.38% | ||
Overall performance of protein-DNA binding mode prediction
| Protein-DNA binding mode | # of protein chains | Sensitivity | Precision |
|---|---|---|---|
| 146 | 100.00% | 80.22% | |
| 220 | 70.45% | 73.46% | |
| 166 | 68.07% | 88.98% | |
| 38 | 34.21% | 52.00% | |
| 30 | 93.33% | 50.91% |
Figure 2A difficult case (PDB ID 1LMB:4) of binding residue prediction, which can be enhanced with the best aligned template of correct predicted protein-DNA binding mode. Residues colored by red means false positive. Residues colored by blue means false negative. Residues colored by green means true positive. (a) Prediction of sequence-specific binding residues. (b) Prediction of non-specific binding residues. (c) Combination of sequence-specific and non-specific binding residues prediction. (d) Enhanced prediction with the best aligned template of correct protein-DNA binding mode prediction.
Performance delivered by alternative predictors of DNA-binding residues, where the F-measure is the harmonic mean of precision and sensitivity
| Predictor | Sensitivity | Specificity | Accuracy | Precision | F-measure |
|---|---|---|---|---|---|
| 0.501 | 0.993 | 0.965 | 0.817 | 0.622 | |
| 0.530 | 0.953 | 0.891 | 0.655 | 0.586 | |
| 0.569 | 0.956 | 0.893 | 0.719 | 0.635 | |
| 0.682 | 0.660 | 0.664 | 0.308* | 0.425* | |
| 0.410 | 0.871 | 0.780 | 0.439* | 0.424* | |
| 0.652 | 0.728 | 0.722 | 0.186* | 0.289* | |
| 0.791 | 0.786 | 0.800 | -* | -* |
*The numbers with an asterisk are those that have been derived from the numbers reported in the related studies.
Dataset of 253 TF-DNA complexes for DNA-binding residues prediction
| 253 TF-DNA Complexes | |||||||||
|---|---|---|---|---|---|---|---|---|---|
Figure 3Overall framework for DNA-binding residues prediction.
Protein-DNA binding modes and their corresponding Pfam domains
| Protein-DNA Binding mode | Pfam Domain |
|---|---|
| Zipper-type | HLH |
| bZIP_1 | |
| bZIP_2 | |
| Helix-turn-helix | HTH_AraC |
| TetR_N | |
| Trp_repressor | |
| Homeobox | |
| E2F_TDP | |
| GntR | |
| Fe_dep_repress | |
| Ets | |
| HTH_3 | |
| Sigma70_r4 | |
| LacI | |
| Fork_head | |
| Resolvase | |
| Pou | |
| Zinc-coordinating | zf-C4 |
| zf-C2H2 | |
| Zn_clus | |
| GATA | |
| P53 | |
| β-hairpin/ribbon | MetJ |
| RHH_1 | |
| Arc | |
| Others | SRF-TF |
| Runt | |
| PAX | |
| RHD | |
| Autoind_bind | |
| cNMP_binding | |
Illustration of feature set for protein-DNA binding mode prediction
| Feature title | Feature descriptions |
|---|---|
| Class label | 5 protein-DNA binding modes |
| 1. zipper-type | |
| 2. helix-turn-helix (HTH) | |
| 3. zinc-coordinating | |
| 4. β-hairpin/ribbon | |
| 5. others | |
| Non-specific binding | 20 dimensions of amino acid |
| 3 dimensions of secondary structure elements | |
| # of binding residues | |
| Protein chain information | 3 dimensions of secondary structure elements |
| # of total residues in a protein chain | |