| Literature DB >> 25474071 |
Wei Wang, Juan Liu, Xionghui Zhou.
Abstract
BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding proteins (SSBs), and they take part in different biological functions. DSBs usually act as transcriptional factors to regulate the genes' expressions, while SSBs usually play roles in DNA replication, recombination, and repair, etc. Understanding the binding specificity of a DNA binding protein is helpful for the research of protein functions.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25474071 PMCID: PMC4243121 DOI: 10.1186/1471-2105-15-S12-S4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1All detected tunnels of protein 1A73. The graph shows the CAVER package detects out 27 tunnels in 1A73 protein, and show 3D structure for all tunnels with different colours in protein surface.
Index values for all tunnels of 1A73
| Tunnel | Bottleneck-radius | Length | Curvature |
|---|---|---|---|
| 1 | 3.52 | 2.47 | 1.05 |
| 2 | 2.79 | 3.48 | 1.26 |
| 3 | 2.54 | 7.64 | 1.26 |
| 4 | 1.85 | 5.85 | 1.77 |
| 5 | 1.86 | 12.08 | 2.02 |
| 6 | 1.33 | 14.78 | 1.29 |
| 7 | 1.25 | 12.68 | 1.43 |
| 8 | 0.96 | 13.08 | 1.39 |
| 9 | 1.09 | 15.63 | 1.50 |
| 10 | 1.13 | 16.26 | 1.71 |
| 11 | 1.03 | 29.47 | 1.57 |
| 12 | 0.98 | 25.02 | 1.62 |
| 13 | 1.03 | 35.71 | 1.61 |
| 14 | 1.07 | 33.06 | 2.00 |
| 15 | 0.77 | 19.99 | 1.43 |
| 16 | 0.77 | 35.07 | 1.47 |
| 17 | 0.79 | 25.53 | 2.09 |
| 18 | 0.71 | 24.74 | 1.39 |
| 19 | 0.77 | 28.35 | 1.32 |
| 20 | 0.72 | 38.97 | 1.78 |
| 21 | 0.88 | 51.54 | 1.62 |
| 22 | 0.70 | 46.82 | 1.47 |
| 23 | 0.77 | 36.59 | 1.40 |
| 24 | 0.73 | 41.06 | 1.47 |
| 26 | 0.72 | 45.18 | 3.11 |
| 27 | 0.72 | 47.09 | 2.18 |
This table shows the values of bottleneck radius, length and curvature for the all tunnels. Note that the boldface (25#) presents the values of the largest tunnel.
Figure 2The largest tunnel (25#) of protein 1A73. The graph shows the red tunnel is the largest tunnel in terms of maximizing (Length*Bottleneck Radius).
Figure 3Six templates of the OB-fold domain. They show structural similarity but different topologies, and the similarity of sequences are with <30%.
Figure 4Feature distributions of different kinds of DNA-binding proteins. These graphs show the box plot of the four features for the HOLO and APO datasets. Those are (a) tunnel bottleneck radius, (b) tunnel length, (c) tunnel curvature and (d) TM-score.
Correlation of tunnel length and tunnel curvature
| Dataset | Protein types | Pearson-coefficient | P-value |
|---|---|---|---|
| HOLO set | DSBs | 0.6929 | 2.3752e-23 |
| SSBs | 0.4484 | 0.0054 | |
| APO set | DSBs | 0.9293 | 7.7890e-23 |
| SSBs | 0.5599 | 2.5705e-04 |
Performance on HOLO set
| Feature | ACC | SN | SP | AUC | MCC | F1 | |
|---|---|---|---|---|---|---|---|
| Length | 0.7470 | 0.7725 | 0.7258 | 0.7539 | 0.5207 | 0.7681 | |
| Curvature | 0.6808 | 0.7058 | 0.6525 | 0.6818 | 0.3760 | 0.6949 | |
| TM-Score | 0.7054 | 1.0000 | 0.4050 | 0.6629 | 0.5018 | 0.7823 | |
| Length+Curvature | 0.7725 | 0.7925 | 0.7508 | 0.7738 | 0.5633 | 0.7889 | |
| Length+TM-Score | 0.8434 | 0.8308 | 0.8583 | 0.8476 | 0.7012 | 0.8472 | |
| Curvature+TM-Score | 0.7824 | 0.7750 | 0.7925 | 0.7866 | 0.5848 | 0.7903 | |
| Length+Curvature+TM-Score | 0.8686 | 0.8725 | 0.8608 | 0.8710 | 0.7497 | 0.8782 | |
| Baseline | Random 1 feature | 0.5040 | 0.5082 | 0.4859 | 0.4967 | -0.0056 | 0.5553 |
| Random 2 features | 0.4951 | 0.4907 | 0.5137 | 0.5028 | 0.0040 | 0.5697 | |
| Total 3 features | 0.4938 | 0.4914 | 0.5036 | 0.4968 | -0.0043 | 0.5789 | |
Performance on APO set
| Feature | ACC | SN | SP | AUC | MCC | F1 | |
|---|---|---|---|---|---|---|---|
| Length | 0.6533 | 0.4783 | 0.8267 | 0.6548 | 0.3205 | 0.5732 | |
| Curvature | 0.5487 | 0.3033 | 0.8000 | 0.5676 | 0.1119 | 0.4005 | |
| TM-Score | 0.8015 | 0.9525 | 0.6483 | 0.8117 | 0.6424 | 0.8425 | |
| Length+Curvature | 0.6401 | 0.4658 | 0.8192 | 0.6491 | 0.3101 | 0.5689 | |
| Length+TM-Score | 0.8543 | 0.9800 | 0.7242 | 0.8511 | 0.7378 | 0.8848 | |
| Curvature+TM-Score | 0.8518 | 0.9850 | 0.7167 | 0.8592 | 0.7365 | 0.8836 | |
| Length+Curvature+TM-Score | 0.8310 | 0.9533 | 0.7058 | 0.8286 | 0.6934 | 0.8620 | |
| Baseline | Random 1 feature | 0.4990 | 0.4991 | 0.4988 | 0.4990 | -0.0025 | 0.4712 |
| Random 2 features | 0.4991 | 0.4973 | 0.5016 | 0.4998 | -0.0014 | 0.4875 | |
| Total 3 features | 0.5019 | 0.5054 | 0.4973 | 0.5019 | 0.0028 | 0.5035 | |
Performance of the independent test
| Feature | ACC | SN | SP | AUC | MCC | F1 | |
|---|---|---|---|---|---|---|---|
| Length+TM-Score | 0.7191 | 0.8235 | 0.5789 | 0.7012 | 0.7097 | 0.4179 | |
| Curvature+TM-Score | 0.6854 | 0.7451 | 0.6053 | 0.6752 | 0.6389 | 0.3531 | |
| Length+Curvature+TM-Score | 0.7303 | 0.7647 | 0.6842 | 0.7245 | 0.6842 | 0.4489 | |
| Baseline | Random 1 feature | 0.5009 | 0.5059 | 0.4943 | 0.4997 | 0.0004 | 0.4927 |
| Random 2 features | 0.5011 | 0.5014 | 0.5007 | 0.5010 | 0.0022 | 0.5312 | |
| Total 3 features | 0.5006 | 0.5041 | 0.4958 | 0.5007 | -0.0002 | 0.5323 | |
Performance on mixed set
| Feature | ACC | SN | SP | AUC | MCC | F1 | |
|---|---|---|---|---|---|---|---|
| Length | 0.6161 | 0.7132 | 0.5170 | 0.6237 | 0.2446 | 0.6548 | |
| Curvature | 0.5984 | 0.6887 | 0.5063 | 0.6097 | 0.2018 | 0.6374 | |
| TM-Score | 0.7558 | 0.9846 | 0.5230 | 0.7571 | 0.5756 | 0.8104 | |
| Length+Curvature | 0.6114 | 0.7004 | 0.5211 | 0.6169 | 0.2300 | 0.6457 | |
| Length+TM-Score | 0.7984 | 0.8880 | 0.7089 | 0.8076 | 0.6222 | 0.8259 | |
| Curvature+TM-Score | 0.7701 | 0.9407 | 0.5980 | 0.7802 | 0.5844 | 0.8133 | |
| Length+Curvature+TM-Score | 0.8251 | 0.8605 | 0.7904 | 0.8265 | 0.6632 | 0.8372 | |
| Baseline | Random 1 feature | 0.5030 | 0.5070 | 0.4918 | 0.4992 | -0.0011 | 0.5201 |
| Random 2 features | 0.4996 | 0.4986 | 0.5024 | 0.5001 | 0.0013 | 0.5369 | |
| Total 3 features | 0.5014 | 0.5021 | 0.4992 | 0.5007 | 0.0015 | 0.5525 | |