| Literature DB >> 21689482 |
Jingna Si1, Zengming Zhang, Biaoyang Lin, Michael Schroeder, Bingding Huang.
Abstract
BACKGROUND: Protein-DNA interactions play an important role in many fundamental biological activities such as DNA replication, transcription and repair. Identification of amino acid residues involved in DNA binding site is critical for understanding of the mechanism of gene regulations. In the last decade, there have been a number of computational approaches developed to predict protein-DNA binding sites based on protein sequence and/or structural information.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21689482 PMCID: PMC3121123 DOI: 10.1186/1752-0509-5-S1-S7
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Summary of detailed characteristics of the six available web servers for DNA-binding sites prediction.
| Machine learning methods | Properties used in training | Online website | |
|---|---|---|---|
| DISIS | Support Vector Machine (SVM) | Evolutionary profile | |
| DNABindR | Naïve Bayes classifier | Relative solvent accessibility | |
| BindN | SVM | The side chain pKa value | |
| BindN-rf | Random forest | The side chain pKa value | |
| DP-Bind | SVM | Sequence-based BLOSUM62 | |
| DBS-PRED | Neural network | Protein sequence information | |
Figure 1The prediction workflow of the metaDBSite approach. The protein sequence is submitted to the six predictors and the prediction results are retrieved. Then these predicted results are input into the trained SVM and the final prediction (which residues are DNA-binding sites (marked as “+”) is made.
The prediction results of metaDBSite (10-fold SVM cross-validation) and the other six methods alone for the PDNA-316 benchmark dataset.
| Method | Accuracy | Sensitivity | Specificity | Strength | MCC | F-measure |
|---|---|---|---|---|---|---|
| metaDBSite | 0.77 | 0.77 | 0.77 | 0.77 | 0.32 | 0.33 |
| BindN | 0.78 | 0.54 | 0.80 | 0.67 | 0.21 | 0.26 |
| BindN-rf | 0.82 | 0.67 | 0.83 | 0.75 | 0.32 | 0.34 |
| DBS-PRED | 0.75 | 0.53 | 0.76 | 0.65 | 0.17 | 0.23 |
| DISIS | 0.92 | 0.19 | 0.98 | 0.59 | 0.25 | 0.27 |
| DNABindR | 0.73 | 0.66 | 0.74 | 0.70 | 0.23 | 0.26 |
| dpBind | 0.78 | 0.69 | 0.79 | 0.74 | 0.29 | 0.31 |
Figure 2Performance of metaDBSite with DNA-binding site definitions using different distance cut-offs and ASA method on the PDNA-316 benchmark dataset.
Figure 3Representative protein-DNA complex: replication initiation protein and its binding DNA regions (PDB ID: 1REP). A: Predicted DNA-binding residues are in blue. B: The real DNA-binding residues defined with 3.5 Å are in red. The replication initiation protein is shown in green.
Figure 4Screenshot of result page on the metaDBSite web-server. The predicted DNA-binding residues are marked “+” for the sixe predictors and “*” for metaDBSite and are all colored green. The non DNA-binding residues are marked “-”.