| Literature DB >> 21750739 |
Sebastian Schmeier1, Boris Jankovic, Vladimir B Bajic.
Abstract
BACKGROUND: Physical interactions between transcription factors (TFs) are necessary for forming regulatory protein complexes and thus play a crucial role in gene regulation. Currently, knowledge about the mechanisms of these TF interactions is incomplete and the number of known TF interactions is limited. Computational prediction of such interactions can help identify potential new TF interactions as well as contribute to better understanding the complex machinery involved in gene regulation.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21750739 PMCID: PMC3130058 DOI: 10.1371/journal.pone.0021887
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Confusion Matrix.
| Actual class | |||
| Positive | Negative | ||
|
|
| True positive (TP) | False positive (FP) |
|
| False negative (FN) | True negative (TN) | |
The table indicates the nomenclature for an outcome of a prediction relative to the actual value.
Performance measures.
| Measurement | Equation |
| Precision | TP/(TP+FP) |
| Sensitivity (Recall) | TP/(TP+FN) |
| Specificity | TN/(TN+FP) |
| False Discovery Rate (FDR) | FP/(FP+TP) |
| Accuracy | (TP+TN)/(TP+FP+TN+FN) |
| F-measure | 2 * Precision * Sensitivity/(Precision+Sensitivity) |
The table shows the performance measures used. TP: True positives; FP: False positives; TN: True negatives; FN: False negatives.
Figure 1Feature vector length versus accuracy, specificity and sensitivity.
The figure shows for different feature vector lengths, selected through the feature selection algorithm explained above, the average accuracy, sensitivity and specificity of the 10-fold CV. The model that uses 97 features (red dashed line) achieves the best accuracy of 82.04% while having a sensitivity of 76.45% and a specificity of 88.61%.
Cross-validation results.
| Fold | Sensitivity | Specificity | Precision | FDR | Accuracy | F-measure |
| 1 | 75.21 | 90.91 | 91.00 | 9.00 | 82.27 | 82.35 |
| 2 | 80.34 | 92.16 | 92.16 | 7.84 | 85.84 | 85.85 |
| 3 | 80.51 | 88.24 | 88.79 | 11.21 | 84.09 | 84.44 |
| 4 | 80.34 | 80.39 | 82.46 | 17.54 | 80.37 | 81.39 |
| 5 | 75.63 | 84.31 | 84.91 | 15.09 | 79.64 | 80.00 |
| 6 | 77.88 | 90.65 | 89.80 | 10.20 | 84.09 | 83.41 |
| 7 | 77.12 | 89.22 | 89.22 | 10.78 | 82.73 | 82.73 |
| 8 | 68.29 | 92.78 | 92.31 | 7.69 | 79.09 | 78.50 |
| 9 | 73.73 | 91.18 | 90.63 | 9.37 | 81.81 | 81.31 |
| 10 | 75.42 | 86.27 | 86.41 | 13.59 | 80.46 | 80.54 |
| Average | 76.45 | 88.61 | 88.77 | 11.23 | 82.04 | 82.05 |
The table shows individual results as well as the average results of the 10-fold CV run using 97 features. FDR: false discovery rate.