| Literature DB >> 20353553 |
Hifzur R Ansari1, Gajendra P S Raghava.
Abstract
BACKGROUND: Small molecular cofactors or ligands play a crucial role in the proper functioning of cells. Accurate annotation of their target proteins and binding sites is required for the complete understanding of reaction mechanisms. Nicotinamide adenine dinucleotide (NAD+ or NAD) is one of the most commonly used organic cofactors in living cells, which plays a critical role in cellular metabolism, storage and regulatory processes. In the past, several NAD binding proteins (NADBP) have been reported in the literature, which are responsible for a wide-range of activities in the cell. Attempts have been made to derive a rule for the binding of NAD+ to its target proteins. However, so far an efficient model could not be derived due to the time consuming process of structure determination, and limitations of similarity based approaches. Thus a sequence and non-similarity based method is needed to characterize the NAD binding sites to help in the annotation. In this study attempts have been made to predict NAD binding proteins and their interacting residues (NIRs) from amino acid sequence using bioinformatics tools.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20353553 PMCID: PMC2853471 DOI: 10.1186/1471-2105-11-160
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Percentage composition of NAD interacting and non-interacting residues.
Performance of SVM model developed using amino acid sequence (binary pattern) at different window lengths.
| Window size | Kernel parameters | Thr* | Sen (%) | Spe (%) | Acc (%) | MCC |
|---|---|---|---|---|---|---|
| t 2 g 0.1 j 1 c 1 | 0 | 63.41 | 61.27 | 62.34 | 0.25 | |
| t 2 g 0.1 j 1 c 1 | 0 | 64.46 | 65.13 | 64.79 | 0.3 | |
| t 2 g 0.1 j 1 c 1 | 0 | 67.98 | 66.83 | 67.4 | 0.35 | |
| t 2 g 0.1 j 1 c 1 | 0 | 69.09 | 69.32 | 69.21 | 0.38 | |
| t 2 g 0.1 j 1 c 1 | 0 | 69.7 | 71.37 | 70.54 | 0.41 | |
| t 2 g 0.1 j 1 c 10 | 0 | 70.81 | 72.78 | 71.79 | 0.44 | |
| t 2 g 0.1 j 1 c 10 | 0 | 71.56 | 73.89 | 72.73 | 0.45 | |
| t 2 g 0.1 j 1 c 100 | 0 | 71.27 | 72.49 | 71.88 | 0.44 | |
| t 2 g 0.1 j 1 c 10 | 0 | 70.81 | 73.68 | 72.24 | 0.45 |
*(Thr- Threshold, Sen - Sensitivity, Spe - Specificity, Acc - Accuracy, MCC - Matthew's correlation coefficient)
SVM models were trained and tested on a dataset having equal number of positive and negative data. Bold font shows the performance and parameters of selected SVM model.
Performance of SVM models developed using PSSM profile of proteins at different window lengths.
| Window size | Kernel parameters | Thr* | Sen (%) | Spe (%) | Acc (%) | MCC |
|---|---|---|---|---|---|---|
| t 2 g 1.0 j 1 c 10 | 0 | 83.26 | 82.61 | 82.93 | 0.66 | |
| t 2 g 1.0 j 1 c 10 | 0 | 82.59 | 87.51 | 85.05 | 0.7 | |
| t 2 g 0.1 j 1 c 10 | 0 | 82.39 | 84.67 | 83.53 | 0.67 | |
| t 2 g 0.1 j 1 c 10 | 0 | 84.18 | 86.13 | 85.16 | 0.7 | |
| t 2 g 0.1 j 1 c 10 | 0 | 85.28 | 86.25 | 85.77 | 0.72 | |
| t 2 g 0.1 j 1 c 10 | 0 | 85.7 | 86.52 | 86.11 | 0.72 | |
| t 2 g 0.1 j 1 c 10 | 0 | 85.36 | 86.31 | 85.84 | 0.72 | |
| t 2 g 0.1 j 1 c 10 | 0 | 83.69 | 87.99 | 85.84 | 0.72 | |
| t 2 g 0.1 j 1 c 10 | 0 | 85.52 | 87.33 | 86.43 | 0.73 |
SVM models were trained and tested on a dataset having equal number of positive and negative data. Bold font shows the performance and parameters of selected SVM model.
Figure 2Structure of human Aldose reductase (2ACS) showing prediction of NAD interacting residues by NADbinder. NAD shown in magenta, True positives in red and False positives in blue colour (only the portion of protein with residue mentioned is shown here).
Figure 3ROC Plot for SVM models developed using single sequence (binary pattern) for window size from 3 to 21. (W indicates the window length and value in bracket shows Area under curve).
Figure 4ROC Plot for PSSM based SVM models developed using window size from 3 to 21. (W indicates the window length and value in bracket shows Area under curve).