| Literature DB >> 22759579 |
Yi Xiong1, Juan Liu, Wen Zhang, Tao Zeng.
Abstract
BACKGROUND: The heme-protein interactions are essential for various biological processes such as electron transfer, catalysis, signal transduction and the control of gene expression. The knowledge of heme binding residues can provide crucial clues to understand these activities and aid in functional annotation, however, insufficient work has been done on the research of heme binding residues from protein sequence information.Entities:
Year: 2012 PMID: 22759579 PMCID: PMC3380730 DOI: 10.1186/1477-5956-10-S1-S20
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Figure 1Workflow of the proposed iterative feature selection process.
Figure 2Flowchart of generating the PSSMPP profile. Given a heme binding protein sequence (PDB id: 1A6M; Chain: A), a window size of 3 is set for a simple illustration. The central residue is 9 L (residue number in the sequence; residue name), with its two neighbouring residues on both sides (8 Q and 10 V).
The list of the selected subset of physicochemical properties on Pheme-75 dataset
| ID | Description | AUC |
|---|---|---|
| QIAN880117 | Weights for beta-sheet at the window position of -3 | 0.598 |
| AURR980103 | Normalized positional residue frequency at helix termini N" | 0.593 |
| AURR980118 | Normalized positional residue frequency at helix termini C" | 0.583 |
| SUYM030101 | Linker propensity index | 0.573 |
The correlation coefficients among the four physicochemical properties on Pheme-75 dataset
| QIAN880117 | AURR980103 | AURR980118 | SUYM030101 | |
|---|---|---|---|---|
| - | - | - | - | |
| 0.020 | - | - | - | |
| -0.053 | 0.557 | - | - | |
| 0.107 | 0.104 | 0.286 | - |
Figure 3Performance comparison of different features using 5-fold cross validation on Pheme-75 dataset at varying window sizes.
Performance of different features on PHeme-75 dataset using 5-fold cross validation
| Feature | ACC(%) | SN (%) | SP (%) | PR(%) | MCC | F1 | AUC |
|---|---|---|---|---|---|---|---|
| PSSM | 66.3 | 65.3 | 25.4 | 0.272 | 0.374 | 0.762 | |
| PP | 62.9 | 63.4 | 62.7 | 21.4 | 0.184 | 0.319 | 0.681 |
| PSSM+PP | 67.8 | 72.1 | 67.2 | 26.2 | 0.279 | 0.381 | 0.767 |
| PSSMPP | 71.0 |
Performance comparison of different methods on the independent test set of PHeme-72
| Feature | ACC(%) | SN (%) | SP (%) | PR(%) | MCC | F1 | AUC |
|---|---|---|---|---|---|---|---|
| Binary | 67.9 | 61.8 | 68.9 | 24.9 | 0.225 | 0.355 | 0.718 |
| PSSM | 65.9 | 64.3 | 26.0 | 0.280 | 0.386 | 0.768 | |
| PSSMPP | 71.6 |