| Literature DB >> 17880712 |
Ganesan Pugalenthi1, Ke Tang, P N Suganthan, G Archunan, R Sowdhamini.
Abstract
BACKGROUND: Odorant binding proteins (OBPs) are believed to shuttle odorants from the environment to the underlying odorant receptors, for which they could potentially serve as odorant presenters. Although several sequence based search methods have been exploited for protein family prediction, less effort has been devoted to the prediction of OBPs from sequence data and this area is more challenging due to poor sequence identity between these proteins.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17880712 PMCID: PMC2216042 DOI: 10.1186/1471-2105-8-351
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Confusion matrix for RLSC on the training dataset
| Positive | Negative | |
| Positive | 451 | 25 |
| Negative | 35 | 2122 |
Classification results achieved on different feature subsets. The optimal values of σ and λ are also given.
| Features | LOOA | BLOOA | TP rates | TN rates | MCC | ||
| 1463 | 2.614e-5 | 1e-009 | 0.977 | 0.965 | 0.945 | 0.984 | 0.922 |
| 450 | 4.714e-5 | 1e-009 | 0.975 | 0.963 | 0.945 | 0.981 | 0.915 |
| 250 | 6.325e-5 | 1e-008 | 0.970 | 0.961 | 0.948 | 0.975 | 0.901 |
| 100 | 1e-4 | 1e-008 | 0.970 | 0.962 | 0.950 | 0.975 | 0.903 |
| 50 | 1.414e-4 | 1e-008 | 0.967 | 0.958 | 0.945 | 0.971 | 0.891 |
LOOA – Leave-one-out accuracy (LOOA); BLOOA – Balanced LOOA MCC – Matthews Correlation Coefficient; σ – Kernel-parameter λ – Regularization parameter; TN – True negative; TP-True positive
Prediction result of 414 odorant binding proteins by RLSC, PSI-BLAST and HMM methods
| Method | Correctly predicted as odorant binding proteins | Incorrectly predicted as non odorant binding proteins | Classification accuracy |
| RLSC | 402 | 12 | 97.1% |
| PSI-BLAST | 369 | 45 | 89.1% |
| HMM | 360 | 54 | 86.9% |
Amino acid groupings (11 groups) according to their physical and chemical properties
| Hydrophobic (hb) | F, I, W, L, V, M, Y, C, A |
| Hydrophilic (hp) | R, K, N, D, E, P |
| Charged (Ch) | R, H, K, D, E |
| Neutral (Neu) | T, H, G, S, Q |
| Aliphatic (Ali) | I, L, V |
| Aromatic (Aro) | F, W, Y, H |
| Polar (Pol) | N, Q, R, E, D |
| Nonpolar (Npol) | F, M, I, L, V |
| Polar-Nonpolar (PN) | C, K, H, Y, W |
| Small (Sm) | P, V, A, G, T, S, N, D |
| Cysteine (cys) | C |
Figure 1Description of the feature selection method. Redundant features are sequentially removed until the number of remaining features reaches a pre-defined number.