| Literature DB >> 15980444 |
Abstract
The receptors of amine subfamily are specifically major drug targets for therapy of nervous disorders and psychiatric diseases. The recognition of novel amine type of receptors and their cognate ligands is of paramount interest for pharmaceutical companies. In the past, Chou and co-workers have shown that different types of amine receptors are correlated with their amino acid composition and are predictable on its basis with considerable accuracy [Elrod and Chou (2002) Protein Eng., 15, 713-715]. This motivated us to develop a better method for the recognition of novel amine receptors and for their further classification. The method was developed on the basis of amino acid composition and dipeptide composition of proteins using support vector machine. The method was trained and tested on 167 proteins of amine subfamily of G-protein-coupled receptors (GPCRs). The method discriminated amine subfamily of GPCRs from globular proteins with Matthew's correlation coefficient of 0.98 and 0.99 using amino acid composition and dipeptide composition, respectively. In classifying different types of amine receptors using amino acid composition and dipeptide composition, the method achieved an accuracy of 89.8 and 96.4%, respectively. The performance of the method was evaluated using 5-fold cross-validation. The dipeptide composition based method predicted 67.6% of protein sequences with an accuracy of 100% with a reliability index > or =5. A web server GPCRsclass has been developed for predicting amine-binding receptors from its amino acid sequence [http://www.imtech.res.in/raghava/gpcrsclass/ and http://bioinformatics.uams.edu/raghava/gpersclass/ (mirror site)].Entities:
Mesh:
Substances:
Year: 2005 PMID: 15980444 PMCID: PMC1160112 DOI: 10.1093/nar/gki351
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The performance of amino acid composition and dipeptide composition based method in recognizing the GPCRs at different thresholds
| Threshold | Amino acid composition | Dipeptide composition | ||||||
|---|---|---|---|---|---|---|---|---|
| Sen | Spe | Acc | MCC | Sen | Spe | Acc | MCC | |
| −0.4 | 100 | 81.2 | 90.6 | 0.83 | 100 | 93.3 | 96.6 | 0.93 |
| −0.2 | 100 | 86.7 | 93.2 | 0.88 | 99.4 | 99.4 | 99.4 | 0.98 |
| 0.0 | 99.4 | 96.9 | 98.2 | 0.96 | 99.4 | 100 | 99.7 | 0.99 |
| 0.2 | 97 | 99.4 | 98.2 | 0.96 | 82.0 | 100 | 91.0 | 0.83 |
| 0.4 | 95.2 | 100 | 97.6 | 0.95 | 52.1 | 100 | 76.1 | 0.59 |
Sen, sensitivity; Spe, specificity; Acc, accuracy.
The performance of amino acid and dipeptide composition based method using different SVM kernels
| Amine receptors | Amino acid composition based method | Dipeptide composition based method | ||||||
|---|---|---|---|---|---|---|---|---|
| REF kernel (γ = 500 and | Polynomial kernel ( | RBF kernel (γ = 100 and | Polynomial kernel ( | |||||
| ACC | MCC | ACC | MCC | ACC | MCC | ACC | MCC | |
| Acteylcholine | 87.1 | 0.92 | 90.3 | 0.90 | 93.6 | 0.96 | 93.6 | 0.96 |
| Adrenoceptor | 95.5 | 0.88 | 86.3 | 0.76 | 100 | 0.93 | 100 | 0.91 |
| Dopamine | 92.1 | 0.82 | 84.2 | 0.74 | 92.1 | 0.95 | 92.0 | 0.93 |
| Serotonin | 85.3 | 0.85 | 83.3 | 0.85 | 98.2 | 0.97 | 94.4 | 0.95 |
| Overall | 89.8 | 0.86 | 85.6 | 0.81 | 96.4 | 0.95 | 95.1 | 0.93 |
ACC, accuracy.
Figure 1Expected accuracy of SVM classifier with a Reliability Index (RI) equal to a given value. The fraction of sequences that is predicted at a given RI is also shown on x-axis. (a) Amino acid composition. (b) Dipeptide composition.
Figure 2(a) Snapshot of input page of GPCRsclass server. (b) Snapshot of results obtained after the analysis of submission shown in Figure 1a.