| Literature DB >> 16689693 |
Yun Fei Wang1, Huan Chen, Yan Hong Zhou.
Abstract
A computational system for the prediction and classification of human G-protein coupled receptors (GPCRs) has been developed based on the support vector machine (SVM) method and protein sequence information. The feature vectors used to develop the SVM prediction models consist of statistically significant features selected from single amino acid, dipeptide, and tripeptide compositions of protein sequences. Furthermore, the length distribution difference between GPCRs and non-GPCRs has also been exploited to improve the prediction performance. The testing results with annotated human protein sequences demonstrate that this system can get good performance for both prediction and classification of human GPCRs.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16689693 PMCID: PMC5173243 DOI: 10.1016/s1672-0229(05)03034-2
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1The length distributions of human GPCRs (A) and non-GPCRs (B).
Fig. 2The results of discriminating 653 human GPCRs from 10,845 non-GPCRs.
Performance Comparison of Human GPCR Classification
| Family | Our method | Bhasin and Raghava’s method | ||||
|---|---|---|---|---|---|---|
| Acc | Sp | Sn | Acc | Sp | Sn | |
| A | 0.995 | 0.992 | 1 | 0.995 | 0.992 | 1 |
| B | 0.989 | 1 | 0.967 | 0.966 | 1 | 0.900 |
| C | 1 | 1 | 1 | 0.897 | 1 | 0.400 |
| fz_smo | 0.979 | 1 | 0.917 | 0.938 | 1 | 0.750 |
Fig. 3System flowchart for GPCR prediction and classification.
Fig. 4The μ value distribution of single amino acid composition features.
Fig. 5The μ value distribution of dipeptide composition features.