| Literature DB >> 20550715 |
Zhanchao Li1, Xuan Zhou, Zong Dai, Xiaoyong Zou.
Abstract
BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20550715 PMCID: PMC2905366 DOI: 10.1186/1471-2105-11-325
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Random sequence consisting of 40 residues as an example to illustrate derivation of feature vector.
Figure 2Flowchart of the current method.
Figure 3The relationship between the accuracy and the number of features.
Figure 4The relationship between the number of features and the number of generations.
Figure 5Fitness values and overall accuracy based on the most fitted member of each generation.
Figure 6Composition of the optimized features subset.
Comparison of different method by the jackknife at superfamily level
| Method | MCC | |||
|---|---|---|---|---|
| GPCR-CA [ | 91.46 | 92.33 | 90.96 | N/A |
| GPCR-SVMFS | 97.81 | 97.04 | 98.61 | 0.9563 |
Figure 7Comparison of different method by the jackknife test at family level.
Success rates obtained with the GPCR-SVMFS predictor by jackknife test at subfamily level
| GPCR subfamily | Number of proteins | Number of correct prediction | |
|---|---|---|---|
| Amine | 46 | 43 | 93.48 |
| Peptide | 72 | 71 | 98.61 |
| Rhodopsin | 17 | 15 | 88.24 |
| Olfactory | 19 | 19 | 100.0 |
| Nucleotide | 13 | 10 | 76.92 |
| Other | 34 | 32 | 94.12 |
| Overall | 201 | 190 | 94.53 |
The performance of GPCR-SVMFS and GPCRPred at superfamily level
| Method | MCC | |||
|---|---|---|---|---|
| GPCRPred [ | 99.50 | 98.60 | 99.80 | 0.9900 |
| GPCR-SVMFSa | 100.0 | 100.0 | 100.0 | 1.0000 |
a In order to consistent with evaluation method of GPCRPred, 5-fold cross-validation is utilized.
The performance of GPCR-SVMFS and GPCRPred at family level
| Method | ||||||
|---|---|---|---|---|---|---|
| Class A | Class B | Class C | Class D | Class E | Overall | |
| GPCRPred [ | 98.10 | 85.70 | 81.30 | 36.40 | 100.0 | 97.30 |
| GPCR-SVMFSa | 100.0 | 100.0 | 100.0 | 81.82 | 100.0 | 99.74 |
a In order to consistent with evaluation method of GPCRPred, 2-fold cross-validation is utilized.
The performance of GPCR-SVMFS and GPCRPred at subfamily level
| Class A subfamilies | Number of proteins | ||
|---|---|---|---|
| GPCR-SVMFSa | |||
| Amine | 221 | 99.10 | 100.0 |
| Peptide | 381 | 99.70 | 99.21 |
| Hormone | 25 | 100.0 | 100.0 |
| Rhodopsin | 183 | 98.90 | 99.45 |
| Olfactory | 87 | 100.0 | 100.0 |
| Prostanoid | 38 | 100.0 | 100.0 |
| Nucleotide | 48 | 85.40 | 93.75 |
| Cannabis | 11 | 100.0 | 90.91 |
| Platelet activating factor | 4 | 100.0 | 100.0 |
| Gonadotrophin releasing hormone | 10 | 100.0 | 100.0 |
| Thyrotropin releasing hormone | 7 | 85.70 | 85.71 |
| Melatonin | 13 | 100.0 | 100.0 |
| Viral | 17 | 33.30 | 76.47 |
| Lysospingolipids | 9 | 58.80 | 100.0 |
| Overall | 1054 | 97.30 | 98.77 |
a In order to consistent with evaluation method of GPCRPred, 2-fold cross-validation is utilized.
The prediction power of GPCR-SVMFS to independent dataset at family level
| GPCR family | Number of proteins | Number of correct prediction | |
|---|---|---|---|
| Rhodopsin-like | 20290 | 19510 | 96.16 |
| Metabotropic | 1194 | 1024 | 85.76 |
| Secretin-like | 1484 | 1017 | 68.53 |
| Overall | 22972 | 21551 | 93.81 |
The prediction power of GPCR-SVMFS to independent dataset at subfamily level
| GPCR subfamily | Number of proteins | Number of correct prediction | |
|---|---|---|---|
| Amine | 1840 | 1476 | 80.22 |
| Peptide | 4169 | 3090 | 74.12 |
| Rhodopsin | 1376 | 1208 | 87.79 |
| Olfactory | 9977 | 9075 | 90.96 |
| Nucleotide | 576 | 315 | 54.69 |
| Overall | 17938 | 15164 | 84.54 |