| Literature DB >> 24963340 |
Yajun Sheng1, Xingye Qiu1, Chen Zhang1, Jun Xu1, Yanping Zhang1, Wei Zheng1, Ke Chen2.
Abstract
The protein quaternary structure is very important to the biological process. Predicting their attributes is an essential task in computational biology for the advancement of the proteomics. However, the existing methods did not consider sufficient properties of amino acid. To end this, we proposed a hybrid method Quad-PRE to predict protein quaternary structure attributes using the properties of amino acid, predicted secondary structure, predicted relative solvent accessibility, and position-specific scoring matrix profiles and motifs. Empirical evaluation on independent dataset shows that Quad-PRE achieved higher overall accuracy 81.7%, especially higher accuracy 92.8%, 93.3%, and 90.6% on discrimination for trimer, hexamer, and octamer, respectively. Our model also reveals that six features sets are all important to the prediction, and a hybrid method is an optimal strategy by now. The results indicate that the proposed method can classify protein quaternary structure attributes effectively.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24963340 PMCID: PMC4052169 DOI: 10.1155/2014/715494
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
The numbers of monomer, dimmer, trimer, tetramer, hexamer, and octamer in our benchmark dataset.
| Total | Monomer | Dimer | Trimer | Tetramer | Hexamer | Octamer |
|---|---|---|---|---|---|---|
| 1040 | 366 | 338 | 53 | 155 | 67 | 61 |
Summary of the considered features, where y denotes one of the three secondary structure states and x denotes one of the 20 common AAs.
| Feature sets | Description |
|---|---|
| Sequence-based (79) | Sequence length (1) |
|
| |
| PSSM-based (203) | From the PSSM matrix |
|
| |
| Secondary structure (217) | Based on the features utilized in the PSI-Pred method (90) |
|
| |
| Average RSA based (23) | Average RSA of the residues with AA type |
|
| |
| Average isoelectric point (1) |
|
|
| |
| Auto-correlation functions based on FH |
|
|
| |
| Auto-correlation functions based on cumulative FH |
|
|
| |
| Sum of hydrophobicities based on FH |
|
|
| |
| R groups (5) | RG |
|
| |
| Electronic groups (5) | EG |
|
| |
| Blast based (30) | Refer to subsection “ |
|
| |
| GLAM2-based (30) | Refer to subsection “ |
|
| |
| GIBBS-based (6) | Refer to subsection “ |
Figure 1The diagram of Quad-PRE.
Predicted results with C = 0.1 and gamma = 0.01.
| Monomer | Dimer | Trimer | Tetramer | Hexamer | Octamer | Average | |
|---|---|---|---|---|---|---|---|
| ACC | 63.0% | 63.8% |
| 87.0% |
|
|
|
| TPR | 77.9% | 30.8% | 24.5% | 18.1% | 23.9% | 39.3% | 35.7% |
| SPC | 54.9% | 79.8% |
|
|
|
|
|
| PPV | 48.4% | 42.3% | 27.1% | 75.7% | 45.8% | 28.2% | 44.6% |
| MCC | 0.316 | 0.116 | 0.220 | 0.328 | 0.299 | 0.284 | 0.260 |
| AUC |
| 0.582 |
|
|
|
|
|
Figure 2The ROC curves of six classes.
Comparison with Garian's method.
| ACC | TPR | SPC | PPV | MCC | AUC | |
|---|---|---|---|---|---|---|
| Quad-PRE |
| 30.8% |
|
|
|
|
| QSE | 46.2% | 73.8% | 32.6% | 34.7% | 0.065 | 0.522 |
Figure 3The ROC curves comparison Quad-PRE with Garian's QSE.
Comparison with results are generated by different feature groups.
| ART_1 | ART_2 | ART_3 | BLAST | GLAM2 | GIBBS | Total | |
|---|---|---|---|---|---|---|---|
| ave-ACC |
| 38.5% | 39.9% | 34.9% | 23.7% | 30.6% |
|
| ave-TPR | 23.9% | 21.8% | 23.2% |
| 22.7% | 15.3% |
|
| ave-SPC |
| 84.7% | 85.1% | 85.3% | 84.5% | 82.6% |
|
| ave-PPV |
| 27.4% | 28.9% | 35.2% | 20.5% | 10.2% |
|
| ave-MCC | 0.153 | 0.090 | 0.111 |
| 0.051 | −0.024 |
|
| ave-AUC |
| 0.662 | 0.661 | 0.660 | 0.573 | 0.510 |
|
Figure 4Comparison with the ROC curves of different classes for different feature groups.