| Literature DB >> 15163352 |
Qinghua Cui1, Tianzi Jiang, Bing Liu, Songde Ma.
Abstract
BACKGROUND: Subcellular localization of a new protein sequence is very important and fruitful for understanding its function. As the number of new genomes has dramatically increased over recent years, a reliable and efficient system to predict protein subcellular location is urgently needed.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15163352 PMCID: PMC420457 DOI: 10.1186/1471-2105-5-66
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Prediction accuracies of traditional subcellular localization for prokaryotic sequences with RBF kernel function
| Location | Accuracy (%) (Self-consistency test) | Accuracy (%) (Jackknife test) |
| Extracellular | 100 | 75.7 (75.7) |
| Periplasmic | 100 | 81.2 (78.7) |
| Cytoplasmic | 100 | 99 (97.5) |
| Total accruacy | 100 | 92.9 (91.4) |
Prediction accuracies of traditional subcellular localization for eukaryotic sequences with RBF kernel function
| Location | Accuracy (%) (Self-consistency test) | Accuracy (%) (Jackknife test) |
| Extracellular | 100 | 86.5 |
| Mitochondrial | 100 | 67.6 |
| Cytoplasmic | 100 | 80 |
| Nuclear | 100 | 91.2 |
| Total accruacy | 100 | 84.14 |
Prediction accuracies of Esub8 with RBF kernel function
| Location | Accuracy (%) (Self-consistency test) | Accuracy (%) (Jackknife test) |
| Chloroplast | 100 | 89.9 |
| Cytoplasm | 100 | 86.2 |
| Extracellular | 100 | 81.5 |
| Golgi apparatus | 100 | 68.2 |
| Lysosome | 100 | 85.0 |
| Mitochondria | 100 | 72.0 |
| Nucleus | 100 | 92.2 |
| Peroxisome | 100 | 72.6 |
| Total accruacy | 100 | 87 |
Comparing the total accuracies with other 6 methods. From 1 to 7, the methods are our method, Reinhardt and Hubbard's method, Chou and Elrod's method, Yuan's method, Hua and Sun's method, Feng and Zhang's method 1 and method 2.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| P. S. | 100 | - | 90 | - | - | 97.7 | - |
| P. J. | 92.9 | 81 | 87 | 89.1 | 91.4 | 90.4 | 89.6 |
| E. S. | 100 | - | - | - | - | - | - |
| E. J. | 84.14 | 66 | - | 73.0 | 79.4 | - | - |
P. S. denotes prokaryotic sequences prediction accuracies by self-consistency test, and then P. J., prokaryotic sequences prediction accuracies by jackknife test, E. S., eukaryotic sequences prediction accuracies by self-consistency test, E. J. eukaryotic sequences prediction accuracies by jackknife test. En dash denotes there is no result by the corresponding method.
The dataset used in Esub8.
| Subcellular localization | Number of sequences |
| Chloroplast | 1019 |
| Cytoplasm | 2088 |
| Extracellular | 595 |
| Golgi apparatus | 211 |
| Lysosome | 133 |
| Mitochondria | 644 |
| Nucleus | 3199 |
| Peroxisome | 116 |
The final sequences in each location class of the dataset
| Species | Subcellular localization | Number of sequences |
| Prokaryotic | Extracellular | 107 |
| Periplasmic | 202 | |
| Cytoplasmic | 688 | |
| Eukaryotic | Extracellular | 325 |
| Mitochondrial | 321 | |
| Cytoplasmic | 684 | |
| Nuclear | 1097 |