| Literature DB >> 24027753 |
Abstract
Knowing the submitochondrial location of a mitochondrial protein is an important step in understanding its function. We developed a new method for predicting protein submitochondrial locations by introducing a new concept: positional specific physicochemical properties. With the framework of general form pseudoamino acid compositions, our method used only about 100 features to represent protein sequences, which is much simpler than the existing methods. On the dataset of SubMito, our method achieved over 93% overall accuracy, with 98.60% for inner membrane, 93.90% for matrix, and 70.70% for outer membrane, which are comparable to all state-of-the-art methods. As our method can be used as a general method to upgrade all pseudoamino-acid-composition-based methods, it should be very useful in future studies. We implement our method as an online service: SubMito-PSPCP.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24027753 PMCID: PMC3763570 DOI: 10.1155/2013/263829
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Summary of the dataset.
| Submitochondrial locations | Number of proteins | |
|---|---|---|
| SML3-317 | SML3-983 | |
| Inner membrane | 131 | 661 |
| Outer membrane | 41 | 145 |
| Matrix | 145 | 177 |
|
| ||
| Total | 317 | 983 |
Physicochemical properties used in this method.
| AAIndex ID | Property description |
|---|---|
| BULH740101 | Transfer free energy to surface |
| EISD840101 | Consensus normalized hydrophobicity |
| HOPT810101 | Hydrophilicity value |
| RADA880108 | Mean polarity |
| ZIMJ680104 | Isoelectric point |
| MCMT640101 | Refractivity |
| BHAR880101 | Average flexibility indices |
| CHOC750101 | Average volume of buried residue |
| COSI940101 | Electron-ion interaction potential values |
Prediction performance on SML3-983 dataset.
| Submitochondrial location | ACC | MCC |
|---|---|---|
| Inner membrane | 95.46% | 0.77 |
| Outer membrane | 77.93% | 0.83 |
| Matrix | 74.01% | 0.73 |
| Overall | 89.01% |
Performance comparison on SML3-317 dataset.
| Methods | Inner membrane | Matrix | Outer membrane | Overall | |||
|---|---|---|---|---|---|---|---|
| ACC | MCC | ACC | MCC | ACC | MCC | ||
| SubMito [ | 85.50% | 0.79 | 94.50% | 0.77 | 51.20% | 0.64 | 85.20% |
| GPLoc [ | 83.20% | 0.80 | 97.20% | 0.85 | 78.10% | 0.77 | 89.00% |
| SubIdent [ | 91.60% | 0.86 | 97.30% | 0.79 | 82.90% | 0.88 | 93.10% |
| Predict_SubMito [ | 91.80% | 0.79 | 96.40% | 0.79 | 66.10% | 0.63 | 89.70% |
| MitoLoc [ | 97.70% | 0.94 | 99.00% | 0.93 | 68.30% | 0.81 | 94.70% |
| Fan and Li [ | 94.70% | 0.91 | 99.30% | 0.96 | 80.50% | 0.84 | 94.90% |
| TetraMito [ | 100.00% | 0.90 | 96.60% | 0.95 | 65.90% | 0.79 | 94.00% |
| This work | 98.60% | 0.92 | 93.90% | 0.89 | 70.70% | 0.79 | 93.10% |
Independent dataset test of the current method.
| Dataset | Average ACC | Standard deviation of ACC |
|---|---|---|
| SML3-317 | 90.24% | 3.27% |
| SML3-983 | 87.17% | 1.81% |
The values in this table are obtained by 20 times 20% independent dataset test.