| Literature DB >> 17134515 |
Abstract
BACKGROUND: Knowing the submitochondria localization of a mitochondria protein is an important step to understand its function. We develop a method which is based on an extended version of pseudo-amino acid composition to predict the protein localization within mitochondria. This work goes one step further than predicting protein subcellular location. We also try to predict the membrane protein type for mitochondrial inner membrane proteins.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17134515 PMCID: PMC1716183 DOI: 10.1186/1471-2105-7-518
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The leave one out cross validation result
| Label | Compartment | TP | TN | FP | FN | ACC | MCC |
| 1 | Inner membrane | 112 | 173 | 13 | 19 | 85.5% | 0.791 |
| 2 | Outer membrane | 21 | 273 | 3 | 20 | 51.2% | 0.636 |
| 3 | Matrix | 137 | 143 | 29 | 8 | 94.5% | 0.774 |
| Overall accuracy | 85.2% | ||||||
Prediction result on complete mitochondria proteome of Arabidopsis thaliana
| Locations | Number of sequence | Proportion |
| Inner membrane | 109 | 21% |
| Outer membrane | 64 | 13% |
| Matrix | 323 | 66% |
| Over all | 496 | 100% |
Prediction accuracy for different c
| Location | c = 1 | c = 2 | c = 3 | c = 4 |
| Inner membrane | 80.9% | 83.9% | 82.4% | |
| Outer membrane | 51.2% | 36.5% | 34.1% | |
| Matrix | 93.1% | 94.5% | 93.8% | |
| Over all | 82.4% | 83.0% | 81.1% | |
Prediction accuracy for different number of physicochemical properties
| Location | Using 9 properties | Using 2 properties |
| Inner membrane | 85.5% | |
| Outer membrane | 29.3% | |
| Matrix | 91.0% | |
| Over All | 81.1% | |
The distribution of data set
| Label | Compartment | Number of Sequence |
| 1 | Inner membrane | 131 |
| 2 | Outer membrane | 41 |
| 3 | Matrix | 145 |
| Total | 317 | |
The proteins localized at inner membrane are classified into 2 classes containing different membrane protein type. The "multi-pass membrane protein" has 101 sequences, and the "matrix side membrane protein" has 30 sequences.
The 9 physicochemical properties used in this work
| Properties description | Reference |
| Hydrophilicity value | Hopp-Woods (1981) |
| Mean polarity | Radzicka-Wolfenden (1988) |
| Isoelectric point | Zimmerman et al .(1968) |
| Refractivity | McMeekin et al. (1964) |
| Average flexibility indices | Bhaskaran-Ponnuswamy (1988) |
| Average volume of buried residue | Chothia (1975) |
| Electron-ion interaction potential values | Cosic (1994) |
| Transfer free energy to surface | Bull-Breese (1974) |
| Consensus normalized hydrophobicity | Eisenberg (1984) |
All the information in this table is derived from AAIndex database.
The classifiers parameters and accuracy
| Classifier | C | γ | Leave-one-out accuracy |
| inmem_otmem | 100 | 0.001 | 90.7% |
| inmem_matrx | 100 | 0.005 | 90.9% |
| matrx_otmem | 100 | 0.001 | 91.4% |
| mlps_mtrx | 100 | 0.007 | 92.4% |
The parameter C and γ are manually searched to get as high accuracy as possible. The "inmem" means inner membrane, "otmem" means outer membrane, "matrx" means matrix, "mlps" means multi-pass membrane and "mtrx" means the matrix side.