| Literature DB >> 19309509 |
Kirsti Laurila1, Mauno Vihinen.
Abstract
BACKGROUND: Eukaryotic cells contain numerous compartments, which have different protein constituents. Proteins are typically directed to compartments by short peptide sequences that act as targeting signals. Translocation to the proper compartment allows a protein to form the necessary interactions with its partners and take part in biological networks such as signalling and metabolic pathways. If a protein is not transported to the correct intracellular compartment either the reaction performed or information carried by the protein does not reach the proper site, causing either inactivation of central reactions or misregulation of signalling cascades, or the mislocalized active protein has harmful effects by acting in the wrong place.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19309509 PMCID: PMC2680896 DOI: 10.1186/1471-2164-10-122
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Prediction results for the localization of wild type proteins with the individual predictorsa
| Compartment | tp | rp | tn | fn | Accuracy | Precision | Recall | MCC | Proteins located |
| TargetP (mitochodrial) | 101 | 109 | 1254 | 52 | 0.894 | 0.481 | 0.660 | 0.506 | 210 |
| TMHMM | 377 | 144 | 853 | 142 | 0.811 | 0.724 | 0.726 | 0.581 | 521 |
| Golgipredictor | 23 | 178 | 227 | 20 | 0.558 | 0.114 | 0.535 | 0.056 | 201 |
| PeroxiP | 6 | 24 | 423 | 12 | 0.923 | 0.200 | 0.333 | 0.220 | 30 |
| PTS1 | 3 | 1 | 446 | 15 | 0.966 | 0.750 | 0.167 | 0.343 | 4 |
| PredictNLS | 98 | 23 | 248 | 217 | 0.590 | 0.810 | 0.311 | 0.279 | 121 |
| Summary | 0.790b | 0.513b | 0.455b | 0.331b | 1087 |
atp, the number of positive cases that were correctly predicted; tn, the number of negative cases correctly predicted; fp, the number of positive cases incorrectly predicted; fn, the number of negative cases incorrectly predicted.
baverage value.
Prediction results for the localization of wild type proteins with the Scandinavian protocol.
| Compartmenta | tp | fp | tn | fn | Accuracy | Precision | Recall | MCC | Proteins located |
| Mtm | 10 | 29 | 1337 | 140 | 0.889 | 0.256 | 0.067 | 0.086 | 39 |
| Mps | 0 | 6 | 1363 | 147 | 0.899 | 0.000 | 0.000 | -0.021 | 6 |
| Mma | 91 | 74 | 1293 | 58 | 0.913 | 0.552 | 0.611 | 0.532 | 165 |
| Gtm | 23 | 178 | 1240 | 75 | 0.833 | 0.114 | 0.235 | 0.079 | 201 |
| PM | 221 | 35 | 619 | 641 | 0.554 | 0.863 | 0.256 | 0.268 | 256 |
| S | 246 | 84 | 1093 | 93 | 0.883 | 0.745 | 0.726 | 0.661 | 330 |
| ER | 2 | 1 | 1330 | 183 | 0.879 | 0.667 | 0.011 | 0.074 | 3 |
| N | 98 | 23 | 1097 | 298 | 0.788 | 0.810 | 0.247 | 0.368 | 121 |
| P | 6 | 24 | 1462 | 24 | 0.968 | 0.200 | 0.200 | 0.184 | 30 |
| C | 269 | 166 | 852 | 229 | 0.739 | 0.618 | 0.540 | 0.392 | 435 |
| Summary | 966b | 620b | 1888b | 0.835c | 0.483c | 0.289c | 0.262c | 1586b |
Abbreviations for statistical parameters as in Table 1.
a C, cytosol; Gtm, Golgi, transmembrane; Mma, mitochondrial matrix; Mps, mitochondrial, periplasmic space; Mtm, mitochondrial transmembrane; N, nuclear; P, peroxisomal; PM, plasma membrane; S, secreted
btotal number
caverage value.
Prediction results for the localization of wild type proteins with WoLF PSORT.
| Mutant compartment | |||||||||
| Wild type compartmenta | tp | fp | tn | fn | Accuracy | Precision | Recall | MCC | Proteins located |
| CK | 2 | 38 | 1471 | 5 | 0.972 | 0.050 | 0.286 | 0.110 | 40 |
| CK_PM | 0 | 31 | 1485 | 31 | 0.960 | 0.000 | 0.000 | -0.020 | 31 |
| C | 339 | 304 | 713 | 160 | 0.694 | 0.527 | 0.679 | 0.362 | 643 |
| C_G | 0 | 29 | 1450 | 37 | 0.956 | 0.000 | 0.000 | -0.022 | 29 |
| C_M | 8 | 145 | 1328 | 35 | 0.881 | 0.052 | 0.186 | 0.048 | 153 |
| C_N | 102 | 365 | 963 | 86 | 0.703 | 0.218 | 0.543 | 0.191 | 467 |
| C_P | 3 | 77 | 1425 | 11 | 0.942 | 0.038 | 0.214 | 0.070 | 80 |
| ER | 85 | 253 | 1081 | 97 | 0.769 | 0.251 | 0.467 | 0.217 | 338 |
| ER_G | 4 | 70 | 1392 | 50 | 0.921 | 0.054 | 0.074 | 0.023 | 74 |
| ER_M | 2 | 92 | 1413 | 9 | 0.933 | 0.021 | 0.182 | 0.042 | 94 |
| S | 281 | 321 | 856 | 58 | 0.750 | 0.467 | 0.829 | 0.474 | 602 |
| S_PM | 4 | 148 | 1358 | 6 | 0.898 | 0.026 | 0.400 | 0.081 | 152 |
| G | 13 | 74 | 1344 | 85 | 0.895 | 0.149 | 0.133 | 0.085 | 87 |
| L | 33 | 202 | 1258 | 23 | 0.852 | 0.140 | 0.589 | 0.235 | 235 |
| M | 128 | 325 | 1038 | 25 | 0.769 | 0.283 | 0.837 | 0.394 | 453 |
| M_N | 0 | 33 | 1459 | 24 | 0.962 | 0.000 | 0.000 | -0.019 | 33 |
| M_P | 1 | 112 | 1397 | 6 | 0.922 | 0.009 | 0.143 | 0.018 | 113 |
| N | 325 | 330 | 790 | 71 | 0.735 | 0.496 | 0.821 | 0.467 | 655 |
| P | 12 | 302 | 1184 | 18 | 0.789 | 0.038 | 0.400 | 0.068 | 314 |
| PM | 356 | 185 | 841 | 134 | 0.790 | 0.658 | 0.727 | 0.533 | 542 |
| Summary | 1696b | 3398b | 966b | 0.854c | 0.174c | 0.375c | 0.168c | 5095b | |
Abbreviations for statistical parameters as in Table 1.
aC, cytosol; CK, cytoskeleton; G, Golgi compartment; M, mitochondrial; N, nuclear; P, peroxisomal; PM, plasma membrane; S, secreted
btotal number. Underline sign indicates multiple predictions.
caverage value
Changes in SP localization prediction due to mutations.
| Mutant compartment | ||||||||||||||||
| Wild type compartmenta | Gtm | Mtm/Gtm | Mma/C | Mma | Mma/PM | Mps/S | C | Mma/P | N | PM | Mtm | S | Mtm/PM | Mma/N | P | Total |
| Mtm/Gtm | 0/3 | 1/1 | 1/4 | |||||||||||||
| Mtm/PM | 0/6 | 0/9 | 1/1 | 1/16 | ||||||||||||
| PM | 17/47 | 0/1 | 4/5 | 21/53 | ||||||||||||
| S | 0/4 | 7/8 | 7/12 | |||||||||||||
| C | 0/1 | 8/8 | 19/20 | 1/1 | 1/1 | 0/2 | 29/33 | |||||||||
| Mtm | 0/3 | 1/1 | 1/4 | |||||||||||||
| Mma/Gtm | 0/2 | 0/2 | ||||||||||||||
| Gtm | 0/5 | 17/21 | 0/1 | 2/4 | 19/31 | |||||||||||
| Mtm/C | 0/1 | 1/4 | 1/5 | |||||||||||||
| Mps/C | 1/1 | 1/1 | ||||||||||||||
| Mma | 6/6 | 1/1 | 2/2 | 1/1 | 10/10 | |||||||||||
| Mma/S | 2/2 | 1/1 | 1/1 | 4/4 | ||||||||||||
| Mma/C | 4/4 | 4/4 | ||||||||||||||
| Mps | 2/2 | 2/2 | ||||||||||||||
| Mma/N | 1/1 | 1/1 | 2/2 | |||||||||||||
| P | 1/1 | 1/1 | ||||||||||||||
| N | 0/18 | 1/1 | 1/19 | |||||||||||||
| Total | 17/61 | 0/19 | 17/18 | 3/6 | 2/2 | 4/4 | 7/8 | 1/1 | 21/22 | 25/48 | 0/1 | 3/5 | 4/5 | 1/1 | 0/2 | 105/203 |
The numbers separated by the slash sign are for how many proteins the wild type localization has been correctly predicted, and the number of analyzed mutations, respectively.
aC, cytosol; Gtm, Golgi, transmembrane; Mma, mitochondrial matrix; Mps, mitochondrial, periplasmic space; Mtm, mitochondrial transmembrane; N, nuclear; P, peroxisomal; PM, plasma membrane; S, secreted. Slash sign indicates alternative predicted localizations.
Changes in WoLF PSORT localization prediction due to mutations.
| Mutant compartment | ||||||||||||||||||
| Wild type compartmenta | PM | S | S_PM | C | C_N | C_M | N | N/C_N | N/C/ | ER | ER_ | M | M_N | M_N/ | P | L | CK | Total |
| P | 2/2 | 0/1 | 1/6 | 2/2 | 0/1 | 0/6 | 1/1 | 6/19 | ||||||||||
| S | 0/1 | 0/4 | 0/1 | 4/5 | 1/1 | 4/9 | 0/2 | 9/23 | ||||||||||
| S/S_PM | 0/1 | 0/1 | ||||||||||||||||
| C | 0/6 | 2/3 | 3/9 | 3/3 | 5/6 | 0/1 | 3/6 | 2/2 | 18/36 | |||||||||
| C/C_N | 1/1 | 1/1 | ||||||||||||||||
| C_N | 0/2 | 18/19 | 4/8 | 2/2 | 2/2 | 3/3 | 29/36 | |||||||||||
| C_M | 1/1 | 1/1 | ||||||||||||||||
| N | 0/1 | 0/10 | 6/10 | 7/7 | 2/4 | 15/32 | ||||||||||||
| G/ER_G | 0/1 | 0/1 | ||||||||||||||||
| ER | 0/11 | 0/1 | 1/1 | 0/1 | 2/2 | 3/16 | ||||||||||||
| M | 0/2 | 3/3 | 6/6 | 0/2 | 0/2 | 9/15 | ||||||||||||
| ER_M | 1/1 | 1/1 | ||||||||||||||||
| M/P/M_P | 0/1 | 0/1 | ||||||||||||||||
| Total | 0/21 | 7/21 | 0/1 | 37/53 | 7/11 | 3/3 | 12/20 | 0/3 | 0/1 | 4/7 | 1/1 | 11/28 | 2/2 | 3/3 | 3/3 | 0/3 | 2/2 | 92/183 |
The numbers separated by the slash sign are for how many proteins the wild type localization have been correctly predicted, and the number of analyzed mutations, respectively.
aC, cytosol; CK, cytoskeleton; G, Golgi compartment; M, mitochondrial; N, nuclear; P, peroxisomal; PM, plasma membrane; S, secreted. Slash sign indicates alternative localization predictions and underline sign multiple predicted localizations.
Figure 1Amino acid distribution for the two prediction schemes. Scandinavian protocol (A, B) and WoLF PSORT (C, D) predictions of (A, C) amino acids in wild type proteins that are predicted to be mutated in localization mutants, and (B, D) mutant amino acids in localization mutants.
Figure 2Schematic illustration of the analysis of protein localization with the Scandinavian protocol. The predicted compartments are indicated with corresponding numbers in black circles. The localizations are listed in the middle of the figure. RC, reliability coefficient; SP, signal peptide; TM, transmembrane region.