| Literature DB >> 17597904 |
Paul D Taylor1, Teresa K Attwood, Darren R Flower.
Abstract
Two algorithms, based onBayesian Networks (BNs), for bacterial subcellular location prediction, are explored in this paper: one predicts all locations for Gram+ bacteria and the other all locations for Gram- bacteria. Methods were evaluated using different numbers of residues (from the N-terminal 10 residues to the whole sequence) and residue representation (amino acid-composition, percentage amino acid-composition or normalised amino acid-composition). The accuracy of the best resulting BN was compared to PSORTB. The accuracy of this multi-location BN was roughly comparable to PSORTB; the difference in predictions is low, often less than 2%. The BN method thus represents both an important new avenue of methodological development for subcellular location prediction and a potentially value new tool of true utilitarian value for candidate subunit vaccine selection.Entities:
Year: 2006 PMID: 17597904 PMCID: PMC1891703 DOI: 10.6026/97320630001260
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Results of the Gram+ all compartments predictor. The best performing network is highlighted in bold
| Sequence representation | Sub-sequence length | Cytoplasmic accuracy (%) | Membrane accuracy (%) | Extracellular accuracy (%) | Negative set accuracy (%) |
|---|---|---|---|---|---|
| Amino acid-composition | 10 | 98.84 | 3.73 | 8.20 | 32.25% |
| 20 | 94.15 | 19.20 | 45.08 | 29.43% | |
| 30 | 93.94 | 41.07 | 52.87 | 51.11% | |
| 40 | 93.94 | 52.53 | 52.05 | 77.62% | |
| 50 | 94.89 | 70.93 | 78.28 | 92.25% | |
| All sequence | 94.49 | 59.20 | 48.77 | 94.71% | |
| Amino acids | 10 | 60.11 | 0.00 | 0.37 | 21.14% |
| 20 | 79.71 | 0.80 | 4.41 | 23.51% | |
| 30 | 85.64 | 3.73 | 8.82 | 31.23% | |
| 40 | 87.81 | 23.73 | 13.97 | 42.24% | |
| 50 | 90.06 | 24.27 | 25.00 | 43.25% | |
| All sequence | 99.12 | 26.93 | 36.40 | 47.14% | |
| Normalised amino acid-composition | 10 | 100 | 30.67 | 56.15 | 25.26% |
| 20 | 100 | 31.73 | 61.48 | 29.43% | |
| 30 | 100 | 46.93 | 56.56 | 42.15% | |
| 40 | 100 | 59.47 | 57.38 | 56.45% | |
| 50 | 100 | 65.00 | 75.00 | 78.26% | |
| All sequence | 100 | 65.00 | 55.33 | 76.15% |
Results of the Gram- all compartments predictor. The best performing network is highlighted in bold
| Sequence representation | Sub-sequence length | Cytoplasmic accuracy (%) | Inner Membrane accuracy (%) | Periplasmic accuracy (%) | Outer Membrane accuracy (%) | Extracelluar accuracy (%) | Negative set accuracy (%) |
|---|---|---|---|---|---|---|---|
| Amino acid-composition | 10 | 78.14 | 13.31 | 61.14 | 1.13 | 55.14 | 80.14 |
| 20 | 81.61 | 49.14 | 63.73 | 7.14 | 59.62 | 84.72 | |
| 30 | 85.25 | 52.51 | 69.09 | 34.15 | 76.15 | 83.62 | |
| 40 | 86.14 | 74.62 | 72.24 | 59.25 | 71.17 | 89.25 | |
| 50 | 84.74 | 91.16 | 79.96 | 70.75 | 86.12 | 92.42 | |
| All sequence | 89.14 | 84.57 | 71.15 | 55.25 | 80.24 | 87.59 | |
| Amino acids | 10 | 32.55 | 22.21 | 2.14 | 0.24 | 8.36 | 27.73 |
| 20 | 35.25 | 27.52 | 7.36 | 2.51 | 14.82 | 32.15 | |
| 30 | 29.83 | 32.15 | 12.93 | 7.37 | 17.93 | 34.85 | |
| 40 | 40.24 | 37.86 | 18.74 | 8.52 | 24.99 | 41.84 | |
| 50 | 41.84 | 32.41 | 34.14 | 12.41 | 28.25 | 36.83 | |
| All sequence | 40.34 | 44.83 | 27.41 | 15.84 | 22.51 | 40.14 | |
| Normalised amino acid-composition | 10 | 63.26 | 22.15 | 52.36 | 0.42 | 40.25 | 57.12 |
| 20 | 64.37 | 53.87 | 58.25 | 2.35 | 43.95 | 58.51 | |
| 30 | 69.73 | 68.46 | 65.36 | 12.94 | 51.64 | 68.20 | |
| 40 | 72.63 | 81.36 | 73.36 | 33.73 | 52.72 | 71.14 | |
| 50 | 74.89 | 92.97 | 71.l4 | 60.26 | 66.67 | 74.28 | |
| All sequence | 71.46 | 91.73 | 70.73 | 52.24 | 63.26 | 71.75 |
Accuracy of PSORTB bacterial subcellular location predictor in comparison to the most accurate methods produced
| Gram-type | Subcellular location | PSORTB accuracy (%) | Multi-location predictor accuracy (%) |
|---|---|---|---|
| Gram+ | Cytoplasmic | 96.38 | 94.89 |
| Membranous | 91.47 | 70.93 | |
| Extra-cellular | 70.42 | 78.28 | |
| Negative set | 93.86 | 92.25 | |
| Gram- | Cytoplasmic | 91.37 | 84.74 |
| Inner membrane | 94.68 | 91.96 | |
| Periplasmic | 84.69 | 79.96 | |
| Outer membrane | 83.70 | 70.75 | |
| Extra-cellular | 77.55 | 86.12 | |
| Negative set | 90.02 | 92.42 |