| Literature DB >> 19849839 |
Piero Fariselli1, Castrense Savojardo, Pier Luigi Martelli, Rita Casadio.
Abstract
BACKGROUND: Discriminative models are designed to naturally address classification tasks. However, some applications require the inclusion of grammar rules, and in these cases generative models, such as Hidden Markov Models (HMMs) and Stochastic Grammars, are routinely applied.Entities:
Year: 2009 PMID: 19849839 PMCID: PMC2776008 DOI: 10.1186/1748-7188-4-13
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Figure 1Graphical structure of a linear-CRF (left) and a linear GRHCRF/HCRF (right).
Prediction of the topology of the Prokaryotic outer membrane proteins.
| CRF-1 (Vit) | 0.26 ± 0.05 | 0.72 ± 0.01 | 0.47 ± 0.02 | 0.59 ± 0.01 | 0.80 ± 0.01 |
| CRF-1 (Pvit) | 0.39 ± 0.05 | 0.77 ± 0.01 | 0.54 ± 0.02 | 0.71 ± 0.01 | 0.80 ± 0.01 |
| CRF-2 (Vit) | 0.34 ± 0.05 | 0.76 ± 0.01 | 0.52 ± 0.03 | 0.63 ± 0.02 | 0.82 ± 0.02 |
| CRF-2 (Pvit) | 0.47 ± 0.05 | 0.80 ± 0.01 | 0.60 ± 0.03 | 0.74 ± 0.02 | 0.82 ± 0.02 |
| CRF-3 (Vit) | 0.29 ± 0.04 | 0.72 ± 0.01 | 0.45 ± 0.02 | 0.60 ± 0.02 | 0.79 ± 0.01 |
| CRF-3 (Pvit) | 0.45 ± 0.04 | 0.76 ± 0.01 | 0.52 ± 0.02 | 0.70 ± 0.02 | 0.79 ± 0.01 |
| GRHCRF | 0.66 ± 0.04 | 0.85 ± 0.01 | 0.70 ± 0.03 | 0.83 ± 0.01 | 0.84 ± 0.01 |
| HMM-B2TMR | 0.58 ± 0.04 | 0.80 ± 0.01 | 0.62 ± 0.02 | 0.82 ± 0.02 | 0.83 ± 0.01 |
C(t), Sn(t) and Sp(t) are reported for the transmembrane segments (t).
Vit = Viterbi decoding, Pvit = posterior-Viterbi decoding.
For GRHCRF and HMM-B2TMR we used the posterior-Viterbi decoding.
Models are detailed in the text. Scoring indices are described in Measure of Accuracy section.
Figure 2Automaton structure designed for the prediction of the topology of the outer-membrane proteins in Prokaryotes with GRHCRFs and HMMs.
Figure 3Three different non-ambigous automata derived from the one depicted in Figure 2. These automata are designed to have a bijective mapping between the states and the labels (after the corresponding re-labeling of the sequences). In the text they are referred as CRF1 (a), CRF2 (b) and CRF3 (c).
Confidence level of the results reported in Table 1.
| GRHCRF vs CRF-1 | 98.0% | 99.5% | 99.5% | 99.8% | 99.5% |
| GRHCRF vs CRF-2 | 96.0% | 99.5% | 99.5% | 99.5% | 99.5% |
| GRHCRF vs CRF-3 | 96.0% | 99.5% | 99.5% | 99.5% | 99.5% |
| GRHCRF vs HMM-B2TMR | 80.0% | 96.0% | 99.0% | 98.0% | 99.5% |
The confidence level on the significance of the differences was computed with a t-test.
For all methods we consider the best results of Table 1.