| Literature DB >> 19429691 |
Michael Remmert1, Dirk Linke, Andrei N Lupas, Johannes Söding.
Abstract
Outer membrane proteins (OMPs) are the transmembrane proteins found in the outer membranes of Gram-negative bacteria, mitochondria and plastids. Most prediction methods have focused on analogous features, such as alternating hydrophobicity patterns. Here, we start from the observation that almost all beta-barrel OMPs are related by common ancestry. We identify proteins as OMPs by detecting their homologous relationships to known OMPs using sequence similarity. Given an input sequence, HHomp builds a profile hidden Markov model (HMM) and compares it with an OMP database by pairwise HMM comparison, integrating OMP predictions by PROFtmb. A crucial ingredient is the OMP database, which contains profile HMMs for over 20,000 putative OMP sequences. These were collected with the exhaustive, transitive homology detection method HHsenser, starting from 23 representative OMPs in the PDB database. In a benchmark on TransportDB, HHomp detects 63.5% of the true positives before including the first false positive. This is 70% more than PROFtmb, four times more than BOMP and 10 times more than TMB-Hunt. In Escherichia coli, HHomp identifies 57 out of 59 known OMPs and correctly assigns them to their functional subgroups. HHomp can be accessed at http://toolkit.tuebingen.mpg.de/hhomp.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19429691 PMCID: PMC2703889 DOI: 10.1093/nar/gkp325
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.ROC plot comparing HHomp with the β-barrel prediction methods PROFtmb, TMB-Hunt and BOMP. The true positives (TPs) set consists of the 2164 proteins annotated as OMPs in TransportDB without a BLAST E-value < 0.01 to any of the 23 proteins in our trainings set. The FPs are 5000 randomly selected non-OMP proteins from SCOP. HHomp detects 63.5% of all TPs before the first FP. The effective error rate is the number of effective FPs divided by the sum of TPs and effective FPs are defined as FPs multiplied with 21 to obtain the same fraction of OMPs in the benchmark set as in Gram-negative genomes (2%) (1).
Number of proteins predicted as OMPs by HHomp and PROFtmb for various genomes
| Organism class | Organism | Proteins | HHomp hits with prob | PROFtmb hits with score | ||
|---|---|---|---|---|---|---|
| 100 (%) | >90 (%) | >10 | >7 | |||
| Archaea | 1841 | 0 | 0 | 0 | 4 (0.2%) | |
| 1784 | 0 | 0 | 1 (0.1%) | 2 (0.1%) | ||
| Gram-positive bacteria | 2618 | 0 | 1 (0.04) | 6 (0.2%) | 13 (0.5%) | |
| 4102 | 0 | 0 | 0 | 8 (0.2%) | ||
| 2272 | 0 | 1 (0.04) | 3 (0.1%) | 20 (0.9%) | ||
| 2771 | 0 | 0 | 4 (0.1%) | 17 (0.6%) | ||
| Gram-negative bacteria | 4240 | 71 (1.7) | 77 (1.8) | 29 (0.7%) | 82 (1.9%) | |
| 2063 | 34 (1.6) | 36 (1.7) | 10 (0.5%) | 26 (1.2%) | ||
| 5288 | 36 (0.7) | 39 (0.8) | 26 (0.5%) | 98 (1.8%) | ||
| 1488 | 29 (1.9) | 31 (2.1) | 7 (0.5%) | 21 (1.4%) | ||
| Cyanobacteria | 2892 | 20 (0.7) | 26 (0.9) | 2 (0.1%) | 32 (1.1%) | |
| Eucarya | 5869 | 1 (0.02) | 4 (0.07) | 20 (0.3%) | 64 (1.0%) | |
| 34 143 | 1 (∼0) | 20 (0.1) | 74 (0.2%) | 362 (1.0%) | ||
Gram-positive bacteria and archaea do not have an outer membrane and should therefore not possess OMPs. Note that the error rate of HHomp at 90% probability is significant lower than the error rate of PROFtmb at a score of 10. In most Gram-negative bacteria, HHomp detects >1.5% OMPs with 100%, over twice more than PROFtmb at a score of 10. In yeast, HHomp correctly predicts the major mitochondrial OMBBs (35,36).
Figure 2.HHomp results page with graphical overview of regions matched to OMPs from the HHomp database, summary results list and detailed alignments.