| Literature DB >> 21414186 |
Ren-Xiang Yan1, Zhen Chen, Ziding Zhang.
Abstract
BACKGROUND: Outer membrane proteins (OMPs) are frequently found in the outer membranes of gram-negative bacteria, mitochondria and chloroplasts and have been found to play diverse functional roles. Computational discrimination of OMPs from globular proteins and other types of membrane proteins is helpful to accelerate new genome annotation and drug discovery.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21414186 PMCID: PMC3072342 DOI: 10.1186/1471-2105-12-76
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Performance of different OMP discrimination methods based on the GS-dataset.
| Method | ||||
|---|---|---|---|---|
| DDa,b | 0.541 | 82.4 | 78.8 | 83.3 |
| NN_AACa,b | 0.716 | 91.0 | 79.3 | 93.8 |
| SVM_AAC_DPCa,b | 0.816 | 93.9 | 90.9 | 94.7 |
| SSEA-OMPc | 0.772 | 90.9 | 72.9 | 98.1 |
| SSEA-OMPd | 0.906 | 96.2 | 91.5 | 98.1 |
aDD, NN_AAC and SVM_AAC_DPC were developed in Suwa's group[5,6,15].
bThe corresponding results are directly cited from [5,6,15].
cBased on the stringent sequence-filtering method. Briefly, only the remaining sequences sharing a sequence identity less than 25%, a BLAST e-value greater than 0.01 and a PSI-BLAST e-value greater than 0.01 with the test protein were kept in the sequence library.
dOnly the first sequence-filtering procedure was employed. Briefly, only the remaining sequences sharing a sequence identity less than 25% and a BLAST e-value greater than 0.01 with the test protein were kept in the sequence library. It should be emphasized that the performance of SSEA-OMP based on the first sequence-filtering procedure could be overestimated. We list the SSEA-OMP performance based on the first sequence-filtering procedure to allow a generally fair comparison between SSEA-OMP and the other three methods, since the performance of the other three methods were characterized by simple sequence identity-based filtering procedure [5,6,15].
Figure 1ROC curve of SSEA-OMP assessed using the GS-dataset. The overall performance of SSEA-OMP was assessed by the Leave-One-Out (LOO) test. In each step of the LOO test, only the remaining sequences sharing a sequence identity less than 25%, a BLAST e-value greater than 0.01 and a PSI-BLAST e-value greater than 0.01 were kept and used as the training dataset.
Figure 2ROC curves of different OMP discrimination methods assessed using the R-dataset. The ROC curves of HHomp, PROFtmb, BOMP and TMB-Hunt were previously reported by Remmert et al. (2009) [7] and the corresponding data points were downloaded from ftp://ftp.tuebingen.mpg.de/pub/protevo/HHomp/benchmark/. To benchmark the overall performance of SSEA-OMP on the R-dataset, we also used the stringent sequence-filtering method. After the sequence filtering in each step of the benchmark experiment, we ensured that any sequence in the reference dataset should share a sequence identity less than 25%, a BLAST e-value greater than 0.01 and a PSI-BLAST e-value greater than 0.01 with the test sequence in the R-dataset.