Lingyun Zou1, Chonghan Nan, Fuquan Hu. 1. Department of Microbiology, College of Basic Medical Sciences, Third Military Medical University (TMMU), Chongqing 40038, China and Department of Tuberculosis, Institute of Infectious TB Prevention, Third Hospital of PLA, Baoji, Shanxi 721006, China.
Abstract
MOTIVATION: Various human pathogens secret effector proteins into hosts cells via the type IV secretion system (T4SS). These proteins play important roles in the interaction between bacteria and hosts. Computational methods for T4SS effector prediction have been developed for screening experimental targets in several isolated bacterial species; however, widely applicable prediction approaches are still unavailable RESULTS: In this work, four types of distinctive features, namely, amino acid composition, dipeptide composition, .position-specific scoring matrix composition and auto covariance transformation of position-specific scoring matrix, were calculated from primary sequences. A classifier, T4EffPred, was developed using the support vector machine with these features and their different combinations for effector prediction. Various theoretical tests were performed in a newly established dataset, and the results were measured with four indexes. We demonstrated that T4EffPred can discriminate IVA and IVB effectors in benchmark datasets with positive rates of 76.7% and 89.7%, respectively. The overall accuracy of 95.9% shows that the present method is accurate for distinguishing the T4SS effector in unidentified sequences. A classifier ensemble was designed to synthesize all single classifiers. Notable performance improvement was observed using this ensemble system in benchmark tests. To demonstrate the model's application, a genome-scale prediction of effectors was performed in Bartonella henselae, an important zoonotic pathogen. A number of putative candidates were distinguished. AVAILABILITY: A web server implementing the prediction method and the source code are both available at http://bioinfo.tmmu.edu.cn/T4EffPred.
MOTIVATION: Various human pathogens secret effector proteins into hosts cells via the type IV secretion system (T4SS). These proteins play important roles in the interaction between bacteria and hosts. Computational methods for T4SS effector prediction have been developed for screening experimental targets in several isolated bacterial species; however, widely applicable prediction approaches are still unavailable RESULTS: In this work, four types of distinctive features, namely, amino acid composition, dipeptide composition, .position-specific scoring matrix composition and auto covariance transformation of position-specific scoring matrix, were calculated from primary sequences. A classifier, T4EffPred, was developed using the support vector machine with these features and their different combinations for effector prediction. Various theoretical tests were performed in a newly established dataset, and the results were measured with four indexes. We demonstrated that T4EffPred can discriminate IVA and IVB effectors in benchmark datasets with positive rates of 76.7% and 89.7%, respectively. The overall accuracy of 95.9% shows that the present method is accurate for distinguishing the T4SS effector in unidentified sequences. A classifier ensemble was designed to synthesize all single classifiers. Notable performance improvement was observed using this ensemble system in benchmark tests. To demonstrate the model's application, a genome-scale prediction of effectors was performed in Bartonella henselae, an important zoonotic pathogen. A number of putative candidates were distinguished. AVAILABILITY: A web server implementing the prediction method and the source code are both available at http://bioinfo.tmmu.edu.cn/T4EffPred.
Authors: Ziv Lifshitz; David Burstein; Michael Peeri; Tal Zusman; Kierstyn Schwartz; Howard A Shuman; Tal Pupko; Gil Segal Journal: Proc Natl Acad Sci U S A Date: 2013-02-04 Impact factor: 11.205
Authors: Svetlana Lockwood; Daniel E Voth; Kelly A Brayton; Paul A Beare; Wendy C Brown; Robert A Heinzen; Shira L Broschat Journal: PLoS One Date: 2011-11-28 Impact factor: 3.240
Authors: Megan Y Nas; Richard C White; Ashley L DuMont; Alberto E Lopez; Nicholas P Cianciotto Journal: Infect Immun Date: 2019-08-21 Impact factor: 3.441