Literature DB >> 33505516

iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection.

Chenchen Ding1, Haitao Han1, Qianyue Li1, Xiaoxia Yang1, Taigang Liu1.   

Abstract

Identification of bacterial type III secreted effectors (T3SEs) has become a popular research topic in the field of bioinformatics due to its crucial role in understanding host-pathogen interaction and developing better therapeutic targets against the pathogens. However, the recognition of all effector proteins by using traditional experimental approaches is often time-consuming and laborious. Therefore, development of computational methods to accurately predict putative novel effectors is important in reducing the number of biological experiments for validation. In this study, we proposed a method, called iT3SE-PX, to identify T3SEs solely based on protein sequences. First, three kinds of features were extracted from the position-specific scoring matrix (PSSM) profiles to help train a machine learning (ML) model. Then, the extreme gradient boosting (XGBoost) algorithm was performed to rank these features based on their classification ability. Finally, the optimal features were selected as inputs to a support vector machine (SVM) classifier to predict T3SEs. Based on the two benchmark datasets, we conducted a 100-time randomized 5-fold cross validation (CV) and an independent test, respectively. The experimental results demonstrated that the proposed method achieved superior performance compared to most of the existing methods and could serve as a useful tool for identifying putative T3SEs, given only the sequence information.
Copyright © 2021 Chenchen Ding et al.

Entities:  

Year:  2021        PMID: 33505516      PMCID: PMC7806399          DOI: 10.1155/2021/6690299

Source DB:  PubMed          Journal:  Comput Math Methods Med        ISSN: 1748-670X            Impact factor:   2.238


  39 in total

Review 1.  Biological applications of support vector machines.

Authors:  Zheng Rong Yang
Journal:  Brief Bioinform       Date:  2004-12       Impact factor: 11.622

2.  Type III secretion of the Salmonella effector protein SopE is mediated via an N-terminal amino acid signal and not an mRNA sequence.

Authors:  M H Karavolos; A J Roe; M Wilson; J Henderson; J J Lee; D L Gally; C M A Khan
Journal:  J Bacteriol       Date:  2005-03       Impact factor: 3.490

Review 3.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

4.  Bastion3: a two-layer ensemble predictor of type III secreted effectors.

Authors:  Jiawei Wang; Jiahui Li; Bingjiao Yang; Ruopeng Xie; Tatiana T Marquez-Lago; André Leier; Morihiro Hayashida; Tatsuya Akutsu; Yanju Zhang; Kuo-Chen Chou; Joel Selkrig; Tieli Zhou; Jiangning Song; Trevor Lithgow
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

5.  Comprehensive assessment and performance improvement of effector protein predictors for bacterial secretion systems III, IV and VI.

Authors:  Yi An; Jiawei Wang; Chen Li; André Leier; Tatiana Marquez-Lago; Jonathan Wilksch; Yang Zhang; Geoffrey I Webb; Jiangning Song; Trevor Lithgow
Journal:  Brief Bioinform       Date:  2018-01-01       Impact factor: 11.622

6.  Using weakly conserved motifs hidden in secretion signals to identify type-III effectors from bacterial pathogen genomes.

Authors:  Xiaobao Dong; Yong-Jun Zhang; Ziding Zhang
Journal:  PLoS One       Date:  2013-02-20       Impact factor: 3.240

7.  Effective identification of Gram-negative bacterial type III secreted effectors using position-specific residue conservation profiles.

Authors:  Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
Journal:  PLoS One       Date:  2013-12-31       Impact factor: 3.240

8.  Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach.

Authors:  Taigang Liu; Yufang Qin; Yongjie Wang; Chunhua Wang
Journal:  Int J Mol Sci       Date:  2015-12-24       Impact factor: 5.923

9.  Computational prediction shines light on type III secretion origins.

Authors:  Tatyana Goldberg; Burkhard Rost; Yana Bromberg
Journal:  Sci Rep       Date:  2016-10-07       Impact factor: 4.379

10.  A Random Forest Sub-Golgi Protein Classifier Optimized via Dipeptide and Amino Acid Composition Features.

Authors:  Zhibin Lv; Shunshan Jin; Hui Ding; Quan Zou
Journal:  Front Bioeng Biotechnol       Date:  2019-09-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.