Literature DB >> 30407530

DeepT3: deep convolutional neural networks accurately identify Gram-negative bacterial type III secreted effectors using the N-terminal sequence.

Li Xue1, Bin Tang2, Wei Chen3, Jiesi Luo4.   

Abstract

MOTIVATION: Various bacterial pathogens can deliver their secreted substrates also called effectors through Type III secretion systems (T3SSs) into host cells and cause diseases. Since T3SS secreted effectors (T3SEs) play important roles in pathogen-host interactions, identifying them is crucial to our understanding of the pathogenic mechanisms of T3SSs. However, the effectors display high level of sequence diversity, therefore making the identification a difficult process. There is a need to develop a novel and effective method to screen and select putative novel effectors from bacterial genomes that can be validated by a smaller number of key experiments.
RESULTS: We develop a deep convolution neural network to directly classify any protein sequence into T3SEs or non-T3SEs, which is useful for both effector prediction and the study of sequence-function relationship. Different from traditional machine learning-based methods, our method automatically extracts T3SE-related features from a protein N-terminal sequence of 100 residues and maps it to the T3SEs space. We train and test our method on the datasets curated from 16 species, yielding an average classification accuracy of 83.7% in the 5-fold cross-validation and an accuracy of 92.6% for the test set. Moreover, when comparing with known state-of-the-art prediction methods, the accuracy of our method is 6.31-20.73% higher than previous methods on a common independent dataset. Besides, we visualize the convolutional kernels and successfully identify the key features of T3SEs, which contain important signal information for secretion. Finally, some effectors reported in the literature are used to further demonstrate the application of DeepT3.
AVAILABILITY AND IMPLEMENTATION: DeepT3 is freely available at: https://github.com/lje00006/DeepT3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2019        PMID: 30407530     DOI: 10.1093/bioinformatics/bty931

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.

Authors:  Jalil Villalobos-Alva; Luis Ochoa-Toledo; Mario Javier Villalobos-Alva; Atocha Aliseda; Fernando Pérez-Escamirosa; Nelly F Altamirano-Bustamante; Francine Ochoa-Fernández; Ricardo Zamora-Solís; Sebastián Villalobos-Alva; Cristina Revilla-Monsalve; Nicolás Kemper-Valverde; Myriam M Altamirano-Bustamante
Journal:  Front Bioeng Biotechnol       Date:  2022-07-07

2.  ACNNT3: Attention-CNN Framework for Prediction of Sequence-Based Bacterial Type III Secreted Effectors.

Authors:  Jie Li; Zhong Li; Jiesi Luo; Yuhua Yao
Journal:  Comput Math Methods Med       Date:  2020-04-03       Impact factor: 2.238

3.  DeepT3_4: A Hybrid Deep Neural Network Model for the Distinction Between Bacterial Type III and IV Secreted Effectors.

Authors:  Lezheng Yu; Fengjuan Liu; Yizhou Li; Jiesi Luo; Runyu Jing
Journal:  Front Microbiol       Date:  2021-01-21       Impact factor: 5.640

4.  iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection.

Authors:  Chenchen Ding; Haitao Han; Qianyue Li; Xiaoxia Yang; Taigang Liu
Journal:  Comput Math Methods Med       Date:  2021-01-06       Impact factor: 2.238

5.  Computational Systems Biology of Alfalfa - Bacterial Blight Host-Pathogen Interactions: Uncovering the Complex Molecular Networks for Developing Durable Disease Resistant Crop.

Authors:  Raghav Kataria; Naveen Duhan; Rakesh Kaundal
Journal:  Front Plant Sci       Date:  2022-02-17       Impact factor: 5.753

6.  ProtPlat: an efficient pre-training platform for protein classification based on FastText.

Authors:  Yuan Jin; Yang Yang
Journal:  BMC Bioinformatics       Date:  2022-02-11       Impact factor: 3.169

7.  T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features.

Authors:  Zewei Chen; Ziyi Zhao; Xinjie Hui; Junya Zhang; Yixue Hu; Runhong Chen; Xuxia Cai; Yueming Hu; Yejun Wang
Journal:  Front Microbiol       Date:  2022-02-08       Impact factor: 5.640

8.  Advances and Challenges in Studying Type III Secretion Effectors of Attaching and Effacing Pathogens.

Authors:  Sabrina L Slater; Gad Frankel
Journal:  Front Cell Infect Microbiol       Date:  2020-07-07       Impact factor: 5.293

9.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

10.  ChlamDB: a comparative genomics database of the phylum Chlamydiae and other members of the Planctomycetes-Verrucomicrobiae-Chlamydiae superphylum.

Authors:  Trestan Pillonel; Florian Tagini; Claire Bertelli; Gilbert Greub
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.