Literature DB >> 25800819

A sequence-based two-level method for the prediction of type I secreted RTX proteins.

Jiesi Luo1, Wenling Li, Zhongyu Liu, Yanzhi Guo, Xuemei Pu, Menglong Li.   

Abstract

Many Gram-negative bacteria use the type I secretion system (T1SS) to translocate a wide range of substrates (type I secreted RTX proteins, T1SRPs) from the cytoplasm across the inner and outer membrane in one step to the extracellular space. Since T1SRPs play an important role in pathogen-host interactions, identifying them is crucial for a full understanding of the pathogenic mechanism of T1SS. However, experimental identification is often time-consuming and expensive. In the post-genomic era, it becomes imperative to predict new T1SRPs using information from the amino acid sequence alone when new proteins are being identified in a high-throughput mode. In this study, we report a two-level method for the first attempt to identify T1SRPs using sequence-derived features and the random forest (RF) algorithm. At the full-length sequence level, the results show that the unique feature of T1SRPs is the presence of variable numbers of the calcium-binding RTX repeats. These RTX repeats have a strong predictive power and so T1SRPs can be well distinguished from non-T1SRPs. At another level, different from that of the secretion signal, we find that a sequence segment located at the last 20-30 C-terminal amino acids may contain important signal information for T1SRP secretion because obvious differences were shown between the corresponding positions of T1SRPs and non-T1SRPs in terms of amino acid and secondary structure compositions. Using five-fold cross-validation, overall accuracies of 97% at the full-length sequence level and 89% at the secretion signal level were achieved through feature evaluation and optimization. Benchmarking on an independent dataset, our method could correctly predict 63 and 66 of 74 T1SRPs at the full-length sequence and secretion signal levels, respectively. We believe that this study will be useful in elucidating the secretion mechanism of T1SS and facilitating hypothesis-driven experimental design and validation.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25800819     DOI: 10.1039/c5an00311c

Source DB:  PubMed          Journal:  Analyst        ISSN: 0003-2654            Impact factor:   4.616


  4 in total

1.  Individually double minimum-distance definition of protein-RNA binding residues and application to structure-based prediction.

Authors:  Wen Hu; Liu Qin; Menglong Li; Xuemei Pu; Yanzhi Guo
Journal:  J Comput Aided Mol Des       Date:  2018-11-26       Impact factor: 3.686

2.  Genome-wide prediction of bacterial effector candidates across six secretion system types using a feature-based statistical framework.

Authors:  Andi Dhroso; Samantha Eidson; Dmitry Korkin
Journal:  Sci Rep       Date:  2018-11-21       Impact factor: 4.379

3.  T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features.

Authors:  Zewei Chen; Ziyi Zhao; Xinjie Hui; Junya Zhang; Yixue Hu; Runhong Chen; Xuxia Cai; Yueming Hu; Yejun Wang
Journal:  Front Microbiol       Date:  2022-02-08       Impact factor: 5.640

4.  Protein-Specific Prediction of RNA-Binding Sites Based on Information Entropy.

Authors:  Yue Ji; Lu Bai; Menglong Li
Journal:  Comput Intell Neurosci       Date:  2022-10-03
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.