| Literature DB >> 35069791 |
Mujiexin Liu1, Hui Chen2, Dong Gao3, Cai-Yi Ma3, Zhao-Yue Zhang2,3.
Abstract
Helicobacter pylori (H. pylori) is the most common risk factor for gastric cancer worldwide. The membrane proteins of the H. pylori are involved in bacterial adherence and play a vital role in the field of drug discovery. Thus, an accurate and cost-effective computational model is needed to predict the uncharacterized membrane proteins of H. pylori. In this study, a reliable benchmark dataset consisted of 114 membrane and 219 nonmembrane proteins was constructed based on UniProt. A support vector machine- (SVM-) based model was developed for discriminating H. pylori membrane proteins from nonmembrane proteins by using sequence information. Cross-validation showed that our method achieved good performance with an accuracy of 91.29%. It is anticipated that the proposed model will be useful for the annotation of H. pylori membrane proteins and the development of new anti-H. pylori agents.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35069791 PMCID: PMC8769816 DOI: 10.1155/2022/7493834
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1The workflow diagram of developing the H. pylori membrane protein prediction model.
Figure 2The IFS curves for (a) 2-mer features, (b) gapped 2-mer features, (c) PseAAC features, and (d) merged features.
Figure 3The ROC curves of the 5-fold cross-validation test.
Figure 4(a) The heat map of AAC of the model features. (b) The frequency of the six amino acids in the two classes.