Literature DB >> 33585423

Predicting Cell Wall Lytic Enzymes Using Combined Features.

Xiao-Yang Jing1, Feng-Min Li1.   

Abstract

Due to the overuse of antibiotics, people are worried that existing antibiotics will become ineffective against pathogens with the rapid rise of antibiotic-resistant strains. The use of cell wall lytic enzymes to destroy bacteria has become a viable alternative to avoid the crisis of antimicrobial resistance. In this paper, an improved method for cell wall lytic enzymes prediction was proposed and the amino acid composition (AAC), the dipeptide composition (DC), the position-specific score matrix auto-covariance (PSSM-AC), and the auto-covariance average chemical shift (acACS) were selected to predict the cell wall lytic enzymes with support vector machine (SVM). In order to overcome the imbalanced data classification problems and remove redundant or irrelevant features, the synthetic minority over-sampling technique (SMOTE) was used to balance the dataset. The F-score was used to select features. The S n , S p , MCC, and Acc were 99.35%, 99.02%, 0.98, and 99.19% with jackknife test using the optimized combination feature AAC+DC+acACS+PSSM-AC. The S n , S p , MCC, and Acc of cell wall lytic enzymes in our predictive model were higher than those in existing methods. This improved method may be helpful for protein function prediction.
Copyright © 2021 Jing and Li.

Entities:  

Keywords:  F-score; cell wall lytic enzymes; jackknife test; optimized combination feature; support vector machine; synthetic minority over-sampling technique

Year:  2021        PMID: 33585423      PMCID: PMC7874139          DOI: 10.3389/fbioe.2020.627335

Source DB:  PubMed          Journal:  Front Bioeng Biotechnol        ISSN: 2296-4185


  49 in total

1.  An empirical correlation between secondary structure content and averaged chemical shifts in proteins.

Authors:  Anaika B Sibley; Monique Cosman; V V Krishnan
Journal:  Biophys J       Date:  2003-02       Impact factor: 4.033

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

3.  HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source.

Authors:  Shixiang Wan; Yucong Duan; Quan Zou
Journal:  Proteomics       Date:  2017-09       Impact factor: 3.984

4.  Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.

Authors:  Jiawei Wang; Bingjiao Yang; Yi An; Tatiana Marquez-Lago; André Leier; Jonathan Wilksch; Qingyang Hong; Yang Zhang; Morihiro Hayashida; Tatsuya Akutsu; Geoffrey I Webb; Richard A Strugnell; Jiangning Song; Trevor Lithgow
Journal:  Brief Bioinform       Date:  2019-05-21       Impact factor: 11.622

5.  Computational and Biological Methods for Gene Therapy.

Authors:  Liang Cheng
Journal:  Curr Gene Ther       Date:  2019       Impact factor: 4.391

6.  Characterization of Streptococcus gordonii prophage PH15: complete genome sequence and functional analysis of phage-encoded integrase and endolysin.

Authors:  Jan R van der Ploeg
Journal:  Microbiology       Date:  2008-10       Impact factor: 2.777

7.  Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou's PseAAC.

Authors:  Zhe Ju; Jian-Jun He
Journal:  J Mol Graph Model       Date:  2017-07-25       Impact factor: 2.518

8.  Identification of Phage Viral Proteins With Hybrid Sequence Features.

Authors:  Xiaoqing Ru; Lihong Li; Chunyu Wang
Journal:  Front Microbiol       Date:  2019-03-26       Impact factor: 5.640

9.  External lysis of Escherichia coli by a bacteriophage endolysin modified with hydrophobic amino acids.

Authors:  Guangmou Yan; Rui Yang; Kejia Fan; Hanlin Dong; Chencheng Gao; Shuang Wang; Ling Yu; Zhe Cheng; Liancheng Lei
Journal:  AMB Express       Date:  2019-07-15       Impact factor: 3.298

10.  iCrotoK-PseAAC: Identify lysine crotonylation sites by blending position relative statistical features according to the Chou's 5-step rule.

Authors:  Sharaf Jameel Malebary; Muhammad Safi Ur Rehman; Yaser Daanial Khan
Journal:  PLoS One       Date:  2019-11-21       Impact factor: 3.240

View more
  2 in total

1.  SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information.

Authors:  Adeel Malik; Sathiyamoorthy Subramaniyam; Chang-Bae Kim; Balachandran Manavalan
Journal:  Comput Struct Biotechnol J       Date:  2021-12-14       Impact factor: 7.271

2.  Identification of suicidality in patients with major depressive disorder via dynamic functional network connectivity signatures and machine learning.

Authors:  Manxi Xu; Xiaojing Zhang; Yanqing Li; Shengli Chen; Yingli Zhang; Zhifeng Zhou; Shiwei Lin; Tianfa Dong; Gangqiang Hou; Yingwei Qiu
Journal:  Transl Psychiatry       Date:  2022-09-12       Impact factor: 7.989

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.