| Literature DB >> 32827670 |
Chaolu Meng1, Jin Wu2, Fei Guo3, Benzhi Dong4, Lei Xu5.
Abstract
Cell wall lytic enzymes play key roles in biochemical, morphological, genetic research and industry fields. To save time and labor costs, bioinformatic methods are usually adopted to narrow the scope of in vitro experimentation. In this paper, we established a novel machine learning (support vector machine) based identifier called CWLy-pred to identify cell wall lytic enzymes. An improved MRMD feature selection method is also proposed to select the optimal training set to avoid data redundancy. CWLy-pred obtains an accuracy of 93.067%, a sensitivity of 85.3%, a specificity of 94.8%, an MCC of 0.775 and an AUC of 0.900. It outperforms the state-of-the-art identifier in terms of accuracy, sensitivity, specificity and MCC. Our proposed model is based on a feature set of only 6 dimensions; therefore, it not only can overcome overfitting problems but can also supervise biological experiments effectively. CWLy-pred is embedded in a web application at http://server.malab.cn/CWLy-pred/index.jsp, which is accessible for free.Entities:
Keywords: Bioinformatics; Cell wall lytic enzymes; Improved MRMD feature selection method; Machine learning-based identifier; Support vector machine
Year: 2020 PMID: 32827670 DOI: 10.1016/j.ygeno.2020.08.015
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736