| Literature DB >> 32135196 |
Chaolu Meng1, Jun Zhang2, Xiucai Ye3, Fei Guo4, Quan Zou5.
Abstract
Phage virion protein (PVP) identification plays key role in elucidating relationships between phages and hosts. Moreover, PVP identification can facilitate the design of related biochemical entities. Recently, several machine learning approaches have emerged for this purpose and have shown their potential capacities. In this study, the proposed PVP identifiers are systemically reviewed, and the related algorithms and tools are comprehensively analyzed. We summarized the common framework of these PVP identifiers and constructed our own novel identifiers based upon the framework. Furthermore, we focus on a performance comparison of all PVP identifiers by using a training dataset and an independent dataset. Highlighting the pros and cons of these identifiers demonstrates that g-gap DPC (dipeptide composition) features are capable of representing characteristics of PVPs. Moreover, SVM (support vector machine) is proven to be the more effective classifier to distinguish PVPs and non-PVPs.Entities:
Keywords: G-gap DPC; Machine leaning; Phage virion proteins; Support vector machine
Year: 2020 PMID: 32135196 DOI: 10.1016/j.bbapap.2020.140406
Source DB: PubMed Journal: Biochim Biophys Acta Proteins Proteom ISSN: 1570-9639 Impact factor: 3.036