Literature DB >> 32251507

Learning transferable deep convolutional neural networks for the classification of bacterial virulence factors.

Dandan Zheng1, Guansong Pang2, Bo Liu1, Lihong Chen1, Jian Yang1.   

Abstract

MOTIVATION: Identification of virulence factors (VFs) is critical to the elucidation of bacterial pathogenesis and prevention of related infectious diseases. Current computational methods for VF prediction focus on binary classification or involve only several class(es) of VFs with sufficient samples. However, thousands of VF classes are present in real-world scenarios, and many of them only have a very limited number of samples available.
RESULTS: We first construct a large VF dataset, covering 3446 VF classes with 160 495 sequences, and then propose deep convolutional neural network models for VF classification. We show that (i) for common VF classes with sufficient samples, our models can achieve state-of-the-art performance with an overall accuracy of 0.9831 and an F1-score of 0.9803; (ii) for uncommon VF classes with limited samples, our models can learn transferable features from auxiliary data and achieve good performance with accuracy ranging from 0.9277 to 0.9512 and F1-score ranging from 0.9168 to 0.9446 when combined with different predefined features, outperforming traditional classifiers by 1-13% in accuracy and by 1-16% in F1-score.
AVAILABILITY AND IMPLEMENTATION: All of our datasets are made publicly available at http://www.mgc.ac.cn/VFNet/, and the source code of our models is publicly available at https://github.com/zhengdd0422/VFNet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2020        PMID: 32251507     DOI: 10.1093/bioinformatics/btaa230

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach.

Authors:  Shufang Wu; Zhencheng Fang; Jie Tan; Mo Li; Chunhui Wang; Qian Guo; Congmin Xu; Xiaoqing Jiang; Huaiqiu Zhu
Journal:  Gigascience       Date:  2021-09-08       Impact factor: 6.524

2.  VFDB 2022: a general classification scheme for bacterial virulence factors.

Authors:  Bo Liu; Dandan Zheng; Siyu Zhou; Lihong Chen; Jian Yang
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.