Literature DB >> 30407484

BacPaCS-Bacterial Pathogenicity Classification via Sparse-SVM.

Eran Barash1, Neta Sal-Man2, Sivan Sabato1, Michal Ziv-Ukelson1.   

Abstract

MOTIVATION: Bacterial infections are a major cause of illness worldwide. However, most bacterial strains pose no threat to human health and may even be beneficial. Thus, developing powerful diagnostic bioinformatic tools that differentiate pathogenic from commensal bacteria are critical for effective treatment of bacterial infections.
RESULTS: We propose a machine-learning approach for classifying human-hosted bacteria as pathogenic or non-pathogenic based on their genome-derived proteomes. Our approach is based on sparse Support Vector Machines (SVM), which autonomously selects a small set of genes that are related to bacterial pathogenicity. We implement our approach as a tool-'Bacterial Pathogenicity Classification via sparse-SVM' (BacPaCS)-which is fully automated and handles datasets significantly larger than those previously used. BacPaCS shows high accuracy in distinguishing pathogenic from non-pathogenic bacteria, in a clinically relevant dataset, comprising only human-hosted bacteria. Among the genes that received the highest positive weight in the resulting classifier, we found genes that are known to be related to bacterial pathogenicity, in addition to novel candidates, whose involvement in bacterial virulence was never reported.
AVAILABILITY AND IMPLEMENTATION: The code and the resulting model are available at: https://github.com/barashe/bacpacs. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30407484     DOI: 10.1093/bioinformatics/bty928

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Predicting the pathogenicity of bacterial genomes using widely spread protein families.

Authors:  Shaked Naor-Hoffmann; Dina Svetlitsky; Neta Sal-Man; Yaron Orenstein; Michal Ziv-Ukelson
Journal:  BMC Bioinformatics       Date:  2022-06-24       Impact factor: 3.307

Review 2.  Forest and Trees: Exploring Bacterial Virulence with Genome-wide Association Studies and Machine Learning.

Authors:  Jonathan P Allen; Evan Snitkin; Nathan B Pincus; Alan R Hauser
Journal:  Trends Microbiol       Date:  2021-01-14       Impact factor: 18.230

3.  HDG-select: A novel GUI based application for gene selection and classification in high dimensional datasets.

Authors:  Shilan S Hameed; Rohayanti Hassan; Wan Haslina Hassan; Fahmi F Muhammadsharif; Liza Abdul Latiff
Journal:  PLoS One       Date:  2021-01-28       Impact factor: 3.240

Review 4.  Typing methods based on whole genome sequencing data.

Authors:  Laura Uelze; Josephine Grützke; Maria Borowiak; Jens Andre Hammerl; Katharina Juraschek; Carlus Deneke; Simon H Tausch; Burkhard Malorny
Journal:  One Health Outlook       Date:  2020-02-18

5.  Wide range of metabolic adaptations to the acquisition of the Calvin cycle revealed by comparison of microbial genomes.

Authors:  Johannes Asplund-Samuelsson; Elton P Hudson
Journal:  PLoS Comput Biol       Date:  2021-02-08       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.