Laura de Nies1, Sara Lopes1, Susheel Bhanu Busi1, Valentina Galata1, Anna Heintz-Buschart1,2,3, Cedric Christian Laczny1, Patrick May4, Paul Wilmes5. 1. Systems Ecology Research Group, Luxembourg Centre for Systems Biomedicine, Esch-sur-Alzette, Luxembourg. 2. Metagenomics Support Unit, German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany. 3. Department of Soil Ecology, Helmholtz Centre for Environmental Research GmbH-UFZ, Halle (Saale), Germany. 4. Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, Esch-sur-Alzette, Luxembourg. 5. Systems Ecology Research Group, Luxembourg Centre for Systems Biomedicine, Esch-sur-Alzette, Luxembourg. paul.wilmes@uni.lu.
Abstract
BACKGROUND: Pathogenic microorganisms cause disease by invading, colonizing, and damaging their host. Virulence factors including bacterial toxins contribute to pathogenicity. Additionally, antimicrobial resistance genes allow pathogens to evade otherwise curative treatments. To understand causal relationships between microbiome compositions, functioning, and disease, it is essential to identify virulence factors and antimicrobial resistance genes in situ. At present, there is a clear lack of computational approaches to simultaneously identify these factors in metagenomic datasets. RESULTS: Here, we present PathoFact, a tool for the contextualized prediction of virulence factors, bacterial toxins, and antimicrobial resistance genes with high accuracy (0.921, 0.832 and 0.979, respectively) and specificity (0.957, 0.989 and 0.994). We evaluate the performance of PathoFact on simulated metagenomic datasets and perform a comparison to two other general workflows for the analysis of metagenomic data. PathoFact outperforms all existing workflows in predicting virulence factors and toxin genes. It performs comparably to one pipeline regarding the prediction of antimicrobial resistance while outperforming the others. We further demonstrate the performance of PathoFact on three publicly available case-control metagenomic datasets representing an actual infection as well as chronic diseases in which either pathogenic potential or bacterial toxins are hypothesized to play a role. In each case, we identify virulence factors and AMR genes which differentiated between the case and control groups, thereby revealing novel gene associations with the studied diseases. CONCLUSION: PathoFact is an easy-to-use, modular, and reproducible pipeline for the identification of virulence factors, bacterial toxins, and antimicrobial resistance genes in metagenomic data. Additionally, our tool combines the prediction of these pathogenicity factors with the identification of mobile genetic elements. This provides further depth to the analysis by considering the genomic context of the pertinent genes. Furthermore, PathoFact's modules for virulence factors, toxins, and antimicrobial resistance genes can be applied independently, thereby making it a flexible and versatile tool. PathoFact, its models, and databases are freely available at https://pathofact.lcsb.uni.lu . Video abstract.
BACKGROUND: Pathogenic microorganisms cause disease by invading, colonizing, and damaging their host. Virulence factors including bacterial toxins contribute to pathogenicity. Additionally, antimicrobial resistance genes allow pathogens to evade otherwise curative treatments. To understand causal relationships between microbiome compositions, functioning, and disease, it is essential to identify virulence factors and antimicrobial resistance genes in situ. At present, there is a clear lack of computational approaches to simultaneously identify these factors in metagenomic datasets. RESULTS: Here, we present PathoFact, a tool for the contextualized prediction of virulence factors, bacterial toxins, and antimicrobial resistance genes with high accuracy (0.921, 0.832 and 0.979, respectively) and specificity (0.957, 0.989 and 0.994). We evaluate the performance of PathoFact on simulated metagenomic datasets and perform a comparison to two other general workflows for the analysis of metagenomic data. PathoFact outperforms all existing workflows in predicting virulence factors and toxin genes. It performs comparably to one pipeline regarding the prediction of antimicrobial resistance while outperforming the others. We further demonstrate the performance of PathoFact on three publicly available case-control metagenomic datasets representing an actual infection as well as chronic diseases in which either pathogenic potential or bacterial toxins are hypothesized to play a role. In each case, we identify virulence factors and AMR genes which differentiated between the case and control groups, thereby revealing novel gene associations with the studied diseases. CONCLUSION: PathoFact is an easy-to-use, modular, and reproducible pipeline for the identification of virulence factors, bacterial toxins, and antimicrobial resistance genes in metagenomic data. Additionally, our tool combines the prediction of these pathogenicity factors with the identification of mobile genetic elements. This provides further depth to the analysis by considering the genomic context of the pertinent genes. Furthermore, PathoFact's modules for virulence factors, toxins, and antimicrobial resistance genes can be applied independently, thereby making it a flexible and versatile tool. PathoFact, its models, and databases are freely available at https://pathofact.lcsb.uni.lu . Video abstract.
Authors: Laura de Nies; Susheel Bhanu Busi; Mina Tsenkova; Rashi Halder; Elisabeth Letellier; Paul Wilmes Journal: Nat Commun Date: 2022-04-28 Impact factor: 17.694
Authors: Elena G Biosca; José Francisco Català-Senent; Àngela Figàs-Segura; Edson Bertolini; María M López; Belén Álvarez Journal: Viruses Date: 2021-12-17 Impact factor: 5.048
Authors: Jing Yang; Mohammed Eslami; Yi-Pei Chen; Mayukh Das; Dongmei Zhang; Shaorong Chen; Alexandria-Jade Roberts; Mark Weston; Angelina Volkova; Kasra Faghihi; Robbie K Moore; Robert C Alaniz; Alice R Wattam; Allan Dickerman; Clark Cucinell; Jarred Kendziorski; Sean Coburn; Holly Paterson; Osahon Obanor; Jason Maples; Stephanie Servetas; Jennifer Dootz; Qing-Ming Qin; James E Samuel; Arum Han; Erin J van Schaik; Paul de Figueiredo Journal: Proc Natl Acad Sci U S A Date: 2022-04-01 Impact factor: 12.779
Authors: Advait Balaji; Bryce Kille; Anthony D Kappell; Gene D Godbold; Madeline Diep; R A Leo Elworth; Zhiqin Qian; Dreycey Albin; Daniel J Nasko; Nidhi Shah; Mihai Pop; Santiago Segarra; Krista L Ternus; Todd J Treangen Journal: Genome Biol Date: 2022-06-20 Impact factor: 17.906