Literature DB >> 35809965

Assessing putative bias in prediction of anti-microbial resistance from real-world genotyping data under explicit causal assumptions.

Mattia Prosperi1, Christina Boucher2, Jiang Bian3, Simone Marini4.   

Abstract

Whole genome sequencing (WGS) is quickly becoming the customary means for identification of antimicrobial resistance (AMR) due to its ability to obtain high resolution information about the genes and mechanisms that are causing resistance and driving pathogen mobility. By contrast, traditional phenotypic (antibiogram) testing cannot easily elucidate such information. Yet development of AMR prediction tools from genotype-phenotype data can be biased, since sampling is non-randomized. Sample provenience, period of collection, and species representation can confound the association of genetic traits with AMR. Thus, prediction models can perform poorly on new data with sampling distribution shifts. In this work -under an explicit set of causal assumptions- we evaluate the effectiveness of propensity-based rebalancing and confounding adjustment on antibiotic resistance prediction using genotype-phenotype AMR data from the Pathosystems Resource Integration Center (PATRIC). We select bacterial genotypes (encoded as k-mer signatures, i.e., DNA fragments of length k), country, year, species, and AMR phenotypes for the tetracycline drug class, preparing test data with recent genomes coming from a single country. We test boosted logistic regression (BLR) and random forests (RF) with/without bias-handling. On 10,936 instances, we find evidence of species, location and year imbalance with respect to the AMR phenotype. The crude versus bias-adjusted change in effect of genetic signatures on AMR varies but only moderately (selecting the top 20,000 out of 40+ million k-mers). The area under the receiver operating characteristic (AUROC) of the RF (0.95) is comparable to that of BLR (0.94) on both out-of-bag samples from bootstrap and the external test (n = 1085), where AUROCs do not decrease. We observe a 1 %-5 % gain in AUROC with bias-handling compared to the sole use of genetic signatures. In conclusion, we recommend using causally-informed prediction methods for modeling real-world AMR data; however, traditional adjustment or propensity-based methods may not provide advantage in all use cases and further methodological development should be sought.
Copyright © 2022 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Antimicrobial resistance; Biomedical informatics; Causal methods; Directed acyclic graph; Epidemiology; Explainability; Interpretability; Propensity score

Mesh:

Substances:

Year:  2022        PMID: 35809965      PMCID: PMC9425730          DOI: 10.1016/j.artmed.2022.102326

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   7.011


  21 in total

1.  Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available.

Authors:  Miguel A Hernán; James M Robins
Journal:  Am J Epidemiol       Date:  2016-03-18       Impact factor: 4.897

2.  Portable nanopore analytics: are we there yet?

Authors:  Marco Oliva; Franco Milicchio; Kaden King; Grace Benson; Christina Boucher; Mattia Prosperi
Journal:  Bioinformatics       Date:  2020-08-15       Impact factor: 6.937

3.  The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities.

Authors:  James J Davis; Alice R Wattam; Ramy K Aziz; Thomas Brettin; Ralph Butler; Rory M Butler; Philippe Chlenski; Neal Conrad; Allan Dickerman; Emily M Dietrich; Joseph L Gabbard; Svetlana Gerdes; Andrew Guard; Ronald W Kenyon; Dustin Machi; Chunhong Mao; Dan Murphy-Olson; Marcus Nguyen; Eric K Nordberg; Gary J Olsen; Robert D Olson; Jamie C Overbeek; Ross Overbeek; Bruce Parrello; Gordon D Pusch; Maulik Shukla; Chris Thomas; Margo VanOeffelen; Veronika Vonstein; Andrew S Warren; Fangfang Xia; Dawen Xie; Hyunseung Yoo; Rick Stevens
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

4.  Estimating Individual Treatment Effect in Observational Data Using Random Forest Methods.

Authors:  Min Lu; Saad Sadiq; Daniel J Feaster; Hemant Ishwaran
Journal:  J Comput Graph Stat       Date:  2018-02-01       Impact factor: 2.302

5.  A comparison of 12 algorithms for matching on the propensity score.

Authors:  Peter C Austin
Journal:  Stat Med       Date:  2013-10-07       Impact factor: 2.373

Review 6.  Using Genomics to Track Global Antimicrobial Resistance.

Authors:  Rene S Hendriksen; Valeria Bortolaia; Heather Tate; Gregory H Tyson; Frank M Aarestrup; Patrick F McDermott
Journal:  Front Public Health       Date:  2019-09-04

7.  MEGARes 2.0: a database for classification of antimicrobial drug, biocide and metal resistance determinants in metagenomic sequence data.

Authors:  Enrique Doster; Steven M Lakin; Christopher J Dean; Cory Wolfe; Jared G Young; Christina Boucher; Keith E Belk; Noelle R Noyes; Paul S Morley
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

Review 8.  Microbial Resistance Movements: An Overview of Global Public Health Threats Posed by Antimicrobial Resistance, and How Best to Counter.

Authors:  Sameer Dhingra; Nor Azlina A Rahman; Ed Peile; Motiur Rahman; Massimo Sartelli; Mohamed Azmi Hassali; Tariqul Islam; Salequl Islam; Mainul Haque
Journal:  Front Public Health       Date:  2020-11-04

9.  Metagenomics reveals impact of geography and acute diarrheal disease on the Central Indian human gut microbiome.

Authors:  Tanya M Monaghan; Tim J Sloan; Stephen R Stockdale; Adam M Blanchard; Richard D Emes; Mark Wilcox; Rima Biswas; Rupam Nashine; Sonali Manke; Jinal Gandhi; Pratishtha Jain; Shrejal Bhotmange; Shrikant Ambalkar; Ashish Satav; Lorraine A Draper; Colin Hill; Rajpal Singh Kashyap
Journal:  Gut Microbes       Date:  2020-05-27

10.  CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database.

Authors:  Brian P Alcock; Amogelang R Raphenya; Tammy T Y Lau; Kara K Tsang; Mégane Bouchard; Arman Edalatmand; William Huynh; Anna-Lisa V Nguyen; Annie A Cheng; Sihan Liu; Sally Y Min; Anatoly Miroshnichenko; Hiu-Ki Tran; Rafik E Werfalli; Jalees A Nasir; Martins Oloni; David J Speicher; Alexandra Florescu; Bhavya Singh; Mateusz Faltyn; Anastasia Hernandez-Koutoucheva; Arjun N Sharma; Emily Bordeleau; Andrew C Pawlowski; Haley L Zubyk; Damion Dooley; Emma Griffiths; Finlay Maguire; Geoff L Winsor; Robert G Beiko; Fiona S L Brinkman; William W L Hsiao; Gary V Domselaar; Andrew G McArthur
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.