Literature DB >> 26115845

Assessing host-specificity of Escherichia coli using a supervised learning logic-regression-based analysis of single nucleotide polymorphisms in intergenic regions.

Shuai Zhi1, Qiaozhi Li2, Yutaka Yasui3, Thomas Edge4, Edward Topp5, Norman F Neumann6.   

Abstract

Host specificity in E. coli is widely debated. Herein, we used supervised learning logic-regression-based analysis of intergenic DNA sequence variability in E. coli in an attempt to identify single nucleotide polymorphism (SNP) biomarkers of E. coli that are associated with natural selection and evolution toward host specificity. Seven-hundred and eighty strains of E. coli were isolated from 15 different animal hosts. We utilized logic regression for analyzing DNA sequence data of three intergenic regions (flanked by the genes uspC-flhDC, csgBAC-csgDEFG, and asnS-ompF) to identify genetic biomarkers that could potentially discriminate E. coli based on host sources. Across 15 different animal hosts, logic regression successfully discriminated E. coli based on animal host source with relatively high specificity (i.e., among the samples of the non-target animal host, the proportion that correctly did not have the host-specific marker pattern) and sensitivity (i.e., among the samples from a given animal host, the proportion that correctly had the host-specific marker pattern), even after fivefold cross validation. Permutation tests confirmed that for most animals, host specific intergenic biomarkers identified by logic regression in E. coli were significantly associated with animal host source. The highest level of biomarker sensitivity was observed in deer isolates, with 82% of all deer E. coli isolates displaying a unique SNP pattern that was 98% specific to deer. Fifty-three percent of human isolates displayed a unique biomarker pattern that was 98% specific to humans. Twenty-nine percent of cattle isolates displayed a unique biomarker that was 97% specific to cattle. Interestingly, even within a related host group (i.e., Family: Canidae [domestic dogs and coyotes]), highly specific SNP biomarkers (98% and 99% specificity for dog and coyotes, respectively) were observed, with 21% of dog E. coli isolates displaying a unique dog biomarker and 61% of coyote isolates displaying a unique coyote biomarker. Application of a supervised learning method, such as logic regression, to DNA sequence analysis at certain intergenic regions demonstrates that some E. coli strains may evolve to become host-specific.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  E. coli; Host specificity; Intergenic sequences; Logic regression

Mesh:

Substances:

Year:  2015        PMID: 26115845     DOI: 10.1016/j.ympev.2015.06.007

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  5 in total

1.  Automatic identification of variables in epidemiological datasets using logic regression.

Authors:  Matthias W Lorenz; Negin Ashtiani Abdi; Frank Scheckenbach; Anja Pflug; Alpaslan Bülbül; Alberico L Catapano; Stefan Agewall; Marat Ezhov; Michiel L Bots; Stefan Kiechl; Andreas Orth
Journal:  BMC Med Inform Decis Mak       Date:  2017-04-13       Impact factor: 2.796

Review 2.  Naturalized Escherichia coli in Wastewater and the Co-evolution of Bacterial Resistance to Water Treatment and Antibiotics.

Authors:  Daniel Yu; Kanghee Ryu; Shuai Zhi; Simon J G Otto; Norman F Neumann
Journal:  Front Microbiol       Date:  2022-05-30       Impact factor: 6.064

Review 3.  Whole-Genome Sequencing of Bacterial Pathogens: the Future of Nosocomial Outbreak Analysis.

Authors:  Scott Quainoo; Jordy P M Coolen; Sacha A F T van Hijum; Martijn A Huynen; Willem J G Melchers; Willem van Schaik; Heiman F L Wertheim
Journal:  Clin Microbiol Rev       Date:  2017-10       Impact factor: 26.132

Review 4.  Factors Obscuring the Role of E. coli from Domestic Animals in the Global Antimicrobial Resistance Crisis: An Evidence-Based Review.

Authors:  Fernanda Loayza; Jay P Graham; Gabriel Trueba
Journal:  Int J Environ Res Public Health       Date:  2020-04-28       Impact factor: 3.390

5.  Evidence of Naturalized Stress-Tolerant Strains of Escherichia coli in Municipal Wastewater Treatment Plants.

Authors:  Shuai Zhi; Graham Banting; Qiaozhi Li; Thomas A Edge; Edward Topp; Mykola Sokurenko; Candis Scott; Shannon Braithwaite; Norma J Ruecker; Yutaka Yasui; Tim McAllister; Linda Chui; Norman F Neumann
Journal:  Appl Environ Microbiol       Date:  2016-08-30       Impact factor: 4.792

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.