| Literature DB >> 35695507 |
Mustapha M Mustapha1,2, Vatsala R Srinivasa1,2, Marissa P Griffith1,2, Shu-Ting Cho1, Daniel R Evans1, Kady Waggle1,2, Chinelo Ezeonwuka1,2, Daniel J Snyder3, Jane W Marsh1,2, Lee H Harrison1,2, Vaughn S Cooper3,4, Daria Van Tyne1,4.
Abstract
Healthcare-associated infections (HAIs) cause mortality, morbidity, and waste of health care resources. HAIs are also an important driver of antimicrobial resistance, which is increasing around the world. Beginning in November 2016, we instituted an initiative to detect outbreaks of HAIs using prospective whole-genome sequencing-based surveillance of bacterial pathogens collected from hospitalized patients. Here, we describe the diversity of bacteria sampled from hospitalized patients at a single center, as revealed through systematic analysis of bacterial isolate genomes. We sequenced the genomes of 3,004 bacterial isolates from hospitalized patients collected over a 25-month period. We identified bacteria belonging to 97 distinct species, which were distributed among 14 groups of related species. Within these groups, isolates could be distinguished from one another by both average nucleotide identity (ANI) and principal-component analysis of accessory genes (PCA-A). Core genome genetic distances and rates of evolution varied among species, which has practical implications for defining shared ancestry during outbreaks and for our broader understanding of the origins of bacterial strains and species. Finally, antimicrobial resistance genes and putative mobile genetic elements were frequently observed, and our systematic analysis revealed patterns of occurrence across the different species sampled from our hospital. Overall, this study shows how understanding the population structure of diverse pathogens circulating in a single health care setting can improve the discriminatory power of genomic epidemiology studies and can help define the processes leading to strain and species differentiation. IMPORTANCE Hospitalized patients are at increased risk of becoming infected with antibiotic-resistant organisms. We used whole-genome sequencing to survey and compare over 3,000 clinical bacterial isolates collected from hospitalized patients at a large medical center over a 2-year period. We identified nearly 100 different bacterial species, which we divided into 14 different groups of related species. When we examined how genetic relatedness differed between species, we found that different species were likely evolving at different rates within our hospital. This is significant because the identification of bacterial outbreaks in the hospital currently relies on genetic similarity cutoffs, which are often applied uniformly across organisms. Finally, we found that antibiotic resistance genes and mobile genetic elements were abundant and were shared among the bacterial isolates we sampled. Overall, this study provides an in-depth view of the genomic diversity and evolutionary processes of bacteria sampled from hospitalized patients, as well as genetic similarity estimates that can inform hospital outbreak detection and prevention efforts.Entities:
Keywords: antimicrobial resistance; bacterial evolution; horizontal gene transfer; hospital-acquired infections; pangenome; whole-genome sequencing
Year: 2022 PMID: 35695507 PMCID: PMC9238379 DOI: 10.1128/msystems.01384-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 7.324
FIG 1Species and body site distribution of 3,004 clinical bacterial isolates from hospitalized patients. Isolates were collected from a single hospital over 25 months as part of the Enhanced Detection System for Healthcare-Associated Transmission (EDS-HAT) project. Pie charts show the distribution of isolates belonging to 14 different species groups collected from different types of clinical specimens.
FIG 2Genome length and pangenome size among sampled species. (A) Distribution of genome lengths of isolates belonging to each species group, ordered from shortest to longest median genome length. Vertical lines show median values. (B) Pangenome collection curves for up to 250 genomes from genera containing multiple species and with at least 50 genomes collected. Pangenomes were generated by Roary with an 80% protein identity cutoff. (C) Pangenome collection curves for up to 250 genomes from species with at least 40 genomes collected. Pangenomes were generated by Roary with an 95% protein identity cutoff. Curves show the mean pan-genome size and shading shows the standard deviation.
FIG 3Average nucleotide identity (ANI) and principal-component analysis of accessory genes (PCA-A) distinguish between and within species. (A) Phylogeny and pairwise ANI values for Citrobacter spp. sampled by EDS-HAT. Gray shading indicates ANI values >95%, with darker shading showing higher identity. (B) PCA-A plot for Citrobacter species with >2 isolates. (C) Pairwise ANI distribution of S. marcescens isolate genomes, showing pairwise ANI comparisons between isolates in different clades that fall below the species cutoff (95% ANI, vertical dashed line). (D) PCA-A plot for S. marcescens isolates, showing clear separation of five distinct clades. (E-G) PCA-A plots for dominant sequence types (STs) of C. difficile (E), E. faecium (F), and S. aureus (G).
FIG 4Pairwise SNP distances and genome evolution vary between species. (A) Comparison of within-patient, within-cluster, and between-patient single nucleotide polymorphisms (SNPs) for select species. Pairwise comparisons are shown for all isolate pairs belonging to the same sequence type (ST) within each species. Boxes show the median, 25th and 75th percentiles. (B) Genome evolution rates for dominant STs within C. difficile (CD), vancomycin-resistant E. faecium (VRE), methicillin-resistant S. aureus (MRSA) and P. aeruginosa (PSA). Isolates belonging to the four largest STs (three largest for MRSA) of each species were considered, and nucleotide substitution rate (SNPs/genome/year) was calculated for each ST separately. Individual data points are labeled with the corresponding ST, and boxes show the median, 25th and 75th percentiles. (C) Recombination events per mutation (R/Theta) for select species. Each data point represents a distinct ST, and data are grouped by species. STs with at least 10 isolates are shown. Boxes show the median, 25th and 75th percentiles. PRO=P. mirabilis, SER=S. marcescens, KLP=K. pneumoniae, EC=E. coli, ACIN=A. baumannii.
FIG 5Acquired antimicrobial resistance gene abundance and diversity. (A) Prevalence of resistance genes found in more than one species group. Genes are grouped by antibiotic class, and gray shading shows the prevalence of each gene within and across each group. Darker shading indicates higher prevalence. ACIN = Acinetobacter spp.; KL = Klebsiella spp.; CB = Citrobacter spp.; EC = E. coli; PRV = Providencia spp.; PR = Proteus spp.; SER = Serratia spp.; PSA = P. aeruginosa; PSB = Pseudomonas spp.; STEN = Stenotrophomonas spp.; BC = Burkholderia spp.; VRE = vancomycin-resistant Enterococcus spp.; MRSA = methicillin-resistant S. aureus; CD = C. difficile. (B) Resistance gene co-occurrence. Relative frequency versus number of genomes is plotted for pairs of resistance genes that co-occur at ≥ 50% relative frequency. Blue dots indicate AMR genes in the same drug class, while orange dots indicate genes in different classes. The size of each dot corresponds to the number of different species groups found to carry each pair. AMR gene pairs found in ≥ 4 different species groups are labeled. (C) Distribution of extended-spectrum beta-lactamase (ESBL) and carbapenemase enzymes among E. coli and Klebsiella spp. isolates.
FIG 6Mobile genetic element (MGE) distribution and cargo. (A) Clusters of putative MGEs identified in 3,004 study isolate genomes. Nodes within each cluster correspond to bacterial isolates, and are color coded by species group (color key provided in panel B). (B) Distribution of isolates in the entire data set (left) versus isolates encoding one or more putative MGEs (right). (C) Distribution of putative MGEs resembling plasmid, IS/transposon, or prophage/ICE sequences, determined by nucleotide sequence comparisons and manual curation. (D) Distribution of antimicrobial resistance (AMR) genes detected among 186 putative MGEs. (E) Distribution of clusters of orthologous groups of proteins (COG) categories of MGE genes with COG categories assigned.