| Literature DB >> 34374563 |
Sruti DebRoy1, Misu Sanson2, Brittany Shah2, Shirjana Regmi2, Luis Alberto Vega2, Chioma Odo1, Pranoti Sahasrabhojane1, Allison McGeer3,4, Gregory J Tyrrell5,6, Nahuel Fittipaldi4,7, Samuel A Shelburne1,8,9, Anthony R Flores2,9.
Abstract
Clonal replacement is a major driver for changes in bacterial disease epidemiology. Recently, it has been proposed that episodic emergence of novel, hypervirulent clones of group A Streptococcus (GAS) results from acquisition of a 36-kb DNA region leading to increased expression of the cytotoxins Nga (NADase) and SLO (streptolysin O). We previously described a gene fusion event involving the gene encoding the GAS M protein (emm) and an adjacent M-like protein (enn) in the emm4 GAS population, a GAS emm type that lacks the hyaluronic acid capsule. Using whole-genome sequencing of a temporally and geographically diverse set of 1,126 isolates, we discovered that the North American emm4 GAS population has undergone clonal replacement with emergent GAS strains completely replacing historical isolates by 2017. Emergent emm4 GAS strains contained a handful of small genetic variations, including the emm-enn gene fusion, and showed a marked in vitro growth defect compared to historical strains. In contrast to other previously described GAS clonal replacement events, emergent emm4 GAS strains were not defined by acquisition of exogenous DNA and had no significant increase in transcript levels of nga and slo toxin genes via RNA sequencing and quantitative real-time PCR analysis relative to historic strains. Despite the in vitro growth differences, emergent emm4 GAS strains were hypervirulent in mice and ex vivo growth in human blood compared to historical strains. Thus, these data detail the emergence and dissemination of a hypervirulent acapsular GAS clone defined by small, endogenous genetic variation, thereby defining a novel model for GAS strain replacement. IMPORTANCE Severe invasive infections caused by group A Streptococcus (GAS) result in substantial morbidity and mortality in children and adults worldwide. Previously, GAS clonal strain replacement has been attributed to acquisition of exogenous DNA leading to novel virulence gene acquisition or increased virulence gene expression. Our study of type emm4 GAS identified emergence of a hypervirulent GAS clade defined by variation in endogenous DNA content and lacking augmented toxin gene expression relative to replaced strains. These findings expand our understanding of the molecular mechanisms underlying bacterial clonal emergence.Entities:
Keywords: M type; evolution; exotoxin; genomics; group A Streptococcus; population genetics; virulence
Year: 2021 PMID: 34374563 PMCID: PMC8409732 DOI: 10.1128/mSystems.00495-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
Description of strains used in this study
| Country | State/region | Years | No. of strains | Disease type | Reference(s) |
|---|---|---|---|---|---|
| Canada | Multiple | 1993−2018 | 221 | Invasive | This study |
| United Kingdom | NA | 2014 | 33 | Invasive/scarlet fever |
|
| United States | Multiple | 1998−2017 | 799 | Invasive | |
| United States | Texas | 2014−2017 | 74 | Invasive/pharyngeal |
Strains selected from a total of 884 strains with representatives from all 10 Canadian provinces.
NA, not available.
Total of 10 ABCs collection sites (https://www.cdc.gov/abcs/index.html).
All GAS isolates beginning in 2015 were sequenced by the Centers for Disease Control and Prevention Streptococcal Laboratory with data publicly available at NCBI (BioProject accession number PRJNA395240).
FIG 1Maximum likelihood phylogenetic reconstruction of 1,126 emm4 GAS disease isolates from North America and the United Kingdom. (A) Unrooted neighbor-joining phylogenetic tree based on 12,436 biallelic SNPs relative to the emm4 reference strain Duke (GenBank accession number CP031770) following correction for recombination (ClonalFrameML). Branch tips colored to indicate individual emm4 strain year of isolation and shape for subclade designation (BAPS). The dashed circle indicates single-locus variant (ST38) strains from the main (ST39) emm4 population. (B) Progressive replacement of historical (SC1) strains by emergent (SC3) over the 22-year period. The total number of isolates and geographic origin are indicated below bar graph. The red value for 2014 indicates that 32 isolates from the United Kingdom were included in the analysis.
FIG 2Distribution of DNases, exotoxins, and macrolide-lincosamide-streptogramin (MLS) resistance within the emm4 population. A rooted maximum likelihood (ML) phylogram of 1,126 emm4 GAS disease strains is shown. Branch tip color and shape indicate year and subclade designation as in Fig. 1 and the associated legend. Subclades (SC1 to 3) are indicated by shaded boxes. DNase, exotoxin, and antimicrobial resistance gene content is indicated by vertical bars to the right of the ML phylogram and defined in the legend headings. Asterisks indicate nodes representing strains with completed genomes (see Table S3 in the supplemental material).
Select virulence genes/loci with changes differentiating emergent and historical clades
| Virulence gene(s) | Type | Locus | No. of strains (%) with virulence gene | |
|---|---|---|---|---|
| Emergent (SC3) ( | Historical (SC1) ( | |||
| R | D1F63_08970; gene fusion between | 614 (100) | 0 (0.0) | |
| SNP | D1F63_09065; single amino acid change (threonine to isoleucine) in regulator of SpeB, RopB | 532 (86.6) | 1 (0.2) | |
|
| Indel | D1F63_02180; single nucleotide insertion leading to frameshift in response regulator, SilA, of the Sil bacteriocin signaling system | 614 (100) | 29 (7.0) |
|
| SNP | D1F63_03645; nonsense mutation in | 602 (98.0) | 2 (0.5) |
| SNP | D1F63_08980; single amino acid change (alanine to valine) in stand-alone regulator Mga | 137 (22.5) | 376 (87.0) | |
Type of chromosomal change: nucleotide insertion/deletion (indel), recombination (R), single nucleotide polymorphism (SNP).
Gene locus in the emm4 GAS reference strain Duke (GenBank accession number CP031770).
FIG 3Highly polymorphic genetic loci identified within the emm4 GAS population after exclusion of recombinant loci. Genes of known or predicted regulators (red) are indicated.
Frequency of mutation in known or predicted transcriptional regulators in emm4 GAS population and by subclade
| Gene | Locus tag | Description | Total | Total | SC3 | SC1 | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mutations | FS/NS/SYN | Chi-square | ||||||||
|
| D1F63_01590 | Sensor histidine kinase | 157 | 48/106/3 | 2,635 | 0.0E+00 | 220 (19.5) | 88 (14.4) | 119 (28.5) | <0.001 |
|
| D1F63_09080 | Rgg family transcriptional regulator | 103 | 17/86/0 | 2,056 | 0.0E+00 | 145 (12.9) | 82 (13.4) | 51 (12.2) | NS |
|
| D1F63_00785 | HTH, Mga domain-containing protein | 82 | 29/49/4 | 652 | 1.72E−140 | 137 (12.2) | 55 (9.0) | 73 (17.5) | <0.001 |
|
| D1F63_08980 | Multiple gene activator, Mga | 40 | 4/34/2 | 108 | 4.3E−22 | 114 (10.1) | 95 (15.5) |
| <0.001 |
|
| D1F63_01585 | DNA-binding response regulator | 24 | 0/23/1 | 106 | 1.1E−21 | 28 (2.5) | 10 (1.6) | 16 (3.8) | NS |
|
| D1F63_01190 | DNA-binding response regulator | 20 | 7/13/0 | 61 | 9.6E−12 | 33 (2.9) | 22 (3.5) |
| NS |
|
| D1F63_03645 | HTH, Mga domain-containing protein | 30 | 11/12/7 | 53 | 4.8E−10 | 85 (7.5) |
| 68 (16.3) | <0.001 |
|
| D1F63_01180 | Sensor histidine kinase | 25 | 13/10/2 | 41 | 2.9E−7 | 118 (10.5) | 57 (9.3) |
| NS |
| D1F63_07540 | HTH, Mga domain-containing protein | 24 | 2/17/5 | 29 | 1.1E−4 | 38 (3.4) |
|
| NS | |
Identifier in the reference genome Duke (CP031770).
Putative function and/or conserved functional domains.
Number of unique variations (mutations) in the given gene derived from the emm4 population (n = 1,127).
Number of mutations defined as frameshift or stop (FS), nonsynonymous (NS), or synonymous (SYN).
Chi-square statistic based on total number of variations in the core genome (n = 10,594).
P value based on chi-square test for the total emm4 population following correction for multiple comparisons (Bonferroni).
Number of strains in the emm4 population (total emm4) with at least one of the defined gene mutations.
Number of strains in emergent or historical clades (excluding ST38; n = 16) with at least one of the defined gene mutations. Italic values indicate that the indicated gene was not significantly highly polymorphic in the respective population (Table S5).
P value (Fisher’s exact) comparing proportion of strains with at least one mutation in emergent versus historical clades following correction for multiple comparisons (Bonferroni).
Excludes the clade-defining mutation T104I found in emergent/clade 2 strains.
Excludes the clade-defining mutation A27V found in historical/clade 1 strains.
Excludes the clade-defining nonsense mutation found in emergent/clade 2 strains and a synonymous SNP (n = 289 strains) in historical/clade 1 strains.
Excludes a synonymous SNP (n = 492) occurring in both clades.
FIG 4Emergent emm4 GAS strains have altered in vitro growth and no increased SLO toxin production compared to historical strains. (A) Growth curves comparing historical (blue squares) to emergent (red circles) clade emm4 GAS strains in nutrient-rich (THY; solid) or glucose-deficient (C medium; open) conditions. Error bars represent the standard deviations of four strains grown in biological triplicate. Dashed lines indicate mid-exponential (ME) phase of growth. (B) Principal-component analysis of RNA-seq comparing emergent (red) and historical (blue) clade strains. Individual strains with replicate samples are labeled and indicated using dashed circles. PC 1, principal component 1. (C) Log2 fold change in transcript levels derived from RNA-seq for selected genes derived from RNA-seq comparing four historical and four emergent emm4 GAS strains. Error bars represent standard deviations. (D) Transcript levels (qRT-PCR) of slo from emergent or historical GAS cells grown in nutrient-rich media and harvested at ME as defined in panel A. The asterisk indicates a P value of <0.005 (unpaired t test). (E) Western blot analysis using anti-Nga antibody (α-Nga) of culture supernatants (SN) from historical (blue text) or emergent (red text) emm4 GAS strains. Similar amounts of protein are demonstrated using anti-HPr. ctrl, control. (F) Transcript levels derived from qRT-PCR of slo from representative strains of emm1 (MGAS2221), emm3 (MGAS10870), emm4, emm28 (strain TSPY902), emm87 (strain TSPY1057), and emm89 (strain MSPY1). For emm4 GAS, transcript levels reflect four historical strains (in blue) and four emergent strains (in red). RNA was extracted from ME cultures as described in Methods. TaqMan qRT-PCR data for both panels D and F are means ± standard deviations of two biological replicates, with two technical replicates, done on two separate days. pos., positive; neg., negative.
FIG 5In vivo mouse and ex vivo human blood virulence assays comparing emergent and historical clade emm4 GAS strains. (A) Cumulative Kaplan-Meier survival curves of mice (n = 7 per strains) infected with four historical (blue) or four emergent (red) emm4 GAS strains (2 × 108 CFU). The P value of <0.0001 was determined by the log rank test. Survival curves following infection with other dose ranges are shown in Fig. S4 in the supplemental material. (B) Survival of emergent (red) or historical (blue) strains following exposure to human blood ex vivo. Error bars represent standard deviations of four emergent and four historical emm4 GAS strains assayed in quadruplicate using three independent donors. The two asterisks indicate P < 0.05 (Mann-Whitney U test). Individual strain comparisons are shown in Fig. S4.