| Literature DB >> 34787444 |
Sruti DebRoy1, William C Shropshire1, Chau Nguyen Tran1, Haiping Hao2, Marc Gohel1, Jessica Galloway-Peña3,4,5,6, Blake Hanson7,4, Anthony R Flores8,4, Samuel A Shelburne1,3,4.
Abstract
Although prokaryotic DNA methylation investigations have long focused on immunity against exogenous DNA, it has been recently recognized that DNA methylation impacts gene expression and phase variation in Streptococcus pneumoniae and Streptococcus suis. A comprehensive analysis of DNA methylation is lacking for beta-hemolytic streptococci, and thus we sought to examine DNA methylation in the major human pathogen group A Streptococcus (GAS). Using a database of 224 GAS genomes encompassing 80 emm types, we found that nearly all GAS strains encode a type I restriction modification (RM) system that lacks the hsdS' alleles responsible for impacting gene expression in S. pneumoniae and S. suis. The GAS type I system is located on the core chromosome, while sporadically present type II orphan methyltransferases were identified on prophages. By combining single-molecule real-time (SMRT) analyses of 10 distinct emm types along with phylogenomics of 224 strains, we were able to assign 13 methylation patterns to the GAS population. Inactivation of the type I RM system, occurring either naturally through phage insertion or through laboratory-induced gene deletion, abrogated DNA methylation detectable via either SMRT or MinION sequencing. Contrary to a previous report, inactivation of the type I system did not impact transcript levels of the gene (mga) encoding the key multigene activator protein (Mga) or Mga-regulated genes. Inactivation of the type I system significantly increased plasmid transformation rates. These data delineate the breadth of the core chromosomal type I RM system in the GAS population and clarify its role in immunity rather than impacting Mga regulon expression. IMPORTANCE The advent of whole-genome approaches capable of detecting DNA methylation has markedly expanded appreciation of the diverse roles of epigenetic modification in prokaryotic physiology. For example, recent studies have suggested that DNA methylation impacts gene expression in some streptococci. The data described herein are from the first systematic analysis of DNA methylation in a beta-hemolytic streptococcus and one of the few analyses to comprehensively characterize DNA methylation across hundreds of strains of the same bacterial species. We clarify that DNA methylation in group A Streptococcus (GAS) is primarily due to a type I restriction modification (RM) system present in the core genome and does not impact mga-regulated virulence gene expression, but does impact immunity against exogenous DNA. The identification of the DNA motifs recognized by each type I RM system may assist with optimizing methods for GAS genetic manipulation and help us understand how bacterial pathogens acquire exogenous DNA elements.Entities:
Keywords: Streptococcus pyogenes; immunity; type I RM system
Mesh:
Substances:
Year: 2021 PMID: 34787444 PMCID: PMC8597746 DOI: 10.1128/mSphere.00799-21
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1Overview of GAS type I RM systems. (A) A schematic figure shows the location of the type I RM system on the GAS chromosome. Locations of selected type II RM systems relative to dnaA are also shown, and the respective GAS strain and emm type are indicated in parentheses. (B) Alignment of HsdS sequences from the 10 most common target recognition domain (TRD) patterns identified in our GAS population. The percentage of amino acid identity across each position was calculated using Jalview (72) and is displayed (0 to 100%) on the y axis of the histogram. The x axis indicates amino acid positions. (C and D) Unrooted, neighborhood-joining trees based on (C) full-length HsdS sequences and (D) TRD sequence alignment from the 224 GAS strains in this study. GAS emm types dominating each HsdS cluster are indicated. The number of strains (C) and TRDs (D) in each cluster is indicated within the colored circles and reflected by circle size.
Comparison of HsdR and HsdM between GAS and other streptococci
| % of identity or similarity to GAS HsdR or HsdM | ||||||
|---|---|---|---|---|---|---|
| VGS |
| |||||
| Identity | Similarity | Identity | Similarity | Identity | Similarity | |
| HsdR | 93 | 96 | 70 | 70 | 64 | 78 |
| HsdM | 97 | 99 | 74 | 86 | 47 | 86 |
The percentages of identity and similarity indicated are at the amino acid level. VGS, viridans group streptococci.
Homologs are present in only a few strains.
FIG 2Correlation between GAS TRD alleles and phylogeny inferred from core gene alignment. The maximum likelihood phylogenetic tree was created from a core gene alignment of the 224 GAS genomes. Inner and outer circles are color coordinated to indicate the TRD present in positions 1 and 2 of HsdS, respectively. The half-site of the target recognized by each TRD is listed in parentheses within the legend. Closed stars indicate strains that were sequenced by PacBio in this study. Open stars are GAS strains for which PacBio data are available on REBASE (73). Major GAS emm types are indicated, and isolates of the same emm type tend to cluster, with the exception of emm12. Black dots on internal nodes indicate >95% bootstrap support. The single black star along the inner circle denotes a GAS strain that clusters with the emm1 strains but belongs to emm204.
FIG 3Methylation events detected in various GAS strains. Comparison of modification quality value (modQV) histograms indicating methylation events detected by PacBio sequencing between (A) emm1 strains MGAS2221 and M1-SC and (B) emm4 strains SAS-M4-Duke and Duke Large. modQV values are indicated on the x axis, and the numbers of bases are displayed on the y axis. The lines are color-coded for each nucleotide.
Bacterial strains used in this study
| Strain | Description | Reference |
|---|---|---|
| MGAS2221 | Invasive clinical isolate, |
|
| M1-SC-1 | Clinical isolate, |
|
| MSPY1 | Clinical isolate, |
|
| SAS-M4-Duke | Clinical isolate, |
|
| RLGH | Clinical isolate, |
|
| Duke Large | Invasive clinical isolate, coisolated with SAS-M4-Duke, inactive CovS |
|
| Duke Δ | Isogenic mutant of SAS-M4-Duke, CovS inactive |
|
| MGAS10870 | Clinical isolate, |
|
| TSPY416 |
| This study |
| TSPY125 | Clinical isolate, | This study |
| TSPY155 | Clinical isolate, |
|
| TSPY453 | Clinical isolate, |
|
| TSPY1309 | Clinical isolate, | This study |
| TSPY136 | Clinical isolate, | This study |
| MGAS6180 | Clinical isolate, |
|
| MGAS6180 Δ | Isogenic | This study |
| TSPY1057 | Clinical isolate, |
|
| TSPY1057 Δ | Isogenic | This study |
FIG 4Characterization of impact of hsdM inactivation on GAS methylation, transformation, and gene expression. (A) Comparison of methylation events detected by the Caller neural network model using ONT long-read sequencing data between wild-type (blue dots) and ΔhsdM mutant (orange dots) strains of MGAS6180 (emm28) and TSPY1057 (emm87), respectively. The dotted red line in each subpanel indicates a probability of 0.5. Welch’s t test of independent samples with a multiple-sample Bonferroni correction was performed for each wild-type strain and respective ΔhsdM group, with **** indicating statistically significant difference between the strains at P ≤ 0.0001. (B) Comparison of transformation efficiency between TSPY1057 and its ΔhsdM mutant. GAS cells were transformed on three separate occasions, and the numbers of colonies detected are shown as log10 CFU/ml per μg DNA. ****, statistically significant difference between the strains at P ≤ 0.0001 by unpaired t test (C and D) TaqMan qRT-PCR analysis of the impact of hsdM deletion on transcript levels of the gene encoding the multigene activator (mga) and Mga-regulated genes in (C) MGAS6180 and (D) TSPY1057. TaqMan qRT-PCR data are means ± standard deviations from two biological replicates, with two technical replicates, done on 2 separate days.
Methylation identified in the genomes of GAS strains from different emm types
| GAS strain | Motif string | Position modified | Type | % of motifs detected | No. of motifs detected | No. of motifs in genome | Mean modQV | |
|---|---|---|---|---|---|---|---|---|
| MGAS2221 | 1 |
| 3 | m6A | 98.0 | 337 | 344 | 79.09 |
|
| 3 | m6A | 97.7 | 336 | 344 | 77.17 | ||
| SF370 | 1 |
| 3 | m6A | 98.8 | 331 | 335 | 75.26 |
|
| 3 | m6A | 97.9 | 328 | 335 | 76.43 | ||
| M1-SC | 1 | ND | ||||||
| MGAS10870 | 3 |
| 3 | m6A | 100.0 | 524 | 524 | 513.45 |
|
| 3 | m6A | 99.8 | 523 | 524 | 493.54 | ||
| SAS-M4-Duke | 4 |
| 3 | m6A | 100.0 | 542 | 542 | 143.86 |
|
| 3 | m6A | 100.0 | 542 | 542 | 142.56 | ||
| RLGH | 4 |
| 3 | m6A | 100.0 | 542 | 542 | 307.29 |
|
| 3 | m6A | 99.8 | 541 | 542 | 299.71 | ||
| Duke Large | 4 |
| 3 | m6A | 100.0 | 542 | 542 | 228.72 |
|
| 3 | m6A | 100.0 | 542 | 542 | 225.93 | ||
| Duke Δ | 4 |
| 3 | m6A | 98.2 | 532 | 542 | 68.37 |
|
| 3 | m6A | 96.5 | 523 | 542 | 68.08 | ||
| TSPY155 | 11 |
| 4 | m6A | 99.8 | 462 | 463 | 283.43 |
|
| 3 | m6A | 99.6 | 461 | 463 | 312.70 | ||
| TSPY136 | 22 |
| 3 | m6A | 100.0 | 241 | 241 | 285.17 |
|
| 3 | m6A | 100.0 | 241 | 241 | 258.73 | ||
| MGAS6180 | 28 |
| 4 | m6A | 99.5 | 414 | 416 | 203.92 |
|
| 3 | m6A | 99.0 | 412 | 416 | 229.83 | ||
| TSPY1309 | 63 |
| 3 | m6A | 100.0 | 537 | 537 | 525.49 |
|
| 3 | m6A | 99.8 | 536 | 537 | 474.90 | ||
| TSPY416 | 68 |
| 3 | m6A | 100.0 | 513 | 513 | 270.36 |
|
| 3 | m6A | 99.8 | 512 | 513 | 259.95 | ||
| TSPY453 | 77 |
| 3 | m6A | 100.0 | 190 | 190 | 412.97 |
|
| 3 | m6A | 100.0 | 190 | 190 | 404.43 | ||
| TSPY125 | 83 |
| 3 | m6A | 98.9 | 345 | 349 | 476.50 |
|
| 2 | m6A | 98.0 | 342 | 349 | 458.08 | ||
| MSPY1 | 89 |
| 3 | m6A | 100.0 | 179 | 179 | 312.57 |
|
| 3 | m6A | 99.4 | 178 | 179 | 280.25 | ||
FIG 5Genes encoding the type I MTase have higher transcript levels than those encoding type II RM MTases. Shown are the results from analysis of transcript levels of genes encoding the MTase of the type I and type II RM systems in representative emm1 (MGAS2221) and emm3 (MGAS10870) strains. The data shown are means ± standard deviations from two biological replicates, with two technical replicates, done on 2 separate days.