| Literature DB >> 35019696 |
Ellen Ruth A Morris1, Jing Wu2, Angela I Bordin1, Sara D Lawhon2, Noah D Cohen1.
Abstract
Streptococcus equi subsp. equi (SEE) is a host-restricted equine pathogen considered to have evolved from Streptococcus equi subsp. zooepidemicus (SEZ). SEZ is promiscuous in host range and is commonly recovered from horses as a commensal. Comparison of a single strain each of SEE and SEZ using whole-genome sequencing, supplemented by PCR of selected genes in additional SEE and SEZ strains, was used to characterize the evolution of SEE. But the known genetic variability of SEZ warrants comparison of the whole genomes of multiple SEE and SEZ strains. To fill this knowledge gap, we utilized whole-genome sequencing to characterize the accessory genome elements (AGEs; i.e., elements present in some SEE strains but absent in SEZ or vice versa) and methylomes of 50 SEE and 50 SEZ isolates from Texas. Consistent with previous findings, AGEs consistently found in all SEE isolates were primarily from mobile genetic elements that might contribute to host restriction or pathogenesis of SEE. Fewer AGEs were identified in SEZ because of the greater genomic variability among these isolates. The global methylation patterns of SEE isolates were more consistent than those of the SEZ isolates. Among homologous genes of SEE and SEZ, differential methylation was identified only in genes of SEE encoding proteins with functions of quorum sensing, exopeptidase activity, and transitional metal ion binding. Our results indicate that effects of genetic mobile elements in SEE and differential methylation of genes shared by SEE and SEZ might contribute to the host specificity of SEE. IMPORTANCE Strangles, caused by the host-specific bacterium Streptococcus equi subsp. equi (SEE), is the most commonly diagnosed infectious disease of horses worldwide. Its ancestor, Streptococcus equi subsp. zooepidemicus (SEZ), is frequently isolated from a wide array of hosts, including horses and humans. A comparison of the genomes of a single strain of SEE and SEZ has been reported, but sequencing of further isolates has revealed variability among SEZ strains. Thus, the importance of this study is that it characterizes genomic and methylomic differences of multiple SEE and SEZ isolates from a common geographic region (viz., Texas). Our results affirm many of the previously described differences between the genomes of SEE and SEZ, including the role of mobile genetic elements in contributing to host restriction. We also provide the first characterization of the global methylome of Streptococcus equi and evidence that differential methylation might contribute to the host restriction of SEE.Entities:
Keywords: DNA methylation; Streptococcus equi; methylome; whole-genome sequencing
Mesh:
Year: 2022 PMID: 35019696 PMCID: PMC8754150 DOI: 10.1128/spectrum.00764-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Comparison of accessory genome elements (AGE) of SEE (n = 50) and SEZ (n = 50) genomes. The outer ring shows the ClustAGE bins that are ≥200 base pairs in size; these are ordered clockwise from the largest bin to the smallest bin and are differentiated by orange and green to define bin borders. The concentric inner bands show the distribution of AGE within each individual isolate. Bands that are blue represent SEE isolates, and bands that are red represent SEZ isolates. The central ruler of the figure indicates the cumulative size of the AGE in kilobases.
Accessory genome elements identified in all 50 SEE genomes
| RefSeq_4047 | Gene name | Region | PSORTb result | Protein |
|---|---|---|---|---|
| SEQ_0756 | ICESe1 | Cytoplasmic | Transcriptional regulator | |
| SEQ_0757 | ICESe1 | Cytoplasmic | Modification methylase PstI (EC 2.1.1.72) | |
| SEQ_0758 | ICESe1 | Cytoplasmic | Type II site-specific deoxyribonuclease | |
| SEQ_0761 | ICESe1 | Cytoplasmic | USG protein | |
| SEQ_0787 | Prophage Seq2 | Unknown | Phage integrase: site-specific recombinase | |
| SEQ_0816 | Prophage Seq2 | Unknown | Phage protein | |
| SEQ_0817 | Prophage Seq2 | Unknown | Phage protein | |
| SEQ_0818 | Prophage Seq2 | Unknown | Phage endonuclease | |
| SEQ_0819 | Prophage Seq2 | Cytoplasmic | Phage terminase | |
| SEQ_0823 | Prophage Seq2 | Cytoplasmic | Phage portal protein | |
| SEQ_0824 | Prophage Seq2 | Cytoplasmic | Prophage Clp protease-like protein | |
| SEQ_0825 | Prophage Seq2 | Cytoplasmic | Phage capsid protein | |
| SEQ_0826 | Prophage Seq2 | Cytoplasmic | Putative capsid protein (ACLAME 311) | |
| SEQ_0827 | Prophage Seq2 | Cytoplasmic | DNA packaging protein | |
| SEQ_0828 | Prophage Seq2 | Unknown | Phage protein | |
| SEQ_0829 | Prophage Seq2 | Cytoplasmic | Phage protein | |
| SEQ_0830 | Prophage Seq2 | Cytoplasmic | Phage protein | |
| SEQ_0831 | Prophage Seq2 | Cytoplasmic | Phage major tail protein | |
| SEQ_0832 | Prophage Seq2 | Unknown | Phage protein | |
| SEQ_0833 | Prophage Seq2 | Unknown | Phage protein | |
| SEQ_0835 | Prophage Seq2 | Unknown | Phage-related protein | |
| SEQ_1102 | Insertion element | Cytoplasmic | Site-specific recombinase | |
| SEQ_1231 | ICESe2 | Cytoplasmic | Hypothetical protein | |
| SEQ_1233 |
| ICESe2 | Unknown | Hypothetical protein |
| SEQ_1234 |
| ICESe2 | Unknown | Hypothetical protein |
| SEQ_1235 |
| ICESe2 | Cytoplasmic membrane | Heterodimeric efflux ABC transporter |
| SEQ_1236 |
| ICESe2 | Cytoplasmic membrane | Heterodimeric efflux ABC transporter |
| SEQ_1237 |
| ICESe2 | Cytoplasmic membrane | Duplicated ATPase component BL0693 of energizing module of predicted ECF transporter |
| SEQ_1238 |
| ICESe2 | Cytoplasmic membrane | Transmembrane component BL0694 of energizing module of predicted ECF transporter |
| SEQ_1239 |
| ICESe2 | Cytoplasmic membrane | Substrate-specific component BL0695 of predicted ECF transporter |
| SEQ_1240 |
| ICESe2 | Cytoplasmic | Hypothetical protein |
| SEQ_1241 |
| ICESe2 | Cytoplasmic | Hypothetical protein |
| SEQ_1242 |
| ICESe2 | Cytoplasmic | Polyketide synthase modules and related proteins |
| SEQ_1243 |
| ICESe2 | Cytoplasmic | 2,3-Dihydroxybenzoate-AMP ligase (EC 2.7.7.58) of siderophore biosynthesis |
| SEQ_1244 |
| ICESe2 | Cytoplasmic | 4′-Phosphopantetheinyl transferase (EC 2.7.8.-) |
| SEQ_1245 |
| ICESe2 | Cytoplasmic | Iron acquisition yersiniabactin synthesis enzyme YbtT @ thioesterase in siderophore biosynthesis gene cluster |
| SEQ_1246 |
| ICESe2 | Cytoplasmic | Iron-dependent repressor |
| SEQ_1249 | ICESe2 | Unknown | Hypothetical protein | |
| SEQ_1250 | ICESe2 | Cytoplasmic | Hypothetical protein | |
| SEQ_1252 | ICESe2 | Cytoplasmic | Hypothetical protein | |
| SEQ_1253 | ICESe2 | Cell wall/extracellular | Superfamily II DNA and RNA helicase | |
| SEQ_1254 | ICESe2 | Cytoplasmic | Hypothetical protein | |
| SEQ_1257 | ICESe2 | Cytoplasmic | FIG00645039: hypothetical protein with HTH-domain | |
| SEQ_1258 | ICESe2 | Cytoplasmic | Abortive infection protein AbiGI | |
| SEQ_1260 | ICESe2 | Unknown | Hypothetical protein | |
| SEQ_1261 | ICESe2 | Unknown | NLP/P60 family protein | |
| SEQ_1262 | ICESe2 | Cytoplasmic | Modification methylase Cfr9I (EC 2.1.1.113) | |
| SEQ_1263 | ICESe2 | Unknown | TrsE-like protein | |
| SEQ_1264 | ICESe2 | Cytoplasmic membrane | Hypothetical protein | |
| SEQ_1265 | ICESe2 | Cytoplasmic | Hypothetical protein | |
| SEQ_1266 | ICESe2 | Cytoplasmic membrane | Hypothetical protein | |
| SEQ_1267 | ICESe2 | Cytoplasmic membrane | Maff2 family protein | |
| SEQ_1268 | ICESe2 | Cytoplasmic membrane | Hypothetical protein | |
| SEQ_1269 | ICESe2 | Cytoplasmic membrane | ABC-type antimicrobial peptide transport system | |
| SEQ_1270 | ICESe2 | Cytoplasmic membrane | Hypothetical protein | |
| SEQ_1271 | ICESe2 | Cytoplasmic membrane | Hypothetical protein | |
| SEQ_1274 | ICESe2 | Cytoplasmic | Chromosome (plasmid) partitioning protein ParB | |
| SEQ_1275 | ICESe2 | Cytoplasmic membrane | Chromosome (plasmid) partitioning protein ParA | |
| SEQ_1728 |
| Prophage Seq3 | Unknown | Streptococcal pyrogenic exotoxin K (SpeK) |
| SEQ_1739 | Prophage Seq3 | Cytoplasmic membrane | Phage tail length tape-measure protein | |
| SEQ_1740 | Prophage Seq3 | Unknown | Conserved hypothetical protein - phage associated | |
| SEQ_1741 | Prophage Seq3 | Cytoplasmic | Conserved hypothetical protein - phage associated | |
| SEQ_1742 | Prophage Seq3 | Unknown | Phage major tail protein | |
| SEQ_1743 | Prophage Seq3 | Cytoplasmic | Phage major tail protein | |
| SEQ_1744 | Prophage Seq3 | Cytoplasmic membrane | Structural protein | |
| SEQ_1745 | Prophage Seq3 | Unknown | Phage protein | |
| SEQ_1746 | Prophage Seq3 | Unknown | Phage protein | |
| SEQ_1747 | Prophage Seq3 | Cytoplasmic | Phage protein | |
| SEQ_1748 | Prophage Seq3 | Cytoplasmic | Hypothetical phage protein | |
| SEQ_1749 | Prophage Seq3 | Unknown | Phage major capsid protein | |
| SEQ_1750 | Prophage Seq3 | Cytoplasmic | Phage major capsid protein | |
| SEQ_1751 | Prophage Seq3 | Unknown | FIG01114710: hypothetical protein | |
| SEQ_1755 | Prophage Seq3 | Cytoplasmic | Guanosine-3′ | |
| SEQ_1756 | Prophage Seq3 | Unknown | Hypothetical protein | |
| SEQ_1757 | Prophage Seq3 | Cytoplasmic | Phi Mu50B-like protein | |
| SEQ_1758 | Prophage Seq3 | Cytoplasmic | Phage portal protein | |
| SEQ_1762 | Prophage Seq3 | Unknown | Pleiotropic regulator of exopolysaccharide synthesis | |
| SEQ_1763 | Prophage Seq3 | Cytoplasmic | Chromosome segregation ATPase | |
| SEQ_2036 |
| Prophage Seq4 | Extracellular | Streptococcal pyrogenic exotoxin H (SpeH); toximoron (superantigen) @ exotoxin |
| SEQ_2037 |
| Prophage Seq4 | Extracellular | Exotoxin |
| SEQ_2038 | Prophage Seq4 | Unknown | Phage lysin | |
| SEQ_2040 | Prophage Seq4 | Cytoplasmic membrane | Phage holin | |
| SEQ_2041 | Prophage Seq4 | Unknown | Phage holin | |
| SEQ_2042 | Prophage Seq4 | Cytoplasmic | Phage protein | |
| SEQ_2043 | Prophage Seq4 | Unknown | Hypothetical protein |
FIG 2Gene ontology (GO) terms and KEGG pathways (annotated in ClueGO) in the accessory genome elements identified in all SEE (n = 50) genomes. Circle size represents the degree of the positive relationship between the GO terms and the term’s adjusted P value. The related terms are grouped and presented in the same color.
Accessory genome elements identified in all 50 SEZ genomes
| RefSeq_H70 | Gene name | Region | PSORTb result | Protein |
|---|---|---|---|---|
| SZO_01750 |
| Deleted in 4047 | Cytoplasm | Sorbitol-6-phosphate 2-dehydrogenase (EC 1.1.1.140) |
| SZO_14750 | Cytoplasm | Transcriptional regulator | ||
| SZO_15220 |
| Deleted in 4047 | Cytoplasm | 6-Phospho-beta-galactosidase (EC 3.2.1.85) |
| SZO_15240 |
| Deleted in 4047 | Cytoplasm | PTS system |
| SZO_15250 |
| Deleted in 4047 | Cytoplasm | Beta-glucoside bgl operon antiterminator |
| SZO_05610 | Deleted in 4047 | Cytoplasm membrane | ABC transporter ATP-binding protein | |
| SZO_05620 | Deleted in 4047 | Cytoplasm membrane | Daunorubicin resistance transmembrane protein | |
| SZO_05630 | Deleted in 4047 | Cytoplasm membrane | Efflux ABC transporter | |
| SZO_14690 | ESAT-6-like | Cytoplasm membrane | Branched-chain amino acid transport system carrier protein | |
| SZO_14730 |
| Cytoplasm membrane | Competence-stimulating peptide ABC transporter permease protein ComB | |
| SZO_14744 | Cytoplasm membrane | Competence-stimulating peptide ABC transporter ATP-binding protein ComA | ||
| SZO_15230 |
| Deleted in 4047 | Cytoplasm membrane | PTS system |
| SZO_14742 | Extracellular | FIG01116836: hypothetical protein | ||
| SZO_10380 | Unknown | FIG01117834: hypothetical protein | ||
| SZO_14743 | Unknown | FIG01120711: hypothetical protein |
FIG 3Gene ontology (GO) terms and KEGG pathways (annotated in ClueGO) in the accessory genome elements identified in all SEZ (n = 50) genomes. Circle size represents the degree of the positive relationship between the GO terms and the term’s adjusted P value. The related terms are grouped and presented in the same color.
FIG 4Percentage of variance for the core genome relative to the reference. (A) Percentage of variance in 50 SEE isolates from Texas. The variance the SEE isolates had a mean of 0.00010% and exhibited a variance range of 0.00005% to 0.00015%. (B) Percentage of variance in 50 SEZ isolates from Texas. The variance of the SEZ isolates had a mean of 0.01853% and demonstrated a range of 0.00013% to 0.02527%.
FIG 5Gene ontology (GO) terms and KEGG pathways (annotated in ClueGO) for SNPs identified in 75% (≥38) of SEZ genomes. Circle size represents the degree of the positive relationship between the GO terms and the term’s adjusted P value. The related terms are grouped and presented in the same color.
Novel motif sequences from SEE and SEZ genomes
| Motif sequence | Genome ID | Subsp. | Center position | Modification type |
|---|---|---|---|---|
|
| 18-074 |
| 3 | m6A |
| 19-050 | zoo | 4 | m6A | |
|
| 14-006 | zoo | 3 | m6A |
|
| 19-052 | zoo | 5 | m6A |
| 19-044 | zoo | 1 | m6A | |
| 19-050 | zoo | 1 | m6A | |
| 18-066 | zoo | 3 | m6A | |
| 14-151 | zoo | 3 | m6A | |
| 19-048 | zoo | 3 | m6A | |
| 18-059 | zoo | 5 | m6A | |
| 19-043 | zoo | 5 | m6A | |
| 19-044 | zoo | 5 | m6A | |
| 18-059 | zoo | 5 | m6A | |
| 19-043 | zoo | 5 | m6A | |
| 19-044 | zoo | 5 | m6A | |
| 19-047 | zoo | 4 | m6A | |
|
| 19-041 | zoo | 2 | m6A |
| 19-038 | zoo | 3 | m6A | |
|
| 19-047 | zoo | 2 | m6A |
|
| 19-058 | zoo | 2 | m6A |
| 19-056 | zoo | 2 | m6A | |
|
| 19-045 | zoo | 6 | m6A |
| 18-058 | zoo | 3 | m6A | |
|
| 14-007 | zoo | 5 | m4C |
|
| 18-055 | zoo | 5 | m4C |
|
| 17-006 | zoo | 4 | m6A |
|
| 19-036 | zoo | 4 | m6A |
|
| 19-058 | zoo | 2 | m6A |
zoo, zooepidemicus.
Methylation location, type, and motif in 24 SEE genomes
| CDS | Location | Type | Motif sequence |
|---|---|---|---|
| SEQ_0045 | 56855 | m6A | CTGCAG |
| SEQ_0067 | 74697 | m6A | CTGCAG |
| SEQ_0067 | 74700 | m6A | CTGCAG |
| SEQ_0070 | 76364 | m6A | CTGCAG |
| SEQ_0251 | 230695 | m6A | CTGCAG |
| SEQ_0300 | 285354 | m6A | CTGCAG |
| SEQ_0300 | 285357 | m6A | CTGCAG |
| SEQ_0302 | 288740 | m6A | CTGCAG |
| SEQ_0340 | 323013 | m6A | CTGCAG |
| SEQ_0340 | 323016 | m6A | CTGCAG |
| SEQ_0435 | 417164 | m6A | CTGCAG |
| SEQ_0435 | 417167 | m6A | CTGCAG |
| SEQ_0474 | 460537 | m6A | CTGCAG |
| SEQ_0474 | 461395 | m6A | CTGCAG |
| SEQ_0497 | 482852 | m6A | CTGCAG |
| SEQ_0497 | 482855 | m6A | CTGCAG |
| SEQ_0596 | 580039 | m6A | CTGCAG |
| SEQ_0596 | 580042 | m6A | CTGCAG |
| SEQ_0721 | 712040 | m6A | CTGCAG |
| SEQ_0769 | 763220 | m6A | CTGCAG |
| SEQ_0769 | 763223 | m6A | CTGCAG |
| SEQ_0898 | 873274 | m6A | CTGCAG |
| SEQ_0898 | 873277 | m6A | CTGCAG |
| SEQ_0976 | 967141 | m6A | CTGCAG |
| SEQ_1129 | 1118411 | m6A | CTGCAG |
| SEQ_1277 | 1274166 | m6A | CTGCAG |
| SEQ_1277 | 1274169 | m6A | CTGCAG |
| SEQ_1278 | 1276130 | m6A | CTGCAG |
| SEQ_1299 | 1296130 | m6A | CTGCAG |
| SEQ_1299 | 1296133 | m6A | CTGCAG |
| SEQ_1318 | 1318299 | m6A | CTGCAG |
| SEQ_1318 | 1318302 | m6A | CTGCAG |
| SEQ_1407 | 1406622 | m6A | CTGCAG |
| SEQ_1407 | 1406625 | m6A | CTGCAG |
| SEQ_1407 | 1408183 | m6A | CTGCAG |
| SEQ_1410 | 1411644 | m6A | CTGCAG |
| SEQ_1410 | 1411647 | m6A | CTGCAG |
| SEQ_1439 | 1442999 | m6A | CTGCAG |
| SEQ_1439 | 1443002 | m6A | CTGCAG |
| SEQ_1448 | 1453017 | m6A | CTGCAG |
| SEQ_1448 | 1453020 | m6A | CTGCAG |
| SEQ_1597 | 1602240 | m6A | CTGCAG |
| SEQ_1597 | 1602243 | m6A | CTGCAG |
| SEQ_1597 | 1602706 | m6A | CTGCAG |
| SEQ_1615 | 1626084 | m6A | CTGCAG |
| SEQ_1625 | 1634867 | m6A | CTGCAG |
| SEQ_1625 | 1634870 | m6A | CTGCAG |
| SEQ_1627 | 1636655 | m6A | CTGCAG |
| SEQ_1651 | 1658796 | m6A | CTGCAG |
| SEQ_1651 | 1658799 | m6A | CTGCAG |
| SEQ_1895 | 1896398 | m6A | CTGCAG |
| SEQ_1895 | 1896401 | m6A | CTGCAG |
| SEQ_1981 | 1925057 | m6A | CTGCAG |
| SEQ_1981 | 1925060 | m6A | CTGCAG |
| SEQ_1920 | 1928487 | m6A | CTGCAG |
| SEQ_1920 | 1928490 | m6A | CTGCAG |
| SEQ_1937 | 1945433 | m6A | CTGCAG |
| SEQ_1937 | 1945436 | m6A | CTGCAG |
| SEQ_1937 | 1945842 | m6A | CTGCAG |
| SEQ_2009 | 2033472 | m6A | CTGCAG |
| SEQ_2152 | 2161140 | m6A | CTGCAG |
| SEQ_2152 | 2161143 | m6A | CTGCAG |
| SEQ_2161 | 2171880 | m6A | CTGCAG |
| SEQ_2161 | 2171883 | m6A | CTGCAG |
| SEQ_2161 | 2172181 | m6A | CTGCAG |
| SEQ_2161 | 2172184 | m6A | CTGCAG |
| SEQ_2161 | 2173113 | m6A | CTGCAG |
| SEQ_2161 | 2173116 | m6A | CTGCAG |
| SEQ_2210 | 2224386 | m6A | CTGCAG |
| SEQ_2210 | 2224389 | m6A | CTGCAG |
FIG 6Gene ontology (GO) terms and KEGG pathways (annotated in ClueGO) on homologous proteins where methylation is present in SEE (n = 24) genomes but absent in SEZ (n = 24) genomes. Circle size represents the degree of the positive relationship between the GO terms and the term’s adjusted P value. The related terms are grouped and presented in the same color.
FIG 7Reverse transcription C values of SEE (n = 5) and SEZ (n = 5) isolates. C values were normalized to the housekeeping gene (SEQ_1170) and demonstrated no difference in transcription level. Normalized C values for SEQ_1597 (exopeptidase activity) demonstrated a significantly (P < 0.05) lower level of transcription relative to the housekeeping gene in SEE compared to that in SEZ. Normalized C values for SEQ_1918 (quorum sensing) revealed significantly (P < 0.05) higher levels of transcription relative to the housekeeping gene in SEE compared to those in SEZ. Statistics were performed using the Wilcoxon rank sum test, and an asterisk represents a P value of <0.05. Pink dots denote SEE isolates, and blue dots denote SEZ isolates.
Real-time PCR primer sequences
| Gene ID | Target function | Forward primer | Probe | Reverse primer |
|---|---|---|---|---|
| SEQ_1170 | Housekeeping |
| FAM-ATGATGGCTTCGATACGCTCTGGC-NFQ |
|
| SEQ_1597 | Exopeptidase |
| FAM-ATGTGCTTCTCAGTTGAGCCTGGT-NFQ |
|
| SEQ_1918 | Quorum sensing |
| FAM-TAGCAGCGGAAAGGTATCTGGCAA-NFQ |
|