| Literature DB >> 33075055 |
Sean J Buckley1, Mark R Davies2, David J McMillan1.
Abstract
Bacterial "stand-alone" response regulators (RRs) are pivotal to the control of gene transcription in response to changing cytosolic and extracellular microenvironments during infection. The genome of group A Streptococcus (GAS) encodes more than 30 stand-alone RRs that orchestrate the expression of virulence factors involved in infecting multiple tissues, so causing an array of potentially lethal human diseases. Here, we analysed the molecular epidemiology and biological associations in the coding sequences (CDSs) and upstream intergenic regions (IGRs) of 35 stand-alone RRs from a collection of global GAS genomes. Of the 944 genomes analysed, 97% encoded 32 or more of the 35 tested RRs. The length of RR CDSs ranged from 297 to 1587 nucleotides with an average nucleotide diversity (π) of 0.012, while the IGRs ranged from 51 to 666 nucleotides with average π of 0.017. We present new evidence of recombination in multiple RRs including mga, leading to mga-2 switching, emm-switching and emm-like gene chimerization, and the first instance of an isolate that encodes both mga-1 and mga-2. Recombination was also evident in rofA/nra and msmR loci with 15 emm-types represented in multiple FCT (fibronectin-binding, collagen-binding, T-antigen)-types, including novel emm-type/FCT-type pairings. Strong associations were observed between concatenated RR allele types, and emm-type, MLST-type, core genome phylogroup, and country of sampling. No strong associations were observed between individual loci and disease outcome. We propose that 11 RRs may form part of future refinement of GAS typing systems that reflect core genome evolutionary associations. This subgenomic analysis revealed allelic traits that were informative to the biological function, GAS strain definition, and regional outbreak detection.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33075055 PMCID: PMC7571705 DOI: 10.1371/journal.pone.0240834
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Distribution of GAS stand-alone response regulators.
| RR | Spy locus | Distribution (%) | Function | References |
|---|---|---|---|---|
| spy0092 | 944/944 (100) | Zinc acquisition and virulence | [ | |
| spy1634 | 942/944 (99.8) | Short-chain fatty acid metabolism | [ | |
| spy0514 | 944/944 (100) | Carbohydrate catabolite and virulence regulation | [ | |
| spy1777 | 944/944 (100) | Nutritional stress response | [ | |
| spy0037 | 944/944 (100) | Biofilm-related transformation | [ | |
| spy1717 | 944/944 (99.9) | Copper tolerance | [ | |
| spy0898 | 944/944 (100) | Innate immunity defence | [ | |
| spy1870 | 944/944 (100) | Cathelicidin resistance | [ | |
| spy2074 | 944/944 (100) | Heat stress response | [ | |
| spy0846 | 942/944 (99.8) | Zinc resistance efflux | [ | |
| spy1763 | 943/944 (99.9) | Heat shock response repressor | [ | |
| spy1978 | 944/944 (100) | Not known, Under regulatory influence of | [ | |
| spy1293 | 944/944 (100) | Cell adhesion and polysaccharide metabolism | [ | |
| spy2019 | 151/944 (16) | Carbohydrate metabolite-responsive virulence regulator of ‘throat’ strains | [ | |
| spyM18_2077 | 794/944 (84) | Carbohydrate metabolite-responsive virulence regulator of ‘generalist’ strains 3,4,5 | [ | |
| spy1259 | 942/944 (99.8) | ROS resistance via iron sequestration | [ | |
| spy49_0118 | 722/944 (75.5) | Regulation of type-dependent FCT and virulence genes | [ | |
| spy0450 | 940/944 (99.6) | Metal uptake, virulence, and metabolism | [ | |
| spyM3_0097 | 328/944 (34.7) | Regulation of type-dependent FCT genes, | [ | |
| spy0187 | 944/944 (100) | Metal homeostasis, oxidative environment response, and immunity defence | [ | |
| spy0735 | 834/944 (88.3) | Regulator of | [ | |
| spy0627 | 944/944 (100) | Expression of | [ | |
| spy0496 | 940/944 (99.6) | Quorum sensing, biofilm regulator | [ | |
| spy0533 | 940/944 (99.6) | Quorum sensing, biofilm regulator | [ | |
| spy0216 | 944/944 (100) | [ | ||
| spy0124 | 616/944 (65.3) | Regulation of type-dependent FCT genes and toxins | [ | |
| spy2042 | 944/944 (100) | Growth phase-appropriate balance of virulence (e.g. | [ | |
| spy0715 | 944/944 (99.9) | GntR-like | [ | |
| spy1202 | 944/944 (99.7) | GntR-like under regulatory influence of RopB in NZ131 background | [ | |
| spy1258 | 944/944 (100) | TetR-like | [ | |
| spy1602 | 942/944 (99.8) | GntR-like | ||
| spy2177 | 944/944 (99.8) | TetR-like under regulatory influence of RopB in NZ131 background. | [ | |
| spy1857 | 940/944 (99.6) | [ | ||
| spy0887 | 944/944 (99.9) | Influences speB expression | [ | |
| spy0188 | 676/944 (71.6) | Not known | [ |
1Gene name, or SF370 locus tag where not available
2Full length intact
3Virulence-related
4Metabolism-related
5Biofilm formation-related.
Nucleotide variation in the coding sequences of selected GAS stand-alone response regulators.
| Gene | Size (nt) | Alleles | Variant nt positions | Nucleotide diversity (π) | Allelic diversity (D) | Average nt percentage identity [range] | Recombination events |
|---|---|---|---|---|---|---|---|
| 723 | 32 | 41 | 0.0035 | 0.708 | 100 [94–100] | 1 | |
| 993 | 141 | 168 | 0.0173 | 0.981 | 98 [93–100] | 3 | |
| 645 | 87 | 62 | 0.0037 | 0.962 | 100 [99–100] | 0 | |
| 999 | 58 | 41 | 0.0041 | 0.915 | 100 [99–100] | 1 | |
| 1203 | 78 | 407 | 0.0896 | 0.897 | 86 [52–100] | 1 | |
| 894 | 87 | 182 | 0.0737 | 0.954 | 92 [65–100] | 3 | |
| 432 | 63 | 58 | 0.0027 | 0.895 | 100 [99–100] | 0 | |
| 480 | 74 | 57 | 0.0041 | 0.933 | 100 [97–100] | 1 | |
| 741 | 25 | 18 | 0.0025 | 0.616 | 100 [99–100] | 0 | |
| 459 | 61 | 55 | 0.0033 | 0.864 | 100 [97–100] | 3 | |
| 639 | 95 | 89 | 0.0062 | 0.952 | 99 [97–100] | 0 | |
| 1038 | 164 | 202 | 0.0134 | 0.985 | 99 [90–100] | 3 | |
| 717 | 90 | 187 | 0.0031 | 0.953 | 100 [95–100] | 2 | |
| 1587 | 40 | 362 | 0.0181 | 0.278 | 98 [89–100] | 3 | |
| 1491 | 201 | 366 | 0.0103 | 0.939 | 99 [84–100] | 4 | |
| 1491 | 56 | 61 | 0.0026 | 0.838 | 100 [91–100] | 0 | |
| 780 | 90 | 83 | 0.0047 | 0.922 | 100 [99–100] | 0 | |
| 465 | 145 | 87 | 0.0102 | 0.983 | 99 [98–100] | 1 | |
| 885 | 40 | 50 | 0.0052 | 0.559 | 99 [99–100] | 1 | |
| 741 | 44 | 35 | 0.0038 | 0.845 | 100 [99–100] | 0 | |
| 1506 | 161 | 390 | 0.0065 | 0.926 | 97 [1–100] | 0 | |
| 441 | 123 | 388 | 0.0292 | 0.967 | 97 [86–100] | 4 | |
| 849 | 63 | 52 | 0.0027 | 0.917 | 100 [99–100] | 0 | |
| 909 | 77 | 60 | 0.0053 | 0.958 | 99 [99–100] | 1 | |
| 840 | 156 | 140 | 0.0034 | 0.982 | 100 [99–100] | 2 | |
| 1533 | 122 | 201 | 0.0088 | 0.871 | 99 [96–100] | 2 | |
| 864 | 132 | 138 | 0.0036 | 0.945 | 100 [94–100] | 1 | |
| 531 | 114 | 240 | 0.0453 | 0.971 | 95 [78–100] | 17 | |
| 720 | 45 | 118 | 0.0041 | 0.571 | 100 [89–100] | 1 | |
| 519 | 57 | 53 | 0.0019 | 0.703 | 100 [97–100] | 0 | |
| 1032 | 135 | 232 | 0.0193 | 0.98 | 98 [90–100] | 2 | |
| 540 | 54 | 100 | 0.0049 | 0.887 | 99 [65–100] | 0 | |
| 903 | 52 | 37 | 0.0039 | 0.894 | 100 [99–100] | 0 | |
| 297 | 39 | 30 | 0.0011 | 0.602 | 100 [99–100] | 0 | |
| 846 | 32 | 36 | 0.0052 | 0.819 | 99 [95–100] | 0 |
1Gene name, or SF370 locus tag where not available
2Variant nucleotides in the multiple sequence alignment
3Number of putative recombination events inferred by fastGEAR.
Amino acid variation in the coding sequences of selected GAS stand-alone response regulators.
| Gene | Average aa percentage identity [range] | Nonsense mutations | πA/πS | KA/KS | Selection pressure |
|---|---|---|---|---|---|
| 99.3 [15.9–100] | 3 | 0.069 | 0.068 | neg | |
| 94.6 [9.6–100] | 94 | 0.15 | 0.132 | neg | |
| 99.8 [16.6–100] | 3 | 4.548 | 4.456 | pos | |
| 99.8 [34–100] | 2 | 1.062 | 1.008 | pos | |
| 77.2 [7.5–100] | 47 | 0.613 | 0.659 | neg | |
| 88.7 [19.4–100] | 29 | 0.984 | 0.951 | neg | |
| 99.7 [77.5–100] | 1 | 0.153 | 0.155 | neg | |
| 99.5 [96–100] | 0 | 0.185 | 0.107 | neg | |
| 99.4 [97.4–100] | 0 | 0.955 | 0.947 | neg | |
| 95.1 [3.3–100] | 29 | 1.91 | 2.039 | pos | |
| 99.5 [98–100] | 0 | 0.122 | 0.121 | neg | |
| 98.7 [22.9–100] | 1 | 0.131 | 0.13 | neg | |
| 99.2 [31.7–100] | 15 | 0.573 | 0.455 | neg | |
| 93.4 [11.3–100] | 5 | 0.109 | 0.108 | neg | |
| 98.5 [16.5–100] | 9 | 0.785 | 0.909 | neg | |
| 99.4 [9–100] | 4 | 0.027 | 0.025 | neg | |
| 99.6 [88.4–100] | 3 | 0.091 | 0.096 | neg | |
| 97.9 [6.7–100] | 6 | 0.84 | 0.949 | neg | |
| 99.6 [98.4–100] | 3 | 0.101 | 0.1 | neg | |
| 99.8 [98.1–100] | 9 | 0.069 | 0.066 | neg | |
| 90.8 [5.5–100] | 9 | 0.228 | 0.228 | neg | |
| 98.9 [9.5–100] | 4 | 3.086 | 4.503 | pos | |
| 99.6 [97.9–100] | 6 | 0.262 | 0.26 | neg | |
| 99.6 [78.4–100] | 1 | 0.346 | 0.314 | neg | |
| 96.2 [27.3–100] | 36 | 0.413 | 0.548 | neg | |
| 97.8 [15.7–100] | 8 | 0.423 | 0.411 | neg | |
| 99.3 [ | 15 | 0.098 | 0.099 | neg | |
| 94 [7.5–100] | 60 | 0.689 | 0.723 | neg | |
| 99 [6.4–100] | 13 | 0.062 | 0.052 | neg | |
| 98.2 [7.4–100] | 19 | 0.272 | 0.311 | neg | |
| 98.1 [25.1–100] | 2 | 0.204 | 0.203 | neg | |
| 97.8 [5.8–100] | 11 | 0.788 | 0.554 | neg | |
| 99.9 [90.8–100] | 1 | 0.018 | 0.018 | neg | |
| 99 [10.2–100] | 7 | 0.703 | 0.833 | neg | |
| 98.9 [ | 3 | 0.507 | 0.506 | neg |
1Gene name, or SF370 locus tag where not available
2Alleles containing premature stop codon
3pos = positive and neg = negative.
Nucleotide variation in the upstream intergenic regions of selected GAS stand-alone response regulators.
| RR | Size | Alleles | Variant nt positions | Nucleotide diversity (π) | Allelic diversity (D) | Recombination events |
|---|---|---|---|---|---|---|
| 109 | 12 | 12 | 0.0017 | 0.178 | 0 | |
| 200 | 94 | 105 | 0.0002 | 0.966 | 0 | |
| 173 | 18 | 67 | 0.0066 | 0.515 | 0 | |
| 217 | 30 | 27 | 0.0072 | 0.801 | 0 | |
| 131 | 68 | 67 | 0.0148 | 0.879 | 0 | |
| 171 | 90 | 244 | 0.2906 | 0.814 | 0 | |
| 221 | 23 | 29 | 0.0036 | 0.610 | 0 | |
| 239 | 27 | 32 | 0.0041 | 0.643 | 0 | |
| 195 | 15 | 15 | 0.0050 | 0.693 | 0 | |
| 135 | 28 | 85 | 0.0074 | 0.756 | 0 | |
| 134 | 25 | 37 | 0.0052 | 0.580 | 1 | |
| 360 | 133 | 214 | 0.0297 | 0.985 | 2 | |
| 245 | 106 | 213 | 0.0028 | 0.900 | 0 | |
| 675 | 42 | 396 | 0.0233 | 0.310 | 2 | |
| 666 | 133 | 535 | 0.0174 | 0.939 | 3 | |
| 162 | 31 | 71 | 0.0095 | 0.729 | 0 | |
| 384 | 60 | 55 | 0.0143 | 0.904 | 1 | |
| 142 | 27 | 75 | 0.0019 | 0.321 | 0 | |
| 429 | 26 | 126 | 0.0173 | 0.564 | 0 | |
| 51 | 7 | 6 | 0.0014 | 0.149 | 0 | |
| 513 | 132 | 521 | 0.0308 | 0.970 | 1 | |
| 63 | 8 | 8 | 0.0126 | 0.515 | 0 | |
| 88 | 3 | 11 | 0.0006 | 0.147 | 0 | |
| 79 | 14 | 20 | 0.0125 | 0.777 | 0 | |
| 554 | 80 | 109 | 0.0095 | 0.938 | 1 | |
| 262 | 42 | 91 | 0.0021 | 0.759 | 1 | |
| 268 | 44 | 182 | 0.0080 | 0.786 | 0 | |
| 147 | 34 | 190 | 0.0122 | 0.719 | 0 | |
| 55 | 7 | 5 | 0.0040 | 0.215 | 0 | |
| 117 | 20 | 19 | 0.0053 | 0.483 | 0 | |
| 105 | 41 | 40 | 0.0344 | 0.859 | 1 | |
| 134 | 34 | 66 | 0.0111 | 0.528 | 0 | |
| 105 | 6 | 13 | 0.0001 | 0.148 | 0 | |
| 147 | 16 | 15 | 0.0008 | 0.150 | 0 | |
| 155 | 27 | 63 | 0.0024 | 0.793 | 0 |
1Gene name, or SF370 locus tag where not available
2Nucleotide distance between RR genes and upstream gene
3Variant nucleotides in the multiple sequence alignment
4Number of putative recombination events inferred by fastGEAR.
Fig 1Schematic representation of the recombination event observed in the intergenic region of mga suggesting recombination of mga-2 into an mga-1-type background (n = 10).
Numerical values represent both the intra-allele and inter-allele percentage nucleotide identities. Hatched fill = Intergenic regions, Blue = genes of mga-2-type isolates, and Green = genes of mga-1-type isolates. Notes A = recombination flank, B = SF370 spy locus tag, C = emm82C chimeric emm gene.
Fig 2Schematic representation of the genes of the mga regulon depicting observed plasticity.
Fig 3Proposed evolutionary pathway describing the deletion-fusion (chimerization) event observed in emm82C NGAS473.
The non-deleted DNA sequences share 100% percentage nucleotide identity. Green = Coding sequence, Yellow = Intergenic region, Hatched = Deleted DNA, and black arrow located 93 nt from the 5’ end depicts the start of the Centre for Disease Control 180 nt emm82.0 subtyping sequence.
Fig 4Subgenomic comparison of the mga loci of emm12 GAS isolates, SP1LAU and emm12 reference strain MGAS9429, displaying the presence of both mga-1 and mga-2 loci in SP1LAU using Blast Ring Generator (BRIG).
Fig 5Observed nucleotide diversity (π) distribution within the intergenic region and functional domains of A) mga-1 (n = 151) and B) mga-2 (n = 793). IGR = intergenic region, 1 = Common Mga domain, 2 = helix-turn-helix (HTH)-3, 3 = HTH-4, 4 = phosphotransferase system regulatory domain (PRD)-1, 5 = PRD-2, 6 = phosphotransferase system enzyme IIB-like domain.