| Literature DB >> 18405369 |
Debra E Bessen1, Karen F McGregor, Adrian M Whatmore.
Abstract
BACKGROUND: The M type-specific surface protein antigens encoded by the 5' end of emm genes are targets of protective host immunity and attractive vaccine candidates against infection by Streptococcus pyogenes, a global human pathogen. A history of genetic change in emm was evaluated for a worldwide collection of > 500 S. pyogenes isolates that were defined for genetic background by multilocus sequence typing of housekeeping genes.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18405369 PMCID: PMC2359762 DOI: 10.1186/1471-2180-8-59
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Population-based surveillance of S. pyogenes.*
| Australia, tropical | 125 | NR | NR | NR | 13 | 46 | 41 |
| Rome (Italy) | 114 | 50 | 1 | 48 | n/a | n/a | n/a |
| Germany | 216 | 51 | 0 | 49 | n/a | n/a | n/a |
| Spain | 520 | 32 | 1 | 68 | n/a | n/a | n/a |
| Mexico | 282 | 54 | 1 | 44 | n/a | n/a | n/a |
| USA | > 1,900 | 53 | 1 | 47 | n/a | n/a | n/a |
| Ethiopia | 104 | 26 | 28 | 35 | 0 | 32 | 47 |
| Nepal | 53 | NR | NR | NR | 19 | 30 | 51 |
| Brazil | 87 | 20 | 20 | 61 | 3 | 55 | 42 |
| Brussels (Belgium) | 163 | 55 | 0 | 45 | NR | NR | NR |
| Australia, tropical | 129 | NR | NR | NR | 13 | 53 | 35 |
* emm pattern is inferred based on emm-type (McGregor et al., 2004; reference 23) NR, < 10 isolates recovered; n/a, not applicable.
Summary of S. pyogenes sample (sub)sets analyzed in this study.
| Distribution according to emm pattern: | ||||||
| Undefined or other | ||||||
| Complete | Number of isolates | 582 | 156 | 181 | 240 | 5 |
| Complete | Number of STs represented | 259 | 42 | 91 | 123 | 5 |
| Complete | Number of emm types represented | 156 | 29 | 62 | 61 | 4 |
| Complete | Number of unique combinations of emm type and ST | 280 | 47 | 104 | 124 | 5 |
| Complete | Simpson's diversity index * | 0.993 | 0.950 | 0.985 | 0.990 | n.d. |
| Complete | Simpson's diversity index, 95% confidence intervals | 0.992–0.995 | 0.938–0.963 | 0.979–0.991 | 0.987–0.992 | n.d. |
| emm nt substitution | Number of isolates | 520 | 137 | 155 | 220 | 8 |
| emm nt substitution | Number of emm types represented | 105 | 18 | 40 | 44 | 3 |
| emm nt substitution | Number of emm alleles represented | 188 | 54 | 57 | 71 | 6 |
| emm HGT | Number of isolates | 531 | 143 | 156 | 224 | 8 |
| emm HGT | Number of emm types represented | 105 | 18 | 40 | 44 | 3 |
| emm HGT | Number of STs represented | 219 | 34 | 75 | 108 | 2 |
* Based on unique combinations of emm type and ST. Abbreviations: n.d., not determined; HGT, horizontal gene transfer; nt, nucleotide; ST, sequence type
Synonymous and nonsynonymous nucleotide substitutions within the emm type region (150 nt), based on Clustal W alignments corresponding to each emm type
| emm pattern | No. of emm types (%) | No. of isolates analyzed | Average no. of nonsynonymous substitutions per nonsynonymous site (Ka) # | Average no. of synonymous substitutions per synonymous site (Ks) | Average ratio of Ka to Ks |
| A-C | 18 (17) | 137 | 0.02121 | 0.00431 | 4.92 |
| D | 40 (38) | 155 | 0.00748 | 0.00488 | 1.53 |
| E | 44 (42) * | 220 | 0.00732 | 0.00581 | 1.26 |
| other^ | 3 | 8 | 0.02633 | 0.00721 | 3.65 |
| All | 105 (100) | 520 $ | 0.01037 | 0.00526 | 1.96 |
# Represents average of values from alignments corresponding to each emm type
$ Sequence alignment of (partial) emm genes was performed for 520 of the 531 isolates.
* emmst206 is included with emm96 analysis since the large indel (54 nt) is at the extreme 5' end.
^Includes two emm types associated with multiple emm patterns (emm54, emmst854; patterns A-C and D) and emm31 (pattern uncertain).
Recombinational replacement of emm with a new emm type.
| emm pattern | No, of isolates | No. of STs | No. of emm type-variable STs (%) | No. of emm types * | No. of emm types associated with emm type variable STs (%) | No. of recombinational events | No. of recombinational events per locus per ST |
| A-C | 156 | 42 | 3 (7.1) | 29 | 8 (27.6) | 5 | 0.119 |
| D | 181 | 91 | 9 (9.9) | 62 | 23 (37.1) | 14 | 0.154 |
| E | 240 | 123 | 2 (1.6) | 61 | 4 (6.6) | 2 | 0.016 |
| All | 577 | 256 | 14 | 152 | 35 | 21 | 0.096 average |
* One ST scored as pattern D (ST3) and 1 ST scored as pattern E (ST150) contain isolates with rearranged emm region yielding a new emm type, but could not be assigned a pattern; they are listed here with the pattern D and E groups, respectively. Other emm pattern-undefined isolates are not included in chart.
Figure 1Differences in the number of housekeeping alleles between isolates sharing an . The y-axis shows the numbers of emm types represented by are each category, as defined by the x-axis. (A), The minimum number of differences in housekeeping alleles between isolates sharing an emm type are: zero (singleton STs), one or two (1 CC with multiple STs), three or four (multiple CCs with STs of intermediate distance), and five (multiple CCs whereby all STs are distant; represents HGT). (B), Distribution of the maximum number of differences in housekeeping alleles between isolates sharing an emm type. Clonal complex (CC) is defined by STs sharing at least 5 of 7 housekeeping alleles
emm types associated with distant STs.
| emm pattern | No. of emm types examined * | No. of emm types involved in HGT | No. of HGT events involving emm type | Average no. of HGT events per emm type | Average no. of isolates sampled per emm type | Average no. of countries sampled per emm type | No. of HGT events involving the same emm allele | No. of emm types restricted to 1 ST or CC |
| A-C | 18 | 3 | 3 | 0.17 | 7.94 | 2.78 | 2 | 14 |
| D | 40 | 19^ | 24^ | 0.60 | 3.90 | 2.18 | 15 | 15 |
| E | 44 | 27 | 33 | 0.75 | 5.09 | 2.28 | 15 | 12 |
| other | 3 | 3 | 3 | 1.00 | 2.67 | 1.67 | 0 | 0 |
| Total | 105 | 52 | 63 | n/a | n/a | n/a | 32 | 41 |
* emm types examined are those whereby 2 or more isolates are present in the sample set (i.e., sample set of 531 isolates, excluding the 51 singletons).
^ One emm type (emm93) may not be involved in HGT based on identification of an intermediate ST in another study [49]
Abbreviations: n/a, applicable; CC, clonal complex; HGT, horizontal gene transfer; ST, sequence type
Estimate for the minimum number of recombinational events involving housekeeping genes.
| emm pattern | No. of STs | No. of recombinational events | No. of loci | No. of recombinational events per locus per ST |
| A-C | 42 | 2 | 7 | 0.007 |
| D | 91 | 16 | 7 | 0.025 |
| E | 123 | 15 | 7 | 0.017 |