| Literature DB >> 20668530 |
David J McMillan1, Debra E Bessen, Marcos Pinho, Candace Ford, Gerod S Hall, José Melo-Cristino, Mário Ramirez.
Abstract
BACKGROUND: Streptococcus dysgalactiae subspecies equisimilis (SDSE) is an emerging global pathogen that can colonize and infect humans. Although most SDSE isolates possess the Lancefield group G carbohydrate, a significant minority have the group C carbohydrate. Isolates are further sub-typed on the basis of differences within the emm gene. To gain a better understanding of their molecular epidemiology and evolutionary relationships, multilocus sequence typing (MLST) analysis was performed on SDSE isolates collected from Australia, Europe and North America. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2010 PMID: 20668530 PMCID: PMC2909212 DOI: 10.1371/journal.pone.0011741
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of SDSE isolates included in this study.
| Collection site | No. of isolates | No. of different | Diversity index, | No. of different STs | Diversity index, | STs unique to collection site | No. of isolates with group G carbohydrate | No. of isolates with group C carbohydrate | No. of invasive isolates | No. of non-invasive isolates |
|
| 55 | 17 | 0.926 (0.901–0.956) | 23 | 0.937 (0.908–0.970) | 14 | 47 | 8 | 24 | 25 |
|
| 36 | 17 | 0.951 (0.930–0.972) | 22 | 0.956 (0.922–0.989) | 11 | 28 | 8 | 10 | 26 |
|
| 72 | 34 | 0.984 (0.976–0.985) | 45 | 0.975 (0.961–0.988) | 33 | 48 | 23 | 70 | 1 |
|
| 11 | 8 | 0.927 (0.833–1.020) | 10 | 0.981 (0.936–1.020) | 5 | 6 | 5 | 9 | 0 |
|
| 61 | 33 | 0.983 (0.979–0.988) | 37 | 0.972 (0.957–0.988) | 28 | 42 | 18 | 61 | 1 |
|
| 15 | 12 | 0.971 (0.951–1.001) | 13 | 0.981 (0.951–1.011) | 10 | 7 | 7 | 1 | 6 |
|
|
|
| 0.961 (0.954–0.967) |
|
|
|
|
|
|
|
D, Simpsons Index of Diversity.
CI, Confidence Interval.
n.a., not applicable.
Housekeeping genes used for MLST of SDSE.
| Gene | ORF | Size of partial gene | No. of alleles | No. of nucleotide variant positions (%) | No. of variant aa positions | π | dn | ds | dn/ds |
| Glucose kinase ( | SDEG_1515 | 498 | 12 | 50 (10.1) | 7 | 0.021 | 0.0035 | 0.0736 | 0.047 |
| Glutamine transport protein ( | SDEG_1494 | 450 | 10 | 15 (3.3) | 7 | 0.010 | 0.0059 | 0.0248 | 0.240 |
| Glutamate racemase ( | SDEG_0413 | 438 | 10 | 12 (2.7) | 1 | 0.012 | 0.0068 | 0.0142 | 0.479 |
| DNA mismatch repair protein ( | SDEG_2091 | 405 | 10 | 27 (6.7) | 7 | 0.016 | 0.0062 | 0.0480 | 0.129 |
| Transketolase ( | SDEG_1735 | 459 | 20 | 37 (8.1) | 3 | 0.034 | 0.0008 | 0.1472 | 0.006 |
| Xanthine phosphoribosyl transferase ( | SDEG_0895 | 450 | 22 | 38 (8.4) | 10 | 0.021 | 0.0049 | 0.0736 | 0.0665 |
| Acetoacetyl-coathioloase ( | SDEG_1700 | 434 | 12 | 18 (4.1) | 5 | 0.011 | 0.0035 | 0.0330 | 0.106 |
Based on ORF number in the GGS_124 genome (Genbank number AP010935).
aa, amino acid.
Figure 1goeBURST diagram of relationships between 178 global SDSE isolates.
The size of each circle is proportional to the number of isolates with that particular ST in a logarithmic scale. STs assigned to the same CC are linked by straight lines. Blue circles represent isolates that have the group G carbohydrate. Red circles represent isolates expressing the group C carbohydrate. Whenever isolates of the same ST have different group carbohydrates, the number of isolates bearing the same carbohydrate is proportional to the respective color. The green circle represents the single isolate expressing the group L carbohydrate.
Relationship between ST and emm type.
| ST | No. of isolates | Associated | No. of |
| 15 | 20 | stC839, stG10, stG166b, stG2078, stG245, stG6, stG652 | 7 |
| 8 | 10 | stC839, stG11, stG480, stG643, stG7860 | 5 |
| 4 | 10 | stC36, stC5344, stG6792, stG97, stG7882 | 5 |
| 3 | 10 | emm57, stC1400, stC839, stG653 | 4 |
| 25 | 9 | stG166b, stG5420, stG6 | 3 |
| 17 | 9 | stC74a, stG2078, stG485 | 3 |
| 63 | 2 | stG6, stG652 | 2 |
| 52 | 2 | stG6, stG643 | 2 |
| 20 | 8 | stC6979, stG62647 | 2 |
| 34 | 3 | stC1400, stG5063 | 2 |
| 29 | 14 | stC74a, stG485 | 2 |
Relationship between emm type and ST.
|
| No. of isolates | Associated STs | No. of STs | No. of CCSLV
| No. of CCDLV
| No. of distant STs |
| stG6 | 13 | 15, 24, 25, 44, 52, 58, 62, 63 | 8 | 4 | 2 | 2 |
| stG480 | 11 | 7, 8, 38, 39, 40, 41, 67 | 7 | 2 | 2 | 2 |
| stC1400 | 8 | 3, 28, 34, 46, 64, 66 | 6 | 3 | 1 | 4 |
| stG485 | 8 | 17, 29, 37, 47, 55, 69 | 6 | 3 | 2 | 2 |
| stG643 | 9 | 8, 12, 22, 48, 52, 73 | 6 | 3 | 4 | 5 |
| stG652 | 7 | 15, 32, 59, 61, 63, 71 | 6 | 3 | 2 | |
| stC6979 | 8 | 9, 19, 20, 54, 80 | 5 | 4 | 4 | 3 |
| stC36 | 6 | 4, 45, 49, 50, 68 | 5 | 2 | 1 | 2 |
| stC74a | 15 | 17, 29, 70, 77 | 4 | 2 | 3 | 3 |
| stC839 | 7 | 3, 8, 15, 78 | 4 | 3 | 2 | 3 |
| stG166b | 5 | 15, 25, 56, 65 | 4 | 2 | 2 | 2 |
| stG11 | 5 | 6, 8, 42 | 3 | 1 | 1 | |
| stG2078 | 9 | 15, 17, 72 | 3 | 2 | 2 | 2 |
| stG245 | 3 | 15, 21, 36 | 3 | 2 | 2 | 2 |
| stG4831 | 3 | 74, 75, 76 | 3 | 1 | 0 | |
| stG62647 | 9 | 20, 33, 60 | 3 | 1 | 1 | |
| stG6792 | 6 | 4, 31, 51 | 3 | 1 | 1 | |
| emm57 | 3 | 3, 57 | 2 | 1 | 0 | 2 |
| stC5344 | 3 | 4, 43 | 2 | 1 | 1 | |
| stC6746 | 2 | 5, 27 | 2 | 1 | 0 | |
| stC9431 | 2 | 13, 14 | 2 | 1 | 1 | |
| stG5063 | 2 | 2, 34 | 2 | 1 | 0 | |
| stG840 | 2 | 26, 30 | 2 | 1 | 1 | |
| stG7882 | 2 | 4, 18 | 2 | 1 | 1 |
CCSLV – Clonal complex based on Single Locus Variant relationships.
CCDLV – Clonal complex based on Double Locus Variant relationships.
Number of STs sharing the same emm type and differing from all other STs harboring that emm type at greater than five housekeeping alleles.
Distribution of housekeeping alleles among GCS and GGS isolates.
| Housekeeping gene locus | % of alleles shared by GCS and GGS | % of alleles restricted to GCS | % of alleles restricted to GGS |
|
| 30 | 30 | 40 |
|
| 32 | 36 | 32 |
|
| 56 | 33 | 11 |
|
| 33 | 25 | 43 |
|
| 50 | 33 | 17 |
|
| 35 | 35 | 30 |
|
| 40 | 30 | 30 |
| Total for all alleles | 38 | 36 | 29 |
Presented in order of the locus position on the genome of strain GGS_124.
Excludes gtr06 which is restricted to group L.
Figure 2Venn diagram depicting the distribution of ST and emm type across three continents.
Unbracketed numbers represent the total number of STs or emm types. The numbers in brackets indicate the percentage of total isolates in the entire collection.
Intercontinenal clones of SDSE.
| CC | ST |
| group carbohydrate | Australia | Europe | North America |
| 3 | 3 | stC839 | C | x | x | x |
| 4 | 4 | stG6792 | G | x | x | |
| 8 | 8 | stG480 | G | x | x | x |
| 8 | 8 | stG11 | G | x | x | |
| 8 | 38 | stG480 | G | x | x | x |
| 15 | 15 | stG10 | G | x | x | x |
| 15 | 15 | stG652 | G | x | x | |
| 15 | 15 | stG166b | G | x | x | |
| 17 | 17 | stG2078 | G | x | x | x |
| 17 | 12 | stG643 | G | x | x | |
| 20 | 20 | stG62647 | C | x | x | x |
| 25 | 25 | stG5420 | G | x | x | x |
| 25 | 25 | stG6 | G | x | x | |
| 29 | 29 | stC74a | G | x | x | x |
| 49 | 49 | stC36 | C | x | x |
Figure 3Maximum parsimony tree of concatenated housekeeping alleles.
The housekeeping alleles for each of the 80 STs for SDSE were concatenated (3,134 nt positions), and a maximum parsimony tree was constructed. The radial, unrooted phylogenetic tree is shown. Bootstrap values (500 replicates) showing branch support equal or greater than 80% are indicated; bootstrap analysis used a heuristic search and the 50% majority-rule consensus tree is presented. STs representing GCS and GGS are depicted in red and blue, respectively; the single group L isolate (ST1) is depicted in green. CCs having three or more STs are indicated. Characters: 2937 are constant, 56 variable characters are parsimony-uninformative, 141 are parsimony-informative. Consistency index (CI) = 0.3350; CI excluding uninformative characters = 0.2669; retention index (RI) = 0.7926.