| Literature DB >> 23918808 |
Shinya Watanabe, Teruo Kirikae, Tohru Miyoshi-Akiyama.
Abstract
Streptococcus dysgalactiae subsp. equisimilis (SDSE) is an emerging human pathogen that causes life-threatening invasive infections such as streptococcal toxic shock syndrome. Recent epidemiological studies reveal that invasive SDSE infections have been increasing in Asia, Europe, and the United States. Almost all SDSE carry Lancefield group G or C antigen. We have determined the complete genome sequence of a human group C SDSE 167 strain. A comparison of its sequence with that of four SDSE strains, three in Lancefield group G and one in Lancefield group A, showed approximately 90% coverage. Most regions showing little or no homology were located in the prophages. There was no evidence of massive rearrangement in the genome of SDSE 167. Bayesian phylogeny using entire genome sequences showed that the most recent common ancestor of the five SDSE strains appeared 446 years ago. Interestingly, we found that SDSE 167 harbors sugar metabolizing enzymes in a unique region and streptodornase in the phage region, which presumably contribute to the degradation of host tissues and the prompted covRS mutation, respectively. A comparison of these five SDSE strains, which differ in Lancefield group antigens, revealed a gene cluster presumably responsible for the synthesis of the antigenic determinant. These results may provide the basis for molecular epidemiological research of SDSE.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23918808 PMCID: PMC3787669 DOI: 10.1093/gbe/evt117
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
SDSE Strains Completely Sequenced
| Strain | 167 | AC-2713 | ATCC 12394 | GGS_124 | RE378 |
|---|---|---|---|---|---|
| Length (nt) | 2,076,397 | 2,179,445 | 2,159,491 | 2,106,340 | 2,151,145 |
| G + C% | 39.57 | 39.52 | 39.5 | 39.58 | 39.49 |
| CDS | 2,223 | 221 | 2,056 | 2,094 | 1,877 |
| rRNA operon | 5 | 5 | 5 | 5 | 5 |
| tRNA | 56 | 57 | 57 | 57 | 56 |
| Lancefield | C | A | G | G | G |
| Acc. no. | AP012976 | NC_019042 | NC_017567 | NC_012891 | NC_018712 |
| Isolation year | 2003 | 1999 | 1939 | 2006 | 2007 |
| Locus tag of putative enzyme cluster synthesizing Lancefield antigen determinant | SDSE167_0822 to SDSE167_0826 | SDSE_0797 to SDSE_0799 | SDE12394_04095 to 04110 | SDEG_0759 to SDEG_0762 | GGS_0731 to GGS_0734 |
| The Blast hits | Group C SDSE, | Group A SDSE, | Group G SDSE | Group G SDSE | Group G SDSE |
| Reference | This study |
Note.—Overview of SDSE strains used for comparisons with the 167 genome.
FCircular representation of the genome of Streptococcus dysgalactiae subsp. equisimilis strain 167. Circle 1 (outermost circle) indicates the distances from the putative origin of replication. Circles 2 and 3 show annotated CDS encoded by the forward (light blue) and reverse (pink) chromosomal strands, respectively. Circle 4 shows the rrs operons. Circle 5 shows prophages (green). Circle 6 shows unique regions found in 167 other than in phages, including regions encoding enzymes involved in sugar transfer and sugar metabolism and the FCT region. Circle 7 (innermost circle) shows the G + C content with more and less than average (0.40) in purple and brown, respectively.
Distribution of Prophages among SDSE Strains and Genes Carried by Each Prophage
| Strain | Phage No. | Length | Best Hit of Blast | Comment |
|---|---|---|---|---|
| 167 | Φ1 | 37,972 | Streptodornase (Sdn) | |
| Putative cell wall hydrolase, lysin | ||||
| Putative holin | ||||
| Putative hyaluronidase | ||||
| Head maturation protease | ||||
| Site-specific recombinase | ||||
| Putative transcriptional activator | ||||
| Putative C5 methylase MarMP1 | ||||
| Single-strand binding protein | ||||
| Putative replisome organizer | ||||
| Φ2 | 18,822 | Defective | ||
| AC-2713 | Φ1 | 10,880 | gp44 clamp loader subunit | Defective |
| Φ2 | 36,966 | Site-specific recombinase | ||
| Putative cell wall hydrolase, lysin | ||||
| Holin | ||||
| Putative platelet-binding protein | ||||
| ClpP-like protease | ||||
| Putative portal protein | ||||
| Putative DNA methylase | ||||
| Transferase | ||||
| Putative helicase | ||||
| Putative DNA polymerase A domain | ||||
| Φ3 | 5,854 | DNA cytosine methylase | Defective | |
| Transcriptional regulator | ||||
| Putative repressor protein | ||||
| Φ4 | 38,710 | Putative helicase | ||
| Site-specific recombinase | ||||
| Putative cell wall hydrolase, lysin | ||||
| Amidase | ||||
| Holin | ||||
| PblB | ||||
| Putative DNA methylase | ||||
| Endonuclease of the HNH family with predicted DNA-binding module at C-terminus | ||||
| Transferase | ||||
| Putative helicase | ||||
| Phi APSE P51-like protein | ||||
| Putative DNA polymerase A domain | ||||
| Φ5 | 14,532 | Defective | ||
| Φ6 | 5,920 | Csp | Defective | |
| Putative DNA polymerase III delta prime subunit | ||||
| ATCC12394 | Φ1 | 10,872 | gp44 clamp loader subunit | Defective |
| Φ2 | 11,328 | DNA polymerase accessory protein | Defective | |
| Φ3 | 11,990 | IMPB | Defective | |
| Putative methyltransferase-endonuclease | ||||
| Φ4 | 28,611 | Putative DNA polymerase III delta prime subunit | ||
| GGS_124 | Φ1 | 10,897 | gp44 clamp loader subunit | Defective |
| Φ2 | 18,298 | Putative DNA binding protein | Defective | |
| Putative DNA methylase | ||||
| Φ3 | 41,484 | Streptodornase | ||
| Cell wall hydrolase | Defective | |||
| Putative holin protein | ||||
| Putative hyaluronidase | ||||
| Putative platelet-binding protein | ||||
| Putative human platelet-binding | ||||
| Putative endodeoxyribonuclease | ||||
| Putative recombinase | ||||
| Φ4 | 10,774 | Site-specific recombinase | Defective | |
| Φ5 | 35,572 | Putative | ||
| Putative holin | ||||
| Putative endodeoxyribonuclease | ||||
| Putative platelet-binding protein | ||||
| Recombination protein | ||||
| Φ6 | 19,428 | Putative DNA polymerase III delta prime subunit | Defective | |
| RE378 | Φ1 | 8,852 | Putative helicase | Defective |
Note.—Prophage regions of the SDSE strains were identified using Prophage Finder (Bose and Barber 2006). Proteins encoded by the prophage were analyzed by the Blast algorithm, with the best hits listed. Only the Blast results not categorized as “hypothetical protein” are listed in the table.
FGenome rearrangement maps of S. dysgalactiae subsp. equisimilis 167 with four other SDSE strains. Genome rearrangement maps were prepared with entire genomes (a) or after omitting the phage regions (b) using in silico molecular cloning. Sequences were aligned from the predicted replication origin of each genome. Red and green represent the highest and lowest identity of nucleotide sequences, respectively.
FBayesian phylogenetic trees of SDSE. Phylogenetic trees were prepared with entire genomes (a) or after omitting the phage regions (b) using BEAST (Drummond and Rambaut 2007) and visualized using FigTree. Estimated ages of subclades are shown as the median values. ESS values of all parameters in BEAST are more than 200. The posterior probability value for each branch was 1.
Distribution of CRISPR and Cas Genes among the SDSE Strains
| Strain | CRISPR ID | No. of Spacers | Cas Genes |
|---|---|---|---|
| 167 | tmp_1_Possible CRISPR_1 | 1 | No hits |
| tmp_1_Possible CRISPR_2 | 1 | No hits | |
| AC-2713 | tmp_1_Probable CRISPR_2 | 19 | Csn1 |
| tmp_1_Probable CRISPR_3 | 14 | Cas3 | |
| tmp_1_Possible CRISPR_1 | 1 | Csn1 | |
| ATCC12394 | tmp_1_Probable CRISPR_2 | 25 | Csn1 |
| tmp_1_Probable CRISPR_3 | 29 | Cas3 | |
| tmp_1_Possible CRISPR_1 | 2 | No hits | |
| tmp_1_Possible CRISPR_4 | 1 | No hits | |
| GGS_124 | tmp_1_Probable CRISPR_2 | 18 | Csn1 |
| tmp_1_Possible CRISPR_1 | 1 | No hits | |
| tmp_1_Possible CRISPR_3 | 1 | No hits | |
| RE378 | tmp_1_Probable CRISPR_2 | 7 | Csn1 |
| tmp_1_Probable CRISPR_3 | 13 | Cas3 | |
| tmp_1_Possible CRISPR_1 | 1 | No hits | |
| tmp_1_Possible CRISPR_4 | 1 | No hits | |
Note.—CRISPR and Cas genes among the SDSE strains were identified using CRISPRfinder (Grissa et al. 2007). Possible CRISPR, probable CRISPR, number of spacers, and type of Cas are presented.