| Literature DB >> 22666370 |
Tristan Lefébure1, Vince P Richards, Ping Lang, Paulina Pavinski-Bitar, Michael J Stanhope.
Abstract
Streptococcus pyogenes, is an important human pathogen classified within the pyogenic group of streptococci, exclusively adapted to the human host. Our goal was to employ a comparative evolutionary approach to better understand the genomic events concomitant with S. pyogenes human adaptation. As part of ascertaining these events, we sequenced the genome of one of the potential sister species, the agricultural pathogen S. canis, and combined it in a comparative genomics reconciliation analysis with two other closely related species, Streptococcus dysgalactiae and Streptococcus equi, to determine the genes that were gained and lost during S. pyogenes evolution. Genome wide phylogenetic analyses involving 15 Streptococcus species provided convincing support for a clade of S. equi, S. pyogenes, S. dysgalactiae, and S. canis and suggested that the most likely S. pyogenes sister species was S. dysgalactiae. The reconciliation analysis identified 113 genes that were gained on the lineage leading to S. pyogenes. Almost half (46%) of these gained genes were phage associated and 14 showed significant matches to experimentally verified bacteria virulence factors. Subsequent to the origin of S. pyogenes, over half of the phage associated genes were involved in 90 different LGT events, mostly involving different strains of S. pyogenes, but with a high proportion involving the horse specific pathogen S. equi subsp. equi, with the directionality almost exclusively (86%) in the S. pyogenes to S. equi direction. Streptococcus agalactiae appears to have played an important role in the evolution of S. pyogenes with a high proportion of LGTs originating from this species. Overall the analysis suggests that S. pyogenes adaptation to the human host was achieved in part by (i) the integration of new virulence factors (e.g. speB, and the sal locus) and (ii) the construction of new regulation networks (e.g. rgg, and to some extent speB).Entities:
Mesh:
Year: 2012 PMID: 22666370 PMCID: PMC3364286 DOI: 10.1371/journal.pone.0037607
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Streptococcus genomes used in the study.
| Taxa | Locus tag | Species ID | Accession number |
|
| sag_2603 | sag1 | NC_004116 |
|
| sag_A909 | sag2 | NC_004368 |
|
| sag_NEM | sag3 | NC_004368 |
|
| sde_GGS | sde | NC_012891 |
|
| seq_4047 | seq | NC_012471 |
|
| sez | sez1 | NC_012470 |
|
| sez_M | sez2 | NC_011134 |
|
| sgo | sgo | NC_009785 |
|
| smu | smu | NC_004350 |
|
| spn | spn | NC_003028 |
|
| spy_M1 | spy1 | NC_002737 |
|
| spy_Man | spy2 | NC_009332 |
|
| spy_10270 | spy3 | NC_008022 |
|
| spy_10394 | spy4 | NC_006086 |
|
| spy_10750 | spy5 | NC_008024 |
|
| spy_2065 | spy6 | NC_008023 |
|
| spy_315 | spy7 | NC_004070 |
|
| spy_5005 | spy8 | NC_007297 |
|
| spy_6180 | spy9 | NC_007296 |
|
| spy_8232 | spya | NC_003485 |
|
| spy_9429 | spyb | NC_008021 |
|
| spy_NZ | spyc | NC_011375 |
|
| spy_SSI | spyd | NC_004606 |
|
| san | san | NC_009009 |
|
| ssu | ssu | NC_012924 |
|
| sth | sth | NC_006448 |
|
| sub | sub | NC_012004 |
|
| sca | sca | AIDX00000000 |
|
| GALLO | sga | NC_013798 |
|
| smi | smi | NC_013853 |
Gene tree support and concordance factors for three concurrent species trees.
| Strep core (701 genes) | 4 sp core (1072 genes) | ||||
| Topologies | Gene tree | Bootstrap>90% | Gene tree | Bootstrap>90% | Concordance factors |
| ((sde,spy),sca),seq; | 277 (39.5%) | 262 (37.4%) | 442 (41.2%) | 430 (40.1%) | 0.443 (0.409,0.477) |
| ((sca,spy),sde),seq; | 161 (23.0%) | 151 (21.5%) | 319 (29.8%) | 305 (28.5%) | 0.293 (0.262,0.326) |
| ((sca,sde),spy),seq; | 201 (28.7%) | 188 (26.8%) | 281 (26.2%) | 273 (25.5%) | 0.263 (0.233,0.295) |
“Strep core” refers to the core genome of the genus Streptococcus based on the species included in Table 1. “4 sp core” refers to the core genome based on a consideration of only the 4 species involved in these three topologies. The ‘Gene tree’ column refers to the number of gene trees displaying the specified topology, while the ‘Bootstrap’ column presents the number of genes supporting this bipartition with a bootstrap score greater than 90%. The last column includes the concordance factors with 95% confidence intervals in parentheses.
Figure 1Total evidence trees using the Streptococcus one-to-one core genes (701 orthologs, A) and the four species one-to-one core genes (1072 orthologs, B).
On the branches are reported the gene-tree support, as well as the genome wide concordance factors (in bold) with their 95% credibility intervals for the four species analysis. For both trees, total evidence topologies are identical to gene-tree majority rule consensus topologies, as well as the primary concordance topology for the four species tree.
Figure 2The seven topologies and the number of genes mapped to them, found by the Bayesian gene tree concordance analysis.
Figure 3Multidimensional scaling representation of 1072 core genes.
Similarity between pairs of genes is measured as the probability that two genes share the same topology. Genes are colored according to their most probable topology, with black referring to topology 1, red is topology 2, and green topology 3; blue refers to other rare topologies (see Table 2 for topology descriptions).
Genes gained on the Streptococcus pyogenes branch.
| Locus tag | LGT | Gene | Annotated product |
| 19 | sag |
| phosphoribosylaminoimidazole-succinocarboxamide synthase |
| 61 | sag |
| adenylate kinase |
| 110 | butyrate-acetoacetate CoA-transferase, beta subunit | ||
| 155 |
| pyrogenic exotoxin G precursor | |
| 377 | putative positive transcriptional regulator | ||
| 378 |
| shikimate 5-dehydrogenase | |
| 386 | sga | putative efflux protein | |
| 454 | sag |
| peptide chain release factor 2 |
| 458 | sag | acetoin reductase | |
| 461 | sag |
| asparaginyl-tRNA synthetase |
| 526 |
| putative glycosyltransferase | |
| 591 | 5′-nucleotidase | ||
| 684 | putative repressor - phage associated | ||
| 725 |
| hyaluronidase - phage associated | |
| 799 | sag |
| peptide chain release factor 1 |
| 923 | putative holin - phage associated | ||
| 1013 |
| nicotinate-nucleotide pyrophosphorylase | |
| 1014 | putative integral membrane protein | ||
| 1101 |
| hyaluronoglucosaminidase - phage associated | |
| 1141 |
| putative excisionase - phage associated | |
| 1201 |
| putative two-component sensor histidine kinase | |
| 1205 |
| streptococcal pyrogenic exotoxin SpeK - phage associated | |
| 1221 | putative minor capsid protein - phage associated | ||
| 1223 | putative minor capsid protein - phage associated | ||
| 1225 | putative major capsid/head protein - phage associated | ||
| 1229 | putative minor capsid protein - phage associated | ||
| 1235 | putative ABC transporter ATP-binding protein - phage associated | ||
| 1249 | putative single-strand DNA-binding protein - phage associated | ||
| 1301 |
| exotoxin type A precursor - phage associated | |
| 1349 | putative Cro-like repressor - phage associated | ||
| 1465 |
| NADH peroxidase | |
| 1483 |
| putative tagatose-6-phosphate kinase | |
| 1612 | sag |
| putative nucleoside transporter |
| 1624 | sag |
| phosphoglycerate kinase |
| 1645 |
| putative response regulator of salavaricin regulon | |
| 1652 |
| lantibiotic salivaricin A precursor | |
| 1660 |
| putative lactose phosphotransferase system repressor protein | |
| 1720 |
| putative dipeptide ABC transporter, permease protein | |
| 1722 | sag |
| putative dipeptide ABC transporter, ATP-binding protein |
| 1733 | putative two-component response regulator | ||
| 1740 |
| foldase protein PrsA | |
| 1742 |
| pyrogenic exotoxin B | |
| 1744 |
| putative transcription regulator | |
| 1773 |
| urocanate hydratase | |
| 1775 | formiminotetrahydrofolate cyclodeaminase | ||
| 1779 |
| histidine ammonia-lyase |
When the gene donor was identified as another Streptococcus species, the name of the donor is documented in the “LGT” column. Locus tag refers to the MGAS315 genome (SpyM3). Genes marked with an asterisk have significant BLASTp matches to established pathogenic bacteria virulence factors within the Virulence Factors of Pathogenic Bacteria database. LGT column shows donor species for genes gained via lateral gene transfer as detected by AnGST (see Table 1 for species IDs). Gene gains that were not robust to the species tree variation are not shown.
Distribution of 52 phage genes gained on the S. pyogenes branch for nine strains that possessed phage descriptions in their annotations.
| Gene ID | Strain ID | ||||||||
| 315 | SF370 | 5005 | 6180 | 9429 | 10270 | 2096 | 10750 | NZ131 | |
| 1264* | 315.6 | 5005.3 | 6180.2 | 9428.3 | 10270.3 | 2096.2 | NZ131.2 | ||
| 1270* | 315.2 | 370.2 | 5005.2 | 9429.1 | 10270.3 | 10750.3 | NZ131.3 | ||
| 1586* | 315.3 | 370.2 | 9429.2 | 10750.3 | |||||
| 1587* | 315.5 | 6180.1 | 9429.1 | 10270.2 | 10750.2 | ||||
| 1590* | 315.5 | 6180.1 | 9429.1 | 10270.2 | 10750.2 | ||||
| 1687* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 1710* | 315.3 | 370.2 | 5005.2 | 9429.2 | 10750.3 | NZ131.2 | |||
| 1792* | 315.6 | 5005.2 | 6180.2 | 10270.3 | NZ131.2 | ||||
| 1818* | 315.4 | 5005.1 | 6180.2 | 10270.3 | NZ131.3 | ||||
| 1862* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 1863* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 1864* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 1865* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 1867* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 1875* | 315.1 | 370.3 | 6180.1 | 9429.2 | NZ131.3 | ||||
| 1906* | 315.1 | 370.3 | 6180.1 | 9429.2 | NZ131.3 | ||||
| 1915* | 315.4 | 5005.1 | 6180.2 | 10270.3 | NZ131.3 | ||||
| 1939* | 315.3 | 370.2 | 5005.2 | 10750.3 | |||||
| 1957* | 315.2 | 9429.1 | 10270.1 | 10750.1 | |||||
| 1969* | 315.4 | 6180.2 | 10270.3 | NZ131.3 | |||||
| 1970* | 315.4 | 6180.2 | 10270.3 | NZ131.3 | |||||
| 1975* | 5005.3 | 9428.3 | 2096.2 | NZ131.2 | |||||
| 1977* | 315.6 | 5005.3 | 9428.3 | 2096.2 | NZ131.2 | ||||
| 1993* | 315.4 | 370.1 | 6180.2 | 10270.3 | 2096.1 | ||||
| 2094 | 315.3 | 370.2 | 5005.2 | 10750.3 | |||||
| 2121* | 315.1 | NZ131.3 | |||||||
| 2160* | 315.5 | 5005.1 | 6180.1 | ||||||
| 2191* | 315.4 | 6180.2 | 10270.3 | ||||||
| 2234* | 315.6 | 5005.3 | 9428.3 | 2096.2 | |||||
| 2260* | 315.5 | 370.1 | 10270.2 | 2096.1 | 10750.2 | ||||
| 2266* | 315.2 | 370.3 | 5005.2 | 6180.1 | 9429.2 | ||||
| 2276* | 315.6 | 6180.2 | 10270.3 | ||||||
| 2297 | 315.4 | 6180.2 | 10270.3 | 10750.2 | |||||
| 2441* | 315.5 | 9429.1 | 10270.1 | 10750.1 | |||||
| 2621 | 5005.1 | ||||||||
| 2667 | 315.2 | 10270.2 | 10750.2 | NZ131.3 | |||||
| 2722 | 315.3 | 370.2 | 10750.3 | ||||||
| 2730 | 315.2 | 10270.1 | 10750.1 | ||||||
| 2731 | 315.2 | 9429.1 | 10270.1 | 10750.1 | |||||
| 2733* | 5005.1 | 10270.2 | 10750.2 | ||||||
| 2735 | 315.4 | 6180.2 | 10270.3 | ||||||
| 2736 | 315.4 | 6180.2 | 10270.3 | ||||||
| 2745 | 315.2 | 5005.1 | 6180.1 | NZ131.3 | |||||
| 3051 | 315.5 | 10270.2 | 10750.2 | ||||||
| 3069 | 315.5 | 10270.2 | 10750.2 | ||||||
| 3070 | 315.4 | 6180.2 | 10270.3 | ||||||
| 3071 | 315.4 | 6180.2 | 10270.3 | ||||||
| 3072 | 315.4 | 6180.2 | 10270.3 | ||||||
| 3080 | 315.3 | 10750.3 | |||||||
| 3081 | 315.3 | 10750.3 | |||||||
| 3087 | 315.2 | 6180.1 | 9429.1 | ||||||
| 3088 | 315.4 | 6180.2 | |||||||
Columns under strain IDs contain phage IDs. Asterisks mark genes that experienced LGT subsequent to being gained on the S. pyogenes branch. The strain ID prefix MGAS was omitted to save space.
Figure 4Circular map of the S. pyogenes MGAS315 genome showing location of gained and LGT genes.
A color legend for each ring (1 to 15) appears at the top of the figure. The figure highlights the S. pyogenes core-genome distribution (tracks 7 and 8) and islands of gained genes (tracks 3 and 4). MGAS315 is chosen simply as a reference genome to map the results of the analysis.