| Literature DB >> 25888688 |
Guillaume Méric1, Maria Miragaia2, Mark de Been3, Koji Yahara4, Ben Pascoe5, Leonardos Mageiros1, Jane Mikhail1, Llinos G Harris1, Thomas S Wilkinson1, Joana Rolo2, Sarah Lamble6, James E Bray7, Keith A Jolley7, William P Hanage8, Rory Bowden6, Martin C J Maiden7, Dietrich Mack9, Hermínia de Lencastre10, Edward J Feil1, Jukka Corander1, Samuel K Sheppard11.
Abstract
The opportunistic pathogens Staphylococcus aureus and Staphylococcus epidermidis represent major causes of severe nosocomial infection, and are associated with high levels of mortality and morbidity worldwide. These species are both common commensals on the human skin and in the nasal pharynx, but are genetically distinct, differing at 24% average nucleotide divergence in 1,478 core genes. To better understand the genome dynamics of these ecologically similar staphylococcal species, we carried out a comparative analysis of 324 S. aureus and S. epidermidis genomes, including 83 novel S. epidermidis sequences. A reference pan-genome approach and whole genome multilocus-sequence typing revealed that around half of the genome was shared between the species. Based on a BratNextGen analysis, homologous recombination was found to have impacted on 40% of the core genes in S. epidermidis, but on only 24% of the core genes in S. aureus. Homologous recombination between the species is rare, with a maximum of nine gene alleles shared between any two S. epidermidis and S. aureus isolates. In contrast, there was considerable interspecies admixture of mobile elements, in particular genes associated with the SaPIn1 pathogenicity island, metal detoxification, and the methicillin-resistance island SCCmec. Our data and analysis provide a context for considering the nature of recombinational boundaries between S. aureus and S. epidermidis and, the selective forces that influence realized recombination between these species.Entities:
Keywords: Staphylococcus; ecology; evolution; nosocomial infections; recombination
Mesh:
Year: 2015 PMID: 25888688 PMCID: PMC4453061 DOI: 10.1093/gbe/evv066
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FCore and accessory genome variation in S. aureus and S. epidermidis. (A) Overlap between the core and accessory genomes calculated in 324 S. aureus and S. epidermidis genomes. Core genes were defined as being present in 100% isolates, accessory genes in less than 100% but more than 0%, and absent in 0%. The radius of each circle is proportional to the number of detected genes. (B) Core genome nucleotide divergence of representative pairs of S. aureus and S. epidermidis strains between each other. The numbers indicate the range of calculated nucleotide divergence for at least three pairs of strains, and the error bars indicate standard error of the mean. (C) Tajima’s D values for 1,225 core genes shared by both species. The number of genes with different combinations of positive and negative D values is indicated in each quadrant. (D) Watterson’s estimator values for each 1,225 core gene shared by both species. More genes showed a higher θ value in S. epidermidis than in S. aureus.
FPopulation structure of 181 S. aureus and 143 S. epidermidis isolates. A phylogenetic tree constructed from 1,478 genes (2,066,448 bp) found in both species using an approximation of the maximum-likelihood algorithm, implemented in FastTree2. The scale represents the number of substitutions per site. The branch linking S. aureus (left) to S. epidermidis (right) is 0.265 substitutions per site long, corresponding to 537,276 variable sites. Isolates are colored according to clonal complexes and asterisks denote clonal complexes that appear paraphyletic on this tree, including the S. aureus CC-1 and the S. epidermidis CC-365.
FPairwise core and accessory genome variation in S. aureus and S. epidermidis. Matrices show pairwise comparison between 324 isolates ordered according to the phylogenetic tree presented in figure 1. Core genome similarity is based upon the number of shared alleles at 1,478 loci found in all isolates. Accessory genome similarity based upon gene presence or absence at 2,008 noncore loci. The heat-map coloring ranges from white, through red, to blue (maximum). The minimum number of shared alleles in the core genome was 0 and the maximum was 1,478. The minimum number of shared accessory genes was 232 and the maximum was 1,204.
FPopulation structure and core genome recombination in S. aureus and S. epidermidis. Phylogenetic tree reconstructions from core gene alignments, using an approximation of the maximum-likelihood algorithm, for (A) 181 S. aureus and (B) 143 S. epidermidis genomes. Tree branches are colored according to recombining population clusters inferred by BNG and histograms indicate the total length of recombining sequence inferred. (C) The distributions of unique alleles of recombining (R; light gray) and nonrecombining (NR; white) core genes. Asterisks indicate a statistical significance of P < 0.0001, n.s. denotes no significance.
Genes for Which Single-gene Phylogenies do not Segregate Clearly by Species (S. aureus / S. epidermidis)
| Gene Name | Alias | Homologs in Other Annotated Genomes | Description | Prevalence in | Prevalence in | Frequency of Species Overlap | ||
|---|---|---|---|---|---|---|---|---|
| SERP1173 | IS200 family transposase | 60 | 143 | 1.000 | 1,208,761 | |||
| SERP2221 | cadmium resistance family protein | 75 | 143 | 1.000 | 2,258,410 | |||
| SERP2240 | IS200 family transposase | 60 | 143 | 1.000 | 2,268,628 | |||
| SERP2521 | Methicillin resistance penicillin-binding protein MecA | 110 | 74 | 1.000 | 2,578,662 | |||
| SERP1583 | Mobile element protein | 138 | 136 | 0.993 | 1,612,684 | |||
| SERP2501 | SA0056, SE0055 | hypothetical protein | 98 | 103 | 0.990 | 2,561,164 | ||
| SERP2422 | FIG01108158: hypothetical protein | 181 | 142 | 0.979 | 2,482,195 | |||
| SERP1588 | Mobile element protein | 139 | 139 | 0.971 | 1,617,610 | |||
| SAR1828 | SH1761 ( | DNA-invertase transposon Tn552 | 121 | 75 | 0.960 | 1,911,329 | ||
| SAR0374 | SA1829, SACOL0896 | Hypothetical SAV0790 homolog in superantigen-encoding pathogenicity islands SaPI | 84 | 47 | 0.957 | 414,760 | ||
| SERP1579 | Mobile element protein | 137 | 131 | 0.954 | 1,610,300 | |||
| SERP2526 | Mobile element protein | 137 | 131 | 0.947 | 2,581,912 | |||
| SERP2499 | Cassette chromosome recombinase B | 108 | 93 | 0.903 | 2,560,296 | |||
| SERP2467 | FIG01108090: hypothetical protein | 113 | 123 | 0.885 | 2,521,017 | |||
| SERP2504 | SA0054, SE0030 | FIG01108090: hypothetical protein | 113 | 123 | 0.876 | 2,562,086 | ||
| SERP2496 | SA0059 | hypothetical protein | 107 | 86 | 0.872 | 2,557,063 | ||
| SERP2498 | Cassette chromosome recombinase A | 108 | 84 | 0.857 | 2,558,646 | |||
| SERP0249 | SA0026, SA0034, SE0071, SE0079, SE0090 | IS431mec-like transposase | 137 | 131 | 0.825 | 257,823 | ||
| SAR1893 | SACOL1857 | hypothetical protein [Genomic island νSaβ2] | 110 | 44 | 0.750 | 1,977,960 | ||
| SAR0382 | SA1820, SAV0800 | Putative terminase, superantigen-encoding pathogenicity islands SaPI | 121 | 92 | 0.728 | 421,037 | ||
| SERP2495 | SA0060 | FIG01108228: hypothetical protein | 72 | 48 | 0.667 | 2,555,270 | ||
| SAR0377 | SA1826, SACOL0900 | Hypothetical SAV0794 homolog in superantigen-encoding pathogenicity islands SaPI | 91 | 56 | 0.643 | 417,811 | ||
| SERP2453 | hypothetical protein | 181 | 142 | 0.585 | 2,506,887 | |||
| SERP2431 | SAR0692 | Arsenate reductase (EC 1.20.4.1) | 105 | 139 | 0.581 | 2,489,753 | ||
| SERP0209 | SAR0692 | Arsenate reductase (EC 1.20.4.1) | 105 | 143 | 0.559 | 221,772 | ||
| SAR2594 | SE0212, SA2302 | ABC transporter ATP-binding protein | 181 | 74 | 0.554 | 2,679,952 | ||
| SERP2503 | SA0057 | FIG01107894: hypothetical protein | 108 | 108 | 0.546 | 2,561,562 | ||
| SERP2423 | FIG01108158: hypothetical protein | 181 | 142 | 0.536 | 2,483,315 | |||
| SERP2465 | lipoprotein | 181 | 142 | 0.521 | 2,519,564 | |||
| SERP1460 | Beta-lactamase regulatory sensor-transducer BlaR1 | 132 | 115 | 0.513 | 1,526,822 | |||
| SERP2515 | SA0044 | Disulfide bond regulator | 124 | 111 | 0.468 | 2,571,330 | ||
| SERP0265 | Mobile element protein | 173 | 142 | 0.462 | 272,732 | |||
| SAR2595 | SA2303, SE0213 | Membrane spanning protein | 181 | 61 | 0.443 | 2,680,724 | ||
| SERP2520 | Methicillin resistance regulatory sensor-transducer MecR1 | 104 | 69 | 0.413 | 2,576,556 | |||
| SERP1370 | Cadmium resistance protein | 75 | 143 | 0.350 | 1,428,085 | |||
| SERP2434 | SA0046 | Disulfide bond regulator | 124 | 111 | 0.315 | 2,492,038 | ||
| SERP2452 | Conserved domain protein | 62 | 79 | 0.304 | 2,506,105 | |||
| SERP0479 | Mobile element protein | 162 | 143 | 0.284 | 476,253 | |||
| SERP2491 | SA1014 | hypothetical protein | 181 | 141 | 0.276 | 2,551,832 | ||
| SAR2392 | SA2101 | Hypothetical protein | 181 | 142 | 0.254 | 2,461,526 | ||
| SERP2435 | SA0043 | Zn-dependent hydroxyacylglutathione hydrolase / Polysulfide binding protein | 127 | 110 | 0.181 | 2,493,385 | ||
| SAR0698 | SACOL0036, SE0629 | Putative transposase | 138 | 109 | 0.174 | 743,744 | ||
| SERP2513 | SA0046 | FIG003846: hypothetical protein | 124 | 112 | 0.143 | 2,569,855 | ||
| SERP2443 | FIG01110589: hypothetical protein | 181 | 142 | 0.127 | 2,500,392 | |||
| SERP2429 | SA1591, SAR0690, SE0136 | Arsenical resistance operon repressor | 181 | 134 | 0.119 | 2,488,047 | ||
| SAR0445 | SA0401 | Tandem lipoprotein within Pathogenicity island | 173 | 99 | 0.104 | 473,389 | ||
| SERP2432 | SA0080 | FIG003846: hypothetical protein | 124 | 112 | 0.054 | 2,490,572 | ||
| SERP2451 | Tandem lipoprotein within Pathogenicity island | 181 | 139 | 0.044 | 2,505,694 | |||
| SERP2418 | FIG01109629: hypothetical protein | 180 | 128 | 0.033 | 2,479,100 | |||
| SAR0248 | SA0243, SA0247, SE0321, SERP0198 | Teichoic acid biosynthesis protein B | 179 | 142 | 0.021 | 290,744 | ||
| SERP2445 | FIG01108771: hypothetical protein | 181 | 142 | 0.021 | 2,501,497 | |||
| SERP0369 | Mobile element protein | 176 | 143 | 0.017 | 367,885 | |||
| SERP0478 | Mobile element protein | 176 | 143 | 0.017 | 476,113 | |||
| SERP1393 | Mobile element protein | 176 | 143 | 0.017 | 1,449,378 | |||
| SERP1995 | Mobile element protein | 176 | 143 | 0.017 | 2,014,786 | |||
| SERP2002 | Mobile element protein | 176 | 143 | 0.017 | 2,020,995 | |||
| SERP2268 | Mobile element protein | 176 | 143 | 0.017 | 2,301,670 | |||
| SERP2323 | Mobile element protein | 176 | 143 | 0.017 | 2,360,017 | |||
| SERP2447 | FIG01108158: hypothetical protein | 181 | 142 | 0.017 | 2,502,826 | |||
| SERP2163 | Mobile element protein | 177 | 143 | 0.011 | 2,192,271 | |||
| SERP2390 | Staphopain A precursor (EC 3.4.22.48) | 180 | 141 | 0.006 | 2,441,865 | |||
| SERP0465 | Cobalt–zinc–cadmium resistance protein | 181 | 142 | 0.006 | — | 467,605 |
aAs inferred from the AureusDB website: http://aureusdb.biologie.uni-greifswald.de/ (last accessed April 22, 2015).
bPosition on the reference genome of S. aureus strain MRSA252 [GenBank accession: NC_002952].
cPosition on the reference genome of S. epidermidis strain RP62A [GenBank accession: NC_002976].
FGenomic position of recombined accessory genes in S. aureus and S. epidermidis based upon single-gene phylogenies. The frequency of admixed genes for 62 accessory genes with mixed ancestry phylogenies mapped to the corresponding locus in the reference S. epidermidis (RP62A) and S. aureus (MRSA252) genome. Example gene groups with high levels of interspecies gene flow are indicated.