| Literature DB >> 29720527 |
Jolene R Bowers1, Elizabeth M Driebe2, Valerie Albrecht3, Linda K McDougal3, Mitchell Granade3, Chandler C Roe2, Darrin Lemmer2, J Kamile Rasheed3, David M Engelthaler2, Paul Keim2, Brandi M Limbago3.
Abstract
Strains of Staphylococcus aureus in clonal complex 8 (CC8), including USA300, USA500, and the Iberian clone, are prevalent pathogens in the United States, both inside and outside health care settings. Methods for typing CC8 strains are becoming obsolete as the strains evolve and diversify, and whole-genome sequencing has shown that some strain types fall into multiple sublineages within CC8. In this study, we attempt to clarify the strain nomenclature of CC8, classifying the major strain types based on whole-genome sequence phylogenetics using both methicillin-resistant S. aureus (MRSA) and methicillin-susceptible S. aureus (MSSA) genomes. We show that isolates of the Archaic and Iberian clones from decades ago make up the most basal clade of the main CC8 lineages and that at least one successful lineage of CC8, made up mostly of MSSA, diverged before the other well-known strain types USA500 and USA300. We also show that the USA500 type includes two clades separated by the previously described "Canadian epidemic MRSA" strain CMRSA9, that one clade containing USA500 also contains the USA300 clade, and that the USA300-0114 strain type is not a monophyletic group. Additionally, we present a rapid, simple CC8 strain-typing scheme using real-time PCR assays that target single nucleotide polymorphisms (SNPs) derived from our CC8 phylogeny and show the significant benefit of using more stable genomic markers based on evolutionary lineages over traditional S. aureus typing techniques. This more accurate and accessible S. aureus typing system may improve surveillance and better inform the epidemiology of this very important pathogen.IMPORTANCEStaphylococcus aureus is a major human pathogen worldwide in both community and health care settings. Surveillance for S. aureus strains is important to our understanding of their spread and to informing infection prevention and control. Confusion surrounding the strain nomenclature of one of the most prevalent lineages of S. aureus, clonal complex 8 (CC8), and the imprecision of current tools for typing S. aureus make surveillance and source tracing difficult and sometimes misleading. In this study, we clarify the CC8 strain designations and propose a new typing scheme for CC8 isolates that is rapid and easy to use. This typing scheme is based on relatively stable genomic markers, and we demonstrate its superiority over traditional typing techniques. This scheme has the potential to greatly improve epidemiological investigations of S. aureus.Entities:
Keywords: CC8; MRSA; MSSA; Staphylococcus aureus; assay development; phylogeny; strain typing; whole genome
Mesh:
Year: 2018 PMID: 29720527 PMCID: PMC5932376 DOI: 10.1128/mSphere.00464-17
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1 WGS-based maximum likelihood phylogeny (using the best-fit model TVMe+ASC) of 497 S. aureus isolate genomes showing the CC8 group in the context of the whole of S. aureus. This analysis includes 1,000 bootstraps of 275,242 total SNPs in a core genome size (the length of the reference genome covered by all samples, excluding repeated regions) of 1.84 Mbp. Regions of chromosomal exchange among lineages resulting in hybrid strains (e.g., ST239) were not excluded. Bootstrap values are 100%, except where indicated. Branches of the phylogeny on which SNPs were selected for assay development are marked with a red triangle.
Characteristics and reference isolates of lineages of CC8
| Traditional strain | Known isolate(s) | Reference(s) | Main | Main | Main | WGS- |
|---|---|---|---|---|---|---|
| ST239 | JKD6008, T0131, | III | t037, t431, t030 | ST239 | ST239 | |
| ST630 | Unknown | V | t377 | ST630 | ST630 | |
| Archaic | Newman, COL, | I | t051 | ST250 | CC8a | |
| Iberian | HPV107, PER34, EMRSA5, E2125, | I | t051 | ST247 | CC8a | |
| NCTC 8325, BR-VRSA | II, III, IVa | t334 | ST8, ST1181 | CC8b | ||
| USA500/Iberian | NRS385 (95938, NR-46071), | IV | t064 | ST8 | CC8c | |
| CMRSA9 | 01S-0965 | VIII | t008 | ST8 | CC8d | |
| USA500 | BD02-25, CA-224 | IV | t008 | ST8 | CC8e | |
| Early Branching | V2200, HUV05 | IV | t008 | ST8 | CC8e | |
| USA300-NAE | FPR3757, TCH1516 | IVa | t008 | ST8 | CC8f | |
| USA300-SAE | M121, CA12 | IVc | t008 | ST8 | CC8e |
t377 is based on this study, and SCCmec type V and t4549 are based on one MRSA isolate (44).
Each SCCmec type is based on one MRSA isolate. Most isolates in this clade are MSSA.
We have made the distinction between USA500 and USA500/Iberian, as recent work (including unpublished data from the CDC and data from Albrecht et al. [4]) has characterized new isolates based on the presence of sea and seb genes (7) and SCCmec type IV. See Materials and Methods for details.
FIG 2 WGS-based maximum likelihood phylogeny (using the best-fit model TVMe+ASC) of 348 genomes of S. aureus (229 MRSA and 119 MSSA) belonging to the Inner CC8 clade (excluding ST239 and ST630 genomes), illustrating the relationship structure of clinically important CC8 groups and showing that genetic marker inference (GMI) strain typing is not always indicative of genetic relationship. MSSA genomes, on branches colored orange (Hex DF5000, RGB [223,80,0]), are interspersed among MRSA genomes. This analysis includes 1,000 bootstraps of 13,988 SNPs. Nodes with bootstrap values of <90% are marked with purple triangles. The core genome size is 2.26 Mbp (78.8% of reference genome FPR3757). Branches of the phylogeny on which SNPs were selected for assay development are marked with a red triangle.
FIG 3 The genetic marker inference (GMI) methodology used for inferring S. aureus strain types by using genetic markers.
Comparison of typing S. aureus isolates by the genetic marker inference assay and real-time PCR SNP assays on unknown (not sequenced) samples
| Genetic marker | No. of isolates in category by real-time PCR SNP assay | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| CC8 | Clade | Clade | Clade | Group | Clade | ST239 | CC8 | Non- | Total | |
| MRSA | ||||||||||
| CC8 | 114 | 7 | 0 | 34 | 11 | 57 | 4 | 1 | 0 | 114 |
| CC8-Unknown | 9 | 7 (5) | 0 | 2 (0) | 0 | 0 | 0 | 0 | 0 | 9 |
| USA500/Iberian | 21 | 0 | 0 | 21 (3) | 0 | 0 | 0 | 0 | 0 | 21 |
| CMRSA9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 (0) | 0 | 1 |
| USA500 | 22 | 0 | 0 | 11 (11) | 11 (3) | 0 | 0 | 0 | 0 | 22 |
| USA300 | 57 | 0 | 0 | 0 | 0 | 57 (1) | 0 | 0 | 0 | 57 |
| ST239 | 4 | 0 | 0 | 0 | 0 | 0 | 4 (0) | 0 | 0 | 4 |
| Non-CC8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 (0) | 30 |
| Total | 114 | 7 | 0 | 34 | 11 | 57 | 4 | 1 | 30 | 144 |
| MSSA | ||||||||||
| CC8 | 61 | 0 | 18 | 4 | 15 | 20 | 0 | 4 | 3 | 64 |
| CC8-Unknown | 45 | 0 | 18 (18) | 4 (4) | 15 (15) | 4 (4) | 0 | 4 (4) | 3 (3) | 48 |
| USA300 | 16 | 0 | 0 | 0 | 0 | 16 (0) | 0 | 0 | 0 | 16 |
| Total | 61 | 0 | 18 | 4 | 15 | 20 | 0 | 4 | 3 | 64 |
Numbers in parentheses are the number of isolates that were subsequently whole-genome sequenced to determine the true strain type.
Assays designed and validated in this study
| Clade, group, | Assay | Primer/probe | Probe labels | Sequence | Product |
|---|---|---|---|---|---|
| Clade CC8 (including ST239 | CC8_B+ | tCC8_F | CGAGTCAGCTAGTGGTCCGTT | 88 | |
| tCC8_R | ATGCATAGCTCTTGCTAAAGTGTA | ||||
| tCC8-A_FB+ | FAM, BHQ-1plus | ACCTATACCTGA | |||
| non-tCC8-G_TB+ | TET, BHQ-1plus | CTATACCTGA | |||
| Inner CC8 clade (excluding ST239 | inCC8_B+ | inCC8_F | TGCCCATAACACATTTGACACTTT | 79 | |
| inCC8_R1 | TTCGGCCACAGCTAAACTCG | ||||
| inCC8_R2 | GTTCGGCTACAGCTAAACTTGC | ||||
| inCC8_FB+ | FAM, BHQ-1plus | ATCGGACC | |||
| non-inCC8_TB+ | TET, BHQ-1plus | TAATCGGACC | |||
| Clade CC8a (Archaic | CC8a_B+ | CC8a_F | CGCCAAATGACTCGCATTGT | 241 | |
| CC8a_R | GCATGTGCCTTTCCGAARTAAA | ||||
| CC8a-C_FB+ | FAM, BHQ-1plus | ATTACTGTAGCAGGG | |||
| nonCC8a-T_TB+ | TET, BHQ-1plus | CTGTAGCAGGG | |||
| Clade CC8b | CC8b_B+ | CC8b_F | GATGACGTGATAACTGTACGTSGAT | 240 | |
| CC8b_R | CGCGATTGAGGGTGAATATTGC | ||||
| CC8b-C_FB+ | FAM, BHQ-1plus | AAGCTAACAAAAT | |||
| nonCC8b-T_TB+ | Tet, BHQ-1plus | CAAAGCTAACAAAAT | |||
| Clade CC8c | NewIber_B+ | NewIber_F | GCGCAACAGGGAAGCAA | 118 | |
| NewIber_R | TGCGGATGTCCTATGTCTGAAAG | ||||
| NewIber-T_FB+ | FAM, BHQ-1plus | TGCACTTACATA | |||
| nonNewIber-C_TB+ | Tet, BHQ-1plus | CACTTACATA | |||
| Group CC8e | CC8e_B+ | CC8e_F | ACCTTATACRGAACATAGCAGACG | 106 | |
| CC8e_R | TCGATGCGCTTCTATCACTTC | ||||
| CC8e-C_FB+ | FAM, BHQ-1plus | TATTAGATGAAGGC | |||
| nonCC8e-T_TB+ | TET, BHQ-1plus | TTTATTAGATGAAGGC | |||
| Clade CC8f | CC8f_B+ | CC8f_F | CCTGAAGAAGAAGAGCGTTTAAGAA | 208 | |
| CC8f_R | RCATCCTACGATGGCCGAATC | ||||
| CC8f-T_FB+ | FAM, BHQ-1plus | TAAACGTCGTAAAG | |||
| nonCC8f-A_TB+ | TET, BHQ-1plus | ACGTAAACGTCGTAAAG | |||
| ST239 | ST239_B+ | ST239_F | CATGACCGCCACTATAACCAGA | 99 | |
| ST239_R | ATGCAACATTAGCAGGAGGATG | ||||
| ST239-C_FB+ | FAM, BHQ-1plus | TACGACTGAC | |||
| non239-T_TB+ | TET, BHQ-1plus | CGACTGAC |
FAM, 6-carboxyfluorescein; BHQ, black hole quencher; TET, tetrachlorofluorescein.
Nucleotides in boldface in each probe sequence are the phylogenetically informative canonical SNP state targeted by the assay.
USA300-NAE isolates will test positive on this assay.