| Literature DB >> 27150362 |
David M Aanensen1, Edward J Feil2, Matthew T G Holden3, Janina Dordel4, Corin A Yeats1, Artemij Fedosejev1, Richard Goater5, Santiago Castillo-Ramírez6, Jukka Corander7, Caroline Colijn8, Monika A Chlebowicz9, Leo Schouls10, Max Heck10, Gerlinde Pluister10, Raymond Ruimy11, Gunnar Kahlmeter12, Jenny Åhman12, Erika Matuschek12, Alexander W Friedrich9, Julian Parkhill13, Stephen D Bentley13, Brian G Spratt14, Hajo Grundmann15.
Abstract
UNLABELLED: The implementation of routine whole-genome sequencing (WGS) promises to transform our ability to monitor the emergence and spread of bacterial pathogens. Here we combined WGS data from 308 invasive Staphylococcus aureus isolates corresponding to a pan-European population snapshot, with epidemiological and resistance data. Geospatial visualization of the data is made possible by a generic software tool designed for public health purposes that is available at the project URL (http://www.microreact.org/project/EkUvg9uY?tt=rc). Our analysis demonstrates that high-risk clones can be identified on the basis of population level properties such as clonal relatedness, abundance, and spatial structuring and by inferring virulence and resistance properties on the basis of gene content. We also show that in silico predictions of antibiotic resistance profiles are at least as reliable as phenotypic testing. We argue that this work provides a comprehensive road map illustrating the three vital components for future molecular epidemiological surveillance: (i) large-scale structured surveys, (ii) WGS, and (iii) community-oriented database infrastructure and analysis tools. IMPORTANCE: The spread of antibiotic-resistant bacteria is a public health emergency of global concern, threatening medical intervention at every level of health care delivery. Several recent studies have demonstrated the promise of routine whole-genome sequencing (WGS) of bacterial pathogens for epidemiological surveillance, outbreak detection, and infection control. However, as this technology becomes more widely adopted, the key challenges of generating representative national and international data sets and the development of bioinformatic tools to manage and interpret the data become increasingly pertinent. This study provides a road map for the integration of WGS data into routine pathogen surveillance. We emphasize the importance of large-scale routine surveys to provide the population context for more targeted or localized investigation and the development of open-access bioinformatic tools to provide the means to combine and compare independently generated data with publicly available data sets.Entities:
Mesh:
Year: 2016 PMID: 27150362 PMCID: PMC4959656 DOI: 10.1128/mBio.00444-16
Source DB: PubMed Journal: mBio Impact factor: 7.867
Abundance, diversity, and proportion of MRSA isolates in each major or minor CC detected in the sample
| Group and CC | Total no. of genomes | No. of reference genomes | Proportion of MRSA genomes | Mean no. of PW SNPs (SE) | Mean yr of PW MRCA (range) | Example clone(s) |
|---|---|---|---|---|---|---|
| Major | ||||||
| CC5 | 78 | 8 | 0.8 | 438 (8.2) | 1951 (1950–1952) | USA100 New York/Japan USA800, pediatric |
| CC22 | 41 | 1 | 0.775 | 266 (6.6) | 1972 (1972–1973) | EMRSA-15, Barnim |
| CC45 | 39 | 0 | 0.231 | 571 (9.4) | 1935 (1933–1936) | USA600, Berlin |
| CC8 | 33 | 5 | 0.642 | 456 (9.1) | 1949 (1948–1950) | Iberian, USA300, USA500, archaic, Central European |
| CC30 | 34 | 2 | 0.065 | 481 (5.8) | 1946 (1945–1947) | EMRSA-16 (ST36), phage type 80/81, SWP, USA200 |
| CC15 | 24 | 0 | 0 | 258 (4.4) | 1974 (1973–1974) | |
| Minor | ||||||
| CC1 | 14 | 2 | 0 | 415 (9.3) | 1954 (1953–1955) | USA400 |
| ST20 | 7 | 0 | 0 | 369 (10.4) | 1960 (1959–1961) | |
| ST25 | 7 | 0 | 0 | 307 (9.5) | 1968 (1966–1969) | |
| ST7 | 6 | 0 | 0 | 159 (6.3) | 1986 (1985–1987) | |
| CC121 | 5 | 0 | 0 | 737 (17.7) | 1913 (1912–2016) | |
| CC88 | 5 | 0 | 0 | 356 (12) | 1961 (1960–1963) | |
| CC12 | 4 | 0 | 0 | 365 (11.9) | 1960 (1959–1962) | |
| CC398 | 4 | 1 | 0 | 326 (11.7) | 1965 (1964–1967) | |
| ST101 | 4 | 0 | 0 | 240 (10.8) | 1976 (1975–1978) | |
| ST72 | 4 | 0 | 0.25 | 275 (10.3) | 1971 (1970–1973) |
Including reference genomes.
Excluding reference genomes.
Mean number of SNPs in all possible pairwise (PW) combinations of genomes. Standard errors were estimated by bootstrapping (as implemented in MEGA v6.0). The standard error is the spread of pairwise values and reflects the degree of substructuring within each CC. A high standard error indicates that some pairs of genomes are closely related and others are more distant (that is, subclusters are apparent within the CC), whereas a low standard error indicates that all pairs of genomes show similar levels of divergence from each other and the phylogeny of the isolates within the complex is starlike. This analysis therefore indicates that, of the major CCs, CC15 exhibits the lowest degree of subclustering, whereas CC45 and CC8 exhibit the highest.
Mean estimated date of the most recent common ancestor (MRCA) of all possible pairwise combinations of genomes. This is based on a mutation rate of 1.3 × 10−6 per site per year (or four SNPs per genome per year). This rate was proposed for ST22 by Holden et al. (8), and similar rates have been calculated for several other lineages (ST30, ST225, ST8-USA300) (6, 7, 9). We note that the mutation rate estimate for ST239 is approximately twice as high (3 × 10−6 per site per year) (12), for reasons that are unclear. The calculation is as follows. Approximating the genome size of S. aureus to 3 Mb, this rate equates to approximately four SNPs per genome per year. If a pair of genomes differs by, say, 500 SNPs, meaning 250 SNP changes, on average, in each of the two genomes, this would therefore correspond to 250/4 = 62.5 years of divergence. As the samples were collected in 2006, this means that the common ancestor of the two genomes would have existed around 1943.
Excluding isolate 11_SE_395, as this is outside the main CC45 cluster.
We have excluded the diverse ST239 genomes corresponding to the Portuguese, Brazilian, and EMRSA-4-7 clones, as this is a hybrid genome.
Excluding ST34, as this is a hybrid genome.
Excluding isolate 296_DE_582 (ST582), as this is a hybrid genome (see text).
FIG 1 Phylogenetic relationship of the invasive S. aureus population circulating in Europe in 2006. A rooted neighbor-joining tree based on 235,226 genomewide core SNPs is shown. Lineages are highlighted and named according to the corresponding CC or ST.
FIG 2 Phylogeny decoration. Colors of branches indicate MSSA (green) and MRSA (red) states. Each isolate is annotated by affiliation with a CC or ST (A), SSCmec type and MSSA (green) or MRSA (red) state (B), or antibiotic resistance profile (C). Red boxes indicate the presence of genetic resistance markers, black dots indicate phenotypic resistance, and gray boxes highlight resistance in reference genomes. (D) Size and composition of the accessory genome based on the number of noncore homologous groups with further categorization according to MGE type. (E) Close-up of phylogenetic trees of the six major lineages.
FIG 3 Phylogenetic reconstruction of CC5. Branch color indicates MSSA (green) or MRSA (red). Clusters described in Results are shaded gray. Symbols at the tips indicate the geographic origins of these isolates. SCCmec IV subtypes are shown for ST125.
FIG 4 Phylogenetic reconstruction of CC22. Branch color indicates MSSA (green) or MRSA (red). The EMRSA-15 cluster is shaded gray. Symbols at the tips of the branches indicate the geographic origins of these isolates. A cluster consisting of isolates from Berlin indicating the possible point of EMRSA-15 introduction into Germany from the United Kingdom is shaded a darker gray. The position of an isolate from Lisbon is shown indicating the possible location of its entry into Portugal.
FIG 5 (A) Phylogenetic reconstruction of CC30 isolates in the sample. Branch color indicates MSSA (green) or MRSA (red). Reference genomes are named and clusters are highlighted and named in accordance with the report of McAdam et al. (7). The dashed line indicates the long branch leading to three ST34 isolates that evolved through the acquisition of a 200-kb homologous replacement within the chromosome from an ST10/ST145-like parent (66). (B) Phylogenetic tree of combined CC30 data obtained from isolates from the study of McAdam et al. (7) and isolates from panel A. Colors and cluster names are as in panel A. Light gray shading indicates isolates carrying SNPs in the hla and agrC genes thought to restrict these lineages to health care settings. (C) Representation of successive clonal radiations within the recent evolutionary history of CC30. These radiations correspond to recognized HRCs, both contemporary and historic. The SWP clone is a historic diversified starlike expansion with relatively long branches. Phage type 80/81 probably emerged from within this starlike expansion, as did the current MSSA HRC, which we have termed EMSSA-ST30. Finally, EMRSA-16 emerged from within the successful EMSSA-ST30 lineage, resulting in a more recent, and tightly clustered, starlike expansion.
FIG 6 Conservation of ncHGs across the invasive S. aureus population in Europe. Isolates are arranged in the tree order of Fig. 2 along the left and top. The ncHGs in the pairwise comparison are displayed as a heat map matrix, and colors represent the total numbers of ncHGs that genome pairs have in common; darker shading indicates more ncHGs in common (see the scale at the bottom). The dark squares corresponding to (from the top) CC45, CC30, CC22, CC8, CC5, and (at the bottom) CC15.
Comparison of antibiotic resistances predicted by in silico and SRL test results against the EDL reference
| Antibiotic | No. of | % Concordance | No. of SRL vs EDL results | % Concordance | No. of | % Concordance | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | Traits | False positive | False negative | Discordant | Total | Discordant | Total | Discordant | ||||
| Penicillin | 308 | 269 | 4 | 3 | 7 | 97.73 | 131 | 7 | 94.66 | 308 | 7 | 97.73 |
| Cefoxitin | 308 | 123 | 3 | 1 | 4 | 98.70 | 216 | 3 | 98.61 | 308 | 4 | 98.70 |
| Ciprofloxacin | 308 | 122 | 2 | 3 | 5 | 98.38 | 219 | 4 | 98.17 | 308 | 5 | 98.38 |
| Moxifloxacin | 308 | 118 | 2 | 0 | 2 | 99.35 | ||||||
| Amikacin | 308 | 71 | 21 | 2 | 23 | 92.53 | ||||||
| Gentamicin | 308 | 29 | 0 | 0 | 0 | 100.00 | 243 | 1 | 99.59 | 308 | 0 | 100.00 |
| Tobramycin | 308 | 77 | 7 | 0 | 7 | 97.73 | 79 | 1 | 98.73 | 308 | 7 | 97.73 |
| Erythromycin | 308 | 105 | 5 | 3 | 8 | 97.40 | 260 | 8 | 96.92 | 308 | 8 | 97.40 |
| Clindamycin | 308 | 95 | 3 | 2 | 5 | 98.38 | 172 | 10 | 94.19 | 308 | 5 | 98.38 |
| Tetracycline | 308 | 21 | 1 | 0 | 1 | 99.68 | 133 | 1 | 99.25 | 308 | 1 | 99.68 |
| Tigecycline | 308 | 0 | 0 | 3 | 3 | 99.03 | ||||||
| Fusidic acid | 308 | 14 | 1 | 0 | 1 | 99.68 | 175 | 5 | 97.14 | 308 | 1 | 99.68 |
| Linezolid | 308 | 0 | 0 | 0 | 0 | 100.00 | 194 | 1 | 99.48 | 308 | 0 | 100.00 |
| Mupirocin | 308 | 9 | 0 | 5 | 5 | 98.38 | ||||||
| Rifampin | 308 | 12 | 1 | 0 | 1 | 99.68 | 225 | 4 | 98.22 | 308 | 1 | 99.68 |
| Trimethoprim | 308 | 10 | 0 | 0 | 0 | 100.00 | ||||||
| Teicoplanin | 120 | 0 | 0 | 3 | 3 | 97.50 | 87 | 3 | 96.55 | 120 | 3 | 97.50 |
| Vancomycin | 120 | 0 | 0 | 0 | 0 | 100.00 | 118 | 1 | 99.15 | 120 | 0 | 100.00 |
| Daptomycin | 120 | 0 | 0 | 0 | 0 | 100.00 | ||||||
| Total | 5,288 | 1,075 | 50 | 25 | 75 | 98.58 | 2,252 | 49 | 97.82 | 3,628 | 42 | 98.84 |
Only results for antibiotics tested by SRLs were compared.