| Literature DB >> 28537263 |
Amandine Perrin1,2,3, Elise Larsonneur1,2,4, Ainsley C Nicholson5, David J Edwards6,7, Kristin M Gundlach8, Anne M Whitney5, Christopher A Gulvik5, Melissa E Bell5, Olaya Rendueles1,2, Jean Cury1,2, Perrine Hugon1,2, Dominique Clermont9, Vincent Enouf10, Vladimir Loparev11, Phalasy Juieng11, Timothy Monson8, David Warshauer8, Lina I Elbadawi12,13, Maroya Spalding Walters14, Matthew B Crist14, Judith Noble-Wang14, Gwen Borlaug13, Eduardo P C Rocha1,2, Alexis Criscuolo3, Marie Touchon1,2, Jeffrey P Davis13, Kathryn E Holt6,7, John R McQuiston5, Sylvain Brisse1,2,15.
Abstract
An atypically large outbreak of Elizabethkingia anophelis infections occurred in Wisconsin. Here we show that it was caused by a single strain with thirteen characteristic genomic regions. Strikingly, the outbreak isolates show an accelerated evolutionary rate and an atypical mutational spectrum. Six phylogenetic sub-clusters with distinctive temporal and geographic dynamics are revealed, and their last common ancestor existed approximately one year before the first recognized human infection. Unlike other E. anophelis, the outbreak strain had a disrupted DNA repair mutY gene caused by insertion of an integrative and conjugative element. This genomic change probably contributed to the high evolutionary rate of the outbreak strain and may have increased its adaptability, as many mutations in protein-coding genes occurred during the outbreak. This unique discovery of an outbreak caused by a naturally occurring mutator bacterial pathogen provides a dramatic example of the potential impact of pathogen evolutionary dynamics on infectious disease epidemiology.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28537263 PMCID: PMC5458099 DOI: 10.1038/ncomms15483
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Phylogenetic tree of the outbreak isolates.
Maximum likelihood phylogenetic tree inferred from 3,411,033 aligned nucleotide characters (1,137,011 codons) based on cgMLST data. The tree was rooted based on phylogenetic analyses using epidemiologically unrelated E. anophelis strains as an outgroup. Thick branches have bootstrap support>80% (200 replicates). The scale bar represents substitutions per site. Sub-clusters (sc) 1 to 6 are represented by coloured boxes. Counties A to L (and U for ‘unspecified', attributed to the strains from outside of Wisconsin) are represented by coloured circles (see key on the left). Sets of isolates gathered from the same patient are indicated with vertical black lines after the isolate codes. Median Bayesian estimates of the month and year are provided for major internal branches (with 95% HPDs in square brackets). The branching position of the mutS isolate CSID 3015183688, denoted by the dashed branch line, was defined based on a separate analysis (using the same methods) and its branch length was divided by 5 for practical reasons.
Figure 2Temporal-spatial distribution of cases by genetic sub-cluster.
Case counts (n=59, over the three-state area) are presented in two-week intervals, as indicated below the histogram bars, based on the date of initial positive culture. Genetic sub-cluster colours (see key) correspond to those in the phylogenetic figures. Geographic distribution of Wisconsin cases (n=56) is displayed, overall (insets) and by two-week intervals (lower panel). The numbers along the x and y axis of the maps are longitude and latitude, respectively. Letters inside counties correspond to letters on the lower left key on Fig. 1.
Figure 3Mutation spectrum of E. anophelis strains by clade.
Frequency of each observed substitution mutation, reconstructed from FastML analysis, is shown for different parts of the E. anophelis tree.
Figure 4Excision of ICEEa1 can lead to mutY WT in outbreak strains.
Here the insertion site is TTT̂TT. In both the ICE and the mutY, there are inverted repeats (IR, red arrows) separated by ∼5–6 nucleotides. Note that the chromosomal IRs are only partially conserved, as denoted by the interrupted arrows. (a) Upon insertion of the ICE at that site, this will create two heteroduplexes. (b) These will be resolved either by replication or by reparation. One of the two solutions to the heteroduplex resolution leads to the observed outbreak strain sequence. (c) If the ICE excises from the outbreak strain sequence, it will produce one heteroduplex. (d) The resolution of the heteroduplex left after excision of the ICE will lead to the mutY wild-type (WT) gene in one of the two scenarii.
Figure 5Circular representation of gene content variation between the outbreak strain and 30 other E. anophelis genomes.
Circles, from 1 (innermost circle) to 8 (outermost circle), correspond to: Circle 1: scale of the reference genome CSID 3015183678. Circle 2: GC skew (positive GC skew, green; negative GC skew, violet). Circle 3: G+C content (above average, external peaks; below average, internal peaks). Circle 4: non-coding genes (rRNA, tRNA, tmRNA); their positions are also reported in circles 5 and 6. Circle 5: frequency of CSID 3015183678 protein-encoding DNA sequences (CDSs) among the 69 outbreak isolates genomes; note the high conservation, except for a 77 kbp deletion near position 3.8 Mbp. Circle 6: frequency of CSID 3015183678 genes in all other E. anophelis genomes, revealing genomic regions containing CDSs with low frequency in the species as a whole. Circle 7: core genes in all 99 E. anophelis strains. Circle 8: remarkable genomic regions of the outbreak isolates; specific regions are marked in red, deletion in olive. Functional information about CDSs comprised in these regions is given in Table 1. The figure was obtained using BLAST Ring Image Generator (BRIG)73. For more details, see.
Genomic features associated with the Wisconsin outbreak isolates*.
| Region 1 | 3,926,747 | 10,253 | 10,564 | 11 | Type I restriction/modification system; DNA-invertase |
| Region 2 | 292,287 | 354,501 | 62,215 | 62 | ICE |
| Region 3 | 599,595 | 606,529 | 6,935 | 5 | Tetratricopeptide repeat protein |
| Region 4 | 1,200,465 | 1,219,016 | 18,552 | 13 | CTP pyrophosphohydrolase |
| Region 5 | 214,2546 | 216,0415 | 17,870 | 17 | Putative polysaccharide synthesis clusters (capsule and LPS) |
| Region 6 | 217,9815 | 219,3156 | 13,342 | 13 | Putative polysaccharide synthesis clusters (capsule and LPS) |
| Region 7 | 2,367,659 | 2,378,760 | 11,102 | 8 | Putative deoxyribonuclease RhsC |
| Region 8 | 2,705,573 | 2,710,635 | 5,063 | 5 | Glycosyl hydrolase, beta-glycosidase and beta-glucosidase |
| Region 9 | 2,898,750 | 2,904,987 | 6,238 | 5 | Xylulose kinase, xylose isomerase, sodium/glucose co-transporter |
| Region 10 | 3,097,180 | 3,118,179 | 21,000 | 21 | Transposase; FAD-dependent urate hydroxylase (flavoprotein involved in urate degradation to allantoin) |
| Region 11 | 3,477,609 | 3,483,251 | 5,643 | 7 | Hypothetical proteins |
| Region 12 | 3,506,671 | 3,521,185 | 14,515 | 10 | Starch-binding outer membrane protein; Ferrienterobactin receptor precursor; Susd/RagB outer membrane lipoprotein; Nisin biosynthesis protein NisC; Putative lantibiotic biosynthesis protein |
| Region 13 | 3,727,334 | 3,744,334 | 17,001 | 15 | Transposase, IS200-like |
| Deletion | 3,779,205 | 3,856,342 | 77,138 | 75 | Multidrug resistance protein MdtE and efflux pump membrane transporter BepE; HopJ type III effector protein (found in plant pathogens); ABC-2 family transporter protein; Cytochrome c551 peroxidase precursor; H(+)/Cl(−) exchange transporter ClcA; Sulfite exporter TauE/SafE; Bicarbonate transporter BicA; Vitamin B12 transporter BtuB precursor; Putative transporter YycB; beta-lactamase |
CTP, cytidine triphosphate; FAD, flavin adenine dinucleotide; LPS, lipopolysaccharide.
Positions refer to the genome of reference strain CSID 3015183678.
*Present in at least 90% of outbreak genomes and in <20% of the other E. anophelis.
†Absent in four E. anophelis outbreak genomes (patient 30).