| Literature DB >> 28593194 |
David M Gordon1, Sarah Geyik1, Olivier Clermont2, Claire L O'Brien3,4, Shiwei Huang3, Charmalie Abayasekara1,5, Ashwin Rajesh1, Karina Kennedy3,6, Peter Collignon3,6,7, Paul Pavli3,4, Christophe Rodriguez8, Brian D Johnston9, James R Johnson9, Jean-Winoc Decousser10, Erick Denamur2,11.
Abstract
The Escherichia coli lineage known as clonal complex 95 (CC95) is a cosmopolitan human-associated lineage responsible for a significant fraction of extraintestinal infections of humans. Whole-genome sequence data of 200 CC95 strains from various origins enabled determination of the CC95 pangenome. The pangenome analysis revealed that strains of the complex could be assigned to one of five subgroups that vary in their serotype, extraintestinal virulence, virulence gene content, and antibiotic resistance gene profile. A total of 511 CC95 strains isolated from humans living in France, Australia, and the United States were screened for their subgroup membership using a PCR-based method. The CC95 subgroups are nonrandomly distributed with respect to their geographic origin. The relative frequency of the subgroups was shown to change through time, although the nature of the changes varies with continent. Strains of the subgroups are also nonrandomly distributed with respect to source of isolation (blood, urine, or feces) and host sex. Collectively, the evidence indicates that although strains belonging to CC95 may be cosmopolitan, human movement patterns have been insufficient to homogenize the distribution of the CC95 subgroups. Rather, the manner in which CC95 strains evolve appears to vary both spatially and temporally. Although CC95 strains appeared globally as pandemic, fine-scale structure analysis shows epidemic patterns of the CC95 subgroups. Furthermore, the observation that the relative frequency of CC95 subgroups at a single locality has changed over time indicates that the relative fitness of the subgroups has changed. IMPORTANCEEscherichia coli clonal complex 95 represents a cosmopolitan, genetically diverse lineage, and the extensive substructure observed in this lineage is epidemiologically and clinically relevant. The frequency with which CC95 strains are responsible for extraintestinal infection appears to have been stable over the past 15 years. However, the different subgroups identified within this lineage have an epidemic structure depending on the host, sample, continent, and time. Thus, the evolution and spread of strains belonging to CC95 are very different from those of another cosmopolitan human-associated clonal complex, CC131, which has increased significantly in frequency as a cause of extraintestinal infection over the past 15 years due to the evolution and spread of two very closely related, nearly monomorphic lineages.Entities:
Keywords: Escherichia coli; clonal complex 95; epidemiology; population genetics
Year: 2017 PMID: 28593194 PMCID: PMC5451516 DOI: 10.1128/mSphere.00168-17
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1 Relative frequency of phylogroups, CC95 isolates, and CC95 subgroups among E. coli strains recovered from feces, urine, or blood samples taken from people living in Canberra in 2002 and 2014. (A) Relative frequency of phylogroups among E. coli isolates; (B) relative frequency of CC95 strains among B2 strains; (C) Relative frequency of CC95 subgroups among CC95 strains.
FIG 2 Distribution of the genes in the pangenome of E. coli CC95. The number of genes found in 1 to 200 strains, respectively, is shown. All genomes were annotated using Prokka, and the pangenome was determined using ROARY.
FIG 3 Substructure of 200 E. coli CC95 strains. Group B2 strain ED1a was used as an outgroup (not shown). The single nucleotide polymorphisms (SNPs) were detected using the Harvest Suite (43) of tools and ED1a as the reference strain. Gubbins (53) was used to infer recombination events, and recombinant sites were removed. A maximum likelihood tree was inferred with a general time-reversible (GTR) model of evolution using MEGA 6.0 (54).
FIG 4 Similarity among E. coli CC95 isolates based on the variable genome. Genes present in all genomes or in a single strain were eliminated. Among-strain similarity was quantified using a Jaccard metric, and the dimensions of the matrix were reduced using principal-coordinate (PCO) analysis as implemented in PAST. Turquoise denotes subgroup A strains, purple B strains, orange C strains, pink D strains, green E strains, and black unassigned strains.
Distribution of putative virulence factors among strains belonging to the five CC95 subgroups
| Gene | % in CC95 subgroup: | |||||
|---|---|---|---|---|---|---|
| A ( | B ( | C ( | D ( | E ( | ||
| 100 | 100 | 4 | 0 | 44 | <0.001 | |
| 0 | 58 | 0 | 0 | 0 | <0.001 | |
| Colicin E1 gene | 0 | 0 | 81 | 8 | 44 | <0.001 |
| Colicin Ia gene | 24 | 23 | 83 | 68 | 52 | <0.001 |
| 24 | 39 | 87 | 74 | 36 | <0.001 | |
| 0 | 58 | 0 | 13 | 0 | <0.001 | |
| 0 | 100 | 2 | 0 | 0 | <0.001 | |
| 90 | 13 | 100 | 95 | 100 | <0.001 | |
| 24 | 94 | 87 | 87 | 60 | <0.001 | |
| 24 | 35 | 90 | 86 | 84 | <0.001 | |
| 25 | 35 | 90 | 84 | 84 | <0.001 | |
| Microcin V gene | 24 | 29 | 81 | 74 | 52 | <0.001 |
| 90 | 55 | 100 | 95 | 96 | <0.001 | |
| 90 | 0 | 0 | 3 | 0 | <0.001 | |
| 92 | 84 | 94 | 87 | 88 | 0.607 | |
| Antigen 43 gene | 94 | 87 | 4 | 95 | 4 | <0.001 |
| 0 | 58 | 0 | 0 | 0 | <0.001 | |
| Microcin H47 gene | 22 | 29 | 71 | 48 | 70 | <0.001 |
| 0 | 74 | 2 | 0 | 0 | <0.001 | |
| 0 | 29 | 2 | 5 | 4 | <0.001 | |
| 69 | 42 | 2 | 0 | 4 | <0.001 | |
Contingency table χ2 testing if gene is nonuniformly distributed among CC95 subgroups.
Frequency of plasmid-borne antibiotic resistance determinants among CC95 isolates from humans living in Canberra, Australia, with respect to their subgroup membership and serotype
| Variant | % of subgroup isolates with determinant | ||||||
|---|---|---|---|---|---|---|---|
| A, O1:H7 ( | A, O2:H7 ( | B, O18:H7 ( | C, O1:H7 ( | C, O25b:H4 ( | E, O2:H4 ( | ||
| 0 | 57.9 | 0 | 0 | 50.0 | 0 | <0.001 | |
| 9.1 | 52.6 | 41.7 | 5.6 | 62.5 | 46.7 | 0.005 | |
| 27.3 | 21.1 | 8.3 | 5.6 | 12.5 | 6.7 | 0.48 | |
| 9.1 | 21.1 | 0 | 16.7 | 12.5 | 26.7 | 0.29 | |
| 0 | 0 | 0 | 11.1 | 0 | 26.7 | 0.03 | |
| 0 | 63.2 | 0 | 16.7 | 25.0 | 20.0 | <0.001 | |
| 9.1 | 21.1 | 0 | 16.7 | 12.5 | 26.7 | 0.49 | |
| 0 | 0 | 0 | 88.9 | 25.0 | 0 | <0.001 | |
Contingency table χ2 testing if gene is nonuniformly distributed among CC95 subgroups.
FIG 5 E. coli CC95 subgrouping multiplex PCR. The svg band corresponds to the CC95 control band (positive on all strains), whereas the HP H062v1_5555168, HP Eh660v1_350050, HP Er527v1_680030, yjgJ, and HP Eh252v1_100119 bands correspond to the amplification of subgroup E-, A-, B-, D-, and C-specific genes, respectively. UA, unassignable. Lanes: 1, H252 (O1:H7); 2, 004-008 (O25b:H4); 3, B5H8 (O1:H7); 4, B1I8 (O2a:H4) (note the amplification of the subgroup C-specific gene in addition to the subgroup D-specific gene); 5, S88 (O45a:H7); 6, 034-018 (O1:H7); 7, UTI89 (O18:H7); 8, 003-007 (O18:H7); 9, H660 (O1:H7); 10, B6H13 (O2a:H7); 11, 0032-001 (O2a:H4); 12, H062 (O2a:H4); 13, H151 (O1:H1); 14, NILS21 (O2b:H5). M, molecular size marker (1-kb Plus DNA ladder; Invitrogen).
Assessment of the ability of the subgroup CC95-specific genes together with serotype to correctly assign CC95 strains to their correct subgroup
| Phylogeny assignment | Multiplex PCR assignment(s) | PCR serotype | No. of isolates |
|---|---|---|---|
| A | O2a:H7 | 5 | |
| A | A | O1:H- | 3 |
| A | A | O1:H7 | 9 |
| A | A | O2a:H7 | 16 |
| B | B | O18:H7 | 19 |
| C | O1:H7 | 1 | |
| C | C | O1:H7 | 32 |
| C | C | O25b:H4 | 9 |
| D | D | O1:H7 | 6 |
| D | D | O2a:H4 | 2 |
| D | D | O45:H7 | 2 |
| D | D and C | O1:H7 | 3 |
| D | D and C | O2a:H4 | 1 |
| E | E | O2a:H4 | 16 |
| Other | O1:H1 | 3 | |
| Other | C | O2a:H7 | 1 |
| Other | A | O120:H5 | 1 |
| Other | A | O2b:H5 | 2 |
Only those strains sequenced as part of this study have been included.
Relative abundance of CC95 subgroups isolated from the same source in different countries
| Source | Country | % of isolates of subgroup: | ||||||
|---|---|---|---|---|---|---|---|---|
| A | B | C | D | E | ||||
| Urine | Australia | 41 | 41 | 14 | 30 | 0 | 11 | <0.001 |
| United States | 99 | 66 | 29 | 0 | 2 | 3 | ||
| Blood | Australia | 23 | 33 | 11 | 33 | 0 | 14 | <0.001 |
| France | 173 | 4 | 10 | 3 | 74 | 9 | ||
| Feces | Australia | 27 | 64 | 11 | 14 | 4 | 7 | <0.001 |
| France | 34 | 6 | 26 | 12 | 47 | 9 | ||
Contingency table χ2 testing if CC95 subgroups are nonuniformly distributed with respect to source.
Frequency of specific virulence traits in CC95 subgroup B isolates from different countries
| Virulence trait | % of isolates positive from: | |||
|---|---|---|---|---|
| Australia ( | France ( | United States ( | ||
| 67 | 4 | 71 | <0.001 | |
| 67 | 100 | 95 | 0.005 | |
| 33 | 80 | 5 | <0.001 | |
| 58 | 8 | 95 | <0.001 | |
Contingency table χ2 testing if gene is nonuniformly distributed among CC95 subgroups.