| Literature DB >> 25088811 |
Violette Da Cunha1,2,3, Mark R Davies4,5, Pierre-Emmanuel Douarre1,2, Isabelle Rosinski-Chupin1,2, Immaculada Margarit6, Sebastien Spinali7, Tim Perkins6, Pierre Lechat3, Nicolas Dmytruk7, Elisabeth Sauvage1,2, Laurence Ma8, Benedetta Romi6, Magali Tichit8, Maria-José Lopez-Sanchez1,2, Stéphane Descorps-Declere3, Erika Souche3, Carmen Buchrieser2,9, Patrick Trieu-Cuot1,10, Ivan Moszer3, Dominique Clermont11, Domenico Maione6, Christiane Bouchier8, David J McMillan12,13, Julian Parkhill4, John L Telford6, Gordan Dougan4, Mark J Walker5, Matthew T G Holden4, Claire Poyart1,7,14,15, Philippe Glaser1,2,3.
Abstract
Streptococcus agalactiae (Group B Streptococcus, GBS) is a commensal of the digestive and genitourinary tracts of humans that emerged as the leading cause of bacterial neonatal infections in Europe and North America during the 1960s. Due to the lack of epidemiological and genomic data, the reasons for this emergence are unknown. Here we show by comparative genome analysis and phylogenetic reconstruction of 229 isolates that the rise of human GBS infections corresponds to the selection and worldwide dissemination of only a few clones. The parallel expansion of the clones is preceded by the insertion of integrative and conjugative elements conferring tetracycline resistance (TcR). Thus, we propose that the use of tetracycline from 1948 onwards led in humans to the complete replacement of a diverse GBS population by only few TcR clones particularly well adapted to their host, causing the observed emergence of GBS diseases in neonates.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25088811 PMCID: PMC4538795 DOI: 10.1038/ncomms5544
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Summary of global multi-locus sequence typing studies*.
| Study | Origin | Clinical origin | Number of | CC1 (ST1) | CC10 | CC17 | CC19 (ST19, 28) | CC23 | CC26 | Other |
|---|---|---|---|---|---|---|---|---|---|---|
| Jones[ | World | Ca, Ninv, Ainv | 152 | 16% (14%) | 18% | 30% | 17% (13%–2%) | 12% | 2% | 5% |
| Luan[ | Sweden (1988–1997) | Ninv, Ainv | 158 | 15% (9%) | 13% | 24% | 29% (16%–1%) | 14% | 0 | 4% |
| Manning[ | Canada Alberta (1993–2002) | Ca, Ninv | 413 | 23% (19%) | 14% | 16% | 20% (19%–NA) | 22% | NA | 5% |
| Bohnsack[ | USA (1995–1999) | Ca, Ninv | 899 | 16% (15%) | 9% | 13% | 17% (12%–2%) | 40% | 0 | 5% |
| Sadowi[ | Poland (1996–2005) | Ca, Ninv, Ainv | 114 | 17% (13%) | 18% | 14% | 12% (9–2%) | 37% | 1% | 1% |
| Huber[ | Kenya (2007–2010) | Ca, Ainv | 169 | 12% (9%) | 17% | 21% | 14% (5%–4%) | 27% | 2% | 7% |
| Brochet[ | Dakar Bangui (2006–2007) | Ca | 163 | 20% (9%) | 6% | 12% | 28% (4%–15%) | 17% | 15% | 2% |
NA, Not available.
As percentage in each clonal complex.
First author of the publication and reference.
In parenthesis the years of isolation if known.
The epidemiological origin: Ca, carriage, Ninv, neonatal invasive disease, Ainv, adults invasive disease.
In parenthesis, the percentage corresponding to the indicated sequence types (STs).
Corresponds to strains from the closely related clonal complexes (CC) CC6, CC8 and CC10.
Figure 1Population structure of human GBS is driven by tetracycline resistance acquisition
(a) Whole-genome-based phylogeny of 229 sequenced GBS isolates and strain SS1219 isolated from fish[48]. Maximum Likelihood (ML) using MEGA was used to infer phylogenetic relationships. The major clonal complexes (CC) 1, 10, 17, 19, 23 and 26 as defined on the GBS MLST web site (http://pubmlst.org/sagalactiae/) correspond to well-defined branches. Isolates are indicated by dots coloured according to their geographical origin. Flanking the whole-genome phylogeny, are four Bayesian maximum clade credibility phylogenies (b–e) based on the non-recombinogenic genome for the GBS CC17 (b), CC23 (c), CC19 (d) and CC1 strains (e). Divergence dates (median estimates with 95% highest posterior density dates in brackets) are provided in blue for the major nodes. Coloured branches relate to the major tetracycline-resistant clones. Arrows indicate the predicted time of insertion of the ICE carrying the tet(M) resistance determinant within the major clones. Capsular serotypes are indicated on the right of each tree according to the indicated colour code.
Figure 2Distribution of SNPs and recombination across all GBS isolates from the six major CCs
The maps were generated by using the SyntView software. Isolates were ordered according to the distance from the reference genome depicted at the inner circle. CC numbers are indicated in the centre. Recombined regions compared with the reference genome correspond to regions with a higher density of SNPs indicated by short lines on each circle corresponding to one strain. Around the outside circle are the relative positions of selected antigenic loci. The reference genomes were BG-NI-011 for CC1, DK-NI-008 for CC10, COH1 for CC17, RBH11 for CC19, CCH210801006 for CC23 and Bangui-IP-105 for CC26.
Diversity within the six clonal complexes.
| CC1 | CC10 | CC17 | CC19 | CC23 | CC26 | |
|---|---|---|---|---|---|---|
| Strain name | BG-NI-011 | DK-NI-008 | COH1 | RBH11 | CCH210801006 | Bangui-IP-105 |
| Genome size | 2,078 (25) | 2,080 (34) | 2,065 (1) | 2,180 (34) | 2,055 (17) | 2,054 (45) |
| No. of isolates | 39 | 18 | 79 | 39 | 36 | 6 |
| Interrogated regions | 914 kb | 1,069 kb | 1,860 kb | 1,427 kb | 980 kb | 1,582 kb |
| Polymorphic positions | 1,244 | 971 | 3,922 | 2,016 | 1,329 | 263 |
| Recombination | 987 kb | 914 kb | 68 kb | 532 kb | 875 kb | 398 kb |
| Depth | 174±23 | 126±7 | 129±16 | 97±21 | 169±16 | 74±34 |
| Mutation rate | 0.64 | — | 0.56 | 0.93 | 0.75 | — |
Total contig size in kb, in parenthesis, the number of contigs.
Cumulative size in kb of regions not predicted to have recombined.
Cumulative size in kb of the recombined regions and the exchanged antigenic loci.
Expressed as the average number of single-nucleotide polymorphism (SNP) per Mb from the root to the tips of the tree for strains isolated after 2005.
As estimated by dividing the depth of each lineage by its age predicted by the BEAST analysis, in SNPs per Mb per year (Fig. 1b–e).
Figure 3Phylogeny of the ‘hypervirulent’ CC17 lineage
(a) ML phylogeny based on the alignment of 3,922 polymorphic positions. Six independent ICE insertions (indicated on the right and by blue arrows) corresponding to six different lineages (indicated by different colours) were identified and are numbered from 11 to 16 (Table 4). A star indicates that Tn5801 has been lost by this isolate. Following the loss of Tn5801, strains CCH210160764 and CCH207800974 have acquired unrelated ICE expressing tet(M) and erm(B), and tet(O) and erm(B), respectively. Nodes with >90% bootstrap support are indicated by black dots. (b) Genetic maps and alignment of Tn916 and Tn5801. Comparisons were performed by BLASTn. The tet(M) gene is coloured in yellow, genes encoding type 4 secretion system components are in blue and the integrase and excisionase genes which are not conserved between the two transposons in red. Percentages of identities are shown in blue scale and range between 68 and 98% for the tet(M) region.
Figure 4Phylogeny of clonal complex CC1
(a) ML phylogeny from the alignment of pseudosequences of the 1,244 polymorphic positions in 914 interrogated kbases. The five independent Tn916 or Tn5801 insertions are indicated in blue and numbers from 1 to 5 refer to their description in Table 4. The two TcR lineages with more than one isolate are coloured in blue and red. Three sub-lineages have acquired an erm resistance gene. Within lineage Tn916-1, 40% of the isolates (12) carry Tn3872 (dark-blue branch and strain Bangui-IP-30). The four observed serotypes (cps) II, IV, V and VI are indicated in violet. A star indicates that Tn916 has been lost by the isolate. Antibiotic resistance genes other than tet(M) are indicated in red. Nodes with >90% bootstrap support are indicated by black dots. (b) Genetic map of Tn3872. Tn917 carrying the erm(B) gene is in grey the erm(B) gene being in orange, Tn916 genes are coloured as in Fig. 3. The location of 13 out of the 15 SNPs between strain CZ-NI-006 Tn916 and strain DE-NI-001 Tn3872, indicated by black bars are located between positions 10,407 and 13,659 (Tn916 coordinates) and the 76 SNPs between strain Bangui-IP-50 and strain CZ-NI-006, all located between position 10,407 and 13,497 are in blue.
Figure 5Correlation of isolation date with maximum likelihood root-to-tip branch length for the five major TcR lineages calculated with Path-O-Gen
These analyses predict the origin of these clones in agreement with the BEAST analysis except for the CC23 lineage where there was a lack of temporal sampling to support tree root estimates. X axis, time in years; Y axis root-to-tip branch length in SNP per Mb. (a) CC1 lineage Tn916-1; (b) CC19 lineage Tn916-17; (c) CC17 lineage Tn5801-11; (d) CC17 lineage Tn916-12; (e) CC23 lineage Tn5801-23.
Distribution of antibiotic resistance genes among the 229 sequenced isolates.
| Antibiotic resistance | Gene | Number of isolates | Total | ||||||
|---|---|---|---|---|---|---|---|---|---|
| CC1 | CC10 | CC17 | CC19 | CC23 | CC26 | Other | |||
| Tetracycline | 34 | 13 | 77 | 27 | 27 | 5 | 7 | 190 | |
| 1 | 2 | 1 | 6 | 1 | 0 | 0 | 11 | ||
| 0 | 1 | 0 | 0 | 1 | 2 | 0 | 4 | ||
| Erythromycin | 13 | 0 | 2 | 4 | 0 | 0 | 0 | 19 | |
| 6 | 0 | 0 | 2 | 2 | 2 | 0 | 12 | ||
| 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | ||
|
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | |
| Streptomycin |
| 1 | 0 | 0 | 4 | 1 | 0 | 0 | 6 |
| Kanamycin | 0 | 0 | 0 | 3 | 1 | 0 | 0 | 4 | |
| Streptothrycin |
| 0 | 0 | 0 | 3 | 1 | 0 | 0 | 4 |
| Chloramphenicol |
| 0 | 0 | 0 | 2 | 0 | 0 | 0 | 2 |
| Lincosamides |
| 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 |
| Lincosamide—streptogramin | 1 | 1 | 0 | 2 | 1 | 0 | 1 | 6 | |
| Spectinomycin |
| 3 | 0 | 0 | 2 | 1 | 2 | 0 | 8 |
| Total number of isolates | 39 | 19 | 78 | 39 | 36 | 6 | 12 | 229 | |
Characteristics of clones deriving from Tn916 and Tn5801 insertions.
| Clone | No. of isolates per clone | Integrative and conjugative element | Recombination events | |||
|---|---|---|---|---|---|---|
|
| CC | Sequence type |
| |||
| 1 | 1 | 1 | V | 27 (2) | Tn | 92 kb |
| 2 | 196, 459 | IV | 6 | Tn | 80 kb | |
| 3 | 2 | IV | 1 | Tn |
—
| |
| 4 | 2 | IV | 1 | Tn | n.a. | |
| 5 | 2 | II | 1 | Tn | n.a. | |
| 3 | n.a. | |||||
| 6 | 10 | 10 | V-II | 5 | Tn | 347 kb |
| 7 | 8 | Ib | 4 | Tn | — | |
| 8 | 12 | II | 1 | Tn | n.a. | |
| 9 | 10 | Ib | 1 | Tn | n.a. | |
| 10 | 10 | II | 2 | Tn | — | |
| 5 | n.a. | |||||
| 11 | 17 | 17 | III | 42 (3) | Tn | — |
| 12 | 17 | III | 23 | Tn | — | |
| 13 | 17 | III | 6 | Tn | — | |
| 14 | 17 | III | 2 | Tn | — | |
| 15 | 17 | III | 1 | Tn | n.a. | |
| 16 | 291 | IV | 5 | Tn | — | |
| 17 | 19 | 19 | III V | 20 (3) | Tn | 253 kb |
| 18 | 19 | III | 4 (1) | Tn | — | |
| 19 | 19 | III | 1 | Tn | n.a. | |
| 20 | 28 | II | 2 | Tn | — | |
| 21 | 28 | II | 3 | Tn | — | |
| 22 | 28 | II | 1 | Tn | n.a. | |
| 8 | n.a. | |||||
| 23 | 23 | 23 | Ia | 18 (1) | Tn | 107 kb |
| 24 | 23, 144 | Ia–V | 8 (1) | Tn | 91 kb | |
| 25 | 23 | III | 1 | Tn | n.a. | |
| 26 | 23 | Ia | 1 | Tn | n.a. | |
| 8 | n.a. | |||||
| 27 | 26 | 26 | V | 3 | Tn | — |
| 28 | 26 | V | 2 | Tn | 77 kb R5 protein | |
| 29 | 22 | 22 | II | 2 | Tn | — |
n.a., Not applicable.
Position relative to the completely sequenced 2603 V/R genome[45].
Size of the recombined regions and antigenic loci.
In parenthesis are indicated the number of strains predicted to have lost Tn916 or Tn5801.
Tn916 is inserted in the opposite orientation at the same site.
Tn916 is inserted within genomic islands inserted at the reported locations relative to the 2603 V/R genome.
972 kb correspond to a hot spot of insertion of Tn5801 at the 5′ end of the guaA gene.
A dash indicates no recombination event.
Number of isolates in clonal complex (CC)1, 10, 19 and 23 predicted not to have acquired Tn916 or Tn5801.