| Literature DB >> 28348829 |
Ann-Katrin Llarena1, Ji Zhang2, Minna Vehkala3, Niko Välimäki4, Marjaana Hakkinen5, Marja-Liisa Hänninen1, Mati Roasto6, Mihkel Mäesaar6,7, Eduardo Taboada8, Dillon Barker8, Giuliano Garofolo9, Cesare Cammà9, Elisabetta Di Giannatale9, Jukka Corander10, Mirko Rossi1.
Abstract
The decreased costs of genome sequencing have increased the capability to apply whole-genome sequencing to epidemiological surveillance of zoonotic Campylobacter jejuni. However, knowledge of the genetic diversity of this bacteria is vital for inferring relatedness between epidemiologically linked isolates and a necessary prerequisite for correct application of this methodology. To address this issue in C. jejuni we investigated the spatial and temporal signals in the genomes of a major clonal complex and generalist lineage, ST-45 CC, by analysing the population structure and genealogy as well as applying genome-wide association analysis of 340 isolates from across Europe collected over a wide time range. The occurrence and strength of the geographical signal varied between sublineages and followed the clonal frame when present, while no evidence of a temporal signal was found. Certain sublineages of ST-45 formed discrete and genetically isolated clades containing isolates with extremely similar genomes regardless of time and location of sampling. Based on a separate data set, these monomorphic genotypes represent successful C. jejuni clones, possibly spread around the globe by rapid animal (migrating birds), food or human movement. In addition, we observed an incongruence between the genealogy of the strains and multilocus sequence typing (MLST), challenging the existing clonal complex definition and the use of whole-genome gene-by-gene hierarchical nomenclature schemes for C. jejuni.Entities:
Keywords: Campylobacter jejuni; genomic epidemiology; monomorphic clones; whole-genome sequencing
Mesh:
Year: 2016 PMID: 28348829 PMCID: PMC5359405 DOI: 10.1099/mgen.0.000088
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Overview of isolates included in this study
See text for details.
| Human | 144 | UK | PubMLST |
| 36 | Finland | ||
| Animal | 42 | UK | PubMLST |
| 95 | Finland | ||
| 3 | Estonia | This study | |
| 2 | Lithuania | This study | |
| Environment | 5 | Finland | |
| Unknown | 13 | UK | PubMLST |
| Animal | 1 | Estonia | This study |
| Human | 31 | Canada | Public Health Agency of Canada |
| Animal | 8 | Italy | Italian surveillance study |
| 40 | Canada | Public Health Agency of Canada | |
| 6* | Finland | This study | |
| Environment | 11 | Canada | Public Health Agency of Canada |
*Migratory barnacle geese.
Population structure of ST-45 CC with associated geographical signal according to clonal frame
Geographic association with a country/region is indicated in bold type. EST: Estonia, LIT: Lithuania, FIN: Finland, UK: United Kingdom, BAL: Baltics (FIN, LIT, EST), unk: unknown.
| bp. diff.† | STs* | Year | Phyly§ | Recom|| | |||
|---|---|---|---|---|---|---|---|
| Clade | |||||||
| 1* | 1.7×10−2 | 10 | ST-4791, ST-7259 | Para | 0.701 | ||
| 8* | 7.4×10−5 | 12 | ST-7259 | Mono | 0.804 | ||
| 12* | 2.7×10−6 | 2 | ST-7256 | Mono | 0.262 | ||
| Clade | |||||||
| 2* | 1.5×10−3 | 46 | ST-230 | FIN, UK | 2004-2013 | Mono | 0.059 |
| 4 | 2.4×10−5 | 42 | ST-45 | EST, FIN, UK | 1999-2013 | Mono | 0.042 |
| 5 | 1.0×10−5 | 26 | ST-45 | LIT, FIN, UK | 2008-2013 | Mono | 0.075 |
| 6 | 9.2×10−5 | 83 | ST-45, ST-7008 | LIT, FIN, UK | 2000-2013 | Mono | 0.001 |
| 7* | 4.4×10−3 | 45 | ST-25, ST-45, ST-233, ST-686, ST-845, ST-1326, ST-1701, ST-2197 | FIN, | 2004-2013 | Poly | 0.054 |
| 9* | 3.4×10−3 | 42 | ST-45, ST-137, ST-538, ST-2109, ST-6017 | FIN, | 2000-2013 | Poly | 0.078 |
| 10* | 1.6×10−3 | 28 | ST-11 | 1999-2013 | Mono | 0.124 | |
| 11* | 5.1×10−3 | 4 | ST-1003 | UK, FIN | 2004-2010 | Mono | 0.237 |
*BAPS populations with evidence of geographical structuring with the country of origin marked in bold.
†Average genetic distance nucleotide in the aligned, non-recombining core genome.
‡Number of strains in each BAPS cluster.
§Phylogeny of the BAPS population according to the core genome ML tree.
||percentage of core genome alignment with signs of recombination originating from outside the dataset.
¶Variable association within the BAPS cluster, as BAPS 22*–24* at the secondary level were associated with UK origin.
Fig. 1.(a) Phylogram of RAxML tree based on non-recombined core genome. Semicircles from outside to inside represent genetic features listed in the legends. Branches are colored according to hierBAPS clustering. The two major clades (a and b) are indicated. (b) Inner phylogram: RAxML-tree as described for Fig. 1(a). Outer phylogram: RAxML-tree based on a binary matrix of presence–absence of accessory genes. Nodes with bootstrap values >50 % are indicated with a circle. Semicircles from outside to inside: ST, year and country of origin (see legends for indications). Branches are colored according to hierBAPS clustering.
Fig. 2.Manhattan plots illustrating significant hits associated with UK (red, upper) or Baltic countries (blue, lower) mapped to C. jejuni M1 or C. jejuni 4031 genomes, respectively. The dots represent number of words (k-mers), and the regions with higher number of mapped words are indicated with arrows and letters. In the UK Manhattan plot, region A (11 k-mers) mapped to position 153 200–153 295 and region B (12 k-mers) to position 240 500–240 600 on M1. In the lower Baltic plot, region C (18 k-mers) mapped to position 1 276 300–1 277 300 on 4031.
Fig. 3.Bayesian phylogenetic tree of Finnish BAPS 6 isolates inferred with BEAST based on non-recombined core genome of approximately 1.2 Mb. Year of isolation is indicated by the color of the bar: yellow: 2000, blue: 2001, purple: 2004, green: 2008, orange: 2012. Two subclusters of BAPS 6 are scaled up and presented as embedded illustrations. Predicted divergence times with confidence intervals given in years before present are given at their respective nodes. Single-nucleotide variant (SNV) numbers in a cluster are given in yellow and light green boxes near the most common recent ancestor of that cluster (when space allows, otherwise located between the involved leafs). Fewer than five SNVs were detected between epidemiologically linked isolates (two from 2008, two from 2012, two from 2004, three from 2012), indicating the usefulness of WGS in outbreak investigations.
Fig. 4.Comparison of ST-45 of Canadian, Italian and barnacle geese origin and one representative of each hierBAPS cluster level 2 on the b-lineage. Purple open circle: BAPS 4 isolate, green open circles: BAPS 6 isolate, green triangles: barnacle geese isolate, red diamonds: Canadian strains, blue squares; Italian strains. BAPS 4 and 6 are highlighted. Genetic distance (nt substitutions per site) is indicated with a bar.