| Literature DB >> 23279096 |
Samuel K Sheppard1, Xavier Didelot, Keith A Jolley, Aaron E Darling, Ben Pascoe, Guillaume Meric, David J Kelly, Alison Cody, Frances M Colles, Norval J C Strachan, Iain D Ogden, Ken Forbes, Nigel P French, Philip Carter, William G Miller, Noel D McCarthy, Robert Owen, Eva Litrup, Michael Egholm, Jason P Affourtit, Stephen D Bentley, Julian Parkhill, Martin C J Maiden, Daniel Falush.
Abstract
Hybridization between distantly related organisms can facilitate rapid adaptation to novel environments, but is potentially constrained by epistatic fitness interactions among cell components. The zoonotic pathogens Campylobacter coli and C. jejuni differ from each other by around 15% at the nucleotide level, corresponding to an average of nearly 40 amino acids per protein-coding gene. Using whole genome sequencing, we show that a single C. coli lineage, which has successfully colonized an agricultural niche, has been progressively accumulating C. jejuni DNA. Members of this lineage belong to two groups, the ST-828 and ST-1150 clonal complexes. The ST-1150 complex is less frequently isolated and has undergone a substantially greater amount of introgression leading to replacement of up to 23% of the C. coli core genome as well as import of novel DNA. By contrast, the more commonly isolated ST-828 complex bacteria have 10-11% introgressed DNA, and C. jejuni and nonagricultural C. coli lineages each have <2%. Thus, the C. coli that colonize agriculture, and consequently cause most human disease, have hybrid origin, but this cross-species exchange has so far not had a substantial impact on the gene pools of either C. jejuni or nonagricultural C. coli. These findings also indicate remarkable interchangeability of basic cellular machinery after a prolonged period of independent evolution.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23279096 PMCID: PMC3749442 DOI: 10.1111/mec.12162
Source DB: PubMed Journal: Mol Ecol ISSN: 0962-1083 Impact factor: 6.185
Fig. 1Ancestry of Campylobacter jejuni and C. coli. (A) Neighbour-joining tree of 30 C. jejuni and C. coli genomes. Isolates belonging to C. jejuni are shown in blue, and those belonging to C. coli clade 1 are indicated in red, clade 2 in yellow, and clade 3 in green. The scale bar represents a genetic distance of 0.01. (B) Genetic ancestry of 239 543 polymorphic sites among C. jejuni and C. coli isolates inferred using structure assuming 2 populations. Genomes are ordered according to isolate NCTC 11168(Parkhill et al. 2000), and each nucleotide is coloured according to genetic ancestry to C. jejuni (▪) or C. coli (□). (C) Histograms of nucleotide divergence between C. jejuni (isolate 4) and genomes from C. coli clade 1 (red), 2 (yellow), 3 (green) and C. jejuni (blue). Pairwise comparisons between C. jejuni and un-recombined clade 1 (isolate 23 in the example), and clade 2 and 3 genomes (upper panel) show a unimodal distribution with modes between 10 and 12%. Comparison of C. jejuni with C. coli clade 1 isolates from the ST-828 and ST1150 complexes (middle panel) has a bimodal distribution with similar modes at 10–12% but with an earlier mode at <2%. The nucleotide divergence of the earlier mode is similar to comparison between two C. jejuni genomes (bottom panel).
Fig. 2The effect of interspecific recombination on tree clade structure. Maximum likelihood trees, based on the Tamura–Nei model, of 30 Campylobacter jejuni and C. coli genomes are based on concatenated sequences of ribosomal protein (rps) subunit loci genes that show (A) 35 genes with no evidence of homologous recombination and (B) 16 with evidence of recombination in at least one isolate using clonalframe. Isolates belonging to C. jejuni are shown in blue, and those belonging to C. coli clade 1 are indicated in red, clade 2 in yellow, and clade 3 in green. The trees are drawn to scale, with branch lengths measured in the number of substitutions per site. The scale bar represents a genetic distance of 0.01. Trefoil clade structure is resolved in nonrecombining genes.
Fig. 3History of recombination in C. coli clade 1. (A) Genealogy inferred using clonalframe. (B) Recombination and mutation events on each branch of the genealogy, for an example, 1046 bp gene (metC2). Crosses indicate substitutions by either mutation or recombination. Recombination events inferred with high posterior probability are coloured according to whether the imported DNA is more similar to genomes from Campylobacter jejuni (blue) or C. coli clades 2 and 3 (yellow). (C) Total number of inferred homologous and nonhomologous imports on each branch from each species across the genome. Isolate 16 was excluded from this analysis because of poor genome coverage.
Fig. 4Homology dependence of recombination between Campylobacter jejuni and C. coli. The distribution of divergence between C. jejuni (isolate 4) and unintrogressed C. coli clade 1 (isolate 23) for the recombinant genes (white) and the nonrecombinant genes (black). Recombination is rarer in areas of the genome where there is high divergence between species but the effect is slight with a large overlap between the two distributions. Recombination occurs between genes at all levels of divergence.
Campylobacter jejuni genes with homologous sequence (70% blast similarity, >50% of the gene) present among introgressed C. coli clade 1 genomes and hypotheses about their potential function
| Gene | Product | Description |
|---|---|---|
| Transport and metabolism of L-fucose | ||
| Cj0480c | Transcriptional regulator | This is divergently transcribed from the other genes in this unit and is likely to be regulating the rightward reading genes. Cj0480 is an IclR family regulator (Gundogdu |
| Cj0481 (annotated as | Putative dihydropicolinate (DHP) synthase | DHP, also present in |
| Cj0482/0483 ( | Putative altronate or D-galactarate (sugar) hydrolase | Could be a pseudogene because the N-terminus encoded in Cj0482 and the C-terminus in Cj0483 is separated by a stop codon. There are examples where such genes are expressed. Possible fucose hydrolase? |
| Cj0484 | Major facilitator superfamily transport protein | Probably, a substrate-proton symporter to import a substrate driven by the pmf. It has some similarity to phthalate (aromatic) family transporters (Gundogdu |
| Cj0485 | Dehydrogenase/oxidoreductase, FabG family. | This is possibly an alcohol dehydrogenase |
| Cj0486 | Probable L-fucose transporter | This is a sugar transporter of the major facilitator superfamily, with significant similarity to the L-fucose–proton symporter of |
| Cj0489/Cj0490 | Putative aldehyde dehydrogenase | Potentially involved in a step of fucose catabolism? |
| Zinc uptake system | ||
| Cj0263 | Zinc transporter ZupT | There may be a connection between the zinc uptake system genes in supplying zinc for the activity of the protease. A number of proteins contain the Cj1589 domain, so it is difficult to predict the function but there may be a zinc connection with Cj0263. |
| Cj0620 | Zinc-dependent protease | |
| Cj1589 | Zinc-dependent hydrolase, possibly a beta-lactamase or glyoxalase II | |
| Flagellin-associated genes | ||
| Cj1339 ( | Flagellin protein | Flagellin-associated proteins that could be involved in niche colonization. The presence of |
| Cj1338 ( | Flagellin protein | |
| Cj0548 ( | Hook-associated protein | |
| Cj1299 ( | Acyl carrier protein for the O-linked glycosylation locus | |
| Cj1409( | Holo-acyl carrier protein synthase | |
| Miscellaneous | ||
| Cj0555 | Putative malonate (HOOC.CH2.COOH) transporter | This could be involved in growth on malonate, but this is an uncommon plant-derived carbon source. |
| Cj1297 | Putative component of the efflux system | Speculatively associated with antibiotic efflux. |
| Cj1365c | Secreted serine protease | Could be associated with breakdown of specific proteins for growth on amino acids |
| Cj1506c ( | Chemoreceptor for aspartate A | Chemotaxis towards aspartate, as facilitated by CcaA, is involved in the colonization of the intestinal tract (Hartley-Tassell |
| Cj1051c ( | — | A restriction modification enzyme |
| Cj1134 ( | Lauroyl acyltransferase | Enzyme involved in the biosynthesis of LipidA. This will probably be an essential gene |
| Cj1414c ( | Part of the capsule locus | Probable capsule polysaccharide modification gene |
| Cj1187c ( | Arsenical efflux pump | Used for detoxification. Actual substrate cannot be predicted, but Cj1297 may also have a related broad detoxification function |
| Cj0308c ( | Dethiobiotin synthase | Involved in synthesis of the cofactor Biotin. |
Fig. 5A scenario for the evolution of Campylobacter jejuni and C. coli. These species diverged followed by the split of C. coli clades 1, 2 and 3. Recombination from C. jejuni to C. coli clade 1 began at some point before R1, and subsequent clonal expansion of introgressed lineages (828 and 1150 clonal complexes) at R1 and R2 led to the dominance of hybrid lineages in agriculture, human disease and currently available isolate collections. Clade 2 (C2) and 3 (C3) and clade 1 (C1*) populations from wild bird and environmental reservoirs (e.g. represented by isolates 16 and 23) remained unintrogressed. The cross-sectional area and diameter of the lineage ‘trunks’ are based on the abundance of isolates in the PubMLST database and the length of trunks is arbitrarily defined.
Distribution of C. coli lineages among different sources*
| Source | |||||||
|---|---|---|---|---|---|---|---|
| Farm | Clinical | Riparian | |||||
| Clade | Isolates | STs | Isolates | STs | Isolates | STs | |
| 1 | ST-828 complex | 915 | 216 | 481 | 86 | 0 | 0 |
| 1 | ST-1150 complex | 38 | 12 | 0 | 0 | 0 | 0 |
| 1 | Clade 1 other | 201 | 107 | 19 | 17 | 0 | 0 |
| 2 | 0 | 0 | 0 | 0 | 37 | 31 | |
| 3 | 2 | 2 | 0 | 0 | 30 | 25 | |
MLST typed farm isolates (from cattle, chicken, pig, sheep or turkey faeces or meat), riparian isolates (from duck, swan, pigeon and gull faeces and environmental water samples),and clinical isolates (from human blood and faeces) are from published studies and defined locations (Sheppard et al. 2010b). Most clade 1 isolates that are not part of the ST-828 or 1150 complexes, nevertheless, share alleles with them suggesting recent common ancestry. Note that isolates sequenced for the current study were taken from a wider collection including additional riparian and wild bird isolates.
Sheppard et al. (2010b).