| Literature DB >> 34661518 |
Anastasia-Lisa Dieckmann1, Thomas Riedel2,3, Boyke Bunk2, Cathrin Spröer2, Jörg Overmann2,3, Uwe Groß1, Oliver Bader1, Wolfgang Bohne1, Burkhard Morgenstern4, Morteza Hosseini4,5, Andreas E Zautner1.
Abstract
The intriguing recent discovery of Campylobacter coli strains, especially of clade 1, that (i) possess mosaic C. coli/C. jejuni alleles, (ii) demonstrate mixed multilocus sequence types (MLSTs) and (iii) have undergone genome-wide introgression has led to the speculation that these two species may be involved in an accelerated rate of horizontal gene transfer that is progressively leading to the merging of both species in a process coined 'despeciation'. In an MLST-based neighbour-joining tree of a number of C. coli and C. jejuni isolates of different clades, three prominent Campylobacter isolates formed a seemingly separate cluster besides the previously described C. coli and C. jejuni clades. In the light of the suspected, ongoing genetic introgression between the C. coli and C. jejuni species, this cluster of Campylobacter isolates is proposed to present one of the hybrid clonal complexes in the despeciation process of the genus. Specific DNA methylation as well as restriction modification systems are known to be involved in selective uptake of external DNA and their role in such genetic introgression remains to be further investigated. In this study, the phylogeny and DNA methylation of these putative C. coli/C. jejuni hybrid strains were explored, their genomic mosaic structure caused by C. jejuni introgression was demonstrated and basic phenotypic assays were used to characterize these isolates. The genomes of the three hybrid Campylobacter strains were sequenced using PacBio SMRT sequencing, followed by methylome analysis by Restriction-Modification Finder and genome analysis by Parsnp, Smash++ and blast. Additionally, the strains were phenotypically characterized with respect to growth behaviour, motility, eukaryotic cell invasion and adhesion, autoagglutination, biofilm formation, and water survival ability. Our analyses show that the three hybrid Campylobacter strains are clade 1 C. coli strains, which have acquired between 8.1 and 9.1 % of their genome from C. jejuni. The C. jejuni genomic segments acquired are distributed over the entire genome and do not form a coherent cluster. Most of the genes originating from C. jejuni are involved in chemotaxis and motility, membrane transport, cell signalling, or the resistance to toxic compounds such as bile acids. Interspecies gene transfer from C. jejuni has contributed 8.1-9.1% to the genome of three C. coli isolates and initiated the despeciation between C. jejuni and C. coli. Based on their functional annotation, the genes originating from C. jejuni enable the adaptation of the three strains to an intra-intestinal habitat. The transfer of a fused type II restriction-modification system that recognizes the CAYNNNNNCTC/GAGNNNNNRTG motif seems to be the key for the recombination of the C. jejuni genetic material with C. coli genomes.Entities:
Keywords: Campylobacter coli; Campylobacter jejuni; PacBio SMRT sequencing; genome; introgression; methylome
Mesh:
Substances:
Year: 2021 PMID: 34661518 PMCID: PMC8627207 DOI: 10.1099/mgen.0.000679
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.MLST-based phylogenetic tree of and . MLST-based dendrogram including sequences of 41 strains places the three hybrid strains (red) between (yellow) and (clades 1A green, 1B dark green, 1C pale green, clade 2 purple, clade 3 blue).
Fig. 2.Core genome analysis of representative genomes of and by Parsnp. The core genome-based analysis of 20 . and four . genomes identifies the isolates meC0280, meC0281 and meC0467 as clade 1 representatives.
Fig. 3.Similarity analysis of hybrid strains to or references. Genome segments of 1000 bp (Smash++ algorithm) are matched to the genomes of BfR-CA-9557 and NCTC 11168. (a) Segments that are more similar to are shown in turquoise in the track diagram in their position in the genome, segments that are more similar to are shown in red, and equal segments in are shown in yellow. (b) Condensed representations of the Smash++ segments according to their similarity to and in absolute numbers and in per cent.
Fig. 4.Functional subsystems identified in 141 genes of the hybrid strain meC0280 originating from . This pie chart illustrates the functional subsystems of the 141 genes of the hybrid strain meC0280, which presumably originated from . In particular, the two categories motility and chemotaxis as well as membrane transport dominate. Isolate meC0280 was chosen as an example because with 141 genes of putative origin compared to the other two strains, it contains the most genes of putative origin.
Fig. 5.Venn diagram to illustrate the common proportions of genes putatively derived from and taken up by the hybrid strains. The three hybrid strains have 141, 136 or 124 genes integrated into their genome, which are presumably derived from . These genes are relatively different between the individual hybrid isolates although individual functional groups are clearly overrepresented. Just 23 genes originating from are found in all three hybrid genomes.
Putative meC0280 restriction modification systems
|
ORF (REBASE) |
Strand |
Position in genome |
Description |
Type/subunit |
Predicted recognition sequence |
|---|---|---|---|---|---|
|
2066 |
+ |
1511587–1513905 |
Cco280IP Type I restriction-modification system, restriction subunit R (EC 3.1.21.3) |
I/R |
CRAANNNNNNNRTAG |
|
2069 |
+ |
1516316–1517563 |
S.Cco280I Type I restriction-modification system, specificity subunit S (EC 3.1.21.3) |
I/S |
CRAANNNNNNNRTAG |
|
2071 |
+ |
1518755–1520242 |
M.Cco280I Type I restriction-modification system, DNA-methyltransferase subunit M (EC 2.1.1.72) |
I/M |
CRAANNNNNNNRTAG |
|
50 |
+ |
46204–48537 |
Cco280ORFAAP putative type IIS restriction/modification enzyme |
II/RM |
|
|
51 |
+ |
48507–050015 |
Cco280ORFABP putative type IIS restriction/modification enzyme |
II/RM |
|
|
1206 |
− |
889311–890063 |
M.Cco280ORFBP adenine-specific methyltransferase (EC 2.1.1.72) |
II/M |
|
|
1377 |
− |
997265–1000351 |
Cco280IV / Cco280ORFCP adenine-specific DNA methyltransferase |
II/RM |
GGGTDA |
|
1382 |
− |
1002805–1006899 |
Cco280III Type II restriction-modification system, DNA-methyltransferase subunit M (EC 2.1.1.72)/Type II restriction-modification system, restriction subunit R (EC 3.1.21.3) |
II/RM |
GAGNNNNNRTG |
|
2185 |
+ |
1599644–1600738 |
M.Cco280II DNA modification methylase (adenine-specific methyltransferase) (EC 2.1.1.72) |
II/M |
RAATTY |
|
2238 |
− |
1633326–1634678 |
Cco280McrCP McrBC restriction endonuclease system, McrB subunit, putative |
IV/R |
|
|
2239 |
− |
1634665–1636548 |
Cco280McrBP McrBC restriction endonuclease system, McrB subunit, putative |
IV/R |
|
Methylation motifs of meC0280
|
No. |
Motif |
Modified position |
Modification yype |
% Motifs detected |
No. of motifs detected |
No. of motifs in genome |
Mean modification QV* |
Mean motif coverage |
Partner motif |
|---|---|---|---|---|---|---|---|---|---|
|
A1 |
CTAYNNNNNNNTTYG |
3 |
m6A |
100 |
284 |
284 |
253.97 |
180.29 |
CRAANNNNNNNRTAG |
|
A2 |
CRAANNNNNNNRTAG |
4 |
m6A |
100 |
284 |
284 |
231.44 |
177.57 |
CTAYNNNNNNNTTYG |
|
B1 |
CAYNNNNNCTC |
2 |
m6A |
100 |
797 |
797 |
244.40 |
181.57 |
GAGNNNNNRTG |
|
B2 |
GAGNNNNNRTG |
2 |
m6A |
99.62 |
794 |
797 |
232.87 |
179.81 |
CAYNNNNNCTC |
|
C |
RAATTY |
3 |
m6A |
99.74 |
28 407 |
28 482 |
258.96 |
177.96 |
RAATTY |
|
D |
GGGTDA |
6 |
m6A |
99.68 |
1573 |
1578 |
251.71 |
178.39 |
|
The PacBio SMRT Analysis software was used to identify methylation motifs. The sequence consensus of the motif is shown in column 2 whereas the reverse complement of the motif, the partner motif, is shown in column 10. IUPAC ambiguity codes represent non-uniform positions. The position of the modified base within the motif and the type of methylation are indicated in columns 3 and 4. Column 5 denotes the percentage of a motif’s occurrences in the genome (column 7) for which a methylation has been detected (column 6). Column 8 lists the average modification quality (in Phred Q-scores) and column 9 the average coverage of motifs detected as modified.
*QV, quality value
Fig. 6.Methylation motifs in the three hybrid strains. Sequence logos of four methylation motifs detected in the hybrid isolate meC0280. The two motifs in A1 and in A2 as well as B1 and B2 are partner motifs that are methylated at both strands. All motifs are recognized by N-6 adenine-specific methyltransferases. The height of each stack indicates the degree of conservation (bits). The height of the letters represents the relative frequency of the base. The asterisk under a particular letter indicates the modified/methylated base.
Alignments to 21 . - and 27 . -specific gene markers
|
Gene identifier |
Species association of gene identifier |
meC0280 |
meC0281 |
meC0467 | |||
|---|---|---|---|---|---|---|---|
|
Identity (%) |
Coverage (%) |
Identity (%) |
Coverage (%) |
Identity (%) |
Coverage (%) | ||
|
Cc76339_00005 c |
|
89 |
100 |
89 |
100 |
89 |
100 |
|
Cc76339_01340 |
|
92 |
99 |
91 |
99 |
91 |
99 |
|
Cc76339_01460 c |
|
96 |
100 |
96 |
100 |
96 |
100 |
|
Cc76339_01470 c |
|
98 |
100 |
98 |
100 |
98 |
100 |
|
Cc76339_01480 c |
|
96 |
100 |
97 |
100 |
97 |
100 |
|
Cc76339_01490 c |
|
94 |
100 |
94 |
100 |
94 |
100 |
|
Cc76339_01750 |
|
95 |
100 |
95 |
100 |
95 |
100 |
|
Cc76339_02240 |
|
96 |
100 |
96 |
100 |
95 |
100 |
|
Cc76339_03250 |
|
98 |
100 |
97 |
100 |
98 |
100 |
|
Cc76339_04670 |
|
92 |
100 |
92 |
100 |
92 |
100 |
|
Cc76339_09670 |
|
94 |
100 |
94 |
100 |
94 |
100 |
|
Cc76339_10710 |
|
97 |
100 |
98 |
100 |
98 |
100 |
|
Cc76339_10950 |
|
93 |
100 |
92 |
100 |
92 |
100 |
|
Cc76339_11130 |
|
94 |
100 |
96 |
100 |
97 |
100 |
|
Cc76339_11470 |
|
85 |
100 |
85 |
100 |
81 |
100 |
|
Cc76339_11500 c |
|
84 |
98 |
84 |
98 |
84 |
98 |
|
Cc76339_12660 c |
|
94 |
100 |
94 |
100 |
94 |
100 |
|
Cc76339_12670 |
|
92 |
100 |
92 |
100 |
92 |
100 |
|
Cc76339_12940 |
|
93 |
98 |
93 |
98 |
93 |
98 |
|
Cc76339_15800 |
|
96 |
100 |
97 |
100 |
96 |
100 |
|
Cc76339_15900 c |
|
94 |
100 |
94 |
100 |
95 |
100 |
|
11168_Cj0011c |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0090 |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0135 |
|
72 |
97 |
72 |
97 |
72 |
97 |
|
11168_Cj0186c |
|
67 |
92 |
67 |
92 |
67 |
92 |
|
11168_Cj0327 |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0339 |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0340 |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0414 |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0454c |
|
70 |
99 |
70 |
100 |
– |
– |
|
11168_Cj0494 |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0873c |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj0900c |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj1021c |
|
72 |
57 |
72 |
57 |
72 |
57 |
|
11168_Cj1036c |
|
88 |
61 |
89 |
61 |
89 |
61 |
|
11168_Cj1060c |
|
– |
– |
77 |
42 |
74 |
42 |
|
11168_Cj1162c |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj1666c |
|
– |
– |
– |
– |
– |
– |
|
11168_Cj1714 |
|
– |
– |
– |
– |
– |
– |
|
11168_ctsT |
|
96 |
60 |
96 |
60 |
98 |
60 |
|
11168_kdpD |
|
95 |
48 |
95 |
48 |
95 |
48 |
|
11168_tonB2 |
|
97 |
30 |
97 |
30 |
97 |
30 |
|
Cj81116_1523 |
|
– |
– |
– |
– |
– |
– |
|
Cj_81–176_1820 (CJ81176_0363)* |
|
– |
– |
– |
– |
– | |
|
Cj_81–176_6530 (CJ81176_1246)* |
|
– |
– |
– |
– |
– |
– |
|
Cj_81–176_8530 (FORC46_1556)* |
|
– |
– |
– |
– |
– |
– |
|
Cj_81–176_RS08535 (CJ81176_1613)* |
|
– |
– |
– |
– |
– |
– |
|
Cjdoylei_26997_0913 |
|
– |
– |
– |
– |
– |
– |
Gene identifier in the re-annotated genome.