Literature DB >> 34766904

A Campylobacter integrative and conjugative element with a CRISPR-Cas9 system targeting competing plasmids: a history of plasmid warfare?

Arnoud H M van Vliet¹, Oliver J Charity², Mark Reuter².

Abstract

Entities: Chemical

Keywords: CRISPR-Cas; Campylobacter; mobile genetic elements; plasmids

Mesh：

Year: 2021 PMID： 34766904 PMCID： PMC8743540 DOI： 10.1099/mgen.0.000729

Source DB: PubMed Journal: Microb Genom ISSN： 2057-5858

× No keyword cloud information.

Data Summary

All genome sequences used in this study are available on the National Center for Biotechnology Information (NCBI) Genome database or in the Campylobacter PubMLST website; the assembly accession numbers (NCBI Genome) or genome ID numbers (Campylobacter PubMLST) are listed in Table S1 (available in the online version of this article). Genome assemblies were quality checked based on N50, L50, genome size and number of contigs. CRISPR Spacer sequences and predicted targets, Cas9 alignments, presence of mobile elements and plasmids are all included in the Supplementary Material. All supplementary tables and figures can be found on Figshare: https://doi.org/10.6084/m9.figshare.16988674.v1. Understanding pathogen evolution is paramount for enhancing food safety and limiting pathogenic disease in humans and animals. species comprise a group of human and animal pathogens with a remarkable success rate, being the most frequent cause of bacterial food-borne disease in high-income countries. A common theme among evolution is genomic plasticity, and a significant proportion of this plasticity is driven by horizontal gene transfer that results in acquisition of complex traits in one evolutionary event. Understanding the mechanisms of transfer of mobile genetic elements (MGEs) and how MGEs such as integrative conjugative elements exclude other MGEs is fundamental to understanding evolution. CRISPR-Cas9 proteins play a role in bacterial immune systems, mediating the defence against bacteriophages, plasmids and integrative elements. The use of CRISPR-Cas by a mobile element to fight off competing elements, possibly to the advantage or detriment to their host, also increases our understanding of how important selfish genomic islands undergo co-evolution with bacterial pathogens, and generates insight into the complex warfare between MGEs.

Introduction

The genus is a member of the , and comprises gram-negative bacteria that are commonly found in the intestines of warm-blooded animals. The best studied members are and , which are closely related thermophilic species commonly found in birds and animals involved in agriculture, i.e. poultry, cattle and pigs, while they are also found in many wild birds [1, 2]. They jointly represent the most common bacterial human diarrhoeal pathogens in the developed world, with transmission often foodborne via undercooked meat and cross-contamination in kitchen environments [3, 4]. Other related species include the recently described found in poultry [5], which is a zoonotic species from dogs and cats [6], and the group consisting of several species isolated from birds and animals connected to coastal environments [7]. Horizontal gene transfer (HGT) plays a major role in the evolution of microbial genomes [8]. Phages and plasmids are contributors to HGT-driven genomic plasticity, with transfer conducted by either transduction or conjugation, or alternatively by natural transformation [9]. One class of mobile genetic elements (MGEs) are the integrative and conjugative elements (ICEs), which are self-transferable elements that can mediate excision, form a circular intermediate and often encode the genes for the Type IV conjugative pili used to transfer to a new recipient host cell [10, 11]. ICEs often contain genes required for reversable site-specific recombination, conjugation and regulation, but also carry ‘cargo’ genes that may confer antimicrobial resistance, virulence properties or metabolic capabilities to recipient cells [12], as well as addiction modules ensuring stable maintenance within the host cell [13]. Although acquisition of new genetic traits via HGT may have significant benefits for the recipient cell, the newly acquired sequences can also be detrimental to the host. Therefore, cells have developed a diverse set of mechanisms to control entry, integration and expression of foreign DNA [14]. One such system is the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and proteins encoded by CRISPR-associated (Cas) genes, which encode the components of an RNA-guided, sequence-specific immune system against invading nucleic acids, often phages, plasmids and other transferable elements [15]. Many CRISPR-Cas systems have the Cas1 and Cas2 proteins mediating spacer acquisition [16] and other Cas proteins involved in expression, maturation/processing, and targeting and interference of the foreign DNA or RNA sequences, commonly phages and plasmids [17]. The RNA-guided endonuclease of the Type II CRISPR-Cas system is the Cas9 (Csn1/Csx12) protein, which mediates processing of CRISPR RNAs and subsequent interference with the targets, in combination with a guide RNA called trans-activating CRISPR RNA (tracrRNA) [18]. Early studies using multilocus sequence typing (MLST) indicated a high level of genetic variability in species such as and [19], and subsequent comparative genomic analyses have shown that this level of genetic variability is achieved by differences in genetic content and high levels of allelic variability [20-22], probably supported by the natural competence of many species. Along with a variety of small plasmids (<10 kb), there are three major classes of 30–60 kb plasmids in and (pVir, pTet and pCC42) [23-25], although these are of variable size and gene content [26]. There are also four chromosomally located MGEs first identified in RM1221 [27], of which CJIE1 is a Mu-like prophage, CJIE2 and CJIE4 are related temperate prophages [28-31], and CJIE3 is a putative ICE which can contain the Type VI secretion system (T6SS) [32, 33]. In a previous study, we showed that 98 % of genomes investigated contained a Type II-C CRISPR-Cas system consisting of cas9-cas1-cas2 genes and a relatively short spacer array (4.9±2.7 spacers, N=1942 genomes) [34]. In contrast, only 10 % of genomes contained a copy of the CRISPR-Cas system, while genomes from non-agricultural (environmental) isolates contained a closely related, but separate Type II-C CRISPR-Cas system with the full complement of cas9-cas1-cas2 genes, or an orphan cas9 gene without cas1 or cas2 genes [34]. We have expanded this survey of CRISPR-Cas systems in and , and show that there is a third, clearly distinct CRISPR-Cas system in both and , which is located on a relatively conserved chromosomally located ICE (CampyICE1), and have investigated a possible role of this CRISPR-Cas system in contributing to plasmid competition in .

Methods

Identification of CRISPR-Cas systems

A collection of complete and draft genome sequences of (N=5829) and (N=1347) (Table S1) were obtained from the NCBI Genomes database (http://www.ncbi.nlm.nih.gov/genome/browse/) and the pubMLST website (http://pubmlst.org/campylobacter/) [35]. Genome assemblies were quality checked based on N50, L50, genome size and number of contigs, and have been used previously for studying gene distribution in [36, 37]. Genome sequences for non-jejuni/coli species such as , the group and were obtained from the NCBI genome database using ncbi-genome-download version 0.2.11 (https://github.com/kblin/ncbi-genome-download/). Genome sequences were annotated with Prokka version 1.13 [38], and the annotation searched for Cas9 orthologues using the Cj1523c (Cas9) amino acid sequence using blastp, while genome sequences were searched using tblastn to identify inactivated copies of cas9 genes. CRISPR arrays were identified as described previously [34], using the CRISPRfinder software (http://crispr.u-psud.fr/Server/) [39] and the CRISPR Recognition Tool CRT [40], further supported by blast searches and manual curation. Conservation of sequences was represented using Weblogo [41].

Prediction of putative targets of CRISPR spacers

A total of 108 unique and 16 variant families of the CampyICE1 CRISPR spacer sequences were used as query on the CRISPRTarget website (http://brownlabtools.otago.ac.nz/CRISPRTarget/crispr_analysis.html) [42], and used to search the Genbank-Phage, Refseq-Plasmid and Refseq-Viral databases. Only targets were included for further analysis. Hits with plasmids from the pVir, pTet and pCC42 families were recorded. Individual genomes with plasmid-specific spacers and that were positive for either pVir, pTet or pCC42 were searched for the target sequences of that genome using blast.

Analysis of MGE and plasmid distribution

Genome sequences were screened using Abricate (https://github.com/tseemann/abricate) version 0.9.8, with each mobile element/plasmid subdivided into 600 nt fragments used as individual queries, and each 600 nt query sequence was only scored as positive with a minimum coverage of 70 % and minimum sequence identity of 80 %. The CJIE1, CJIE2, CJIE3 and CJIE4 elements were obtained from reference strain RM1221 [27]. Nucleotide positions in the RM1221 genome (accession number CP000025) were 207 005–244 247 (CJIE1), 498 503–538 770 (CJIE2), 1 021 082–1 071 873 (CJIE3), and 1 335 703– 1 371 932 (CJIE4). The T6SS genes were taken from 108 (accession number JX436460). For the CampyICE1 element, genome sequences were screened with the CampyICE1 element from strain CCN26 (accession number NZ_FBML01, nucleotide positions contig 11 : 109 469–134 196 and reverse strand contig 17 : 19 482–78 836), the Clade 1a . strain RM1875 (accession number CP007183, nucleotide positions 1 235 330–1 320 414) and the Clade 2 strain C8C3 (accession number FBQX01, nucleotide positions 905 906–996 822). The pCC42 plasmid sequence was obtained from 15-537360 (accession number CP006703), whereas the pTet (accession number CP000549) and pVir (accession number CP000550) plasmid sequences were obtained from 81-176. Other plasmids used were pRM3194 (accession number CP014345), pHELV-1 (accession number CP020479) and pSCJK2-1 (accession number CP038863). Genomes were scored as positive for a mobile element or plasmid if >50 % were positive for 600 nt queries. Samples scoring 30–50 % were manually inspected for the distribution of matches and given a final score. Clinker version 0.0.20 [43] was used to generate comparative gene maps of MGEs and plasmids, using default settings. Table S1 includes the presence/absence information of the pCC42, pTet and pVir plasmids, and the CJIE1, CJIE2, CJIE3 and CJIE4 MGEs.

Phylogenetic trees

Core genome MLST allelic profiles were generated for the 5829 . and 1347 . genomes using a 678 gene set described previously [44]. Allele calling was performed using chewBBACA version 2.6 [45] using default settings. Phylogenetic trees were generated using GrapeTree version 1.5.0 [46] with the RapidNJ implementation of neighbour-joining, and annotated using the standard seven-gene MLST clonal complexes as determined using the MLST program version 2.19 (https://github.com/tseemann/mlst). Cas9 protein sequences were aligned with mega7 using the muscle algorithm with default settings [47], and phylogenetic trees were reconstructed using the mega7 neighbour-joining option, pairwise deletion and the Jones–Taylor–Thornton (JTT) model, with 500 bootstraps. Trees were visualised using mega7 [47] and Figtree version 1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/).

Results

Campylobacter jejuni and contain a third type II-C Cas9-encoding gene

A collection of 5829 . and 1347 . genomes was searched for the presence of Cas9 orthologues using the NCTC11168 Cj1523c and 76639 BN865_15240c amino acid sequences, representative of the two type II-C Cas9 proteins previously detected in and [34]. In addition to the cas9 genes representative of the /agricultural and the non-agricultural genomes, a third cas9 gene was detected in 134 (2.3 %) of genomes and 92 (6.8 %) of genomes, predicted to encode a 965 aa protein, with four . and three . genomes containing an interrupted cas9 gene. This new cas9 gene did not have adjacent cas1 or cas2 genes. Alignment of the predicted new Cas9 proteins from and the clades with Cas9 proteins from members of the genera and showed that the new Cas9 proteins form a separate cluster (Fig. 1), suggesting these have originated from a more distant common ancestor. Alignment of the additional Cas9 proteins from and the different genetic clades showed that the three RuvC motifs, the HNH motif and R-rich region were all conserved (Fig. S1).

Fig. 1.

Phylogenetic tree comparing the CampyICE Cas9 proteins with other and Cas9 proteins. The CampyICE1 Cas9 protein (blue) is distinct from the previously described Cas9 proteins of and (red), other species (green), and selected species (black). subsp. is shown as C. doylei. The tree was drawn using the neighbour-joining method based on an alignment with the mega7 muscle plugin. Bootstrap values are indicated at branches which scored >95 %, based on 500 iterations using mega7, using the JTT matrix and pairwise deletion. Bar, the number of amino acid substitutions per site. An alignment of a subset of Cas9 proteins with domain annotation is provided in Fig. S1.

The novel CRISPR-Cas system is located on an integrative conjugative mobile element

We first looked for the genomic region containing the gene encoding the new Cas9 protein in completed and genomes. Only two complete genomes contained the additional cas9 gene; an inactivated copy of the cas9 gene was found on the RM1875 genome, while a complete copy of the gene was present in C8C3. The cas9 gene was flanked by a short CRISPR-repeat region with five to six repeats, similar to the repeat lengths reported previously [34]. Investigation of the surrounding genes showed the downstream presence of a putative Type IV conjugative transfer system, with traG, traN, traL and traE genes, as well as a parM gene encoding the chromosome segregation protein ParM, while upstream of cas9, genes annotated as DNA primase, thymidine kinase, XerC tyrosine recombinase and an integrase were detected, with the integrase flanked by a tRNA-Met gene as an integration site (Fig. 2a), thus matching the common components of an ICE [10]. We have named the cas9-containing ICE CampyICE1.

Fig. 2.

Structure and genetic conservation of CampyICE1 from and . (a) Schematic overview of the gene structure of CampyICE1 from and . The relative positions of the three CRISPR arrays and their transcriptional orientation are shown above the blocks of genes. In the CRISPR arrays, repeats are represented by arrowheads, and spacers by diamonds, with the ends of the flanking genes shown. The gene category colours are shown to highlight the large proportion of hypothetical proteins with no known function. (b) Graphical comparison of CampyICE1 elements from and genomes, presented as output of a comparison of Prokka-generated annotations [38] using Clinker [43]. The colours of the arrows in the figure are used to identify homologous blocks of genes, and are not related to the colours used in (a). The RM1875 and C8C3 CampyICE1-containing genomic regions were used to search the 134 . and 92 . genomes containing the CampyICE1-cas9 gene for additional contigs matching the additional CampyICE1 sequences, and ordered these contigs accordingly. We were able to reconstruct the CampyICE1 genomic regions for 81 . and 133 . genomes, we annotated these and each showed genetic synteny. The size of the ICE ranged from 70.0 to 129.3 kb (average 87.7 kb, n=214), and each CampyICE1 region started with a gene encoding a putative integrase (in GenBank often annotated as 30S ribosomal subunit protein), followed by a XerC tyrosine recombinase. There were six relatively conserved blocks of genes downstream, of which the third block ends with the cas9 gene, and the fourth and the fifth blocks contain genes encoding conjugation proteins (Fig. 2a). Finally, the mobile element also contained up to three putative CRISPR arrays, each with at most a few repeats. The conservation of the CampyICE1 gene synteny is shown in Fig. 2(b) using three and three examples. Searches of the GenBank sequence database for orthologues of CampyICE1 allowed the identification of a similar element in subsp. , where the element is split into two parts, but lacks the gene block containing the cas9 gene. There were also regions with sequence and CampyICE1 gene structure similarity in plasmid pCU110 and plasmid pCIG1485E, although both lack the cas9 gene (Fig. S2). Subsequent searches in other species genomes in the GenBank database allowed the identification of other plasmids and potential ICEs with similar layouts from diverse species such as , , and , but none of those contained the cas9 gene (Fig. S2).

Distribution of CampyICE1 and other mobile elements and linkage to MLST-clonal complexes

To assess whether the distribution of CampyICE1 and other MGEs was linked to specific MLST-types or isolation source, we screened a collection of 5829 . and 1347 . genomes [36] using blast+ for the presence of CampyICE1, CJIE1, CJIE2, CJIE3 and CJIE4, the plasmids pVir, pTet and pCC42, and the CJIE3-associated T6SS (Table 1). The CJIE1 element was the most common in , while CJIE4 was the least common of the MGEs from RM1221, although still more common than CampyICE1. In , the CJIE1, CJIE2 and CJIE3 elements were present in similar fractions, and again were much more common than CJIE4 and CampyICE1 (Table 1). There was clear variation within the CJIE1–CJIE4 genetic elements, mostly in length but also in gene content (Fig. S3) with the CJIE3 element differing due to the presence or absence of the T6SS. With regard to the three plasmids, pVir was rare in both and , while pTet was present in approximately a quarter of the and genomes. The pCC42 plasmid was relatively rare in , but the most common plasmid in (Table 1). The plasmids showed more conservation of gene structure and content (Fig. S4), although there were combinations of plasmids and mobile elements that lead to megaplasmids with phage elements or the T6SS [48] which were not separately included in this analysis.

Table 1.

Prevalence of chromosomal and extrachromosomal mobile elements in 5829 . and 1347 . genome assemblies

Mobile element	C. jejuni (N=5829)	C. coli (N=1347)
Chromosomal elements
CampyICE1	134 (2.3 %)	92 (6.8 %)
CJIE1	2136 (36.6 %)	254 (18.9 %)
CJIE2	1291 (22.1 %)	225 (16.7 %)
CJIE3 with T6SS*	1137 (19.5 %)	203 (15.1 %)
CJIE3 without T6SS†	537 (9.2 %)	2 (0.1 %)
CJIE4	798 (13.7 %)	79 (5.9 %)
Plasmids
pCC42	253 (4.3 %)	383 (28.4 %)
pTet	1177 (20.2 %)	337 (25.0 %)
pVir	84 (1.4 %)	15 (1.1 %)

*Combined presence of the CJIE3 element and the Type VI secretion system.

†Presence of the CJIE3 element but absence of the Type VI secretion system.

Prevalence of chromosomal and extrachromosomal mobile elements in 5829 . and 1347 . genome assemblies Mobile element (N=5829) (N=1347) Chromosomal elements CampyICE1 134 (2.3 %) 92 (6.8 %) CJIE1 2136 (36.6 %) 254 (18.9 %) CJIE2 1291 (22.1 %) 225 (16.7 %) CJIE3 with T6SS* 1137 (19.5 %) 203 (15.1 %) CJIE3 without T6SS† 537 (9.2 %) 2 (0.1 %) CJIE4 798 (13.7 %) 79 (5.9 %) Plasmids pCC42 253 (4.3 %) 383 (28.4 %) pTet 1177 (20.2 %) 337 (25.0 %) pVir 84 (1.4 %) 15 (1.1 %) *Combined presence of the CJIE3 element and the Type VI secretion system. †Presence of the CJIE3 element but absence of the Type VI secretion system. The genomes were clustered in a phylogenetic tree based on a 678 gene core genome (cg)MLST scheme [44], which grouped the genomes mostly according to clonal complexes of the seven-gene MLST for (Fig. 3) and the different clades (Fig. 4). With the exception of CJIE3 and the associated T6SS in , there was no clear association with specific MLST clonal complexes in either or . In , CJIE3 without the T6SS was restricted to clonal complexes ST-354 and ST-257, while the CJIE3 with T6SS was mostly found in clonal complexes ST-464, ST-353, ST-573 and ST-403 (Fig. 3). There was no obvious link between isolation source and any of the MGEs, although it should be noted that the dataset used is biased towards human isolates. Similar to the mobile elements, the pVir, pTet and pCC42 plasmids did not show an association with either MLST clonal complex in or the clade, or isolation source (Figs 3 and 4). The specific distribution per genome is provided in Table S1.

Fig. 3.

Fig. 4.

Distribution of mobile elements and plasmids in 1347 genome sequences. The phylogenetic tree was based on core genome MLST. Isolation source category and seven-gene MLST information are included for comparative purposes. The scale bar represent the distance expressed as the number of different cgMLST alleles.

Distribution of mobile elements and plasmids in 5829 genome sequences. The phylogenetic tree was based on core genome MLST. Isolation source category and seven-gene MLST information are included for comparative purposes. The scale bar represent the distance expressed as the number of different cgMLST alleles. Distribution of mobile elements and plasmids in 1347 genome sequences. The phylogenetic tree was based on core genome MLST. Isolation source category and seven-gene MLST information are included for comparative purposes. The scale bar represent the distance expressed as the number of different cgMLST alleles.

The majority of CampyICE1 CRISPR spacers are predicted to target plasmids

CRISPR arrays consist of the CRISPR repeats and the individual spacers, which are used to generate the cRNAs used for interference, and the tracrRNA [18]. The layout of the CampyICE1 CRISPR arrays is distinct from most other Type II CRISPR-Cas systems, where the CRISPR array and tracrRNA are often found directly next to the Cas genes. In contrast, the CampyICE1 system does not contain the ubiquitous cas1 and cas2 genes, and has a total of three CRISPR arrays spaced over the element (Fig. 2). We were able to identify spacers from 81 and 133 CampyICE1 elements. The first array contained 3.0±1.5 spacers (N=197, range 1–6), and also contained a putative tracrRNA in the opposite transcriptional orientation (Fig. 5a), while the second CRISPR array contained 3.1±1.7 spacers (N=208, range 1–10) and lacked a potential tracrRNA. The third CRISPR array is shorter and contained 1.0±0.6 spacers (N=182, range 1–3). The tracrRNA and repeat sequence are distinct from the previously described and CRISPR systems [34], with the changes in the repeat sequence being mirrored in the tracrRNA sequence, and thus unlikely to affect functionality (Fig. 5a, b). The predicted Protospacer Adjacent Motif (PAM) was 5′-A(C/T)A(C/T) (Fig. 5a), which matches well with the 5′-ACAc PAM-motif described for the Cas9 protein [34, 49].

Fig. 5.

Characteristics of the CampyICE1 CRISPR spacers, protospacers and tracrRNA, and predicted plasmid targeting by the CampyICE1 CRISPR-Cas9 system. (a) A section of the CRISPR array is shown (centre) with the corresponding protospacer (top) with 8 nt flanking sequences which contain the PAM motif at the 3′ end of the protospacer, represented using a sequence logo. The tracrRNA sequence and structure are included below. (b) Comparison of the CRISPR-repeats and predicted tracrRNA part of CampyICE1, and the three clades. The tracrRNA and CRISPR-repeat show matching changes as indicated by red underlined residues. Asterisks indicate conserved nucleotides, boxes indicate the complementary sequences in CRISPR repeat and tracrRNA. (c) Example of a CampyICE1 CRISPR spacer perfectly matching a segment of the 81-176 pVir plasmid. (d) Schematic representation of the pCC42, pTet and pVir family of plasmids (based on the 15-537360 pCC42 plasmid and the 81-176 pTet and pVir plasmids), with the locations of plasmid-targeting CampyICE1 spacers indicated. For pTet, the approximate location of target gene YSU_08860 (absent from the 81-176 plasmid) is indicated by the dashed, green arrowhead. More information on specific spacers and their targets is provided in Table S2. Comparison of the spacers from 214 CampyICE1 elements showed that these consisted of 108 unique spacer sequences, and an additional 40 spacers that were subdivided into 16 variant families, where two to six spacers had one or two nucleotide differences to each other and were predicted to match the same targets (Table S2). The spacers were used to search phage and plasmid databases for putative targets, and a total of 62 unique spacers and eight variant families were predicted to target the plasmids pCC42 (31 unique spacers, two variants), pTet (16 unique spacers, six variants) and pVir (15 unique spacers, see Fig. 5c for an example). Furthermore, there were spacers predicted to target the plasmid pHELV-1 (one unique spacer) and pSCJK2-1 from SCJK2 (six unique spacers, two variants). The pHELV-1 and pSCJK2-1 plasmids were not detected in the 5829 and 1347 genomes used in this study. The predicted targets on the plasmids pCC42, pTet and pVir were plotted against the plasmid maps (Fig. 5d), and showed that targets for pCC42 and pVir were found in multiple genes on these two plasmids, whereas the targets on pTet were limited to two genes, of which YSU_08860 is not universally present on plasmids of the pTet family (Fig. 5d).

Plasmid-mapping CampyICE1 CRISPR spacers are associated with an absence of the corresponding plasmids

To assess whether the CampyICE1 CRISPR-Cas9 system can function to exclude plasmids by using plasmid-mapping spacers, the 226 and CampyICE1-positive genome assemblies were searched for the presence of plasmid contigs and matches with spacer sequences (Table S3 and Fig. S1). As one possible escape for CRISPR-Cas9 surveillance could be sequence mutations/changes in the plasmids, we also checked whether the predicted plasmid-matching spacer would recognize any sequence in the genome assemblies (which include plasmid contigs). Of the assemblies, spacers were detected in 81/92 assemblies, and 56 had no plasmid/spacer matches. Of the 25 assemblies where there were plasmid/spacer matches, three had an inactivated CampyICE1 cas9 gene, and 11 did not have sequences matching the spacer(s) or only partial matches in their genome assembly, suggesting that mutations in the plasmid sequence have made the spacer unusable. This left 11 assemblies with a functional cas9 gene and spacer matching the pCC42 plasmid. Similarly, for , spacers were detected in 133/134 genomes, and 109 had no plasmid/spacer matches. Of the 24 assemblies where there were plasmid/spacer matches, two had an inactivated CampyICE1 cas9 gene with frameshifts and stop codons, and seven did not have sequences matching the spacer(s) or only partial matches in their genome assembly. This left 15 assemblies with a functional cas9 gene and spacer matching the pCC42 (seven) and pTet (eight) plasmids. The matching of spacers, CampyICE1 Cas9 status and plasmid presence/absence is given in Fig. 6, with more detailed data in Tables S3 and S4.

Fig. 6.

Low prevalence of pVir, pTet and pCC42 plasmids in CampyICE1-positive and is associated with CRISPR-spacers targeting these plasmids. The and isolates have been combined in this graph; specific data per isolate and spacer are available in Table S3, while data for and separately are provided in Table S4.

Discussion

In the last 25 years, CRISPR-Cas has gone from a relatively obscure repeat system in bacteria to a Nobel Prize winning phenomenon [50]. CRISPR-Cas systems are widespread in prokaryotic organisms, and while early reports predicted them to be a bacterial version of the adaptive immune system against phages, it is now clear that they target a wide variety of MGEs, and can also have a diverse set of alternative functions. Recent studies show that CRISPR-Cas systems are not just located on genomes, but can also be found on MGEs. Type IV and Type I CRISPR-Cas systems have been reported on enterobacterial plasmids [51, 52], and have been predicted to function in competition between plasmids [53]. species contain a variety of CRISPR-Cas systems associated with putative MGEs and genomic islands [54, 55], although data on their potential role in MGE competition are still lacking. To our knowledge, our study is the first to feature an incomplete Type II-C CRISPR-Cas9 system that is associated with an MGE, and where the majority of spacers matched competing plasmids. We have shown that CampyICE1 is highly conserved in both and , that it has up to three short spacer arrays on the ICE, and that the presence of a functional CampyICE1 CRISPR-Cas system and anti-plasmid spacers is associated with the absence of the three targeted plasmid types in and . The Type II-C Cas9 protein encoded on CampyICE1 is closely related to the Cas9 proteins found in other and species, but clusters separately, suggesting it may have been co-opted from a genomic location in an ancestral species. Interestingly, CampyICE1 lacks the cas1 and cas2 genes [56], a feature which has also been noted for the hypercompact Cas12j (CasΦ) system found on certain bacteriophages [57]. The Cas12j system closely resembles the CampyICE1 Cas9 system described here, as they share a limited CRISPR spacer repertoire [58]. The lack of Cas1 and Cas2 components could mean that the CampyICE1 system is incapable of acquiring new spacers, which is supported by the relative lack of spacer diversity in the 214 genomes containing CampyICE1. However, we cannot exclude that the CampyICE1 Cas9 may be able to co-opt the Cas1 and Cas2 proteins from the chromosomal version of the CRISPR-Cas system in and , although this is speculative. We have previously shown that ~98 % of all genomes have a CRISPR-Cas system, while this is more limited in , where only ~10 % of genomes have a CRISPR-Cas system [34]. Since the diversity in CRISPR spacers is also low in the chromosomal version of CRISPR-Cas of and and most spacers cannot (yet) be linked to mobile elements or phages [34, 59–61], the chromosomal copy of Cas9 may perform additional or alternative functions in , such as control or activity in virulence [62-66]. However, this is not the case for the CampyICE1 CRISPR-Cas9 system, as a majority of spacers can be linked to the three main families of plasmids in and : pTet, pVir and pCC42. In our collection of genomes, 41.7 % of and 24.3 % of genomes are predicted to contain one or more of these three plasmids, in different combinations. The three plasmids do not show signs of incompatibility, as 93 and 166 genomes had a combination of two plasmids or all three plasmids together. The role of these plasmids in and is still unclear, but they can carry virulence factors and contribute to the dissemination of antibiotic resistance. However, plasmids are not absolutely required for this, and plasmid-free isolates are also common. This is similar for the CJIE-elements, where different combinations of the CJIE-elements and CampyICE1 were detected. The different roles of the CJIE-elements in and are still not clear, although the T6SS from CJIE3 has been linked with virulence [32, 33, 67, 68], and the DNases of the CJIE1, CJIE2 and CJIE4 elements are associated with reduced biofilm formation and reduced natural transformation [29, 30, 69]. The CRISPR-Cas9 system of the CampyICE1 element has some unique properties, as there are up to three short CRISPR arrays on the mobile element, with the essential tracrRNA not located with the cas9 gene but located in another CRISPR spacer array on CampyICE1. Although the arrays detected were small, there were still 108 unique spacers and 16 spacer families, with a spacer family defined as spacers differing by one or two nucleotides only. The majority of CampyICE CRISPR spacers and variants were predicted to target plasmids (69 spacers and 10 variants, 63.7%), with most spacers predicted to target pCC42, pTet and pVir, the three major plasmids in and , which is a very high proportion compared to many other CRISPR-Cas studies. For example, a study on type IV CRISPR-Cas systems could only match 12 % of spacers with targets, and this was reduced to only 7 % for the non-type IV CRISPR-Cas systems [53]. In our previous study [34] we were also unable to match most spacers with putative targets, which is common. The presence of CampyICE1, functional CRISPR-Cas9 and anti-plasmid spacers was associated with the absence of the competing plasmids targeted, suggesting that CampyICE1 has used its CRISPR-Cas9 system for ‘plasmid warfare’ as a form of incompatibility. The match is not perfect, as there are several examples of a complete CampyICE1 CRISPR-Cas9 system with plasmid-targeting spacers, to which the spacers mapped were present with 100 % sequence identity between spacer and predicted plasmid contigs (Tables S3 and S4, Fig. 6). This could potentially mean that the CRISPR-Cas system can prevent acquisition of new plasmids, but for unknown reasons is unable to remove plasmids already present, although this is highly speculative. It also suggests that the CampyICE1 plasmid restriction can be avoided by mutation of the target site disrupting the sequence matching, making the system less functional, especially in a bacterium known for its high levels of genetic variation. We also speculate that DNA modification and transcriptional variation/regulation may play a role in spacer-target discrepancies. In summary, we have identified a new putative mobile element in and that contains a degenerated CRISPR-Cas9 system predicted to employ this CRISPR-Cas system to compete with other families of plasmids. We also show that mobile elements and plasmids are semi-randomly distributed within a large set of and genomes, and display significant levels of genetic variation within the elements. This fits well with the previously described genetic variability of the genus , and adds to the complexity of mobile elements present within these successful foodborne human pathogens. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.

68 in total

1. GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens.

Authors: Zhemin Zhou; Nabil-Fareed Alikhan; Martin J Sergeant; Nina Luhmann; Cátia Vaz; Alexandre P Francisco; João André Carriço; Mark Achtman
Journal: Genome Res Date: 2018-07-26 Impact factor: 9.043

2. CRISPR RNA-Dependent Binding and Cleavage of Endogenous RNAs by the Campylobacter jejuni Cas9.

Authors: Gaurav Dugar; Ryan T Leenay; Sara K Eisenbart; Thorsten Bischler; Belinda U Aul; Chase L Beisel; Cynthia M Sharma
Journal: Mol Cell Date: 2018-03-01 Impact factor: 17.970

3. Nucleases encoded by the integrated elements CJIE2 and CJIE4 inhibit natural transformation of Campylobacter jejuni.

Authors: Esther J Gaasbeek; Jaap A Wagenaar; Magalie R Guilhabert; Jos P M van Putten; Craig T Parker; Fimme J van der Wal
Journal: J Bacteriol Date: 2009-12-18 Impact factor: 3.490

4. Convergence of Campylobacter species: implications for bacterial evolution.

Authors: Samuel K Sheppard; Noel D McCarthy; Daniel Falush; Martin C J Maiden
Journal: Science Date: 2008-04-11 Impact factor: 47.728

5. Campylobacter jejuni acquire new host-derived CRISPR spacers when in association with bacteriophages harboring a CRISPR-like Cas4 protein.

Authors: Steven P T Hooton; Ian F Connerton
Journal: Front Microbiol Date: 2015-01-05 Impact factor: 5.640

6. A genomic island in Vibrio cholerae with VPI-1 site-specific recombination characteristics contains CRISPR-Cas and type VI secretion modules.

Authors: Maurizio Labbate; Fabini D Orata; Nicola K Petty; Nathasha D Jayatilleke; William L King; Paul C Kirchberger; Chris Allen; Gulay Mann; Ankur Mutreja; Nicholas R Thomson; Yan Boucher; Ian G Charles
Journal: Sci Rep Date: 2016-11-15 Impact factor: 4.379

1. Genomic Screening of Antimicrobial Resistance Markers in UK and US Campylobacter Isolates Highlights Stability of Resistance over an 18-Year Period.

Authors: Arnoud H M van Vliet; Siddhartha Thakur; Joaquin M Prada; Jai W Mehat; Roberto M La Ragione
Journal: Antimicrob Agents Chemother Date: 2022-04-11 Impact factor: 5.938

1 in total