| Literature DB >> 29033911 |
Claudio Hidalgo-Cantabrana1,2, Alexandra B Crawley1, Borja Sanchez2, Rodolphe Barrangou1.
Abstract
Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis, and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum, and show that these sequences are useful for typing across three subspecies.Entities:
Keywords: Bifidobacterium longum; CRISPR-Cas systems; genotyping; probiotics
Year: 2017 PMID: 29033911 PMCID: PMC5626976 DOI: 10.3389/fmicb.2017.01851
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
CRISPR-cas systems in Bifidobacterium longum strains.
| 7 | I-C | 32 | 159 | Y | Y | |||
| 9 | I-C | 32 | 159 | Y | Y | |||
| 379 | I-C | 32 | 1 | Y | Y | |||
| 35624 | I-C | 32 | 164 | Y | Y | |||
| 105-A | II-C | 36 | 34 | Y | Y | |||
| 1-5B | None | None | ||||||
| 1-6B | II-C | 36 | 7 | Y | Y | |||
| 2nd locus | I-E | 28 | 21 | Y | ||||
| 17-1B | I-U | 36 | 44 | Y | Y | |||
| 2-2B | II-C | 36 | 8 | Y | Y | |||
| 2nd locus | I-E | 26 | 11 | Y | ||||
| 35B | II-C | 36 | 12 | Y | Y | |||
| 44B | II-C | 36 | 18 | Y | Y | |||
| 2nd locus | I-E | 28 | 25 | Y | ||||
| 7-1B | II-C | 36 | 34 | Y | Y | |||
| 72B | None | None | ||||||
| AH1206 | None | None | ||||||
| ATCC55813 | None | None | ||||||
| BBMN68 | I-E | 29 | 10 | Y | Y | |||
| 29 | 13 | |||||||
| 29 | 7 | |||||||
| BG7 | II-C | 36 | 34 | Y | Y | |||
| BLO12 | II-C | 36 | 19 | Y | Y | |||
| BXY01 | None | None | ||||||
| CCUG30698 | None | None | ||||||
| CECT 7347 | II-C | 36 | 38 | Y | Y | |||
| CMCC P0001 | None | None | ||||||
| CMW7750 | None | None | ||||||
| D2957 | None | None | ||||||
| DJO10A | II-C | 36 | 43 | Y | Y | |||
| DSM 20219 | None | None | ||||||
| E18 | None | None | ||||||
| EK13 | None | None | ||||||
| EK5 | None | None | ||||||
| F8 | None | None | ||||||
| GT15 | None | None | ||||||
| JCM 1217 | None | None | ||||||
| JDM 301 | None | None | ||||||
| KACC 91563 | II-C | 36 | 33 | Y | Y | |||
| LMG 13197 | None | None | ||||||
| LO-06 | None | None | ||||||
| LO-10 | None | None | ||||||
| LO-21 | None | None | ||||||
| LO-C29 | None | None | ||||||
| LO-K29a | None | None | ||||||
| LO-K29b | None | None | ||||||
| MC-42 | I-E | 29 | 136 | Y | Y | |||
| NCC2705 | None | None | ||||||
| NCIMB8809 | None | None | ||||||
| VMKB44 | II-C | 36 | 52 | Y | Y | |||
| 157F | None | None | ||||||
| ATCC 15697 | None | None | ||||||
| BIB1401242951 | None | None | ||||||
| BIB1401272845a | None | None | ||||||
| BIB1401272845b | None | None | ||||||
| BIC1206122787 | None | None | ||||||
| BIC1307292462 | None | None | ||||||
| BIC1401111250 | None | None | ||||||
| BIC1401212621a | None | None | ||||||
| BIC1401212621b | None | None | ||||||
| BT1 | I-C | 33 | 61 | Y | Y | |||
| CCUG 52486 | None | None | ||||||
| CECT 7210 | None | None | ||||||
| EK3 | I-E | 29 | 69 | Y | Y | |||
| 2nd locus | I-C | 33 | 8 | Y | ||||
| IN-07 | I-C | 33 | 61 | Y | Y | |||
| IN-F29 | I-C | 33 | 76 | Y | Y | |||
| TPY12-1 | None | None | ||||||
| AGR2137 | I-E | 29 | 21 | Y | Y | |||
| 2nd locus | 32 | 4 | ||||||
| BSM11-5 | I-C | 32 | 62 | Y | Y | |||
| DSM 20211 | Undet | 32 | 9 | N | ||||
| LMG 21814 | Undet | 32 | 9 | N |
Figure 1CRISPR-Cas systems in Bifidobacterium longum. Number of CRISPR-Cas systems detected in B. longum strains for each CRISPR-Cas type (A). Comparison between the occurrence and diversity of CRISPR-Cas systems in B. longum strains (outside ring) and Bifidobacterium (inside ring). Percentage was calculated based on the number of positive strains for each subtype divided by total strains analyzed in each study (B). Phylogenetic tree based on the amino acid sequence of Cas1 protein of B. longum and other bifidobacteria species, aligned with MUSCLE algorithm, and depicted with UPGMA using 500 bootstrap replicates. Bootstrap values are recorded on the nodes. The CRISPR-Cas subtypes are written on the right and groups are colored for each subtype (C).
Figure 2CRISPR phylogenetic analyses in B. longum. Phylogenetic tree based on the Cas1 protein of B. longum strains (A), Cas9 protein (B), Cas3 protein (C), and the CRISPR repeats sequence (D). Alignments were performed with MUSCLE algorithm and the tree was depicted with UPGMA using 500 bootstrap replicates. Bootstrap values are recorded on the nodes. The CRISPR-Cas subtypes are written on the right and groups are colored for each subtype.
Figure 3CRISPR loci in B. longum. The CRISPR locus of each strain was annotated and depicted with signature cas genes colored in red, cas3 for Type I (A) and cas9 for Type II (B), and the universal cas1 and cas2 colored in blue and green respectively. Accessory genes are colored in a gray scale regarding their functional category, CRISPR repeats are represented as black lines on the right side of each locus (spacers are not represented) and transposase are represented with checkboard pattern fill. Numbers below CRISPR-Cas systems represent their position in the genome (or contig) and the numbers on top of the repeat-spacer array represent the number of repeats. The CRISPR loci are represented according to their size, bar scale represents 6 Kb.
Figure 4Box and Whisker representation for the number of CRISPR repeats detected in the CRISPR loci of each CRISPR subtype (A). The number of CRISPR repeats in subtype I-C displayed subspecies-dependent difference between the strains of B. longum subsp. longum and B. longum subsp. infantis and B. longum subsp. suis (B). CRISPR GC content and size in B. longum. Box and Whisker representation of the GC content of the CRISPR loci of each CRISPR subtype, dotted line represent the overall GC content of whole genome in bifidobacteria (C).
Bifidobacterium longum strains harboring CRISPR-Cas immune systems.
| 7 | I-C | 49.03 | 60 | Commercial | Lewis et al., | |
| 9 | I-C | 49.03 | 60 | Commercial | Lewis et al., | |
| 379 | I-C | 50.5 | 60.20 | Human gut | Averina et al., | |
| 35624 | I-C | 49.03 | 60 | Human gut | Altmann et al., | |
| 105-A | II-C | 49.56 | 60.10 | Infant feces | Kanesaki et al., | |
| 1-6B | II-C | 49.6 | 59.6 | Infant feces | Shkoporov et al., | |
| I-E | 58.11 | |||||
| 17-1B | I-U | 50.1 | 60.20 | Infant feces | Chaplin et al., | |
| 2-2B | II-C | 49.5 | 59.70 | Infant feces | Shkoporov et al., | |
| I-E | 58.7 | |||||
| 35B | II-C | 49.5 | 60.10 | Infant feces | Shkoporov et al., | |
| 44B | II-C | 49.5 | 59.7 | Infant feces | Shkoporov et al., | |
| I-E | 58.03 | |||||
| 7-1B | II-C | 49.53 | 59.80 | Infant feces | Chaplin et al., | |
| BBMN68 | I-E | 55.1 | 59.90 | Human feces | Hao et al., | |
| BG7 | II-C | 49.56 | 60.01 | Infant feces | Kwon et al., | |
| BLO12 | II-C | 49.6 | 60.00 | Infant feces | Milani et al., | |
| CECT 7347 | II-C | 49.6 | 60 | Commercial | Chenoll et al., | |
| DJO10A | II-C | 49.55 | 60.11 | Adult feces | Lee et al., | |
| KACC 91563 | II-C | 49.7 | 59.81 | Neonates feces | Ham et al., | |
| MC-42 | I-E | 55.13 | 59.80 | Infant feces | Tupikin et al., | |
| VMKB44 | II-C | 49.6 | 60.30 | Infant feces | Chaplin et al., | |
| BT1 | I-C | 49.3 | 59.4 | Infant feces | Chung, | |
| EK3 | I-E | 54.6 | 59.4 | Infant feces | Chaplin et al., | |
| I-C | 55.7 | |||||
| IN-07 | I-C | 56.5 | 60.0 | Infant feces | Matsuki et al., | |
| IN-F29 | I-C | 56.6 | 59.90 | Infant feces | Matsuki et al., | |
| AGR2137 | I-E | 54.6 | 59.90 | Calf feces | Kelly et al., | |
| BSM11-5 | I-C | 55.8 | 59.90 | Infant feces | Bunesova et al., |
Figure 5CRISPR subtype II-C spacers comparison in B. longum. The CRISPR spacers of CRISPR subtype II-C were represented using an Excel Macro tool. The spacers are represented by a square and each unique spacer sequence is indicated as a unique color and a geometric figure. Squares containing an “X” represent deleted or missing spacers (A). The last spacer acquired is represented on the left side while the first spacer is on the right side. The spacers schematic representation showed a common origin (right side) for the strains and the evolution trend in four different clusters numbered from i to iv (B).
Figure 6CRISPR spacers array comparison in B. longum for CRISPR Type I. The CRISPR spacers of CRISPR subtypes I-C (A), subtype I-E (B), and subtype I-U (C) were represented using an Excel Macro tool. The spacers are represented by a square and each unique spacer sequence is indicated as a unique color and geometric figure. Squares containing and “X” represent deleted or missing spacers. The last spacer acquired is represented on the left side while the first spacer is on the right side. Numbers on top of the spacers array indicates the first and last spacer showing the size of the array. The long arrays were reduced for a better representation and are indicated with a double line break.
Figure 7CRISPR spacers targeting prophages in Bifidobacterium genomes. The heatmap represents B. longum CRISPR spacers that matched prophages in Bifidobacterium genomes (A) and B. longum genomes (B). The vertical axis represent the genomes that harbor prophages targeted by B. longum CRISPR spacers. The horizontal axis represent B. longum strains carrying CRISPR spacers that target prophages. The color scales represents the number of targeting events with blue squares representing the absent of matches and red squares representing the highest number of targeting.
Figure 8B. bifidum LMG11583 prophage targeted by B. longum spacers and PAM prediction for subtype I-C. The prophage integrated in B. bifidum LMG11583 chromosome is targeted by several CRISPR spacers, from B. longum subsp. longum and B. longum subsp. infantis, each unique spacer represented with a unique color, and the CRISPR-Cas subtype between brackets (A). The figure on the bottom left shows the protospacer sequence of the prophage matched by each spacer (color legend) and the upstream region containing the Protospacer Adjacent Motif (PAM) underlined, whereas bottom right displayed the consensus PAM represented with the frequency plot of WebLogo server (B).
Figure 9PAM prediction for B. longum CRISPR-Cas systems. PAM prediction for the CRISPR-Cas subtypes I-E (A), I-U (B), and II-C (C) present in B. longum. The left panel displayed the protospacers in bold, and the 5′-end or 3′-end (Type I and Type II respectively) containing the Protospacer Adjacent Motif (PAM) underlined. The right panel displayed the consensus PAM represented with the frequency plot of WebLogo server.