| Literature DB >> 35451949 |
Xiao Han1,2, Xingya Zhou1,2, Zhangming Pei1,2, Catherine Stanton3,4,5, R Paul Ross3,4, Jianxin Zhao1,2,6, Hao Zhang1,2,6,7, Bo Yang1,2,3, Wei Chen1,2,6.
Abstract
The clustered regularly interspaced short palindromic repeat (CRISPR)-CRISPR-associated protein (Cas) system is an important adaptive immune system for bacteria to resist foreign DNA infection, which has been widely used in genotyping and gene editing. To provide a theoretical basis for the application of the CRISPR-Cas system in Bifidobacterium breve, the occurrence and diversity of CRISPR-Cas systems were analysed in 150 B. breve strains. Specifically, 47 % (71/150) of B. breve genomes possessed the CRISPR-Cas system, and type I-C CRISPR-Cas system was the most widely distributed among those strains. The spacer sequences present in B. breve can be used as a genotyping marker. Additionally, the phage assembly-related proteins were important targets of the type I-C CRISPR-Cas system in B. breve, and the protospacer adjacent motif sequences were further characterized in B. breve type I-C system as 5'-TTC-3'. All these results might provide a molecular basis for the development of endogenous genome editing tools in B. breve.Entities:
Keywords: Bifidobacterium breve; CRISPR-Cas system; genotyping; prophage
Mesh:
Substances:
Year: 2022 PMID: 35451949 PMCID: PMC9453068 DOI: 10.1099/mgen.0.000812
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.CRISPR-Cas systems in . (a) Occurrence of CRISPR array. (b) Repeat length of different CRISPR-Cas subtypes. (c) The number of spacers in each CRISPR-Cas subtype. (d) Secondary structure of repeat sequences of different subtypes. Secondary structure prediction was performed by the RNAfold web server. Bases were coloured by base-pair probability.
Fig. 2.Visualization of representative CRISPR-Cas loci. Different cas genes were indicated by arrows with different colours. The repeat-spacer sequence was represented by black rectangles. Long irrelevant sequences were shortened by double disconnection. The CRISPR locus was drawn based on its size, and the bar scale indicates 2 Kb.
Fig. 3.Phylogenetic tree based on the amino acid sequence of Cas1 protein. Different subtypes were indicated by different colours. The neighbor-joining tree was described with 1000 bootstrap replicates. Bootstrap values were recorded on the branch and represented by the size of the circle.
Fig. 4.Visualization of spacers in . Each unique spacer sequence was represented by a square with a unique colour. The first spacer of strain acquisition was shown on the right, and the last spacer was shown on the left.
Fig. 5.Prophages in genomes targeted by spacers. The horizontal axis represented those strains carrying the spacers that targeted the prophages, and the vertical axis represented the strains carrying the prophages targeted by the spacers. The number of targeted events was indicated by different colours. The value was calculated by log2.
Fig. 6.Prophage protein targeted by spacers. The 13 proteins that were most frequently targeted by spacers were displayed. Protein targeting was analysed based on the NCBI nr database through BLASTp search.
Fig. 7.Visualizing DRBB28 prophage protein targeted by spacers and prediction of PAMs in . (a) DRBB28 prophage protein targeted by spacers. Two of the DRBB28 prophages were displayed and the different prophages were separated by double disconnection. Each square represented a targeted event and was coloured according to different targeted proteins. (b) PAMs prediction. Spacers with different text colours target proteins represented as the same colour in Fig. 7(a). The eight nucleotides (grey text) flanking the 5' of each protospacer were used to predict the PAM sequence (underlined). The corresponding results were displayed on the right after visualization using the WebLogo server.