| Literature DB >> 35185818 |
Yuqiao Zhou1, Wei Zhou2, Jinzhi Zhou2, Jinchang Yan3, Dingting Xu4, Xiner Zheng1, Shuaizhou Zong1, Ping Jiang5, Shiyi Tian1, Jianzhong Han1, Daofeng Qu1.
Abstract
Microorganisms have developed many strategies in the process of long-term defense against external attacks, one of which is the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated proteins (Cas) bacterial immunological system. In this study, the whole genome of 300 strains of Klebsiella was collected, the CRISPR-Cas system in the strains was statistically analyzed, and the types and structures of CRISPR system in Klebsiella were explored, as well as the correlation between CRISPR and mobile genetic elements (MGEs). Through principal component analysis (PCA), we found that Cas gene, plasmids, integron, IS1, IS609, and enzymes of DNA metabolism were closely related to CRISPR-Cas. Compared the structural characteristics of plasmids, the DinG family helicases, Cas6, Csf2, and IS5 were observed near the CRISPR loci in plasmid, which is also confirmed by the results of PCA that they may be important factors affecting the plasmid with CRISPR.Entities:
Keywords: CRISPR-Cas; Klebsiella; PCA; plasmid; type IV
Year: 2022 PMID: 35185818 PMCID: PMC8847753 DOI: 10.3389/fmicb.2021.790673
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
The information of the confirmed CRISPR1∼CRISPR7.
| CRISPR | Type | Repeat sequence (5′–3′) | No. of strains | No. of repeats | Frequency (%) |
| Typical repeat | GAAACACCCCCACGTGCGTGGGGAAGAC | 10 | 127 | 57.21 | |
| Repeat variants | AGAAACACCCCCACGTGCGTGGGGAAGAC | 12 | 5.41 | ||
| AGAAACACCCCCACGCGTGTGGGGAAGAC | 20 | 9.01 | |||
| CRISPR1 | AGAAACACCCCCACGCATGTGGGGAAGAC | 4 | 1.80 | ||
| AGAAACACCCCCACGCGTGTGGGGAAGA | 10 | 4.50 | |||
| Terminal repeat | GAAACACCCCCACGCGTGTGGGGAAGAC | 25 | 11.26 | ||
| GAAACACCCCCACGCATGTGGGGAAGAC | 24 | 10.81 | |||
| Typical repeat | GTCTTCCCCACGCACGTGGGGGTGTTTC | 38 | 405 | 44.90 | |
| Repeat variants | GTCTTCCCCACACGCGTGGGGGTGTTTCT | 166 | 18.40 | ||
| GTCTTCCCCACGCACGTGGGGGTGTTTCT | 149 | 16.52 | |||
| GTCTTCCCCACACGCGTGGGGTGTTTCT | 7 | 0.78 | |||
| CRISPR2 | TCCCCACACGCGTGGGGGTGTTTCT | 10 | 1.11 | ||
| Terminal repeat | GTCTTCCCCACACACGTGGGGGTGTTTC | 12 | 1.33 | ||
| GTCTTCCCCACATGCGTGGGGGTGTTTC | 78 | 8.65 | |||
| GTCTTCCCCACACGCGTGGGGGTGTTTC | 57 | 6.32 | |||
| GTCTTCCCCACGTGCGTGGGGGTGTTTC | 18 | 2.00 | |||
| Typical repeat | GTGTTCCCCGCGCCAGCGGGGATAAACCG | 17 | 612 | 97.45 | |
| CRISPR3 | Repeat variants | GTGTTCCCCGCGCTAGCGGGGATAAACCG | 6 | 0.96 | |
| Terminal repeat | GTGTTCCCCGCGCCAGCGGGGATAAACTGG | 10 | 1.59 | ||
| Typical repeat | GTATTCCCCCCGCATGCGGGGGTTATCGG | 13 | 79 | 36.74 | |
| Repeat variants | AGTATTCCCCCCGTGTGCGGGGGTTATCGG | 5 | 2.33 | ||
| Terminal repeat | CCCCCCGCATGCGGGGGTTATCGG | 19 | 8.84 | ||
| CRISPR4 | GTATGTCCCCCGCTGGCGGGGGTTATCGG | 12 | 5.58 | ||
| GTATTCCCCCGCTTGCGGGGGTTATCGG | 29 | 13.49 | |||
| GTATTCCCCCCGTGTGCGGGGGTTATCGG | 71 | 33.02 | |||
| Typical repeat | CCGATAACCCCCGCATGCGGGGGGAATAC | 11 | 70 | 45.45 | |
| Repeat variants | CGATAACCCCCGCATGCGGGGGGAATAC | 10 | 6.49 | ||
| Terminal repeat | CCGATAACCCCCGCACACGGGGG | 43 | 27.92 | ||
| CRISPR5 | CCGATAACCCCCGCATGCGGGGGGAATACT | 9 | 5.84 | ||
| CCGATAACCCCCGCATGCGGGGG | 11 | 7.14 | |||
| CCGATAACCCCCGCAAGCGGGGGGAATAC | 11 | 7.14 | |||
| Typical repeat | CGGTTTATCCCCGCTGGCGCGGGGAACAC | 12 | 354 | 93.65 | |
| CRISPR6 | Repeat variants | GGTTTATCCCCGCTGGCGCGGGGAACAC | 7 | 1.85 | |
| Terminal repeat | CGGTTTATCCCCGCTCACGCGGGGAACAC | 7 | 1.85 | ||
| CGGTTTATCCCCGCTAGCGCGGGGAACAC | 10 | 2.65 | |||
| Typical repeat | GTTCACTGCCGTACAGGCAGCTTAGAAA | 7 | 44 | 55.00 | |
| Repeat variants | TAAGCTGCCTGTACGGCAGTGAA | 4 | 5.00 | ||
| Terminal repeat | TTTCTAAGCTGCCTGTACGGCAGTGAACA | 4 | 5.00 | ||
| ATTCGGCTTGAGAGCCGTTTCCA | 4 | 5.00 | |||
| GGGAATAAGTCACTGAAAGTAA | 4 | 5.00 | |||
| CRISPR7 | GGAAACGGCACAAAAGCCGAATCGAAGCAAGTTACTGAAAATAA | 4 | 5.00 | ||
| GGAATCGGCGCAAAAGCCGAATCGAAGCAAGTTACTGAAAATAA | 4 | 5.00 | |||
| GCATTCGTCCCAAGAGCCGAA | 4 | 5.00 | |||
| GGGAATAAGGCATTGAAAGTTA | 4 | 5.00 | |||
| GCATTCGTCCCAAGAGCCGAT | 4 | 5.00 |
FIGURE 1(A) The weblogo of repeats of CRISPR1∼CRISPR7. The sequence were first and terminal repeats of CRISPR1∼CRISPR7. (B) The secondary structure of repeats of CRISPR1∼CRISPR7. Secondary structure prediction of the most frequent sequence of the first and terminal repeats of each CRISPR was performed by RNAfold. The free energy of the thermodynamic ensemble was −11.60, −11.70, −15.20 −11.80, −12.70, −14.20, −8.60 kcal/mol.
FIGURE 2Six Groups of CRISPR spacer size variability. The relationship between the size of repeat and spacer among six groups: (A) Group 1 spacers; (B) Group 2 spacers; (C) Group 3 spacers; (D) Group 4 spacers; (E) Group 5 spacers; (F) Group 6 spacers; (G) Group 7 spacers; (H) The x-axis represents the size of the CRISPR spacers, the y-axis represents the number of the CRISPR spacer. The size of repeat and spacer were inversely correlated.
FIGURE 3The evolutionary tree of Cas1 of all strains. The Cas1 has 57 strains, respectively. The Cas1 genes sequence were obtained by searching for the complete genome sequences in Genbank. Strains in one branch indicate most evolutionary similarities, the branch represented that these sequences could be divides into groups by certain values and the percentage of each branch showed the sequence similarity, and the evolutionary distance scale of Cas1 in 0.10.
FIGURE 4Principal component analysis of CRISPR of Klebsiella in relation to mobile genetic elements, regulators and enzymes of DNA metabolism. (A) Correlation vector diagram of each factor of principal component axis (blue line) and Klebsiella strains (red dot). A vector represents the correlation of each factor with the axis of the first principal component. (B) An enlarged view of the point representing Klebsiella.
FIGURE 5(A) Schematic diagram of the Plasmid p15WZ-82_Vir. (B) Schematic diagram of the Plasmid pKpvST101_5. According to the function of the gene classification, use arrows to represent the gene, and color. (C) Local structure of plasmid CRISPR.
FIGURE 6Principal component analysis of the CRISPR plasmid of Klebsiella with mobile genetic elements, regulatory factors, and enzymes of DNA metabolism. (A) Correlation vector diagram of principal component axis (blue line) and each factor of Klebsiella strain (red dot). The vector represents the correlation of each factor to the axis of the first principal component. Panel (B) represents an enlarged view of Klebsiella’s point.