| Literature DB >> 35655130 |
Omar Rawashdeh1, Rabeah Y Rawashdeh2, Temesgen Kebede3, David Kapp3, Anca Ralescu4.
Abstract
BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their function, particularly as a part of transcriptional regulatory regions in T4 bacteriophages.Entities:
Keywords: CRISPR-Cas9; Genomic distribution; PAMs; Promoters; T4 phage; Transcription
Mesh:
Substances:
Year: 2022 PMID: 35655130 PMCID: PMC9161530 DOI: 10.1186/s12863-022-01056-8
Source DB: PubMed Journal: BMC Genom Data ISSN: 2730-6844
Fig. 1A schematic illustration of the components and mechanism of CRISPR-Cas9. 1. CRISPR-Cas9 is a fusion between Cas9 nuclease and single guiding RNA (sg RNA). Spacer sequences in sg RNA are phage DNA sequences that were integrated into bacterial DNA from previous exposures. 2. A latter phage attack causes the injection of its DNA genome inside the bacterial cell. 3. CRISPR-Cas9 complex recognizes the target cleaving site in phage genome by the help of the protospacer-adjacent motif (PAM) sequence, at this step CRISPR-Cas9 binds the phage DNA sequences that are complementary to sg RNA which are located the upstream genomic sequence of PAM. 4. Cas9 cuts the double strands upstream of the PAM to induce DNA double-strand breaks. Different PAM sequences are used by several types of CRISPR systems and the PAM sequence NGG was used for illustration. Created with BioRender.com
PAM motifs according to CRISPR-Cas type but regardless of the contained phages
| CRISPR-Cas | From Organism | PAM Sequence | Reference |
|---|---|---|---|
II-A I-C IV | NGG NAG (A/T)GG | [ | |
| NGG | [ | ||
| II-A | NGRRT NGRRN NNGRRT TTGGGT | [ | |
| II- C | NNNNGATT | [ | |
| II-A | NNAGAAW NGGNG | [ | |
V-A II-A | TTTV TCTA | [ | |
V-A Cas12a (AsCas12a, AsCpf1) | GTTV GCTV TATA (eng. Cpf1) TTTV | [ | |
| I-E | C(T/A)T | [ | |
| I-E | 5’-ATG-3’ 5’-AAG-3’ Its complement 5′-CTT-3′ | [ | |
| I-E | AWG | [ | |
| I-A | TCN CCN | [ |
Fig. 2PAM’s motifs distribution in each of the three types of promoters: 40 early (A), 33 middle (B), and 50 late (C). Sixty bases were scanned for each promoter upstream of the transcription start site (which is given the coordinate 0). The 60 bases of a promoter were categorized into six segments represented by stacked bars of different colors. Y axis represents PAM’s count number
Fraction of targeted essential genes per PAM (number of essential genes where the PAM count was detected as an outlier). Only significant genes were used in the calculation of the fractions
| PAM | Fraction of targeted essential genes |
|---|---|
| ATG | 0.125 |
| AWG | 0.2 |
| CCN | 0 |
| CTT | 0 |
| GCTV | 0.4 |
| GTTV | 0.142857143 |
| NAG | 0 |
| NGG | 0 |
| NGGNG | 0.1 |
| NGGRRT | 0 |
| NGRRN | 0 |
| NGRRT | 0 |
| NNAGAAW | 0 |
| NNGRRT | 0 |
| NNNNGATT | 0 |
| NNNNRYAC | 0 |
| TATA | 0 |
| TCN | 0 |
| TCTA | 0.2 |
| TTGAAT | 0.255813953 |
| TTGGGT | 0.28 |
| TTTV | 0 |
Shows all PAMs as triplet DNA codons and the possible amino acids generated from each codon
| PAM | Amino acids |
|---|---|
| ATG | M |
| AWG | K M |
| CCN | P |
| CTT | L |
| GCTV | L A |
| GTTV | V L F |
| NAG | K Q E |
| NGG | R G W |
| NGGNG | R E A G V W |
| NGGRRT | N R S E D G W |
| NGRRN | K N R S E D G W |
| NGRRT | N R S E D G W |
| NNAGAAW | K N T R I Q P L E A G V S |
| NNGRRT | K N T R S M Q P L E D A G V W |
| NGATT | R I D G |
| NRYAC | N T S I H R D A G V Y C |
| TATA | I Y |
| TCN | S |
| TCTA | L S |
| TTGAAT | N E L |
| TTGGGT | G W L |
| TTTV | L F |
Fig. 3Mean ASA (average surface accessibility) values, averaged across the different possible codons per PAM
Fig. 4Maximum mean ASA (average surface accessibility) values, across the different possible codons per PAM