| Literature DB >> 36060308 |
E V Kropocheva1, L A Lisitskaya1, A A Agapov1, A A Musabirov1, A V Kulbachinskiy1, D M Esyunina1.
Abstract
Programmable nucleases are the most important tool for manipulating the genes and genomes of both prokaryotes and eukaryotes. Since the end of the 20th century, many approaches were developed for specific modification of the genome. The review briefly considers the advantages and disadvantages of the main genetic editors known to date. The main attention is paid to programmable nucleases from the family of prokaryotic Argonaute proteins. Argonaute proteins can recognize and cleave DNA sequences using small complementary guide molecules and play an important role in protecting prokaryotic cells from invading DNA. Argonaute proteins have already found applications in biotechnology for targeted cleavage and detection of nucleic acids and can potentially be used for genome editing. © Pleiades Publishing, Inc. 2022, ISSN 0026-8933, Molecular Biology, 2022. © Pleiades Publishing, Inc., 2022.Entities:
Keywords: Argonaute proteins; biosensors; genome editing; guide DNA; programmable nucleases
Year: 2022 PMID: 36060308 PMCID: PMC9427165 DOI: 10.1134/S0026893322060103
Source DB: PubMed Journal: Mol Biol ISSN: 0026-8933 Impact factor: 1.540
Fig. 1. Variety of programmable nucleases. (a–d) Endonucleases that do not require a PAM sequence in the target for catalytic activity; (e, f) endonucleases that require the presence of PAM (protospacer adjacent motif) in the 5'-region from the recognition site; (g, h) endonucleases that require the presence of PAM in the 3'-region from the recognition site. (a) Chimeric endonuclease based on the zinc finger DNA-binding domain (PDB ID: 7ZNF) and FokI (PDB ID: 2FOK) (protein structures are combined for illustration); (b) chimeric endonuclease based on the TALE DNA-binding domain (PDB ID: 4HPZ) and FokI (PDB ID: 2FOK) (protein structures are combined for illustration); (c) meganuclease I-AabMI (PDB ID: 4YIT); (d) Argonaute protein can be an RNA- or DNA-guided endonuclease and cleave RNA or DNA (TtAgo, PDB ID: 4NCB); (e) RNA-dependent DNA-endonuclease TnpB (TnpB ISDra2 structure was predicted using the AlphaFold2 algorithm [1]); (f) RNA-dependent DNA-endonuclease Cas12 (PDB ID: 7EU9) [2]; (g) RNA-dependent DNA-endonuclease Cas9 (PDB ID: 6M0X) [3]; and (h) RNA-dependent DNA-endonuclease IscB (the structure was predicted using the AlphaFold2 algorithm [1]).
Fig. 2. Cleavage of nucleic acids by Argonaute proteins. Argonaute is loaded with a guide single-stranded nucleic acid molecule (guide binding), followed by a target search based on complementary interactions (target recognition). If the target is complementary to the guide, the conformation of the enzyme changes, catalysis occurs (target cleavage) and the cleaved target is released (target release).
Fig. 3. Phylogenetic tree of prokaryotic Argonaute proteins. The tree was built according to the data published in [59]. Multiple alignment of the MID and PIWI domains was performed using the MAFFT program (v7.487) [117]. The tree was constructed using the IQ-TREE (v2.1.4-beta) program [118–120]. The colors of the branches correspond to the groups of Argonautes: longA (green), longB (blue), short (burgundy). In circles, color indicates the features of the structure of PIWI domains (inner circle; active proteins—green, inactive—gray), MID (second circle; phosphate-binding domain—yellow, OH-binding domain—dark blue) and PAZ (outer circle; full-length PAZ—green, shortened—pink); the PAZ domain is missing in “short” Argonautes.
Characteristics of catalytically active prokaryotic and eukaryotic Argonaute proteins
| Host | Argonaute, | Guide, lengthа | 5' end | Target | Reaction temperatureс | Ion | Mismatches | Catalytic activity | References |
|---|---|---|---|---|---|---|---|---|---|
| Prokaryotic Argonaute proteins | |||||||||
|
| AaAgo, WP_010880937.1 | DNA 18‒24 | 5′-P; ? | RNA, (DNA ?) | opt. 55°C; up to 75°C | Mg2+, Mn2+ | ? | Guide-dependent | [ |
(strains TK520, CWBI 1009) | CbAgo, WP_058142162.1; CbcAgo, WP_045143632.1 | DNAe (RNA weakly) 16‒18 | 5′-P; none | DNA* (RNA weakly) | opt. 37°C; 30‒60°C (CbcAgo opt. 37‒42°C; 25‒55°C) | Mg2+, Mn2+, Co2+ | ↓5, 6, 12‒15 nt | Guide-dependent; guide-independent | [ |
|
| CpAgo, WP_003477422.1 | DNA 15‒30 (≥12) | 5′-P; none | DNA (RNA) | opt. 37°C; 4–70°C | Mg2+, Mn2+ | ↓12‒15 nt | Guide-dependent | [ |
|
| FpAgo, WP_012966655.1 | DNA 15‒16 | 5′-P; G1 | DNA | 75–99°C | Mg2+, Mn2+, Co2+, Ni2+ | ↑7 nt; ↓3,8,11‒15 nt | Guide-dependent | [ |
|
| IbAgo, WP_007287731.1 | DNA 15‒30 (≥14) | 5′-P; none | DNA | opt. 37°C; 4–60°C | Mg2+, Mn2+ | ↓14‒16 nt | Guide-dependent | [ |
| KmAgo, WP_010289662.1 | DNA (RNA) 16‒20 (≥12) | 5′-P; none | DNA (RNA) | opt. 37‒60°С; from 25°С | Mn2+, Mg2+, weakly Co2+ | ↓4, 5, 10‒15 nt (DNA‒DNA); ↓4, 8‒11 nt (DNA‒RNA); ↓3‒11 nt (RNA‒DNA) | Guide-dependent; guide-independent (37°С) | [ |
|
| LrAgo, WP_075892274.1 | DNAe 16‒18 (≥10) | 5′-P; none | DNA | opt. 37°C; 30–54°C | Mn2+, Mg2+, weakly Co2+ | ↑4‒8 nt; ↓10‒15 nt | Guide-dependent; guide-independent | [ |
|
| MfAgo, WP_015791216.1 | DNA, RNA 16 | 5′-P DNA and RNA; 5′-OH DNA | DNA | opt. 80‒90°C; from 54°C | Mn2+, Mg2+, Co2+ | ? | Guide-dependent | [ |
|
| MpAgo, WP_014295921.1 | RNA 16‒40 | 5′-OH; none | DNA (RNA) | 37, 55, 60°C | Mn2+, Mg2+, weakly Ni2+ | ↓5, 7, 8 nt; ↓5‒15 nt dinucleotide mismatches | Guide-dependent | [ |
| MjAgo, WP_010870838.1 | DNA 15‒41 | 5′-P; purines | DNA | opt. 75‒95°C; from 37°C | Mg2+ | ? ↓10‒11 dinucleotide mismatches | Guide-dependent; guide-independent | [ |
|
| NgAgo, WP_005580376.1 | DNA 22‒24 | 5′-P; ? | DNA ? (RNA ?) | 37°C | Mg2+ | ? | Guide-dependent | [ |
| PfAgo, WP_011011654.1 | DNA 15‒31 | 5′-P; none | DNA | opt. 87‒99.9°C; from 37°C | Mn2+, Co2+ | ? | Guide-dependent; guide-independent | [ |
|
| SeAgo, WP_011244830.1 | DNAe 14‒24 | 5′-P; none | DNAe | opt. 37°C | Mn2+, weakly Mg2+ | ↑5, 6, 10 nt; ↓7, 12‒15, 18 nt | Guide-dependent; guide-independent | [ |
|
| TpAgo, WP_041081268.1 | RNA 21 | 5′-OH; ? | DNA | 60°C | Mg2+, Mn2+ | ? | Guide-dependent | [ |
| TtAgo, WP_011174533.1 | DNAe 13‒25 | 5′-P; C1 G1' | DNAe (RNA) | opt. 73‒75°C; ⩾20 °C ssDNA; ⩾ 65°C plasmids | Mn2+, Mg2+ | ↓12‒15 nt | Guide-dependent; guide-independent | [ |
| Eukaryotic Argonaute proteins | |||||||||
|
| hAgo2, NP_036286.2 | RNAe (DNA) 20‒21 | 5′-P; U1 A1' | RNAe | 37°C | Mg2+ | ↓2‒8 nt (target binding); ↓8‒11 triple mismatch | Guide-dependent | [ |
| KpAgo, XP_001644461.1 | RNAe (DNA) 12‒17 (11‒25) | 5′-P; U1 | RNAe | 25‒30°C | Mg2 | ↓7‒14 nt (↓↓9‒13 nt) | Guide-dependent | [ | |
|
| SIWI, NP_001098066.2 | RNAe 28‒30 | 5′-P; U1 | RNAe | 26, 37°C | Mg2+, Mn2+ | ↓2‒7 nt (target binding) | Guide-dependent | [ |
aThe type of guide nucleic acid and its length (number of nucleotides). bCharacteristics of the 5'-end of the guide: P—phosphate group, OH—hydroxyl group; the presence of specificity to the nitrogenous base is indicated: none—no specificity, A/U/G/C— types of nitrogenous bases, 1—the first nucleotide of the guide chain; 1'—the first nucleotide of the target chain. Here and further: ?—unknown. cThe temperature at which Argonaute protein could cleave target; opt.—the temperature optimum, the temperature range at which the Argonaute cleaves targets is also given. Discrete temperature values are given for publications where measurements were carried out only under these conditions. dThe effect of mismatches (unpaired nucleotides of the duplex guide–target) on the efficiency of target cleavage; the positions of the nucleotides (nt) are numbered relative to the 5'-end of the guide. ↑—increase in cleavage efficiency, ↓—decrease in cleavage efficiency. eBoth in vitro and in vivo.
Fig. 4. Detection of target sequences in biological samples using Argonaute proteins. (a) Detection of nucleic acids using Argonautes. The Argonaute is programmed to cut out the target sequence from the DNA; in addition, an oligonucleotide probe complementary to this sequence is added to the sample; the probe has a hairpin structure, to the ends of which a fluorescent label (F) and a quencher (Q) are attached. During the first round of catalysis, the Argonaute cuts out the target sequence in the sample, after which the Argonaute binds this sequence as a guide and performs the second round of catalysis, as a result of which the fluorescent probe is cleaved. The presence of the target sequence is assessed by increasing the fluorescence intensity [180, 181]. (b) Mutant alleles analysis in biological samples. To detect the mutant allele, DNA is first amplified, and then PCR products are incubated at 98°C with Argonaute loaded with two DNA guides corresponding to one of the variants of the target sequence; then the products are separated in an agarose gel [182]. (c) Detection of rare DNA variants. The mutant DNA sample is incubated with Argonaute loaded with guides corresponding to the wild-type sequence; only mutant DNA remains intact after catalysis; the amount of such DNA can be estimated using Real-time PCR or other methods [183]. (d) For miRNA detection, catalytically inactive Argonaute is loaded with a guide corresponding to the target sequence and containing a fluorescent label; after incubation with miRNAs immobilized on a chip, the fluorescence signal of guides immobilized in a complex with Argonautes and the target miRNA is detected [184]. (e) Detection of RNA modifications [131]. (f) RNA secondary structure analysis. Structured RNAs are incubated with Argonautes loaded with guides to different sequence sites; cleavage sites can be detected directly or by reverse transcription [126, 127, 153].
Fig. 5. Promising areas of applications of catalytically inactive Argonautes in biotechnology. (a) Argonaute (Ago) can be linked to a fluorescent label (F) and loaded with a guide complementary to the sequence of interest. (b) Argonaute may be combined with enzymes that replace or modify nucleotides in RNA. (c) Combining Argonaute with a transcription factor (TF) may enhance the transcription of a particular gene; a similar effect may be achieved if Argonaute competes with a transcriptional repressor protein; Argonaute with additional domains may enhance the interactions of the enhancer with the promoter of the gene of interest and thereby activate transcription. (d) Pre-mRNA splicing may be regulated by targeting Argonaute to its specific region. (e) Argonaute may be linked to various enzymes of other DNA editing systems to modify specific regions of the genome. (f) Argonaute may be combined with histone and DNA modifying enzymes for epigenome editing (orange circle indicates histone modification, lilac circle indicates cytosine methylation in DNA).