| Literature DB >> 35465164 |
Sophia McDaniel1, Alexis Komor2, Alon Goren1.
Abstract
Single nucleotide variants (SNVs) represent the most common type of polymorphism in the human genome. However, in many cases the phenotypic impacts of such variants are not well understood. Intriguingly, while some SNVs cause debilitating diseases, other variants in the same gene may have no, or limited, impact. The mechanisms underlying these complex patterns are difficult to study at scale. Additionally, current data and research is mainly focused on European populations, and the mechanisms underlying genetic traits in other populations are poorly studied. Novel technologies may be able to mitigate this disparity and improve the applicability of personalized healthcare to underserved populations. In this review we discuss base editing technologies and their potential to accelerate progress in this field, particularly in combination with single-cell RNA sequencing. We further explore how base editing screens can help link SNVs to distinct disease phenotypes. We then highlight several studies that take advantage of single-cell RNA sequencing and CRISPR screens to emphasize the current limitations and future potential of this technique. Lastly, we consider the use of such approaches to potentially accelerate the study of genetic mechanisms in non-European populations.Entities:
Keywords: Base editing; CRISPR; Screens; Single nucleotide variants; Single-cell RNA sequencing
Year: 2022 PMID: 35465164 PMCID: PMC9010703 DOI: 10.1016/j.csbj.2022.03.031
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1ClinVar distribution of SNV effects in humans and demonstration of the GWAS population biases. a. Graphs demonstrating the approximate percent of identified (by sequencing) human genetic variants that have been clinically classified (left), and the distribution of phenotypic effects, as listed in the ClinVar database [53] (right). b. Estimated ratios of the GWAS in different populations during the last 15 years (the colors on the right represent the different populations). The plot was obtained from https://gwasdiversitymonitor.com/[67].
Summary of sgRNA design and microhomology prediction tools (non-exhaustive).
| sgRNA Design Tools | |||||||
|---|---|---|---|---|---|---|---|
| Design Tool | CRISPR Tool | Cas Types Included | Input | Output | Cell Types | Weblink | Study |
| pegFinder | Prime editing | Cas9 | ∼200 bp sequence context and cleavage site, desired PAM sequence, and desired edited sequence | Top sgRNAs, | Not specified | ||
| CHOPCHOP | CRISPRko, CRISPRa, CRISPRi | Cas9; Cas9n; Cas12a; Cas13 | Target gene, reference genome of interest | A list of scored sgRNAs as well as a visual representation | Not specified | ||
| Elevation | CRISPRko | Cas9 | Gene/transcript ID | Top sgRNAs, on target score, off target score | Not specified | ||
| GUIDES | CRISPRko | Cas9 | Genome of interest, gene of interest | Top sgRNAs, on target score, off target score | Not specified | ||
| CRISPRscan | CRISPRko | Cas9; Cas12a | Genome of interest, gene or transcript of interest | CRISPRscan score | Not specified | ||
| BE-Hive | Base Editing | ABE; BE4 | Target genomic DNA sequence, desired protospacer sequence | Editing efficiency score for each editable base within edit window | HEK293T; mES | ||
| BE-DICT Bystander | Base Editing | ABE; BE4 | Target genomic DNA sequence | Top sgRNAs, editing efficiency score for each editable base within the edit window | Not specified | ||
| DeepBaseEditor | Base Editing | ABE; CBE | Target genomic DNA sequence | Top sgRNAs, DeepCas9 score based on edit efficiency and probability of bystander edits | Not specified | ||
| Microhomology Prediction Tools | |||||||
| CRISPR RGEN Tools: Micrhomology-Predictor | All engineered nucleases | Cas9 | 60–80 bp seq flanking the cleavage site | Score predicting likelihood of out-of-frame deletions | Not specified | ||
| Up to 5 kb sequence | Scores for all possible Cas9 targets in the inquiry | ||||||
| inDelphi | CRISPRko | Cas9 | Sequence context and cleavage site, cell type of interest, and PAM sequence of interest | Score predicting likelihood of out-of-frame deletions | mESC; HEK293; U2Os; K563; HCT116 | ||
Summary of CRISPR-based tools (non-exhaustive).
| CRISPR Derivative | Cas Protein Type | Edit Type | Appended Enzyme | Modification | Study |
|---|---|---|---|---|---|
| CRISPR | CRISPR | Indels | N/A | N/A | |
| CRISPRi | dCas9 | Inhibition | N/A | Both nucleases deactivated | |
| CRISPRa | dCas9 | Activation | VP64 | Both nucleases deactivated and VP64 transcriptional activator domain appended | |
| CRISPRa | dCas9 | Activation | VP64 | Both nucleases deactivated and VP64 transcriptional activator domain appended | |
| ABE | nCas9 | Transition SNP | Modified TadA adenosine deaminase | Both nucleases deactivated and evolved TadA adenosine deaminase appended | |
| BE1 | dCas9 | Transition SNP | APOBEC | Both nucleases deactivated and APOBEC cytosine deaminase appended | |
| BE2 | dCas9 | Transition SNP | APOBEC | UGI appended | |
| BE3 | nCas9 | Transition SNP | APOBEC | HNH domain nucleolytic activity restored | |
| BE4 | nCas9 | Transition SNP | APOBEC | Linker lengths extended and additional UGI appended | |
| BE4max | nCas9 | Transition SNP | APOBEC | NLS appended | |
| PE1 | nCas9 | Small indels, Transition SNPs, Transversion SNPs | M-MLV Reverse Transcriptase | Reverse transcriptase appended; pegRNA appended | |
| PE2 | nCas9 | Small indels, Transition SNPs, Transversion SNPs | M-MLV Reverse Transcriptase | Reverse transcriptase modified | |
| PE3 | nCas9 | Small indels, Transition SNPs, Transversion SNPs | M-MLV Reverse Transcriptase | Additional Cas9 nickase appended; simple sgRNA targeting complementary strand appended |
Fig. 2Structure and mechanism of the CRISPR/Cas system. The CRISPR/Cas system (exemplified by Cas9 above) is composed of an sgRNA (shown in orange), which is complementary to the target DNA and binds to the target DNA, and a Cas protein (shown in grey), which helps bind and cleave the DNA through two nucleolytic domains. The HNH nuclease domain cleaves the complementary (i.e., target) strand while the RuvC nuclease cleaves the non-target strand. By definition, the non-target strand is called the protospacer, and has the same sequence as the sgRNA. The protospacer adjacent motif (PAM) is located directly downstream of the protospacer and is required for the Cas protein to initiate DNA binding [33]. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3Schematic of the structural makeup and mechanism of base editors. Top: Cytosine base editors (CBEs) are composed of a catalytically impaired Cas9 protein (Cas9n) with a cytidine deaminase fused to the N-terminus and a uracil glycosylase inhibitor fused to the C-terminus. After the Cas9:sgRNA complex binds to the target DNA, the cytidine deaminase may convert any cytidine bases within the edit window to uridines. DNA repair pathways then preferentially replace the non-edited strand and incorporate an adenosine base across from the uridine. Overall, CBEs catalyze a C•G to T•A base pair conversion. Bottom: Adenine base editors (ABEs) are similar to CBEs but have an adenosine deaminase as the DNA base modifying enzyme and have no DNA repair inhibitor on the C-terminus. After the Cas9:sgRNA complex binds, adenosine residues in the edit window may be converted to inosines, which have the base-pairing properties of guanosine. Overall, ABEs catalyze an A•T to G•C base pair conversion.
Fig. 4A general schematic of BE screens and single cell RNA sequencing (scRNAseq) workflows. Note, as there are multiple approaches that can be used for linking scRNAseq with BEs (or CRIPSRko/a/i) screens we provide here a general overview of the main steps. a. A library of sgRNA spacer sequences is designed and generated, then assembled into a viral vector. Lentiviruses are then produced, which are transduced into a population of target cells that express a BE. Expression of both the BE and a sgRNA will result in the introduction of an SNV of interest. The resulting cell population (which harbors a library of SNVs) is then subjected to a challenge to induce growth competition, followed by investigation of the effect of each SNV. b. After subjecting a pool of cells (that harbor a library of mutations) to a perturbation, the cells are passed through a microfluidics device to isolate individual cells into droplets containing a barcoded bead. The cells are lysed, and mRNA is captured by oligonucleotides on the beads. The oligonucleotides are reverse transcribed into a library, the droplets are recombined, then the library is sequenced and analyzed [61].
Summary of example BE screen studies.
| Investigated Genes | Selection Type | Selection Agent | Editor Type | scRNAseq Capture Method | Cell Line(s) | Study |
|---|---|---|---|---|---|---|
| MAP2K1, KRAS, NRAS | Positive | Vemurafenib | BE3 | CROP-seq | A375 | |
| Multiple | Positive/Negative | Cisplatin, Hygromycin | BE3, BE4 | N/A | HT29, MELJUSO | |
| BRCA1 | Negative | Olaparib | BE3 | N/A | HAP1 | |
| DDR Pathway | Positive/Negative | Cisplatin, Olaparib, Doxorubicin, Camptothecin | BE3 | N/A | MFC10A, MFC7, HAP1 |