| Literature DB >> 31890142 |
Guanqing Liu1,2, Yong Zhang1,3, Tao Zhang1,2,4.
Abstract
The Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)/ CRISPR-associated (Cas) system has emerged as the main technology for gene editing. Successful editing by CRISPR requires an appropriate Cas protein and guide RNA. However, low cleavage efficiency and off-target effects hamper the development and application of CRISPR/Cas systems. To predict cleavage efficiency and specificity, numerous computational approaches have been developed for scoring guide RNAs. Most scores are empirical or trained by experimental datasets, and scores are implemented using various computational methods. Herein, we discuss these approaches, focusing mainly on the features or computational methods they utilise. Furthermore, we summarise these tools and give some suggestions for their usage. We also recommend three versatile web-based tools with user-friendly interfaces and preferable functions. The review provides a comprehensive and up-to-date overview of computational approaches for guide RNA design that could help users to select the optimal tools for their research.Entities:
Keywords: CRISPR; Efficiency; Guide RNA design; Machine-learning; Specificity
Year: 2019 PMID: 31890142 PMCID: PMC6921152 DOI: 10.1016/j.csbj.2019.11.006
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Schematic diagram of CRISPR/Cas9 system at both on-target site and off-target site.
Computational methods for evaluation of guide RNA efficiency.
| Tool | Enzymes | Data source | Main features | Quantitative metrics |
|---|---|---|---|---|
| E-CRISP | Cas9 | – | SC, GF | – |
| CRISPRscan | Cas9, Cpf1 | Zebrafish | SC | Spearman correlation = 0.309, from |
| evaluateCrispr | Cas9 | SC | Spearman correlation = 0.074, from | |
| sgRNAScorer | Cas9, Cpf1 | Human | SC, EGF | Spearman correlation = 0.225, from |
| SSC | Cas9 | Human, Mouse | SC | Spearman correlation = 0.274, from |
| WU-CRISPR | Cas9 | Human, Mouse | SC, EP | Spearman correlation = 0.215, from |
| Azimuth | Cas9 | Human, Mouse | SC, GF, EP | Spearman correlation = 0.366, from |
| CRISPRater | Cas9 | Human | SC, GF | Pearson correlation = 0.399, from |
| CRISPRpred | Cas9 | Human, Mouse | SC, EP | ROC-AUC = 0.85, from |
| CASPER | Cas9, Cpf1 | – | SC | Pearson correlation = 0.443, from |
| DeepCpf1 | Cpf1 | Human | SC, EGF | Spearman correlation = 0.748, from |
| TSAM | Cas9 | Human, Mouse, Zebrafish | SC, GF, EP | Spearman correlation = 0.4, from |
| TUSCAN | Cas9 | Human, Mouse, Zebrafish | SC, GF | Spearman correlation = 0.12, from |
| uCRISPR | Cas9 | – | SC, EP | Spearman correlation = 0.3, from |
SC, sequences composition; GF, genetic features; EGF, epigenetic features; EP, energetics properties.
Computational methods for prediction of guide RNA specificity.
| Tool | Enzymes | Methods | Main features | Quantitative metrics |
|---|---|---|---|---|
| CasOT | Cas9 | alignment | unlimited mismatch number, paired-gRNA mode, annotation | slow |
| Cas-OFFinder | costom | alignment | unlimited mismatch number, GPU acceleration, web support | middle, fast (use GPU) |
| sgRNAcas9 | Cas9 | alignment | max 5 mismatches, paired-gRNA mode, annotation, risk evaluation | slow |
| FlashFry | costom | alignment | unlimited mismatch number, multiple on/off-target scores, annotation | fast |
| Crisflash | Cas9 | alignment | unlimited mismatch number, variant data support | fast |
| MIT | Cas9 | scoring | 20 bp sgRNA without PAM | ROC-AUC = 0.87, from |
| CCTop | Cas9, Cpf1 | scoring | empirically score based on number of mismatches | ROC-AUC = 0.77, from |
| CFD | Cas9 | scoring | 20 bp sgRNA with PAM (enable non-canonical PAM) | ROC-AUC = 0.91, from |
| CRISPRoff | Cas9 | scoring | energetics property and sequences composition | ROC-AUC = 0.98, from |
| uCRISPR | Cas9 | scoring | energetics property and sequences composition | Pearson correlation = 0.75, from |
| CRISTA | Cas9 | scoring | machine learning, sequences composition and epigenetic feature | ROC-AUC = 0.96, from |
| Elevation | Cas9 | scoring | machine learning, integrate both CFD model and epigenetic features | ROC-AUC = 0.98, from |
| DeepCRISPR | Cas9 | scoring | deep learning, sequences composition and epigenetic feature | ROC-AUC = 0.98, from |
Fig. 2Timeline of the development of web-based tools for CRISPR guide RNA design.
Web-based CRISPR guide RNA design tools.
| Tool | Website |
|---|---|
| CHOPCHOP | |
| CRISPR RGEN Tools | |
| CRISPOR | |
| E-CRISP | |
| CCTop | |
| CRISPR-ERA | |
| CRISPETa | |
| CRISPRscan | |
| EuPaGDT | |
| CRISPR-P | |
| CRISPR-PLANT | |
| CRISPR-GE | |
| inDelphi* | |
| FORECasT* | |
| Lindel* |
Tools marked by asterisk (*) are used for prediction of CRISPR editing outcomes.