| Literature DB >> 33051688 |
Alexander A Gooden1, Christine N Evans1, Timothy P Sheets1, Michelle E Clapp1, Raj Chari1.
Abstract
With the technology's accessibility and ease of use, CRISPR has been employed widely in many different organisms and experimental settings. As a result, thousands of publications have used CRISPR to make specific genetic perturbations, establishing in itself a resource of validated guide RNA sequences. While numerous computational tools to assist in the design and identification of candidate guide RNAs exist, these are still just at best predictions and generally, researchers inevitably will test multiple sequences for functional activity. Here, we present dbGuide (https://sgrnascorer.cancer.gov/dbguide), a database of functionally validated guide RNA sequences for CRISPR/Cas9-based knockout in human and mouse. Our database not only contains computationally determined candidate guide RNA sequences, but of even greater value, over 4000 sequences which have been functionally validated either through direct amplicon sequencing or manual curation of literature from over 1000 publications. Finally, our established framework will allow for continual addition of newly published and experimentally validated guide RNA sequences for CRISPR/Cas9-based knockout as well as incorporation of sequences from different gene editing systems, additional species and other types of site-specific functionalities such as base editing, gene activation, repression and epigenetic modification.Entities:
Year: 2021 PMID: 33051688 PMCID: PMC7779039 DOI: 10.1093/nar/gkaa848
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of sgRNA design tools
| Name | URL | Species | Design? | Validated guides |
|---|---|---|---|---|
| CCTop ( |
| Many | Yes | No |
| CHOPCHOP ( |
| Many | Yes | No |
| CRISPOR ( |
| Many | Yes | No |
| CRISPRseek ( |
| Many | Yes | No |
| CRISPRz ( |
| Zebrafish | No | Yes |
| CRISPR-ERA ( |
| Many | Yes | No |
| CRISPR MultiTargeter ( |
| Many | No | No |
| Cas-Designer ( |
| Many | Yes | No |
| DeepHF ( |
| Human | No | No |
| E-CRISP ( |
| Many | Yes | No |
| EuPaGDT ( |
| Many | Yes | No |
| inDelphi ( |
| Human, Mouse | Yes | No |
| sgRNAcas9 ( |
| Many | Yes | No |
| sgRNA Designer ( |
| Many | No | No |
| sgRNA Scorer 2.0 ( |
| Many | Yes | No |
| WU-CRISPR ( |
| Many | Yes | No |
|
| ||||
| Addgene |
| Many | No | Yes |
| CRISPR gRNA Design Tool (Atum) |
| Many | Yes | No |
| Benchling |
| Many | Yes | No |
| CRISPR Design Tool (Horizon Discovery) |
| Many | No | No |
| CRISPR sgRNA Design Tool (GenScript) |
| Many | No | No |
| CRISPR Cas9 Guide RNA Checker (IDT DNA) |
| Many | Yes | No |
| True Designer Genome Editior (ThermoFisher Scientific) |
| Many | Yes | No |
Figure 1.Structure of the dbGuide database. (A) Components of the dbGuide database. The user interface was developed using html and javascript and the application is managed on an Apache web server using django. All underlying data is stored in a MariaDB (MySQL) database. (B) Schema of the MySQL database. Each species, currently limited to human and mouse, has a table of sgRNA sequences with pre-computed metrics and a table with gene annotation information. sgRNA sequences obtained from publications are stored in a single table and the ‘species’ field is used to determine which species the guide RNA was used. Similarly, sequences from targeted amplicon sequencing data also have the ‘species’ field for this purpose.
Figure 2.Screen shots of the key user interfaces. (A) Opening window of the dbGuide database. The user must first select whether to search within the human/mouse genome and then can specify a HGNC gene symbol, chromosomal coordinate in BED format, or ENSEMBL gene/transcript ID. (B) Window depicting results of a search by gene symbol. Selectable rows of sgRNA sequences are returned with various pre-computed metrics and PubMed identifiers, if the sequence had been used in a peer-reviewed publication. (C) Graphical representation of sgRNA sequences for which targeted amplicon sequencing data were generated. Stacked bar plots display the percentage of NHEJ mutation events that results in an ‘in-frame’ (black) or ‘out of frame’ (blue) amino acid change.