Literature DB >> 25414360

CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites.

Yuki Naito1, Kimihiro Hino1, Hidemasa Bono1, Kumiko Ui-Tei2.   

Abstract

UNLABELLED: CRISPRdirect is a simple and functional web server for selecting rational CRISPR/Cas targets from an input sequence. The CRISPR/Cas system is a promising technique for genome engineering which allows target-specific cleavage of genomic DNA guided by Cas9 nuclease in complex with a guide RNA (gRNA), that complementarily binds to a ∼ 20 nt targeted sequence. The target sequence requirements are twofold. First, the 5'-NGG protospacer adjacent motif (PAM) sequence must be located adjacent to the target sequence. Second, the target sequence should be specific within the entire genome in order to avoid off-target editing. CRISPRdirect enables users to easily select rational target sequences with minimized off-target sites by performing exhaustive searches against genomic sequences. The server currently incorporates the genomic sequences of human, mouse, rat, marmoset, pig, chicken, frog, zebrafish, Ciona, fruit fly, silkworm, Caenorhabditis elegans, Arabidopsis, rice, Sorghum and budding yeast. AVAILABILITY: Freely available at http://crispr.dbcls.jp/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25414360      PMCID: PMC4382898          DOI: 10.1093/bioinformatics/btu743

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Genome engineering is a promising technique to manipulate endogenous chromosomal DNA in a site-specific manner. A novel system that employs the prokaryotic immune defense system based on the clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) protein has been reported as a prominent genome engineering approach (Cho ; Cong ; Jinek ; Mali ). Recent studies utilize the RNA-guided endonuclease Cas9 from Streptococcus pyogenes and a guide RNA (gRNA), which acts as a guide to define the target site to introduce DNA double-stranded break. A remarkable advantage of the CRISPR/Cas system is that the target DNA sequence is recognized by simple base-pairing complementarity by the gRNA. Thus, the CRISPR/Cas system can be programmed only by changing the gRNA sequence, and the synthesis of the gRNA for targeting a specific gene is easy at low cost. However, it should be a critical issue to avoid the cleavage of the unintended off-target genes, since double-stranded break results in stable and heritable modification of the genome. In this study, we present CRISPRdirect (http://crispr.dbcls.jp/), which provides efficient selection of CRISPR/Cas target sites with reduced numbers of potential off-target candidates. CRISPRdirect investigates the entire genome for perfect matches with each candidate target sequence (20 mer) and their seed sequence (12 or 8 mer) flanking the PAM. Users can also browse the detailed list of potential off-target sites that have partial complementarity with the selected sequence. The server incorporates genomic sequences of human, mouse, rat, marmoset, pig, chicken, frog, zebrafish, Ciona, fruit fly, silkworm, Caenorhabditis elegans, Arabidopsis, rice, Sorghum and budding yeast. Currently, several web servers are available for designing CRISPR/Cas gRNAs (Supplementary Table S1). CRISPR Design (Hsu ; Ran ) performs gRNA selection from an input sequence up to 250 bp, and gRNAs are scored based on predicted off-target interactions. E-CRISP (Heigwer ) ranks gRNAs according to on-target specificity and number of off-targets. E-CRISP, ZiFiT (Sander ), Cas9 Design (Ma ) and CHOPCHOP (Montague ) utilize Bowtie (Langmead and Salzberg, 2012) to perform off-target searches allowing mismatches. On the other hand, DNA2.0 gRNA Design Tool (https://www.dna20.com/eCommerce/startCas9) searches for perfect matches with 12 nt seed to identify off-target sites. These servers except CRISPR Design and ZiFiT can process at least 10 kbp of input sequence. Web servers for checking off-target sites for given 20 nt sequences are also available, such as Cas-OFFinder (Bae ) and GGGenome (http://GGGenome.dbcls.jp/). These web servers are useful for designing gRNAs for a few input sequences, but processing large number of input sequences requires a laborious process. Even in such cases, CRISPRdirect returns the results quickly and provides a convenient interface for automated gRNA design as described in the Data export and API section, making it a powerful tool for using CRISPR/Cas system on a genome-wide scale.

2 Web server implementation

2.1 Overview

The web server accepts an accession number, a genome coordinate or an arbitrary nucleotide sequence up to 10 kbp as input (Fig. 1A) and returns a list of CRISPR/Cas target candidates. Target sequences of 20 nt adjacent to the PAM sequence (e.g. NGG, NRG) are searched from both strands of the input sequence and listed as shown in Figure 1B. The list contains target position, target sequence, additional information on the sequence and the number of target sites in the genome. The additional information on the sequences such as GC content and calculated melting temperature (Tm) are provided, since previous report suggested that sgRNA sequences with very high or low GC content were less effective against their targets (Wang ). The presence or absence of TTTT (four consecutive T’s that cause pol III termination) in the target sequence is also indicated in order to avoid TTTT in gRNA vectors with pol III promoter. A detailed description of the web server is provided in Supplementary Methods.
Fig. 1.

Screenshot from the CRISPRdirect web server. (A) Top page. The server accepts either an accession number or a nucleotide sequence as input. (B) Typical output of CRISPRdirect. A list of CRISPR/Cas target candidates is displayed. (C) A graphical view of target sites demonstrates the position and orientation of each site. (D) The results can be exported as tab-delimited text or in JSON format. (E) Detailed list of potential off-target sites which visualizes the positions of mismatches and gaps

Screenshot from the CRISPRdirect web server. (A) Top page. The server accepts either an accession number or a nucleotide sequence as input. (B) Typical output of CRISPRdirect. A list of CRISPR/Cas target candidates is displayed. (C) A graphical view of target sites demonstrates the position and orientation of each site. (D) The results can be exported as tab-delimited text or in JSON format. (E) Detailed list of potential off-target sites which visualizes the positions of mismatches and gaps

2.2 Off-target evaluation

The number of target sites in the genome (Fig. 1B) is counted using Jellyfish (Marçais and Kingsford, 2011). The column ‘20 mer+PAM’ shows the number of hits with perfect matches for each target sequence (20 mer) adjacent to the PAM. Although the exact length of the completely complementary region necessary for cleavage by CRISPR nucleases is unknown, the mutations within the ‘seed’ sequence at 8–12 nt immediately adjacent to the PAM are known to impair cleavage, suggesting that this region is the most critical determinant of target specificity (Cong ; Fu ; Hsu ; Pattanayak ). Therefore, we built up the columns ‘12 mer+PAM’ and ‘8 mer+PAM’ in order to show the number of hits with perfect matches for their seed sequence (12 or 8 mer, respectively) adjacent to the PAM. Note that the numbers of hits displayed here include both on-target and off-target sites. For instance, one (‘1’) in these columns indicates that the sequence has only one perfect match with the intended target site. Any number greater than one indicates that there are some potential off-target sites. Thus, in terms of avoiding off-target editing, the smaller the number (but not zero) is, the better. Zero (‘0’) in these columns means that the sequence has no match in the genomic sequence; such sequences may possibly span over exon–exon junctions, so their use should be avoided. CRISPRdirect highlights the CRISPR/Cas targets that have relatively fewer off-target sites (Fig. 1B and C). A detailed list of off-target candidates can be investigated by clicking the ‘detail’ link (Fig. 1E). The searches allowing mismatches and gaps (insertions and/or deletions) are performed using GGGenome (http://GGGenome.dbcls.jp/) REST API developed by the authors’ group instead of widely used BLAST (Altschul ), because BLAST may overlook some potential off-targets as mentioned in our previous work describing siDirect (Naito ), a web server for designing functional siRNA with reduced off-target effects. GGGenome quickly searches short nucleotide sequences utilizing suffix arrays and inverse suffix links indexed on solid state drive (SSD). As shown in Supplementary Table S1, off-target searches allowing gaps are not yet available in other existing web tools. However, the most recent report shows that CRISPR/Cas9 system has off-target activity with insertions or deletions between target DNA and gRNA sequences (Lin ). Therefore, we consider that off-target searches allowing mismatches and gaps would be a more suitable procedure to list off-target candidates exhaustively. The positions of the mismatches and gaps are visualized in the list (Fig. 1E), which may help predict the potency of off-target editing. CRISPRdirect incorporates genomic sequences of various organisms to perform off-target searches. Although Xenopus laevis has long been used as a preferred model organism among developmental biologists, we incorporated X.tropicalis genome instead of X.laevis genome, because X.tropicalis is diploid while X.laevis is allotetraploid which makes it difficult to select specific targets. There are some loci that are difficult to select specific targets. Typical examples are the histone clusters (NM_021059, etc.) and ribosomal proteins (NM_022551, etc.), which are known to form multigene families. When designing CRISPR targets for such genes, users should manually investigate a detailed list of potential off-target sites (Fig. 1E) and select the sequence that has fewer off-target hits on unrelated loci. Alternatively, if site-specific gRNA could not be designed within intended region, multiple gRNA approaches would be considerable (Guilinger ; Ran ; Tsai ). For such strategy, graphical view of CRISPRdirect results which visualizes the position and orientation of target sites (Fig. 1C) would be helpful for selecting paired gRNAs.

2.3 Data export and API

The results can be exported as tab-delimited text or in JSON format from the bottom of the result page (Fig. 1D). Users can copy–paste the text results into a spreadsheet or text editor for downstream analysis. The results can also be downloaded as a separate file by clicking the ‘download’ link. Alternatively, tab-delimited text or JSON output can be obtained via API, which is convenient for users to design a number of CRISPR/Cas targets in an automated manner.

Funding

Life Science Database Integration Project, National Bioscience Database Center (NBDC) of Japan Science and Technology Agency (JST) (to Y.N. and H.B.); Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan (to Y.N. and K.U.-T.); Cell Innovation Program of MEXT (to K.U.-T.). Conflict of interest: none declared.
  21 in total

1.  siDirect: highly effective, target-specific siRNA design software for mammalian RNA interference.

Authors:  Yuki Naito; Tomoyuki Yamada; Kumiko Ui-Tei; Shinichi Morishita; Kaoru Saigo
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.

Authors:  Guillaume Marçais; Carl Kingsford
Journal:  Bioinformatics       Date:  2011-01-07       Impact factor: 6.937

3.  Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease.

Authors:  Seung Woo Cho; Sojung Kim; Jong Min Kim; Jin-Soo Kim
Journal:  Nat Biotechnol       Date:  2013-01-29       Impact factor: 54.908

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  ZiFiT (Zinc Finger Targeter): an updated zinc finger engineering tool.

Authors:  Jeffry D Sander; Morgan L Maeder; Deepak Reyon; Daniel F Voytas; J Keith Joung; Drena Dobbs
Journal:  Nucleic Acids Res       Date:  2010-04-30       Impact factor: 16.971

6.  Multiplex genome engineering using CRISPR/Cas systems.

Authors:  Le Cong; F Ann Ran; David Cox; Shuailiang Lin; Robert Barretto; Naomi Habib; Patrick D Hsu; Xuebing Wu; Wenyan Jiang; Luciano A Marraffini; Feng Zhang
Journal:  Science       Date:  2013-01-03       Impact factor: 47.728

7.  RNA-guided human genome engineering via Cas9.

Authors:  Prashant Mali; Luhan Yang; Kevin M Esvelt; John Aach; Marc Guell; James E DiCarlo; Julie E Norville; George M Church
Journal:  Science       Date:  2013-01-03       Impact factor: 47.728

8.  High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells.

Authors:  Yanfang Fu; Jennifer A Foden; Cyd Khayter; Morgan L Maeder; Deepak Reyon; J Keith Joung; Jeffry D Sander
Journal:  Nat Biotechnol       Date:  2013-06-23       Impact factor: 54.908

9.  CHOPCHOP: a CRISPR/Cas9 and TALEN web tool for genome editing.

Authors:  Tessa G Montague; José M Cruz; James A Gagnon; George M Church; Eivind Valen
Journal:  Nucleic Acids Res       Date:  2014-05-26       Impact factor: 16.971

10.  RNA-programmed genome editing in human cells.

Authors:  Martin Jinek; Alexandra East; Aaron Cheng; Steven Lin; Enbo Ma; Jennifer Doudna
Journal:  Elife       Date:  2013-01-29       Impact factor: 8.140

View more
  326 in total

1.  Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision.

Authors:  Zehua Bao; Mohammad HamediRad; Pu Xue; Han Xiao; Ipek Tasan; Ran Chao; Jing Liang; Huimin Zhao
Journal:  Nat Biotechnol       Date:  2018-05-07       Impact factor: 54.908

Review 2.  Gene-edited CRISPy Critters for alcohol research.

Authors:  Gregg E Homanics
Journal:  Alcohol       Date:  2018-03-07       Impact factor: 2.405

3.  Genome engineering uncovers 54 evolutionarily conserved and testis-enriched genes that are not required for male fertility in mice.

Authors:  Haruhiko Miyata; Julio M Castaneda; Yoshitaka Fujihara; Zhifeng Yu; Denise R Archambeault; Ayako Isotani; Daiji Kiyozumi; Maya L Kriseman; Daisuke Mashiko; Takafumi Matsumura; Ryan M Matzuk; Masashi Mori; Taichi Noda; Asami Oji; Masaru Okabe; Renata Prunskaite-Hyyrylainen; Ramiro Ramirez-Solis; Yuhkoh Satouh; Qian Zhang; Masahito Ikawa; Martin M Matzuk
Journal:  Proc Natl Acad Sci U S A       Date:  2016-06-29       Impact factor: 11.205

4.  Engineering Saccharomyces cerevisiae for production of simvastatin.

Authors:  Carly M Bond; Yi Tang
Journal:  Metab Eng       Date:  2018-09-10       Impact factor: 9.783

5.  Integrated design, execution, and analysis of arrayed and pooled CRISPR genome-editing experiments.

Authors:  Matthew C Canver; Maximilian Haeussler; Daniel E Bauer; Stuart H Orkin; Neville E Sanjana; Ophir Shalem; Guo-Cheng Yuan; Feng Zhang; Jean-Paul Concordet; Luca Pinello
Journal:  Nat Protoc       Date:  2018-04-12       Impact factor: 13.491

6.  Improved bioethanol production using CRISPR/Cas9 to disrupt the ADH2 gene in Saccharomyces cerevisiae.

Authors:  Ting Xue; Kui Liu; Duo Chen; Xue Yuan; Jingping Fang; Hansong Yan; Luqiang Huang; Youqiang Chen; Wenjin He
Journal:  World J Microbiol Biotechnol       Date:  2018-10-01       Impact factor: 3.312

7.  Distinct cis-acting regions control six6 expression during eye field and optic cup stages of eye formation.

Authors:  Kelley L Ledford; Reyna I Martinez-De Luna; Matthew A Theisen; Karisa D Rawlins; Andrea S Viczian; Michael E Zuber
Journal:  Dev Biol       Date:  2017-04-21       Impact factor: 3.582

8.  CAR1 deletion by CRISPR/Cas9 reduces formation of ethyl carbamate from ethanol fermentation by Saccharomyces cerevisiae.

Authors:  Young-Wook Chin; Woo-Kyung Kang; Hae Won Jang; Timothy L Turner; Hyo Jin Kim
Journal:  J Ind Microbiol Biotechnol       Date:  2016-08-29       Impact factor: 3.346

9.  Functional Studies of Transcriptional Cofactors via Microinjection-Mediated Gene Editing in Xenopus.

Authors:  Yuki Shibata; Lingyu Bao; Liezhen Fu; Bingyin Shi; Yun-Bo Shi
Journal:  Methods Mol Biol       Date:  2019

10.  Generation of Efficient Knock-in Mouse and Human Pluripotent Stem Cells Using CRISPR-Cas9.

Authors:  Tatsuya Anzai; Hiromasa Hara; Nawin Chanthra; Taketaro Sadahiro; Masaki Ieda; Yutaka Hanazono; Hideki Uosaki
Journal:  Methods Mol Biol       Date:  2021
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.