Literature DB >> 27559154

Cas-analyzer: an online tool for assessing genome editing results using NGS data.

Jeongbin Park1, Kayeong Lim2,3, Jin-Soo Kim2,3, Sangsu Bae1,4.   

Abstract

Genome editing with programmable nucleases has been widely adopted in research and medicine. Next generation sequencing (NGS) platforms are now widely used for measuring the frequencies of mutations induced by CRISPR-Cas9 and other programmable nucleases. Here, we present an online tool, Cas-Analyzer, a JavaScript-based implementation for NGS data analysis. Because Cas-Analyzer is completely used at a client-side web browser on-the-fly, there is no need to upload very large NGS datasets to a server, a time-consuming step in genome editing analysis. Currently, Cas-Analyzer supports various programmable nucleases, including single nucleases and paired nucleases.
AVAILABILITY AND IMPLEMENTATION: Free access at http://www.rgenome.net/cas-analyzer/ CONTACT: sangsubae@hanyang.ac.kr or jskim01@snu.ac.krSupplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27559154      PMCID: PMC5254075          DOI: 10.1093/bioinformatics/btw561

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Programmable nucleases such as zinc finger nucleases (ZFNs), transcription-activator-like effector nucleases (TALENs), and RNA-guided endonucleases derived from CRISPR-Cas9/Cpf1 systems, which are adaptive immune responses in bacteria and archaea, are widely used for genome editing in many research fields including biology, biotechnology, agriculture, and medical science (Kim and Kim, 2014). The type II Cas9 nuclease from Streptococcus pyogenes (SpCas9) was the first CRISPR nuclease used for genome editing (Cho ; Cong ; Jinek ; Mali ); since that time, various orthogonal Cas9 nucleases such as StCas9 (Cong ), NmCas9 (Hou ) and SaCas9 (Ran ) have been developed. Recently, putative type V Cpf1 nucleases from Acidominococcus and Lachnospiraceae were reported to mediate efficient genome editing in human cells (Kim ; Zetsche ) and mice (Hur ; Kim ). Moreover, dimeric CRISPR nucleases such as RNA-guided nickases (Cho ; Ran ) and RNA-guided FokI nucleases (Tsai ), or biochemical improvement of wild-type SpCas9 (Kleinstiver ; Slaymaker ) have been developed for genome editing to reduce off-target effects. Programmable nucleases introduce DNA double-strand breaks at user-defined target sites in the genome, ultimately inducing targeted gene knockout or knock-in via the cell’s own repair systems [error-prone non-homologous end joining or homology-directed repair (HDR) in the presence of a DNA template, respectively]. The induced mutation rates in cells can be estimated in a straightforward manner by using Surveyor nuclease (Perez ), the T7 endonuclease I (T7E1) assay (Kim ), polyacrylamide gel electrophoresis (Zhu ) or droplet digital PCR (Nelson ). However, these methods do not allow analysis of mutant sequences and are limited by relatively poor sensitivity. Recently we and other groups have used targeted deep sequencing to detect programmable nuclease-induced mutations with high sensitivity and precision and to analyze mutation patterns (Baek ). However, analysis of next generation sequencing (NGS) data is difficult for many researchers. Although a few web-based tools such as CRISPR-GA (Güell ), AGEseq (Xue and Tsai, 2015) and CRISPResso (Pinello ) are available, they are inconvenient to use because their web interfaces require a very long time to upload large data files (Supplementary Material S1). AGEseq and CRISPResso also support a command-line interface, but they are not accessible to researchers who are not familiar with bioinformatics. To address this issue, we present a web-based tool, Cas-Analyzer that is constructed with a JavaScript-based algorithm; thus, it wholly runs on the client-side so that large amounts of sequencing data do not need to be uploaded to the server. Thanks to the improvements in the newest JavaScript engines in the most recent web browsers (Supplementary Table S1), this tool works in a reasonable time. Currently, Cas-Analyzer supports a variety of nucleases, including single nucleases (SpCas9, StCas9, NmCas9, SaCas9, CjCas9 and AsCpf1/LbCpf1) and paired nucleases (ZFNs, TALENs, Cas9 nickases and dCas9-FokI nucleases).

2 Implementation

2.1 File loading

To use Cas-Analyzer, deep sequencing data are needed, which can be obtained by amplifying the target locus of genome edited cells (Supplementary Material S2) followed by NGS. The format of the raw output data is usually Fastq or gzip-compressed, and both data types are acceptable to Cas-Analyzer (Fig. 1A). For the compressed files, we used a JavaScript library ‘pako’ (http://nodeca.github.io/pako/), which is slightly modified to support blocked gzip files. If users upload paired-end sequencing data, Cas-Analyzer first merges paired-end reads by the JavaScript port of Fastq-join, a part of ea-utils (https://code.google.com/archive/p/ea-utils/).
Fig. 1.

Overview of Cas-Analyzer. (A) Uploading NGS data files. Single-end reads, paired-end reads, or already merged sequencing data are allowed. (B) Basic information about the query sequences are required for using Cas-Analyzer. (C) Indicators used in the analysis step. (D) The results are summarized as a table that includes the mutation count and frequency. (E) Insertions and deletions are also visualized as graphs. (F) All filtered sequences from the input data are aligned with the reference sequence

Overview of Cas-Analyzer. (A) Uploading NGS data files. Single-end reads, paired-end reads, or already merged sequencing data are allowed. (B) Basic information about the query sequences are required for using Cas-Analyzer. (C) Indicators used in the analysis step. (D) The results are summarized as a table that includes the mutation count and frequency. (E) Insertions and deletions are also visualized as graphs. (F) All filtered sequences from the input data are aligned with the reference sequence

2.2 Data analysis

Cas-Analyzer analyzes the uploaded data and calculates mutation frequencies in three steps (Fig. 1B–D): (i) Cas-Analyzer first finds the cleavage point in the reference sequence for the selected nuclease. Using the given comparison range (R) parameter, Cas-Analyzer defines 12nt of indicator sequences on both sides of the given reference sequence and then selects the valid sequences, which contain both indicators with up to a 1-nt mismatch, from the uploaded data. (ii) For the selected sequences, Cas-Analyzer then counts the recurrent frequency of each sequence and excludes the sequences below the given minimum frequency (n). (iii) Cas-Analyzer finally classifies the filtered sequences into three different groups: ‘insertion’, ‘deletion’ or ‘WT or substitution’ based on comparing the sequence length with the length of the given reference sequence. Optionally, if a WT marker range (r) is given, the short sequence around the cleavage point will be used as the marker of wild-type. If this marker exists in the query sequence, it will always be classified into the ‘WT or substitution’ group regardless of its length. Additionally, if the donor DNA sequence for HDR is given, Cas-Analyzer defines an HDR indicator (>8nt) by comparing the donor sequence with the reference sequence and classifies all query sequences that have the HDR indicator into the ‘HDR’ category.

2.3 Sequence alignment

For user convenience, after data analysis is complete, the results (a relatively small amount of data) are aligned to the reference sequence by using a JavaScript ported EMBOSS Needle (Rice ). The aligned results are categorized by mutation type and sorted in descending order by count. In addition, the position and size of insertions or deletions are depicted as interactive graphs on the results web page (Fig. 1E and F).

Funding

This work was supported by a grant of the Korea Healthcare technology R&D Project, Ministry for Health & Welfare Affairs (HI16C1012) to S.B. and Institute for Basic Science (IBS-R021-D1) to J.-S.K. Conflict of Interest: none declared. Click here for additional data file.
  25 in total

1.  Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity.

Authors:  F Ann Ran; Patrick D Hsu; Chie-Yu Lin; Jonathan S Gootenberg; Silvana Konermann; Alexandro E Trevino; David A Scott; Azusa Inoue; Shogo Matoba; Yi Zhang; Feng Zhang
Journal:  Cell       Date:  2013-08-29       Impact factor: 41.582

2.  Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease.

Authors:  Seung Woo Cho; Sojung Kim; Jong Min Kim; Jin-Soo Kim
Journal:  Nat Biotechnol       Date:  2013-01-29       Impact factor: 54.908

3.  Genome editing assessment using CRISPR Genome Analyzer (CRISPR-GA).

Authors:  Marc Güell; Luhan Yang; George M Church
Journal:  Bioinformatics       Date:  2014-07-01       Impact factor: 6.937

4.  Targeted mutagenesis in mice by electroporation of Cpf1 ribonucleoproteins.

Authors:  Junho K Hur; Kyoungmi Kim; Kyung Wook Been; Gayoung Baek; Sunghyeok Ye; Junseok W Hur; Seuk-Min Ryu; Youn Su Lee; Jin-Soo Kim
Journal:  Nat Biotechnol       Date:  2016-06-06       Impact factor: 54.908

5.  Generation of knockout mice by Cpf1-mediated gene targeting.

Authors:  Yongsub Kim; Seung-A Cheong; Jong Geol Lee; Sang-Wook Lee; Myeong Sup Lee; In-Jeoung Baek; Young Hoon Sung
Journal:  Nat Biotechnol       Date:  2016-06-06       Impact factor: 54.908

6.  In vivo genome editing improves muscle function in a mouse model of Duchenne muscular dystrophy.

Authors:  Christopher E Nelson; Chady H Hakim; David G Ousterout; Pratiksha I Thakore; Eirik A Moreb; Ruth M Castellanos Rivera; Sarina Madhavan; Xiufang Pan; F Ann Ran; Winston X Yan; Aravind Asokan; Feng Zhang; Dongsheng Duan; Charles A Gersbach
Journal:  Science       Date:  2015-12-31       Impact factor: 47.728

7.  Targeted genome editing in human cells with zinc finger nucleases constructed via modular assembly.

Authors:  Hye Joo Kim; Hyung Joo Lee; Hyojin Kim; Seung Woo Cho; Jin-Soo Kim
Journal:  Genome Res       Date:  2009-05-21       Impact factor: 9.043

8.  Efficient genome engineering in human pluripotent stem cells using Cas9 from Neisseria meningitidis.

Authors:  Zhonggang Hou; Yan Zhang; Nicholas E Propson; Sara E Howden; Li-Fang Chu; Erik J Sontheimer; James A Thomson
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-12       Impact factor: 11.205

9.  In vivo genome editing using Staphylococcus aureus Cas9.

Authors:  F Ann Ran; Le Cong; Winston X Yan; David A Scott; Jonathan S Gootenberg; Andrea J Kriz; Bernd Zetsche; Ophir Shalem; Xuebing Wu; Kira S Makarova; Eugene V Koonin; Phillip A Sharp; Feng Zhang
Journal:  Nature       Date:  2015-04-01       Impact factor: 49.962

10.  RNA-programmed genome editing in human cells.

Authors:  Martin Jinek; Alexandra East; Aaron Cheng; Steven Lin; Enbo Ma; Jennifer Doudna
Journal:  Elife       Date:  2013-01-29       Impact factor: 8.140

View more
  99 in total

1.  CRISPResso2 provides accurate and rapid genome editing sequence analysis.

Authors:  Kendell Clement; Holly Rees; Matthew C Canver; Jason M Gehrke; Rick Farouni; Jonathan Y Hsu; Mitchel A Cole; David R Liu; J Keith Joung; Daniel E Bauer; Luca Pinello
Journal:  Nat Biotechnol       Date:  2019-03       Impact factor: 54.908

2.  Integrated design, execution, and analysis of arrayed and pooled CRISPR genome-editing experiments.

Authors:  Matthew C Canver; Maximilian Haeussler; Daniel E Bauer; Stuart H Orkin; Neville E Sanjana; Ophir Shalem; Guo-Cheng Yuan; Feng Zhang; Jean-Paul Concordet; Luca Pinello
Journal:  Nat Protoc       Date:  2018-04-12       Impact factor: 13.491

3.  Correction of a pathogenic gene mutation in human embryos.

Authors:  Hong Ma; Nuria Marti-Gutierrez; Sang-Wook Park; Jun Wu; Yeonmi Lee; Keiichiro Suzuki; Amy Koski; Dongmei Ji; Tomonari Hayama; Riffat Ahmed; Hayley Darby; Crystal Van Dyken; Ying Li; Eunju Kang; A-Reum Park; Daesik Kim; Sang-Tae Kim; Jianhui Gong; Ying Gu; Xun Xu; David Battaglia; Sacha A Krieg; David M Lee; Diana H Wu; Don P Wolf; Stephen B Heitner; Juan Carlos Izpisua Belmonte; Paula Amato; Jin-Soo Kim; Sanjiv Kaul; Shoukhrat Mitalipov
Journal:  Nature       Date:  2017-08-02       Impact factor: 49.962

Review 4.  Design and analysis of CRISPR-Cas experiments.

Authors:  Ruth E Hanna; John G Doench
Journal:  Nat Biotechnol       Date:  2020-04-13       Impact factor: 54.908

5.  CRISPR/Cas9 genome editing in wheat.

Authors:  Dongjin Kim; Burcu Alptekin; Hikmet Budak
Journal:  Funct Integr Genomics       Date:  2017-09-16       Impact factor: 3.410

6.  Lentiviral delivery of co-packaged Cas9 mRNA and a Vegfa-targeting guide RNA prevents wet age-related macular degeneration in mice.

Authors:  Sikai Ling; Shiqi Yang; Xinde Hu; Di Yin; Yao Dai; Xiaoqing Qian; Dawei Wang; Xiaoyong Pan; Jiaxu Hong; Xiaodong Sun; Hui Yang; Soren Riis Paludan; Yujia Cai
Journal:  Nat Biomed Eng       Date:  2021-01-04       Impact factor: 25.671

Review 7.  Technologies and Computational Analysis Strategies for CRISPR Applications.

Authors:  Kendell Clement; Jonathan Y Hsu; Matthew C Canver; J Keith Joung; Luca Pinello
Journal:  Mol Cell       Date:  2020-07-02       Impact factor: 17.970

8.  Decoding non-random mutational signatures at Cas9 targeted sites.

Authors:  Amir Taheri-Ghahfarokhi; Benjamin J M Taylor; Roberto Nitsch; Anders Lundin; Anna-Lina Cavallo; Katja Madeyski-Bengtson; Fredrik Karlsson; Maryam Clausen; Ryan Hicks; Lorenz M Mayr; Mohammad Bohlooly-Y; Marcello Maresca
Journal:  Nucleic Acids Res       Date:  2018-09-19       Impact factor: 16.971

9.  A robust and practical CRISPR/crRNA screening system for soybean cultivar editing using LbCpf1 ribonucleoproteins.

Authors:  Hyeran Kim; Jisun Choi
Journal:  Plant Cell Rep       Date:  2020-09-18       Impact factor: 4.570

10.  Transcriptomic and physiological analysis of OsCAO1 knockout lines using the CRISPR/Cas9 system in rice.

Authors:  Yu Jin Jung; Hyo Ju Lee; Jihyeon Yu; Sangsu Bae; Yong-Gu Cho; Kwon Kyoo Kang
Journal:  Plant Cell Rep       Date:  2020-09-27       Impact factor: 4.570

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.