Literature DB >> 26438539

CRISPRz: a database of zebrafish validated sgRNAs.

Gaurav K Varshney1, Suiyuan Zhang2, Wuhong Pei1, Ashrifia Adomako-Ankomah1, Jacob Fohtung1, Katherine Schaffer1, Blake Carrington3, Anoo Maskeri1, Claire Slevin1, Tyra Wolfsberg2, Johan Ledin4, Raman Sood3, Shawn M Burgess5.   

Abstract

CRISPRz (http://research.nhgri.nih.gov/CRISPRz/) is a database of CRISPR/Cas9 target sequences that have been experimentally validated in zebrafish. Programmable RNA-guided CRISPR/Cas9 has recently emerged as a simple and efficient genome editing method in various cell types and organisms, including zebrafish. Because the technique is so easy and efficient in zebrafish, the most valuable asset is no longer a mutated fish (which has distribution challenges), but rather a CRISPR/Cas9 target sequence to the gene confirmed to have high mutagenic efficiency. With a highly active CRISPR target, a mutant fish can be quickly replicated in any genetic background anywhere in the world. However, sgRNA's vary widely in their activity and models for predicting target activity are imperfect. Thus, it is very useful to collect in one place validated CRISPR target sequences with their relative mutagenic activities. A researcher could then select a target of interest in the database with an expected activity. Here, we report the development of CRISPRz, a database of validated zebrafish CRISPR target sites collected from published sources, as well as from our own in-house large-scale mutagenesis project. CRISPRz can be searched using multiple inputs such as ZFIN IDs, accession number, UniGene ID, or gene symbols from zebrafish, human and mouse. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26438539      PMCID: PMC4702947          DOI: 10.1093/nar/gkv998

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated protein (Cas9) is an acquired immunity system found in archea and bacteria that protects against invading viruses, baceteriophages and other exogenous DNA (1–3). The Cas9 is a programmable, RNA-guided endonuclease that functions together with CRISPR RNA crRNA and transactivating crRNA (tracrRNA) to create a DNA target site for cleavage (4). The CRISPR/Cas9 system has been exploited by researchers as an in vivo genome-editing tool in a wide variety of cell types and organisms, including zebrafish (5–9). The bipartite RNA-guided component was further simplified into a single-guide RNA (sgRNA) (4,6) the sequence of which directs Cas9 to a target site and causes a double-stranded break. That cleavage site is then either repaired using error-prone non-homologous end joining (NHEJ) or homology directed repair (HDR) when a donor DNA is present. CRISPR/Cas9 has been used in a variety of applications such as generating gene knockouts via indels, making precise sequence alteration in the genome with knock-in strategies, chromosomal rearrangements, transcriptional activation, transcriptional repression, conditional mutagenesis, genome-wide screens, genomic locus imaging, and human therapeutics (10,11). In zebrafish, CRISPR/Cas9 has been shown to work for both generating gene knockouts (8,12) and integrating exogenous DNA into a specific locus (i.e. knock-ins) (13–15). Efficient guide RNAs have been shown to generate bi-allelic mutations in zebrafish, and phenotypes can often be observed in injected embryos (8), making it possible to perform large-scale phenotypic screening in injected embryos (16). The modular nature of CRISPR/Cas9 also allows targeting multiple genes simultaneously. With approximately 20% of the zebrafish genome duplicated, making double or triple mutants simultaneously is an important application of CRISPR/Cas9 mutagenesis for zebrafish. Recently, we showed that CRISPR/Cas9 is six-times more efficient than two other genome-editing techniques, ZFNs and TALENs, in generating germline mutations (12). Two other large-scale studies measured somatic activities for more than 1000 genomic targets in zebrafish (17,18) and many other studies are using CRISPR/Cas9 in various applications (13,14,16,19–26). Given the increasing popularity of zebrafish as a model organism (27) and the rapid adoption of CRISPR/Cas9 genome editing in zebrafish, an integrated resource for all CRISPR target sites that have been validated is useful to the community. Here, we report the development of CRISPRz, the first database of experimentally validated CRISPR/Cas9 target sequences from our ongoing large-scale genome-editing project as well as other published sources.

MATERIALS AND METHODS

Data sources

CRISPRz contains data from published sources as well as our ongoing genome-editing project (Figure 1). Currently, CRISPRz has 479 validated targets from 10 sources (5,7,8,12,14–16,18,25,28). There are 146 targets that have not been previously published and 333 targets that have been extracted from published sources. The somatic and germline activities were measured using different methods depending on the source. Each target is assigned an activity based on the techniques used for each published CRISPR target (12). We used the % mutagenesis rate based on germline transmission for each target. For our in-house mutagenesis project, the CRISPR targets were designed using a Browser Extensible Data (BED) track that contains 18 367 469 CRISPR targets in the zebrafish genome (13). For each gene, two targets were chosen and the sgRNA was prepared in vitro from a synthesized template. The two sgRNAs and Cas9 was injected into one-cell stage embryos and the embryos were raised to adults. To determine germline mutagenesis activity, the ‘founder’ fish were either outcrossed or inbred to generate F1 embryos and the mutagenesis activity was determined using fluorescent PCR (29). The somatic activity was measured using the CRISPR-STAT assay (30) from 2-day post-fertilization embryos. We also obtained data from published sources: the target sequences, their mutagenesis activity and genotyping primers, the genomic co-ordinates and protoacceptor-motif (PAM).
Figure 1.

Overview of data collection methods. (A) Two sgRNA and Cas9 was injected into 1-cell stage embryos and the somatic and germline mutagenesis activities were measured according to Carrington et al. (30) and Varshney et al. (12). (B) CRISPR target sequences were extracted from the published sources and related information was obtained from the UCSC genome browser.

Overview of data collection methods. (A) Two sgRNA and Cas9 was injected into 1-cell stage embryos and the somatic and germline mutagenesis activities were measured according to Carrington et al. (30) and Varshney et al. (12). (B) CRISPR target sequences were extracted from the published sources and related information was obtained from the UCSC genome browser.

Database structure and implementation

The CRISPRz database is hosted on an Apache web server (2.2.15) running Scientific Linux release 6.6 (Carbon) and utilizes the Common Gateway Interface (CGI), developed using perl programming, that connects with an Oracle 11g relational database. The Web site uses Perl, HTML, Java Script and Template toolkit, and the search function was developed in Perl and CGI. The connectivity between the CGI and the database was implemented using Perl's database Interface (DBI) module, and the Oracle database driver for the DBI module (DBD::oracle). The database is composed of tables that store data content, including the zebrafish gene names and gene symbols, as well as the human and mouse orthologs. The gene annotation data and orthologs were downloaded from the Zebrafish Information Network (http://zfin.org). A set of Perl scripts was developed to add new data and annotation into CRISPRz.

RESULTS

Database navigation

The CRISPRz Web interface has a navigation sidebar that provides links to various sections of the database. Users can search CRISPRz by clicking on the Search link on the left sidebar. We provide a user-friendly search interface that accepts multiple inputs and allows users to search using different identifiers. There are four different ways to search CRISPRz - search by ID, search by single gene, search by gene list and search by the source lab or publication. The search by ID accepts different identifiers such as Ensembl (e.g. ENSDARG00000039077), GenBank (e.g. BC133731), RefSeq (e.g. NM_13108), UniGene (e.g. Dr.75081) or ZFIN (e.g. ZDB-GENE-990415-171). The user can use either a single ID or a list of IDs separated by commas, spaces or carriage returns. The search by single gene accepts gene symbols (e.g. tyr) or gene name (e.g. tyrosinase) from zebrafish, mouse, or human. CRISPRz also has a bulk search option that allows users to use a list of gene names or gene symbols separated by commas, spaces or carriage returns. Since CRISPRz contains data from various labs, users can search by the source lab (e.g. Shawn Burgess or Chen). CRISPRz can also be searched using the first author's last name for the publication (e.g. Jao).

CRISPRz search output

An example of the result of a search is shown in Figure 2. For each query term, the CRISPRz ID, zebrafish gene name, chromosome number, target sequence, PAM site, Cas9 used (e.g. Streptococcus pyogens (S.p.)), the somatic or germline activity (% mutagenesis rate or active/inactive) with the identification method (e.g. sequencing or fluorescent PCR or surveyor assay or T7 endonuclease assay), genotyping primers, source lab and the reference if available are shown. Each gene is linked to the ZFIN gene page and the target sequence is linked to the UCSC genome browser track allowing users to locate the position of the target in the genome. The reference link is connected to PubMed.
Figure 2.

CRISPRz search interface and search output. (A) The CRISPRz database can be queried with accession numbers from a number of sources, including Ensembl, GenBank, RefSeq and ZFIN. Users can also query by human, mouse, or zebrafish gene symbols and gene names, either single entry or multiple entries. CRISPRz can also be searched using a source lab name or using the last name of the first author of a publication. All searches allow for an exact match (is) or a query with wildcards (contains) (B) The zebrafish gene symbol ‘ptp’ was entered in the Search by Single Gene box. Since the ‘contains’ radio button was selected, all zebrafish gene symbols in ZFIN containing the text string ‘ptp’ are returned. Each gene name is linked to ZFIN entries for specific genes.

CRISPRz search interface and search output. (A) The CRISPRz database can be queried with accession numbers from a number of sources, including Ensembl, GenBank, RefSeq and ZFIN. Users can also query by human, mouse, or zebrafish gene symbols and gene names, either single entry or multiple entries. CRISPRz can also be searched using a source lab name or using the last name of the first author of a publication. All searches allow for an exact match (is) or a query with wildcards (contains) (B) The zebrafish gene symbol ‘ptp’ was entered in the Search by Single Gene box. Since the ‘contains’ radio button was selected, all zebrafish gene symbols in ZFIN containing the text string ‘ptp’ are returned. Each gene name is linked to ZFIN entries for specific genes.

CRISPR target tracks

We previously pre-computed all possible CRISPR targets in the zebrafish genome and generated a Browser Extensible Data (BED) track that contains 18 367 469 CRISPR target sites (12). This data hub is hosted on the UCSC genome browser and is available for upload in Ensembl. CRISPRz hosts a link to the data hub on the UCSC genome browser. Users can select a CRISPR target from the UCSC genome browser by searching for their gene name or the gene's genomic coordinates.

Methods and protocols

CRISPRz hosts protocols for making sgRNA, microinjection, genotyping (12), somatic activity measurement by fluorescent PCR (CRISPR-STAT) (30) and a calculator for determining the amount of sgRNA and Cas9 used for microinjection (CRISPR-CALC).

Data submission

We have generated a template in Excel that will allow users to submit their data to CRISPRz by email. All user-submitted data will be reviewed and verified manually for a consistent format and then moved to the database. In the future, we will provide an online submission for entering data.

Accessibility

CRISPRz can be accessed at http://research.nhgri.nih.gov/crisprz. The CRISPRz data can also be downloaded in CSV format.

CONCLUSIONS

CRISPRz was developed in an effort to provide a comprehensive list of validated CRISPR targets from published sources as well as from an ongoing genome-wide knockout project in the zebrafish genome. Data will be added as more validated CRISPR targets are published or contributed from unpublished, in-house projects. The database is also open for data submission from the research community. An effort is being made to cross-reference CRISPRz with the Zebrafish Information Network (ZFIN) database. CRISPRz will host the most up-to-date protocols and methods from the Burgess lab. We believe by providing a list of validated CRIPSR targets, the community will save significant time and resources.
  28 in total

1.  The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA.

Authors:  Josiane E Garneau; Marie-Ève Dupuis; Manuela Villion; Dennis A Romero; Rodolphe Barrangou; Patrick Boyaval; Christophe Fremaux; Philippe Horvath; Alfonso H Magadán; Sylvain Moineau
Journal:  Nature       Date:  2010-11-04       Impact factor: 49.962

2.  Expansion of CRISPR/Cas9 genome targeting sites in zebrafish by Csy4-based RNA processing.

Authors:  Wei Qin; Fang Liang; Yan Feng; Haipeng Bai; Ruibin Yan; Song Li; Shuo Lin
Journal:  Cell Res       Date:  2015-08-04       Impact factor: 25.617

Review 3.  Expanding the Biologist's Toolkit with CRISPR-Cas9.

Authors:  Samuel H Sternberg; Jennifer A Doudna
Journal:  Mol Cell       Date:  2015-05-21       Impact factor: 17.970

4.  Rapid reverse genetic screening using CRISPR in zebrafish.

Authors:  Arish N Shah; Crystal F Davey; Alex C Whitebirch; Adam C Miller; Cecilia B Moens
Journal:  Nat Methods       Date:  2015-04-13       Impact factor: 28.547

5.  A CRISPR/Cas9 vector system for tissue-specific gene disruption in zebrafish.

Authors:  Julien Ablain; Ellen M Durand; Song Yang; Yi Zhou; Leonard I Zon
Journal:  Dev Cell       Date:  2015-03-05       Impact factor: 12.270

6.  The Streptococcus thermophilus CRISPR/Cas system provides immunity in Escherichia coli.

Authors:  Rimantas Sapranauskas; Giedrius Gasiunas; Christophe Fremaux; Rodolphe Barrangou; Philippe Horvath; Virginijus Siksnys
Journal:  Nucleic Acids Res       Date:  2011-08-03       Impact factor: 16.971

7.  Engineered CRISPR-Cas9 nucleases with altered PAM specificities.

Authors:  Benjamin P Kleinstiver; Michelle S Prew; Shengdar Q Tsai; Ved V Topkar; Nhu T Nguyen; Zongli Zheng; Andrew P W Gonzales; Zhuyun Li; Randall T Peterson; Jing-Ruey Joanna Yeh; Martin J Aryee; J Keith Joung
Journal:  Nature       Date:  2015-06-22       Impact factor: 49.962

8.  Intron targeting-mediated and endogenous gene integrity-maintaining knockin in zebrafish using the CRISPR/Cas9 system.

Authors:  Jia Li; Bai-bing Zhang; Yong-gang Ren; Shan-ye Gu; Yuan-hang Xiang; Jiu-lin Du
Journal:  Cell Res       Date:  2015-04-07       Impact factor: 25.617

9.  High-throughput gene targeting and phenotyping in zebrafish using CRISPR/Cas9.

Authors:  Gaurav K Varshney; Wuhong Pei; Matthew C LaFave; Jennifer Idol; Lisha Xu; Viviana Gallardo; Blake Carrington; Kevin Bishop; MaryPat Jones; Mingyu Li; Ursula Harper; Sunny C Huang; Anupam Prakash; Wenbiao Chen; Raman Sood; Johan Ledin; Shawn M Burgess
Journal:  Genome Res       Date:  2015-06-05       Impact factor: 9.043

10.  CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo.

Authors:  Miguel A Moreno-Mateos; Charles E Vejnar; Jean-Denis Beaudoin; Juan P Fernandez; Emily K Mis; Mustafa K Khokha; Antonio J Giraldez
Journal:  Nat Methods       Date:  2015-08-31       Impact factor: 28.547

View more
  22 in total

1.  A subset of SMN complex members have a specific role in tissue regeneration via ERBB pathway-mediated proliferation.

Authors:  Wuhong Pei; Lisha Xu; Zelin Chen; Claire C Slevin; Kade P Pettie; Stephen Wincovitch; Shawn M Burgess
Journal:  NPJ Regen Med       Date:  2020-03-25

Review 2.  Advancements in zebrafish applications for 21st century toxicology.

Authors:  Gloria R Garcia; Pamela D Noyes; Robert L Tanguay
Journal:  Pharmacol Ther       Date:  2016-03-22       Impact factor: 12.310

Review 3.  Zebrafish Genome Engineering Using the CRISPR-Cas9 System.

Authors:  Mingyu Li; Liyuan Zhao; Patrick S Page-McCaw; Wenbiao Chen
Journal:  Trends Genet       Date:  2016-11-08       Impact factor: 11.639

4.  A high-throughput functional genomics workflow based on CRISPR/Cas9-mediated targeted mutagenesis in zebrafish.

Authors:  Gaurav K Varshney; Blake Carrington; Wuhong Pei; Kevin Bishop; Zelin Chen; Chunxin Fan; Lisha Xu; Marypat Jones; Matthew C LaFave; Johan Ledin; Raman Sood; Shawn M Burgess
Journal:  Nat Protoc       Date:  2016-10-27       Impact factor: 13.491

5.  Genome-wide analysis identifies 12 loci influencing human reproductive behavior.

Authors:  Nicola Barban; Rick Jansen; Ronald de Vlaming; Ahmad Vaez; Jornt J Mandemakers; Felix C Tropf; Xia Shen; James F Wilson; Daniel I Chasman; Ilja M Nolte; Vinicius Tragante; Sander W van der Laan; John R B Perry; Augustine Kong; Tarunveer S Ahluwalia; Eva Albrecht; Laura Yerges-Armstrong; Gil Atzmon; Kirsi Auro; Kristin Ayers; Andrew Bakshi; Danny Ben-Avraham; Klaus Berger; Aviv Bergman; Lars Bertram; Lawrence F Bielak; Gyda Bjornsdottir; Marc Jan Bonder; Linda Broer; Minh Bui; Caterina Barbieri; Alana Cavadino; Jorge E Chavarro; Constance Turman; Maria Pina Concas; Heather J Cordell; Gail Davies; Peter Eibich; Nicholas Eriksson; Tõnu Esko; Joel Eriksson; Fahimeh Falahi; Janine F Felix; Mark Alan Fontana; Lude Franke; Ilaria Gandin; Audrey J Gaskins; Christian Gieger; Erica P Gunderson; Xiuqing Guo; Caroline Hayward; Chunyan He; Edith Hofer; Hongyan Huang; Peter K Joshi; Stavroula Kanoni; Robert Karlsson; Stefan Kiechl; Annette Kifley; Alexander Kluttig; Peter Kraft; Vasiliki Lagou; Cecile Lecoeur; Jari Lahti; Ruifang Li-Gao; Penelope A Lind; Tian Liu; Enes Makalic; Crysovalanto Mamasoula; Lindsay Matteson; Hamdi Mbarek; Patrick F McArdle; George McMahon; S Fleur W Meddens; Evelin Mihailov; Mike Miller; Stacey A Missmer; Claire Monnereau; Peter J van der Most; Ronny Myhre; Mike A Nalls; Teresa Nutile; Ioanna Panagiota Kalafati; Eleonora Porcu; Inga Prokopenko; Kumar B Rajan; Janet Rich-Edwards; Cornelius A Rietveld; Antonietta Robino; Lynda M Rose; Rico Rueedi; Kathleen A Ryan; Yasaman Saba; Daniel Schmidt; Jennifer A Smith; Lisette Stolk; Elizabeth Streeten; Anke Tönjes; Gudmar Thorleifsson; Sheila Ulivi; Juho Wedenoja; Juergen Wellmann; Peter Willeit; Jie Yao; Loic Yengo; Jing Hua Zhao; Wei Zhao; Daria V Zhernakova; Najaf Amin; Howard Andrews; Beverley Balkau; Nir Barzilai; Sven Bergmann; Ginevra Biino; Hans Bisgaard; Klaus Bønnelykke; Dorret I Boomsma; Julie E Buring; Harry Campbell; Stefania Cappellani; Marina Ciullo; Simon R Cox; Francesco Cucca; Daniela Toniolo; George Davey-Smith; Ian J Deary; George Dedoussis; Panos Deloukas; Cornelia M van Duijn; Eco J C de Geus; Johan G Eriksson; Denis A Evans; Jessica D Faul; Cinzia Felicita Sala; Philippe Froguel; Paolo Gasparini; Giorgia Girotto; Hans-Jörgen Grabe; Karin Halina Greiser; Patrick J F Groenen; Hugoline G de Haan; Johannes Haerting; Tamara B Harris; Andrew C Heath; Kauko Heikkilä; Albert Hofman; Georg Homuth; Elizabeth G Holliday; John Hopper; Elina Hyppönen; Bo Jacobsson; Vincent W V Jaddoe; Magnus Johannesson; Astanand Jugessur; Mika Kähönen; Eero Kajantie; Sharon L R Kardia; Bernard Keavney; Ivana Kolcic; Päivikki Koponen; Peter Kovacs; Florian Kronenberg; Zoltan Kutalik; Martina La Bianca; Genevieve Lachance; William G Iacono; Sandra Lai; Terho Lehtimäki; David C Liewald; Cecilia M Lindgren; Yongmei Liu; Robert Luben; Michael Lucht; Riitta Luoto; Per Magnus; Patrik K E Magnusson; Nicholas G Martin; Matt McGue; Ruth McQuillan; Sarah E Medland; Christa Meisinger; Dan Mellström; Andres Metspalu; Michela Traglia; Lili Milani; Paul Mitchell; Grant W Montgomery; Dennis Mook-Kanamori; Renée de Mutsert; Ellen A Nohr; Claes Ohlsson; Jørn Olsen; Ken K Ong; Lavinia Paternoster; Alison Pattie; Brenda W J H Penninx; Markus Perola; Patricia A Peyser; Mario Pirastu; Ozren Polasek; Chris Power; Jaakko Kaprio; Leslie J Raffel; Katri Räikkönen; Olli Raitakari; Paul M Ridker; Susan M Ring; Kathryn Roll; Igor Rudan; Daniela Ruggiero; Dan Rujescu; Veikko Salomaa; David Schlessinger; Helena Schmidt; Reinhold Schmidt; Nicole Schupf; Johannes Smit; Rossella Sorice; Tim D Spector; John M Starr; Doris Stöckl; Konstantin Strauch; Michael Stumvoll; Morris A Swertz; Unnur Thorsteinsdottir; A Roy Thurik; Nicholas J Timpson; Joyce Y Tung; André G Uitterlinden; Simona Vaccargiu; Jorma Viikari; Veronique Vitart; Henry Völzke; Peter Vollenweider; Dragana Vuckovic; Johannes Waage; Gert G Wagner; Jie Jin Wang; Nicholas J Wareham; David R Weir; Gonneke Willemsen; Johann Willeit; Alan F Wright; Krina T Zondervan; Kari Stefansson; Robert F Krueger; James J Lee; Daniel J Benjamin; David Cesarini; Philipp D Koellinger; Marcel den Hoed; Harold Snieder; Melinda C Mills
Journal:  Nat Genet       Date:  2016-10-31       Impact factor: 38.330

Review 6.  Zebrafish as a Translational Model: An Experimental Alternative to Study the Mechanisms Involved in Anosmia and Possible Neurodegenerative Aspects of COVID-19?

Authors:  Karla C M Costa; Tamires A V Brigante; Gabriel G Fernandes; Davi S Scomparin; Franciele F Scarante; Danielle P de Oliveira; Alline C Campos
Journal:  eNeuro       Date:  2021-06-02

7.  ge-CRISPR - An integrated pipeline for the prediction and analysis of sgRNAs genome editing efficiency for CRISPR/Cas system.

Authors:  Karambir Kaur; Amit Kumar Gupta; Akanksha Rajput; Manoj Kumar
Journal:  Sci Rep       Date:  2016-09-01       Impact factor: 4.379

8.  GenomeCRISPR - a database for high-throughput CRISPR/Cas9 screens.

Authors:  Benedikt Rauscher; Florian Heigwer; Marco Breinig; Jan Winter; Michael Boutros
Journal:  Nucleic Acids Res       Date:  2016-10-26       Impact factor: 16.971

9.  Guided genetic screen to identify genes essential in the regeneration of hair cells and other tissues.

Authors:  Wuhong Pei; Lisha Xu; Sunny C Huang; Kade Pettie; Jennifer Idol; Alberto Rissone; Erin Jimenez; Jason W Sinclair; Claire Slevin; Gaurav K Varshney; MaryPat Jones; Blake Carrington; Kevin Bishop; Haigen Huang; Raman Sood; Shuo Lin; Shawn M Burgess
Journal:  NPJ Regen Med       Date:  2018-06-04

Review 10.  Opportunities for CRISPR/Cas9 Gene Editing in Retinal Regeneration Research.

Authors:  Leah J Campbell; David R Hyde
Journal:  Front Cell Dev Biol       Date:  2017-11-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.