Literature DB >> 23180778

The Zebrafish Insertion Collection (ZInC): a web based, searchable collection of zebrafish mutations generated by DNA insertion.

Gaurav K Varshney1, Haigen Huang, Suiyuan Zhang, Jing Lu, Derek E Gildea, Zhongan Yang, Tyra G Wolfsberg, Shuo Lin, Shawn M Burgess.   

Abstract

ZInC (Zebrafish Insertional Collection, http://research.nhgri.nih.gov/ZInC/) is a web-searchable interface of insertional mutants in zebrafish. Over the last two decades, the zebrafish has become a popular model organism for studying vertebrate development as well as for modeling human diseases. To facilitate such studies, we are generating a genome-wide knockout resource that targets every zebrafish protein-coding gene. All mutant fish are freely available to the scientific community through the Zebrafish International Resource Center (ZIRC). To assist researchers in finding mutant and insertion information, we developed a comprehensive database with a web front-end, the ZInC. It can be queried using multiple types of input such as ZFIN (Zebrafish Information Network) IDs, UniGene accession numbers and gene symbols from zebrafish, human and mouse. In the future, ZInC may include data from other insertional mutation projects as well. ZInC cross-references all integration data with the ZFIN (http://zfin.org/).

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23180778      PMCID: PMC3531054          DOI: 10.1093/nar/gks946

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Functional genomic studies in different model organisms including vertebrates such as mice, frogs and fish have contributed immensely to our understanding of various human diseases. Furthermore, it is now possible to systematically create gene mutations and then study the associated phenotypes (i.e., ‘reverse’ genetics). Zebrafish has gained momentum in the past two decades as a genetically tractable model organism in which to study vertebrate development because of its transparent embryos, small size, in vitro development, high fecundity and inexpensive maintenance (1). Their embryonic development is fast, with most of the critical development occurring in the first 5 days. Traditionally in zebrafish, forward genetic screens are used, and two independent, large-scale mutagenesis screens have been carried out using ENU as the mutagen (2,3). ENU is very efficient, but it introduces single base-pair change into DNA, and the identification of the target gene is typically done by a rather laborious and time-consuming method of positional cloning (4,5). To circumvent the positional cloning step, insertional mutagenesis strategies have been developed in different model organisms. Insertional elements such as transposons and retroviruses have become indispensible tools in manipulating genomes for various applications, not only insertional mutagenesis, but also transgenesis and gene therapy (6,7). Insertional mutagenesis using retroviral vectors has effectively been used to disrupt gene functions in vertebrates (8,9). We are using a high-throughput retroviral mediated mutagenesis followed by mapping using next-generation sequencing methods (10) to generate a knockout library of zebrafish. In order to make this mutagenic resource readily available to the research community, we have developed ZInC, the Zebrafish Integration Collection, an integrated database and web front-end to display mutagenic insertions in a simple and interactive way.

INSERTION DATA SOURCE

The experimental data in ZInC are derived from our ongoing retroviral mediated insertional mutagenesis project (Varshney and Lu et al., unpublished). A graphical representation of the pipeline is included in the ZInC website.

INSERTION DATA PROCESSING

We developed a robust mapping analysis pipeline to generate insertion data in the zebrafish genome. Raw data were processed from ELAND or BAM output files, generated from paired-end sequencing using Illumina platform, and sequence reads were extracted. To map insertion sites, retroviral vector and linker sequences were trimmed, then the trimmed reads were mapped to zebrafish genome (Ensembl Zv9.0 assembly) using the short read aligner Bowtie (12). Since LM-PCR was performed to isolate specific integration events, it was possible that the same integration site was sequenced multiple times. Therefore, sequence reads that mapped to the same chromosome, having the same genomic alignment start position, and mapping to the same DNA strand defined a single integration event. The mapped integration sites were then compared to the genomic locations of annotated genes (Ensembl e65) to determine which integration sites are associated with genes, that is, which integration sites are within exons or introns. Insertion events are flagged as being ‘Predicted Mutagenic’ only if they fall within any exon or the first intron.

DATABASE DESIGN AND IMPLEMENTATION

The ZInC website interface uses a Common Gateway Interface (CGI) constructed in the Perl programming language that interacts with a relational database hosted in Oracle 11g. The web interface was developed by using HTML, Perl, Java Script and Template Toolkit. The connectivity between the CGI and the Oracle 11g Relational Database Management Software was implemented using Perl's Database Interface (DBI) and the Oracle database driver for the DBI module (DBD::Oracle). The ZInC database consists of several tables that hold the data content, including the zebrafish gene name and gene symbols, human and mouse gene orthologs, and zebrafish KEGG (Kyoto Encyclopedia of Genes and Genomes) information (13). The tables are populated by downloading data from DAVID (http://david.abcc.ncifcrf.gov/; KEGG only) (14,15) and Zebrafish Information Network (ZFIN) (http://zfin.org/; all other annotations) (16); these data will be refreshed quarterly, or as available. The list of insertions is also updated as new experimental and bioinformatics analyses become available. A suite of Perl scripts was developed to add new data and annotations into ZInC as well as to ensure data integrity.

DATABASE NAVIGATION

The navigation sidebar on the left side of each page provides links to different sections of the web resource. As shown in Figure 1, the ‘Search’ link is the main one, a search page that provides access to the integration sites in the database. To facilitate the search, we provide a simple interface that accepts multiple input IDs such as Ensembl (e.g. ENSDARG00000010070), Genbank (e.g. BC081408), RefSeq (e.g. NM_001004678), UniGene (e.g. Dr.134464) or ZFIN (e.g. ZDB-GENE-040912-127) identifiers. The user can enter either a single ID or a list of IDs separated by commas, spaces or carriage returns. Users can also search by gene symbol (e.g. smo) or gene name (e.g. smoothened homolog) from zebrafish, human or mouse. A list of gene names or gene symbols can also be used as search input; again, the list of identifiers can be separated by comma, space or carriage returns. Researchers studying biochemical pathways can search by KEGG pathway (13) terms (e.g. Glycosphingolipid biosynthesis) to find insertions in genes in a specific pathway.
Figure 1.

ZInC search interface. Insertion sites within genes have been mapped to a variety of common identifiers. Users can query ZInC with accession numbers from a number of sources, including Ensembl, GenBank, RefSeq and ZFIN. Users can also query by human, mouse or zebrafish gene symbols and names, either individual entries or longer lists. Queries can also be performed on KEGG biological pathways. All searches allow for an exact match (is) or a query with wildcards (contains). In this instance, we searched for a mutant in the gene smoothened using the zebrafish symbol ‘smo’ and by choosing the ‘contains’ radio button, the search will return any gene symbol that has the text string ‘smo’ in it.

ZInC search interface. Insertion sites within genes have been mapped to a variety of common identifiers. Users can query ZInC with accession numbers from a number of sources, including Ensembl, GenBank, RefSeq and ZFIN. Users can also query by human, mouse or zebrafish gene symbols and names, either individual entries or longer lists. Queries can also be performed on KEGG biological pathways. All searches allow for an exact match (is) or a query with wildcards (contains). In this instance, we searched for a mutant in the gene smoothened using the zebrafish symbol ‘smo’ and by choosing the ‘contains’ radio button, the search will return any gene symbol that has the text string ‘smo’ in it. The results of a search are shown in Figure 2. For each gene hit by the query term, the ZFIN ID, zebrafish gene symbol and zebrafish gene name are returned, regardless of whether that gene is disrupted by an integration. The presence of an integration is indicated by the link ‘View integration’ in the Integration column; whether the insertion is predicted to be mutagenic (i.e. it lands in an exon or first intron) is marked in the final column. The results of the ‘View integration’ link are shown in Figure 3. In brief, this page shows the integration position, allele number, Ensembl gene ID and a link to order fish through the Zebrafish International Resource Center (ZIRC) when available. The insertion position is linked to the UCSC Genome Browser so that users can see the genomic context of each integration site. Each allele number is linked to the corresponding ZFIN allele page.
Figure 2.

ZInC search results. In Figure 1, the zebrafish gene symbol ‘smo’ was entered in the Search by Single Gene box. Since the ‘contains’ radio button was selected, all zebrafish gene symbols in ZFIN containing the text string ‘smo’ are returned, regardless of whether an integration in the gene is available. IDs in the ZFIN ID column link to ZFIN entries for specific genes. For those genes with an integration, more detailed information is available through the ‘View Integration’ link (Figure 3).

Figure 3.

ZInC integration site details. Clicking on the ‘View Integration’ link on a search results page (Figure 2) results in a detailed view of the integration site. The ‘Integration Position’ column links to the UCSC Genome Browser zoomed in to a 2 Kb window around the integration site, the ‘Ensembl Gene ID’ column links to the Ensembl gene page, and the ‘ORDER FISH’ column links directly to the ZIRC to purchase the desired mutant fish (if available).

ZInC search results. In Figure 1, the zebrafish gene symbol ‘smo’ was entered in the Search by Single Gene box. Since the ‘contains’ radio button was selected, all zebrafish gene symbols in ZFIN containing the text string ‘smo’ are returned, regardless of whether an integration in the gene is available. IDs in the ZFIN ID column link to ZFIN entries for specific genes. For those genes with an integration, more detailed information is available through the ‘View Integration’ link (Figure 3). ZInC integration site details. Clicking on the ‘View Integration’ link on a search results page (Figure 2) results in a detailed view of the integration site. The ‘Integration Position’ column links to the UCSC Genome Browser zoomed in to a 2 Kb window around the integration site, the ‘Ensembl Gene ID’ column links to the Ensembl gene page, and the ‘ORDER FISH’ column links directly to the ZIRC to purchase the desired mutant fish (if available).

CONCLUSIONS

ZInC is a part of an ongoing project where we aim to knock out every protein-coding gene in the zebrafish genome. We will update the database at least quarterly with newly identified integration sites. Other groups are also attempting to knock out genes using different insertional elements, such as the Tol2 and Ac/Ds transposons (17,18). An effort is being made to integrate these and other similar data into ZInC, allowing it to serve as a central repository for all integration sites in zebrafish.

ACCESSIBILITY

ZInC can be accessed at http://research.nhgri.nih.gov/zinc. Comprehensive lists of all insertion sites, as well as protocols and methods required for the genotyping can be downloaded from http://research.nhgri.nih.gov/zinc/?mode=downloads. All the data cross-references with gene and mutant data in the ZFIN: http://zfin.org.

ACCESSION NUMBERS

As of August 2012, 13 316 insertion sequences have been submitted to the National Center for Biotechnology Information (NCBI) Genome Survey Sequence (GSS) database. The accession numbers are JS426363-JS426454, JS495495-JS496658, JS578512-JS583384, JS672208-JS672893, JS784708-JS785225 and JS876947-JS886733, and the BioSample ID is GSS: LIBGSS_038780. The full list of integrations is available from the Downloads page at: http://research.nhgri.nih.gov/zinc/?mode=downloads.

FUNDING

Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health (to S.B. and T.W.) and R01 grant NIH DK084349 (to S.L.). Funding for open access charge: Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health (to S.B. and T.W.) and R01 grant NIH DK084349 (to S.L.). Conflict of interest statement. None declared.
  18 in total

1.  A large-scale insertional mutagenesis screen in zebrafish.

Authors:  A Amsterdam; S Burgess; G Golling; W Chen; Z Sun; K Townsend; S Farrington; M Haldi; N Hopkins
Journal:  Genes Dev       Date:  1999-10-15       Impact factor: 11.361

2.  The KEGG resource for deciphering the genome.

Authors:  Minoru Kanehisa; Susumu Goto; Shuichi Kawashima; Yasushi Okuno; Masahiro Hattori
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  A transposon-mediated gene trap approach identifies developmentally regulated genes in zebrafish.

Authors:  Koichi Kawakami; Hisashi Takeda; Noriko Kawakami; Makoto Kobayashi; Naoto Matsuda; Masayoshi Mishina
Journal:  Dev Cell       Date:  2004-07       Impact factor: 12.270

Review 4.  Positional cloning of mutated zebrafish genes.

Authors:  W S Talbot; A F Schier
Journal:  Methods Cell Biol       Date:  1999       Impact factor: 1.441

5.  Insertional mutagenesis and rapid cloning of essential genes in zebrafish.

Authors:  N Gaiano; A Amsterdam; K Kawakami; M Allende; T Becker; N Hopkins
Journal:  Nature       Date:  1996-10-31       Impact factor: 49.962

6.  Trans-kingdom transposition of the maize dissociation element.

Authors:  Alexander Emelyanov; Yuan Gao; Naweed Isaak Naqvi; Serguei Parinov
Journal:  Genetics       Date:  2006-09-01       Impact factor: 4.562

7.  Efficient recovery of ENU-induced mutations from the zebrafish germline.

Authors:  L Solnica-Krezel; A F Schier; W Driever
Journal:  Genetics       Date:  1994-04       Impact factor: 4.562

Review 8.  Genetics and early development of zebrafish.

Authors:  C B Kimmel
Journal:  Trends Genet       Date:  1989-08       Impact factor: 11.639

9.  A genetic screen for mutations affecting embryogenesis in zebrafish.

Authors:  W Driever; L Solnica-Krezel; A F Schier; S C Neuhauss; J Malicki; D L Stemple; D Y Stainier; F Zwartkruis; S Abdelilah; Z Rangini; J Belak; C Boggs
Journal:  Development       Date:  1996-12       Impact factor: 6.868

10.  The identification of genes with unique and essential functions in the development of the zebrafish, Danio rerio.

Authors:  P Haffter; M Granato; M Brand; M C Mullins; M Hammerschmidt; D A Kane; J Odenthal; F J van Eeden; Y J Jiang; C P Heisenberg; R N Kelsh; M Furutani-Seiki; E Vogelsang; D Beuchle; U Schach; C Fabian; C Nüsslein-Volhard
Journal:  Development       Date:  1996-12       Impact factor: 6.868

View more
  14 in total

Review 1.  Understanding cardiac sarcomere assembly with zebrafish genetics.

Authors:  Jingchun Yang; Yu-Huan Shih; Xiaolei Xu
Journal:  Anat Rec (Hoboken)       Date:  2014-09       Impact factor: 2.064

Review 2.  Short stories on zebrafish long noncoding RNAs.

Authors:  Shadabul Haque; Kriti Kaushik; Vincent Elvin Leonard; Shruti Kapoor; Ambily Sivadas; Adita Joshi; Vinod Scaria; Sridhar Sivasubbu
Journal:  Zebrafish       Date:  2014-12       Impact factor: 1.985

3.  The translation initiation factor eIF3i up-regulates vascular endothelial growth factor A, accelerates cell proliferation, and promotes angiogenesis in embryonic development and tumorigenesis.

Authors:  Yike Yuan; Yaguang Zhang; Shaohua Yao; Huashan Shi; Xi Huang; Yuhao Li; Yuquan Wei; Shuo Lin
Journal:  J Biol Chem       Date:  2014-08-21       Impact factor: 5.157

Review 4.  The old and new face of craniofacial research: How animal models inform human craniofacial genetic and clinical data.

Authors:  Eric Van Otterloo; Trevor Williams; Kristin Bruk Artinger
Journal:  Dev Biol       Date:  2016-01-22       Impact factor: 3.582

Review 5.  Mutagenesis and phenotyping resources in zebrafish for studying development and human disease.

Authors:  Gaurav Kumar Varshney; Shawn Michael Burgess
Journal:  Brief Funct Genomics       Date:  2013-10-26       Impact factor: 4.241

6.  Small fish, big prospects: using zebrafish to unravel the mechanisms of hereditary hearing loss.

Authors:  Barbara Vona; Julia Doll; Michaela A H Hofrichter; Thomas Haaf; Gaurav K Varshney
Journal:  Hear Res       Date:  2020-02-06       Impact factor: 3.208

7.  A large-scale zebrafish gene knockout resource for the genome-wide study of gene function.

Authors:  Gaurav K Varshney; Jing Lu; Derek E Gildea; Haigen Huang; Wuhong Pei; Zhongan Yang; Sunny C Huang; David Schoenfeld; Nam H Pho; David Casero; Takashi Hirase; Deborah Mosbrook-Davis; Suiyuan Zhang; Li-En Jao; Bo Zhang; Ian G Woods; Steven Zimmerman; Alexander F Schier; Tyra G Wolfsberg; Matteo Pellegrini; Shawn M Burgess; Shuo Lin
Journal:  Genome Res       Date:  2013-02-04       Impact factor: 9.043

8.  Zebrafish models for ectopic mineralization disorders: practical issues from morpholino design to post-injection observations.

Authors:  Mohammad Jakir Hosen; Olivier M Vanakker; Andy Willaert; Ann Huysseune; Paul Coucke; Anne De Paepe
Journal:  Front Genet       Date:  2013-05-08       Impact factor: 4.599

Review 9.  Regulation of zebrafish sleep and arousal states: current and prospective approaches.

Authors:  Cindy N Chiu; David A Prober
Journal:  Front Neural Circuits       Date:  2013-04-09       Impact factor: 3.492

10.  Building the vertebrate codex using the gene breaking protein trap library.

Authors:  Noriko Ichino; MaKayla R Serres; Rhianna M Urban; Mark D Urban; Anthony J Treichel; Kyle J Schaefbauer; Lauren E Tallant; Gaurav K Varshney; Kimberly J Skuster; Melissa S McNulty; Camden L Daby; Ying Wang; Hsin-Kai Liao; Suzan El-Rass; Yonghe Ding; Weibin Liu; Jennifer L Anderson; Mark D Wishman; Ankit Sabharwal; Lisa A Schimmenti; Sridhar Sivasubbu; Darius Balciunas; Matthias Hammerschmidt; Steven Arthur Farber; Xiao-Yan Wen; Xiaolei Xu; Maura McGrail; Jeffrey J Essner; Shawn M Burgess; Karl J Clark; Stephen C Ekker
Journal:  Elife       Date:  2020-08-11       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.