Literature DB >> 17933775

Gallus GBrowse: a unified genomic database for the chicken.

Carl J Schmidt1, Michael Romanov, Oliver Ryder, Vincent Magrini, Matthew Hickenbotham, Jarret Glasscock, Sean McGrath, Elaine Mardis, Lincoln D Stein.   

Abstract

Gallus GBrowse (http://birdbase.net/cgi-bin/gbrowse/gallus/) provides online access to genomic and other information about the chicken, Gallus gallus. The information provided by this resource includes predicted genes and Gene Ontology (GO) terms, links to Gallus In Situ Hybridization Analysis (GEISHA), Unigene and Reactome, the genomic positions of chicken genetic markers, SNPs and microarray probes, and mappings from turkey, condor and zebra finch DNA and EST sequences to the chicken genome. We also provide a BLAT server (http://birdbase.net/cgi-bin/webBlat) for matching user-provided sequences to the chicken genome. These tools make the Gallus GBrowse server a valuable resource for researchers seeking genomic information regarding the chicken and other avian species.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17933775      PMCID: PMC2238981          DOI: 10.1093/nar/gkm783

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The chicken (Gallus gallus) has played important roles in both scientific research and the general health and welfare of humans. For example, in the field of developmental biology, the chicken embryo model has provided insight into many developmental processes including cell migration (1–3), limb development (4,5) and eye formation (6–8). The discovery of avian oncogenic viruses helped highlight the importance of specific genes in tumorigenesis and the chicken continues to be a popular model system for cancer and other diseases (9–11). As a food source, the chicken was domesticated in Asia ∼7000–10 000 years ago and has undergone intensive selection for both egg and meat production over the past 60–70 years. In 2005, the United States (source: USDA National Agricultural Statistics Service) alone produced and consumed 30 billion pounds and exported another 5 billion pounds of chicken meat. In that same year, 90 billion eggs were produced in the United States. Clearly, the chicken plays an important role as both a model organism and as a food resource. An enormous amount of genomic information and resources are available for the chicken. The genomic sequence of ∼1 billion nucleotides was completed (12) and released in 2004 and then updated in 2006. A total of 3 335 290 SNPS (13) have been deposited in GenBank and over 1000 microsatellite (MS) and other genetic markers have been identified (14,15). At least five microarray platforms are available, and the Gallus In Situ Hybridization Analysis (GEISHA) (16) project is providing detailed descriptions of the embryonic expression pattern of many chicken genes. A centralized, web accessible, chicken database would provide a valuable resource for common access to this data. To begin providing such a resource, we have developed a Generic Model Organism Database (GMOD) (17) Gallus GBrowse site along with a BLAT server for searching the chicken genome. This site provides access to many chicken resources, along with mappings of turkey, condor and zebra finch nucleotide sequences to the chicken genome.

GALLUS GBROWSE DATA

The draft chicken genomic sequence (V2.1), produced by the Genome Sequencing Center at Washington University of St. Louis, was downloaded from the UCSC Genome Browser Gateway. The GMOD GBrowse viewer (17) in combination with a MySQL database management system is used to store, search and display annotation of the chicken genome. The GBrowse web page provides user access and is organized along themes including genes, gene expression platforms, gene expression data, Gene Ontology (GO) and pathways, markers and SNPs and other avian species.

Genes

The gene positions were defined based upon NCBI RefSeq and Ensembl cDNA predictions. These are provided as separate tracks in the GBrowse. In addition, predicted non-coding RNA genes and exon/intron positions are provided based on Ensembl predictions.

Gene expression platforms

These allow visualizing the positions of probes from five array platforms in the context of the chicken genome. Probe sequences for the Delmar (18), Avian Macrophage (19,20), Chicken 13K (21) and the Chicken Oligo microarray (http://www.grl.steelecenter.arizona.edu/products.asp) were aligned with the chicken genomic sequence using BLAT (22). The probe positions for the Affymetrix Chicken Genome Array were obtained from the NetAffx alignment file provided by Affymetrix.

Gene expression

Currently, two sets of gene expression data are accessed from Gallus GBrowse: GEISHA (16) and Unigene (23). The GEISHA project aims to describe the expression pattern of genes in the chicken embryo between Hamburger and Hamilton stages 1–25. The Unigene information is derived from the Unigene expression profiler, which describes the expression pattern for a gene based on EST analysis.

Gene ontologies and pathways

One set of tracks displays GO (24,25) terms for a given gene. GO terms were obtained from the Gene Ontology Annotation (GOA) Database via the NCBI database gene2go file. Hovering the mouse over the glyph will display the assigned GO term, while clicking on the link will connect to the Amigo term definition. Reactome (26,27) is a human-centric curated knowledge base of biological pathways and pathways for other species are predicted by gene ortholog relationships. The Gallus GBrowse Reactome glyph links to the gene summary page in the Reactome knowledge base for the corresponding chicken gene. From the Reactome summary page, one can then access all pertinent information regarding the gene, including the reactions, pathways and molecular complexes the gene product participates in, as well as the gene's orthologs in human and other model species.

Markers and SNPS

Markers were obtained from the NCBI UniSTS ftp site, or from a sequence file provided by Dr Martien Groenen (Wageningen University). The genomic locations of these sequences were then determined by BLAT analysis. SNPs were also mapped to the genome by BLAT using the flanking sequence obtained from the NCBI dbSNP database. Because of the high density of SNPs mapped (>3 000 000) to the chicken genome, the SNP track is only visualized at a zoom scale of 250 000 nucleotides or lower. Clicking on an individual SNPs glyph will link to the NCBI cluster report for that SNP.

Other avian species

To help integrate analysis of the chicken with other avian species, genomic and cDNA data from the turkey (28–31), condor and zebra finch (32,33) have been mapped to the chicken genome by BLAT. Turkey DNA and zebra finch DNA sequences were obtained from NCBI along with the condor MS sequences. The condor 454 sequences were derived from fibroblast ESTs determined using the 454 sequencing technology (34).

DNA

This track visualizes the DNA sequence of the current region. The nucleotide sequence is only presented at a zoom of 100 base pairs. At higher zoom levels, the %GC content is displayed.

QUERY TOOLS

The Gallus GBrowse web page provides an integrated query interface. Specific chromosomal regions of 10 megabases or less can be accessed with known nucleotide coordinates using the Landmark or Region search box (Figure 1). This same search box can be used to locate specific information stored in the GBrowse database. For example, one can search for all genes annotated with the GO term ‘apoptosis’ by inserting ‘GO:apoptosis’ in the Landmark or Region box (Figure 2). This yields a total of 14 genes that have been annotated with ‘apoptosis’ in the chicken genome. A complete listing of all query prefix terms (such as GO) is provided in the Gallus GBrowse help pages.
Figure 1.

Gallus GBrowse. A portion of chicken chromosome 19 (nucleotides 5129862–5170001) shown with glyphs depicting predicted genes (chicken), links to Unigene, Reactome and Gene Ontology annotation, SNPs and the location of turkey, zebra finch and condor ESTs that have been mapped to the chicken genome.

Figure 2.

Searching Gallus GBrowse for specific entries. The search term GO:apoptosis was entered in the Landmark or Region box followed by pushing the Search button. A total of 14 entries were found, only three are presented here. Using the mouse to click on the chromosome link will open the browser to that location.

Gallus GBrowse. A portion of chicken chromosome 19 (nucleotides 5129862–5170001) shown with glyphs depicting predicted genes (chicken), links to Unigene, Reactome and Gene Ontology annotation, SNPs and the location of turkey, zebra finch and condor ESTs that have been mapped to the chicken genome. Searching Gallus GBrowse for specific entries. The search term GO:apoptosis was entered in the Landmark or Region box followed by pushing the Search button. A total of 14 entries were found, only three are presented here. Using the mouse to click on the chromosome link will open the browser to that location. One of the more challenging aspects of using many genomic databases is searching based on a gene name. As a convention, Gallus GBrowse uses chicken gene names assigned by NCBI and entering a search in the syntax ‘NCBI:gene name’ will typically recover the desired information. Another approach can be to use a homologous nucleotide or protein sequence and the BLAT server (below) to identify the chromosomal location of the gene of interest. A BLAT server is provided (http://birdbase.net/cgi-bin/webBlat) to allow searching the chicken genome with either nucleotide or protein sequences. Two databases are provided, one containing the nucleotide sequence (Chicken Genome untranslated) and the other containing the chicken genome translated in all reading frames (Chicken Genome translated). To successfully execute a BLAT search, the appropriate database must be selected for nucleotide (untranslated) or protein (translated) input sequence. Results from the BLAT analysis are returned as two web links, one showing the alignment of the query sequence with the matched chicken genomic sequence, and the second displaying the Gallus GBrowse viewer focused on the region of the aligned query sequence.

FUTURE DIRECTIONS

The Gallus GBrowse will be updated as new relevant information becomes available. One near term objective is to incorporate the position of repetitive sequence elements into the GBrowse database. An additional goal is incorporating both microarray and high-throughput EST sequencing data to describe gene expression patterns. Initially this will likely to reflect a simple interpretation of whether or not a gene was detected above background and allow users to determine if a given gene is expressed under the experimental conditions of the microarray or sequencing assay. Gallus GBrowse will also be improved by linking genes with the curated ontology efforts of AgBase (35,36). The current GO entries are derived from uncurated, electronic annotation and the AgBase effort should provide a far more reliable and accurate assignment of GO terms. Finally, a long-term goal is to continue incorporating genomic information from other avian species with the adoption of additional GMOD tools and the Chado database schema. We hope to ultimately provide an integrated resource for comparative avian genomics.
  36 in total

Review 1.  Morphogen gradients in vertebrate limb development.

Authors:  C Tickle
Journal:  Semin Cell Dev Biol       Date:  1999-06       Impact factor: 7.727

Review 2.  Avian macrophage: effector functions in health and disease.

Authors:  M A Qureshi; C L Heggen; I Hussain
Journal:  Dev Comp Immunol       Date:  2000 Mar-Apr       Impact factor: 3.636

3.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

4.  The generic genome browser: a building block for a model organism system database.

Authors:  Lincoln D Stein; Christopher Mungall; ShengQiang Shu; Michael Caudy; Marco Mangone; Allen Day; Elizabeth Nickerson; Jason E Stajich; Todd W Harris; Adrian Arva; Suzanna Lewis
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

Review 5.  Regulation of vertebrate eye development by Rx genes.

Authors:  Travis J Bailey; Heithem El-Hodiri; Li Zhang; Rina Shah; Peter H Mathers; Milan Jamrich
Journal:  Int J Dev Biol       Date:  2004       Impact factor: 2.203

6.  Genomic resources for songbird research and their use in characterizing gene expression during brain development.

Authors:  Xiaoching Li; Xiu-Jie Wang; Jonathan Tannenhauser; Sheila Podell; Piali Mukherjee; Moritz Hertel; Jeremy Biane; Shoko Masuda; Fernando Nottebohm; Terry Gaasterland
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-10       Impact factor: 11.205

7.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.

Authors: 
Journal:  Nature       Date:  2004-12-09       Impact factor: 49.962

8.  AgBase: a unified resource for functional analysis in agriculture.

Authors:  Fiona M McCarthy; Susan M Bridges; Nan Wang; G Bryce Magee; W Paul Williams; Dawn S Luthe; Shane C Burgess
Journal:  Nucleic Acids Res       Date:  2006-11-29       Impact factor: 16.971

9.  Development of a cDNA array for chicken gene expression analysis.

Authors:  Joan Burnside; Paul Neiman; Jianshan Tang; Ryan Basom; Richard Talbot; Mark Aronszajn; David Burt; Jeff Delrow
Journal:  BMC Genomics       Date:  2005-02-04       Impact factor: 3.969

10.  Reactome: a knowledge base of biologic pathways and processes.

Authors:  Imre Vastrik; Peter D'Eustachio; Esther Schmidt; Geeta Joshi-Tope; Gopal Gopinath; David Croft; Bernard de Bono; Marc Gillespie; Bijay Jassal; Suzanna Lewis; Lisa Matthews; Guanming Wu; Ewan Birney; Lincoln Stein
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more
  3 in total

1.  Shewregdb: database and visualization environment for experimental and predicted regulatory information in Shewanella oneidensis mr-1.

Authors:  Mustafa H Syed; Tatiana V Karpinets; Michael R Leuze; Guruprasad H Kora; Margaret R Romine; Edward C Uberbacher
Journal:  Bioinformation       Date:  2009-10-15

2.  CardioTF, a database of deconstructing transcriptional circuits in the heart system.

Authors:  Yisong Zhen
Journal:  PeerJ       Date:  2016-08-23       Impact factor: 2.984

3.  The value of avian genomics to the conservation of wildlife.

Authors:  Michael N Romanov; Elaina M Tuttle; Marlys L Houck; William S Modi; Leona G Chemnick; Marisa L Korody; Emily M Stremel Mork; Christie A Otten; Tanya Renner; Kenneth C Jones; Sugandha Dandekar; Jeanette C Papp; Yang Da; Eric D Green; Vincent Magrini; Matthew T Hickenbotham; Jarret Glasscock; Sean McGrath; Elaine R Mardis; Oliver A Ryder
Journal:  BMC Genomics       Date:  2009-07-14       Impact factor: 3.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.