| Literature DB >> 25628646 |
Niek de Klein1, Mark Ibberson2, Isaac Crespo2, Sophie Rodius3, Francisco Azuaje3.
Abstract
Among a diversity of animal models of disease, the zebrafish is a promising model organism for enabling novel translational biomedical research. To fully achieve the latter, a key requirement is to match molecular readouts measured in zebrafish with information relevant to health and disease in humans. A fundamental step in this direction is to accurately map gene sequences from zebrafish to humans. Despite significant progress in genome annotation, this remains an intricate and time-consuming challenge. Here we discuss major obstacles that we had to overcome to systematically map genes from zebrafish to human. We identified important disparities, as well as partial agreements, between five public zebrafish-to-human homology resources. There is still a need for standardized, comprehensive genomic mappings between zebrafish and humans. Without this, efforts to use zebrafish as a powerful translational research tool will be stalled.Entities:
Keywords: genome annotation; orthology inference; translational research; zebrafish; zebrafish-to-human gene mapping
Year: 2015 PMID: 25628646 PMCID: PMC4290677 DOI: 10.3389/fgene.2014.00470
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Mapping zebrafish to human sequences via five annotated resources. Sequence probes from Affymetrix’s GeneChip Zebrafish Genome Array were mapped to probes in GeneChip Human Genome U133A. Each mapping pipeline is based on a single resource independently: (1) HomoloGene (Wheeler et al., 2001; Acland et al., 2014), (2) Biomart (Kasprzyk, 2011), (3) conversion file provided by Affymetrix, (4) ZFIN (Bradford et al., 2011), and (5) BLAST homology searches performed at our laboratory (Altschul et al., 1990, 1997; Ye et al., 2006). In the latter method we focused on the most statistically significant BLAST match per query.
FIGURE 2Gene mapping agreement between the different homology annotation resources. Number of mapped genes are shown for each resource and between-resource intersection.