| Literature DB >> 28663344 |
Yanhui Hu1, Aram Comjean1, Stephanie E Mohr1, Norbert Perrimon2,3,4.
Abstract
One of the most powerful ways to develop hypotheses regarding the biological functions of conserved genes in a given species, such as humans, is to first look at what is known about their function in another species. Model organism databases and other resources are rich with functional information but difficult to mine. Gene2Function addresses a broad need by integrating information about conserved genes in a single online resource.Entities:
Keywords: data mining; functional genomics; human genetic disease; model organism databases; orthologs
Mesh:
Year: 2017 PMID: 28663344 PMCID: PMC5555488 DOI: 10.1534/g3.117.043885
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Overview of the Gene2Function (G2F) online resource. For detailed information about the database, logic flow, and information sources, see File S1.
Summary of disease and gene reports displayed in Gene2Function (G2F)
| Column Header | Content | Source of Content |
|---|---|---|
| Disease report | ||
| Gene symbol human | Official gene symbol | NCBI gene |
| Gene ID human | NCBI gene ID | NCBI gene |
| Count disease terms | Number of disease terms | OMIM, EBI GWAS |
| Disease terms | Disease terms | OMIM, EBI GWAS |
| Ortholog overview | Link to G2F gene report | Internal |
| Gene report | ||
| NCBI gene ID | NCBI gene ID | NCBI gene |
| Symbol | Official gene symbol | NCBI gene |
| Human disease counts | Number of disease terms; link to MARRVEL | OMIM, EBI GWAS |
| Species name | Species name | |
| Species-specific gene ID | Species-specific gene ID | Links to HGNC or MOD gene report |
| Species-specific database | Relevant database name | Links to HGNC or MOD home page |
| DIOPT score | DIOPT score | DIOPT |
| Best score | Yes or no, this pair has best score at DIOPT | DIOPT |
| Best score reverse | Yes or no, this pair has best score if opposite search | DIOPT |
| Confidence | DIOPT confidence | DIOPT |
| Publication count | Number of publications on the ortholog | NCBI gene2pubmed |
| GO component counts | Number of cellular component GO terms assigned to the ortholog | NCBI gene2go |
| GO function counts | Number of molecular function GO terms assigned to the ortholog | NCBI gene2go |
| GO process counts | Number of biological processes GO terms assigned to the ortholog | NCBI gene2go |
| Protein interaction counts | Number of protein interactions assigned to the ortholog | BioGrid |
| Genetic interaction counts | Number of genetic interactions assigned to the ortholog | BioGrid |
| Mine phenotype data | Number of phenotype entries from Mines | HumanMine, MouseMine, XenMine, ZebrafishMine, FlyMine, WormBase, SGD |
| Mine expression data | Number of expression entries from Mines | HumanMine, MouseMine, XenMine, ZebrafishMine, FlyMine, WormBase, SGD |
| Mine disruption phenotype | Number of disruption phenotype entries | UniProt |
| 3D structure | Number of 3D structures available for the ortholog | Protein data bank |
| ORF clones | Number of ORF clones | PlasmID clone repository |
| Protein alignment | Multiple or pairwise alignment of orthologs | DIOPT |
OMIM, Online Mendelian Inheritance in Man; EBI, European Bioinformatics Institute; GWAS, genome-wide association study; HGNC, HUGO Gene Nomenclature Committee; MOD, model organism database; DIOPT, DRSC Integrative Ortholog Prediction Tool; GO, gene ontology; SGD, Saccharomyces Genome Database; ORF, open reading frame.
MARRVEL, Model organism Aggregated Resources for Rare Variant ExpLoration (Wang ).
The databases included at G2F are MGI (Blake ), RGD (Shimoyama ), Xenbase (Karpinka ), ZFIN (Howe ), FlyBase (Gramates ), WormBase (Howe ), SGD (Cherry ), and PomBase (McDowall ).
DIOPT score, number of ortholog prediction tools included at DIOPT (Hu ) that cover both species and predict the displayed ortholog match.
In this column, “High” indicates that the ortholog pair has the best score among all pairs with both a forward and a reverse direction score and a DIOPT ≥ 2; “Moderate” indicates that the ortholog pair has the best score with the forward or the reverse search and a DIOPT ≥ 2, or has a DIOPT score ≥ 4 but is not the best score with either a forward or reverse search; and “Low” includes all other predicted ortholog pairs.
Mines (or MODs serving that function): HumanMine, MouseMine, XenMine, ZebrafishMine, FlyMine, WormBase, and SGD (Cherry ; Smith ; Howe ).
Links provided for one of several repositories in the United States and overseas that have ORF clones, many of which are from the ORFeome Collaboration (2016).