| Literature DB >> 19594883 |
John D Osborne1, Jared Flatow, Michelle Holko, Simon M Lin, Warren A Kibbe, Lihua Julie Zhu, Maria I Danila, Gang Feng, Rex L Chisholm.
Abstract
BACKGROUND: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases.Entities:
Mesh:
Year: 2009 PMID: 19594883 PMCID: PMC2709267 DOI: 10.1186/1471-2164-10-S1-S6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Diagram of Disease Ontology annotation of the human genome. A) MMTx was used to annotate GeneRIFs with the Disease Ontology (DO). B) An example GeneRIF suggests that Gene ID: 7040 is annotated with DOID:2585.
Figure 2Example Gene Annotation by DO, OMIM and GO. ATP7B ATPase, Cu++ transporting, beta polypeptide. GeneID: 540. This gene is a member of the P-type cation transport ATPase family and encodes a protein with several membrane-spanning domains, an ATPase consensus sequence, a hinge domain, a phosphorylation site, and at least 2 putative copper-binding sites. This protein functions as a monomer, exporting copper out of the cells, such as the efflux of hepatic copper into the bile. Alternate transcriptional splice variants, encoding different isoforms with distinct cellular localizations, have been characterized. Mutations in this gene have been associated with Wilson disease (WD). DOID. Breast Carcinoma, Carcinoma, Congenital Abnormality, Disorder of copper metabolism, Esophageal carcinoma, Hepatolenticular Degeneration, Liver diseases, Malignant neoplasm of ovary, Primary carcinoma of the liver cells, Stomach Carcinoma. OMIM. Wilson disease. GO. ATP binding, ATPase activity, coupled to transmembrane movement of ions, phosphorylative mechanism, Component, Golgi apparatus, Process, cellular copper ion homeostasis, cellular zinc ion homeostasis, colocalizes_with basolateral plasma membrane, colocalizes_with cytoplasmic membrane-bounded vesicle, colocalizes_with perinuclear region of cytoplasm, colocalizes_with trans-Golgi network, copper ion binding, copper ion import, copper ion transmembrane transporter activity, copper ion transport, copper-exporting ATPase activity, cytoplasm, hydrolase activity, hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, integral to membrane, integral to plasma membrane, intracellular copper ion transport, ion transport, lactation, late endosome, magnesium ion binding, membrane, membrane fraction, metabolic process, metal ion binding, metal ion transmembrane transporter activity, metal ion transport, mitochondrion, nucleotide binding, protein binding, response to copper ion, sequestering of calcium ion, transport. An example gene annotation is provided for ATP7B. The gene description, DOID, OMIM, and GO annotation descriptions are provided.
Figure 3Comparison of DO and OMIM Annotation. A) The number of diseases per gene is plotted for the Disease Ontology (DO) analysis and OMIM. B) The number of genes per disease is plotted for the Disease Ontology (DO) analysis and OMIM.
First Ten Diseases ordered by the number of gene annotations
| DOID:162 | Cancer | 943 |
| DOID:462 | Malignant Neoplasms | 903 |
| DOID:4241 | Malignant neoplasm of breast | 698 |
| DOID:4766 | Embryoma | 620 |
| DOID:10283 | Malignant neoplasm of prostate | 543 |
| DOID:2619 | Neoplasm Metastasis | 386 |
| DOID:9352 | Diabetes Mellitus, Non-Insulin-Dependent | 329 |
| DOID:684 | Primary carcinoma of the liver cells | 326 |
| DOID:7148 | Rheumatoid Arthritis | 320 |
| DOID:1994 | Carcinoma of the Large Intestine | 313 |
First Ten Genes ordered by the number of disease annotations
| 3569 | IL6 | interleukin 6 | 168 |
| 4318 | MMP9 | matrix metallopeptidase 9 | 164 |
| 1956 | EGFR | epidermal growth factor receptor | 138 |
| 1029 | CDKN2A | cyclin-dependent kinase inhibitor | 138 |
| 4313 | MMP2 | matrix metallopeptidase 2 | 135 |
| 3586 | IL10 | interleukin 10 | 134 |
| 4524 | MTHFR | 5,10-methylenetetrahydrofolate reductase | 123 |
| 3576 | IL8 | interleukin 8 | 121 |
| 596 | BCL2 | B-cell CLL/lymphoma 2 | 115 |
| 3553 | IL1B | interleukin 1, beta | 109 |
Estimation of recall and precision of disease annotation
| OMIM | GeneRIF | |
| Recall | 21.85 | 90.76 |
| Precision | 98.46 | 96.66 |
The Homayouni gene collection was used to estimation of recall and precision of gene mappings to the Disease Ontology.
Figure 4Genes linked to different types of cancers. Ovarian, breast cancer, neuroblastoma and multiple myeloma are represented by large grey dots. Genes annotated to each of these diseases are represented by smaller grey dots with 357 genes annotated to ovarian, 199 genes annotated to breast cancer, 156 genes annotated to neuroblastoma, and 135 genes annotated to multiple myeloma. The 11 genes (MMP2, MYC, BCL2, KIT, WT1, CXCL12, CDKN1B, IGF1, CCND1, BIRC5 and SKP2) related to all four diseases are highlighted in the shaded circle at the center.