| Literature DB >> 19073697 |
Csaba Ortutay1, Mauno Vihinen.
Abstract
Disease gene identification is still a challenge despite modern high-throughput methods. Many diseases are very rare or lethal and thus cannot be investigated with traditional methods. Several in silico methods have been developed but they have some limitations. We introduce a new method that combines information about protein-interaction network properties and Gene Ontology terms. Genes with high-calculated network scores and statistically significant gene ontology terms based on known diseases are prioritized as candidate genes. The method was applied to identify novel primary immunodeficiency-related genes, 26 of which were found. The investigation uses the protein-interaction network for all essential immunome human genes available in the Immunome Knowledge Base and an analysis of their enriched gene ontology annotations. The identified disease gene candidates are mainly involved in cellular signaling including receptors, protein kinases and adaptor and binding proteins as well as enzymes. The method can be generalized for any disease group with sufficient information.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19073697 PMCID: PMC2632920 DOI: 10.1093/nar/gkn982
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Protein-interaction network-related scores for essential and immunome genes. An immunome gene was considered essential if mutation in the mouse ortholog causes early lethality. P values for the Kruskal–Wallis rank sum test are shown on the plots. (A) Degree; (B) vulnerability; (C) connectivity.
Figure 2.Identification of novel PID candidate genes by using information about the network properties, GO terms and known immunodeficiency genes. The grey area indicates the 26 candidate genes that have high-protein network scores and GO terms enriched in PID genes and which are not among already known PID genes.
Identified PID candidate genes
| Symbol | Full name | GeneID | Known diseases |
|---|---|---|---|
| CD4 antigen (p55) | 186 940 CD4+ lymphocyte deficiency | ||
| CD9 antigen (p24) | 928 | ||
| cathepsin G | 1511 | ||
| Fas-associating protein with death domain | 8772 | ||
| Protein-tyrosine kinase fyn | 2534 | ||
| Granzyme B | 3002 | ||
| Insulin-like growth factor 1 receptor | 3480 | 147 370 Intrauterine and postnatal growth retardation | |
| Interleukin 2 receptor, β | 3560 | ||
| Insulin receptor | 3643 | 610 549 Diabetes mellitus, insulin-resistant, with acanthosis nigricans | |
| 609 968 Hyperinsulinemic hypoglycemia | |||
| 246 200 Leprechaunism | |||
| 262 190 Rabson–Mendenhall syndrome | |||
| Interleukin-1 receptor-associated kinase 1 | 3654 | ||
| β 1 integrin | 3688 | ||
| Janus kinase 1 | 3716 | ||
| Janus kinase 2 | 3717 | 600 880 Budd–Chiari syndrome | |
| 601 626 Leukemia, acute myelogenous | |||
| 254 450 Myelofibrosis, idiopathic | |||
| 263 300 Myeloproliferative disorder with erythrocytosis—polycythemia vera | |||
| 187 950 Thrombocythemia, essential | |||
| v-Kit Hardy-Zuckerman 4 feline sarcoma viral oncogene homolog | 3815 | 606 764 Gastrointestinal stromal tumor, somatic | |
| 273 300 Germ cell tumors | |||
| 273 300 Mast cell leukemia—mastocytosis with associated hematologic disorder—piebaldism | |||
| Low-density lipoprotein-related protein 1 | 4035 | ||
| Mitogen-activated protein kinase 14 | 1432 | ||
| Platelet-derived growth factor receptor, β-polypeptide | 5159 | 131 440 Myelomonocytic leukemia, chronic—myeloproliferative disorder with eosinophilia | |
| v-Rel reticuloendotheliosis viral oncogene homolog A | 5970 | ||
| Receptor (TNFRSF)-interacting serine-threonine kinase 1 | 8737 | ||
| Suppressor of cytokine signaling 1 | 8651 | ||
| Signal transducer and activator of transcription 3 | 6774 | ||
| Thy-1 cell surface antigen | 7070 | ||
| TNF receptor-associated factor 1 | 7185 | ||
| TNF receptor-associated factor 2 | 7186 | ||
| Tyrosine kinase 2 | 7297 | ||
| X-ray repair complementing defective repair in Chinese hamster cells 5 | 7520 |
For each genes, the HGNC approved symbol, full name, Entrez GeneID and OMIM code for known diseases is provided.