| Literature DB >> 22822426 |
Frank Emmert-Streib1, Ricardo de Matos Simoes, Shailesh Tripathi, Galina V Glazko, Matthias Dehmer.
Abstract
In this paper, we present a Bayesian approach to estimate a chromosome and a disorder network from the Online Mendelian Inheritance in Man (OMIM) database. In contrast to other approaches, we obtain statistic rather than deterministic networks enabling a parametric control in the uncertainty of the underlying disorder-disease gene associations contained in the OMIM, on which the networks are based. From a structural investigation of the chromosome network, we identify three chromosome subgroups that reflect architectural differences in chromosome-disorder associations that are predictively exploitable for a functional analysis of diseases.Entities:
Mesh:
Year: 2012 PMID: 22822426 PMCID: PMC3400933 DOI: 10.1038/srep00513
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Disorders with at least 5 known disease genes which lead to statistically enriched chromosomes for a false discovery rate of FDR = 0.01. The second column provides information about wider disorder categories to which a disease belongs
| disorder (# genes) | category | enriched Chr |
|---|---|---|
| Thalassemia (5) | Hematological | 11, 16 |
| Schizophrenia (9) | Psychiatric | 22 |
| Asthma (13) | Respiratory | 5 |
| Factor VII deficiency (8) | Hematological | 13 |
| Long QT syndrome (7) | Cardiovascular | 21 |
| Mental retardation (24) | Neurological | X |
| Pancreatic cancer (9) | Cancer | 18 |
Disorder categories leading to statistically enriched chromosomes for a false discovery rate of FDR = 0.01. The first column shows the name of the disorder category, the second column gives the number of enriched chromosomes and the third column lists these chromosomes
| disorder category (# genes) | # enriched Chr | enriched Chr |
|---|---|---|
| Bone (62) | - | - |
| Cancer (372) | 4 | 10, 13, 17, 22 |
| Cardiovascular (125) | - | - |
| Connective tissue (1) | 1 | 5 |
| Connective tissue disorder (63) | - | - |
| Dermatological (123) | 2 | 12, 17 |
| Developmental (59) | - | - |
| Ear,Nose,Throat (57) | - | - |
| Endocrine (134) | - | - |
| Gastrointestinal (45) | 1 | 5 |
| Hematological (212) | 2 | 4, X |
| Immunological (137) | - | - |
| Metabolic (345) | - | - |
| multiple (252) | 1 | X |
| Muscular (104) | - | - |
| Neurological (344) | 1 | X |
| Nutritional (26) | - | - |
| Ophthamological (196) | 1 | X |
| Psychiatric (36) | - | - |
| Renal (70) | - | - |
| Respiratory (38) | - | - |
| Skeletal (96) | 1 | 4 |
| Unclassified (32) | 1 | 7 |
Figure 1Shown are the posterior and prior probabilities for the full Bayes and empirical Bayes analysis.
Figure 2Shown are the log-odds for the full Bayes and empirical Bayes analysis.
The number of protein-coding genes (p-genes) at the chromosomes according to the NCBI (accessed May 2012). Further, known disease genes (d-genes) on the chromosomes are listed
| Chr | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| p-genes | 2062 | 1266 | 1092 | 768 | 905 | 1056 | 939 | 704 | 808 |
| d-genes | 164 | 124 | 95 | 64 | 87 | 83 | 78 | 60 | 70 |
Summary of our full Bayesian analysis shown in Fig. 1 and 2. Column two gives the chromosomes for which FOPij(FB) > F holds and column three shows results for LOD(FB) > log(F)
| disorder category | Chr | Chr |
|---|---|---|
| Bone | 4,11,12,18,20 | 4,11,12,18,20 |
| Cancer | 22 | 22 |
| Cardiovascular | 7,21 | 7,21 |
| Connective tissue | 5 | 5 |
| Connective tissue disorder | 6,9,15 | 6,9,15 |
| Dermatological | 12,15,17,18 | 12,15,17,18 |
| Developmental | 12,15 | 9,12,15 |
| Ear,Nose,Throat | 7,13,21 | 7,13,21 |
| Endocrine | 20,Y | 20,Y |
| Gastrointestinal | 5,7,13,18 | 5,7,12,13,18 |
| Hematological | 4,16,22 | 4,16,22 |
| Immunological | 6,21 | 6 |
| Metabolic | - | - |
| multiple | - | - |
| Muscular | 9,21 | 9,21 |
| Neurological | X | X |
| Nutritional | 5,8,16,18 | 3,5,8,16,18 |
| Ophthamological | 15 | 15 |
| Psychiatric | 6,13,15,20,21,22 | 6,13,15,20,21,22,X |
| Renal | 2,16,17,19,X | 16,19 |
| Respiratory | 5,8,14 | 5,8,14 |
| Skeletal | Y | 4,Y |
| Unclassified | 4,7,10,14,18 | 4,7,10,14,18 |
Figure 3The human chromosome network (CNet) where nodes correspond to chromosomes and two chromosomes C and C are connected if the joint consensus condition in Eqn. 16 is fulfiled.
Summary statistics of the chromosome network shown in Fig. 3. Listed are values of the degree, betweenness centrality (bc) and the FDR adjusted p-values of the betweenness centrality values for each of the chromosomes
| Chr | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|
| degree | 0 | 0 | 0 | 9 | 5 | 6 | 7 | 3 | 3 |
| bc-value | 0.0 | 0.0 | 0.0 | 26.4 | 1.9 | 2.3 | 7.6 | 0.0 | 0.0 |
| p-value | 1.0 | 1.0 | 1.0 | <10−5 | 1.0 | 1.0 | 0.13 | 1.0 | 1.0 |
| Chr | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 |
| degree | 4 | 4 | 6 | 8 | 6 | 9 | 3 | 3 | 12 |
| bc-value | 0.0 | 0.0 | 2.9 | 8.6 | 3.8 | 15.9 | 17.0 | 0.0 | 32.9 |
| p-value | 1.0 | 1.0 | 1.0 | 0.005 | 1.0 | <10−5 | <10−5 | 1.0 | <10−5 |
| Chr | 19 | 20 | 21 | 22 | X | Y | |||
| degree | 1 | 9 | 7 | 7 | 0 | 0 | |||
| bc-value | 0.0 | 9.2 | 6.2 | 14.1 | 0.0 | 0.0 | |||
| p-value | 1.0 | 0.0009 | 1.0 | <10−5 | 1.0 | 1.0 |
Figure 4Summary of the gene ontology analysis for the three categories ‘biological process’ (BP), ‘cellular component’ (CC) and ‘molecular function’ (MF).
The color of the chromosomal subgroups corresponds to the color of the chromosomal subgroups in Fig. 3.
Statistically enriched GO categories for chromosome Category 1
| GO.ID/BP | GO term | # of genes | p-val |
|---|---|---|---|
| GO:0051925 | regulation of calcium ion transport via … | 8 | 1.9e-07 |
| GO:0003014 | renal system process | 8 | 3.4e-07 |
| GO:0009056 | catabolic process | 52 | 1.5e-06 |
| GO:0001974 | blood vessel remodeling | 6 | 2.4e-06 |
| GO:0050880 | regulation of blood vessel size | 10 | 2.4e-06 |
| GO:0035150 | regulation of tube size | 10 | 2.7e-06 |
| GO:0032412 | regulation of ion transmembrane transpor… | 8 | 4.5e-06 |
| GO.ID/CC | GO term | # of genes | p-val |
| GO:0045202 | synapse | 20 | 1.6e-06 |
| GO:0005891 | voltage-gated calcium channel complex | 5 | 8.6e-06 |
| GO:0030424 | axon | 13 | 8.6e-06 |
| GO:0005913 | cell-cell adherens junction | 6 | 1.2e-05 |
| GO:0044433 | cytoplasmic vesicle part | 16 | 1.3e-05 |
| GO:0016529 | sarcoplasmic reticulum | 6 | 2.4e-05 |
| GO:0016528 | sarcoplasm | 6 | 3.1e-05 |
| GO.ID/MF | GO term | # of genes | p-val |
| GO:0016836 | hydro-lyase activity | 7 | 1.5e-06 |
| GO:0016835 | carbon-oxygen lyase activity | 8 | 1.5e-06 |
Statistically enriched GO categories for chromosome Category 2
| GO.ID/BP | GO term | # of genes | p-val | |
|---|---|---|---|---|
| GO:0010035 | response to inorganic substance | 32 | 1.1e-12 | |
| GO:2000026 | regulation of multicellular organismal d… | 42 | 3.5e-07 | |
| GO:0051094 | positive regulation of developmental pro… | 31 | 3.6e-07 | |
| GO:0071241 | cellular response to inorganic substance | 10 | 3.9e-07 | |
| GO:0050793 | regulation of developmental process | 49 | 4.2e-07 | |
| GO:0000080 | G1 phase of mitotic cell cycle | 8 | 5.4e-07 | |
| GO:0070482 | response to oxygen levels | 18 | 9.5e-07 | |
| GO:0045597 | positive regulation of cell differentiat… | 25 | 9.5e-07 | |
| GO:0051318 | G1 phase | 8 | 1.1e-06 | |
| GO:0001666 | response to hypoxia | 17 | 1.3e-06 | |
| GO.ID/CC | GO term | # of genes | p-val | |
| GO:0031967 | organelle envelope | 38 | 3.6e-08 | |
| GO:0031975 | envelope | 38 | 5.7e-08 | |
| GO:0000323 | lytic vacuole | 20 | 4.7e-07 | |
| GO:0005764 | lysosome | 20 | 4.7e-07 | |
| GO:0005773 | vacuole | 22 | 5.5e-07 | |
| GO:0031090 | organelle membrane | 71 | 2.3e-06 | |
| GO:0042383 | sarcolemma | 10 | 2.8e-06 | |
| GO.ID/MF | GO term | # of genes | p-val | |
| GO:0016705 | oxidoreductase activity, acting on paire… | 17 | 3.1e-08 | |
| GO:0008395 | steroid hydroxylase activity | 7 | 4.3e-08 | |
| GO:0042803 | protein homodimerization activity | 30 | 6.3e-08 | |
| GO:0004935 | adrenergic receptor activity | 5 | 4.6e-07 | |
| GO:0046982 | protein heterodimerization activity | 19 | 1.9e-06 | |
| GO:0004936 | alpha-adrenergic receptor activity | 4 | 2.7e-06 | |
| GO:0051400 | BH domain binding | 4 | 6.2e-06 | |
| GO:0070330 | aromatase activity | 6 | 7.8e-06 | |
| GO:0004937 | alpha1-adrenergic receptor activity | 3 | 9.1e-06 | |
| GO:0016903 | oxidoreductase activity, acting on the a… | 7 | 9.8e-06 | |
Figure 5The human disorder category network estimated from our Bayesian analysis.