| Literature DB >> 21685067 |
Yong Chen1, Tao Jiang, Rui Jiang.
Abstract
MOTIVATION: Pinpointing genes that underlie human inherited diseases among candidate genes in susceptibility genetic regions is the primary step towards the understanding of pathogenesis of diseases. Although several probabilistic models have been proposed to prioritize candidate genes using phenotype similarities and protein-protein interactions, no combinatorial approaches have been proposed in the literature.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21685067 PMCID: PMC3117332 DOI: 10.1093/bioinformatics/btr213
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Illustration of the MAXIF method. The phenome–interactome network is constructed by integrating the given phenotype similarity profile, PPI network and associations between diseases and genes. By maximizing the information flow in the phenome–interactome network, the strength of association between a query disease and a candidate gene is calculated as the net flow leaving the gene. Candidate genes are then prioritized according to their association strength scores. The numbers on each edge indicate the flow/capacity values of the edge. Only flows with positive values are shown.
Fig. 2.Performance of the proposed method. (A) ROC curves for validation experiments against a linkage interval. (B) ROC curves for validation experiments against random genes. (C) ROC curves for validation experiments against random diseases.
Robustness of the method with respect to the parameter α (with β=10 000 and γ=1) in leave-one-out cross-validation experiments
| 0.1 (%) | 0.2 (%) | 0.3 (%) | 0.4 (%) | 0.5 (%) | |
|---|---|---|---|---|---|
| Linkage interval | |||||
| AUC | 94.56 | 94.57 | 95.66 | 94.68 | 95.57 |
| PRE | 65.06 | 65.06 | 60.14 | 64.38 | 64.46 |
| MRR | 5.15 | 5.15 | 3.88 | 4.84 | 4.20 |
| Random genes | |||||
| AUC | 88.73 | 89.64 | 90.76 | 88.49 | 82.93 |
| PRE | 63.03 | 62.96 | 56.89 | 51.18 | 47.76 |
| MRR | 12.03 | 11.12 | 10.02 | 12.27 | 14.84 |
| Random diseases | |||||
| AUC | 87.76 | 87.73 | 90.65 | 88.45 | 86.07 |
| PRE | 17.95 | 19.03 | 22.44 | 23.52 | 22.24 |
| MRR | 12.84 | 12.20 | 9.47 | 11.61 | 12.98 |
Robustness of method with respect to the parameter β (with α=0.3 and γ=1) in leave-one-out cross-validation experiments
| 1 (%) | 10 (%) | 100 (%) | 1000 (%) | 10 000 (%) | |
|---|---|---|---|---|---|
| Linkage interval | |||||
| AUC | 93.52 | 95.35 | 95.84 | 95.66 | 95.66 |
| PRE | 34.90 | 55.05 | 59.46 | 60.14 | 60.14 |
| MRR | 3.96 | 3.57 | 3.76 | 3.88 | 3.88 |
| Random genes | |||||
| AUC | 88.28 | 89.7 | 91.12 | 90.76 | 90.76 |
| PRE | 35.02 | 53.73 | 55.53 | 56.89 | 56.89 |
| MRR | 8.67 | 8.52 | 9.62 | 10.02 | 10.02 |
| Random diseases | |||||
| AUC | 92.39 | 92.53 | 91.04 | 90.65 | 90.65 |
| PRE | 22.20 | 23.76 | 22.76 | 22.44 | 22.44 |
| MRR | 8.13 | 7.49 | 9.13 | 9.47 | 9.47 |
Robustness of the method with respect to the parameter γ (with α=0.3 and β=10 000) in leave-one-out cross-validation experiments
| 1 (%) | 10 (%) | 20 (%) | 30 (%) | 40 (%) | 50 (%) | 60 (%) | 70 (%) | 80 (%) | 90 (%) | 100 (%) | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Linkage interval | |||||||||||
| AUC | 95.66 | 94.91 | 94.59 | 94.51 | 94.55 | 94.69 | 94.63 | 94.60 | 94.59 | 94.57 | 94.57 |
| PRE | 60.14 | 63.82 | 64.38 | 64.66 | 64.66 | 64.74 | 64.94 | 64.98 | 65.06 | 65.06 | 65.06 |
| MRR | 3.88 | 4.51 | 4.83 | 4.95 | 5.02 | 5.06 | 5.10 | 5.12 | 5.12 | 5.14 | 5.14 |
| Random genes | |||||||||||
| AUC | 90.76 | 88.30 | 86.85 | 86.20 | 85.97 | 85.76 | 85.57 | 85.44 | 85.44 | 85.40 | 85.40 |
| PRE | 56.89 | 61.50 | 62.26 | 62.54 | 62.66 | 62.78 | 62.86 | 63.02 | 63.06 | 63.02 | 63.02 |
| MRR | 10.02 | 12.46 | 13.90 | 14.54 | 14.77 | 14.98 | 15.17 | 15.30 | 15.30 | 15.34 | 15.34 |
| Random diseases | |||||||||||
| AUC | 90.65 | 91.9 | 91.75 | 91.67 | 91.63 | 91.61 | 91.58 | 91.56 | 91.55 | 91.54 | 91.54 |
| PRE | 22.44 | 22.84 | 22.88 | 22.80 | 22.80 | 22.76 | 22.80 | 22.72 | 22.72 | 22.72 | 22.72 |
| MRR | 9.47 | 8.18 | 8.31 | 8.38 | 8.41 | 8.43 | 8.46 | 8.47 | 8.48 | 8.49 | 8.49 |
Fig. 3.Comparison with existing methods on leave-one-out cross-validation experiments against random genes. (A) The ROC curve. (B) The precision-recall curve.
Detailed information of the 47 predicted driver genes in the 50 CNA regions of melanoma
| ID | Chr | Start | End | #(Genes) | Symbol | OMIM | Predicted transcriptional factor binding sites in the promoter region of the gene |
|---|---|---|---|---|---|---|---|
| 1 | 1 | 147226092 | 150596000 | 20 | GJA8 | 116200 601885 | SREBP-1a, SREBP-1b, SREBP-1c, E47, c-Myc, Max, USF1, Max1, PPAR-gamma1, PPAR-gamma2 |
| 2 | 3 | 70107048 | 70229264 | 6 | MITF | 103500 193510 | CREB, deltaCREB, Sox9, p300, STAT3, GATA-2, C/EBPbeta, PPAR-gamma1, PPAR-gamma2, GATA-3 |
| 3 | 3 | 120500072 | 120923376 | 19 | GSK3B | CREB, deltaCREB, NF-kappaB1, FOXO1a, NF-kappaB, SREBP-1b, SREBP-1c, SREBP-1a, HNF-4alpha2, FOXO4 | |
| 4 | 3 | 129929104 | 130624856 | 9 | PLXND1 | GATA-2, PPAR-gamma2, PPAR-gamma1, MEF-2A, AhR, Arnt, RFX1, MZF-1, Egr-3, GR-beta | |
| 5 | 3 | 139476272 | 139507824 | 4 | RBP2 | XBP-1, PPAR-gamma2, PPAR-gamma1, HNF-4alpha2, COUP-TF1, TFIID, TBP, RORalpha1, NF-1, FOXJ2 (long isoform) | |
| 6 | 3 | 139674960 | 141259840 | 15 | RBP2 | XBP-1, PPAR-gamma2, PPAR-gamma1, HNF-4alpha2, COUP-TF1, TFIID, TBP, RORalpha1, NF-1, FOXJ2 (long isoform) | |
| 7 | 3 | 173931152 | 174312880 | 3 | NLGN1 | MEF-2A, HNF-1A, Pax-2, Pbx1a, Tal-1beta, ITF-2, C/EBPalpha, IRF-7A, FOXL1, CUTL1 | |
| 8 | 3 | 175468448 | 175633120 | 1 | TBL1XR1 | NF-kappaB1, NF-kappaB, POU2F1, MEF-2A, POU3F2 (N-Oct-5b), POU3F2, POU3F2 (N-Oct-5a), FOXC1, GR-alpha, c-Myb | |
| 9 | 3 | 192266400 | 199124224 | 52 | TFRC | ATF, p53, NF-kappaB, NF-kappaB1, YY1, Pbx1a, E47, GATA-1, ARP-1, USF1 | |
| 10 | 5 | 11347004 | 14472551 | 3 | TRIO | NF-kappaB1, NF-kappaB, p53, FOXO1a, Egr-3, Egr-2, Egr-1, HNF-1A, PPAR-gamma2, PPAR-gamma1 | |
| 11 | 6 | 3859295 | 5698904 | 9 | CDYL | Hlf, Pbx1a, POU3F2 (N-Oct-5b), POU3F2 (N-Oct-5a), POU3F2, GATA-1, E4BP4, CREB, deltaCREB, AREB6 | |
| 12 | 6 | 57103452 | 57277264 | 5 | BMP5 | Sox9, IRF-7A, POU3F2 (N-Oct-5b), POU3F2 (N-Oct-5a), POU3F2, RSRFC4, Nkx6-1, MEF-2A, NF-kappaB, RelA | |
| 13 | 7 | 139232720 | 140249792 | 10 | AGK | NRSF form 1, NRSF form 2, ROU2F2 (Oct-2.1), POU2F2B, POU2F2C, Oct-B2, oct-B2, oct-B3, POU2F1, POU2F1a | |
| 14 | 8 | 121760777 | 128993129 | 31 | TRIB1 | 612797 | C/EBPbeta, NF-kappaB1, NF-kappaB, Bach2, ATF-2, AREB6, c-Myc, Max, FOXD1, Max1 |
| 15 | 11 | 68953208 | 69754234 | 8 | PPP6R3 | ||
| 16 | 12 | 18722300 | 19574988 | 5 | PLEKHA5 | FOXO3b, FOXO3a, FOXC1, Pax-2, FOXO4, FOXO1a, FOXF2, AhR, Arnt, POU2F2B | |
| 17 | 12 | 24388521 | 32822550 | 45 | ARNTL2 | PPAR-gamma1, PPAR-gamma2, MEF-2A, CUTL1, AP-1, Sox9, E47, Bach2, Nkx2-2, c-Fos | |
| 18 | 12 | 67491552 | 67636136 | 3 | LLPH | Cdc5, EIK-1, HNF-1, HNF-1A, c-Ets-1, AREB6, ISGF-3, NCX/Ncx, MRF-2, NRF-2 | |
| 19 | 15 | 53109188 | 58337128 | 33 | RAB27A | 607624 | Bach1, Chx10, SRF, RFX1, Tal-1beta, ITF-2, STAT3, Cart-1, Tal-1, E47 |
| 20 | 15 | 87547925 | 89038234 | 23 | FANCI | 609053 | Sox9, Amef-2, MEF-2A, EIK-1, ZID, NRF-2, Sox5, Roaz, ATF6 |
| 21 | 17 | 68172496 | 73084144 | 92 | SLC9A3R1 | 612287 | NF-kappaB1, HNF-4alpha2, COUP-TF1, NF-kappaB, NRSF form 2, NRSF form 1, FOXD1, PPAR-gamma2, PPAR-gamma1, GATA-1 |
| 22 | 20 | 47893352 | 49179608 | 16 | DPM1 | 608799 | STAT5A, ATF-2, ATF6, CUTL1, E4BP4, XBP-1, POU2F2C, POU2F2B, POU2F2 (Oct-2.1), POU2F2 |
| 23 | 22 | 39399572 | 40948612 | 36 | GTPBP1 | ATF, Bach1, Arnt, AhR, POU2F1, Bach2, SREBP-1b, SREBP-1c, SREBP-1a, POU3F2 | |
| 24 | 5 | 58445032 | 58683084 | 1 | RAB3C | POU3F2, POU3F2 (N-Oct-5a), POU3F2 (N-Oct-5b), Lmo2, Nkx3-1 v2, Nkx3-1, Nkx3-1 v1, Nkx3-1 v3, Nkx3-1 v4, Pax-2 | |
| 25 | 5 | 59075480 | 59432304 | 1 | RAB3C | POU3F2, POU3F2 (N-Oct-5a), POU3F2 (N-Oct-5b), Lmo2, Nkx3-1 v2, Nkx3-1, Nkx3-1 v1, Nkx3-1 v3, Nkx3-1 v4, Pax-2 | |
| 26 | 5 | 112049568 | 112105816 | 1 | C5orf13 | E4BP4, IRF-1, RREB-1, RP58, POU2F2C, oct-B2, Oct-B1, POU2F2B, oct-B3, POU2F1 | |
| 27 | 6 | 162494042 | 163637213 | 2 | QKI | 180300 | Max, c-Myc, MEF-2A, CUTL1, HNF-1A, FOXJ2 (long isoform), Max1, Arnt, AhR, SREBP-1a |
| 28 | 8 | 6333250 | 9359366 | 28 | DEFB1 | NF-kappaB, NF-kappaB1, C/EBPalpha, p53, GR-alpha | |
| 29 | 9 | 21999960 | 22009732 | 2 | MTAP | HNF-4alpha2, COUP-TF1, p53, Nkx3-1 v3, Nkx3-1 v2, Nkx3-1 v1, Nkx3-1 v4, Nkx3-1, HNF-1A, YY1 | |
| 30 | 10 | 89436437 | 89908984 | 3 | STAMBPL1 | CHOP-10, C/EBPalpha, E4BP4, GATA-1, Pbx1a, RORalpha2, STAT5B, Hlf, YY1, NF-AT1 | |
| 31 | 11 | 111213008 | 111961672 | 17 | IL18 | 180300 | p300, AP-1, c-Fos, c-Jun, STAT1, STAT1alpha, STAT1beta, IRF-1, GR-alpha, NF-kappaB |
| 32 | 13 | 19867988 | 96252808 | 203 | AKAP11 | GATA-1, Max1, c-Myc, MyoD, Pbx1a, RFX1, USF1, ATF, POU3F1, RORalpha1 | |
| 33 | 14 | 38302632 | 38988776 | 7 | PNN | p53, Pax-6, FOXO3b, FOXO3a, YY1, GATA-1, Meis-1, HOXA9B, C/EBPalpha, POU2F1 | |
| 34 | 14 | 45505796 | 46786096 | 2 | FKBP3 | FOXJ2 (long isoform), MEF-2A, YY1, LCR-F1, Nkx3-1 v4, Nkx3-1 v3, Nkx3-1 v1, Nkx3-1, Nkx3-1 v2, POU3F2 | |
| 35 | 14 | 102319430 | 103810789 | 19 | PPP2R5C | HNF-1A,HSF1 (long), HSF2,NF-kappaB1, NF-kappaB, NF-kappaB2, POU2F1, c-Rel, CP1C, NF-YA | |
| 36 | 15 | 39444436 | 39948236 | 13 | CHAC1 | FOXD1, FOXF2, STAT1beta, STAT1, STAT1alpha, FOXO1a, STAT4, STAT2, STAT6, STAT5B | |
| 37 | 15 | 40049072 | 40114644 | 4 | CHAC1 | FOXD1, FOXF2, STAT1beta, STAT1, STAT1alpha, FOXO1a, STAT4, STAT2, STAT6, STAT5B | |
| 38 | 15 | 40603172 | 40933120 | 7 | RAD51 | 114480 | C/EBPbeta, NF-kappaB1, p53, POU2F1, NF-kappaB, Arnt, AhR, Nkx3-1, Nkx3-1 v1, Nkx3-1 v2 |
| 39 | 15 | 41605345 | 43473384 | 30 | TP53BP1 | AP-1, STAT1alpha, STAT1beta, STAT1, c-Jun, c-Fos, NF-kappaB1, NF-kappaB, PPAR-gamma1, PPAR-gamma2 | |
| 40 | 16 | 52162032 | 52621120 | 3 | AKTIP | NF-1, NF-1/L, SRF, SRF(504 AA), E4BP4, ATF-2, P53, USF-1, USF1, SREBP-1a | |
| 41 | 16 | 77264113 | 78880878 | 2 | MAF | 608983 610202 | AP-1, p53, c-Fos, c-Jun, YY1, c-Rel, POU3F2 (N-Oct-5b), POU3F2 (N-Oct-5a), POU3F2, Chx10 |
| 42 | 16 | 87889112 | 87959104 | 2 | CYBA | AP-1, NF-kappaB1, NF-kappaB, PPAR-gamma1, PPAR-gamma2, STAT5A, Sp1, c-Jun, c-Fos, XBP-1 | |
| 43 | 18 | 62719491 | 76117153 | 33 | CDH19 | CUTL1, C/EBPbeta, GATA-1, p53, MEF-2A, FOXD3, SRY | |
| 44 | 7 | 129619320 | 130330138 | 6 | TNPO3 | 601744 | STAT1, STAT1alpha, STAT1beta, E47, STAT3, ATF6, LUN-1, LCR-F1, C/EBPalpha, CHOP-10 |
| 45 | 7 | 12105143 | 13052351 | 2 | ETV1 | Elk-1, Chx10, ATF, CUTL1, Cart-1, FOXF2, FOXC1, HNF-1A, GATA-3, GATA-1 | |
| 46 | 17 | 74741058 | 77061186 | 24 | TIMP2 | AP-1, c-Jun, c-Fos, STAT1, C/EBPbeta, p53, Sp1, STAT1beta, STAT1alpha, ATF | |
| 47 | 20 | 49142876 | 50443163 | 4 | NFATC2 | NF-kappaB1, NF-kappaB, Elk-1, p300, c-Rel, GATA-3, GATA-1, GATA-2, STAT5A, GR-alpha | |
| 48 | 10 | 2552329 | 4073842 | 3 | KLF6 | 137215 176807 | NF-kappaB1, AP-1, c-Fos, c-Jun, C/EBPbeta, NF-kappaB, SRF, Egr-1, STAT3, Sp1 |
| 49 | 14 | 55290464 | 55652536 | 2 | SAMD4A | SRF, E4BP4, Pbx1a, Nkx6-1, MEF-2A, POU3F2 (N-Oct-5b), POU3F2, POU3F2 (N-Oct-5a), RSRFC4, Nkx3-1 v1 | |
| 50 | 14 | 57178720 | 57588888 | 2 | ARID4A | Sp1,FOXO3a,FOXO3b,FOXO4,FOXC1,CUTL1,Pbx1a,FOXO1a,YY1,Elk-1 |
Fig. 4.The 47 predicted driver genes. Red lines denote amplified regions and blue lines denote deleted regions. The genes are involved in several functions, such as transcriptional regulation (green), DNA damage and chromosomal instability (blue), metabolism process (deep blue), immune response (deep red), gap junction transport and signaling (black), neuron differentiation and development of the nervous system (purple) and others (red).
Fig. 5.Six large predicted gene modules. The modules are constructed by extracting genes that directly interact with the 47 predicted driver genes from GeneCards and then identifying connected components. Predicted driver genes are marked in red.
Fig. 6.Transcriptional network of the 47 predicted driver genes. The predicted genes are marked green and their transcriptional factors are marked blue. The most enriched transcriptional factors, p53, NF-kappaB1, NF-kappaB, PPAR-gamma1and PPAR-gamma2, are marked red.