| Literature DB >> 23734762 |
Jie Zhu1, Yufang Qin, Taigang Liu, Jun Wang, Xiaoqi Zheng.
Abstract
BACKGROUND: Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance.Entities:
Mesh:
Year: 2013 PMID: 23734762 PMCID: PMC3622672 DOI: 10.1186/1471-2105-14-S5-S5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Workflow of the DP_LCC method. i) The phenome-interactome network is constructed by integrating the PPI network, disease-gene associations and currently known phenotype similarity data. ii) Diffusion profiles of diseases and genes are calculated by a random walk over the PPI network with restart where similarities between phenotypes are incorporated. iii) Candidate genes are then prioritized according to their similarities with the disease diffusion profile. The numbers on the edges of phenome network indicate their pairwise similarities.
Prediction accuracies of DP_LCC and DP_COS at = 0.25
| DP_LCC | DP_COS | |||
|---|---|---|---|---|
| 0.1 | 406 | 0.7591 | 406 | 0.7590 |
| 0.2 | 412 | 0.7616 | 414 | 0.7606 |
| 0.3 | 417 | 0.7625 | 421 | 0.7624 |
| 0.4 | 422 | 0.7638 | 422 | 0.7625 |
| 0.5 | 422 | 0.7634 | 423 | 0.7618 |
| 0.6 | 421 | 0.7627 | 421 | 0.7612 |
| 0.7 | 419 | 0.7619 | 417 | 0.7609 |
| 0.8 | 419 | 0.7620 | 418 | 0.7605 |
Figure 2ROC curves of DP_LCC by different leave-one-out cross-validation methods. ALI: artificial linkage interval approach; Rand: random genes approach.
Performances of different algorithms at different values of
| Method | 0.1 | 0.2 | 0.25 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| PRE | 370 | 379 | 380 | 380 | 377 | 371 | 370 | 370 | 369 | ||
| AUC | 0.7549 | 0.7542 | 0.7527 | 0.7507 | 0.7493 | 0.7477 | 0.7465 | 0.7451 | 0.7435 | ||
| PRE | 350 | 352 | 351 | 341 | 329 | 309 | 287 | 263 | 229 | ||
| AUC | 0.7372 | 0.7309 | 0.7235 | 0.7027 | 0.6758 | 0.6427 | 0.6017 | 0.5429 | 0.4371 | ||
| PRE_LCC | 405 | 417 | 419 | 419 | 417 | 415 | 414 | 413 | 407 | ||
| AUC_LCC | 0.7487 | 0.7616 | 0.7634 | 0.7643 | 0.7659 | 0.7674 | 0.7688 | 0.7683 | 0.7694 | ||
| PRE_COS | 407 | 421 | 419 | 421 | 418 | 412 | 411 | 412 | 406 | ||
| AUC_COS | 0.7473 | 0.7604 | 0.7618 | 0.7631 | 0.7637 | 0.7638 | 0.7636 | 0.7619 | 0.7606 | ||
The pensions are divided by/1238; run time of our algorithm is 38 minutes. Computations were performed on a single processor of a dual-core Intel (R) Core P8700 2.53 GHz and 2 GB GB of shared memory.
Figure 3Comparison of the proposed method with two diffusion-based methods.
Figure 4Precision-recall curves of three diffusion-based methods.
Top 10 predicted causal genes of 16 multifactoral diseases predicted by DP-LCC
| Phenotype Name | PhenoID | Top-10 predictions for each phenotype by our algorithm | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PAH | TYROBP | TREM2 | MAPT | PSEN1 | SIGLEC14 | CD300E | NCR2 | CLEC5A | HLA-DQB1 | ||
| Breast cancer | 114480 | PAH | DHCR24 | SMYD2 | SHISA5 | FBXO11 | MTM1 | TSC1 | TSC2 | MTMR12 | STK11 |
| Colon cancer | 114500 | MLH1 | BRCA2 | MSH2 | PMS2 | VHL | DHCR24 | SHISA5 | SMYD2 | FBXO11 | EXO1 |
| ABCG8 | ABCG5 | PPP1R3A | RP1 | CFTR | B2M | F2 | APC | PLN | CD1E | ||
| Gastric cancer | 137215 | MLH1 | PTCH2 | MSH2 | PMS2 | ESR1 | SMO | PTCH1 | EXO1 | PNLIP | IHH |
| Atrial fibrillation | 147050 | RAG2 | RAG1 | CPN1 | IGF1R | KL | IL2 | IFNA1 | CPN2 | MEN1 | PRSS1 |
| TP53 | RET | DHCR24 | SMYD2 | SHISA5 | FBXO11 | STK11 | NTRK1 | CDKN2A | BCL7A | ||
| Schizophrenia | 181500 | PAH | GLO1 | TYROBP | TREM2 | MAPT | MTM1 | MTMR12 | SIGLEC14 | CD300E | NCR2 |
| Leukemia/lymphoma | 190685 | POMT1 | POMT2 | PAH | TYROBP | TREM2 | CHRNA1 | NSD1 | CHRNE | FGFR2 | CHRND |
| Lung cancer | 211980 | TP53 | CDKN2A | SFTPA2 | SFTPA1 | ZFP91 | ZNF227 | TBRG1 | KIAA1984 | CDKN2AIP | ANKRD12 |
| Zellweger syndrome | 214100 | PEX19 | ETFB | ETFA | PEX6 | PEX1 | PEX12 | POMT2 | POMT1 | SLC25A17 | PEX11A |
| Leukemia | 253310 | BSCL2 | CHRNA1 | CHRNG | DOK7 | C5orf62 | TMEM19 | USE1 | CHRND | RAPSN | MUSK |
| Asthma | 600807 | MARCO | SCGB3A1 | RFXAP | RFX5 | HPS1 | HPS4 | RFXANK | IL6 | MPO | IL2 |
| Leukemia | 601626 | MPL | THPO | AMPD1 | PDGFRB | HES5 | FANCE | FANCD2 | BRCA2 | ATXN2L | MYH2 |
| Obesity | 601665 | MC3R | ATRNL1 | MC1R | MC5R | ASIP | MC2R | NPY | NPY5R | SIGLEC6 | GHRHR |
| Tuberculosis | 607948 | NDUFV2 | UMOD | HSPA4 | AGRP | ASIP | IFNGR2 | HPN | TUBA4A | GGCX | LTB |