| Literature DB >> 24260251 |
Chad Kimmel1, Shyam Visweswaran.
Abstract
BACKGROUND: Candidate gene prioritization aims to identify promising new genes associated with a disease or a biological process from a larger set of candidate genes. In recent years, network-based methods - which utilize a knowledge network derived from biological knowledge - have been utilized for gene prioritization. Biological knowledge can be encoded either through the network's links or nodes. Current network-based methods can only encode knowledge through links. This paper describes a new network-based method that can encode knowledge in links as well as in nodes.Entities:
Mesh:
Year: 2013 PMID: 24260251 PMCID: PMC3834271 DOI: 10.1371/journal.pone.0079564
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1A small knowledge network.
Figure 2The components, inputs and output of KNGP algorithm.
Figure 3Pseudocode for the KNGP algorithm.
Specification of node weights for each group in the synthetic networks.
| Dataset | Node Weights | ||||
| Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | |
| 1 | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) |
| 2 |
|
| rand(0,0.5) | rand(0,0.5) | rand(0,0.5) |
| 3 |
|
| rand(0.5,1) | rand(0.5,1) | rand(0.5,1) |
| 4 |
|
| rand(0,1) | rand(0,1) | rand(0,1) |
Specification of link weights for each group in the synthetic networks.
| Dataset | Link Weights | ||||
| Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | |
| 1 |
|
| rand(0,0.5) | rand(0,0.5) | rand(0,0.5) |
| 2 | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) |
| 3 |
|
| rand(0.5,1) | rand(0.5,1) | rand(0.5,1) |
| 4 |
|
| rand(0,1) | rand(0,1) | rand(0,1) |
Specification of link weights between groups in the synthetic networks.
| Dataset | Link Weights | |||||||||
| Group 1–2 | Group 1–3 | Group 1–4 | Group 1–5 | Group 2–3 | Group 2–4 | Group 2–5 | Group 3–4 | Group 3–5 | Group 4–5 | |
| 1 |
| rand(0,0.50) | rand(0,0.5) | rand(0,0.5) | rand(0,0.5) | rand(0,0.5) | rand(0,0.5) | rand(0,0.5) | rand(0,0.5) | rand(0,0.5) |
| 2 | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) |
| 3 |
| rand(0.50,1) | rand(0.5,1) | rand(0.5,1) | rand(0.5,1) | rand(0.5,1) | rand(0.5,1) | rand(0.5,1) | rand(0.5,1) | rand(0.5,1) |
| 4 |
|
| rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) | rand(0,1) |
Number of genes known to associated with each of the 19 experimental diseases.
| Disease | Number of genes |
| Rheumatoid Arthritis | 24 |
| Parkinson's Disease | 21 |
| Celiac Disease | 16 |
| Esophageal Cancer | 8 |
| Hepatitis C | 8 |
| Crohn's Disease | 17 |
| Breast Cancer | 27 |
| Asthma | 29 |
| Alzheimer's Disease | 21 |
| Ulcerative Colitis | 24 |
| Endometriosis | 5 |
| Lymphoma | 7 |
| Osteoarthritis | 8 |
| Epilepsy | 6 |
| Atherosclerosis | 43 |
| Pancreatitis | 6 |
| Cirrhosis | 7 |
| Myocardial Infarction | 32 |
| Tuberculosis | 12 |
Figure 4Evaluation protocol.
AUCs for various values of f for the four synthetic datasets.
| Dataset |
|
|
|
|
|
|
| 1 | 0.602 | 0.651 | 0.991 |
|
|
|
| 2 |
| 0.999 | 0.996 | 0.877 | 0.467 | 0.461 |
| 3 | 0.898 | 0.901 | 0.941 |
| 0.924 | 0.922 |
| 4 | 0.974 | 0.978 |
| 0.975 | 0.897 | 0.895 |
AUCs for biological networks that contain weighted links.
| Disease | PPI+GOM | PPI+GOB | PPI+GOC |
| Rheumatoid Arthritis | 0.750 | 0.830 | 0.798 |
| Parkinson's Disease | 0.652 | 0.668 | 0.668 |
| Celiac Disease | 0.744 | 0.814 | 0.795 |
| Esophageal Cancer | 0.840 | 0.871 | 0.858 |
| Hepatitis C | 0.502 | 0.764 | 0.759 |
| Crohn's Disease | 0.850 | 0.862 | 0.846 |
| Breast Cancer | 0.866 | 0.872 | 0.865 |
| Asthma | 0.797 | 0.856 | 0.825 |
| Alzheimer's Disease | 0.807 | 0.843 | 0.828 |
| Ulcerative Colitis | 0.740 | 0.706 | 0.738 |
| Endometriosis | 0.747 | 0.953 | 0.944 |
| Lymphoma | 0.770 | 0.875 | 0.872 |
| Osteoarthritis | 0.840 | 0.778 | 0.837 |
| Epilepsy | 0.579 | 0.622 | 0.612 |
| Atherosclerosis | 0.880 | 0.840 | 0.827 |
| Pancreatitis | 0.852 | 0.715 | 0.865 |
| Cirrhosis | 0.525 | 0.689 | 0.683 |
| Myocardial Infarction | 0.884 | 0.892 | 0.880 |
| Tuberculosis | 0.800 | 0.887 | 0.876 |
|
|
|
|
|
AUCs for biological networks with link weights only, node weights only and combined link and node weights.
| Disease | PPI+GOC link weight network | GO node weight network | PPI+GOC and GO combined network |
| Rheumatoid Arthritis | 0.798 | 0.770 | 0.835 |
| Parkinson's Disease | 0.668 | 0.724 | 0.734 |
| Celiac Disease | 0.795 | 0.775 | 0.807 |
| Esophageal Cancer | 0.858 | 0.876 | 0.853 |
| Hepatitis C | 0.759 | 0.774 | 0.756 |
| Crohn's Disease | 0.846 | 0.808 | 0.847 |
| Breast Cancer | 0.865 | 0.855 | 0.867 |
| Asthma | 0.825 | 0.794 | 0.845 |
| Alzheimer's Disease | 0.828 | 0.868 | 0.863 |
| Ulcerative Colitis | 0.738 | 0.701 | 0.740 |
| Endometriosis | 0.944 | 0.758 | 0.986 |
| Lymphoma | 0.872 | 0.910 | 0.918 |
| Osteoarthritis | 0.837 | 0.803 | 0.858 |
| Epilepsy | 0.612 | 0.710 | 0.718 |
| Atherosclerosis | 0.827 | 0.885 | 0.896 |
| Pancreatitis | 0.865 | 0.755 | 0.878 |
| Cirrhosis | 0.683 | 0.579 | 0.666 |
| Myocardial Infarction | 0.880 | 0.885 | 0.907 |
| Tuberculosis | 0.876 | 0.833 | 0.943 |
|
|
|
|
|
| p-value | 0.02/0.02 |
Top five ranked candidate proteins for asthma.
|
|
| Q13224 (GRIN2B) |
| P24394 (IL4R) |
| P29460 (IL12B) |
| P48357 (LEPR) |