| Literature DB >> 29244004 |
Pakeeza Akram1, Li Liao2.
Abstract
BACKGROUND: Identification of common genes associated with comorbid diseases can be critical in understanding their pathobiological mechanism. This work presents a novel method to predict missing common genes associated with a disease pair. Searching for missing common genes is formulated as an optimization problem to minimize network based module separation from two subgraphs produced by mapping genes associated with disease onto the interactome.Entities:
Keywords: Comorbidity; Disease module separation; Interactome; Missing gene; Optimization
Mesh:
Year: 2017 PMID: 29244004 PMCID: PMC5731604 DOI: 10.1186/s12864-017-4272-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Illustration of network separation calculation
Average ROC Scores with standard deviation, precision and recall for various comorbidity ranges
| Comorbidity Range | |||||
|---|---|---|---|---|---|
| 0-8000 | 0-1 | 1-2 | 2-3 | >3 | |
| Number of Disease Pairs | 605 | 133 | 248 | 76 | 148 |
| Average ROC Score (Shortest Distance) | 0.947 | 0.966 | 0.950 | 0.952 | 0.920 |
| Stddev (Shortest Distance) | 0.094 | 0.063 | 0.089 | 0.072 | 0.124 |
| Average ROC Score (Average Distance) | 0.491 | 0.513 | 0.495 | 0.508 | 0.458 |
| Stdev (Average Distance) | 0.279 | 0.279 | 0.288 | 0.269 | 0.269 |
| Average ROC Score (Randomization) | 0.601 | 0.606 | 0.614 | 0.555 | 0.599 |
| Stedev (Randomization) | 0.278 | 0.282 | 0.287 | 0.258 | 0.2468 |
| Average Precision (Shortest Distance) | 0.88 | 0.88 | 0.85 | 0.89 | 0.96 |
| Stddev (Shortest Distance) | 0.27 | 0.28 | 0.31 | 0.25 | 0.15 |
| Average Precision (Average Distance) | 0.72 | 0.72 | 0.71 | 0.69 | 0.64 |
| Stdev (Average Distance) | 0.311 | 0.31 | 0.32 | 0.33 | 0.30 |
| Average Precision (Randomization) | 0.66 | 0.70 | 0.63 | 0.66 | 0.72 |
| Stedev (Randomization) | 0.29 | 0.28 | 0.29 | 0.30 | 0.29 |
| Average Recall (Shortest Distance) | 0.91 | 0.94 | 0.93 | 0.93 | 0.88 |
| Stddev (Shortest Distance) | 0.13 | 0.11 | 0.13 | 0.09 | 0.16 |
| Average Recall (Average Distance) | 0.69 | 0.72 | 0.70 | 0.70 | 0.64 |
| Stdev (Average Distance) | 0.30 | 0.28 | 0.30 | 0.31 | 0.30 |
| Average Recall (Randomization) | 0.78 | 0.80 | 0.79 | 0.73 | 0.76 |
| Stedev (Randomization) | 0.26 | 0.25 | 0.26 | 0.26 | 0.25 |
Fig. 2Bar chart for average ROC Score, average Precision and average Recall across comorbidity range
Fig. 3Histogram of ROC Scores. A: comorbidity range 0 ~ 1; B: comorbidity range 1~2; C: comorbidity range 2 ~3; D: comorbidity range > 3; E: comorbidity range 0 ~ 10,000; F: randomized common genes; G: SAB based on average distance
Effect of the size of training set and the range of RR on prediction performance
| ROC Score Range | Comorbidity Range | ||||
|---|---|---|---|---|---|
| 0-8000 | 0-1 | 1-2 | 2-3 | >3 | |
| i) 0 ~ 5 Common Genes | |||||
| 0.5-0.6 | 2 | 0 | 2 | 0 | 0 |
| 0.7-0.8 | 5 | 2 | 3 | 0 | 0 |
| 0.9-1.0 | 174 | 46 | 81 | 18 | 29 |
| Total | 181 | 48 | 86 | 18 | 29 |
| ii) 5 ~ 10 Common Genes | |||||
| 0.5-0.6 | 0 | 0 | 0 | 0 | 0 |
| 0.7-0.8 | 2 | 0 | 2 | 0 | 0 |
| 0.9-1.0 | 121 | 36 | 48 | 15 | 22 |
| Total | 123 | 36 | 50 | 15 | 22 |
| iii) 10 - 15 Common Genes | |||||
| 0.5-0.6 | 0 | 0 | 0 | 0 | 0 |
| 0.7-0.8 | 1 | 0 | 0 | 0 | 1 |
| 0.9-1.0 | 46 | 12 | 21 | 4 | 9 |
| Total | 47 | 12 | 21 | 4 | 10 |
| iv) 15 or more Common Genes | |||||
| 0.5-0.6 | 10 | 1 | 3 | 0 | 6 |
| 0.7-0.8 | 24 | 1 | 6 | 4 | 13 |
| 0.9-1.0 | 220 | 35 | 82 | 35 | 68 |
| Total | 254 | 37 | 91 | 39 | 87 |