| Literature DB >> 31984368 |
Huaiying Lin1,2, Ruichen Rong1,2, Xiang Gao1, Kashi Revanna1,2, Michael Zhao1,2, Petar Bajic3, David Jin4, Chengjun Hu5, Qunfeng Dong1,2.
Abstract
OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR).Entities:
Keywords: electronic medical record; survival analysis; temporal correlation
Year: 2019 PMID: 31984368 PMCID: PMC6952009 DOI: 10.1093/jamiaopen/ooz031
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Flowchart of DCN (from left to right). The software constructs all possible disease pair cohorts from medical records, performs the proposed two methods of survival analysis and displays the Cox-PH regression results in an interactive network.
Figure 2.Time to Drop-off Distribution for disease pair: Kidney transplant to Osteoporosis. X axis denotes the time in days when the drop-off happens, while Y axis denotes the density function of drop-off distribution. Bottom frequency table shows the total drop-off frequency, event frequency, and survival frequency during the study period (10 years).
Figure 3.Screenshot of the interactive network display. Panel A—“Information” shows the exposure disease, outcome disease, coefficient, raw and adjusted P-values, hazard ratio test P-value, and a mini-sized survival plot. Panel B lists out additional evaluation plots which can be found through the links below. Panel C—“Filter” panel provides the users with options to filter edges based on sample size, adjusted P-values and coefficients. Panel D—“Shortest Path” finds the shortest path between any two given diseases. Panel E—“Display” provides several options to display the network in Panel G. Panel F provides download links for both Cox-PH regression and RF survival analysis results in CSV format. Panel G displays the interactive disease network based on Cox-PH regression result. Panel H shows in-degree, out-degree and total degree information for all disease nodes shown in the network. Users can search for a disease of interest through the search box. Panel I provides an example of a Cox-Snell residual plot, while Panel J provides an example of a drop-off distribution plot. More instructions on how to use the network visualization tool are available at https://github.com/qunfengdong/DCN.
Example disease pairs of statistically significant temporal relationship detected by DCN
| From-disease | To-disease | Adjusted hazard ratio | Cohort size | HR test | Cox-PH regression | Literature support |
|---|---|---|---|---|---|---|
| Acute myocardial infarction | Atrial fibrillation | 2.273 | 2002 | 0.198 | 1E–05 |
|
| Alcohol abuse | Depression | 2.398 | 2912 | 0.009 | 3E–24 |
|
| Alzheimer disease | Depression | 2.030 | 2058 | 0.708 | 2E–07 |
|
| Asthma | Chronic obstructive pulmonary disease | 2.255 | 9548 | 0.053 | 2E–24 |
|
| Cataract | Glaucoma | 2.495 | 9482 | 0.013 | 2E–31 |
|
| Cerebrovascular disease | Septicemia | 6.164 | 1098 | 0.821 | 9E–08 | No obvious literature support |
| Cerebrovascular disease | Senile dementia | 4.879 | 1098 | 0.569 | 2E–08 |
|
| Chronic kidney disease | Kidney transplant | 9.094 | 9622 | 0.735 | 9E–39 | Well-established relationship |
| Chronic kidney disease | Septicemia | 2.281 | 9622 | 0.916 | 5E–20 |
|
| Chronic obstructive pulmonary disease | Alcohol abuse | 3.490 | 10322 | 0.474 | 7E–09 |
|
| Chronic obstructive pulmonary disease | Asthma | 3.364 | 10 322 | 0.016 | 4E–99 |
|
| Crohn disease | Ulcerative colitis | 22.203 | 1492 | 0.538 | 5E–09 |
|
| Crohn disease | Anemia | 2.037 | 1492 | 0.309 | 7E–11 |
|
| Depression | Alcohol abuse | 2.853 | 10 158 | 0.448 | 6E–08 |
|
| Depression | Alzheimer disease | 2.799 | 10 158 | 0.636 | 2E–08 |
|
| Depression | Senile dementia | 2.065 | 10 158 | 0.480 | 9E–10 |
|
| Eye conditions (cataracts) | Glaucoma | 2.961 | 12 850 | 0.000 | 1E–70 |
|
| Glaucoma | Cataract | 2.269 | 7092 | 0.544 | 6E–40 |
|
| Heart failure | Acute myocardial infarction | 2.243 | 11 764 | 0.849 | 1E–08 |
|
| Heart failure | Chronic obstructive pulmonary disease | 2.032 | 11 764 | 0.966 | 7E–35 |
|
| Ischemic heart disease | Hyperlipidemia | 2.152 | 12 618 | 0.000 | 9E–146 |
|
| Kidney transplant | Osteoporosis | 3.202 | 2438 | 0.764 | 2E–14 |
|
| Kidney transplant | Diabetes | 3.009 | 2438 | 0.000 | 7E–39 |
|
| Kidney transplant | Septicemia | 2.758 | 2438 | 0.506 | 1E–08 |
|
| Kidney transplant | Anemia | 2.741 | 2438 | 0.000 | 1E–46 |
|
| Kidney transplant | Hyperlipidemia | 2.426 | 2438 | 0.007 | 4E–46 |
|
| Obesity | Sleep Apnea | 2.072 | 11 202 | 0.002 | 5E–36 |
|
| Senile dementia | Parkinson’s disease | 2.949 | 5088 | 0.662 | 6E–06 |
|
| Senile dementia | Depression | 2.194 | 5088 | 0.077 | 2E–22 |
|
| Ulcerative colitis | Crohn disease | 29.319 | 1622 | 0.975 | 2E–13 |
|
| Valvular disease | Atrial fibrillation | 2.455 | 12 690 | 0.000 | 3E–70 |
|
Note: From left to right, it lists the exposure disease (From-Disease), outcome disease (To-Disease), the confounding factor adjusted hazard ratio estimated from Cox-PH regression, cohort size, hazard ratio test for proportional hazards assumption for Cox regression, Bonferroni adjusted Cox-PH regression P-value, and literature support or evaluation of the relationship for this disease pair.
Figure 4.Network display of the selected statistically significant disease pairs. Each disease is a node (orange). Node size is proportional to its in-degree. Temporal relationships between disease pairs are edges (gray) with arrow heads denoting direction. The thickness of the edges are proportional to their coefficients.