| Literature DB >> 26563282 |
Chih-Wei Huang1,2, Richard Lu1,2, Usman Iqbal1,2, Shen-Hsien Lin1,2, Phung Anh Alex Nguyen1,2, Hsuan-Chia Yang2,3, Chun-Fu Wang4, Jianping Li4, Kwan-Liu Ma5, Yu-Chuan Jack Li6,7,8, Wen-Shan Jian9,10,11.
Abstract
BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasingly important problem to solve.Entities:
Mesh:
Year: 2015 PMID: 26563282 PMCID: PMC4643519 DOI: 10.1186/s12911-015-0218-7
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Data transformation processes. The data transformation steps behind the visual analysis process followed the analysis process from the raw patient records to the final visualization
Fig. 2The data and control flow of visual analysis process from user perspective. The data flows through a sequence of operators, which were adjustable and associated with different interactions by the user
Fig. 3Time course of 14,567 CKD patients clustered by comorbidities. 14,567 CKD patients clustered according to comorbidities on the timeline. The x-axis showed the timeline covering 12 years before and after each patient who got a diagnosis of CKD, while the y-axis presents the clusters of trajectories for each CKD patient
Fig. 4Classifying patients into uniform cohorts. The flow showed an overview of the data analysis process in this study. The visual analysis process was based on CKD research dataset
Factor association rules
| Diseases / Procedures (Abbreviation) | ICD-9 / Procedure code |
|---|---|
| Cerebrovascular Disease(CVD) | 430–438 |
| Chronic Kidney Disease(CKD) | 585, 586 |
| Congestive Heart Failure(CHF) | 398.91, 402, 404, 425.4-425.9,428 |
| Coronary Artery Disease(CAD) | 410–414 |
| Diabetes mellitus(DM) | 250 |
| Glomerulonephritis(GN) | 582 |
| Hemodialysis(HD) | 58001C,58019C,58020C,58021C,58022C,58023C,58024C,58025C,58027C,58029C,58030B |
| Hyperlipidemia | 272 |
| Hypertension(HTN) | 401 |
| Peritoneal Dialysis(PD) | 58002C,58009B,58010B,58011C,58012B,58017C,58028C |
| Polycystic Kidney Disease(PKD) | 75312 |
| Proteinuria | 791 |
| Renal stone | 592 |
| Renal transplantation(RTPL) | V420 |
| Systemic Lupus Erythematosus(SLE) | 7100 |
Fig. 5Frequency-based Cohort Clustering: Sankey Diagrams for CKD Cohort Sizes of < 250. The trajectories were simplified where larger cohorts were kept and smaller ones were merged into a single “others” group. The light green for others without CKD and light orange for others with CKD
Fig. 6The system visualization displayed with the threshold adjust to 150. The user could enlarge/lower the threshold to reveal different size of cohorts
Fig. 7Explore causal relationship (12,960 patients). a There were 70.2 % of the patients who took HD in the first year of CKD did not develop any other factors, while the rest of them either took PD, RTPL, or died. b After filtering the unconfident associations, the remaining associations only covers 17.4 % of the population. c To perform hierarchical clustering on the patients at the pre-CKD stage and generate ten groups of similar patients. d When we highlighted the group who had a common factor of systemic lupus erythematosus (SLE), we found that none of them took the more serious procedures such as renal transplantation or died. Note: There are three groups labelled “*” because of the groups have no common factor shared by all members in the group