Literature DB >> 31452969

Rubik: Knowledge Guided Tensor Factorization and Completion for Health Data Analytics.

Yichen Wang1, Robert Chen1, Joydeep Ghosh2, Joshua C Denny3, Abel Kho4, You Chen3, Bradley A Malin3, Jimeng Sun1.   

Abstract

Computational phenotyping is the process of converting heterogeneous electronic health records (EHRs) into meaningful clinical concepts. Unsupervised phenotyping methods have the potential to leverage a vast amount of labeled EHR data for phenotype discovery. However, existing unsupervised phenotyping methods do not incorporate current medical knowledge and cannot directly handle missing, or noisy data. We propose Rubik, a constrained non-negative tensor factorization and completion method for phenotyping. Rubik incorporates 1) guidance constraints to align with existing medical knowledge, and 2) pairwise constraints for obtaining distinct, non-overlapping phenotypes. Rubik also has built-in tensor completion that can significantly alleviate the impact of noisy and missing data. We utilize the Alternating Direction Method of Multipliers (ADMM) framework to tensor factorization and completion, which can be easily scaled through parallel computing. We evaluate Rubik on two EHR datasets, one of which contains 647,118 records for 7,744 patients from an outpatient clinic, the other of which is a public dataset containing 1,018,614 CMS claims records for 472,645 patients. Our results show that Rubik can discover more meaningful and distinct phenotypes than the baselines. In particular, by using knowledge guidance constraints, Rubik can also discover sub-phenotypes for several major diseases. Rubik also runs around seven times faster than current state-of-the-art tensor methods. Finally, Rubik is scalable to large datasets containing millions of EHR records.

Entities:  

Keywords:  Computational Phenotyping; Constraint Optimization; Healthcare Analytics; Tensor Analysis

Year:  2015        PMID: 31452969      PMCID: PMC6709413          DOI: 10.1145/2783258.2783395

Source DB:  PubMed          Journal:  KDD        ISSN: 2154-817X


  6 in total

1.  Detecting time-evolving phenotypic topics via tensor factorization on electronic health records: Cardiovascular disease case study.

Authors:  Juan Zhao; Yun Zhang; David J Schlueter; Patrick Wu; Vern Eric Kerchberger; S Trent Rosenbloom; Quinn S Wells; QiPing Feng; Joshua C Denny; Wei-Qi Wei
Journal:  J Biomed Inform       Date:  2019-08-22       Impact factor: 6.317

2.  Temporal phenotyping for transitional disease progress: An application to epilepsy and Alzheimer's disease.

Authors:  Yejin Kim; Samden Lhatoo; Guo-Qiang Zhang; Luyao Chen; Xiaoqian Jiang
Journal:  J Biomed Inform       Date:  2020-06-18       Impact factor: 6.317

3.  COPA: Constrained PARAFAC2 for Sparse & Large Datasets.

Authors:  Ardavan Afshar; Ioakeim Perros; Evangelos E Papalexakis; Elizabeth Searles; Joyce Ho; Jimeng Sun
Journal:  Proc ACM Int Conf Inf Knowl Manag       Date:  2018-10

4.  Privacy-Preserving Tensor Factorization for Collaborative Health Data Analysis.

Authors:  Jing Ma; Qiuchen Zhang; Jian Lou; Joyce C Ho; Li Xiong; Xiaoqian Jiang
Journal:  Proc ACM Int Conf Inf Knowl Manag       Date:  2019-11

5.  Phenotype Inference with Semi-Supervised Mixed Membership Models.

Authors:  Victor A Rodriguez; Adler Perotte
Journal:  Proc Mach Learn Res       Date:  2019-08

6.  Phenotype Instance Verification and Evaluation Tool (PIVET): A Scaled Phenotype Evidence Generation Framework Using Web-Based Medical Literature.

Authors:  Jette Henderson; Junyuan Ke; Joyce C Ho; Joydeep Ghosh; Byron C Wallace
Journal:  J Med Internet Res       Date:  2018-05-04       Impact factor: 5.428

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.