Literature DB >> 33659966

TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records.

Ardavan Afshar1, Ioakeim Perros2, Haesun Park1, Christopher deFilippi3, Xiaowei Yan4, Walter Stewart5, Joyce Ho6, Jimeng Sun1.   

Abstract

Phenotyping electronic health records (EHR) focuses on defining meaningful patient groups (e.g., heart failure group and diabetes group) and identifying the temporal evolution of patients in those groups. Tensor factorization has been an effective tool for phenotyping. Most of the existing works assume either a static patient representation with aggregate data or only model temporal data. However, real EHR data contain both temporal (e.g., longitudinal clinical visits) and static information (e.g., patient demographics), which are difficult to model simultaneously. In this paper, we propose Temporal And Static TEnsor factorization (TASTE) that jointly models both static and temporal information to extract phenotypes. TASTE combines the PARAFAC2 model with non-negative matrix factorization to model a temporal and a static tensor. To fit the proposed model, we transform the original problem into simpler ones which are optimally solved in an alternating fashion. For each of the sub-problems, our proposed mathematical re-formulations lead to efficient sub-problem solvers. Comprehensive experiments on large EHR data from a heart failure (HF) study confirmed that TASTE is up to 14× faster than several baselines and the resulting phenotypes were confirmed to be clinically meaningful by a cardiologist. Using 60 phenotypes extracted by TASTE, a simple logistic regression can achieve the same level of area under the curve (AUC) for HF prediction compared to a deep learning model using recurrent neural networks (RNN) with 345 features.

Entities:  

Keywords:  Computational Phenotyping; Predictive modeling; Tensor Factorization

Year:  2020        PMID: 33659966      PMCID: PMC7924914          DOI: 10.1145/3368555.3384464

Source DB:  PubMed          Journal:  Proc ACM Conf Health Inference Learn (2020)


  1 in total

1.  LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values.

Authors:  Kejing Yin; Ardavan Afshar; Joyce C Ho; William K Cheung; Chao Zhang; Jimeng Sun
Journal:  KDD       Date:  2020-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.