Literature DB >> 34109054

LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values.

Kejing Yin1, Ardavan Afshar2, Joyce C Ho3, William K Cheung1, Chao Zhang2, Jimeng Sun4.   

Abstract

Binary data with one-class missing values are ubiquitous in real-world applications. They can be represented by irregular tensors with varying sizes in one dimension, where value one means presence of a feature while zero means unknown (i.e., either presence or absence of a feature). Learning accurate low-rank approximations from such binary irregular tensors is a challenging task. However, none of the existing models developed for factorizing irregular tensors take the missing values into account, and they assume Gaussian distributions, resulting in a distribution mismatch when applied to binary data. In this paper, we propose Logistic PARAFAC2 (LogPar) by modeling the binary irregular tensor with Bernoulli distribution parameterized by an underlying real-valued tensor. Then we approximate the underlying tensor with a positive-unlabeled learning loss function to account for the missing values. We also incorporate uniqueness and temporal smoothness regularization to enhance the interpretability. Extensive experiments using large-scale real-world datasets show that LogPar outperforms all baselines in both irregular tensor completion and downstream predictive tasks. For the irregular tensor completion, LogPar achieves up to 26% relative improvement compared to the best baseline. Besides, LogPar obtains relative improvement of 13.2% for heart failure prediction and 14% for mortality prediction on average compared to the state-of-the-art PARAFAC2 models.

Entities:  

Keywords:  PARAFAC2 factorization; binary tensor completion; computational phenotyping; tensor factorization

Year:  2020        PMID: 34109054      PMCID: PMC8186437          DOI: 10.1145/3394486.3403213

Source DB:  PubMed          Journal:  KDD        ISSN: 2154-817X


  9 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Temporal phenotyping of medically complex children via PARAFAC2 tensor factorization.

Authors:  Ioakeim Perros; Evangelos E Papalexakis; Richard Vuduc; Elizabeth Searles; Jimeng Sun
Journal:  J Biomed Inform       Date:  2019-02-08       Impact factor: 6.317

3.  Rank-One Matrix Completion With Automatic Rank Estimation via L1-Norm Regularization.

Authors:  Qiquan Shi; Haiping Lu; Yiu-Ming Cheung
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2017-12-11       Impact factor: 10.451

4.  Tensor Factorization for Low-Rank Tensor Completion.

Authors:  Pan Zhou; Canyi Lu; Zhouchen Lin; Chao Zhang
Journal:  IEEE Trans Image Process       Date:  2017-10-12       Impact factor: 10.856

5.  COPA: Constrained PARAFAC2 for Sparse & Large Datasets.

Authors:  Ardavan Afshar; Ioakeim Perros; Evangelos E Papalexakis; Elizabeth Searles; Joyce Ho; Jimeng Sun
Journal:  Proc ACM Int Conf Inf Knowl Manag       Date:  2018-10

6.  TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records.

Authors:  Ardavan Afshar; Ioakeim Perros; Haesun Park; Christopher deFilippi; Xiaowei Yan; Walter Stewart; Joyce Ho; Jimeng Sun
Journal:  Proc ACM Conf Health Inference Learn (2020)       Date:  2020-04

7.  Limestone: high-throughput candidate phenotype generation via tensor factorization.

Authors:  Joyce C Ho; Joydeep Ghosh; Steve R Steinhubl; Walter F Stewart; Joshua C Denny; Bradley A Malin; Jimeng Sun
Journal:  J Biomed Inform       Date:  2014-07-16       Impact factor: 6.317

8.  Discriminative and Distinct Phenotyping by Constrained Tensor Factorization.

Authors:  Yejin Kim; Robert El-Kareh; Jimeng Sun; Hwanjo Yu; Xiaoqian Jiang
Journal:  Sci Rep       Date:  2017-04-25       Impact factor: 4.379

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

  9 in total
  2 in total

1.  Exploring dynamic metabolomics data with multiway data analysis: a simulation study.

Authors:  Lu Li; Huub Hoefsloot; Albert A de Graaf; Evrim Acar; Age K Smilde
Journal:  BMC Bioinformatics       Date:  2022-01-10       Impact factor: 3.169

2.  Tracing Evolving Networks Using Tensor Factorizations vs. ICA-Based Approaches.

Authors:  Evrim Acar; Marie Roald; Khondoker M Hossain; Vince D Calhoun; Tülay Adali
Journal:  Front Neurosci       Date:  2022-04-25       Impact factor: 5.152

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.