Literature DB >> 33251040

Algorithms and Applications to Weighted Rank-one Binary Matrix Factorization.

Haibing Lu1, X I Chen2, Junmin Shi3, Jaideep Vaidya4, Vijayalakshmi Atluri4, Yuan Hong5, Wei Huang6.   

Abstract

Many applications use data that are better represented in the binary matrix form, such as click-stream data, market basket data, document-term data, user-permission data in access control, and others. Matrix factorization methods have been widely used tools for the analysis of high-dimensional data, as they automatically extract sparse and meaningful features from data vectors. However, existing matrix factorization methods do not work well for the binary data. One crucial limitation is interpretability, as many matrix factorization methods decompose an input matrix into matrices with fractional or even negative components, which are hard to interpret in many real settings. Some matrix factorization methods, like binary matrix factorization, do limit decomposed matrices to binary values. However, these models are not flexible to accommodate some data analysis tasks, like trading off summary size with quality and discriminating different types of approximation errors. To address those issues, this article presents weighted rank-one binary matrix factorization, which is to approximate a binary matrix by the product of two binary vectors, with parameters controlling different types of approximation errors. By systematically running weighted rank-one binary matrix factorization, one can effectively perform various binary data analysis tasks, like compression, clustering, and pattern discovery. Theoretical properties on weighted rank-one binary matrix factorization are investigated and its connection to problems in other research domains are examined. As weighted rank-one binary matrix factorization in general is NP-hard, efficient and effective algorithms are presented. Extensive studies on applications of weighted rank-one binary matrix factorization are also conducted.

Entities:  

Keywords:  Discrete data; clustering; compression; pattern discovery

Year:  2020        PMID: 33251040      PMCID: PMC7695232          DOI: 10.1145/3386599

Source DB:  PubMed          Journal:  ACM Trans Manag Inf Syst        ISSN: 2158-656X


  4 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Improving molecular cancer class discovery through sparse non-negative matrix factorization.

Authors:  Yuan Gao; George Church
Journal:  Bioinformatics       Date:  2005-11-01       Impact factor: 6.937

3.  Structured Low-Rank Matrix Factorization: Global Optimality, Algorithms, and Applications.

Authors:  Benjamin D Haeffele; Rene Vidal
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-02-19       Impact factor: 6.226

4.  Non-Negative Matrix Factorizations for Multiplex Network Analysis.

Authors:  Vladimir Gligorijevic; Yannis Panagakis; Stefanos Zafeiriou
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-04-04       Impact factor: 6.226

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.