Literature DB >> 33632116

Sparse data embedding and prediction by tropical matrix factorization.

Amra Omanović1, Hilal Kazan2, Polona Oblak1, Tomaž Curk3.   

Abstract

BACKGROUND: Matrix factorization methods are linear models, with limited capability to model complex relations. In our work, we use tropical semiring to introduce non-linearity into matrix factorization models. We propose a method called Sparse Tropical Matrix Factorization (STMF) for the estimation of missing (unknown) values in sparse data.
RESULTS: We evaluate the efficiency of the STMF method on both synthetic data and biological data in the form of gene expression measurements downloaded from The Cancer Genome Atlas (TCGA) database. Tests on unique synthetic data showed that STMF approximation achieves a higher correlation than non-negative matrix factorization (NMF), which is unable to recover patterns effectively. On real data, STMF outperforms NMF on six out of nine gene expression datasets. While NMF assumes normal distribution and tends toward the mean value, STMF can better fit to extreme values and distributions.
CONCLUSION: STMF is the first work that uses tropical semiring on sparse data. We show that in certain cases semirings are useful because they consider the structure, which is different and simpler to understand than it is with standard linear algebra.

Entities:  

Keywords:  Data embedding; Matrix completion; Matrix factorization; Sparse data; Tropical factorization; Tropical semiring

Mesh:

Year:  2021        PMID: 33632116      PMCID: PMC7908717          DOI: 10.1186/s12859-021-04023-9

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  12 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Fast optimal leaf ordering for hierarchical clustering.

Authors:  Z Bar-Joseph; D K Gifford; T S Jaakkola
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

4.  Data Fusion by Matrix Factorization.

Authors:  Marinka Žitnik; Blaž Zupan
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-01       Impact factor: 6.226

5.  Max-min distance nonnegative matrix factorization.

Authors:  Jim Jing-Yan Wang; Xin Gao
Journal:  Neural Netw       Date:  2014-10-26

6.  Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal.

Authors:  Jianjiong Gao; Bülent Arman Aksoy; Ugur Dogrusoz; Gideon Dresdner; Benjamin Gross; S Onur Sumer; Yichao Sun; Anders Jacobsen; Rileen Sinha; Erik Larsson; Ethan Cerami; Chris Sander; Nikolaus Schultz
Journal:  Sci Signal       Date:  2013-04-02       Impact factor: 8.192

7.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data.

Authors:  Ethan Cerami; Jianjiong Gao; Ugur Dogrusoz; Benjamin E Gross; Selcuk Onur Sumer; Bülent Arman Aksoy; Anders Jacobsen; Caitlin J Byrne; Michael L Heuer; Erik Larsson; Yevgeniy Antipin; Boris Reva; Arthur P Goldberg; Chris Sander; Nikolaus Schultz
Journal:  Cancer Discov       Date:  2012-05       Impact factor: 39.397

8.  Discovery of multi-dimensional modules by integrative analysis of cancer genomic data.

Authors:  Shihua Zhang; Chun-Chi Liu; Wenyuan Li; Hui Shen; Peter W Laird; Xianghong Jasmine Zhou
Journal:  Nucleic Acids Res       Date:  2012-08-08       Impact factor: 16.971

9.  Orthogonal matrix factorization enables integrative analysis of multiple RNA binding proteins.

Authors:  Martin Stražar; Marinka Žitnik; Blaž Zupan; Jernej Ule; Tomaž Curk
Journal:  Bioinformatics       Date:  2016-01-18       Impact factor: 6.937

10.  Optimization and expansion of non-negative matrix factorization.

Authors:  Xihui Lin; Paul C Boutros
Journal:  BMC Bioinformatics       Date:  2020-01-06       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.