Literature DB >> 24467759

Probabilistic non-negative matrix factorization: theory and application to microarray data analysis.

Belhassen Bayar1, Nidhal Bouaynaya, Roman Shterenberg.   

Abstract

Non-negative matrix factorization (NMF) has proven to be a useful decomposition technique for multivariate data, where the non-negativity constraint is necessary to have a meaningful physical interpretation. NMF reduces the dimensionality of non-negative data by decomposing it into two smaller non-negative factors with physical interpretation for class discovery. The NMF algorithm, however, assumes a deterministic framework. In particular, the effect of the data noise on the stability of the factorization and the convergence of the algorithm are unknown. Collected data, on the other hand, is stochastic in nature due to measurement noise and sometimes inherent variability in the physical process. This paper presents new theoretical and applied developments to the problem of non-negative matrix factorization (NMF). First, we generalize the deterministic NMF algorithm to include a general class of update rules that converges towards an optimal non-negative factorization. Second, we extend the NMF framework to the probabilistic case (PNMF). We show that the Maximum a posteriori (MAP) estimate of the non-negative factors is the solution to a weighted regularized non-negative matrix factorization problem. We subsequently derive update rules that converge towards an optimal solution. Third, we apply the PNMF to cluster and classify DNA microarrays data. The proposed PNMF is shown to outperform the deterministic NMF and the sparse NMF algorithms in clustering stability and classification accuracy.

Mesh:

Year:  2014        PMID: 24467759     DOI: 10.1142/S0219720014500012

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  2 in total

1.  High dimensionality reduction by matrix factorization for systems pharmacology.

Authors:  Adel Mehrpooya; Farid Saberi-Movahed; Najmeh Azizizadeh; Mohammad Rezaei-Ravari; Farshad Saberi-Movahed; Mahdi Eftekhari; Iman Tavassoly
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

2.  A sequential Monte Carlo approach to gene expression deconvolution.

Authors:  Oyetunji E Ogundijo; Xiaodong Wang
Journal:  PLoS One       Date:  2017-10-19       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.