Delbert Dueck1, Quaid D Morris, Brendan J Frey. 1. Department of Electrical and Computer Engineering, University of Toronto Toronto, Ontario, Canada M5S 3G4. delbert@psi.toronto.edu
Abstract
MOTIVATION: We address the problem of multi-way clustering of microarray data using a generative model. Our algorithm, probabilistic sparse matrix factorization (PSMF), is a probabilistic extension of a previous hard-decision algorithm for this problem. PSMF allows for varying levels of sensor noise in the data, uncertainty in the hidden prototypes used to explain the data and uncertainty as to the prototypes selected to explain each data vector. RESULTS: We present experimental results demonstrating that our method can better recover functionally-relevant clusterings in mRNA expression data than standard clustering techniques, including hierarchical agglomerative clustering, and we show that by computing probabilities instead of point estimates, our method avoids converging to poor solutions.
MOTIVATION: We address the problem of multi-way clustering of microarray data using a generative model. Our algorithm, probabilistic sparse matrix factorization (PSMF), is a probabilistic extension of a previous hard-decision algorithm for this problem. PSMF allows for varying levels of sensor noise in the data, uncertainty in the hidden prototypes used to explain the data and uncertainty as to the prototypes selected to explain each data vector. RESULTS: We present experimental results demonstrating that our method can better recover functionally-relevant clusterings in mRNA expression data than standard clustering techniques, including hierarchical agglomerative clustering, and we show that by computing probabilities instead of point estimates, our method avoids converging to poor solutions.
Authors: Pablo Tamayo; Daniel Scanfeld; Benjamin L Ebert; Michael A Gillette; Charles W M Roberts; Jill P Mesirov Journal: Proc Natl Acad Sci U S A Date: 2007-03-27 Impact factor: 11.205
Authors: Cécile Bazot; Nicolas Dobigeon; Jean-Yves Tourneret; Aimee K Zaas; Geoffrey S Ginsburg; Alfred O Hero Journal: BMC Bioinformatics Date: 2013-03-19 Impact factor: 3.169