Yuan Gao1, George Church. 1. Department of Genetics, Harvard Medical School Boston, MA 02115, USA. g1m1c1@receptor.med.harvard.edu
Abstract
MOTIVATION: Identifying different cancer classes or subclasses with similar morphological appearances presents a challenging problem and has important implication in cancer diagnosis and treatment. Clustering based on gene-expression data has been shown to be a powerful method in cancer class discovery. Non-negative matrix factorization is one such method and was shown to be advantageous over other clustering techniques, such as hierarchical clustering or self-organizing maps. In this paper, we investigate the benefit of explicitly enforcing sparseness in the factorization process. RESULTS: We report an improved unsupervised method for cancer classification by the use of gene-expression profile via sparse non-negative matrix factorization. We demonstrate the improvement by direct comparison with classic non-negative matrix factorization on the three well-studied datasets. In addition, we illustrate how to identify a small subset of co-expressed genes that may be directly involved in cancer.
MOTIVATION: Identifying different cancer classes or subclasses with similar morphological appearances presents a challenging problem and has important implication in cancer diagnosis and treatment. Clustering based on gene-expression data has been shown to be a powerful method in cancer class discovery. Non-negative matrix factorization is one such method and was shown to be advantageous over other clustering techniques, such as hierarchical clustering or self-organizing maps. In this paper, we investigate the benefit of explicitly enforcing sparseness in the factorization process. RESULTS: We report an improved unsupervised method for cancer classification by the use of gene-expression profile via sparse non-negative matrix factorization. We demonstrate the improvement by direct comparison with classic non-negative matrix factorization on the three well-studied datasets. In addition, we illustrate how to identify a small subset of co-expressed genes that may be directly involved in cancer.
Authors: Giulio Caravagna; Alex Graudenzi; Daniele Ramazzotti; Rebeca Sanz-Pamplona; Luca De Sano; Giancarlo Mauri; Victor Moreno; Marco Antoniotti; Bud Mishra Journal: Proc Natl Acad Sci U S A Date: 2016-06-28 Impact factor: 11.205
Authors: Yan Feng; Timothy J Mitchison; Andreas Bender; Daniel W Young; John A Tallarico Journal: Nat Rev Drug Discov Date: 2009-07 Impact factor: 84.694
Authors: Jerry L Prince; Maureen Stone; Arnold D Gomez; Jordan R Green; Christopher J Hartnick; Thomas J Brady; Timothy G Reese; Van J Wedeen; Georges El Fakhri Journal: IEEE Trans Med Imaging Date: 2018-09-18 Impact factor: 10.048