| Literature DB >> 27428058 |
Dong Wang1, Jin-Xing Liu1,2, Ying-Lian Gao3, Jiguo Yu1, Chun-Hou Zheng1, Yong Xu2.
Abstract
Recent research has demonstrated that characteristic gene selection based on gene expression data remains faced with considerable challenges. This is primarily because gene expression data are typically high dimensional, negative, non-sparse and noisy. However, existing methods for data analysis are able to cope with only some of these challenges. In this paper, we address all of these challenges with a unified method: nonnegative matrix factorization via the L2,1-norm (NMF-L2,1). While L2,1-norm minimization is applied to both the error function and the regularization term, our method is robust to outliers and noise in the data and generates sparse results. The application of our method to plant and tumor gene expression data demonstrates that NMF-L2,1 can extract more characteristic genes than other existing state-of-the-art methods.Entities:
Mesh:
Year: 2016 PMID: 27428058 PMCID: PMC4948826 DOI: 10.1371/journal.pone.0158494
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240