| Literature DB >> 24511361 |
Genevera I Allen1, Christine Peterson2, Marina Vannucci2, Mirjana Maletić-Savatić3.
Abstract
High-dimensional data common in genomics, proteomics, and chemometrics often contains complicated correlation structures. Recently, partial least squares (PLS) and Sparse PLS methods have gained attention in these areas as dimension reduction techniques in the context of supervised data analysis. We introduce a framework for Regularized PLS by solving a relaxation of the SIMPLS optimization problem with penalties on the PLS loadings vectors. Our approach enjoys many advantages including flexibility, general penalties, easy interpretation of results, and fast computation in high-dimensional settings. We also outline extensions of our methods leading to novel methods for non-negative PLS and generalized PLS, an adoption of PLS for structured data. We demonstrate the utility of our methods through simulations and a case study on proton Nuclear Magnetic Resonance (NMR) spectroscopy data.Entities:
Keywords: NMR spectroscopy; generalized PCA; generalized PLS; non-negative PLS; sparse PCA; sparse PLS
Year: 2013 PMID: 24511361 PMCID: PMC3914316 DOI: 10.1002/sam.11169
Source DB: PubMed Journal: Stat Anal Data Min ISSN: 1932-1864 Impact factor: 1.051