Literature DB >> 30294406

Double Sparsity Kernel Learning with Automatic Variable Selection and Data Extraction.

Jingxiang Chen1, Chong Zhang2, Michael R Kosorok1, Yufeng Liu3.   

Abstract

Learning in the Reproducing Kernel Hilbert Space (RKHS) has been widely used in many scientific disciplines. Because a RKHS can be very flexible, it is common to impose a regularization term in the optimization to prevent overfitting. Standard RKHS learning employs the squared norm penalty of the learning function. Despite its success, many challenges remain. In particular, one cannot directly use the squared norm penalty for variable selection or data extraction. Therefore, when there exists noise predictors, or the underlying function has a sparse representation in the dual space, the performance of standard RKHS learning can be suboptimal. In the literature, work has been proposed on how to perform variable selection in RKHS learning, and a data sparsity constraint was considered for data extraction. However, how to learn in a RKHS with both variable selection and data extraction simultaneously remains unclear. In this paper, we propose a unified RKHS learning method, namely, DOuble Sparsity Kernel (DOSK) learning, to overcome this challenge. An efficient algorithm is provided to solve the corresponding optimization problem. We prove that under certain conditions, our new method can asymptotically achieve variable selection consistency. Simulated and real data results demonstrate that DOSK is highly competitive among existing approaches for RKHS learning.

Entities:  

Keywords:  Data extraction; Kernel classification; Kernel regression; Reproducing kernel Hilbert space; Selection consistency; Variable selection

Year:  2018        PMID: 30294406      PMCID: PMC6168218          DOI: 10.4310/SII.2018.v11.n3.a1

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  10 in total

1.  Expert system for predicting protein localization sites in gram-negative bacteria.

Authors:  K Nakai; M Kanehisa
Journal:  Proteins       Date:  1991

2.  Adaptive regularization using the entire solution surface.

Authors:  S Wu; X Shen; C J Geyer
Journal:  Biometrika       Date:  2009-09       Impact factor: 2.445

3.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

4.  A Selective Overview of Variable Selection in High Dimensional Feature Space.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

5.  Multiple Response Regression for Gaussian Mixture Models with Known Labels.

Authors:  Wonyul Lee; Ying Du; Wei Sun; D Neil Hayes; Yufeng Liu
Journal:  Stat Anal Data Min       Date:  2012-12-01       Impact factor: 1.051

6.  Multicategory Large-Margin Unified Machines.

Authors:  Chong Zhang; Yufeng Liu
Journal:  J Mach Learn Res       Date:  2013-05-01       Impact factor: 3.654

7.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

8.  Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models.

Authors:  Hao Helen Zhang; Guang Cheng; Yufeng Liu
Journal:  J Am Stat Assoc       Date:  2011-09-01       Impact factor: 5.033

9.  On Quantile Regression in Reproducing Kernel Hilbert Spaces with Data Sparsity Constraint.

Authors:  Chong Zhang; Yufeng Liu; Yichao Wu
Journal:  J Mach Learn Res       Date:  2016-04       Impact factor: 3.654

10.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

  10 in total
  1 in total

1.  Group-based local adaptive deep multiple kernel learning with lp norm.

Authors:  Shengbing Ren; Fa Liu; Weijia Zhou; Xian Feng; Chaudry Naeem Siddique
Journal:  PLoS One       Date:  2020-09-17       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.