| Literature DB >> 35707509 |
Kuangnan Fang1,2, Peng Wang1, Xiaochen Zhang1, Qingzhao Zhang1,2,3.
Abstract
In the application of high-dimensional data classification, several attempts have been made to achieve variable selection by replacing the ℓ 2 -penalty with other penalties for the support vector machine (SVM). However, these high-dimensional SVM methods usually do not take into account the special structure among covariates (features). In this article, we consider a classification problem, where the covariates are ordered in some meaningful way, and the number of covariates p can be much larger than the sample size n. We propose a structured sparse SVM to tackle this type of problems, which combines the non-convex penalty and cubic spline estimation procedure (i.e. penalizing second-order derivatives of the coefficients) to the SVM. From a theoretical point of view, the proposed method satisfies the local oracle property. Simulations show that the method works effectively both in feature selection and classification accuracy. A real application is conducted to illustrate the benefits of the method.Entities:
Keywords: Structured sparse; local oracle property; support vector machine; variable selection
Year: 2020 PMID: 35707509 PMCID: PMC9041777 DOI: 10.1080/02664763.2020.1849053
Source DB: PubMed Journal: J Appl Stat ISSN: 0266-4763 Impact factor: 1.416