Literature DB >> 35757042

Detection of outliers in high-dimensional data using nu-support vector regression.

Abdullah Mohammed Rashid1, Habshah Midi1,2, Waleed Dhhan3,4, Jayanthi Arasan2.   

Abstract

Support Vector Regression (SVR) is gaining in popularity in the detection of outliers and classification problems in high-dimensional data (HDD) as this technique does not require the data to be of full rank. In real application, most of the data are of high dimensional. Classification of high-dimensional data is needed in applied sciences, in particular, as it is important to discriminate cancerous cells from non-cancerous cells. It is also imperative that outliers are identified before constructing a model on the relationship between the dependent and independent variables to avoid misleading interpretations about the fitting of a model. The standard SVR and the μ-ε-SVR are able to detect outliers; however, they are computationally expensive. The fixed parameters support vector regression (FP-ε-SVR) was put forward to remedy this issue. However, the FP-ε-SVR using ε-SVR is not very successful in identifying outliers. In this article, we propose an alternative method to detect outliers i.e. by employing nu-SVR. The merit of our proposed method is confirmed by three real examples and the Monte Carlo simulation. The results show that our proposed nu-SVR method is very successful in identifying outliers under a variety of situations, and with less computational running time.
© 2021 Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  High-dimensional data; outliers; robustness; statistical learning theory; support vector regression

Year:  2021        PMID: 35757042      PMCID: PMC9225439          DOI: 10.1080/02664763.2021.1911965

Source DB:  PubMed          Journal:  J Appl Stat        ISSN: 0266-4763            Impact factor:   1.416


  4 in total

1.  New support vector algorithms

Authors: 
Journal:  Neural Comput       Date:  2000-05       Impact factor: 2.026

2.  Training nu-support vector regression: theory and algorithms.

Authors:  Chih-Chung Chang; Chih-Jen Lin
Journal:  Neural Comput       Date:  2002-08       Impact factor: 2.026

3.  Regression approaches for microarray data analysis.

Authors:  Mark R Segal; Kam D Dahlquist; Bruce R Conklin
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

4.  Systematic artifacts in support vector regression-based compound potency prediction revealed by statistical and activity landscape analysis.

Authors:  Jenny Balfer; Jürgen Bajorath
Journal:  PLoS One       Date:  2015-03-05       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.