Literature DB >> 27279668

Quantile-based classifiers.

C Hennig1, C Viroli2.   

Abstract

Classification with small samples of high-dimensional data is important in many application areas. Quantile classifiers are distance-based classifiers that require a single parameter, regardless of the dimension, and classify observations according to a sum of weighted componentwise distances of the components of an observation to the within-class quantiles. An optimal percentage for the quantiles can be chosen by minimizing the misclassification error in the training sample. It is shown that this choice is consistent for the classification rule with the asymptotically optimal quantile and that under some assumptions, as the number of variables goes to infinity, the probability of correct classification converges to unity. The effect of skewness of the distributions of the predictor variables is discussed. The optimal quantile classifier gives low misclassification rates in a comprehensive simulation study and in a real-data application.

Keywords:  High-dimensional data; Median-based classifier; Misclassification rate; Skewness

Year:  2016        PMID: 27279668     DOI: 10.1093/biomet/asw015

Source DB:  PubMed          Journal:  Biometrika        ISSN: 0006-3444            Impact factor:   2.445


  1 in total

1.  On rank distribution classifiers for high-dimensional data.

Authors:  Olusola Samuel Makinde
Journal:  J Appl Stat       Date:  2020-05-20       Impact factor: 1.416

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.