Literature DB >> 33816847

The k conditional nearest neighbor algorithm for classification and class probability estimation.

Hyukjun Gweon1, Matthias Schonlau2, Stefan H Steiner2.   

Abstract

The k nearest neighbor (kNN) approach is a simple and effective nonparametric algorithm for classification. One of the drawbacks of kNN is that the method can only give coarse estimates of class probabilities, particularly for low values of k. To avoid this drawback, we propose a new nonparametric classification method based on nearest neighbors conditional on each class: the proposed approach calculates the distance between a new instance and the kth nearest neighbor from each class, estimates posterior probabilities of class memberships using the distances, and assigns the instance to the class with the largest posterior. We prove that the proposed approach converges to the Bayes classifier as the size of the training data increases. Further, we extend the proposed approach to an ensemble method. Experiments on benchmark data sets show that both the proposed approach and the ensemble version of the proposed approach on average outperform kNN, weighted kNN, probabilistic kNN and two similar algorithms (LMkNN and MLM-kHNN) in terms of the error rate. A simulation shows that kCNN may be useful for estimating posterior probabilities when the class distributions overlap. ©2019 Gweon et al.

Entities:  

Keywords:  Nearest neighbor; Nonparametric classification; Probabilistic classifier

Year:  2019        PMID: 33816847      PMCID: PMC7924495          DOI: 10.7717/peerj-cs.194

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  2 in total

1.  Patient activation and advocacy: which literacy skills matter most?

Authors:  Laurie T Martin; Matthias Schonlau; Ann Haas; Kathryn Pitkin Derose; Lindsay Rosenfeld; Stephen L Buka; Rima Rudd
Journal:  J Health Commun       Date:  2011

2.  Distance-based image classification: generalizing to new classes at near-zero cost.

Authors:  Thomas Mensink; Jakob Verbeek; Florent Perronnin; Gabriela Csurka
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-11       Impact factor: 6.226

  2 in total
  2 in total

1.  Optimal 1-NN prototypes for pathological geometries.

Authors:  Ilia Sucholutsky; Matthias Schonlau
Journal:  PeerJ Comput Sci       Date:  2021-04-09

2.  Swarm intelligence-based model for improving prediction performance of low-expectation teams in educational software engineering projects.

Authors:  Bilal I Al-Ahmad; Ala' A Al-Zoubi; Md Faisal Kabir; Marwan Al-Tawil; Ibrahim Aljarah
Journal:  PeerJ Comput Sci       Date:  2022-01-19
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.