Literature DB >> 33816895

Nearest labelset using double distances for multi-label classification.

Hyukjun Gweon1, Matthias Schonlau2, Stefan H Steiner2.   

Abstract

Multi-label classification is a type of supervised learning where an instance may belong to multiple labels simultaneously. Predicting each label independently has been criticized for not exploiting any correlation between labels. In this article we propose a novel approach, Nearest Labelset using Double Distances (NLDD), that predicts the labelset observed in the training data that minimizes a weighted sum of the distances in both the feature space and the label space to the new instance. The weights specify the relative tradeoff between the two distances. The weights are estimated from a binomial regression of the number of misclassified labels as a function of the two distances. Model parameters are estimated by maximum likelihood. NLDD only considers labelsets observed in the training data, thus implicitly taking into account label dependencies. Experiments on benchmark multi-label data sets show that the proposed method on average outperforms other well-known approaches in terms of 0/1 loss, and multi-label accuracy and ranks second on the F-measure (after a method called ECC) and on Hamming loss (after a method called RF-PCT). ©2019 Gweon et al.

Entities:  

Keywords:  Label correlations; Nearest neighbor; Multi-label classification

Year:  2019        PMID: 33816895      PMCID: PMC7924696          DOI: 10.7717/peerj-cs.242

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  1 in total

1.  Multilabel classification with principal label space transformation.

Authors:  Farbound Tai; Hsuan-Tien Lin
Journal:  Neural Comput       Date:  2012-05-17       Impact factor: 2.026

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.