Literature DB >> 35455193

Preference-Driven Classification Measure.

Jan Kozak1, Barbara Probierz1, Krzysztof Kania2, Przemysław Juszczuk1.   

Abstract

Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending on the application of a specific classifier. However, in most cases, these measures are focused on binary classification, and for the problem of many decision classes, they are significantly simplified. Due to the increasing scope of classification applications, there is a growing need to select a classifier appropriate to the situation, including more complex data sets with multiple decision classes. This paper aims to propose a new measure of classifier quality assessment (called the preference-driven measure, abbreviated p-d), regardless of the number of classes, with the possibility of establishing the relative importance of each class. Furthermore, we propose a solution in which the classifier's assessment can be adapted to the analyzed problem using a vector of preferences. To visualize the operation of the proposed measure, we present it first on an example involving two decision classes and then test its operation on real, multi-class data sets. Additionally, in this case, we demonstrate how to adjust the assessment to the user's preferences. The results obtained allow us to confirm that the use of a preference-driven measure indicates that other classifiers are better to use according to preferences, particularly as opposed to the classical measures of classification quality assessment.

Entities:  

Keywords:  classification measure; machine learning; preference-driven classification; quality measure; quality of classification

Year:  2022        PMID: 35455193      PMCID: PMC9032202          DOI: 10.3390/e24040531

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.738


  11 in total

1.  Comparison of the predicted and observed secondary structure of T4 phage lysozyme.

Authors:  B W Matthews
Journal:  Biochim Biophys Acta       Date:  1975-10-20

2.  Comparing two K-category assignments by a K-category correlation coefficient.

Authors:  J Gorodkin
Journal:  Comput Biol Chem       Date:  2004-12       Impact factor: 2.877

3.  H-Accuracy, an Alternative Metric to Assess Classification Models in Medicine.

Authors:  Andrea Campagner; Luca Sconfienza; Federico Cabitza
Journal:  Stud Health Technol Inform       Date:  2020-06-16

4.  The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

Authors:  Takaya Saito; Marc Rehmsmeier
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

5.  Confidence interval for micro-averaged F 1 and macro-averaged F 1 scores.

Authors:  Kanae Takahashi; Kouji Yamamoto; Aya Kuchiba; Tatsuki Koyama
Journal:  Appl Intell (Dordr)       Date:  2021-07-31       Impact factor: 5.086

6.  Understanding and using sensitivity, specificity and predictive values.

Authors:  Rajul Parikh; Annie Mathai; Shefali Parikh; G Chandra Sekhar; Ravi Thomas
Journal:  Indian J Ophthalmol       Date:  2008 Jan-Feb       Impact factor: 1.848

7.  A comparison of MCC and CEN error measures in multi-class prediction.

Authors:  Giuseppe Jurman; Samantha Riccadonna; Cesare Furlanello
Journal:  PLoS One       Date:  2012-08-08       Impact factor: 3.240

8.  100% classification accuracy considered harmful: the normalized information transfer factor explains the accuracy paradox.

Authors:  Francisco J Valverde-Albacete; Carmen Peláez-Moreno
Journal:  PLoS One       Date:  2014-01-10       Impact factor: 3.240

Review 9.  A comprehensive survey of error measures for evaluating binary decision making in data science.

Authors:  Frank Emmert-Streib; Salisou Moutari; Matthias Dehmer
Journal:  Wiley Interdiscip Rev Data Min Knowl Discov       Date:  2019-02-08

10.  The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.

Authors:  Davide Chicco; Giuseppe Jurman
Journal:  BMC Genomics       Date:  2020-01-02       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.