Literature DB >> 33817044

Classifier uncertainty: evidence, potential impact, and probabilistic treatment.

Niklas Tötsch1, Daniel Hoffmann1.   

Abstract

Classifiers are often tested on relatively small data sets, which should lead to uncertain performance metrics. Nevertheless, these metrics are usually taken at face value. We present an approach to quantify the uncertainty of classification performance metrics, based on a probability model of the confusion matrix. Application of our approach to classifiers from the scientific literature and a classification competition shows that uncertainties can be surprisingly large and limit performance evaluation. In fact, some published classifiers may be misleading. The application of our approach is simple and requires only the confusion matrix. It is agnostic of the underlying classifier. Our method can also be used for the estimation of sample sizes that achieve a desired precision of a performance metric.
© 2021 Tötsch and Hoffmann.

Entities:  

Keywords:  Bayesian modeling; Classification; Machine learning; Reproducibility; Statistics; Uncertainty

Year:  2021        PMID: 33817044      PMCID: PMC7959610          DOI: 10.7717/peerj-cs.398

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  2 in total

1.  Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

Authors: 
Journal:  Neural Comput       Date:  1998-09-15       Impact factor: 2.026

Review 2.  Ten quick tips for machine learning in computational biology.

Authors:  Davide Chicco
Journal:  BioData Min       Date:  2017-12-08       Impact factor: 2.522

  2 in total
  3 in total

1.  Framework for Testing Robustness of Machine Learning-Based Classifiers.

Authors:  Joshua Chuah; Uwe Kruger; Ge Wang; Pingkun Yan; Juergen Hahn
Journal:  J Pers Med       Date:  2022-08-14

Review 2.  Review and assessment of Boolean approaches for inference of gene regulatory networks.

Authors:  Žiga Pušnik; Miha Mraz; Nikolaj Zimic; Miha Moškon
Journal:  Heliyon       Date:  2022-08-09

3.  The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.

Authors:  Davide Chicco; Matthijs J Warrens; Giuseppe Jurman
Journal:  PeerJ Comput Sci       Date:  2021-07-05
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.