Literature DB >> 7623060

Classification-algorithm evaluation: five performance measures based on confusion matrices.

A D Forbes1.   

Abstract

OBJECTIVE: The objective of this paper is to introduce, explain, and extend methods for comparing the performance of classification algorithms using error tallies obtained on properly sized, populated, and labeled data sets.
METHODS: Two distinct contexts of classification are defined, involving "objects-by-inspection" and "objects-by-segmentation." In the former context, the total number of objects to be classified is unambiguously and self-evidently defined. In the latter, there is troublesome ambiguity. All five of the measures of performance here considered are based on confusion matrices, tables of counts revealing the extent of an algorithm's "confusion" regarding the true classifications. A proper measure of classification-algorithm performance must meet four requirements. A proper measure should obey six additional constraints.
RESULTS: Four traditional measures of performance are critiqued in terms of the requirements and constraints. Each measure meets the requirements, but fails to obey at least one of the constraints. A nontraditional measure of algorithm performance, the normalized mutual information (NMI), is therefore introduced. Based on the NMI, methods for comparing algorithm performance using confusion matrices are devised.
CONCLUSIONS: The five performance measures lead to similar inferences when comparing a trio of QRS-detection algorithms using a large data set. The modified NMI is preferred, however, because it obeys each of the constraints and is the most conservative measure of performance.

Mesh:

Year:  1995        PMID: 7623060     DOI: 10.1007/bf01617722

Source DB:  PubMed          Journal:  J Clin Monit        ISSN: 0748-1977


  1 in total

1.  A decision theory approach to the approximation of discrete probability densities.

Authors:  D Kazakos; T Cotsidas
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1980-01       Impact factor: 6.226

  1 in total
  16 in total

1.  Assessing the Accuracy of Multi-Temporal Built-Up Land Layers across Rural-Urban Trajectories in the United States.

Authors:  Stefan Leyk; Johannes H Uhl; Deborah Balk; Bryan Jones
Journal:  Remote Sens Environ       Date:  2017-10-07       Impact factor: 10.164

2.  Mining gene expression data by interpreting principal components.

Authors:  Joseph C Roden; Brandon W King; Diane Trout; Ali Mortazavi; Barbara J Wold; Christopher E Hart
Journal:  BMC Bioinformatics       Date:  2006-04-07       Impact factor: 3.169

3.  The influence of data characteristics on detecting wetland/stream surface-water connections in the Delmarva Peninsula, Maryland and Delaware.

Authors:  Melanie K Vanderhoof; Hayley E Distler; Megan W Lang; Laurie C Alexander
Journal:  Wetl Ecol Manag       Date:  2017-06-08       Impact factor: 2.134

4.  Finding unique filter sets in PLATO: a precursor to efficient interaction analysis in GWAS data.

Authors:  Benjamin J Grady; Eric Torstenson; Scott M Dudek; Justin Giles; David Sexton; Marylyn D Ritchie
Journal:  Pac Symp Biocomput       Date:  2010

5.  Speeding disease gene discovery by sequence based candidate prioritization.

Authors:  Euan A Adie; Richard R Adams; Kathryn L Evans; David J Porteous; Ben S Pickard
Journal:  BMC Bioinformatics       Date:  2005-03-14       Impact factor: 3.169

6.  A mathematical and computational framework for quantitative comparison and integration of large-scale gene expression data.

Authors:  Christopher E Hart; Lucas Sharenbroich; Benjamin J Bornstein; Diane Trout; Brandon King; Eric Mjolsness; Barbara J Wold
Journal:  Nucleic Acids Res       Date:  2005-05-10       Impact factor: 16.971

Review 7.  How to evaluate an agent's behavior to infrequent events?-Reliable performance estimation insensitive to class distribution.

Authors:  Sirko Straube; Mario M Krell
Journal:  Front Comput Neurosci       Date:  2014-04-10       Impact factor: 2.380

8.  The optimal sampling design for littoral habitats modelling: A case study from the north-western Mediterranean.

Authors:  Maria Elena Cefalì; Enric Ballesteros; Joan Lluís Riera; Eglantine Chappuis; Marc Terradas; Simone Mariani; Emma Cebrian
Journal:  PLoS One       Date:  2018-05-24       Impact factor: 3.240

9.  Validating distribution models for twelve endemic bird species of tropical dry forest in western Mexico.

Authors:  Miguel A Ortega-Huerta; Jorge H Vega-Rivera
Journal:  Ecol Evol       Date:  2017-08-19       Impact factor: 2.912

10.  Exploring epistasis in candidate genes for rheumatoid arthritis.

Authors:  Marylyn D Ritchie; Jacquelaine Bartlett; William S Bush; Todd L Edwards; Alison A Motsinger; Eric S Torstenson
Journal:  BMC Proc       Date:  2007-12-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.