Literature DB >> 26023687

Optimal Thresholding of Classifiers to Maximize F1 Measure.

Zachary C Lipton1, Charles Elkan2, Balakrishnan Naryanaswamy3.   

Abstract

This paper provides new insight into maximizing F1 measures in the context of binary classification and also in the context of multilabel classification. The harmonic mean of precision and recall, the F1 measure is widely used to evaluate the success of a binary classifier when one class is rare. Micro average, macro average, and per instance average F1 measures are used in multilabel classification. For any classifier that produces a real-valued output, we derive the relationship between the best achievable F1 value and the decision-making threshold that achieves this optimum. As a special case, if the classifier outputs are well-calibrated conditional probabilities, then the optimal threshold is half the optimal F1 value. As another special case, if the classifier is completely uninformative, then the optimal behavior is to classify all examples as positive. When the actual prevalence of positive examples is low, this behavior can be undesirable. As a case study, we discuss the results, which can be surprising, of maximizing F1 when predicting 26,853 labels for Medline documents.

Entities:  

Keywords:  F score; F1 measure; binary classification; evaluation methodology; multilabel learning; supervised learning; text classification

Year:  2014        PMID: 26023687      PMCID: PMC4442797          DOI: 10.1007/978-3-662-44851-9_15

Source DB:  PubMed          Journal:  Mach Learn Knowl Discov Databases


  1 in total

1.  Predicting accurate probabilities with a ranking loss.

Authors:  Aditya Krishna Menon; Xiaoqian J Jiang; Shankar Vembu; Charles Elkan; Lucila Ohno-Machado
Journal:  Proc Int Conf Mach Learn       Date:  2012
  1 in total
  34 in total

1.  Understanding the learned behavior of customized convolutional neural networks toward malaria parasite detection in thin blood smear images.

Authors:  Sivaramakrishnan Rajaraman; Kamolrat Silamut; Md A Hossain; I Ersoy; Richard J Maude; Stefan Jaeger; George R Thoma; Sameer K Antani
Journal:  J Med Imaging (Bellingham)       Date:  2018-07-18

Review 2.  Artificial Intelligence for Mental Health and Mental Illnesses: an Overview.

Authors:  Sarah Graham; Colin Depp; Ellen E Lee; Camille Nebeker; Xin Tu; Ho-Cheol Kim; Dilip V Jeste
Journal:  Curr Psychiatry Rep       Date:  2019-11-07       Impact factor: 5.285

3.  Machine Learning for Supplementing Behavioral Assessment.

Authors:  Jordan D Bailey; Jonathan C Baker; Mark J Rzeszutek; Marc J Lanovaz
Journal:  Perspect Behav Sci       Date:  2021-01-09

4.  Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes.

Authors:  Kenneth L Kehl; Wenxin Xu; Eva Lepisto; Haitham Elmarakeby; Michael J Hassett; Eliezer M Van Allen; Bruce E Johnson; Deborah Schrag
Journal:  JCO Clin Cancer Inform       Date:  2020-08

5.  Optimizing Nondecomposable Data Dependent Regularizers via Lagrangian Reparameterization Offers Significant Performance and Efficiency Gains.

Authors:  Sathya N Ravi; Abhay Venkatesh; Glenn M Fung; Vikas Singh
Journal:  Proc Conf AAAI Artif Intell       Date:  2020-06-16

6.  Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.

Authors:  Po-Hao Chen; Hanna Zafar; Maya Galperin-Aizenberg; Tessa Cook
Journal:  J Digit Imaging       Date:  2018-04       Impact factor: 4.056

7.  Sensorimotor Peak Alpha Frequency Is a Reliable Biomarker of Prolonged Pain Sensitivity.

Authors:  Andrew J Furman; Mariya Prokhorenko; Michael L Keaser; Jing Zhang; Shuo Chen; Ali Mazaheri; David A Seminowicz
Journal:  Cereb Cortex       Date:  2020-11-03       Impact factor: 5.357

8.  Thesaurus-based word embeddings for automated biomedical literature classification.

Authors:  Dimitrios A Koutsomitropoulos; Andreas D Andriopoulos
Journal:  Neural Comput Appl       Date:  2021-05-11       Impact factor: 5.606

9.  A novel framework for designing a multi-DoF prosthetic wrist control using machine learning.

Authors:  Chinmay P Swami; Nicholas Lenhard; Jiyeon Kang
Journal:  Sci Rep       Date:  2021-07-22       Impact factor: 4.379

10.  Computerized Assessment of Psychosis Risk.

Authors:  Vijay A Mittal; Lauren M Ellman; Gregory P Strauss; Elaine F Walker; Philip R Corlett; Jason Schiffman; Scott W Woods; Albert R Powers; Steven M Silverstein; James A Waltz; Richard Zinbarg; Shuo Chen; Trevor Williams; Joshua Kenney; James M Gold
Journal:  J Psychiatr Brain Sci       Date:  2021-06-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.