Literature DB >> 24350304

Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation.

Kendrick Boyd1, Vítor Santos Costa2, Jesse Davis3, C David Page1.   

Abstract

Precision-recall (PR) curves and the areas under them are widely used to summarize machine learning results, especially for data sets exhibiting class skew. They are often used analogously to ROC curves and the area under ROC curves. It is known that PR curves vary as class skew changes. What was not recognized before this paper is that there is a region of PR space that is completely unachievable, and the size of this region depends only on the skew. This paper precisely characterizes the size of that region and discusses its implications for empirical evaluation methodology in machine learning.

Entities:  

Year:  2012        PMID: 24350304      PMCID: PMC3858955     

Source DB:  PubMed          Journal:  Proc Int Conf Mach Learn


  2 in total

1.  SVMs modeling for highly imbalanced classification.

Authors:  Yuchun Tang; Yan-Qing Zhang; Nitesh V Chawla; Sven Krasser
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2008-12-09

2.  ARTS: accurate recognition of transcription starts in human.

Authors:  Sören Sonnenburg; Alexander Zien; Gunnar Rätsch
Journal:  Bioinformatics       Date:  2006-07-15       Impact factor: 6.937

  2 in total
  14 in total

1.  Score As You Lift (SAYL): A Statistical Relational Learning Approach to Uplift Modeling.

Authors:  Houssam Nassif; Finn Kuusisto; Elizabeth S Burnside; David Page; Jude Shavlik; Vítor Santos Costa
Journal:  Mach Learn Knowl Discov Databases       Date:  2013

2.  Semantic Segmentation of White Matter in FDG-PET Using Generative Adversarial Network.

Authors:  Kyeong Taek Oh; Sangwon Lee; Haeun Lee; Mijin Yun; Sun K Yoo
Journal:  J Digit Imaging       Date:  2020-08       Impact factor: 4.056

3.  Doublet identification in single-cell sequencing data using scDblFinder.

Authors:  Pierre-Luc Germain; Aaron Lun; Carlos Garcia Meixide; Will Macnair; Mark D Robinson
Journal:  F1000Res       Date:  2021-09-28

4.  Semi-supervised Learning for the BioNLP Gene Regulation Network.

Authors:  Thomas Provoost; Marie-Francine Moens
Journal:  BMC Bioinformatics       Date:  2015-07-13       Impact factor: 3.169

5.  Identification of long non-coding transcripts with feature selection: a comparative study.

Authors:  Giovanna M M Ventola; Teresa M R Noviello; Salvatore D'Aniello; Antonietta Spagnuolo; Michele Ceccarelli; Luigi Cerulo
Journal:  BMC Bioinformatics       Date:  2017-03-23       Impact factor: 3.169

6.  Using Machine Learning Algorithms to Predict Risk for Development of Calciphylaxis in Patients with Chronic Kidney Disease.

Authors:  Ross S Kleiman; Eric R LaRose; Jonathan C Badger; David Page; Michael D Caldwell; James A Clay; Peggy L Peissig
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18

7.  Analyzing Factors Associated with Fatal Road Crashes: A Machine Learning Approach.

Authors:  Ali J Ghandour; Huda Hammoud; Samar Al-Hajj
Journal:  Int J Environ Res Public Health       Date:  2020-06-09       Impact factor: 3.390

8.  Detection of long non-coding RNA homology, a comparative study on alignment and alignment-free metrics.

Authors:  Teresa M R Noviello; Antonella Di Liddo; Giovanna M Ventola; Antonietta Spagnuolo; Salvatore D'Aniello; Michele Ceccarelli; Luigi Cerulo
Journal:  BMC Bioinformatics       Date:  2018-11-06       Impact factor: 3.169

9.  Improving palliative care with deep learning.

Authors:  Anand Avati; Kenneth Jung; Stephanie Harman; Lance Downing; Andrew Ng; Nigam H Shah
Journal:  BMC Med Inform Decis Mak       Date:  2018-12-12       Impact factor: 2.796

10.  Area under precision-recall curves for weighted and unweighted data.

Authors:  Jens Keilwagen; Ivo Grosse; Jan Grau
Journal:  PLoS One       Date:  2014-03-20       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.