Literature DB >> 24259520

Learning classification models with soft-label information.

Quang Nguyen1, Hamed Valizadegan, Milos Hauskrecht.   

Abstract

OBJECTIVE: Learning of classification models in medicine often relies on data labeled by a human expert. Since labeling of clinical data may be time-consuming, finding ways of alleviating the labeling costs is critical for our ability to automatically learn such models. In this paper we propose a new machine learning approach that is able to learn improved binary classification models more efficiently by refining the binary class information in the training phase with soft labels that reflect how strongly the human expert feels about the original class labels.
MATERIALS AND METHODS: Two types of methods that can learn improved binary classification models from soft labels are proposed. The first relies on probabilistic/numeric labels, the other on ordinal categorical labels. We study and demonstrate the benefits of these methods for learning an alerting model for heparin induced thrombocytopenia. The experiments are conducted on the data of 377 patient instances labeled by three different human experts. The methods are compared using the area under the receiver operating characteristic curve (AUC) score.
RESULTS: Our AUC results show that the new approach is capable of learning classification models more efficiently compared to traditional learning methods. The improvement in AUC is most remarkable when the number of examples we learn from is small.
CONCLUSIONS: A new classification learning framework that lets us learn from auxiliary soft-label information provided by a human expert is a promising new direction for learning classification models from expert labels, reducing the time and cost needed to label data.

Entities:  

Keywords:  data labeling by human experts; machine learning; soft-label information

Mesh:

Year:  2013        PMID: 24259520      PMCID: PMC3994863          DOI: 10.1136/amiajnl-2013-001964

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  12 in total

1.  A Pattern Mining Approach for Classifying Multivariate Temporal Data.

Authors:  Iyad Batal; Hamed Valizadegan; Gregory F Cooper; Milos Hauskrecht
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2011-11-12

2.  Mining Recent Temporal Patterns for Event Detection in Multivariate Time Series Data.

Authors:  Iyad Batal; Dmitriy Fradkin; James Harrison; Fabian Moerchen; Milos Hauskrecht
Journal:  KDD       Date:  2012

3.  An evaluation of clinicians' subjective prior probability estimates.

Authors:  J G Dolan; D R Bordley; A I Mushlin
Journal:  Med Decis Making       Date:  1986 Oct-Dec       Impact factor: 2.583

4.  The accuracy of experienced physicians' probability estimates for patients with sore throats. Implications for decision making.

Authors:  R M Poses; R D Cebul; M Collins; S S Fager
Journal:  JAMA       Date:  1985-08-16       Impact factor: 56.272

5.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

6.  Feature importance analysis for patient management decisions.

Authors:  Michal Valko; Milos Hauskrecht
Journal:  Stud Health Technol Inform       Date:  2010

7.  Impact of the patient population on the risk for heparin-induced thrombocytopenia.

Authors:  T E Warkentin; J A Sheppard; P Horsewood; P J Simpson; J C Moore; J G Kelton
Journal:  Blood       Date:  2000-09-01       Impact factor: 22.113

8.  Physicians' estimates of the probability of myocardial infarction in emergency room patients with chest pain.

Authors:  W M Tierney; J Fitzgerald; R McHenry; B J Roth; B Psaty; D L Stump; F K Anderson
Journal:  Med Decis Making       Date:  1986 Jan-Mar       Impact factor: 2.583

9.  Modeling treatment of ischemic heart disease with partially observable Markov decision processes.

Authors:  M Hauskrecht; H Fraser
Journal:  Proc AMIA Symp       Date:  1998

10.  Outlier detection for patient monitoring and alerting.

Authors:  Milos Hauskrecht; Iyad Batal; Michal Valko; Shyam Visweswaran; Gregory F Cooper; Gilles Clermont
Journal:  J Biomed Inform       Date:  2012-08-27       Impact factor: 6.317

View more
  12 in total

1.  Hierarchical Active Learning with Proportion Feedback on Regions.

Authors:  Zhipeng Luo; Milos Hauskrecht
Journal:  Mach Learn Knowl Discov Databases       Date:  2019-01-23

2.  Active Learning of Multi-Class Classifiers with Auxiliary Probabilistic Information.

Authors:  Yanbing Xue; Milos Hauskrecht
Journal:  Proc Int Fla AI Res Soc Conf       Date:  2018-05

3.  Hierarchical Active Learning with Group Proportion Feedback.

Authors:  Zhipeng Luo; Milos Hauskrecht
Journal:  IJCAI (U S)       Date:  2018-07

4.  Clinical time series prediction: Toward a hierarchical dynamical system framework.

Authors:  Zitao Liu; Milos Hauskrecht
Journal:  Artif Intell Med       Date:  2014-11-06       Impact factor: 5.326

5.  Efficient Learning of Classification Models from Soft-label Information by Binning and Ranking.

Authors:  Yanbing Xue; Milos Hauskrecht
Journal:  Proc Int Fla AI Res Soc Conf       Date:  2017-05

6.  Sparse Multidimensional Patient Modeling using Auxiliary Confidence Labels.

Authors:  Eric Heim; Milos Hauskrecht
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2015-11

7.  Active Learning of Classification Models with Likert-Scale Feedback.

Authors:  Yanbing Xue; Milos Hauskrecht
Journal:  Proc SIAM Int Conf Data Min       Date:  2017

8.  Active Learning of Multi-class Classification Models from Ordered Class Sets.

Authors:  Yanbing Xue; Milos Hauskrecht
Journal:  Proc Conf AAAI Artif Intell       Date:  2019-07-17

9.  Modeling multivariate clinical event time-series with recurrent temporal mechanisms.

Authors:  Jeong Min Lee; Milos Hauskrecht
Journal:  Artif Intell Med       Date:  2021-01-18       Impact factor: 5.326

10.  Quantifying Parkinson's disease motor severity under uncertainty using MDS-UPDRS videos.

Authors:  Mandy Lu; Qingyu Zhao; Kathleen L Poston; Edith V Sullivan; Adolf Pfefferbaum; Marian Shahid; Maya Katz; Leila Montaser Kouhsari; Kevin Schulman; Arnold Milstein; Juan Carlos Niebles; Victor W Henderson; Li Fei-Fei; Kilian M Pohl; Ehsan Adeli
Journal:  Med Image Anal       Date:  2021-07-21       Impact factor: 13.828

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.