Literature DB >> 26949568

Sparse Multidimensional Patient Modeling using Auxiliary Confidence Labels.

Eric Heim1, Milos Hauskrecht1.   

Abstract

In this work, we focus on the problem of learning a classification model that performs inference on patient Electronic Health Records (EHRs). Often, a large amount of costly expert supervision is required to learn such a model. To reduce this cost, we obtain confidence labels that indicate how sure an expert is in the class labels she provides. If meaningful confidence information can be incorporated into a learning method, fewer patient instances may need to be labeled to learn an accurate model. In addition, while accuracy of predictions is important for any inference model, a model of patients must be interpretable so that clinicians can understand how the model is making decisions. To these ends, we develop a novel metric learning method called Confidence bAsed MEtric Learning (CAMEL) that supports inclusion of confidence labels, but also emphasizes interpretability in three ways. First, our method induces sparsity, thus producing simple models that use only a few features from patient EHRs. Second, CAMEL naturally produces confidence scores that can be taken into consideration when clinicians make treatment decisions. Third, the metrics learned by CAMEL induce multidimensional spaces where each dimension represents a different "factor" that clinicians can use to assess patients. In our experimental evaluation, we show on a real-world clinical data set that our CAMEL methods are able to learn models that are as or more accurate as other methods that use the same supervision. Furthermore, we show that when CAMEL uses confidence scores it is able to learn models as or more accurate as others we tested while using only 10% of the training instances. Finally, we perform qualitative assessments on the metrics learned by CAMEL and show that they identify and clearly articulate important factors in how the model performs inference.

Entities:  

Year:  2015        PMID: 26949568      PMCID: PMC4774858          DOI: 10.1109/BIBM.2015.7359703

Source DB:  PubMed          Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)        ISSN: 2156-1125


  10 in total

1.  A support vector machine approach for detection of microcalcifications.

Authors:  Issam El-Naqa; Yongyi Yang; Miles N Wernick; Nikolas P Galatsanos; Robert M Nishikawa
Journal:  IEEE Trans Med Imaging       Date:  2002-12       Impact factor: 10.048

2.  Progress toward meaningful use: hospitals' adoption of electronic health records.

Authors:  Ashish K Jha; Matthew F Burke; Catherine DesRoches; Maulik S Joshi; Peter D Kralovec; Eric G Campbell; Melinda B Buntin
Journal:  Am J Manag Care       Date:  2011-12       Impact factor: 2.229

3.  Absolute identification by relative judgment.

Authors:  Neil Stewart; Gordon D A Brown; Nick Chater
Journal:  Psychol Rev       Date:  2005-10       Impact factor: 8.934

4.  Conditional outlier detection for clinical alerting.

Authors:  Milos Hauskrecht; Michal Valko; Iyad Batal; Gilles Clermont; Shyam Visweswaran; Gregory F Cooper
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

5.  Learning classification models with soft-label information.

Authors:  Quang Nguyen; Hamed Valizadegan; Milos Hauskrecht
Journal:  J Am Med Inform Assoc       Date:  2013-11-20       Impact factor: 4.497

6.  Feature importance analysis for patient management decisions.

Authors:  Michal Valko; Milos Hauskrecht
Journal:  Stud Health Technol Inform       Date:  2010

7.  Impact of the patient population on the risk for heparin-induced thrombocytopenia.

Authors:  T E Warkentin; J A Sheppard; P Horsewood; P J Simpson; J C Moore; J G Kelton
Journal:  Blood       Date:  2000-09-01       Impact factor: 22.113

8.  Outlier detection for patient monitoring and alerting.

Authors:  Milos Hauskrecht; Iyad Batal; Michal Valko; Shyam Visweswaran; Gregory F Cooper; Gilles Clermont
Journal:  J Biomed Inform       Date:  2012-08-27       Impact factor: 6.317

9.  Learning classification with auxiliary probabilistic information.

Authors:  Quang Nguyen; Hamed Valizadegan; Milos Hauskrecht
Journal:  Proc IEEE Int Conf Data Min       Date:  2011

10.  A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification.

Authors:  Alexander Statnikov; Lily Wang; Constantin F Aliferis
Journal:  BMC Bioinformatics       Date:  2008-07-22       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.