Literature DB >> 11779685

The problem of bias in training data in regression problems in medical decision support.

B Mac Namee1, P Cunningham, S Byrne, O I Corrigan.   

Abstract

This paper describes a bias problem encountered in a machine learning approach to outcome prediction in anticoagulant drug therapy. The outcome to be predicted is a measure of the clotting time for the patient; this measure is continuous and so the prediction task is a regression problem. Artificial neural networks (ANNs) are a powerful mechanism for learning to predict such outcomes from training data. However, experiments have shown that an ANN is biased towards values more commonly occurring in the training data and is thus, less likely to be correct in predicting extreme values. This issue of bias in training data in regression problems is similar to the associated problem with minority classes in classification. However, this bias issue in classification is well documented and is an on-going area of research. In this paper, we consider stratified sampling and boosting as solutions to this bias problem and evaluate them on this outcome prediction problem and on two other datasets. Both approaches produce some improvements with boosting showing the most promise.

Entities:  

Mesh:

Year:  2002        PMID: 11779685     DOI: 10.1016/s0933-3657(01)00092-6

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  5 in total

1.  Randomized trial of model predictive control for improved anemia management.

Authors:  Michael E Brier; Adam E Gaweda; Andrew Dailey; George R Aronoff; Alfred A Jacobs
Journal:  Clin J Am Soc Nephrol       Date:  2010-02-25       Impact factor: 8.237

2.  Inter-electrode correlations measured with EEG predict individual differences in cognitive ability.

Authors:  Nicole Hakim; Edward Awh; Edward K Vogel; Monica D Rosenberg
Journal:  Curr Biol       Date:  2021-10-11       Impact factor: 10.834

3.  A comparative analysis of multi-level computer-assisted decision making systems for traumatic injuries.

Authors:  Soo-Yeon Ji; Rebecca Smith; Toan Huynh; Kayvan Najarian
Journal:  BMC Med Inform Decis Mak       Date:  2009-01-14       Impact factor: 2.796

4.  The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

Authors:  Takaya Saito; Marc Rehmsmeier
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

5.  Incremental learning with SVM for multimodal classification of prostatic adenocarcinoma.

Authors:  José Fernando García Molina; Lei Zheng; Metin Sertdemir; Dietmar J Dinter; Stefan Schönberg; Matthias Rädle
Journal:  PLoS One       Date:  2014-04-03       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.