Literature DB >> 21216571

Cost-sensitive case-based reasoning using a genetic algorithm: application to medical diagnosis.

Yoon-Joo Park1, Se-Hak Chun, Byung-Chun Kim.   

Abstract

OBJECTIVE: The paper studies the new learning technique called cost-sensitive case-based reasoning (CSCBR) incorporating unequal misclassification cost into CBR model. Conventional CBR is now considered as a suitable technique for diagnosis, prognosis and prescription in medicine. However it lacks the ability to reflect asymmetric misclassification and often assumes that the cost of a positive diagnosis (an illness) as a negative one (no illness) is the same with that of the opposite situation. Thus, the objective of this research is to overcome the limitation of conventional CBR and encourage applying CBR to many real world medical cases associated with costs of asymmetric misclassification errors.
METHODS: The main idea involves adjusting the optimal cut-off classification point for classifying the absence or presence of diseases and the cut-off distance point for selecting optimal neighbors within search spaces based on similarity distribution. These steps are dynamically adapted to new target cases using a genetic algorithm. We apply this proposed method to five real medical datasets and compare the results with two other cost-sensitive learning methods-C5.0 and CART.
RESULTS: Our finding shows that the total misclassification cost of CSCBR is lower than other cost-sensitive methods in many cases. Even though the genetic algorithm has limitations in terms of unstable results and over-fitting training data, CSCBR results with GA are better overall than those of other methods. Also the paired t-test results indicate that the total misclassification cost of CSCBR is significantly less than C5.0 and CART for several datasets.
CONCLUSION: We have proposed a new CBR method called cost-sensitive case-based reasoning (CSCBR) that can incorporate unequal misclassification costs into CBR and optimize the number of neighbors dynamically using a genetic algorithm. It is meaningful not only for introducing the concept of cost-sensitive learning to CBR, but also for encouraging the use of CBR in the medical area. The result shows that the total misclassification costs of CSCBR do not increase in arithmetic progression as the cost of false absence increases arithmetically, thus it is cost-sensitive. We also show that total misclassification costs of CSCBR are the lowest among all methods in four datasets out of five and the result is statistically significant in many cases. The limitation of our proposed CSCBR is confined to classify binary cases for minimizing misclassification cost because our proposed CSCBR is originally designed to classify binary case. Our future work extends this method for multi-classification which can classify more than two groups.
Copyright © 2010 Elsevier B.V. All rights reserved.

Mesh:

Year:  2011        PMID: 21216571     DOI: 10.1016/j.artmed.2010.12.001

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  3 in total

Review 1.  Role of Soft Computing Approaches in HealthCare Domain: A Mini Review.

Authors:  Shalini Gambhir; Sanjay Kumar Malik; Yugal Kumar
Journal:  J Med Syst       Date:  2016-10-29       Impact factor: 4.460

2.  Accurate prediction of coronary artery disease using reliable diagnosis system.

Authors:  Indrajit Mandal; N Sairam
Journal:  J Med Syst       Date:  2012-02-12       Impact factor: 4.460

3.  An Efficient Predictive Model for Myocardial Infarction Using Cost-sensitive J48 Model.

Authors:  Atefeh Daraei; Hodjat Hamidi
Journal:  Iran J Public Health       Date:  2017-05       Impact factor: 1.429

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.