Literature DB >> 30230408

The class imbalance problem detecting adverse drug reactions in electronic health records.

Sara Santiso1, Arantza Casillas1, Alicia Pérez1.   

Abstract

This work focuses on adverse drug reaction extraction tackling the class imbalance problem. Adverse drug reactions are infrequent events in electronic health records, nevertheless, it is compulsory to get them documented. Text mining techniques can help to retrieve this kind of valuable information from text. The class imbalance was tackled using different sampling methods, cost-sensitive learning, ensemble learning and one-class classification and the Random Forest classifier was used. The adverse drug reaction extraction model was inferred from a dataset that comprises real electronic health records with an imbalance ratio of 1:222, this means that for each drug-disease pair that is an adverse drug reaction, there are approximately 222 that are not adverse drug reactions. The application of a sampling technique before using cost-sensitive learning offered the best result. On the test set, the f-measure was 0.121 for the minority class and 0.996 for the majority class.

Keywords:  adverse drug reactions; class imbalance; decision support systems; electronic health records; text mining

Year:  2018        PMID: 30230408     DOI: 10.1177/1460458218799470

Source DB:  PubMed          Journal:  Health Informatics J        ISSN: 1460-4582            Impact factor:   2.681


  2 in total

1.  A Keyword-Enhanced Approach to Handle Class Imbalance in Clinical Text Classification.

Authors:  Andrew E Blanchard; Shang Gao; Hong-Jun Yoon; J Blair Christian; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen M Schwartz; Charles Wiggins; Linda Coyle; Lynne Penberthy; Georgia D Tourassi
Journal:  IEEE J Biomed Health Inform       Date:  2022-06-03       Impact factor: 7.021

2.  A Network-Based Drug Repurposing Method Via Non-Negative Matrix Factorization.

Authors:  Shagahyegh Sadeghi; Jianguo Lu; Alioune Ngom
Journal:  Bioinformatics       Date:  2021-12-07       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.