Literature DB >> 28736771

On Interestingness Measures for Mining Statistically Significant and Novel Clinical Associations from EMRs.

Orhan Abar1, Richard J Charnigo2, Abner Rayapati3, Ramakanth Kavuluru4.   

Abstract

Association rule mining has received significant attention from both the data mining and machine learning communities. While data mining researchers focus more on designing efficient algorithms to mine rules from large datasets, the learning community has explored applications of rule mining to classification. A major problem with rule mining algorithms is the explosion of rules even for moderate sized datasets making it very difficult for end users to identify both statistically significant and potentially novel rules that could lead to interesting new insights and hypotheses. Researchers have proposed many domain independent interestingness measures using which, one can rank the rules and potentially glean useful rules from the top ranked ones. However, these measures have not been fully explored for rule mining in clinical datasets owing to the relatively large sizes of the datasets often encountered in healthcare and also due to limited access to domain experts for review/analysis. In this paper, using an electronic medical record (EMR) dataset of diagnoses and medications from over three million patient visits to the University of Kentucky medical center and affiliated clinics, we conduct a thorough evaluation of dozens of interestingness measures proposed in data mining literature, including some new composite measures. Using cumulative relevance metrics from information retrieval, we compare these interestingness measures against human judgments obtained from a practicing psychiatrist for association rules involving the depressive disorders class as the consequent. Our results not only surface new interesting associations for depressive disorders but also indicate classes of interestingness measures that weight rule novelty and statistical strength in contrasting ways, offering new insights for end users in identifying interesting rules.

Entities:  

Keywords:  association rule mining; electronic medical records; rule interestingness measures

Year:  2016        PMID: 28736771      PMCID: PMC5521989          DOI: 10.1145/2975167.2985843

Source DB:  PubMed          Journal:  ACM BCB


  11 in total

1.  THE ENVIRONMENT AND DISEASE: ASSOCIATION OR CAUSATION?

Authors:  A B HILL
Journal:  Proc R Soc Med       Date:  1965-05

2.  The US National Comorbidity Survey Replication (NCS-R): design and field procedures.

Authors:  Ronald C Kessler; Patricia Berglund; Wai Tat Chiu; Olga Demler; Steven Heeringa; Eva Hiripi; Robert Jin; Beth-Ellen Pennell; Ellen E Walters; Alan Zaslavsky; Hui Zheng
Journal:  Int J Methods Psychiatr Res       Date:  2004       Impact factor: 4.035

3.  Association rules and data mining in hospital infection control and public health surveillance.

Authors:  S E Brossette; A P Sprague; J M Hardin; K B Waites; W T Jones; S A Moser
Journal:  J Am Med Inform Assoc       Date:  1998 Jul-Aug       Impact factor: 4.497

Review 4.  Mental disorders and medical comorbidity.

Authors:  Benjamin G Druss; Elizabeth Reisinger Walker
Journal:  Synth Proj Res Synth Rep       Date:  2011-02

5.  Validation of an association rule mining-based method to infer associations between medications and problems.

Authors:  A Wright; A McCoy; S Henkin; M Flaherty; D Sittig
Journal:  Appl Clin Inform       Date:  2013-03-06       Impact factor: 2.342

6.  Calculating confidence intervals for relative risks (odds ratios) and standardised ratios and rates.

Authors:  J A Morris; M J Gardner
Journal:  Br Med J (Clin Res Ed)       Date:  1988-05-07

7.  An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.

Authors:  Ramakanth Kavuluru; Anthony Rios; Yuan Lu
Journal:  Artif Intell Med       Date:  2015-05-15       Impact factor: 5.326

8.  Proton Pump Inhibitor Usage and the Risk of Myocardial Infarction in the General Population.

Authors:  Nigam H Shah; Paea LePendu; Anna Bauer-Mehren; Yohannes T Ghebremariam; Srinivasan V Iyer; Jake Marcus; Kevin T Nead; John P Cooke; Nicholas J Leeper
Journal:  PLoS One       Date:  2015-06-10       Impact factor: 3.240

9.  Quantifying the impact of chronic conditions on a diagnosis of major depressive disorder in adults: a cohort study using linked electronic medical records.

Authors:  Euijung Ryu; Alanna M Chamberlain; Richard S Pendegraft; Tanya M Petterson; William V Bobo; Jyotishman Pathak
Journal:  BMC Psychiatry       Date:  2016-04-26       Impact factor: 3.630

10.  Comparing high-dimensional confounder control methods for rapid cohort studies from electronic health records.

Authors:  Yen Sia Low; Blanca Gallego; Nigam Haresh Shah
Journal:  J Comp Eff Res       Date:  2015-12-04       Impact factor: 1.744

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.