Literature DB >> 26133479

Using local lexicalized rules to identify heart disease risk factors in clinical notes.

George Karystianis1, Azad Dehghan1, Aleksandar Kovacevic2, John A Keane3, Goran Nenadic4.   

Abstract

Heart disease is the leading cause of death globally and a significant part of the human population lives with it. A number of risk factors have been recognized as contributing to the disease, including obesity, coronary artery disease (CAD), hypertension, hyperlipidemia, diabetes, smoking, and family history of premature CAD. This paper describes and evaluates a methodology to extract mentions of such risk factors from diabetic clinical notes, which was a task of the i2b2/UTHealth 2014 Challenge in Natural Language Processing for Clinical Data. The methodology is knowledge-driven and the system implements local lexicalized rules (based on syntactical patterns observed in notes) combined with manually constructed dictionaries that characterize the domain. A part of the task was also to detect the time interval in which the risk factors were present in a patient. The system was applied to an evaluation set of 514 unseen notes and achieved a micro-average F-score of 88% (with 86% precision and 90% recall). While the identification of CAD family history, medication and some of the related disease factors (e.g. hypertension, diabetes, hyperlipidemia) showed quite good results, the identification of CAD-specific indicators proved to be more challenging (F-score of 74%). Overall, the results are encouraging and suggested that automated text mining methods can be used to process clinical notes to identify risk factors and monitor progression of heart disease on a large-scale, providing necessary data for clinical and epidemiological studies.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Heart disease; Risk factors; Rule-based modelling; Text mining; Vocabularies

Mesh:

Year:  2015        PMID: 26133479      PMCID: PMC4974302          DOI: 10.1016/j.jbi.2015.06.013

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  19 in total

1.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

2.  Extracting medical information from narrative patient records: the case of medication-related information.

Authors:  Louise Deléger; Cyril Grouin; Pierre Zweigenbaum
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

3.  Identifying risk factors for metabolic syndrome in biomedical text.

Authors:  Marcelo Fiszman; Graciela Rosemblat; Caroline B Ahlers; Thomas C Rindflesch
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

4.  Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries.

Authors:  Yan Xu; Kai Hong; Junichi Tsujii; Eric I-Chao Chang
Journal:  J Am Med Inform Assoc       Date:  2012-05-14       Impact factor: 4.497

5.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

6.  Medication information extraction with linguistic pattern matching and semantic rules.

Authors:  Irena Spasic; Farzaneh Sarafraz; John A Keane; Goran Nenadic
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

7.  Identification and extraction of family history information from clinical reports.

Authors:  Sergey Goryachev; Hyeoneui Kim; Qing Zeng-Treitler
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

8.  Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives.

Authors:  Aleksandar Kovacevic; Azad Dehghan; Michele Filannino; John A Keane; Goran Nenadic
Journal:  J Am Med Inform Assoc       Date:  2013-04-20       Impact factor: 4.497

9.  High sensitivity cardiac troponin and the under-diagnosis of myocardial infarction in women: prospective cohort study.

Authors:  Anoop S V Shah; Megan Griffiths; Kuan Ken Lee; David A McAllister; Amanda L Hunter; Amy V Ferry; Anne Cruikshank; Alan Reid; Mary Stoddart; Fiona Strachan; Simon Walker; Paul O Collinson; Fred S Apple; Alasdair J Gray; Keith A A Fox; David E Newby; Nicholas L Mills
Journal:  BMJ       Date:  2015-01-21

10.  Recognition of medication information from discharge summaries using ensembles of classifiers.

Authors:  Son Doan; Nigel Collier; Hua Xu; Hoang Duy Pham; Minh Phuong Tu
Journal:  BMC Med Inform Decis Mak       Date:  2012-05-07       Impact factor: 2.796

View more
  8 in total

Review 1.  Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2.

Authors:  Amber Stubbs; Christopher Kotfila; Hua Xu; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-22       Impact factor: 6.317

2.  Clinical trial cohort selection based on multi-level rule-based natural language processing system.

Authors:  Long Chen; Yu Gu; Xin Ji; Chao Lou; Zhiyong Sun; Haodan Li; Yuan Gao; Yang Huang
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

3.  Automatic prediction of coronary artery disease from clinical narratives.

Authors:  Kevin Buchan; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-27       Impact factor: 6.317

4.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

5.  Automatic mining of symptom severity from psychiatric evaluation notes.

Authors:  George Karystianis; Alejo J Nevado; Chi-Hun Kim; Azad Dehghan; John A Keane; Goran Nenadic
Journal:  Int J Methods Psychiatr Res       Date:  2017-12-22       Impact factor: 4.035

6.  Automated Analysis of Domestic Violence Police Reports to Explore Abuse Types and Victim Injuries: Text Mining Study.

Authors:  George Karystianis; Armita Adily; Peter W Schofield; David Greenberg; Louisa Jorm; Goran Nenadic; Tony Butler
Journal:  J Med Internet Res       Date:  2019-03-12       Impact factor: 5.428

Review 7.  Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.

Authors:  Seyedmostafa Sheikhalishahi; Riccardo Miotto; Joel T Dudley; Alberto Lavelli; Fabio Rinaldi; Venet Osmani
Journal:  JMIR Med Inform       Date:  2019-04-27

8.  A rule-based approach to identify patient eligibility criteria for clinical trials from narrative longitudinal records.

Authors:  George Karystianis; Oscar Florez-Vargas; Tony Butler; Goran Nenadic
Journal:  JAMIA Open       Date:  2019-08-20
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.