Literature DB >> 26305514

Mining heart disease risk factors in clinical text with named entity recognition and distributional semantic models.

Jay Urbain1.   

Abstract

We present the design, and analyze the performance of a multi-stage natural language processing system employing named entity recognition, Bayesian statistics, and rule logic to identify and characterize heart disease risk factor events in diabetic patients over time. The system was originally developed for the 2014 i2b2 Challenges in Natural Language in Clinical Data. The system's strengths included a high level of accuracy for identifying named entities associated with heart disease risk factor events. The system's primary weakness was due to inaccuracies when characterizing the attributes of some events. For example, determining the relative time of an event with respect to the record date, whether an event is attributable to the patient's history or the patient's family history, and differentiating between current and prior smoking status. We believe these inaccuracies were due in large part to the lack of an effective approach for integrating context into our event detection model. To address these inaccuracies, we explore the addition of a distributional semantic model for characterizing contextual evidence of heart disease risk factor events. Using this semantic model, we raise our initial 2014 i2b2 Challenges in Natural Language of Clinical data F1 score of 0.838 to 0.890 and increased precision by 10.3% without use of any lexicons that might bias our results.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Biomedical text mining; Clinical informatics; Diabetes; Distributional semantic models; Heart disease risk factors; Named entity recognition; Natural language processing; Translational research

Mesh:

Year:  2015        PMID: 26305514      PMCID: PMC4984540          DOI: 10.1016/j.jbi.2015.08.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  3 in total

1.  Semantic interoperation and electronic health records: context sensitive mapping from SNOMED CT to ICD-10.

Authors:  James R Campbell; Hazel Brear; Rita Scichilone; Susan White; Kathy Giannangelo; Brian Carlsen; Harold Solbrig; Kin Wah Fung
Journal:  Stud Health Technol Inform       Date:  2013

2.  Identifying synonymy between SNOMED clinical terms of varying length using distributional analysis of electronic health records.

Authors:  Aron Henriksson; Mike Conway; Martin Duneld; Wendy W Chapman
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

3.  Passage relevance models for genomics search.

Authors:  Jay Urbain; Ophir Frieder; Nazli Goharian
Journal:  BMC Bioinformatics       Date:  2009-03-19       Impact factor: 3.169

  3 in total
  10 in total

1.  Feature extraction for phenotyping from semantic and knowledge resources.

Authors:  Wenxin Ning; Stephanie Chan; Andrew Beam; Ming Yu; Alon Geva; Katherine Liao; Mary Mullen; Kenneth D Mandl; Isaac Kohane; Tianxi Cai; Sheng Yu
Journal:  J Biomed Inform       Date:  2019-02-07       Impact factor: 6.317

2.  Automatic prediction of coronary artery disease from clinical narratives.

Authors:  Kevin Buchan; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-27       Impact factor: 6.317

3.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

4.  Assessing Information Congruence of Documented Cardiovascular Disease between Electronic Dental and Medical Records.

Authors:  Jay Patel; Danielle Mowery; Anand Krishnan; Thankam Thyvalikakath
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

Review 5.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

6.  Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review.

Authors:  Alexander Turchin; Luisa F Florez Builes
Journal:  J Diabetes Sci Technol       Date:  2021-03-19

7.  A new synonym-substitution method to enrich the human phenotype ontology.

Authors:  Maria Taboada; Hadriana Rodriguez; Ranga C Gudivada; Diego Martinez
Journal:  BMC Bioinformatics       Date:  2017-10-10       Impact factor: 3.169

8.  Combining information from a clinical data warehouse and a pharmaceutical database to generate a framework to detect comorbidities in electronic health records.

Authors:  Emmanuelle Sylvestre; Guillaume Bouzillé; Emmanuel Chazard; Cécil His-Mahier; Christine Riou; Marc Cuggia
Journal:  BMC Med Inform Decis Mak       Date:  2018-01-24       Impact factor: 2.796

Review 9.  Systematic Evaluation of Research Progress on Natural Language Processing in Medicine Over the Past 20 Years: Bibliometric Study on PubMed.

Authors:  Jing Wang; Huan Deng; Bangtao Liu; Anbin Hu; Jun Liang; Lingye Fan; Xu Zheng; Tong Wang; Jianbo Lei
Journal:  J Med Internet Res       Date:  2020-01-23       Impact factor: 5.428

10.  Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling.

Authors:  Aytuğ Onan
Journal:  Comput Math Methods Med       Date:  2018-07-22       Impact factor: 2.238

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.