Literature DB >> 26122527

The role of fine-grained annotations in supervised recognition of risk factors for heart disease from EHRs.

Kirk Roberts1, Sonya E Shooshan2, Laritza Rodriguez2, Swapna Abhyankar2, Halil Kilicoglu2, Dina Demner-Fushman2.   

Abstract

This paper describes a supervised machine learning approach for identifying heart disease risk factors in clinical text, and assessing the impact of annotation granularity and quality on the system's ability to recognize these risk factors. We utilize a series of support vector machine models in conjunction with manually built lexicons to classify triggers specific to each risk factor. The features used for classification were quite simple, utilizing only lexical information and ignoring higher-level linguistic information such as syntax and semantics. Instead, we incorporated high-quality data to train the models by annotating additional information on top of a standard corpus. Despite the relative simplicity of the system, it achieves the highest scores (micro- and macro-F1, and micro- and macro-recall) out of the 20 participants in the 2014 i2b2/UTHealth Shared Task. This system obtains a micro- (macro-) precision of 0.8951 (0.8965), recall of 0.9625 (0.9611), and F1-measure of 0.9276 (0.9277). Additionally, we perform a series of experiments to assess the value of the annotated data we created. These experiments show how manually-labeled negative annotations can improve information extraction performance, demonstrating the importance of high-quality, fine-grained natural language annotations.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Machine learning; Natural language annotation; Natural language processing

Mesh:

Year:  2015        PMID: 26122527      PMCID: PMC4988795          DOI: 10.1016/j.jbi.2015.06.010

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  26 in total

Review 1.  Evaluating the state of the art in coreference resolution for electronic medical records.

Authors:  Ozlem Uzuner; Andreea Bodnari; Shuying Shen; Tyler Forbush; John Pestian; Brett R South
Journal:  J Am Med Inform Assoc       Date:  2012-02-24       Impact factor: 4.497

2.  Finding temporal order in discharge summaries.

Authors:  Philip Bramsen; Pawan Deshpande; Yoong Keok Lee; Regina Barzilay
Journal:  AMIA Annu Symp Proc       Date:  2006

3.  A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries.

Authors:  Yonghui Wu; Joshua C Denny; S Trent Rosenbloom; Randolph A Miller; Dario A Giuse; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

4.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

5.  Knowledge-rich temporal relation identification and classification in clinical notes.

Authors:  Jennifer D'Souza; Vincent Ng
Journal:  Database (Oxford)       Date:  2014-11-19       Impact factor: 3.451

6.  Identification and extraction of family history information from clinical reports.

Authors:  Sergey Goryachev; Hyeoneui Kim; Qing Zeng-Treitler
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

7.  Identifying patient smoking status from medical discharge records.

Authors:  Ozlem Uzuner; Ira Goldstein; Yuan Luo; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

Review 8.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

Authors:  Weiyi Sun; Anna Rumshisky; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2013-04-05       Impact factor: 4.497

9.  Sentiment Analysis of Suicide Notes: A Shared Task.

Authors:  John P Pestian; Pawel Matykiewicz; Michelle Linn-Gust; Brett South; Ozlem Uzuner; Jan Wiebe; K Bretonnel Cohen; John Hurdle; Christopher Brew
Journal:  Biomed Inform Insights       Date:  2012-01-30

10.  DNorm: disease name normalization with pairwise learning to rank.

Authors:  Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-08-21       Impact factor: 6.937

View more
  8 in total

1.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

2.  Psychiatric symptom recognition without labeled data using distributional representations of phrases and on-line knowledge.

Authors:  Yaoyun Zhang; Olivia Zhang; Yonghui Wu; Hee-Jin Lee; Jun Xu; Hua Xu; Kirk Roberts
Journal:  J Biomed Inform       Date:  2017-06-15       Impact factor: 6.317

3.  Counting trees in Random Forests: Predicting symptom severity in psychiatric intake reports.

Authors:  Elyne Scheurwegs; Madhumita Sushil; Stéphan Tulkens; Walter Daelemans; Kim Luyckx
Journal:  J Biomed Inform       Date:  2017-06-07       Impact factor: 6.317

4.  Inferring the Interactions of Risk Factors from EHRs.

Authors:  Travis Goodwin; Sanda M Harabagiu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2016-07-19

Review 5.  Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.

Authors:  Seyedmostafa Sheikhalishahi; Riccardo Miotto; Joel T Dudley; Alberto Lavelli; Fabio Rinaldi; Venet Osmani
Journal:  JMIR Med Inform       Date:  2019-04-27

6.  Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classification.

Authors:  Michel Oleynik; Amila Kugic; Zdenko Kasáč; Markus Kreuzthaler
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

7.  Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine.

Authors:  Tingting Zhang; Yaqiang Wang; Xiaofeng Wang; Yafei Yang; Ying Ye
Journal:  BMC Med Inform Decis Mak       Date:  2020-04-06       Impact factor: 2.796

8.  Applications of Machine Learning Using Electronic Medical Records in Spine Surgery.

Authors:  John T Schwartz; Michael Gao; Eric A Geng; Kush S Mody; Christopher M Mikhail; Samuel K Cho
Journal:  Neurospine       Date:  2019-12-31
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.