Literature DB >> 24508177

Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study.

Maria Skeppstedt1, Maria Kvist2, Gunnar H Nilsson3, Hercules Dalianis4.   

Abstract

Automatic recognition of clinical entities in the narrative text of health records is useful for constructing applications for documentation of patient care, as well as for secondary usage in the form of medical knowledge extraction. There are a number of named entity recognition studies on English clinical text, but less work has been carried out on clinical text in other languages. This study was performed on Swedish health records, and focused on four entities that are highly relevant for constructing a patient overview and for medical hypothesis generation, namely the entities: Disorder, Finding, Pharmaceutical Drug and Body Structure. The study had two aims: to explore how well named entity recognition methods previously applied to English clinical text perform on similar texts written in Swedish; and to evaluate whether it is meaningful to divide the more general category Medical Problem, which has been used in a number of previous studies, into the two more granular entities, Disorder and Finding. Clinical notes from a Swedish internal medicine emergency unit were annotated for the four selected entity categories, and the inter-annotator agreement between two pairs of annotators was measured, resulting in an average F-score of 0.79 for Disorder, 0.66 for Finding, 0.90 for Pharmaceutical Drug and 0.80 for Body Structure. A subset of the developed corpus was thereafter used for finding suitable features for training a conditional random fields model. Finally, a new model was trained on this subset, using the best features and settings, and its ability to generalise to held-out data was evaluated. This final model obtained an F-score of 0.81 for Disorder, 0.69 for Finding, 0.88 for Pharmaceutical Drug, 0.85 for Body Structure and 0.78 for the combined category Disorder+Finding. The obtained results, which are in line with or slightly lower than those for similar studies on English clinical text, many of them conducted using a larger training data set, show that the approaches used for English are also suitable for Swedish clinical text. However, a small proportion of the errors made by the model are less likely to occur in English text, showing that results might be improved by further tailoring the system to clinical Swedish. The entity recognition results for the individual entities Disorder and Finding show that it is meaningful to separate the general category Medical Problem into these two more granular entity types, e.g. for knowledge mining of co-morbidity relations and disorder-finding relations.
Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Clinical text processing; Corpora development; Disorder; Finding; Named entity recognition; Swedish

Mesh:

Year:  2014        PMID: 24508177     DOI: 10.1016/j.jbi.2014.01.012

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  17 in total

Review 1.  Clinical Natural Language Processing in 2014: Foundational Methods Supporting Efficient Healthcare.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2015-08-13

Review 2.  Recent Advances in Clinical Natural Language Processing in Support of Semantic Analysis.

Authors:  S Velupillai; D Mowery; B R South; M Kvist; H Dalianis
Journal:  Yearb Med Inform       Date:  2015-08-13

3.  Identifying the Clinical Laboratory Tests from Unspecified "Other Lab Test" Data for Secondary Use.

Authors:  Xuequn Pan; James J Cimino
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

4.  Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

Authors:  Fábio Lopes; César Teixeira; Hugo Gonçalo Oliveira
Journal:  J Med Syst       Date:  2020-02-28       Impact factor: 4.460

5.  A Relation-Oriented Model With Global Context Information for Joint Extraction of Overlapping Relations and Entities.

Authors:  Huihui Han; Jian Wang; Xiaowen Wang
Journal:  Front Neurorobot       Date:  2022-07-04       Impact factor: 3.493

6.  Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx.

Authors:  Rebecka Weegar; Maria Kvist; Karin Sundström; Søren Brunak; Hercules Dalianis
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

7.  Machine Learning for Geriatric Clinical Care: Opportunities and Challenges.

Authors:  Nazila Javadi-Pashaki; Mohammad Javad Ghazanfari; Samad Karkhah
Journal:  Ann Geriatr Med Res       Date:  2021-06-21

8.  Fine-grained information extraction from German transthoracic echocardiography reports.

Authors:  Martin Toepfer; Hamo Corovic; Georg Fette; Peter Klügl; Stefan Störk; Frank Puppe
Journal:  BMC Med Inform Decis Mak       Date:  2015-11-12       Impact factor: 2.796

Review 9.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30

10.  Design of an extensive information representation scheme for clinical narratives.

Authors:  Louise Deléger; Leonardo Campillos; Anne-Laure Ligozat; Aurélie Névéol
Journal:  J Biomed Semantics       Date:  2017-09-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.