Literature DB >> 19696949

Automated identification of diagnosis and co-morbidity in clinical records.

C Cano1, A Blanco, L Peshkin.   

Abstract

OBJECTIVES: Automated understanding of clinical records is a challenging task involving various legal and technical difficulties. Clinical free text is inherently redundant, unstructured, and full of acronyms, abbreviations and domain-specific language which make it challenging to mine automatically. There is much effort in the field focused on creating specialized ontology, lexicons and heuristics based on expert knowledge of the domain. However, ad-hoc solutions poorly generalize across diseases or diagnoses. This paper presents a successful approach for a rapid prototyping of a diagnosis classifier based on a popular computational linguistics platform.
METHODS: The corpus consists of several hundred of full length discharge summaries provided by Partners Healthcare. The goal is to identify a diagnosis and assign co-morbidi-ty. Our approach is based on the rapid implementation of a logistic regression classifier using an existing toolkit: LingPipe (http://alias-i.com/lingpipe). We implement and compare three different classifiers. The baseline approach uses character 5-grams as features. The second approach uses a bag-of-words representation enriched with a small additional set of features. The third approach reduces a feature set to the most informative features according to the information content.
RESULTS: The proposed systems achieve high performance (average F-micro 0.92) for the task. We discuss the relative merit of the three classifiers. Supplementary material with detailed results is available at: http:// decsai.ugr.es/~ccano/LR/supplementary_ material/
CONCLUSIONS: We show that our methodology for rapid prototyping of a domain-unaware system is effective for building an accurate classifier for clinical records.

Entities:  

Mesh:

Year:  2009        PMID: 19696949     DOI: 10.3414/ME0615

Source DB:  PubMed          Journal:  Methods Inf Med        ISSN: 0026-1270            Impact factor:   2.176


  2 in total

1.  Modelling and extraction of variability in free-text medication prescriptions from an anonymised primary care electronic medical record research database.

Authors:  George Karystianis; Therese Sheppard; William G Dixon; Goran Nenadic
Journal:  BMC Med Inform Decis Mak       Date:  2016-02-09       Impact factor: 2.796

Review 2.  Extracting information from the text of electronic medical records to improve case detection: a systematic review.

Authors:  Elizabeth Ford; John A Carroll; Helen E Smith; Donia Scott; Jackie A Cassell
Journal:  J Am Med Inform Assoc       Date:  2016-02-05       Impact factor: 4.497

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.