| Literature DB >> 22195128 |
Youngjun Kim1, John Hurdle, Stéphane M Meystre.
Abstract
Clinical text is rich in acronyms and abbreviations, and they are highly ambiguous. As a pre-processing step before subsequent NLP analysis, we are developing and evaluating clinical abbreviation disambiguation methods. The evaluation of two sequential steps, the detection and the disambiguation of abbreviations, is reported here, for various types of clinical notes. For abbreviations detection, our result indicated the SPECIALIST Lexicon LRABR needed to be revised for better abbreviation detection. Our semi-supervised method using generated training data based on expanded form matching for 12 frequent abbreviations in our clinical notes reached over 90% accuracy in five-fold cross validation and unsupervised approach produced comparable results with the semi-supervised methods.Mesh:
Year: 2011 PMID: 22195128 PMCID: PMC3243121
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076