| Literature DB >> 25954575 |
Min Jiang1, Yonghui Wu1, Anushi Shah2, Priyanka Priyanka3, Joshua C Denny2, Hua Xu1.
Abstract
Extraction of medication information embedded in clinical text is important for research using electronic health records (EHRs). However, most of current medication information extraction systems identify drug and signature entities without mapping them to standard representation. In this study, we introduced the open source Java implementation of MedEx, an existing high-performance medication information extraction system, based on the Unstructured Information Management Architecture (UIMA) framework. In addition, we developed new encoding modules in the MedEx-UIMA system, which mapped an extracted drug name/dose/form to both generalized and specific RxNorm concepts and translated drug frequency information to ISO standard. We processed 826 documents by both systems and verified that MedEx-UIMA and MedEx (the Python version) performed similarly by comparing both results. Using two manually annotated test sets that contained 300 drug entries from medication list and 300 drug entries from narrative reports, the MedEx-UIMA system achieved F-measures of 98.5% and 97.5% respectively for encoding drug names to corresponding RxNorm generic drug ingredients, and F-measures of 85.4% and 88.1% respectively for mapping drug names/dose/form to the most specific RxNorm concepts. It also achieved an F-measure of 90.4% for normalizing frequency information to ISO standard. The open source MedEx-UIMA system is freely available online at http://code.google.com/p/medex-uima/.Entities:
Year: 2014 PMID: 25954575 PMCID: PMC4419757
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.An overview of the MedEx-UIMA system
Figure 2.The example of determination of most specific RxNORM code
Evaluation results of MedEx-UIMA on extracting and encoding drug and frequency information
| Tasks | Precision | Recall | F-measure |
|---|---|---|---|
| Drug encoding - least-specific RxCUIs (Clinical narratives) | 98.8 % | 96.3 % | 97.5 % |
| Drug encoding - most specific RxCUIs (Clinical narratives) | 89.3% | 87.0% | 88.1 % |
| Frequency normalization (Clinical narratives) | 91.9% | 88.9% | 90.4% |
| Drug encoding - least-specific RxCUIs (Medication list) | 99.0% | 98.0% | 98.5% |
| Drug encoding - most-specific RxCUIs (Medication list) | 85.8% | 85.0% | 85.4% |