| Literature DB >> 21347153 |
Hua Xu1, Son Doan, Kelly A Birdwell, James D Cowan, Andrew J Vincz, David W Haas, Melissa A Basford, Joshua C Denny.
Abstract
Clinical research often requires extracting detailed drug information, such as medication names and dosages, from Electronic Health Records (EHR). Since medication information is often recorded as both structured and unstructured formats in the EHR, extracting all the relevant drug mentions and determining the daily dose of a medication for a selected patient at a given date can be a challenging and time-consuming task. In this paper, we present an automated approach using natural language processing to calculate daily doses of medications mentioned in clinical text, using tacrolimus as a test case. We evaluated this method using data sets from four different types of unstructured clinical data. Our results showed that the system achieved precisions of 0.90-1.00 and recalls of 0.81-1.00.Entities:
Year: 2010 PMID: 21347153 PMCID: PMC3041548
Source DB: PubMed Journal: Summit Transl Bioinform ISSN: 2153-6430
Figure 1.An overview of the MedEx system.
Examples of dose-related findings.
| 1.“prograf 1mg 5 tabs twice daily” | 2.“TACROLIM US 7MG BID” | |
| 1mg | ||
| 5 tabs | ||
| 7MG | ||
| twice daily | BID |
Examples of normalization of dose-related
| Strength | “1mg” | Qty:1, Unit:mg |
| DoseAmount | “5 tabs” | Qty:5, Unit:tablet |
| Dose | “7MG” | Qty:7, Unit:mg |
| Frequency | “twice daily” | Freq:2, Qty:1, Unit:day |
Precisions and recalls of extracting dose-related findings and determining daily doses of tacrolimus using four different types of clinical text.
| 0.96 | 0.96 | 0.93 | 0.95 | 0.94 | ||
| 0.71 | 0.74 | 0.82 | 0.84 | 0.81 | ||
| 0.92 | 0.92 | 0.90 | 0.93 | 0.90 | ||
| 1.00 | 1.00 | 0.84 | 0.98 | 0.86 | ||
| 1.00 | 0.95 | 0.97 | 0.98 | 0.95 | ||
| 0.92 | 0.92 | 0.90 | 0.97 | 0.92 | ||
| N/A | N/A | 1.00 | 1.00 | 1.00 | ||
| N/A | N/A | 1.00 | 1.00 | 1.00 | ||
Figure 2.Examples of sentences with multiple sets of dosing information.