OBJECTIVE: To develop an algorithm for building longitudinal medication dose datasets using information extracted from clinical notes in electronic health records (EHRs). MATERIALS AND METHODS: We developed an algorithm that converts medication information extracted using natural language processing (NLP) into a usable format and builds longitudinal medication dose datasets. We evaluated the algorithm on 2 medications extracted from clinical notes of Vanderbilt's EHR and externally validated the algorithm using clinical notes from the MIMIC-III clinical care database. RESULTS: For the evaluation using Vanderbilt's EHR data, the performance of our algorithm was excellent; F1-measures were ≥0.98 for both dose intake and daily dose. For the external validation using MIMIC-III, the algorithm achieved F1-measures ≥0.85 for dose intake and ≥0.82 for daily dose. DISCUSSION: Our algorithm addresses the challenge of building longitudinal medication dose data using information extracted from clinical notes. Overall performance was excellent, but the algorithm can perform poorly when incorrect information is extracted by NLP systems. Although it performed reasonably well when applied to the external data source, its performance was worse due to differences in the way the drug information was written. The algorithm is implemented in the R package, "EHR," and the extracted data from Vanderbilt's EHRs along with the gold standards are provided so that users can reproduce the results and help improve the algorithm. CONCLUSION: Our algorithm for building longitudinal dose data provides a straightforward way to use EHR data for medication-based studies. The external validation results suggest its potential for applicability to other systems.
OBJECTIVE: To develop an algorithm for building longitudinal medication dose datasets using information extracted from clinical notes in electronic health records (EHRs). MATERIALS AND METHODS: We developed an algorithm that converts medication information extracted using natural language processing (NLP) into a usable format and builds longitudinal medication dose datasets. We evaluated the algorithm on 2 medications extracted from clinical notes of Vanderbilt's EHR and externally validated the algorithm using clinical notes from the MIMIC-III clinical care database. RESULTS: For the evaluation using Vanderbilt's EHR data, the performance of our algorithm was excellent; F1-measures were ≥0.98 for both dose intake and daily dose. For the external validation using MIMIC-III, the algorithm achieved F1-measures ≥0.85 for dose intake and ≥0.82 for daily dose. DISCUSSION: Our algorithm addresses the challenge of building longitudinal medication dose data using information extracted from clinical notes. Overall performance was excellent, but the algorithm can perform poorly when incorrect information is extracted by NLP systems. Although it performed reasonably well when applied to the external data source, its performance was worse due to differences in the way the drug information was written. The algorithm is implemented in the R package, "EHR," and the extracted data from Vanderbilt's EHRs along with the gold standards are provided so that users can reproduce the results and help improve the algorithm. CONCLUSION: Our algorithm for building longitudinal dose data provides a straightforward way to use EHR data for medication-based studies. The external validation results suggest its potential for applicability to other systems.
Authors: Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497
Authors: Hannah L Weeks; Cole Beck; Elizabeth McNeer; Michael L Williams; Cosmin A Bejan; Joshua C Denny; Leena Choi Journal: J Am Med Inform Assoc Date: 2020-03-01 Impact factor: 4.497
Authors: Katherine C Konvinse; Jason A Trubiano; Rebecca Pavlos; Ian James; Christian M Shaffer; Cosmin A Bejan; Ryan J Schutte; David A Ostrov; Mark A Pilkinton; Misha Rosenbach; Jeffrey P Zwerner; Kristina B Williams; Jack Bourke; Patricia Martinez; Francois Rwandamuriye; Abha Chopra; Mark Watson; Alec J Redwood; Katie D White; Simon A Mallal; Elizabeth J Phillips Journal: J Allergy Clin Immunol Date: 2019-02-16 Impact factor: 10.793
Authors: Leena Choi; Cole Beck; Elizabeth McNeer; Hannah L Weeks; Michael L Williams; Nathan T James; Xinnan Niu; Bassel W Abou-Khalil; Kelly A Birdwell; Dan M Roden; C Michael Stein; Cosmin A Bejan; Joshua C Denny; Sara L Van Driest Journal: Clin Pharmacol Ther Date: 2020-02-11 Impact factor: 6.875
Authors: Michael L Williams; Hannah L Weeks; Cole Beck; Kelly A Birdwell; Sara L Van Driest; Leena Choi Journal: Br J Clin Pharmacol Date: 2022-01-27 Impact factor: 3.716