Silvia P Canelón1, Heather H Burris2, Lisa D Levine3, Mary Regina Boland4. 1. Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, United States. 2. Department of Pediatrics, University of Pennsylvania Perelman School of Medicine, United States. 3. Maternal and Child Health Research Center, Department of Obstetrics and Gynecology, University of Pennsylvania Perelman School of Medicine, United States. 4. Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, United States; Institute for Biomedical Informatics, University of Pennsylvania, United States; Center for Excellence in Environmental Toxicology, University of Pennsylvania, United States; Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, United States. Electronic address: bolandm@upenn.edu.
Abstract
OBJECTIVE: To develop an algorithm that infers patient delivery dates (PDDs) and delivery-specific details from Electronic Health Records (EHRs) with high accuracy; enabling pregnancy-level outcome studies in women's health. MATERIALS AND METHODS: We obtained EHR data from 1,060,100 female patients treated at Penn Medicine hospitals or outpatient clinics between 2010-2017. We developed an algorithm called MADDIE: Method to Acquire Delivery Date Information from Electronic Health Records that infers a PDD for distinct deliveries based on EHR encounter dates assigned a delivery code, the frequency of code usage, and the time differential between code assignments. We validated MADDIE's PDDs against a birth log independently maintained by the Department of Obstetrics and Gynecology. RESULTS: MADDIE identified 50,560 patients having 63,334 distinct deliveries. MADDIE was 98.6 % accurate (F1-score 92.1 %) when compared to the birth log. The PDD was on average 0.68 days earlier than the true delivery date for patients with only one delivery (± 1.43 days) and 0.52 days earlier for patients with more than one delivery episode (± 1.11 days). DISCUSSION: MADDIE is the first algorithm to successfully infer PDD information using only structured delivery codes and identify multiple deliveries per patient. MADDIE is also the first to validate the accuracy of the PDD using an external gold standard of known delivery dates as opposed to manual chart review of a sample. CONCLUSION: MADDIE augments the EHR with delivery-specific details extracted with high accuracy and relies only on structured EHR elements while harnessing temporal information and the frequency of code usage to identify accurate PDDs.
OBJECTIVE: To develop an algorithm that infers patient delivery dates (PDDs) and delivery-specific details from Electronic Health Records (EHRs) with high accuracy; enabling pregnancy-level outcome studies in women's health. MATERIALS AND METHODS: We obtained EHR data from 1,060,100 female patients treated at Penn Medicine hospitals or outpatient clinics between 2010-2017. We developed an algorithm called MADDIE: Method to Acquire Delivery Date Information from Electronic Health Records that infers a PDD for distinct deliveries based on EHR encounter dates assigned a delivery code, the frequency of code usage, and the time differential between code assignments. We validated MADDIE's PDDs against a birth log independently maintained by the Department of Obstetrics and Gynecology. RESULTS: MADDIE identified 50,560 patients having 63,334 distinct deliveries. MADDIE was 98.6 % accurate (F1-score 92.1 %) when compared to the birth log. The PDD was on average 0.68 days earlier than the true delivery date for patients with only one delivery (± 1.43 days) and 0.52 days earlier for patients with more than one delivery episode (± 1.11 days). DISCUSSION: MADDIE is the first algorithm to successfully infer PDD information using only structured delivery codes and identify multiple deliveries per patient. MADDIE is also the first to validate the accuracy of the PDD using an external gold standard of known delivery dates as opposed to manual chart review of a sample. CONCLUSION: MADDIE augments the EHR with delivery-specific details extracted with high accuracy and relies only on structured EHR elements while harnessing temporal information and the frequency of code usage to identify accurate PDDs.
Authors: Jessica R Meeker; Heather H Burris; Ray Bai; Lisa D Levine; Mary Regina Boland Journal: J Am Med Inform Assoc Date: 2022-01-12 Impact factor: 7.942
Authors: Sara Jones; Katie R Bradwell; Lauren E Chan; Courtney Olson-Chen; Jessica Tarleton; Kenneth J Wilkins; Qiuyuan Qin; Emily Groene Faherty; Yan Kwan Lau; Catherine Xie; Yu-Han Kao; Michael N Liebman; Federico Mariona; Anup Challa; Li Li; Sarah J Ratcliffe; Julie A McMurry; Melissa A Haendel; Rena C Patel; Elaine L Hill Journal: medRxiv Date: 2022-08-06