Yiye Zhang1, Shuojia Wang2, Alison Hermann3, Rochelle Joly4, Jyotishman Pathak5. 1. Division of Health Informatics, Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA; Department of Emergency Medicine, Weill Cornell Medicine, New York, NY, USA. Electronic address: yiz2014@med.cornell.edu. 2. School of Public Health, Zhejiang University, HangZhou, Zhejiang, China; Tencent Jarvis Lab, Shenzhen, China. 3. Department of Psychiatry, Weill Cornell Medicine, New York, NY, USA. 4. Department of Obstetrics and Gynecology, Weill Cornell Medicine, New York, NY, USA. 5. Division of Health Informatics, Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA; Department of Psychiatry, Weill Cornell Medicine, New York, NY, USA.
Abstract
OBJECTIVE: There is a scarcity in tools to predict postpartum depression (PPD). We propose a machine learning framework for PPD risk prediction using data extracted from electronic health records (EHRs). METHODS: Two EHR datasets containing data on 15,197 women from 2015 to 2018 at a single site, and 53,972 women from 2004 to 2017 at multiple sites were used as development and validation sets, respectively, to construct the PPD risk prediction model. The primary outcome was a diagnosis of PPD within 1 year following childbirth. A framework of data extraction, processing, and machine learning was implemented to select a minimal list of features from the EHR datasets to ensure model performance and to enable future point-of-care risk prediction. RESULTS: The best-performing model uses from clinical features related to mental health history, medical comorbidity, obstetric complications, medication prescription orders, and patient demographic characteristics. The model performances as measured by area under the receiver operating characteristic curve (AUC) are 0.937 (95% CI 0.912 - 0.962) and 0.886 (95% CI 0.879-0.893) in the development and validation datasets, respectively. The model performances were consistent when tested using data ending at multiple time periods during pregnancy and at childbirth. LIMITATIONS: The prevalence of PPD in the study data represented a treatment prevalence and is likely lower than the illness prevalence. CONCLUSIONS: EHRs and machine learning offer the ability to identify women at risk for PPD early in their pregnancy. This may facilitate scalable and timely prevention and intervention, reducing negative outcomes and the associated burden.
OBJECTIVE: There is a scarcity in tools to predict postpartum depression (PPD). We propose a machine learning framework for PPD risk prediction using data extracted from electronic health records (EHRs). METHODS: Two EHR datasets containing data on 15,197 women from 2015 to 2018 at a single site, and 53,972 women from 2004 to 2017 at multiple sites were used as development and validation sets, respectively, to construct the PPD risk prediction model. The primary outcome was a diagnosis of PPD within 1 year following childbirth. A framework of data extraction, processing, and machine learning was implemented to select a minimal list of features from the EHR datasets to ensure model performance and to enable future point-of-care risk prediction. RESULTS: The best-performing model uses from clinical features related to mental health history, medical comorbidity, obstetric complications, medication prescription orders, and patient demographic characteristics. The model performances as measured by area under the receiver operating characteristic curve (AUC) are 0.937 (95% CI 0.912 - 0.962) and 0.886 (95% CI 0.879-0.893) in the development and validation datasets, respectively. The model performances were consistent when tested using data ending at multiple time periods during pregnancy and at childbirth. LIMITATIONS: The prevalence of PPD in the study data represented a treatment prevalence and is likely lower than the illness prevalence. CONCLUSIONS: EHRs and machine learning offer the ability to identify women at risk for PPD early in their pregnancy. This may facilitate scalable and timely prevention and intervention, reducing negative outcomes and the associated burden.
Authors: Molly M Long; Robert J Cramer; Jennika Jenkins; Linda Bennington; James F Paulson Journal: Arch Womens Ment Health Date: 2018-07-02 Impact factor: 3.633
Authors: Benedict Weobong; Augustinus H A ten Asbroek; Seyi Soremekun; Lu Gram; Seeba Amenga-Etego; Samuel Danso; Seth Owusu-Agyei; Martin Prince; Betty R Kirkwood Journal: BMJ Open Date: 2015-08-27 Impact factor: 2.692
Authors: Thomas R Campion; Evan T Sholle; Jyotishman Pathak; Stephen B Johnson; John P Leonard; Curtis L Cole Journal: J Am Med Inform Assoc Date: 2022-03-15 Impact factor: 4.497
Authors: José Alberto Benítez-Andrades; María Teresa García-Ordás; María Álvarez-González; Raquel Leirós-Rodríguez; Ana F López Rodríguez Journal: Digit Health Date: 2022-07-05