Ivana Marić1, Abraham Tsur2, Nima Aghaeepour3, Andrea Montanari4, David K Stevenson5, Gary M Shaw5, Virginia D Winn6. 1. Department of Pediatrics, Stanford University School of Medicine, Stanford, CA. Electronic address: ivanam@stanford.edu. 2. Department of Obstetrics and Gynecology, Stanford University School of Medicine, Stanford, CA; Department of Obstetrics and Gynecology, The Sheba Medical Center, Tel Hashomer, Israel. 3. Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University School of Medicine, Stanford, CA. 4. Department of Electrical Engineering and Department of Statistics, Stanford University, Stanford, CA. 5. Department of Pediatrics, Stanford University School of Medicine, Stanford, CA. 6. Department of Obstetrics and Gynecology, Stanford University School of Medicine, Stanford, CA.
Abstract
BACKGROUND: Early prediction of preeclampsia is challenging because of poorly understood causes, various risk factors, and likely multiple pathogenic phenotypes of preeclampsia. Statistical learning methods are well-equipped to deal with a large number of variables, such as patients' clinical and laboratory data, and to select the most informative features automatically. OBJECTIVE: Our objective was to use statistical learning methods to analyze all available clinical and laboratory data that were obtained during routine prenatal visits in early pregnancy and to use them to develop a prediction model for preeclampsia. STUDY DESIGN: This was a retrospective cohort study that used data from 16,370 births at Lucile Packard Children Hospital at Stanford, CA, from April 2014 to January 2018. Two statistical learning algorithms were used to build a predictive model: (1) elastic net and (2) gradient boosting algorithm. Models for all preeclampsia and early-onset preeclampsia (<34 weeks gestation) were fitted with the use of patient data that were available at <16 weeks gestational age. The 67 variables that were considered in the models included maternal characteristics, medical history, routine prenatal laboratory results, and medication intake. The area under the receiver operator curve, true-positive rate, and false-positive rate were assessed via cross-validation. RESULTS: Using the elastic net algorithm, we developed a prediction model that contained a subset of the most informative features from all variables. The obtained prediction model for preeclampsia yielded an area under the curve of 0.79 (95% confidence interval, 0.75-0.83), sensitivity of 45.2%, and false-positive rate of 8.1%. The prediction model for early-onset preeclampsia achieved an area under the curve of 0.89 (95% confidence interval, 0.84-0.95), true-positive rate of 72.3%, and false-positive rate of 8.8%. CONCLUSION: Statistical learning methods in a retrospective cohort study automatically identified a set of significant features for prediction and yielded high prediction performance for preeclampsia risk from routine early pregnancy information.
BACKGROUND: Early prediction of preeclampsia is challenging because of poorly understood causes, various risk factors, and likely multiple pathogenic phenotypes of preeclampsia. Statistical learning methods are well-equipped to deal with a large number of variables, such as patients' clinical and laboratory data, and to select the most informative features automatically. OBJECTIVE: Our objective was to use statistical learning methods to analyze all available clinical and laboratory data that were obtained during routine prenatal visits in early pregnancy and to use them to develop a prediction model for preeclampsia. STUDY DESIGN: This was a retrospective cohort study that used data from 16,370 births at Lucile Packard Children Hospital at Stanford, CA, from April 2014 to January 2018. Two statistical learning algorithms were used to build a predictive model: (1) elastic net and (2) gradient boosting algorithm. Models for all preeclampsia and early-onset preeclampsia (<34 weeks gestation) were fitted with the use of patient data that were available at <16 weeks gestational age. The 67 variables that were considered in the models included maternal characteristics, medical history, routine prenatal laboratory results, and medication intake. The area under the receiver operator curve, true-positive rate, and false-positive rate were assessed via cross-validation. RESULTS: Using the elastic net algorithm, we developed a prediction model that contained a subset of the most informative features from all variables. The obtained prediction model for preeclampsia yielded an area under the curve of 0.79 (95% confidence interval, 0.75-0.83), sensitivity of 45.2%, and false-positive rate of 8.1%. The prediction model for early-onset preeclampsia achieved an area under the curve of 0.89 (95% confidence interval, 0.84-0.95), true-positive rate of 72.3%, and false-positive rate of 8.8%. CONCLUSION: Statistical learning methods in a retrospective cohort study automatically identified a set of significant features for prediction and yielded high prediction performance for preeclampsia risk from routine early pregnancy information.
Authors: Shilong Li; Zichen Wang; Luciana A Vieira; Amanda B Zheutlin; Boshu Ru; Emilio Schadt; Pei Wang; Alan B Copperman; Joanne L Stone; Susan J Gross; Yu-Han Kao; Yan Kwan Lau; Siobhan M Dolan; Eric E Schadt; Li Li Journal: NPJ Digit Med Date: 2022-06-06
Authors: Camilo Espinosa; Martin Becker; Ivana Marić; Ronald J Wong; Gary M Shaw; Brice Gaudilliere; Nima Aghaeepour; David K Stevenson Journal: Trends Mol Med Date: 2021-02-08 Impact factor: 15.272
Authors: Mira N Moufarrej; Sevahn K Vorperian; Ronald J Wong; Ana A Campos; Cecele C Quaintance; Rene V Sit; Michelle Tan; Angela M Detweiler; Honey Mekonen; Norma F Neff; Courtney Baruch-Gravett; James A Litch; Maurice L Druzin; Virginia D Winn; Gary M Shaw; David K Stevenson; Stephen R Quake Journal: Nature Date: 2022-02-09 Impact factor: 49.962
Authors: Cecilia Villalaín; Ignacio Herraiz; Paula Domínguez-Del Olmo; Pablo Angulo; José Luis Ayala; Alberto Galindo Journal: Front Cardiovasc Med Date: 2022-07-01
Authors: Kaspar Ratnik; Kristiina Rull; Oliver Aasmets; Triin Kikas; Ele Hanson; Kalle Kisand; Krista Fischer; Maris Laan Journal: Front Cardiovasc Med Date: 2022-07-27
Authors: Seung Mi Lee; Yonghyun Nam; Eun Saem Choi; Young Mi Jung; Vivek Sriram; Jacob S Leiby; Ja Nam Koo; Ig Hwan Oh; Byoung Jae Kim; Sun Min Kim; Sang Youn Kim; Gyoung Min Kim; Sae Kyung Joo; Sue Shin; Errol R Norwitz; Chan-Wook Park; Jong Kwan Jun; Won Kim; Dokyoon Kim; Joong Shin Park Journal: Sci Rep Date: 2022-09-22 Impact factor: 4.996
Authors: Maide Ozen; Nima Aghaeepour; Ivana Marić; Ronald J Wong; David K Stevenson; Lauren L Jantzie Journal: Pediatr Res Date: 2022-10-10 Impact factor: 3.953