| Literature DB >> 32662348 |
Ran Chu1, Wei Chen2, Guangmin Song3, Shu Yao1, Lin Xie1, Li Song1, Yue Zhang1, Lijun Chen1, Xiangli Zhang1, Yuyan Ma1, Xia Luo1, Yuan Liu1, Ping Sun1, Shuquan Zhang1, Yan Fang1, Taotao Dong1, Qing Zhang1, Jin Peng1, Lu Zhang1, Yuan Wei1, Wenxia Zhang1, Xuantao Su2, Xu Qiao2, Kun Song1, Xingsheng Yang1, Beihua Kong1.
Abstract
Background Women with congenital heart disease are considered at high risk for adverse events. Therefore, we aim to establish 2 prediction models for mothers and their offspring, which can predict the risk of adverse events occurred in pregnant women with congenital heart disease. Methods and Results A total of 318 pregnant women with congenital heart disease were included; 213 women were divided into the development cohort, and 105 women were divided into the validation cohort. Least absolute shrinkage and selection operator was used for predictor selection. After validation, multivariate logistic regression analysis was used to develop the model. Machine learning algorithms (support vector machine, random forest, AdaBoost, decision tree, k-nearest neighbor, naïve Bayes, and multilayer perceptron) were used to further verify the predictive ability of the model. Forty-one (12.9%) women experienced adverse maternal events, and 93 (29.2%) neonates experienced adverse neonatal events. Seven high-risk factors were discovered in the maternal model, including New York Heart Association class, Eisenmenger syndrome, pulmonary hypertension, left ventricular ejection fraction, sinus tachycardia, arterial blood oxygen saturation, and pregnancy duration. The machine learning-based algorithms showed that the maternal model had an accuracy of 0.76 to 0.86 (area under the receiver operating characteristic curve=0.74-0.87) in the development cohort, and 0.72 to 0.86 (area under the receiver operating characteristic curve=0.68-0.80) in the validation cohort. Three high-risk factors were discovered in the neonatal model, including Eisenmenger syndrome, preeclampsia, and arterial blood oxygen saturation. The machine learning-based algorithms showed that the neonatal model had an accuracy of 0.75 to 0.80 (area under the receiver operating characteristic curve=0.71-0.77) in the development cohort, and 0.72 to 0.79 (area under the receiver operating characteristic curve=0.69-0.76) in the validation cohort. Conclusions Two prenatal risk assessment models for both adverse maternal and neonatal events were established, which might assist clinicians in tailoring precise management and therapy in pregnant women with congenital heart disease.Entities:
Keywords: congenital heart disease; machine learning; prediction model; pregnancy
Year: 2020 PMID: 32662348 PMCID: PMC7660735 DOI: 10.1161/JAHA.120.016371
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 5.501
Figure 1Flowchart of the study. (A) datasets; (B) model development and validation; (C) model training and validation based on machine learning. AUC indicates area under the receiver operating characteristic curve; CHD, congenital heart disease; LASSO, least absolute shrinkage and selection operator; NPV, negative predictive value; and PPV, positive predictive value.
Patient’s Characteristics (Before Delivery)
| Characteristic | Total (n=318) | Development Cohort (n=213) | Validation Cohort (n=105) |
|---|---|---|---|
| Age at delivery, y | 27 (16–45) | 26 (16–45) | 29 (19–41) |
| Parity | |||
| 0 | 220 (69.2) | 160 (75.1) | 60 (57.1) |
| ≥1 | 98 (30.8) | 53 (24.9) | 45 (42.9) |
| Cardiac functional status | |||
| NYHA class I–II | 248 (78.0) | 159 (74.6) | 89 (84.8) |
| NYHA class III–IV | 70 (22.0) | 54 (25.4) | 16 (15.2) |
| Maternal congenital lesion | |||
| Atrial septal defect | 123 (38.7) | 76 (35.7) | 47 (44.8) |
| Ventricular septal defect | 101 (31.8) | 75 (35.2) | 26 (24.8) |
| Persistent ductus arteriosus | 28 (8.8) | 20 (9.4) | 8 (7.6) |
| Tetralogy of Fallot | 23 (7.2) | 13 (6.1) | 10 (9.5) |
| Ventricular outflow tract obstruction | 19 (6.0) | 9 (4.2) | 10 (9.5) |
| Other | 24 (7.5) | 20 (9.4) | 4 (3.8) |
| PAH, mm Hg | |||
| PAH <30 | 153 (48.1) | 97 (45.5) | 56 (53.3) |
| 30 ≤PAH <60 | 87 (27.4) | 58 (27.2) | 29 (27.6) |
| 60 ≤PAH <90 | 39 (12.3) | 31 (14.6) | 8 (7.6) |
| 90 ≤PAH | 39 (12.3) | 27 (12.7) | 12 (11.4) |
| Eisenmenger syndrome | 26 (8.2) | 18 (8.5) | 8 (7.6) |
| Cardiac surgery before pregnancy | |||
| Corrected | 111 (34.9) | 72 (33.8) | 39 (37.1) |
| Uncorrected | 207 (65.1) | 141 (66.2) | 66 (62.9) |
| Preeclampsia | 31 (9.7) | 22 (10.3) | 9 (8.6) |
| Gestational diabetes mellitus | 13 (4.1) | 8 (3.8) | 5 (4.8) |
| Mode of delivery | |||
| Vaginal | 32 (10.1) | 21 (9.9) | 11 (10.5) |
| Cesarean | 286 (89.9) | 192 (90.1) | 94 (89.5) |
| Pregnancy duration, wk | |||
| 28 ≤GW <32 | 47 (14.8) | 30 (14.1) | 17 (16.2) |
| 32 ≤GW <36 | 6 (1.9) | 5 (2.3) | 1 (1.0) |
| 36 ≤GW | 265 (83.3) | 178 (83.6) | 87 (82.9) |
| Adverse maternal cardiac event | 41 (12.9) | 29 (13.6) | 12 (11.4) |
| Adverse neonatal event | 93 (29.2) | 63 (29.6) | 30 (28.6) |
Values are median (range) or n (%). GW indicates gestational weeks; NYHA, New York Heart Association; and PAH, pulmonary hypertension.
Ventricular outflow tract obstruction including aortic valve stenosis and pulmonary valve stenosis.
Other including Marfan syndrome, mitral regurgitation, single ventricle, atrioventricular septal defect, congenitally corrected transposition of the great arteries, transposition of the great arteries, and so on.
Figure 2Nomogram lists of the maternal model (A) and neonatal model (B); ROC curves of the maternal model (C) and neonatal model (D). Example of the maternal model in (A): A 37‐week pregnant woman with CHD (0 points) who had NYHA class III (12.5 points) without ES (0 points), and had a PAH of 35 mm Hg (2.5 points) and a left ventricular ejection fraction of 40% (70 points) with symptoms of sinus tachycardia (25 points) and an SaO2 of 98% (1 point) has a total score of 111 points; the corresponding probability of experiencing adverse events in this pregnancy was more than 50%. Example of the neonatal model (B): the pregnant woman described above who did not have ES (0 points) but had preeclampsia (15.5 points) with an SaO2 of 98% (2.5 points) has a total score of 18 points, and the corresponding probability of experiencing adverse neonatal events was more than 60%. AUC indicates area under the receiver operating characteristic curve; CHD, congenital heart disease; ES, Eisenmenger syndrome; GW, gestational week; LVEF, left ventricular ejection fraction; NYHA, New York Heart Association; PAH, pulmonary hypertension; ROC, receiver operating characteristic; and SaO2, arterial blood oxygen saturation.
Prediction of the 2 Models by LR and ML Analysis
| Threshold | AUC | Sensitivity | Specificity | PPV | NPV | Overall Accuracy | |
|---|---|---|---|---|---|---|---|
| Maternal model in development cohort (n=213) | |||||||
| LR | 0.13±0.09 | 0.85±0.14 | 0.80±0.11 | 0.73±0.29 | 0.95±0.05 | 0.42±0.21 | 0.79±0.08 |
| SVM | 0.17±0.05 | 0.85±0.12 | 0.84±0.08 | 0.65±0.27 | 0.94±0.05 | 0.40±0.16 | 0.81±0.08 |
| RF | 0.41±0.05 | 0.81±0.17 | 0.80±0.11 | 0.69±0.24 | 0.95±0.04 | 0.39±0.15 | 0.78±0.09 |
| DT | 0.58±0.05 | 0.74±0.10 | 0.83±0.08 | 0.58±0.20 | 0.93±0.03 | 0.37±0.12 | 0.79±0.07 |
| KNN | 0.33 | 0.77±0.16 | 0.86±0.09 | 0.63±0.31 | 0.94±0.05 | 0.41±0.19 | 0.83±0.08 |
| NB | 0.10±0.30 | 0.87±0.18 | 0.82±0.09 | 0.78±0.32 | 0.96±0.05 | 0.41±0.18 | 0.82±0.08 |
| Ada | 0.49±0.005 | 0.79±0.21 | 0.78±0.09 | 0.68±0.28 | 0.94±0.05 | 0.34±0.12 | 0.76±0.07 |
| MLP | 0.47±0.35 | 0.76±0.17 | 0.91±0.09 | 0.53±0.27 | 0.92±0.04 | 0.59±0.34 | 0.86±0.09 |
| Neonatal model in development cohort (n=213) | |||||||
| LR | 0.35±0.05 | 0.77±0.12 | 0.92±0.06 | 0.51±0.21 | 0.82±0.07 | 0.74±0.22 | 0.79±0.08 |
| SVM | 0.31±0.04 | 0.77±0.12 | 0.92±0.06 | 0.51±0.19 | 0.82±0.07 | 0.75±0.19 | 0.80±0.07 |
| RF | 0.50±0.05 | 0.76±0.11 | 0.92±0.06 | 0.51±0.19 | 0.82±0.07 | 0.75±0.19 | 0.80±0.07 |
| DT | 0.75±0.03 | 0.71±0.10 | 0.92±0.06 | 0.48±0.19 | 0.81±0.07 | 0.73±0.22 | 0.79±0.07 |
| KNN | 0.34±0.08 | 0.75±0.12 | 0.85±0.11 | 0.56±0.23 | 0.83±0.09 | 0.66±0.23 | 0.76±0.10 |
| NB | 0.08±0.22 | 0.74±0.08 | 0.92±0.06 | 0.51±0.19 | 0.82±0.07 | 0.75±0.19 | 0.80±0.07 |
| Ada | 0.50±0.004 | 0.77±0.12 | 0.84±0.10 | 0.59±0.19 | 0.83±0.08 | 0.65±0.20 | 0.77±0.10 |
| MLP | 0.33±0.04 | 0.72±0.17 | 0.85±0.11 | 0.51±0.20 | 0.81±0.08 | 0.63±0.25 | 0.75±0.11 |
| Maternal model in validation cohort (n=105) | |||||||
| LR | 0.80 (0.64–0.97) | 0.78 | 0.67 | 0.95 | 0.29 | 0.77 | |
| SVM | 0.80 (0.64–0.97) | 0.87 | 0.58 | 0.94 | 0.37 | 0.84 | |
| RF | 0.79 (0.64–0.94) | 0.86 | 0.58 | 0.94 | 0.35 | 0.83 | |
| DT | 0.68 (0.53–0.84) | 0.85 | 0.50 | 0.93 | 0.30 | 0.81 | |
| KNN | 0.79 (0.64–0.94) | 0.88 | 0.67 | 0.95 | 0.42 | 0.86 | |
| NB | 0.78 (0.61–0.95) | 0.82 | 0.67 | 0.95 | 0.32 | 0.80 | |
| Ada | 0.78 (0.61–0.95) | 0.73 | 0.67 | 0.94 | 0.24 | 0.72 | |
| MLP | 0.74 (0.59–0.89) | 0.92 | 0.50 | 0.93 | 0.46 | 0.88 | |
| Neonatal model in validation cohort (n=105) | |||||||
| LR | 0.76 (0.66–0.87) | 0.91 | 0.50 | 0.82 | 0.68 | 0.79 | |
| SVM | 0.76 (0.66–0.87) | 0.91 | 0.50 | 0.82 | 0.68 | 0.79 | |
| RF | 0.76 (0.66–0.87) | 0.81 | 0.67 | 0.86 | 0.59 | 0.77 | |
| DT | 0.69 (0.59–0.79) | 0.91 | 0.47 | 0.81 | 0.67 | 0.78 | |
| KNN | 0.75 (0.65–0.86) | 0.75 | 0.67 | 0.85 | 0.51 | 0.72 | |
| NB | 0.76 (0.65–0.86) | 0.91 | 0.50 | 0.82 | 0.68 | 0.79 | |
| Ada | 0.76 (0.65–0.86) | 0.73 | 0.70 | 0.86 | 0.51 | 0.72 | |
| MLP | 0.74 (0.63–0.85) | 0.75 | 0.67 | 0.85 | 0.51 | 0.72 | |
Ada indicates AdaBoost; AUC, area under the receiver operating characteristic curve; DT, decision tree; KNN, k‐nearest neighbor; LR, logistic regression; ML, machine learning; MLP, multilayer perceptron; NB, naïve Bayes; NPV, negative predictive value; PPV, positive predictive value; RF, random forest; and SVM, support vector machine.
Figure 3ROC curves of the LR and ML analysis. (A), Training cohort of the maternal model; (B) validation cohort of the maternal model; (C) training cohort of the neonatal model; (D) validation cohort of the neonatal model. Ada indicates AdaBoost; AUC, area under the receiver operating characteristic curve; DT, decision tree; KNN, k‐nearest neighbor; LR, logistic regression; MLP, multilayer perceptron; NB, naïve Bayes; RF, random forest; ROC, receiver operating characteristic; and SVM, support vector machine.