| Literature DB >> 31940391 |
Varvara Turova1, Irina Sidorenko2, Laura Eckardt3, Esther Rieger-Fackeldey4, Ursula Felderhoff-Müser3, Ana Alves-Pinto1, Renée Lampe1.
Abstract
Intracerebral hemorrhage in preterm infants is a major cause of brain damage and cerebral palsy. The pathogenesis of cerebral hemorrhage is multifactorial. Among the risk factors are impaired cerebral autoregulation, infections, and coagulation disorders. Machine learning methods allow the identification of combinations of clinical factors to best differentiate preterm infants with intra-cerebral bleeding and the development of models for patients at risk of cerebral hemorrhage. In the current study, a Random Forest approach is applied to develop such models for extremely and very preterm infants (23-30 weeks gestation) based on data collected from a cohort of 229 individuals. The constructed models exhibit good prediction accuracy and might be used in clinical practice to reduce the risk of cerebral bleeding in prematurity.Entities:
Mesh:
Year: 2020 PMID: 31940391 PMCID: PMC6961932 DOI: 10.1371/journal.pone.0227419
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Procedure adopted to select data samples of affected neonates to be input into the model.
The procedure considers the day of occurrence of IVH.
| Day of IVH diagnosis | 1 | 2–3 | 4–7 | 8–14 | 15–21 |
|---|---|---|---|---|---|
| 0 | 1 | 3 | 7 | 14 |
Number of observation days and measurement samples per infant for the control and affected groups.
| No IVH | With IVH | |
|---|---|---|
| 13 | 14 | |
| 9.8±0.8 | 2.7±2.8 | |
| 8 | 9.2 | |
| 2.4±1.4 | 2.4±1.5 | |
| 2373 | 801 |
Variables and metrics used in evaluation of model performance.
| Metrics/Variable | Definition |
|---|---|
| Total number of data sets | |
| Number of data sets for infants with IVH | |
| Number of data sets for infants without IVH | |
| Number of data sets classified by the ML-method as of infants with IVH | |
| Number of data sets classified by the ML-method as of infants without IVH | |
| c1true × c1method/n | |
| c2true × c2method/n | |
| (n1 + n2)/n | |
| Number of correctly identified data sets/n | |
| (OA-EA)/(1-EA) | |
| Mean value of OA | |
| Number of correctly identified data sets for infants with IVH | |
| Number of correctly identified data sets for infants without IVH | |
| n1true/c1true | |
| n2true/c2true |
Comparison of variable means for the control and affected groups.
For the affected group only records taken before hemorrhage were considered for the model.
| Variable | Extremely preterm (23–26 WG), n = 137 | Very preterm (27–30 WG), n = 92 | ||||
|---|---|---|---|---|---|---|
| Control | Affected | p-value | Control | Affected | p-value | |
| 5.57±1.93 | 5.45±1.69 | 0.057 | 6.41±2.04 | 5.85±2.03 | <0.001 | |
| 6.94±1.60 | 7.05±1.52 | 0.059 | 7.86±1.13 | 7.30±1.24 | <0.001 | |
| 7.98±1.31 | 7.92±0.91 | <0.001 | 8.54±0.57 | 8.00±0.71 | <0.001 | |
| 0.442±0.078 | 0.422±0.078 | <0.001 | 0.467±0.080 | 0.458±0.088 | 0.495 | |
| 198.48±80.48 | 180.28±75.00 | 0.002 | 206.21±90.9 | 165.82±78.4 | <0.001 | |
| 20.27±17.62 | 11.28±8.43 | <0.001 | 9.12±5.58 | 10.84±9.41 | 0.128 | |
| 0.448±0.506 | 0.825±0.942 | <0.001 | 0.646±0.828 | 1.55±3.705 | 0.007 | |
| 7.28±0.07 | 7.25±0.09 | <0.001 | 7.32±0.06 | 7.27±0.09 | <0.001 | |
| 43.07±12.52 | 47.73±14.39 | <0.001 | 42.16±9.37 | 42.26±10.5 | 0.722 | |
| 91.56±4.39 | 90.27±4.33 | <0.001 | 94.23±5.76 | 89.87±6.74 | <0.001 | |
| 47.65±9.49 | 48.40±11.18 | 0.389 | 44.09±7.61 | 48.65±9.05 | <0.001 | |
| 36.80±8.96 | 30.50±8.15 | <0.001 | 40.45±8.58 | 37.16±7.58 | <0.001 | |
| 10.94±5.43 | 8.89±4.79 | <0.001 | 15.45±5.33 | 17.95±7.57 | <0.001 | |
ap-value for the null hypothesis that the group means for the control and affected groups are equal.
Fig 1Optimal number of features in RF-models for extremely preterm infants with 23 to 26 WG.
Fig 2Optimal number of features in RF-models for very preterm infants with 27 to 30 WG.
Fig 3Variable importance plot for extremely preterm infants with 23 to 26 WG.
Fig 4Variable importance plot for very preterm infants with 27 to 30 WG.
Performance results for extremely preterm newborns.
| Model variables | Mean accuracy | Mean kappa | Accuracy of prediction | 95% CI | No information rate | p-value | Sensitivity | Specificity |
|---|---|---|---|---|---|---|---|---|
| 0.877 | 0.735 | 0.89 | (0.795, 0.952) | 0.644 | <0.001 | 0.923 | 0.872 | |
| 0.910 | 0.807 | 0.918 | (0.83, 0.969) | 0.644 | <0.001 | 0.923 | 0.915 | |
| 0.913 | 0.812 | 0.959 | (0.885, 0.991) | 0.644 | <0.001 | 0.923 | 0.979 | |
| 0.893 | 0.743 | 0.918 | (0.804, 0.977) | 0.714 | <0.001 | 0.929 | 0.914 | |
| 0.861 | 0.664 | 0.918 | (0.804, 0.977) | 0.714 | 0.005 | 1.0 | 0.886 | |
| 0.879 | 0.747 | 0.872 | (0.743, | 0.617 | <0.001 | 0.889 | 0.862 | |
| 0.889 | 0.756 | 0.883 | (0.774, | 0.667 | <0.001 | 0.9 | 0.875 |
Performance results for very preterm newborns.
| Model variables | Mean accuracy | Mean kappa | Accuracy of prediction | 95% CI | No information rate | p-value | Sensitivity | Specificity |
|---|---|---|---|---|---|---|---|---|
| 0.898 | 0.76 | 0.972 | (0.855, 0.999) | 0.694 | 3e-05 | 1.0 | 0.96 | |
| 0.9 | 0.77 | 0.944 | (0.813, | 0.694 | 0.0003 | 1.0 | 0.92 | |
| 0.879 | 0.72 | 0.889 | (0.739, 0.969) | 0.694 | 0.006 | 0.818 | 0.92 | |
| 0.892 | 0.752 | 0.944 | (0.813, 0.993) | 0.694 | 0.0003 | 1.0 | 0.92 | |
| 0.891 | 0.765 | 0.909 | (0.708, 0.989) | 0.636 | 0.004 | 0.875 | 0.929 | |
| 0.864 | 0.676 | 0.931 | (0.772, | 0.69 | 0.002 | 0.889 | 0.95 |
Fig 5Probability density functions for pO-records of affected (pink curves) and control (blue curves) patients. Left panel: Extremely preterm infants. Right panel: Very preterm infants.
Fig 6Probability density functions for pCO-records of affected (pink curves) and control (blue curves) patients. Left panel: Extremely preterm infants. Right panel: Very preterm infants.
Obstetric characteristics of the cohort.
| Parameter | No IVH | With IVH | p-value |
|---|---|---|---|
| 26.6±2.1 | 26.3±2.0 | 0.19 | |
| 853±249 | 881±298 | 0.58 | |
| 43.2 | 56.4 | 0.06 | |
| 36.9 | 38.9 | 0.79 | |
| 6.3 | 9.3 | 0.46 | |
| 31.5 | 22.9 | 0.18 |
ap-value based on the two-sided Wilcoxon rank-sum test for the null hypothesis that the group means for the two groups are equal.