| Literature DB >> 35984833 |
Jianfei Song1, Zhenyu Li2, Guijin Yao1, Songping Wei1, Ling Li1, Hui Wu2.
Abstract
Neonatal necrotizing enterocolitis (NEC) occurs worldwide and is a major source of neonatal morbidity and mortality. Researchers have developed many methods for predicting NEC diagnosis and prognosis. However, most people use statistical methods to select features, which may ignore the correlation between features. In addition, because they consider a small dimension of characteristics, they neglect some laboratory parameters such as white blood cell count, lymphocyte percentage, and mean platelet volume, which could be potentially influential factors affecting the diagnosis and prognosis of NEC. To address these issues, we include more perinatal, clinical, and laboratory information, including anemia-red blood cell transfusion and feeding strategies, and propose a ridge regression and Q-learning strategy based bee swarm optimization (RQBSO) metaheuristic algorithm for predicting NEC diagnosis and prognosis. Finally, a linear support vector machine (linear SVM), which specializes in classifying high-dimensional features, is used as a classifier. In the NEC diagnostic prediction experiment, the area under the receiver operating characteristic curve (AUROC) of dataset 1 (feeding intolerance + NEC) reaches 94.23%. In the NEC prognostic prediction experiment, the AUROC of dataset 2 (medical NEC + surgical NEC) reaches 91.88%. Additionally, the classification accuracy of the RQBSO algorithm on the NEC dataset is higher than the other feature selection algorithms. Thus, the proposed approach has the potential to identify predictors that contribute to the diagnosis of NEC and stratification of disease severity in a clinical setting.Entities:
Mesh:
Year: 2022 PMID: 35984833 PMCID: PMC9390903 DOI: 10.1371/journal.pone.0273383
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Relevant studies involving ML methods for NEC diagnosis and prognosis.
| Author | Number of features in use | classifier | AUROC |
|---|---|---|---|
|
| |||
| Pantalone, J. M. et al. | 14 | RF | 87.7% |
| Lure, A. C. et al. | 16 | RF | 98% |
| Jaskari, J. et al. | 14 | RF | 80.6% |
| Gao, W. J. et al. | 23 | GBM | 93.37% |
|
| |||
| Ji, J. et al. | 9 | LDA | 84.38% |
| Sylvester, K. G. et al. | 27 | LDA | 81.7% |
| Pantalone, J. M. et al. | 14 | RF | 75.9% |
| Gao, W. J. et al. | 23 | GBM | 94.13% |
Abbreviations: RF, random forest; GBM, light gradient boosting machine; LDA, linear discriminant analysis.
Fig 1The flowchart of the proposed method.
Fig 2The structure of the used RQBSO algorithm.
Fig 3The pseudocode of the used RQBSO algorithm.
Fig 4(a) solutions generated by the first strategy, (b) solutions generated by the second strategy.
Main perinatal and clinical characteristics of two datasets.
| Dataset 1 (n = 447) | Dataset 2 (n = 296) | |||
|---|---|---|---|---|
| FI (n = 151) | NEC (n = 296) | Medical NEC (n = 205) | Surgical NEC (n = 91) | |
|
| ||||
| GA (median [IQR], weeks) | 31.71[30.14–33.85] | 31.71[30.00–34.00] | 32.00[30.50–34.29] | 31.00[28.86–33.71] |
| BW (median [IQR], g) | 1660[1320–1920] | 1600[1100–1790] | 1660[1400–2100] | 1450[1200–1850] |
| Female (n [%]) | 48[47.1] | 59[48] | 91 [44.4] | 42[46.2] |
| BW for GA | ||||
| SGA (n [%]) | 10[6.6] | 41[13.9] | 30[14.6] | 11[12.1] |
| AGA (n [%]) | 137[90.7] | 250[84.5] | 173[84.4] | 77[84.6] |
| LGA (n [%]) | 4[2.7] | 5[1.6] | 2[1.0] | 3[3.3] |
| Vaginal delivery (n [%]) | 72[47.7] | 127[42.9] | 81[39.5] | 43[47.3] |
| Apgar 1-min (median [IQR]) | 7[6–8] | 7[6–8] | 7[6–8] | 7[5–8] |
| Apgar 5-min (median [IQR]) | 8[8–9] | 9[8–9] | 9[8–9] | 8[7–9] |
| PPROM (n [%]) | 47[31.1] | 98[33.1] | 70[34.1] | 28[30.8] |
| Corrected GA at clinical onset (median [IQR], weeks) | 34.43[33.14–35.86] | 34.07[32.61–35.86] | 34.14[32.71–36.00] | 34.00[32.29–35.71] |
|
| ||||
| Early Use of Antibiotics | 95[62.9] | 172[58.1] | 111[54.1] | 61[67.0] |
| MV (n [%]) | 73[48.3] | 165[55.7] | 94[45.9] | 71[78.0] |
| PDA (n [%]) | 93[61.6] | 200[67.6] | 136[66.3] | 64[70.3] |
| IVH (n [%]) | 62[41.1] | 68[23.0] | 38[18.5] | 30[33.0] |
| Infectious diseases (n [%]) | 60[39.7] | 107[36.1] | 64[31.2] | 43[47.3] |
| Anemia-RBC transfusion | ||||
| Not anemia (n [%]) | 94[63.6] | 203[68.6] | 166[81.0] | 37[40.7] |
| Anemia-not transfusion (n [%]) | 25[16.6] | 28[9.5] | 11[5.4] | 17[18.6] |
| Anemia-transfusion (n [%]) | 32[19.8] | 65[21.9] | 28[13.6] | 37[40.7] |
|
| ||||
| Type of milk | ||||
| human milk (n [%]) | 63[41.7] | 54[18.2] | 38[18.5] | 16[17.6] |
| Formula milk (n [%]) | 59[39.1] | 158[53.4] | 110[53.7] | 48[52.7] |
| Combination (n [%]) | 29[19.2] | 84[28.4] | 57[27.8] | 27[29.7] |
| HMF | 44[29.1] | 29[9.8] | 19[9.3] | 10[11.0] |
| Enteral nutrition start | ||||
| Quick (n [%]) | 124[82.1] | 221[74.7] | 160[78.0] | 61[67.0] |
| Medium (n [%]) | 25[16.6] | 59[19.9] | 32[15.6] | 27[29.7] |
| Slow (n [%]) | 2[1.3] | 16[5.4] | 13[6.4] | 3[3.3] |
| daily milk increment | ||||
| Quick (n [%]) | 53[35.1] | 73[24.7] | 58[28.3] | 15[16.5] |
| Slow (n [%]) | 98[64.9] | 223[75.3] | 147[71.7] | 76[83.5] |
| Probiotics | 119[78.8] | 124[41.9] | 75[36.6] | 49[53.8] |
|
| ||||
| Bowel sound attenuation | 60[39.7] | 182[61.5] | 121[59.0] | 61[67.0] |
| bloody stools | 81[53.6] | 105[35.5] | 75[36.6] | 30[33.0] |
| gastric residual | 39[25.8] | 141[47.6] | 97[47.3] | 44[48.4] |
| abdominal distension | 55[36.4] | 160[54.1] | 91[44.4] | 69[75.8] |
|
| ||||
| WBC at birth | 8.87[2.48–44.36] | 10.91[3.48–52.29] | 11.12[3.48–39.46] | 10.38[4.20–52.29] |
| NEUT% at birth | 0.57[0.12–0.90] | 0.58[0.06–0.93] | 0.58[0.06–0.93] | 0.57[0.15–0.84] |
| LY% at birth | 0.32[0.08–0.80] | 0.33[0.03–0.90] | 0.34[0.03–0.90] | 0.33[0.06–0.74] |
| MO% at birth | 0.08[0.01–0.18] | 0.07[0–0.22] | 0.06[0.00–0.19] | 0.07[0.00–0.22] |
| NEUT# at birth | 4.78[0.5–32.4] | 5.94[0.31–43.26] | 6.04[0.31–35.00] | 5.46[1.17–43.26] |
| LY# at birth | 2.94[0.93–15.93] | 3.4[0.7–30.67] | 3.40[0.70–30.67] | 3.46[0.79–29.90] |
| MO# at birth | 0.67[0.05–5.90] | 0.67[0–4.97] | 0.62[0.00–3.40] | 0.79[0.01–4.97] |
| RBC at birth | 4.61[2.71–6.26] | 4.57[2.54–6.13] | 4.58[2.54–6.13] | 4.43[3.07–5.92] |
| HGB at birth | 172[99–237] | 172[86–226] | 173[86–226] | 170[117–220] |
| HCT at birth | 51.4[29.6–69.3] | 51.4[29–69.3] | 51.7[29.0–69.3] | 50.4[33.6–67.6] |
| MCV at birth | 111.9[98.6–129.3] | 112.9[79.2–132.9] | 112.4[97.0–209.0] | 114.4[79.2–130.6] |
| MCH at birth | 37.8[32.8–44.1] | 37.9[15.6–44.7] | 37.8[15.6–44.7] | 38.0[25.8–43.6] |
| RDW at birth | 16.45[13.9–21.1] | 16.6[13.1–26.9] | 16.7[13.1–26.9] | 16.3[13.4–25.3] |
| PLT at birth | 227[116–406] | 219[42–509] | 218[42–509] | 220[69–460] |
| PCT at birth | 0.23[0.11–0.41] | 0.23[0.09–0.55] | 0.23[0.09–0.55] | 0.24[0.10–0.46] |
| MPV at birth | 10.2[9.2–11.8] | 10.7[8.5–13.0] | 10.6[8.5–13.0] | 11.0[8.9–12.4] |
| PDW at birth | 11.1[9.3–14.6] | 11.9[8.4–18.9] | 11.8[8.4–18.9] | 12.0[8.6–15.6] |
| WBC at clinical onset | 9.71[3.44–25.37] | 9.42[0.95–48.85] | 9.72[2.07–48.85] | 8.64[0.95–27.79] |
| NEUT% at clinical onset | 0.41[0.12–0.84] | 0.61[0.14–0.91] | 0.60[0.14–0.88] | 0.62[0.18–0.91] |
| LY% at clinical onset | 0.43[0.10–0.73] | 0.27[0.06–0.73] | 0.27[0.06–0.71] | 0.26[0.07–0.73] |
| MO% at clinical onset | 0.09[0.01–0.24] | 0.08[0–0.58] | 0.08[0.00–0.58] | 0.07[0.00–0.26] |
| NEUT# at clinical onset | 3.90[0.94–16.76] | 5.61[0.39–43.02] | 5.77[0.51–43.02] | 5.02[0.39–23.50] |
| LY# at clinical onset | 3.97[0.72–8.62] | 2.47[0.06–9.53] | 2.57[0.21–7.86] | 2.25[0.06–9.53] |
| MO# at clinical onset | 0.86[0.04–3.37] | 0.68[0.01–4.43] | 0.74[0.01–4.43] | 0.54[0.05–3.69] |
| RBC at clinical onset | 3.71[2.29–5.50] | 3.86[2.41–6.08] | 3.87[2.50–6.08] | 3.74[2.41–5.03] |
| HGB at clinical onset | 125[77–180] | 135[77–310] | 136[77–310] | 126[86–185] |
| HCT at clinical onset | 37.4[23.0–51.4] | 39.6[23.7–63.0] | 40.3[23.7–63.0] | 38.4[25.3–55.4] |
| MCV at clinical onset | 101.75[83.80–113.20] | 102.65[83.60–122.80] | 103.3[85.1–122.8] | 101.2[83.6–119.4] |
| MCH at clinical onset | 34.6[28.1–39.7] | 34.9[26.7–41.0] | 35.3[26.7–41.0] | 34.0[27.0–40.6] |
| RDW at clinical onset | 15.9[13.2–20.8] | 16.01[10.30–24.30] | 16.0[10.4–24.3] | 16.3[10.3–22.4] |
| PLT at clinical onset | 317.5[105.0–823.0] | 261.5[4.0–799.0] | 257[5–609] | 272[4–799] |
| PCT at clinical onset | 0.36[0.15–0.85] | 0.32[0.01–0.91] | 0.31[0.11–0.68] | 0.33[0.01–0.91] |
| MPV at clinical onset | 11.2[9.2–13.2] | 12[9–14] | 11.96[9.50–14.00] | 12[9–14] |
| PDW at clinical onset | 13.0[8.9–20.3] | 14.2[9.6–23.0] | 14.2[9.8–23.0] | 14.5[9.6–22.8] |
| WBC change | 0.01[-0.72, 2.44] | -0.12[-0.92, 5.82] | -0.09[-0.92, 5.82] | -0.18[-0.92, 2.53] |
| NEUT% change | -0.28[-0.76, 6.00] | 0.07[-0.83, 11.81] | 0.05[-0.83, 11.81] | 0.12[-0.78, 4.06] |
| LY% change | 0.25[-0.82, 5.37] | -0.19[-0.86, 11.96] | -0.19[-0.86, 11.96] | -0.19[-0.86, 7.10] |
| MO% change | 0.17[-0.90, 17.82] | 0.25[-1.00, 800.00] | 0.33[-1.00, 800.00] | 0.18[-1.00, 500.00] |
| NEUT# change | -0.15[-0.92, 10.62] | -0.12[-0.94, 35.09] | -0.10[-0.94, 35.09] | -0.15[-0.94, 4.98] |
| LY# change | 0.25[-0.77, 6.16] | -0.27[-0.98, 3.99] | -0.24[-0.98, 2.81] | -0.35[-0.97, 3.99] |
| MO# change | 0.29[-0.95, 11.27] | 0.03[-0.99, 6400.00] | 0.14[-0.99, 6400.00] | -0.20[-0.96, 73.27] |
| RBC change | -0.18[-0.55, 0.31] | -0.13[-0.42, 0.57] | -0.13[-0.42, 0.49] | -0.15[-0.42, 0.57] |
| HGB change | -0.26[-0.63, 0.18] | -0.20[-0.55, 1.40] | -0.19[-0.48, 1.40] | -0.25[-0.55, 0.33] |
| HCT change | -0.25[-0.63, 0.15] | -0.21[-0.50, 0.36] | -0.20[-0.50, 0.36] | -0.25[-0.50, 0.30] |
| MCV change | -0.09[-0.27, -0.01] | -0.08[-0.33, 0.23] | -0.07[-0.49, 0.11] | -0.10[-0.28, 0.23] |
| MCH change | -0.07[-0.29, 0.03] | -0.06[-0.33, 1.43] | -0.06[-0.33, 1.43] | -0.09[-0.31, 0.25] |
| RDW change | -0.03[-0.21, 0.23] | -0.04[-0.39, 0.47] | -0.04[-0.39, 0.43] | -0.01[-0.37, 0.47] |
| PLT change | 0.46[-0.62, 2.38] | 0.20[-0.97, 4.21] | 0.18[-0.97, 4.21] | 0.23[-0.97, 1.93] |
| PCT change | 0.64[-0.35, 2.55] | 0.40[-0.95, 2.33] | 0.40[-0.65, 2.33] | 0.41[-0.95, 2.00] |
| MPV change | 0.09[-0.16, 0.25] | 0.09[-0.14, 0.34] | 0.09[-0.09, 0.34] | 0.09[-0.10, 0.30] |
| PDW change | 0.16[-0.18, 0.81] | 0.20[-0.25, 0.97] | -0.16[-0.45, 0.63] | -0.14[-0.41, 0.53] |
Abbreviations: BW, birth weight; FPIES, Food protein-induced enterocolitis; GA, gestational age; MV, mechanical ventilation; HMF, human milk fortifier; PPROM, Preterm premature rupture of membranes; PDA, patent ductus arteriosus; IVH, intraventricular hemorrhage; IQR, interquartile range; RBC, red blood cell.
aAnemia is determined based on the hemoglobin concentration, the days after birth, the respiratory status and clinical manifestations based on the recommendations of Canadian Pediatric Society; The usual volume of transfusion was 10 to 20 ml kg−1 over 3 to 5 h and feeding volumes are routinely decreased during transfusions.
bSlow, never start or start later than postnatal day 4; Medium, start on postnatal day 3 or 4; Quick, start within postnatal day 2.
cSlow, the daily milk increment is less than 20 ml per kilogram of body weight until reaching full feeding volumes; quick, more than 20 ml per kilogram of body weight.
dlaboratory values change is percentage change of each indicator at clinical onset compared with those at birth.
Hyper-parameters used by RQBSO algorithm.
| Parameter | value | |
|---|---|---|
|
| alphas | 503.15 |
|
| flip | 5 |
| nBees | 10 | |
| maxIteration | 10 | |
| localIteration | 10 | |
|
| γ | 0.1 |
Fig 5Comparison of ROC and PRC curve of RQBSO and other algorithms.
(a, b) correspond to the ROC curve of dataset 1 and dataset 2. The numbers in parentheses indicate the AUROC value. The x-axis represents sensitivity, or true positive rate (TPR). The y-axis is 1-Specificity, or false positive rate (FPR). (c, d) represents the PRC curve of dataset 1 and dataset 2. The numbers in parentheses indicate the AUPRC value. The x-axis represents recall. The y-axis is precision.
The performance comparison of different feature selection models.
| RQBSO | mRMR | ReliefF | GA | BSO | RFE | LASSO | Ridge | |
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Acc (%) |
| 82.88 | 82.87 | 85.43 | 85.72 | 86.53 | 85.67 | 84.57 |
| Rec (%) |
| 86.26 | 88.02 | 89.86 | 94.27 | 89.82 | 89.39 | 89.39 |
| Pre (%) |
| 86.76 | 85.48 | 87.55 | 87.46 | 89.11 | 88.22 | 86.80 |
| F1-Score (%) |
| 86.36 | 86.69 | 88.57 | 89.03 | 89.41 | 88.75 | 88.00 |
|
| ||||||||
| Acc (%) |
| 75.40 | 76.76 | 75.53 | 81.36 | 75.02 | 77.19 | 73.37 |
| Rec (%) | 68.93 | 47.68 | 50.89 | 39.82 |
| 43.57 | 49.64 | 37.50 |
| Pre (%) |
| 70.95 | 68.71 | 70.05 | 89.31 | 65.12 | 72.25 | 65.82 |
| F1-Score (%) |
| 53.97 | 56.19 | 45.66 | 63.05 | 50.31 | 55.59 | 43.82 |
Feature importance ranking of dataset 1.
| Rank | Feature | Importance score |
|---|---|---|
| 1 | Placenta abnormalities | 0.041254 |
| 2 | PDW at birth | 0.041254 |
| 3 | Type of milk | 0.040842 |
| 4 | LY% change | 0.040842 |
| 5 | Signs of peritoneal irritation | 0.040429 |
| 6 | Feeding volume at NEC onset | 0.039604 |
| 7 | Drowsiness | 0.039191 |
| 8 | NEUT% at clinical onset | 0.039191 |
| 9 | Meconium amniotic fluid | 0.038366 |
| 10 | Probiotics | 0.037954 |
| 11 | Early onset sepsis | 0.036716 |
| 12 | Acidosis | 0.036716 |
| 13 | HCT at clinical onset | 0.036304 |
| 14 | PDA | 0.035891 |
| 15 | Daily milk increment | 0.034653 |
| 16 | WBC change | 0.034653 |
| 17 | Gastric residual | 0.034241 |
| 18 | PS | 0.031766 |
| 19 | Inotropic | 0.030941 |
| 20 | Abdominal distension | 0.030528 |
| 21 | LY# change | 0.030116 |
| 22 | LY# at clinical onset | 0.029703 |
| 23 | MO# at birth | 0.028878 |
| 24 | MO% at birth | 0.028053 |
| 25 | DIC | 0.027640 |
| 26 | MCH at clinical onset | 0.025578 |
| 27 | NEUT# change | 0.025165 |
| 28 | LY% at birth | 0.021865 |
| 29 | Temperature instability | 0.021040 |
| 30 | Bloody stools | 0.020627 |
Feature importance ranking of dataset 2.
| Rank | Feature | Importance score |
|---|---|---|
| 1 | Anemia-RBC transfusion | 0.069979 |
| 2 | Signs of peritoneal irritation | 0.069979 |
| 3 | Acidosis | 0.069279 |
| 4 | Tachycardia | 0.068579 |
| 5 | WBC change | 0.068579 |
| 6 | LY% at birth | 0.066480 |
| 7 | WBC at clinical onset | 0.065780 |
| 8 | Early onset sepsis | 0.061582 |
| 9 | Apgar 5-min | 0.059482 |
| 10 | PICC | 0.052484 |
| 11 | Total number of RBC transfusions | 0.049685 |
| 12 | MCH at clinical onset | 0.049685 |
| 13 | Postnatal age at clinical onset | 0.047586 |
| 14 | PLT change | 0.045486 |
| 15 | Caffeine | 0.044787 |
| 16 | Para | 0.032190 |
| 17 | NEUT# at clinical onset | 0.029391 |
| 18 | Fever | 0.025192 |
| 19 | MCV at clinical onset | 0.023793 |
Fig 6Comparison of ROC and PRC curve of different classifiers.
(a, b) correspond to the ROC curve of dataset 1 and dataset 2. The numbers in parentheses indicate the AUROC value. The x-axis represents sensitivity, or true positive rate (TPR). The y-axis is 1-Specificity, or false positive rate (FPR). (c, d) represents the PRC curve of dataset 1 and dataset 2. The numbers in parentheses indicate the AUPRC value. The x-axis represents recall. The y-axis is precision.