| Literature DB >> 34983431 |
Jing Jiao1,2, Yanran Du3, Xiaokang Li1,2, Yi Guo4,5, Yunyun Ren6, Yuanyuan Wang7,8.
Abstract
BACKGROUND: To develop a non-invasive method for the prenatal prediction of neonatal respiratory morbidity (NRM) by a novel radiomics method based on imbalanced few-shot fetal lung ultrasound images.Entities:
Keywords: Class imbalance; Ensemble learning; Fetal lung ultrasound image; Neonatal respiratory distress syndrome; Prenatal ultrasonic diagnosis; Transient tachypnea
Mesh:
Year: 2022 PMID: 34983431 PMCID: PMC8725479 DOI: 10.1186/s12880-021-00731-z
Source DB: PubMed Journal: BMC Med Imaging ISSN: 1471-2342 Impact factor: 1.930
Fig. 1The Workflow of the entire study. Stage I: For each acquired fetal lung ultrasound image, the ROI inside the fetal lung is delineated by one physician and confirmed by another physician. Stage II: 308 radiomics features are extracted in the ROI of each image. Feature selection is performed on these radiomics features to select the most useful features. Stage III: the selected radiomics features are combined with the clinical features as the input to the classifier. With building and comparing classification models with different methods, the best model is finally selected to predict the risk value of NRM
Fig. 2The flowchart of the selection process of the study population. Pregnant women who met the following criteria were enrolled in the study: (1) singleton pregnancy; (2) those with complete medical information who had undergone maternity examination and subsequent delivery in our hospital; (3) fetuses with no known congenital malformation or chromosomal abnormality; (4) those with no diabetes before pregnancy; and (5) those who had not been prescribed steroids before delivery. Finally, a total of 210 singleton pregnant women with 210 fetal lung ultrasound images were enrolled in our study and randomly divided into the training set and test set at a ratio of approximately 8:2. It is worth noting that we kept the same proportion of NRM and normal in both sets. The training set contains 167 images, of which 40 are NRM and 127 are normal. The test set contains 43 images, of which 11 are NRM and 32 are normal
Fig. 3Examples of NRM and normal fetal lung ultrasound images and manual delineation. a An NRM fetal lung ultrasound image. b A normal fetal lung ultrasound image. c and d are the manual delineations of the ROIs
The summary of the feature set we designed for predicting NRM
| Feature type | Feature name | Feature number |
|---|---|---|
| Clinical information | (1) GA, (2) GDM | 2 |
| Greyscale histogram features | (3) Energy, (4) Entropy, (5) Kurtosis, (6) Mean, (7) Median absolute deviation, (8) Median, (9) Range, (10) Uniformity, (11) Variance, (12) Root mean square, (13) Skewness, (14) Deviation, (15) Histogram kurtosis, (16) Histogram mean, (17) Histogram variance, (18) Histogram skewness | 16 |
| ROI textural features | (19) Mean of contrast, (20) SD of contrast, (21) Mean of covariance, (22) SD of covariance, (23) Mean of non-similarity, (24) SD of non-similarity | 6 |
| GLCM textural features | (25) Energy, (26) Entropy, (27) Dissimilarity, (28) Contrast, (29) Inversed difference, (30) Correlation 1, (31) Correlation 2, (32) Homogeneity, (33) Autocorrelation, (34) Cluster shade, (35) Cluster prominence, (36) Maximum probability, (37) Sum of squares, (38) Sum average, (39) Sum variance, (40) Sum entropy, (41) Difference variance, (42) Difference entropy, (43) Information measures of correlation 1, (44) Information measures of correlation 2, (45) Maximal correlation coefficient, (46) Inverse difference normalized, (47) Inverse difference moment normalized | 23 |
| GLRLM textural features | (48) Short-run emphasis, (49) Long-run emphasis, (50) Grey-level non-uniformity, (51) Run length non-uniformity, (52) Run percentage, (53) Low grey-level run emphasis, (54) High grey-level run emphasis, (55) Short-run low grey-level emphasis, (56) Short-run high grey-level emphasis, (57) Long-run low grey-level emphasis, (58) Long-run high grey-level emphasis, (59) Grey-level variance, (60) Run-length variance | 13 |
| GLSZM textural features | (61) Small zone emphasis, (62) Large zone emphasis, (63) Grey-level non-uniformity, (64) Zone size non-uniformity, (65) Zone percentage, (66) Low grey-level zone emphasis, (67) High grey-level zone emphasis, (68) Small zone low grey-level emphasis, (69) Small zone high grey-level emphasis, (70) Large zone low grey-level emphasis, (71) Large zone high grey-level emphasis, (72) Grey-level variance, (73) Zone-size variance | 13 |
| NGTDM textural features | (74) Coarseness, (75) Contrast, (76) Busyness, (77) Complexity, (78) Strength | 5 |
| Wavelet features | (79–154) Approximation, (155–230) Horizontal, (231–306) Vertical, (307–382) Diagonal | 304 |
| Total feature number | 382 |
(1) Clinical information: GA and GDM are strongly correlated with NRM [7, 8]. GA was determined by the last menstrual period and verified by first-trimester dating ultrasound (crown-rump length). According to the presence of GDM during pregnancy, these pregnant women were divided into Yes and No groups
(2) Greyscale histogram features: Describe the greyscale and histogram distribution of the ROI in fetal lung ultrasound images [13]
(3) Textural features: Describe detailed, invisible greyscale changes and associations in fetal lung ultrasound images
(a) ROI textural features: Describe the distribution of greyscale inside the ROI [14]
(b) Grey-level co-occurrence matrix (GLCM) textural features: Describe the specified spatial linear relationship between the frequencies of two greyscale intensities inside the ROI [15]
(c) Grey-level run-length matrix (GLRLM) textural features: Describe the roughness of the texture by calculating the run-length of the collinear image pixels of the same grey-level in a given direction inside the ROI [16, 17]
(d) Grey-level size zone matrix (GLSZM) textural features: Describe the uniformity of the small pixel population of the ROI [15, 18]
(e) Neighbourhood grey-tone difference matrix (NGTDM) textural features: Describe the difference between the greyscale of each image pixel and the greyscale of its neighbours inside the ROI [19]
(4) Wavelet features: Describe information that is not directly reflected by the greyscale and textural features of the original image. Every fetal lung ultrasound image was decomposed into four components: approximate, horizontal, vertical, and diagonal by wavelet transform (first-level decomposition). Then, the 76 features mentioned above were extracted separately on each component. Finally, a total of 304 wavelet features were extracted
Approximate, horizontal, vertical, and diagonal were decomposed from the image by wavelet transform (first-level decomposition)
GA: gestational age, GDM: gestational diabetes mellitus, ROI: region of interest (fetal lung region), SD: standard deviation, GLCM: grey-level co-occurrence matrix, GLRLM: grey-level run-length matrix, GLSZM: grey-level size zone matrix, NGTDM: neighbourhood grey-tone difference matrix
Characteristics of the training set and test set
| Characteristics | Training set ( | Test set ( | ||||
|---|---|---|---|---|---|---|
| Normal | NRM | Normal | NRM | |||
| No. of images | 127 | 40 | – | 32 | 11 | |
| GA* | 36.49 ± 0.85 | 34.37 ± 2.42 | 36.78 ± 1.64 | 34.53 ± 2.37 | ||
| Birth weight (g)* | 3096 ± 385 | 2978 ± 490 | 3145 ± 423 | 3024 ± 540 | ||
| Yes | 48 (37.80%) | 26 (65.00%) | – | 10 (31.25%) | 7 (63.64%) | – |
| No | 79 (62.20%) | 14 (35.00%) | – | 22 (68.75%) | 4 (36.36%) | – |
| 0.35 | 0.94 | |||||
| Spontaneous vaginal delivery | 56 (44.09%) | 21 (52.50%) | – | 15 (46.88%) | 5 (45.45%) | – |
| Caesarean delivery | 71 (55.91%) | 19 (47.50%) | – | 17 (53.12%) | 6 (54.55%) | – |
| 0.87 | 0.43 | |||||
| Female | 59 (46.46%) | 18 (45.00%) | – | 16 (50.00%) | 7 (63.64%) | – |
| Male | 68 (53.54%) | 22 (55.00%) | – | 16 (50.00%) | 4 (36.36%) | – |
| – | – | |||||
| 5 min ≤ 7 | 0(0.00%) | 4 (10.00%) | – | 0 (0.00%) | 0 (0.00%) | – |
| 5 min > 7 | 1 27 (100.00%) | 36 (90.00%) | – | 32 (100.00%) | 11 (100.00%) | – |
The p value < 0.05 is shown in blod
The t test was performed for continuous variables and the χ2 test was performed for categorical variables
GA gestational age, GDM gestational diabetes mellitus
*Data are means ± standard deviations
Feature names and means of the features selected
| Feature name | Mean ± std | |
|---|---|---|
| Normal | NRM | |
| Energy | 0.543 ± 0.070 | 0.551 ± 0.063 |
| Inverse difference moment normalized | 0.999 ± 0.0004 | 0.998 ± 0.0005 |
| High grey-level run emphasis | 298 ± 62.5 | 279 ± 57.0 |
| Run-length variance | (2.04 ± 0.996) × 10−5 | (2.30 ± 0.844) × 10−5 |
| Inverse difference moment normalized of approximation | 0.801 ± 0.113 | 0.773 ± 0.114 |
| Information measure of correlation 1 of approximation | 0.989 ± 0.002 | 0.990 ± 0.002 |
| Energy of horizontal | 0.362 ± 0.036 | 0.374 ± 0.042 |
| Sum entropy of vertical | (4.81 ± 2.87) × 104 | |
| Long-run high grey-level emphasis of vertical | 432 ± 78.4 | 462 ± 95.3 |
| Energy of diagonal | (1.40 ± 0.724) × 103 | (1.20 ± 0.841) × 103 |
Approximate, horizontal, vertical, and diagonal were decomposed from the image by wavelet transform (first-level decomposition)
Fig. 4Box plots of the top 3 features of the 10 selected features. a–c are the box plots of the high grey-level run emphasis, energy of horizontal and long-run high grey-level emphasis of vertical features extracted from the ROIs of the normal and NRM samples. The normal fetal lung has higher mean values for the features of high grey-level run emphasis (298 ± 62.5) and energy of diagonal (1400 ± 724) than the NRM. For the long-run high grey-level emphasis of vertical feature, the mean value of the normal fetal lung is 432, which is smaller than that of the NRM of 462
The classification performance of different modelling methods
| Method | Training set (mean ± std) | Test set | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| bACC | AUC | SENS | SPEC | PPV | NPV | bACC | AUC | SENS | SPEC | PPV | NPV | |
| SVM | 0.66 ± 0.05 | 0.76 ± 0.07 | 0.82 ± . 0.02 | 0.68 | 0.78 | |||||||
| AdaBoost | 0.76 ± 0.14 | 0.72 ± 0.16 | 0.68 ± 0.18 | 0.84 ± 0.09 | 09 ± . 0.09 | 0.73 | 0.79 | 0.55 | 0.91 | 0.68 | 0.85 | |
| Cost-sensitive SVM | 0.73 ± 0.10 | 0.43 ± 0.21 | 0.89 ± 0.09 | 0.61 ± 0.17 | 0.84 ± 0.04 | 0.45 | 0.84 | |||||
| SVM | 0.71 ± 0.17 | 0.79 ± 0.10 | 0.67 ± 0.17 | 0.45 ± 0.05 | 0.88 ± 0.04 | 0.76 | 0.85 | 0.73 | 0.78 | 0.53 | 0.89 | |
| AdaBoost | 0.66 ± 0.14 | 0.71 ± 0.08 | 0.55 ± 0.15 | 0.76 ± 0.07 | 0.85 ± 0.03 | 0.74 | 0.82 | 0.73 | 0.5 | 0.89 | ||
| SMOTEBoost | 0.71 ± 0.11 | 0.52 ± 0.14 | 0.89 ± 0.10 | 0.72 ± 0.18 | 0.85 ± 0.02 | 0.72 | 0.80 | 0.55 | 0.88 | 0.61 | 0.85 | |
| RUSBoost | 0.82 ± . 0.12 | 0.74 + 0.02 | 0.84 | 0.64 | ||||||||
The best results of each metric are shown in bold, and the worst results are shown in italics. Performance evaluation results obtained by bootstrap K-fold cross-validation in the training set
The classification performance of RUSBoost with different features on the original imbalanced few-shot dataset
| Feature | Training set (mean ± std) | Test set | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| bACC | AUC | SENS | SPEC | PPV | NPV | bACC | AUC | SENS | SPEC | PPV | NPV | |
| GA | 0.72 ± 0.11 | 0.60 ± 0.11 | 0.87 ± 0.08 | 0.71 | ||||||||
| GA & GDM | 0.72 ± 0.24 | 0.89 ± 0.15 | 0.64 | 0.85 | ||||||||
| GA, GDM & Radiomics | 0.83 ± 0.13 | 0.82 ± 0.12 | 0.83 | 0.87 | 0.82 | 0.84 | 0.64 | 0.93 | ||||
| GA, GDM & Radiomics | 0.76 ± 0.08 | 0.81 ± 0.09 | 0.70 ± 0.07 | 0.82 ± 0.15 | 0.55 ± 0.04 | 0.89 | 0.82 | 0.63 | ||||
The best results of each metric are shown in bold, and the worst results are shown in italics. Performance evaluation results obtained by bootstrap K-fold cross-validation in the training set.
Fig. 5Examples of the lung region delineations in the lung ultrasound images of a normal fetus and a fetus with NRM. a and b are the irregular and square ROI selection in the ultrasound image of a normal fetus. c and d are the irregular and square ROI selection in the ultrasound image of a fetus with NRM
Fig. 6The confusion matrix and ROC curves tested in the test set with different combinations of features. a and b are confusion matrices of the model using only clinical data. c is confusion matrices of the model using clinical data combined with delineated ROI. d is confusion matrices of the model using clinical data combined with square ROIs. e shows ROC curves and AUC values for different combinations of features
Fig. 7The distribution of the samples. a The sample distribution of the original dataset with terrible class aliasing. b The sample distribution of the balanced dataset augmented by ADASYN
Comparison of our method with previously reported methods
| Method | Size of training set | Test set | ||||
|---|---|---|---|---|---|---|
| bACC | SENS | SPEC | PPV | NPV | ||
| TDxII [ | - | 0.86 | ||||
| Bonet [ | N = 390 (NRM: -) | 0.85 | 0.84 | 0.86 | 0.63 | 0.94 |
| quantusFLM [ | N = 730 (NRM: 13.8%) | 0.51 | 0.96 | |||
| Our method (irregular ROI) | N = 167 (NRM: 24.0%) | 0.83 | 0.82 | 0.84 | 0.64 | |
| Our method (square ROI) | N = 167 (NRM: 24.0%) | 0.82 | 0.63 | |||
The best results of each metric are shown in bold, and the worst results are shown in italics
TDxII, surfactant/albumin ratio, the best index in the report of amniocentesis results
In Bonet's work, SENS, SPEC, PPV, and NPV were calculated at different gestational week groups, and the table shows the mean values