| Literature DB >> 31888592 |
Zhidong Zhao1,2, Yanjun Deng3, Yang Zhang4, Yefei Zhang3, Xiaohong Zhang3, Lihuan Shao3.
Abstract
BACKGROUND: Fetal heart rate (FHR) monitoring is a screening tool used by obstetricians to evaluate the fetal state. Because of the complexity and non-linearity, a visual interpretation of FHR signals using common guidelines usually results in significant subjective inter-observer and intra-observer variability.Entities:
Keywords: Computer aided diagnosis system; Continuous wavelet transform; Convolutional neural network; Fetal acidemia; Fetal heart rate
Mesh:
Year: 2019 PMID: 31888592 PMCID: PMC6937790 DOI: 10.1186/s12911-019-1007-5
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Summary of related works conducted for the intelligent assessment of the fetal state using FHR signals obtained from CTG
| Author | Database | Distribution (N/P) | Method | Performance(%) | ||
|---|---|---|---|---|---|---|
| Feature extraction | Feature selection | Classifier | ||||
| Krupa et al. 2011 [ | Private | 30/60 | EMD | / | SVM | Acc:87 Se:95 Sp:70 |
| Spilka et al.2012 [ | Private | 123/94 | 33 Set1, Set2, Set3 | PCA,IG | NB,SVM,DT | Se:73.4 Sp:76.3 Fm:71.5 |
| Czabanski et al. 2012 [ | Private | 146/43 | 7 Set1 | / | WFS+ LS-SVM | Acc:92.0 QI:88.2 |
| Fanelli et al. 2013 [ | Private | 61/61 | 2 Set3 | / | ST | AUC:75 |
| Xu et al. 2014 [ | Private | 255/255 | 64 Set1, Set2, Set3 | GA | SVM | Se:83 Sp:66 AUC:74 |
| Dash et al. 2014 [ | Private | 60/23 | 8 Set1 | / | GM,NB | Se: 61 Sp:82 |
| Spilka et al. 2014 [ | CTU-UHB | 175/377 | 33 Set1,Set2, Set3 | / | LCA + RF | Se:72 Sp:78 |
| Doret et al. 2015 [ | Private | 30/15 | 12 Set2, Set3 | / | ST | AUC:87 |
| Comert et al. 2016 [ | CTU-UHB | 60/40 | 18 Set1, Set2 | / | ANN | Acc: 87.0 Se:88.7 Sp:85.1 |
| Stylios et al. 2016 [ | CTU-UHB | 508/44 | 54 Set1, Set2, Set3 | AUC | LS-SVM | Se:68.5 Sp:77.7 |
| Comert et al. 2016 [ | CTU-UHB | 272/280 | 11 Set2, Set3 | / | ANN | Acc: 92.40 Se:95.89 Sp:74.75 |
| Georgoulas et al. 2017 [ | CTU-UHB | 508/44 | 33 Set1, Set2, Set3 | AUC | LS-SVM | Se:72.12 Sp:65.30 |
| Comert et al. 2018 [ | CTU-UHB | 439/113 | IBTF | GA/ | LS-SVM | Se:63.45 Sp:65.88 |
| Li et al. 2018 [ | Private | 3012/1461 | FHR + 1D CNN | Acc:93.24 | ||
| Comert et al. 2018 [ | CTU-UHB | 508/44 | STFT+2D CNN | Se:56.15 Sp:96.51 QI:73.61 | ||
Note: The best performance is indicated in bold
Fig. 1An overview of our proposed CAD system for intelligent prediction of fetal acidemia
An overview of the available information in the open access CTU-UHB CTG database
| Information | Mean | Min | Max |
|---|---|---|---|
| Maternal age (MA, year) | 29.6 | 18 | 46 |
| Gestational age (GA, week) | 40.0 | 37 | 43 |
| pH | 7.23 | 6.85 | 7.47 |
| Base deficit in extracelluar fluid (BDecf, mmol/L) | 4.60 | −3.40 | 26.11 |
| pCO2 | 7.07 | 0.70 | 12.30 |
| Base excess (BE) | −6.38 | −26.80 | −0.20 |
| Apgar 1 min | 8.3 | 1 | 10 |
| Apgar 5 min | 9.1 | 4 | 10 |
| Gravidity | 1.4 | 1 | 11 |
| Parity | 0.4 | 0 | 7 |
| Diabetes | No = 515, Yes = 37 | ||
| Birth weight (BW, g) | 3401 | 1970 | 4750 |
| Infant sex | Male = 286, Female = 266 | ||
| Delivery type | Vaginal = 506, Cesarean section = 46 | ||
Fig. 2Signal preprocessing of No.1001 FHR recording (internal database number)
Fig. 3The FHR signals (left) and corresponding time-frequency images (right) of the normal (top) and pathological (bottom) classes using the CWT with the mother wavelet of db2 and a wavelet scale of 24
Fig. 4The CNN architecture proposed in this work. Note: L = layer; FM = output feature map or number of neurons (width ×height ×depth)
The detailed parameter settings for each layer of the proposed CNN model
| Layer | Type | Parameter/Method | Value/Approach |
|---|---|---|---|
| 1 | Image input layer | Data augmentation | Random crop |
| Data normalization | Zero center | ||
| 2 | Convolution layer | Stride | [ |
| Padding | 0 | ||
| Learning rate of the weight | 1 | ||
| Learning rate of the bias | 1 | ||
| L2 regularization for the weight | 1 | ||
| L2 regularization for the bias | 1 | ||
| 3 | Activation layer | Method | ReLU |
| 4 | Normalization layer | Alpha | 1 × 10−3 |
| Beta | 0.75 | ||
| K | 2 | ||
| 5 | Pooling Layer | Method | Max pooling |
| Pool size | 2 × 2 | ||
| Stride | [ | ||
| Padding | 0 | ||
| 6 | Fully-connected layer | Learning rate of the weight | 1 |
| Learning rate of the bias | 1 | ||
| L2 regularization for the weight | 1 | ||
| L2 regularization for the bias | 1 | ||
| 7 | Dropout layer | Probability | 0.5 |
| 8 | Classification layer | Softmax | Cross-entropy |
The detailed training settings of the proposed CNN model
| Parameter | Value/Approach | |
|---|---|---|
| Backpropagation algorithm | Stochastic gradient descent | |
| Momentum | 0.9 | |
| Initial learning rate | 0.01 | |
| Learning rate drop | Factor | 0.1 |
| Period | 10 epochs | |
| L2 regularizer factor | 1 × 10–4 | |
Fig. 5The training Acc (top) and loss (bottom) change with iteration during the CNN training process
Fig. 6Comparison of the averaged classification performances using different kernel sizes and numbers of filters across ten folds. From left top to right top: Acc, Se, and Sp; from left bottom to right bottom: QI, AUC, and time
Fig. 7Comparison of the averaged classification performances using different max epochs and mini-batch sizes across ten-folds. From left top to right top: Acc, Se, and Sp; from left bottom to right bottom: QI, AUC, and time
Comparison of the averaged classification performances of different layers of CNN model across ten folds
| Layers | Type | Performance | |||||
|---|---|---|---|---|---|---|---|
| Acc (%) | Se (%) | Sp (%) | QI (%) | AUC (%) | Training Time (second) | ||
| 6 | I – C – P – C – F - O | 91.88 | 92.55 | 89.74 | 91.13 | 91.15 | 162.3 |
| 7 | I – C – P – C – P – F - O | 91.21 | 92.13 | 89.25 | 90.68 | 90.69 | 178.8 |
| 8 | I – C – P – C – P – F – F – O | 90.76 | 91.71 | 88.67 | 90.18 | 90.19 | 201.3 |
| 9 | I – C – P – C – P – C – F – F - O | 91.34 | 92.34 | 89.56 | 90.94 | 90.95 | 225.4 |
| 10 | I – C – P – C – P – C – P – F – F - O | 90.82 | 91.88 | 89.11 | 90.48 | 90.50 | 248.2 |
Note: The best performance is indicated in bold. I = image input layer, C = convolution + ReLU + normalization layer, P = max pooling layer, F = fully-connected + dropout layer, O = classification output layer
Comparison of the averaged classification performances of different image resolutions using the same optimization method across ten folds
| Measurement | Acc (%) | Se (%) | Sp (%) | QI (%) | AUC (%) | Time (second) |
|---|---|---|---|---|---|---|
| Dataset | ||||||
| Set1 | 88.47 | 89.12 | 82.33 | 85.66 | 77.28 | 150 |
| Set2 | 94.22 | 96.92 | 86.11 | 91.36 | 92.03 | 317 |
| Set3 | 96.44 | 97.02 | 92.04 | 94.50 | 94.66 | 587 |
| Set4 |
Note: The best performance is indicated in bold
Fig. 8ROC curve of the proposed algorithm using different image resolutions and same optimization method
Average classification performance for different input layers
| Scheme | Performance (Validation) | ||||
|---|---|---|---|---|---|
| Acc (%) | Se (%) | Sp(%) | QI(%) | AUC(%) | |
| HHT | 79.50 | 79.71 | 79.29 | 79.52 | 79.63 |
| Gabor Transformation | 76.38 | 80.56 | 72.33 | 76.25 | 77.22 |
| STFT | 83.27 | 86.78 | 78.83 | 82.91 | 83.10 |
| CWT | 98.34 | 98.22 | 94.87 | 96.53 | 97.82 |