| Literature DB >> 36046934 |
Mila Glavaški1, Lazar Velicki.
Abstract
AIM: To assess the effects of different test-set design strategies for magnetic resonance (MR) image classification using deep learning.Entities:
Mesh:
Year: 2022 PMID: 36046934 PMCID: PMC9468729
Source DB: PubMed Journal: Croat Med J ISSN: 0353-9504 Impact factor: 2.415
Figure 1Examples of uninformative images excluded from further experiments.
Data augmentation and model design
| Option | Value |
|---|---|
| Library | FastAI |
| Image size | 224 |
| Batch size | 20 |
| Image normalization | imagenet_norm |
| Data augmentation | random flip
flips limited to horizontal flips
random rotation between -10 and 10 degrees with probability 0.75
random zoom between 1.0 and 1.1 with probability 0.75
random lightning and contrast change controlled by 0.2 with probability 0.75
random symmetric warp of magnitude between -0.2 and 0.2 with probability 0.75 |
| Architecture | ResNet-50 |
| Model fitting method | One cycle policy |
| Optimizer | Adam |
| Regularization | L2 |
| Loss function | cross entropy loss |
Figure 2The first set of experiments.
Figure 3An example of images in one batch – when complete data set was used (left) and when uniform data subset was used (right).
Figure 4The second set of experiments. Each experiment was conducted with and without data augmentation. HFI – heart failure with infarction; HF –heart failure without infarction; LVH – left ventricle hypertrophy; H – healthy.
Experimental settings and effects of different test-set designs
| Classification | Number of classes | Chest images | Test set | Error rate (%) | Training loss<<Test loss |
|---|---|---|---|---|---|
| Disease groups | 4 | + | 10% random images from each class | 2.0643 | - |
| Disease groups | 4 | + | images of one patient from each class | 46.7975* | + |
| Disease groups | 4 | - | 10% random images from each class | 1.2866 | - |
| Disease groups | 4 | - | images of one patient from each class | 49.0891* | + |
| Sex | 2 | + | 10% random images from each class | 0.9597 | - |
| Sex | 2 | + | images of one patient from each class | 16.0180* | + |
| Sex | 2 | - | 10% random images from each class | 0.3040 | - |
| Sex | 2 | - | images of one patient from each class | 23.5596* | + |
| Patients | 45 | + | 10% random images from each class | 1.5293 | - |
| Patients | 45 | - | 10% random images from each class | 0.3939 | - |
*We used early stopping in models where the test set consisted of the images of one patient from each class. In the cases where test sets consisted of 10% of random images from each class, both training loss and test loss were decreasing constantly, so there was no need or opportunity for early stopping.
Subset analysis using other pretrained models, with and without data augmentation*
| Pretrained models | Error rate (%) with data augmentation | Error rate (%) without data augmentation | Training loss<<Test loss |
|---|---|---|---|
| ResNet-50 | 46.9799 | 30.2013 |
|
| ResNet-152 | 38.2550 | 32.2148 |
|
| SqueezeNet 1.1 | 48.9933 | 25.5034 |
|
| DenseNet-161 | 45.6376 | 27.5168 |
|
| DenseNet-169 | 32.8859 | 29.5302 |
|
| DenseNet-201 | 40.2685 | 33.5570 |
|
| VGG-16 with batch normalization | 34.8993 | 32.8859 |
|
| VGG-19 with batch normalization | 29.5302 | 14.0940 |
|
| AlexNet | 53.6913 | 53.0201 |
|
*Early stopping was applied in all training runs.
Figure 5An example of neighboring slices. Image CAP_SCD0001401_MR__hrt_raw_20120813121634880_31 of patient 14 (left) and image CAP_SCD0001401_MR__hrt_raw_20120813121634905_32 of patient 14 (right).
Figure 6An example of top losses in subset analysis using ResNet-50 for pathological group classification.