| Literature DB >> 30457991 |
Andrew G Taylor1,2, Clinton Mielke2, John Mongan1,2.
Abstract
BACKGROUND: Pneumothorax can precipitate a life-threatening emergency due to lung collapse and respiratory or circulatory distress. Pneumothorax is typically detected on chest X-ray; however, treatment is reliant on timely review of radiographs. Since current imaging volumes may result in long worklists of radiographs awaiting review, an automated method of prioritizing X-rays with pneumothorax may reduce time to treatment. Our objective was to create a large human-annotated dataset of chest X-rays containing pneumothorax and to train deep convolutional networks to screen for potentially emergent moderate or large pneumothorax at the time of image acquisition. METHODS ANDEntities:
Mesh:
Year: 2018 PMID: 30457991 PMCID: PMC6245672 DOI: 10.1371/journal.pmed.1002697
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Hyperparameters explored during model development and training.
| Parameter | Values | Description |
|---|---|---|
| Arch | VGG16, VGG19, ResNet-50, Xception, Inception | Pretrained architecture on ImageNet |
| Pooling | Global average, global max, flatten | Pooling method after final filter layers |
| fc1 | 4, 8, 16, 32, 64, 128 | Neuron count for first fully connected layer after pooling |
| fc2 | 0, 4, 8, 16, 32, 64, 128 | Neuron count for the second fully connected layer |
| LR | 0.001, 0.005, 0.01, 0.02 | Learning rate |
| LR schedule | Constant, cyclic, plateau | Experiments with dynamic learning rates |
| Batch size | 4, 8, 16, 32, 64, 128 | Batch size for the training |
| Dropout | 0, 0.25, 0.5, 0.75 | Dropout setting applied to fully connected layers |
| Augmentation zoom | 0, 0.25, 0.5, 0.75, 1.0 | Maximum fractional zoom range for images. 1.0 = 100% increase in size |
| Augmentation shear | 0, 0.1, 0.3, 0.5 | Fractional affine shear for image augmentation generator |
| Augmentation rotation | 0, 30, 45, 60, 90 | Maximum rotational angle in degrees for image augmentation |
| Optimizer | sgd, adam, nadam, adadelta, rmsprop | Optimization algorithm used for training |
| Batch normalization | Yes/no | A batch normalization layer was optionally inserted before the pooling layer |
| ImgShape | 256, 512, 1,024 | The size of the downsampled image in pixels |
Top 16 models classifying large and moderate pneumothorax, excluding small pneumothoraces in training.
| Training | Validation | Arch | Layer neuron count | Batch size | Dropout | Pool | Augmentation | LR | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | Sens | Spec | PPV | AUC | Sens | Spec | PPV | fc1 | fc2 | Zoom | Shear | |||||
| 0.95 | 0.85 | 0.90 | 0.44 | 0.94 | 0.79 | 0.91 | 0.43 | VGG19 | 16 | 4 | 16 | 0 | Flat | 1 | 0.3 | 0.001 |
| 0.96 | 0.84 | 0.94 | 0.55 | 0.94 | 0.70 | 0.93 | 0.45 | VGG16 | 32 | 8 | 16 | 0.25 | Max | 0.50 | 0.3 | 0.001 |
| 0.97 | 0.87 | 0.93 | 0.54 | 0.94 | 0.74 | 0.93 | 0.47 | VGG19 | 16 | 16 | 16 | 0 | Avg | 0.50 | 0.3 | 0.01 |
| 0.98 | 0.88 | 0.97 | 0.71 | 0.94 | 0.69 | 0.97 | 0.64 | Inception | 64 | 32 | 16 | 0.25 | Avg | 0.50 | 0.5 | 0.02 |
| 0.97 | 0.87 | 0.95 | 0.61 | 0.93 | 0.70 | 0.94 | 0.51 | Inception | 4 | 4 | 16 | 0.50 | Avg | 1 | 0.5 | 0.001 |
| 0.93 | 0.75 | 0.93 | 0.51 | 0.93 | 0.68 | 0.94 | 0.49 | VGG19 | 64 | 4 | 16 | 0.25 | Avg | 1 | 0.1 | 0.005 |
| 0.97 | 0.83 | 0.96 | 0.65 | 0.92 | 0.69 | 0.96 | 0.60 | Inception | 4 | 0 | 16 | 0.50 | Max | 0.75 | 0.3 | 0.02 |
| 0.97 | 0.81 | 0.96 | 0.64 | 0.92 | 0.63 | 0.96 | 0.60 | Xception | 4 | 0 | 4 | 0 | Max | 0.50 | 0.1 | 0.01 |
| 0.95 | 0.78 | 0.96 | 0.62 | 0.92 | 0.64 | 0.96 | 0.56 | VGG19 | 32 | 8 | 4 | 0 | Flat | 0.50 | 0.1 | 0.001 |
| 0.95 | 0.68 | 0.98 | 0.75 | 0.92 | 0.55 | 0.97 | 0.64 | ResNet | 0 | 0 | 8 | 0.25 | Max | 0.75 | 0.1 | 0.001 |
| 0.94 | 0.79 | 0.94 | 0.54 | 0.92 | 0.72 | 0.94 | 0.52 | Xception | 16 | 8 | 4 | 0.25 | Avg | 0.75 | 0.1 | 0.02 |
| 0.95 | 0.77 | 0.96 | 0.64 | 0.92 | 0.64 | 0.96 | 0.60 | VGG19 | 32 | 0 | 8 | 0.75 | Flat | 0.50 | 0.3 | 0.001 |
| 0.97 | 0.86 | 0.93 | 0.55 | 0.91 | 0.69 | 0.94 | 0.47 | Xception | 32 | 0 | 8 | 0 | Flat | 1 | 0.1 | 0.01 |
| 0.95 | 0.81 | 0.92 | 0.49 | 0.91 | 0.68 | 0.93 | 0.44 | VGG16 | 64 | 8 | 16 | 0.25 | Flat | 1 | 0.1 | 0.001 |
| 0.94 | 0.67 | 0.96 | 0.61 | 0.91 | 0.60 | 0.96 | 0.58 | ResNet | 16 | 16 | 16 | 0.25 | Avg | 0.75 | 0.5 | 0.01 |
| 0.95 | 0.84 | 0.92 | 0.50 | 0.88 | 0.63 | 0.92 | 0.39 | VGG19 | 32 | 16 | 16 | 0 | Flat | 0.75 | 0.1 | 0.005 |
Arch, architecture; AUC, area under the receiver operating characteristic curve; LR, learning rate; PPV, positive predictive value; Sens, sensitivity; Spec, specificity.
Fig 1Diagnostic performance ROC curve and confusion matrix of most sensitive top model evaluated on test set (small pneumothoraces excluded).
Mod, moderate; PTX, pneumothorax; ROC, receiver operating characteristic.
Fig 2Diagnostic performance ROC curve and confusion matrix of most specific top model evaluated on test set (small pneumothoraces excluded).
Mod, moderate; PTX, pneumothorax; ROC, receiver operating characteristic.
High sensitivity model test set performance, stratified by pneumothorax size.
| Annotation | Predicted correctly | Predicted incorrectly | Percentage correct | 95% CI |
|---|---|---|---|---|
| Negative | 1,239 | 113 | 92 | 90–93 |
| Trace PTX | 162 | 39 | 81 | 75–85 |
| Small PTX | 113 | 176 | 39 | 34–45 |
| Moderate PTX | 101 | 23 | 81 | 73–87 |
| Large PTX | 24 | 0 | 100 | 86–100 |
PTX, pneumothorax.
High specificity model test set performance, stratified by pneumothorax size.
| Annotation | Predicted correctly | Predicted incorrectly | Percentage correct | 95% CI |
|---|---|---|---|---|
| Negative | 1,311 | 41 | 97 | 96–98 |
| Trace PTX | 193 | 8 | 96 | 92–98 |
| Small PTX | 80 | 209 | 28 | 23–33 |
| Moderate PTX | 97 | 27 | 78 | 70–85 |
| Large PTX | 21 | 3 | 88 | 69–96 |
PTX, pneumothorax.
Performance of the models on the NIH ChestX-ray14 external dataset.
| Measure | High sensitivity model | High specificity model | ||
|---|---|---|---|---|
| Positive (any PTX) | Negative (no PTX) | Positive (any PTX) | Negative (no PTX) | |
| Predicted positive | 2,602 | 16,269 | 1,481 | 3,737 |
| Predicted negative | 2,700 | 90,549 | 3,821 | 103,081 |
| Sensitivity | 0.49 | 0.28 | ||
| Specificity | 0.85 | 0.97 | ||
| PPV | 0.14 | 0.28 | ||
| NPV | 0.97 | 0.96 | ||
| AUC | 0.75 | 0.75 | ||
AUC, area under the receiver operating characteristic curve; NIH, National Institutes of Health; NPV, negative predictive value; PPV, positive predictive value; PTX, pneumothorax.