| Literature DB >> 34153076 |
Yiyun Chen1, Craig S Roberts1, Wanmei Ou1, Tanaz Petigara1, Gregory V Goldmacher1, Nicholas Fancourt2, Maria Deloria Knoll3.
Abstract
BACKGROUND: The World Health Organization (WHO)-defined radiological pneumonia is a preferred endpoint in pneumococcal vaccine efficacy and effectiveness studies in children. Automating the WHO methodology may support more widespread application of this endpoint.Entities:
Year: 2021 PMID: 34153076 PMCID: PMC8216551 DOI: 10.1371/journal.pone.0253239
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Conclusions of CXR-reading by radiologists and pediatricians in training (PERCH) and testing (WHO) datasets.
| Image Class | Training Dataset: PERCH (N = 4172) | Test Dataset: WHO (N = 431) | |||||
|---|---|---|---|---|---|---|---|
| Final Conclusion (N = 4,172) | Round-1 Conclusions by Primary Readers | Round-2 Conclusions by Arbitrators | WHO-Original (n = 222) | WHO-CRES (n = 209) | |||
| (N = 4,172) | (n = 2,358) | ||||||
| Concordant (n = 1,814) | Discordant (n = 2,358) | Concordant (n = 1,144) | Discordant (n = 1,214) | ||||
| Primary Endpoint Pneumonia | 1,075 (25.8%) | 458(11%) | 617(14.8%) | 228(9.7%) | 389(32%) | 90 (40.5%) | 71(34.0%) |
| Other Infiltrates | 993 (23.8%) | 361(8.7%) | 632(15.1%) | 276(11.7%) | 356(29.3%) | 44 (19.8%) | 26 (12.4%) |
| Normal | 1,692 (40.6%) | 854(20.5%) | 838(20.1%) | 521(22.9%) | 317(26.1%) | 75 (33.8%) | 106 (50.7%) |
| Uninterpretable | 412 (9.8%) | 141(3.4%) | 271(6.4%) | 119(5%) | 152(12.5%) | 13 (5.9%) | 6 (2.9%) |
AUROC scores (averaged across 10-fold) on the validation set, and WHO test set by level of inter-observer agreement of the image labels.
| Category | Validation Results | Test Results | ||||||
|---|---|---|---|---|---|---|---|---|
| PERCH | PERCH | WHO | WHO-Original | WHO-CRES | ||||
| Concordant (n = 150) | Discordant (n = 150) | High (n = 120) | Low (n = 88) | High + Low (n = 208) | High (n = 203) | |||
| Primary Endpoint Pneumonia | 0.928 (0.919,0.938) | 0.944 (0.930,0.957) | 0.859 (0.837,0.879) | 0.977 (0.974,0.981) | 0.993 (0.990,0.996) | 0.845 (0.817,0.873) | 0.952 (0.943,0.960) | 0.996 (0.995,0.998) |
| Other Infiltrates | 0.780 (0.764,0.797) | 0.810 (0.788,0.832) | 0.741 (0.715,0766) | 0.891 (0.879,0.903) | 0.969 (0.957,0.980) | 0.726 (0.692,0.759) | 0.856 (0.838,0.875) | 0.935 (0.919,0.950) |
| Normal | 0.897 (0.887,0.907) | 0.896 (0.880,0.911) | 0.788 (0.765,0.812) | 0.951 (0.945,0.957) | 0.995 (0.992,0.997) | 0.749 (0.714,0.784) | 0.921 (0.909,0.932) | 0.974 (0.968,0.980) |
* Average sample size of the 10-fold validation set.
Fig 1Comparison of model predicted probabilities and 95% confidence intervals by endpoint and level of human reader agreement.
Fig 2Comparison of model performance to radiologist and pediatricians on discordant images.
The four operating points represent the conclusions given by the 4 readings. The lines represent model’s performance, with the average of 10-fold validation in blue color. The top rows shows the receiver operating characteristic (ROC) curve and the bottom shows the Precision-Recall (PR) curve.
Fig 3(a). Activation map of PEP. Frontal radiographs of the chest in a child with WHO-defined primary endpoint pneumonia; the child is rotated to the right with dense opacity in the right upper lobe; the model localizes consolidation with a predicted probability p = 0.980; the discriminative visualization shows fine-grained features important to the predicted class. (b). Activation map of other-infiltrates. Frontal radiograph of the chest presents patchy opacity consistent with non-endpoint infiltrate. The model correctly classifies the image as infiltrate with a probability of p = 0.917 and localizes the areas of opacity. The class discriminative visualization highlights important class features.
Final conclusion and model prediction on discordant CXR images (N = 1,062) by key features.
| Model = Final conclusion (n = 637) | Model ≠ Final conclusion (n = 425) | |
|---|---|---|
| Final conclusion (n,%) | ||
| Primary Endpoint Pneumonia (PEP) | 257(40.4%) | 132(31.1%) |
| Other Infiltrates (OI) | 149(23.4%) | 207(48.7%) |
| Normal | 231(36.3%) | 86(20.2%) |
| CXR + (PEP or OI) | 558/789(70.7%) | 187/273(68.5%) |
| Predicted Probability (mean, SD) | ||
| PEP | 85.6%(0.16) | 70.0%(0.20) |
| OI | 60.6%(0.13) | 60.5%(0.12) |
| Normal | 76.9%(0.15) | 69.9%(0.16) |
| Gender (n,%) | ||
| Male | 351(55.1%) | 233(54.8%) |
| Female | 286(44.9%) | 192(45.2%) |
| Age in months (mean, SD) | 10.50(10.69) | 11.19(10.96) |
| Countries (n,%) | ||
| Bangladesh | 70(11.0%) | 52(12.2%) |
| Gambia | 93(14.6%) | 70(16.5%) |
| Kenya | 82(12.9%) | 68(16.0%) |
| Mali | 81(12.7%) | 48(11.3%) |
| South Africa | 186(29.2%) | 105(24.7%) |
| Thailand | 33(5.2%) | 27(6.4%) |
| Zambia | 92(14.4%) | 55(12.9%) |
* p<0.0001 for Pearson’s chi-squared test or two-proportion z-test.
** Differences between PEP and OI are ignored so a greater number of images have concordant results.