| Literature DB >> 32964138 |
Pranav Rajpurkar1, Chloe O'Connell2, Amit Schechter1, Nishit Asnani1, Jason Li1, Amirhossein Kiani1, Robyn L Ball3, Marc Mendelson4, Gary Maartens4, Daniël J van Hoving4, Rulan Griesel4, Andrew Y Ng1, Tom H Boyles4, Matthew P Lungren3.
Abstract
Tuberculosis (TB) is the leading cause of preventable death in HIV-positive patients, and yet often remains undiagnosed and untreated. Chest x-ray is often used to assist in diagnosis, yet this presents additional challenges due to atypical radiographic presentation and radiologist shortages in regions where co-infection is most common. We developed a deep learning algorithm to diagnose TB using clinical information and chest x-ray images from 677 HIV-positive patients with suspected TB from two hospitals in South Africa. We then sought to determine whether the algorithm could assist clinicians in the diagnosis of TB in HIV-positive patients as a web-based diagnostic assistant. Use of the algorithm resulted in a modest but statistically significant improvement in clinician accuracy (p = 0.002), increasing the mean clinician accuracy from 0.60 (95% CI 0.57, 0.63) without assistance to 0.65 (95% CI 0.60, 0.70) with assistance. However, the accuracy of assisted clinicians was significantly lower (p < 0.001) than that of the stand-alone algorithm, which had an accuracy of 0.79 (95% CI 0.77, 0.82) on the same unseen test cases. These results suggest that deep learning assistance may improve clinician accuracy in TB diagnosis using chest x-rays, which would be valuable in settings with a high burden of HIV/TB co-infection. Moreover, the high accuracy of the stand-alone algorithm suggests a potential value particularly in settings with a scarcity of radiological expertise.Entities:
Keywords: Diagnosis; Machine learning
Year: 2020 PMID: 32964138 PMCID: PMC7481246 DOI: 10.1038/s41746-020-00322-2
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Fig. 1Flow diagram of patients.
Patient screening, recruitment, and inclusion for each dataset are summarized.
Cohort demographic and clinical characteristics by dataset and TB diagnosis.
| Dataset 1 Jooste/Khayelitsha ( | Dataset 2 Khayelitsha ( | All participants ( | |||
|---|---|---|---|---|---|
| TB ( | No TB ( | TB ( | No TB ( | ||
| Variable | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) |
| Age (years) | 35.9 (9.5) | 38.6 (10.5) | 36.5 (9.4) | 37.9 (9.7) | 37.3 (9.8) |
| Temperature (°C) | 38.0 (1.3) | 37.5 (1.3) | 37.1 (1.2) | 37.0 (1.2) | 37.3 (1.3) |
| Oxygen saturation (%) | 96 (5) | 94 (7) | 96 (4) | 95 (5) | 95 (6) |
| Hemoglobin (mg/dL) | 8.8 (2.3) | 10.7 (2.3) | 9.0 (2.4) | 10.3 (2.7) | 9.8 (2.6) |
| WBC count (1000/µL) | 8.7 (5.1) | 11.7 (6.7) | 9.7 (9.7) | 11.8 (11.4) | 10.7 (9) |
| CD4 count (cells/mm3) | 127 (117) | 203 (200) | 116 (151) | 203 (274) | 167 (208) |
Fig. 2Diagnostic accuracy of the assisted physicians, stand-alone algorithm, and unassisted physicians.
Each cross represents the stand-alone algorithm’s performance on test data that was assigned as assisted cases for the correspondent physician.
Algorithm performance under four training strategies.
| Strategy | Accuracy (95% CI) | AUC (95% CI) |
|---|---|---|
| Default (w/ Clinical Variables, w/ CheXpert Pretraining, w/ Multi-Label Loss) | 0.78 (0.70, 0.85) | 0.83 (0.75, 0.91) |
| Default w/o Clinical Variables | 0.61 (0.51, 0.69) | 0.57 (0.46, 0.68) |
| Default w/o CheXpert Pre-training | 0.64 (0.55, 0.72) | 0.71 (0.62, 0.81) |
| Default trained on dataset one, validated on dataset two | 0.67 (0.62, 0.72) | 0.71 (0.65, 0.76) |
| Default trained on dataset two, validated on dataset one | 0.60 (0.55, 0.66) | 0.70 (0.64, 0.76) |
The default uses clinical variables, pretraining on CheXpert, and the multi-label loss.
Fig. 3Diagram of the deep learning algorithm architecture.
The architecture takes both the x-ray image and 8 clinical covariates as input and predicts TB and 6 clinical findings.
Fig. 4Experimental setup.
Test cases were randomly assigned to be diagnosed assisted/unassisted by the clinicians. Each clinician analyzed half the cases without algorithm assistance and half of them with algorithm assistance. The web interface for unassisted and assisted viewing is illustrated. Upon examining each case, the clinician made a prediction for the likelihood of a positive TB diagnosis.