| Literature DB >> 36116509 |
Andrew J McNeil1, David W House2, Placide Mbala-Kingebeni3, Olivier Tshiani Mbaya4, Lori E Dodd5, Edward W Cowen6, Véronique Nussenblatt7, Tyler Bonnett8, Ziche Chen2, Inga Saknite9, Benoit M Dawant10, Eric R Tkaczyk11.
Abstract
Entities:
Year: 2022 PMID: 36116509 PMCID: PMC9534148 DOI: 10.1016/j.jid.2022.08.044
Source DB: PubMed Journal: J Invest Dermatol ISSN: 0022-202X Impact factor: 7.590
Figure 1Representative monkeypox AI lesion predictions on photos of previously unseen patients. (a) Two patients from our photograph set. Green contours show true positive lesions, blue shows false positive lesions (outlined by the AI but not the human), and magenta shows false negative lesions (outlined by human but not the AI). Unmarked photos are on the left. Upper photo AI lesion counts: 220 lesions, manual counts from three human raters: 239 (rater 1), 233 (rater 2), and 259 (rater 3). Lower photo AI lesion count: 131. Manual counts: 137 (rater 1), 134 (rater 2), and 143 (rater 3). (b) Two patients from publicly available photographs. Predicted lesion contours by our AI model are shown in yellow. The AI model is the same used to test Patient ID 15 (N=17, n=61). Upper photo from the CDC Public Health Image Library (Mahy 1997). AI lesion counts: 58, manual counts by rater 1: 52. Lower photo from the Nigeria Centre for Disease Control, recently made available on WHO website (NCDC 2022), used with permission. AI lesion counts: 26, manual counts by rater 1: 29. Written informed consent was obtained for research and publication of photos from all patients.
Figure 2Comparison of lesion count performance by AI and human raters. Limits of agreement (LoA, shown with dashed lines) are the boundaries within which 95% of future measurement differences are expected to fall. LoA width = upper LoA – lower LoA. We also show the slope and coefficient of determination (R2) for the linear regression fit (red dashed line) between estimated counts for each pair. The solid black line is the line of agreement. (a) Bland-Altman and correlation plots for the AI against the ground truth (human rater 1). (b) Rater 2 against ground truth. (c) Rater 3 against ground truth.
Summary of pairwise comparisons between different human raters and AI algorithm.
| Rater Pair | Bias | Upper LoA | Lower LoA | LoA Width | Slope | R2 |
|---|---|---|---|---|---|---|
| AI vs 1 | -5.86 | 28.56 | -40.29 | 68.85 | 0.78 | 0.94 |
| 2 vs 1 | -3.24 | 15.98 | -22.46 | 38.44 | 1.02 | 0.97 |
| 3 vs 1 | 9.68 | 48.05 | -28.69 | 76.74 | 1.07 | 0.92 |
| 3 vs 2 | 12.92 | 53.88 | -28.03 | 81.91 | 1.03 | 0.90 |