| Literature DB >> 31173130 |
Allison Park1, Chris Chute1, Pranav Rajpurkar1, Joe Lou1, Robyn L Ball2,3, Katie Shpanskaya4, Rashad Jabarkheel4, Lily H Kim4, Emily McKenna5, Joe Tseng5, Jason Ni5, Fidaa Wishah5, Fred Wittber5, David S Hong6, Thomas J Wilson6, Safwan Halabi5, Sanjay Basu5, Bhavik N Patel5, Matthew P Lungren5, Andrew Y Ng1, Kristen W Yeom5.
Abstract
Importance: Deep learning has the potential to augment clinician performance in medical imaging interpretation and reduce time to diagnosis through automated segmentation. Few studies to date have explored this topic. Objective: To develop and apply a neural network segmentation model (the HeadXNet model) capable of generating precise voxel-by-voxel predictions of intracranial aneurysms on head computed tomographic angiography (CTA) imaging to augment clinicians' intracranial aneurysm diagnostic performance. Design, Setting, and Participants: In this diagnostic study, a 3-dimensional convolutional neural network architecture was developed using a training set of 611 head CTA examinations to generate aneurysm segmentations. Segmentation outputs from this support model on a test set of 115 examinations were provided to clinicians. Between August 13, 2018, and October 4, 2018, 8 clinicians diagnosed the presence of aneurysm on the test set, both with and without model augmentation, in a crossover design using randomized order and a 14-day washout period. Head and neck examinations performed between January 3, 2003, and May 31, 2017, at a single academic medical center were used to train, validate, and test the model. Examinations positive for aneurysm had at least 1 clinically significant, nonruptured intracranial aneurysm. Examinations with hemorrhage, ruptured aneurysm, posttraumatic or infectious pseudoaneurysm, arteriovenous malformation, surgical clips, coils, catheters, or other surgical hardware were excluded. All other CTA examinations were considered controls. Main Outcomes and Measures: Sensitivity, specificity, accuracy, time, and interrater agreement were measured. Metrics for clinician performance with and without model augmentation were compared.Entities:
Mesh:
Year: 2019 PMID: 31173130 PMCID: PMC6563570 DOI: 10.1001/jamanetworkopen.2019.5600
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Study Design
A, Crossover study design. Clinicians were divided into 2 groups to perform reads with and without model augmentation in random order, with a 2-week washout period between. B, Unaugmented read, with original CTA scan in axial, coronal, and sagittal view. C, Augmented read, with model segmentation overlay on CTA in axial, coronal, and sagittal view. Readers had the option to toggle overlays off and view the scan as shown in B. AI indicates artificial intelligence; CTA, computed tomographic angiography.
Figure 2. Data Set Selection Flow Diagram and Patient Demographics
Of 9455 computed tomography angiogram (CTA) examinations performed between 2003 and 2017 at Stanford University Medical Center, 818 were selected according to an exclusion criteria validated by a board-certified neuroradiologist. These examinations were split into the training set, development set, and test set to be used for training models, selecting the best model, and assessing the selected model, respectively.
Clinician Performance Metrics With and Without Augmentation
| Metric | Microaverage (95% CI) | Mean Increase (95% CI) | |||
|---|---|---|---|---|---|
| Without Augmentation | With Augmentation | Unadjusted | Adjusted | ||
| Sensitivity | 0.831 (0.794 to 0.862) | 0.890 (0.858 to 0.915) | 0.059 (0.028 to 0.091) | .001 | .01 |
| Specificity | 0.960 (0.937 to 0.974) | 0.975 (0.957 to 0.986) | 0.016 (−0.010 to 0.041) | .10 | .16 |
| Accuracy | 0.893 (0.782 to 0.912) | 0.932 (0.913 to 0.946) | 0.038 (0.014 to 0.062) | .004 | .02 |
P values were adjusted for multiple hypothesis testing using the Benjamini-Hochberg correction.
Figure 3. Change in Individual Clinicians' Performance Metric
Horizontal lines depict the change in performance metric for each clinician with and without model augmentation. The orange dot represents performance without model, and the blue dot represents performance with model augmentation.