| Literature DB >> 33904984 |
Biluo Shen1,2, Zhe Zhang3,4, Xiaojing Shi1,2, Caiguang Cao1,2, Zeyu Zhang1,5, Zhenhua Hu6,7, Nan Ji8,9,10, Jie Tian11,12,13,14.
Abstract
PURPOSE: Surgery is the predominant treatment modality of human glioma but suffers difficulty on clearly identifying tumor boundaries in clinic. Conventional practice involves neurosurgeon's visual evaluation and intraoperative histological examination of dissected tissues using frozen section, which is time-consuming and complex. The aim of this study was to develop fluorescent imaging coupled with artificial intelligence technique to quickly and accurately determine glioma in real-time during surgery.Entities:
Keywords: Convolutional neural networks; Deep learning; Fluorescence imaging; Gliomas; Intraoperative pathology
Mesh:
Substances:
Year: 2021 PMID: 33904984 PMCID: PMC8440289 DOI: 10.1007/s00259-021-05326-y
Source DB: PubMed Journal: Eur J Nucl Med Mol Imaging ISSN: 1619-7070 Impact factor: 9.236
Fig. 1a Number of specimens per class with pathological examination results as the gold standard. Only tumor specimens had pathological results of tumor grade and Ki-67. b Specimens were resected by the guidance of NIR-II fluorescence imaging. Specimen images of WL and FL were obtained and fed into the CNNs for differentiation of tumor versus non-tumor. Images of tumor specimens were then fed into the CNNs for the classification of the grade and Ki-67 level. The prediction process by the models costed less than 1 s on CPU and can be faster on GPU; therefore, the time spent can be omitted
Fig. 2Diagnostic performance of the CNNs and neurosurgeons (readers). a Receiver operating characteristic (ROC) curves calculated for the CNNs and neurosurgeons on FL and WL images for the classification of tumor tissue versus non-tumor. The lines represent the ROC achieved by the CNNs. Individual neurosurgeon performance is indicated by crosses. Results of FL and WL are colored blue and orange, respectively. b Specificity and sensitivity of the CNNs and averaged individual neurosurgeons (readers) are plotted, using pathological examination results as the gold standard, and compared on FL and WL. The error bars represent the 95% confidence interval computed based on 1000 iterations of the bootstrap method. P values were computed using a two-sided permutation test with 10,000 random resampling of the data. All results were obtained on the test set (N = 608)
Diagnostic performance of FL-CNN, WL-CNN and averaged individual neurosurgeons (reader) and the differences between FL-CNN and other three approaches, using pathological examination results as the gold standard
| Specificity | Sensitivity | PPV | NPV | ||
|---|---|---|---|---|---|
| FL-CNN | 0.822 | 0.938 | 0.910 | 0.872 | 0.924 |
| (0.729, 0.877) | (0.922, 0.966) | (0.866, 0.936) | (0.835, 0.925) | (0.900, 0.940) | |
| Reader (FL) | 0.606 | 0.912 | 0.817 | 0.783 | 0.862 |
| (0.541, 0.672) | (0.881, 0.938) | (0.778, 0.848) | (0.711, 0.843) | (0.834, 0.883) | |
| +0.216* | +0.025 | +0.094* | +0.090* | +0.062* | |
| (0.125, 0.303) | (−0.038, 0.088) | (0.059, 0.129) | (0.008, 0.171) | (0.019, 0.104) | |
| Reader (WL) | 0.827 | 0.820 | 0.901 | 0.705 | 0.859 |
| (0.773, 0.875) | (0.782, 0.855) | (0.868, 0.929) | (0.645, 0.762) | (0.832, 0.884) | |
| −0.005 | +0.118* | +0.009 | +0.168* | +0.065* | |
| (−0.096, 0.087) | (0.053, 0.185) | (−0.031, 0.049) | (0.094, 0.239) | (0.017, 0.113) | |
| WL-CNN | 0.803 | 0.821 | 0.889 | 0.699 | 0.853 |
| (0.599, 0.872) | (0.804, 0.863) | (0.792, 0.926) | (0.644, 0.760) | (0.815, 0.878) | |
| +0.019 | +0.118* | +0.021 | +0.174* | +0.071* | |
| (−0.072, 0.111) | (0.050, 0.185) | (−0.019, 0.062) | (0.102, 0.247) | (0.022, 0.118) | |
*Denotes that there are significant differences between these two results (P < 0.05)
Fig. 3Gradient-weighted Class Activation Maps (GCAMs) visualize the learned feature representations for classification of tumor tissue versus non-tumor. FL/WL images of tumor/non-tumor tissues were randomly sampled. Colored GCAM, GCAM saliency, saliency × image, GCAM heatmap, and heatmap × image are shown. Saliency × image/heatmap × image was generated by putting GCAM saliency/heatmap on the image, which makes GCAM saliency/heatmap more recognizable. The target layer of GCAM was the last stage of the CNNs. Target classes were used to guide backpropagation and generate CAM. For preprocessing, non-square images were zero-padded to square, which led a black pad on the top/bottom or left/right
Fig. 4Performance of the CNNs for the classification of tumor grade and Ki-67 level. a Predicted grade of specimens are compared to the gold standard pathological examination results. AUC, accuracy, sensitivity, specificity, and F1 score of FL-CNN and WL-CNN are compared. b Predicted Ki-67 level of specimens is compared to the gold standard pathology results. AUC, accuracy, sensitivity, specificity, and F1 score of FL-CNN and WL-CNN are compared. All error bars represent the 95% confidence interval computed based on 1000 iterations of the bootstrap method. N = 296 for grade and N = 262 for Ki-67
Performance of the proposed FL-CNN and WL-CNN on the classification of the grade and Ki-67 level of tumor specimens compared to the gold standard pathological examination results
| AUC | Accuracy | Sensitivity | Specificity | |||
|---|---|---|---|---|---|---|
| Grade | FL-CNN | 0.810 | 0.875 | 0.606 | 0.909 | 0.519 |
| (0.712, 0.874) | (0.826, 0.905) | (0.429, 0.758) | (0.867, 0.939) | (0.374, 0.650) | ||
| WL-CNN | 0.688 | 0.696 | 0.545 | 0.715 | 0.286 | |
| (0.600, 0.758) | (0.639, 0.747) | (0.379, 0.714) | (0.655, 0.768) | (0.188, 0.387) | ||
| +0.127* | +0.179* | +0.061 | +0.194* | +0.234* | ||
| (0.016, 0.235) | (0.108, 0.250) | (−0.152, 0.273) | (0.122, 0.266) | (0.110, 0.360) | ||
| Ki-67 | FL-CNN | 0.625 | 0.611 | 0.778 | 0.433 | 0.673 |
| (0.560, 0.688) | (0.545, 0.649) | (0.388, 0.911) | (0.302, 0.748) | (0.502, 0.744) | ||
| WL-CNN | 0.689 | 0.679 | 0.844 | 0.504 | 0.731 | |
| (0.620, 0.751) | (0.615, 0.729) | (0.740, 0.921) | (0.349, 0.595) | (0.656, 0.781) | ||
| −0.063 | −0.073* | −0.089 | −0.056 | −0.065* | ||
| (−0.161, 0.034) | (−0.152, 0.007) | (−0.200, 0.022) | (−0.173, 0.063) | (−0.141, 0.007) | ||
*Denotes that there are significant differences between these two results (P < 0.05)