| Literature DB >> 32098314 |
Wen-Yu Chuang1,2, Shang-Hung Chang3, Wei-Hsiang Yu4, Cheng-Kun Yang4, Chi-Ju Yeh1, Shir-Hwa Ueng1,5, Yu-Jen Liu1, Tai-Di Chen1, Kuang-Hua Chen1, Yi-Yin Hsieh1, Yi Hsia1,6, Tong-Hong Wang7, Chuen Hsueh1,5, Chang-Fu Kuo8, Chao-Yuan Yeh4.
Abstract
Pathologic diagnosis of nasopharyngeal carcinoma (NPC) can be challenging since most cases are nonkeratinizing carcinoma with little differentiation and many admixed lymphocytes. Our aim was to evaluate the possibility to identify NPC in nasopharyngeal biopsies using deep learning. A total of 726 nasopharyngeal biopsies were included. Among them, 100 cases were randomly selected as the testing set, 20 cases as the validation set, and all other 606 cases as the training set. All three datasets had equal numbers of NPC cases and benign cases. Manual annotation was performed. Cropped square image patches of 256 × 256 pixels were used for patch-level training, validation, and testing. The final patch-level algorithm effectively identified NPC patches, with an area under the receiver operator characteristic curve (AUC) of 0.9900. Using gradient-weighted class activation mapping, we demonstrated that the identification of NPC patches was based on morphologic features of tumor cells. At the second stage, whole-slide images were sequentially cropped into patches, inferred with the patch-level algorithm, and reconstructed into images with a smaller size for training, validation, and testing. Finally, the AUC was 0.9848 for slide-level identification of NPC. Our result shows for the first time that deep learning algorithms can identify NPC.Entities:
Keywords: artificial intelligence; cancer identification; convolutional neural network; deep learning; digital pathology; gradient-weighted class activation mapping; nasopharyngeal carcinoma
Year: 2020 PMID: 32098314 PMCID: PMC7072217 DOI: 10.3390/cancers12020507
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Expansion of the training data successfully prevented misclassification of patches. (A) An example of benign nasopharyngeal tissue in our testing set, with prominent germinal centers (G) and benign epithelial cells (asterisks) (original magnification ×40). (B) A magnified inset of (A) (dotted square). (C) The original patch-level model misclassified regions of germinal centers and benign epithelial cells as nasopharyngeal carcinoma (red color: high probability). (D) The performance of the model was markedly improved after expanding the training data with germinal centers and benign epithelial cells.
Figure 2The learning curves and receiver operator characteristic (ROC) curves of our algorithms. (A) The learning curves of our final patch-level model. (B) The ROC curves of our patch-level models. The area under ROC curve (AUC) increased from 0.9675 ± 0.020 to 0.9900 ± 0.004 after expanding the training data with germinal centers and benign epithelial cells (p = 0.000018). (C) The learning curves of our slide-level model. (D) The ROC curve of our slide-level model. The results of pathology residents (red crosses), chief resident (green cross), and attending pathologists (blue crosses) were included for comparison.
Figure 3Results of gradient-weighted class activation mapping (Grad-CAM) on patches (256 × 256 pixels) classified as nasopharyngeal carcinoma (NPC) by our patch-level model. The lower row is the result of Grad-CAM, and the upper row is the corresponding hematoxylin and eosin (H&E) images for comparison. The numbers above each H&E image represent the probability of NPC produced by our patch-level algorithm. Note that the most important region (red color) for classifying a patch as NPC correlated with the location of a clearly identifiable cancer cell (arrows).
Figure 4Correlation between our patch-level model and Epstein–Barr virus (EBV)-encoded small RNA (EBER) in situ hybridization. (A) An example of nasopharyngeal carcinoma in our testing set (original magnification ×40). (B) A magnified inset of (A) (dotted square). Note the tumor cells (T), lymphoid stroma (L), and benign epithelium (E). (C) The tumor cells were highlighted by EBER in situ hybridization. (D) The tumor areas identified by our patch-level model (red color: high probability) correlated well with (C).
Figure 5Correlation between our patch-level model and H&E morphology. More examples of nasopharyngeal carcinoma in our testing set. Each right image is derived from the left corresponding H&E image using our patch-level model (original magnification ×100). Despite the large number of admixed lymphocytes, the tumor cells were successfully identified by our patch-level mode (red color: high probability). Note that the germinal centers (G) and benign epithelial cells (arrowheads) were correctly classified as benign.