| Literature DB >> 36268083 |
Mio Yamaguchi1, Tomoaki Sasaki2,3, Kodai Uemura2, Yuichiro Tajima2, Sho Kato3,4, Kiyoshi Takagi1, Yuto Yamazaki5, Ryoko Saito-Koyama6, Chihiro Inoue5, Kurara Kawaguchi3, Tomoya Soma3,7, Toshio Miyata3, Takashi Suzuki1,5,8.
Abstract
Background: A diagnosis with histological classification by pathologists is very important for appropriate treatments to improve the prognosis of patients with breast cancer. However, the number of pathologists is limited, and assisting the pathological diagnosis by artificial intelligence becomes very important. Here, we presented an automatic breast lesions detection model using microscopic histopathological images based on a Single Shot Multibox Detector (SSD) for the first time and evaluated its significance in assisting the diagnosis.Entities:
Keywords: Artificial intelligence; Breast cancer; Deep learning; Pathology
Year: 2022 PMID: 36268083 PMCID: PMC9577133 DOI: 10.1016/j.jpi.2022.100147
Source DB: PubMed Journal: J Pathol Inform
Fig. 1Summary of the present study.
We built the data set. The micrographs were taken from glass slides by a pathologist and annotation and image label (benign, non-invasive carcinoma, and invasive carcinoma) were provided for each image. The Single Shot Multibox Detector (SSD) model was trained using 1361 images and evaluated using 315 images. The model performance was evaluated by the intersection over union (IoU) and diagnostic accuracy using detection of the model. To investigate the significance of our model in assisting the diagnosis, 3 pathologists and 5 medical students diagnosed images with or without assistance of the model.
The dataset summary.
| Number of pictures | Number of annotations | |||||||
|---|---|---|---|---|---|---|---|---|
| Benign | Non-invasive carcinoma | Invasive carcinoma | Total | Benign | Non-invasive carcinoma | Invasive carcinoma | Total | |
| Training | 608 | 337 | 416 | 1361 | 3516 | 1485 | 1294 | 6295 |
| Test | 164 | 40 | 111 | 315 | 853 | 154 | 247 | 1254 |
| Examination set-1 | 14 | 15 | 14 | 43 | – | – | – | – |
| Examination set-2 | 14 | 15 | 14 | 43 | – | – | – | – |
Fig. 2Example images of annotation by pathologists and detection by the trained SSD model.
Examples of the annotation (left) and the model detection (right). The blue, red, and green boxses indicated benign, non-invasive carcinoma, and invasive carcinoma, respectively. A: The image of accurately detection by the model. B: The image that some boxes with different labels were detected in same region. C: The image that the model accurately detected the benign area by bounding boxes with different shape against the annotation.
Model performance for diagnosis of images in different threshold of confidence score.
| Threshold of detection confidence score. | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | Benign; 0.4 | ||||
| 3-class task | Diagnostic accuracy (%) | 81.6 | 83.8 | 85.4 | 85.4 | 84.8 | 82.9 | 82.5 | 80.0 | 77.8 | |||
| F1-score | Benign | 83.9 | 87.0 | 88.5 | 89.1 | 87.9 | 87.7 | 86.5 | 84.6 | Recall | 90.9 | ||
| Precision | 90.9 | ||||||||||||
| F1-score | 90.9 | ||||||||||||
| Non-invasive carcinoma | 61.3 | 64.7 | 68.8 | 70.2 | 67.4 | 68.8 | 66.0 | 67.4 | Recall | 85.0 | |||
| Precision | 68.0 | ||||||||||||
| F1-score | 75.6 | ||||||||||||
| Invasive carcinoma | 88.3 | 88.2 | 85.4 | 83.7 | 81.3 | 79.8 | 74.4 | 68.2 | Recall | 85.6 | |||
| Precision | 94.1 | ||||||||||||
| F1-score | 89.6 | ||||||||||||
| 2-class task | Diagnostic accuracy (%) | 84.4 | 86.7 | 87.9 | 88.9 | 88.3 | 86.7 | 86.3 | 84.8 | 81.9 | |||
| Recall (%) | 79.3 | 84.7 | 87.9 | 90.8 | 91.3 | 91.0 | 91.5 | 91.9 | 93.5 | 90.1 | |||
| Precision (%) | 91.4 | 88.1 | 86.8 | 85.4 | 83.4 | 80.1 | 78.8 | 74.8 | 66.9 | 90.1 | |||
| F1-score | 84.9 | 86.4 | 87.3 | 88.1 | 87.2 | 85.2 | 84.7 | 82.5 | 78.0 | 90.1 | |||
Model performance for the breast lesions detection.
| Intersection over Union (IoU) | |||
|---|---|---|---|
| Benign | Non-invasive carcinoma | Invasive carcinoma | 3-class average |
| 0.52 (0.32) | 0.44 (0.39) | 0.62 (0.34) | |
Threshold of detection confidence score; Benign 0.4, Non-invasive carcinoma 0.5, Invasive carcinoma 0.1. Data were presented as mean (STD).
Impact of the SSD model on the diagnosis by pathologists and medical students.
| Examination set-1 (Unassisted) | Examination set-2 (Assisted) | ||||||
|---|---|---|---|---|---|---|---|
| Model only | Pathologists (n = 3) | Medical students (n = 5) | Model only | Pathologists (n = 3) | Medical students (n = 5) | ||
| 3-class task | Diagnostic accuracy (%) | 86.0 | 97.7 | 67.4 | 86.0 | 93.8 | 84.7 |
| 2-class task | Diagnostic accuracy (%) | 88.4 | 98.4 | 79.1 | 88.4 | 96.1 | 88.4 |
| Precision (%) | 92.9 | 98.9 | 78.0 | 100 | 96.7 | 89.6 | |
| Recall (%) | 89.7 | 98.9 | 96.6 | 82.8 | 97.7 | 93.8 | |
| F1- score | 91.2 | 98.9 | 86.2 | 90.6 | 97.1 | 91.6 | |
| Time taken for diagnosis (min) | – | 12.9 | 20.2 | – | 17.9 | 23.3 | |
| Time differences between unassisted or assisted tasks (min) | – | – | – | – | +5.0 | +3.1 | |
Threshold of detection confidence score; Benign 0.4, Non-invasive carcinoma 0.5, Invasive carcinoma 0.1. Data of pathologists and medical students were presented as average scores.
Fig. 3Performance of diagnosis by the SSD model, pathologists, and medical students with or without assistance of the model.
A–D: The average diagnostic accuracy of the model, pathologists (n = 3), and medical students (n = 5) without assistance (A, B) or with assistance (C, D) of the model in the 3-class (A, C) or 2-class (B, D) classification tasks. A student t-test was used to examine differences in the diagnostic accuracies and the data were presented as the mean ± S.D. *P < 0.05, ***P < 0.001 vs model, respectively. N.S.; not significant. E, F: The change of each accuracy score with or without assistance of the model in 3-class (E) and 2-class (F) classification tasks. G: The change of the time taken for diagnosis with or without assistance of the model.