| Literature DB >> 32240400 |
Qiwen Xu1,2, Xin Wang1,2, Huabei Jiang3,4,5.
Abstract
We have developed a computer-aided diagnosis system based on a convolutional neural network that aims to classify breast mass lesions in optical tomographic images obtained using a diffuse optical tomography system, which is suitable for repeated measurements in mass screening. Sixty-three optical tomographic images were collected from women with dense breasts, and a dataset of 1260 2D gray scale images sliced from these 3D images was built. After image preprocessing and normalization, we tested the network on this dataset and obtained 0.80 specificity, 0.95 sensitivity, 90.2% accuracy, and 0.94 area under the receiver operating characteristic curve (AUC). Furthermore, a data augmentation method was implemented to alleviate the imbalance between benign and malignant samples in the dataset. The sensitivity, specificity, accuracy, and AUC of the classification on the augmented dataset were 0.88, 0.96, 93.3%, and 0.95, respectively.Entities:
Keywords: Breast cancer; Convolutional neural network; Diffuse optical tomography; Image classification; Machine learning
Year: 2019 PMID: 32240400 PMCID: PMC7099566 DOI: 10.1186/s42492-019-0012-y
Source DB: PubMed Journal: Vis Comput Ind Biomed Art ISSN: 2524-4442
Fig. 1Examples of diffuse optical tomography breast images. a 3D image of a benign breast tumor with a size of 3.0*1.3*1.3, b 2D image sliced from Fig. 1a, c 3D image of a malignant breast tumor with a size of 2.4*1.3*1.2, d 2D image sliced from Fig. 1c
2D diffuse optical tomography image dataset and its division into training and test sets
| Benign | Malignant | |
|---|---|---|
| Training set | 645 | 300 |
| Test set | 215 | 100 |
Fig. 2Architecture of the convolutional neural network
Augmented dataset and its division into training and test sets
| Benign | Malignant | |
|---|---|---|
| Training set | 645 | 600 |
| Test set | 215 | 100 |
Comparison summary of the convolutional neural network performance on original and augmented data
| Accuracy | Specificity | Sensitivity | Area under curve | |
|---|---|---|---|---|
| Original data | 90.2% | 0.80 | 0.95 | 0.94 |
| Augmented data | 93.3% | 0.88 | 0.96 | 0.95 |
Fig. 3a Misclassification rate curve of training set and testing set during training, b Receiver operating characteristic (ROC) comparison of original data and augmented data, c ROC comparison of convolutional neural network (CNN) with different learning rate, d ROC of CNN evaluated by a 10-fold cross-validation