| Literature DB >> 32940026 |
Yankui Sun1, Haoran Zhang1, Xianlin Yao1.
Abstract
SIGNIFICANCE: Automatic and accurate classification of three-dimensional (3-D) retinal optical coherence tomography (OCT) images is essential for assisting ophthalmologist in the diagnosis and grading of macular diseases. Therefore, more effective OCT volume classification for automatic recognition of macular diseases is needed. AIM: For OCT volumes in which only OCT volume-level labels are known, OCT volume classifiers based on its global feature and deep learning are designed, validated, and compared with other methods. APPROACH: We present a general framework to classify OCT volume for automatic recognizing macular diseases. The architecture of the framework consists of three modules: B-scan feature extractor, two-dimensional (2-D) feature map generation, and volume-level classifier. Our architecture could address OCT volume classification using two 2-D image machine learning classification algorithms. Specifically, a convolutional neural network (CNN) model is trained and used as a B-scan feature extractor to construct a 2-D feature map of an OCT volume and volume-level classifiers such as support vector machine and CNN with/without attention mechanism for 2-D feature maps are described.Entities:
Keywords: attention mechanism; convolutional neural network; image classification; optical coherence tomography; transfer learning
Year: 2020 PMID: 32940026 PMCID: PMC7493033 DOI: 10.1117/1.JBO.25.9.096004
Source DB: PubMed Journal: J Biomed Opt ISSN: 1083-3668 Impact factor: 3.170
Fig. 1Architecture of our OCT volume classification.
Fig. 2Network architecture of ResNet-50.
Fig. 3CNN classifier structure.
Performances of our proposed methods on two classes of the Duke dataset (%).
| Methods | ACC | SE | SP |
|---|---|---|---|
| ResNet50+SVM | |||
| ResNet50+CNN | |||
| FT-ResNet50+SVM | |||
| FT-ResNet50+CNN | |||
| FTA-ResNet50+SVM | |||
| FTA-ResNet50+CNN | |||
| FTA-ResNet50+CNN_CBAM |
Fig. 4Voting results: the relationship between accuracy, sensitivity, specificity, and threshold.
Performance comparisons with state-of-the-arts on two classes of the Duke dataset (%).
| Methods | ACC | SE | SP |
|---|---|---|---|
| Voting strategy | |||
| Santos et al. | |||
| Qiu et al. | |||
| FTA-ResNet50+CNN_CBAM |
Fig. 5Representative B-scan examples: (a) 50th B-scan in AMD20 and (b) 50th B-scan in NOR65.
Fig. 6Visualizations of B-scan feature vectors: 50th B-scan (red) in AMD20 and 50th B-scan (blue) in NOR65.
Fig. 7Visualization example of 2-D feature maps (a) AMD20 and (b) NOR65.
Fig. 8Visualization of the feature vectors of AMD20 and NOR65 from 35 to 65 B-scans.
Fig. 9FAST scan mode.
Fig. 10Examples of B-scan images from Tsinghua dataset (a) AMD and (b) DME.
Classification performance of our methods on Tsinghua dataset (%), where 40% AMD and DME volumes are training set, AMD is negative, and DME is positive.
| Methods | ACC | SE | SP |
|---|---|---|---|
| FTA-ResNet50 +SVM | |||
| FTA-ResNet50 +CNN | |||
| FTA-ResNet50+CNN_CBAM |