| Literature DB >> 35418051 |
Mate E Maros1, Thomas Ganslandt1,2, Hee E Kim3, Alejandro Cosa-Linan1, Nandhini Santhanam1, Mahboubeh Jannesari1.
Abstract
BACKGROUND: Transfer learning (TL) with convolutional neural networks aims to improve performances on a new task by leveraging the knowledge of similar tasks learned in advance. It has made a major contribution to medical image analysis as it overcomes the data scarcity problem as well as it saves time and hardware resources. However, transfer learning has been arbitrarily configured in the majority of studies. This review paper attempts to provide guidance for selecting a model and TL approaches for the medical image classification task.Entities:
Keywords: Convolutional neural network; Deep learning; Fine-tuning; Medical image analysis; Transfer learning
Mesh:
Year: 2022 PMID: 35418051 PMCID: PMC9007400 DOI: 10.1186/s12880-022-00793-7
Source DB: PubMed Journal: BMC Med Imaging ISSN: 1471-2342 Impact factor: 1.930
Fig. 1Visual abstract summarizing the scope of our study
Overview of five backbone models
| Model type | Model | Released year | Parameters (all) | Parameters (FE only) | Trainable layers (FE + FC layers) | Dataset |
|---|---|---|---|---|---|---|
| Shallow and linear | LeNet5 | 1998 | 60,000 | 1,716 | 4 (2 + 2) | MNIST |
| AlexNet | 2012 | 62.3 M | 3.7 M | 8 (5 + 3) | ImageNet | |
| VGG16 | 2014 | 134.2 M | 14.7 M | 16 (13 + 3) | ||
| Deep | GoogLeNet | 2014 | 5.3 M | 5.3 M | 22 (21 + 1) | |
| ResNet50 | 2015 | 25.6 M | 23.5 M | 51 (50 + 1) |
FE: feature extraction, FC: fully connected layers; MNIST database: Modified National Institute of Standards and Technology database of handwritten digits with 60,000 training and 10,000 test images, ImageNet database: organized according to the WordNet hierarchy with over 14 million hand-annotated images for visual object recognition research
Fig. 2Four types of transfer learning approach. The last classifier block needs to be replaced by a thinner layer or trained from scratch (ML: Machine learning; FC: Fully connected layers)
Fig. 3Flowchart of the literature search
Fig. 4Studies of transfer learning in medical image classification over time (y-axis) with respect to a the number of publications, b applied backbone model and c transfer learning type
Fig. 5The overview of data characteristics of selected publications. a The correlation of anatomical body parts and imaging modalities. b The number of classes c The histogram of the quantity of medical image datasets
Fig. 6Scatter plots of model performance with data size, image modality, backbone model and transfer learning type. Color keys in a and b indicate the medical image modality, whereas color keys in c and d represent backbone models. Transfer learning types are in any of four marker shapes for all subfigures