| Literature DB >> 34777977 |
Yunhong Gong1, Yanan Sun1, Dezhong Peng1,2,3,4, Peng Chen5, Zhongtai Yan6, Ke Yang6.
Abstract
The COVID-19 pandemic has caused a global alarm. With the advances in artificial intelligence, the COVID-19 testing capabilities have been greatly expanded, and hospital resources are significantly alleviated. Over the past years, computer vision researches have focused on convolutional neural networks (CNNs), which can significantly improve image analysis ability. However, CNN architectures are usually manually designed with rich expertise that is scarce in practice. Evolutionary algorithms (EAs) can automatically search for the proper CNN architectures and voluntarily optimize the related hyperparameters. The networks searched by EAs can be used to effectively process COVID-19 computed tomography images without expert knowledge and manual setup. In this paper, we propose a novel EA-based algorithm with a dynamic searching space to design the optimal CNN architectures for diagnosing COVID-19 before the pathogenic test. The experiments are performed on the COVID-CT data set against a series of state-of-the-art CNN models. The experiments demonstrate that the architecture searched by the proposed EA-based algorithm achieves the best performance yet without any preprocessing operations. Furthermore, we found through experimentation that the intensive use of batch normalization may deteriorate the performance. This contrasts with the common sense approach of manually designing CNN architectures and will help the related experts in handcrafting CNN models to achieve the best performance without any preprocessing operations.Entities:
Keywords: Batch normalization; COVID-19; Evolutionary algorithms; Variable-length chromosomes
Year: 2021 PMID: 34777977 PMCID: PMC8421016 DOI: 10.1007/s40747-021-00513-8
Source DB: PubMed Journal: Complex Intell Systems ISSN: 2199-4536
Fig. 1Traditional neural network architecture with the BN operations
Fig. 2Illustration about fixed-length and variable-length chromosomes
Choice of CNN hyperparameters
| Block type | Encode parameters |
|---|---|
| Convolutional block | The number of input feature maps, the number of output feature maps, the filter size, the stride size, the padding size and the convolutional type |
| Batch normalization block | Whether to use batch normalization block or not |
| Pooling block | The kernel size, the stride size and the pooling type |
| Fully connected block | The number of input neurons and the number of output neurons |
Fig. 3Example to illustrate the crossover operation with variable-length chromosomes. In this example, parents are split into two parts (see a). The head-chromosome is made up of C, BNC and P. All F are all in the tail-chromosome and the first F in each parent are aligned at the same position. b Fully shows the crossover operation. For head-chromosome, a crossover point is random selected in each parents separately. After the head-chromosomes before crossover points are exchanged, the F in the same position are exchanged, respectively. c Describes the offspring generated by the selected parents through crossover operation
Fig. 4Example shows the balance operation to deal with a long offspring and a short offspring generated by the proposed crossover operation. The numbers of units in head-chromosomes are 18 and 5, respectively (a). b Detailedly give the steps of the balance operation, where and . Due to the long offspring , the shrink strategy choices a shrink position and the shrunken length ; For the short offspring, where , a fragment with consecutive genes is selected in global-best chromosome, followed by a growth point . Finally, the selected fragment is inserted after the growth point. c Gives the picture that the balanced offspring with both after the balance operation
Detailed information about COVID-CT
| COVID-19 | Non-COVID-19 | |
|---|---|---|
| Train set | 88 | 231 |
| Test set | 173 | 168 |
| Val set | 88 | 64 |
| Total | 349 | 463 |
Fig. 5Examples of CT images from COVID-CT data set
Fig. 6Evolution curve of EAVL-COVID on COVID-CT
ORIGIN searched by EAVL-COVID and its variants on the COVID-CT data set
| The ORIGIN searched by EAVL-COVID and its variants | ||
|---|---|---|
| ORIGIN | VARIANT#1 | VARIANT#2 |
| Inputs | ||
| maxpooling | maxpooling | maxpooling |
| conv-91 | *conv-91 | conv-91 |
| conv-69 | *conv-69 | conv-69 |
| conv-51 | *conv-51 | conv-51 |
| conv-79 | *conv-79 | conv-79 |
| conv-72 | *conv-72 | conv-72 |
| conv-53 | *conv-53 | conv-53 |
| conv-99 | *conv-99 | conv-99 |
| maxpooling | maxpooling | maxpooling |
| conv-77 | *conv-77 | conv-77 |
| conv-58 | *conv-58 | conv-58 |
| maxpooling | maxpooling | maxpooling |
| FC-347 | ||
| FC-344 | ||
| FC-256 | ||
| FC-193 | ||
| softmax | ||
*The convolutional layer followed a BN layer
Performance comparison between randomly initialized networks (RAND.) and ImageNet pretrained networks (PRET.)
| Parameter numbers | RAND. | PRET. | ||
|---|---|---|---|---|
| BASELINE [ | VGG-16 | 138,357,544 | 0.632 | 0.773 |
| ResNet-18 | 11,689,512 | 0.652 | 0.734 | |
| ResNet-50 | 25,557,032 | 0.687 | 0.775 | |
| DenseNet-121 | 7,978,856 | 0.773 | 0.814 | |
| DenseNet-169 | 14,149,480 | 0.807 | ||
| EfficientNet-b0 | 5,288,548 | 0.695 | 0.774 | |
| EfficientNet-b1 | 7,794,184 | 0.726 | 0.797 | |
| EAVL-COVID | VARIANT#2 | 808,222 | 0.680 | 0.663 |
| VARIANT#1 | 810,565 | 0.749 | 0.738 | |
| ORIGIN | 810,093 | 0.815 |
Performance of ablative experiments on different data sets
| COVID-CT | CIFAR10 | CIFAR100 | STL10 | |||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | # of layers | Accuracy | # of layers | Accuracy | # of layers | Accuracy | # of layers | |
| VGG16BN | 0.632 | 16 | 0.611 | 16 | 0.602 | 16 | 0.589 | 16 |
| RS | 0.618 | 11 | 0.583 | 12 | 0.512 | 12 | 0.544 | 12 |
| EVO-CNN | 0.758 | 10 | 0.784 | 12 | 0.587 | 12 | 0.582 | 12 |
| EAVL-COVID | 13 | 11 | 15 | 13 | ||||
Performance of EAVL-CNN and its variants on different data sets
| Optimizer | Architecture | CIFAR10 | CIFAR100 | STL10 |
|---|---|---|---|---|
| SGD | VARIANT #2 | |||
| VARIANT #1 | ||||
| ORIGIN | ||||
| Adadelta | VARIANT #2 | |||
| VARIANT #1 | ||||
| ORIGIN | ||||
| RMSprop | VARIANT #2 | |||
| VARIANT #1 | ||||
| ORIGIN | ||||
| Adam | VARIANT #2 | |||
| VARIANT #1 | ||||
| ORIGIN |