| Literature DB >> 33519321 |
Shui-Hua Wang1,2,3, Deepak Ranjan Nayak4, David S Guttery5, Xin Zhang6, Yu-Dong Zhang2,7.
Abstract
AIM: : COVID-19 is a disease caused by a new strain of coronavirus. Up to 18th October 2020, worldwide there have been 39.6 million confirmed cases resulting in more than 1.1 million deaths. To improve diagnosis, we aimed to design and develop a novel advanced AI system for COVID-19 classification based on chest CT (CCT) images.Entities:
Keywords: COVID-19; Chest CT; Deep fusion; Discriminant correlation analysis; Micro-averaged F1; pretrained model; transfer learning
Year: 2020 PMID: 33519321 PMCID: PMC7837204 DOI: 10.1016/j.inffus.2020.11.005
Source DB: PubMed Journal: Inf Fusion ISSN: 1566-2535 Impact factor: 12.975
Abbreviation List.
| Abbreviation | Full Name |
|---|---|
| (M)DA | (multiple-way) data augmentation |
| BCS | between-class scatter |
| BSC | between-set covariance |
| CAP | community-acquired pneumonia |
| CCA | canonical correlation analysis |
| CCT | chest computed tomography |
| DAF | data augmentation factor |
| DCA | discriminant correlation analysis |
| DCFDCA | deep CCT fusion by discriminant correlation analysis |
| DDM | diagonally dominant matrix |
| DL | deep learning |
| DLF | decision-level fusion |
| FCL | fully-connected layer |
| FLF | feature-level fusion |
| GSAF | greedy selection algorithm for fusion |
| HC | healthy control |
| ISP | incompatible size problem |
| L2TFL | ( |
| MRL | maximum removable layer |
| MV | Majority voting |
| NLR | number of layers to be removed |
| OHNN | one-hidden layer neural network |
| PF | parallel fusion |
| PTM | pre-trained model |
| SAPNF | selection algorithm of pretrained networks for fusion |
| SF | serial fusion |
| SPT | secondary pulmonary tuberculosis |
| SVD | singular value decomposition |
| TL | transfer learning |
Symbol List.
| Symbol | Meaning |
| raw dataset | |
| raw slice CCT image | |
| size of | |
| index of | |
| labeling | |
| radiologist | |
| class (viz., COVID, CAP, SPT, HC) | |
| grayscale operation | |
| minimum grayscale of an image | |
| maximum grayscale of an image | |
| crop operation | |
| crop values in unit of pixel from four directions | |
| preprocessed dataset | |
| preprocessed slice CCT image | |
| size of | |
| storage compression ratio | |
| size compression ratio | |
| data, labeling, and classifier of source domain | |
| data, labeling, and classifier of target domain | |
| error function | |
| learning rate | |
| number of PTMs | |
| a specified model among all six models. See | |
| a pretrained model | |
| retrained | |
| 1st model | |
| 2nd model | |
| number of last layers to be removed | |
| number of last layers to be removed at 1st model | |
| number of last layers to be removed at 2nd model | |
| number of learnable layers of | |
| number of learnable layers of | |
| maximum removable layer | |
| layers from | |
| features learnt from network | |
| remove layer function | |
| add fully-connected layer function | |
| retrain function | |
| activation function | |
| node number of first added FCL layer | |
| node number of second added FCL layer | |
| training, validation, and test set of a dataset | |
| number of hidden neurons | |
| initialized OHNN | |
| trained OHNN | |
| sort function in descending way | |
| rank list | |
| indicator by | |
| indicator vector | |
| proposed L2TFL operation | |
| output on validation set | |
| output on test set | |
| measuring indicator function | |
| fused feature | |
| deep fusion function | |
| features to be fused from two models | |
| serial fusion operation | |
| parallel fusion operation | |
| length of feature | |
| number of trained features. | |
| cross-covariance operation | |
| transformation matrix of CCA for model 1 | |
| transformation matrix of CCA for model 1 | |
| transformed features by CCA | |
| concatenation of CCA features | |
| summation of CCA features | |
| feature extracted from | |
| between-class scatter matrix | |
| matrix of orthogonal eigenvectors | |
| diagonal matrix of real and non-negtive eigenvalue in decreasing order. | |
| rank function | |
| projection of | |
| between-set covariance matrix of transformed feature sets | |
| transform matrix of DCA for model 1 | |
| transform matrix of DCA for model 2 | |
| transformed feature sets by DCA | |
| concatenation of DCA features | |
| summation of DCA features | |
| number of different DA techniques | |
| number of generated images by each offline MDA technique | |
| one training image | |
| training set | |
| validation set | |
| mean of Gaussian noise injected | |
| variance of Gaussian noise injected | |
| noise injection operation | |
| horizontal shift transform function | |
| vertical shift transform function | |
| Gamma correction operation | |
| image rotation operation | |
| random translation operation | |
| random horizontal shift | |
| random vertical shift | |
| maximum shift range | |
| uniform distribution | |
| image scaling operation | |
| mirror function | |
| concatenation operation | |
| collection of generated MDA images with original image | |
| set of all augmented images | |
| data augmentation factor | |
| non-test set of preprocessed dataset | |
| test set of preprocessed dataset | |
| number of samples of non-test set in | |
| number of samples of test set in | |
| ideal confusion matrix over validation set | |
| ideal confusion matrix over test set | |
| number of runs on validation set | |
| number of runs on test set | |
| micro-averaged F1 | |
| micro-averaged precision | |
| micro-averaged sensitivity | |
| run index | |
| fold index |
Subjects and images of four categories.
| Category | Patients (n) | CCT Images (n) |
|---|---|---|
| COVID-19 | 125 | 284 |
| CAP | 123 | 281 |
| SPT | 134 | 293 |
| HC | 139 | 306 |
(n = number).
Fig. 1Illustration of preprocessing. (CAP: community-acquired pneumonia; SPT: secondary pulmonary tuberculosis; HC: healthy control; CCT: chest CT; HS: histogram stretching).
Fig. 2Samples of X4. (CAP: community-acquired pneumonia; SPT: secondary pulmonary tuberculosis; HC: healthy control).
Storage and size per preprocessing step.
| Preprocessing Step | Variable | Storage* | Size* | |||
|---|---|---|---|---|---|---|
| Raw | 1024 | 1024 | 3 | 12,582,912 | ||
| Grayscale | 1024 | 1024 | 1 | 4194,304 | ||
| HS | 1024 | 1024 | 1 | 4194,304 | ||
| Crop | 724 | 724 | 1 | 2096,704 | ||
| DS | 227 | 227 | 1 | 206,116 |
* Storage and size are measured per image.
Fig. 3Idea of transfer learning. (PTM: pretrained mode; CAP: community-acquired pneumonia; SPT: secondary pulmonary tuberculosis; HC: healthy control).
Candidate pretrained models.
| PTM | PTM Symbol | Parameters (millions) | Input Size |
|---|---|---|---|
| AlexNet | 61.0 | 227×227 | |
| DenseNet201 | 20.0 | 224×224 | |
| ResNet50 | 25.6 | 224×224 | |
| ResNet101 | 44.6 | 224×224 | |
| VGG16 | 138 | 224×224 | |
| VGG19 | 144 | 224×224 |
Proposed L2TFL algorithm.
| Step 1 Read one raw PTM network |
| Step 2 Remove the last NLR |
| Step 3 Add two new fully connected layers, |
| Step 4 Freeze early layers, |
| Step 5 Let last two layers retrainable, |
| Step 6 Retrain the whole network, and obtain the new network |
| Step 7 Output learnt features |
Fig. 4A simplistic example of L2TFL algorithm for ResNet18 (Here NLR L = 2). (ReLU: rectified linear unit; FCL: fully-connected layer; L2TFL: (L, 2) transfer feature learning; NLR: number of layers to be removed).
Proposed GSAF for PTM selection.
| Step 1 Input: Training set |
| Step 2 for |
| for |
| Step 2.1 PTM Retrain |
| Import |
| Use L2TFL via data |
| Obtain |
| Step 2.2 Feature Extraction |
| Generate features |
| Step 2.3 Train OHNN |
| Initialize OHNN |
| Train OHNN |
| Obtain |
| Step 2.4 Obtain Indicator |
| Obtain performance indicator |
| end |
| end |
| Step 3 Generate and sort the indicator vector |
| Step 4 Obtain the rank list |
| Step 5 Choose the top two best models (determine PTM and NLR): |
Proposed SAPNF for PTM selection.
| Step 1 Input: Training set |
| Step 2 for |
| for |
| for |
| for |
| Import |
| Use L2TFL via data |
| Obtain |
| Import |
| Use L2TFL via data |
| Obtain |
| Generate features |
| Generate features |
| Obtain |
| Initialize OHNN |
| Train OHNN |
| Obtain |
| Obtain performance indicator |
| end |
| end |
| end |
| end |
| Step 3 Generate and sort the indicator vector |
| Step 4 Obtain the rank list |
| Step 5 Choose the top two best models (determine PTM and NLR) |
Fig. 6Diagram of proposed offline MDA technology. (DA: data augmentation; MDA: multiple-way DA).
Fig. 7Confusion matrix of multiple class conditions.
Pseudocode of our CCSHNet algorithm.
| Grayscaling: |
| HS: |
| Crop: |
| Downsampling: |
| Split |
| for |
| for |
|
|
| Split nontest set |
|
|
| Training Set: |
| Validation Set: |
|
|
| for |
| Training image: |
| end |
| DA enhanced training set: |
| Enhanced training set labels: |
| See |
| Record |
| End |
| Validation confusion matrix at |
| end |
| Validation confusion matrix. See |
| Indicator is chosen as micro-averaged F1. |
| Obtain |
| Obtain the rank list |
| Output the top two models, i.e., best PTM and NLR combinations. |
| Output |
| Select the two optimal models |
| Feature learning by L2TFL with NLR |
| Deep CCT fusion by DCFDCA. |
| OHNN |
| Training set is |
| Test set is |
| for |
| We initialized a random seed |
| Trained CCSHNet Model |
| Prediction: |
| Test confusion matrix at |
| Calculate Indicators. See |
| End |
| Test confusion matrix: See |
| Calculate indicators. |
Hyperparameter Setting.
| Parameter | Value |
|---|---|
| 1024 | |
| 1024 | |
| 3 | |
| 150 | |
| 150 | |
| 150 | |
| 150 | |
| 6 | |
| 512 | |
| 4 | |
| 3 | |
| 10 | |
| 14 | |
| 30 | |
| 0 | |
| 0.01 | |
| 20 | |
| 422 | |
| 10 | |
| 10 |
Training, validation, and test set.
| Non-test (9 folds for training | Test | Total | |
|---|---|---|---|
| COVID-19 | |||
| CAP | |||
| SPT | |||
| HC |
Fig. 8Results of proposed MDA.
Top best three models on validation set.
| Model | Class | Sen (%) | Prc (%) | F1 (%) |
|---|---|---|---|---|
| DensetNet201 | C1 | 94.63 | 96.45 | 95.53 |
| C2 | 93.16 | 96.68 | 94.88 | |
| C3 | 98.12 | 96.15 | 97.12 | |
| C4 | 99.18 | 96.16 | 97.65 | |
| MA | 96.35 | |||
| DensetNet201 | C1 | 94.93 | 97.07 | 95.99 |
| C2 | 93.91 | 95.57 | 94.73 | |
| C3 | 97.26 | 94.09 | 95.65 | |
| C4 | 97.92 | 97.52 | 97.72 | |
| MA | 96.06 | |||
| ResNet101 | C1 | 96.91 | 96.57 | 96.74 |
| C2 | 96.22 | 94.45 | 95.33 | |
| C3 | 94.44 | 95.75 | 95.09 | |
| C4 | 95.79 | 96.50 | 96.14 | |
| MA | 95.83 |
(MA: micro-averaged; Sen: Sensitivity; Prc: Precision).
GSAF against SAPNF on validation set.
| Selection | Selected | Class | Sen (%) | Prc (%) | F1 (%) |
|---|---|---|---|---|---|
| GSAF | DenseNet201 | C1 | 94.80 | 96.58 | 95.68 |
| C2 | 93.42 | 96.59 | 94.98 | ||
| C3 | 97.90 | 96.22 | 97.05 | ||
| C4 | 99.06 | 96.11 | 97.56 | ||
| MA | 96.37 | ||||
| SAPNF | DenseNet201 | C1 | 96.43 | 98.07 | 97.24 |
| C2 | 95.95 | 97.03 | 96.49 | ||
| C3 | 97.64 | 96.82 | 97.23 | ||
| C4 | 98.53 | 96.83 | 97.67 | ||
| MA | 97.18 |
(MA: micro-averaged; Sen: Sensitivity; Prc: Precision).
Fig. 9Grad-CAM result of a COVID-19 slice. (The “jet” pseudo-color was used. Red colors mean part and parcel areas for AI diagnosis, and blue colors less important areas for AI decision.).
Fig. 10Grad-CAM result on a normal case.
Performance of proposed CCSHNet on test set (%).
| Class | Sen (%) | Prc (%) | F1 (%) |
|---|---|---|---|
| C1 | 95.61 | 97.32 | 96.46 |
| C2 | 96.25 | 96.42 | 96.33 |
| C3 | 98.30 | 96.99 | 97.64 |
| C4 | 97.86 | 97.38 | 97.62 |
| MA | 97.04 |
(MA: micro-averaged; Sen: Sensitivity; Prc: Precision).
Comparison results of state-of-the-art methods.
| Method | Class | Sen (%) | Prc (%) | F1 (%) |
|---|---|---|---|---|
| RCBO | C1 | 71.93 | 84.19 | 77.58 |
| C2 | 72.86 | 72.73 | 72.79 | |
| C3 | 73.56 | 76.41 | 74.96 | |
| C4 | 80.66 | 68.91 | 74.32 | |
| MA | 74.85 | |||
| ELM-BA | C1 | 62.63 | 67.61 | 65.03 |
| C2 | 64.29 | 65.10 | 64.69 | |
| C3 | 71.86 | 66.77 | 69.22 | |
| C4 | 63.93 | 63.52 | 63.73 | |
| MA | 65.71 | |||
| 6L-CNN | C1 | 72.46 | 83.94 | 77.78 |
| C2 | 78.93 | 77.82 | 78.37 | |
| C3 | 81.86 | 75.00 | 78.28 | |
| C4 | 89.84 | 87.54 | 88.67 | |
| MA | 80.94 | |||
| RN-18 | C1 | 82.81 | 82.66 | 82.73 |
| C2 | 81.07 | 74.43 | 77.61 | |
| C3 | 74.24 | 76.98 | 75.58 | |
| C4 | 82.13 | 86.38 | 84.20 | |
| MA | 80.04 | |||
| RN-50-AD | C1 | 87.72 | 85.03 | 86.36 |
| C2 | 87.68 | 91.26 | 89.44 | |
| C3 | 93.39 | 89.89 | 91.60 | |
| C4 | 84.92 | 87.65 | 86.26 | |
| MA | 88.41 | |||
| GAN-GN | C1 | 91.75 | 89.86 | 90.80 |
| C2 | 92.86 | 91.87 | 92.36 | |
| C3 | 89.83 | 89.98 | 89.91 | |
| C4 | 91.64 | 94.27 | 92.93 | |
| MA | 91.50 | |||
| SMO | C1 | 97.02 | 92.63 | 94.77 |
| C2 | 89.11 | 95.23 | 92.07 | |
| C3 | 94.92 | 94.92 | 94.92 | |
| C4 | 94.26 | 92.89 | 93.57 | |
| MA | 93.86 | |||
| CSS | C1 | 94.04 | 92.25 | 93.14 |
| C2 | 93.75 | 95.11 | 94.42 | |
| C3 | 91.36 | 93.58 | 92.45 | |
| C4 | 94.43 | 92.75 | 93.58 | |
| MA | 93.39 | |||
| NiNet | C1 | 87.89 | 91.59 | 89.70 |
| C2 | 80.89 | 85.47 | 83.12 | |
| C3 | 83.22 | 82.11 | 82.66 | |
| C4 | 92.30 | 85.95 | 89.01 | |
| MA | 86.18 | |||
| FCONet | C1 | 92.28 | 95.64 | 93.93 |
| C2 | 96.79 | 94.43 | 95.59 | |
| C3 | 94.75 | 95.88 | 95.31 | |
| C4 | 94.92 | 92.94 | 93.92 | |
| MA | 94.68 | |||
| COVNet | C1 | 89.82 | 86.63 | 88.20 |
| C2 | 89.82 | 92.63 | 91.21 | |
| C3 | 93.73 | 90.66 | 92.17 | |
| C4 | 87.38 | 90.96 | 89.13 | |
| MA | 90.17 | |||
| DeCovNet | C1 | 91.05 | 90.58 | 90.81 |
| C2 | 93.75 | 90.99 | 92.35 | |
| C3 | 90.51 | 86.97 | 88.70 | |
| C4 | 88.69 | 95.58 | 92.01 | |
| MA | 90.94 | |||
| CCSHNet | C1 | 95.61 | 97.32 | 96.46 |
| C2 | 96.25 | 96.42 | 96.33 | |
| C3 | 98.30 | 96.99 | 97.64 | |
| C4 | 97.86 | 97.38 | 97.62 | |
| MA | 97.04 |
Fig. 11Comparison plot of MA F1 for our algorithm compared to 12 state-of-the-art methods.