| Literature DB >> 33881930 |
Bin Zhang1, Ma-Yi-di-Li Ni-Jia-Ti2, Ruike Yan1, Nan An3, Lv Chen1, Shuyi Liu1, Luyan Chen1, Qiuying Chen1, Minmin Li1, Zhuozhi Chen1, Jingjing You1, Yuhao Dong4, Zhiyuan Xiong5,6, Shuixing Zhang1.
Abstract
OBJECTIVES: To develop and validate a radiomic model to predict the rapid progression (defined as volume growth of pneumonia lesions > 50% within seven days) in patients with coronavirus disease 2019 (COVID-19).Entities:
Mesh:
Year: 2021 PMID: 33881930 PMCID: PMC8173680 DOI: 10.1259/bjr.20201007
Source DB: PubMed Journal: Br J Radiol ISSN: 0007-1285 Impact factor: 3.039
Figure 1.(a) Representative follow-up chest CT images in a male patient (29-year-old) with COVID-19 pneumonia. (b) Flowchart of study inclusion. Note: T2 = date of next CT examination, T1 = date of this CT examination, V2 = the pneumonia volume on the next CT examination, V1 = the pneumonia volume on this CT examination.
Figure 2.Histogram of the distribution of number of CT scans and patients under different time interval between two adjacent CT scans.
The parameters of CT image acquisitions
| CT scanner | Manufacturer | Tube voltage | Tube current | Pitch | Slice Thickness |
|---|---|---|---|---|---|
| 64 scanner | GE | 120 kV | 260 mA | 0.984 | 0.625 mm |
| 16 scanner | Siemens | 130 kV | automatic | 1.5 | 1.0 or 0.6 mm |
| 128 scanner | Philips | 120 kV | automatic | 0.7 | 1.0 or 0.67 mm |
The parameters used in different transforms in Pyradiomics
| Transform | Original | Wavelet | LoG | Square | SquareRoot | Exponential | Logarithm |
|---|---|---|---|---|---|---|---|
| Parameters | default | default | sigma = 3.0 | default | default | default | default |
The number of radiomics features in different categories under different image types
| Image type Feature type | Original | Wavelet | LoG | Square | SquareRoot | Exponential | Logarithm |
|---|---|---|---|---|---|---|---|
| Shape-based features | 14 | – | – | – | – | – | – |
| First-order statistics | 18 | 144 | 18 | 18 | 18 | 18 | 18 |
| GLCM | 24 | 192 | 24 | 24 | 24 | 24 | 24 |
| GLDM | 14 | 112 | 14 | 14 | 14 | 14 | 14 |
| GLRLM | 16 | 128 | 16 | 16 | 16 | 16 | 16 |
| GLSZM | 16 | 128 | 16 | 16 | 16 | 16 | 16 |
| NGTDM | 5 | 40 | 5 | 5 | 5 | 5 | 5 |
GLCM, Gray-level co-occurrence matrix; GLDM, Gray-level dependence matrix; GLRLM, Gray-level run length matrix; GLSZM, Gray-level size zone matrix; NGTDM, Neighboring gray tone difference matrix.
Figure 3.The ROC curves of the training dataset and testing dataset with the optimal model. The left panel shows the mean ROC curve and the 95% CI for the training dataset (a). The right panel shows the mean ROC curve and the 95% CI for the validation dataset (b).
The performance of 20 combinations of machine learning methods in predicting the occurrence of rapid progression in patients with COVID-19
| Model | No. of | Train_ | Train_ | Train_ | Train_ | Train_ | Train_AUC | Test_ | Test_ | Test_ | Test_ | Test_ | Test_AUC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LASSO + SVM | 11 | 69.6% | 92.1% | 63.5% | 40.6% | 96.7% | 0.854 | 71.6% | 87.5% | 67.2% | 42.4% | 95.1% | 0.844 |
| LASSO + LR | 11 | 75.7% | 81.0% | 74.3% | 46.0% | 93.5% | 0.860 | 77.0% | 62.5% | 81.0% | 47.6% | 88.7% | 0.825 |
| LASSO + DT | 11 | 98.7% | 100.0% | 98.3% | 94.0% | 100.0% | 1.000 | 75.7% | 43.8% | 84.5% | 43.8% | 84.5% | 0.645 |
| LASSO + RF | 11 | 97.0% | 87.3% | 99.6% | 98.2% | 96.7% | 0.998 | 75.7% | 0 | 96.6% | 0 | 77.8% | 0.736 |
| Relief + SVM | 18 | 65.2% | 73.0% | 63.1% | 34.9% | 89.6% | 0.777 | 71.6% | 81.3% | 69.0% | 41.9% | 93.0% | 0.802 |
| Relief + LR | 18 | 73.0% | 81.0% | 70.8% | 42.9% | 93.2% | 0.831 | 77.0% | 68.8% | 79.3% | 47.8% | 90.2% | 0.842 |
| Relief + DT | 18 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 1.000 | 73.0% | 12.5% | 89.7% | 25.0% | 78.8% | 0.511 |
| Relief + RF | 18 | 90.9% | 60.3% | 99.1% | 95.0% | 90.2% | 0.920 | 81.1% | 12.5% | 100.0% | 100.0% | 80.6% | 0.713 |
| LVW + SVM | 2 | 60.1% | 74.6% | 56.2% | 31.5% | 89.1% | 0.758 | 54.1% | 50.0% | 55.2% | 23.5% | 80.0% | 0.642 |
| LVW + LR | 5 | 68.2% | 77.8% | 65.7% | 38.0% | 91.6% | 0.789 | 71.6% | 75.0% | 70.7% | 41.4% | 91.1% | 0.780 |
| LVW + DT | 24 | 99.0% | 100.0% | 98.7% | 95.5% | 100.0% | 0.999 | 68.9% | 31.3% | 79.3% | 29.4% | 80.7% | 0.554 |
| LVW + RF | 6 | 98.0% | 90.5% | 100.0% | 100.0% | 97.5% | 0.999 | 77.0% | 12.5% | 94.8% | 40.0% | 79.7% | 0.671 |
| L1-norm-SVM+SVM | 17 | 69.9% | 93.7% | 63.5% | 41.0% | 97.4% | 0.885 | 74.3% | 87.5% | 70.7% | 45.2% | 95.4% | 0.857 |
| L1-norm-SVM+LR | 17 | 77.7% | 84.1% | 76.0% | 48.6% | 94.7% | 0.889 | 81.1% | 75.0% | 82.8% | 54.6% | 92.3% | 0.851 |
| L1-norm-SVM+DT | 17 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 1.000 | 81.1% | 31.3% | 94.8% | 62.5% | 83.3% | 0.630 |
| L1-norm-SVM+RF | 17 | 98.7% | 93.7% | 100.0% | 100.0% | 98.3% | 1.000 | 81.1% | 31.3% | 94.8% | 62.5% | 83.3% | 0.728 |
| RFE + SVM | 44 | 88.2% | 100.0% | 85.0% | 64.3% | 100.0% | 0.954 | 73.0% | 62.5% | 75.9% | 41.7% | 88.0% | 0.776 |
| RFE + LR | 44 | 83.1% | 92.1% | 80.7% | 56.3% | 97.4% | 0.930 | 77.0% | 62.5% | 81.0% | 47.6% | 88.7% | 0.834 |
| RFE + DT | 44 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 1.000 | 70.3% | 18.8% | 84.5% | 25.0% | 79.0% | 0.516 |
| RFE + RF | 44 | 97.3% | 87.3% | 100.0% | 100.0% | 96.7% | 0.998 | 78.4% | 6.3% | 98.3% | 50.0% | 79.2% | 0.658 |
ACC, Accuracy; AUC, Area under the curve; DT, Decision tree;LASSO, Least absolute shrinkage and selection operator; LR, Logistic regression; LVW, Las vegas wrapper; NPV, Negative predictive value; PPV, Positive predictive value; RF, Random forest;SP, Specificity; SVM, Support vector machine.