| Literature DB >> 35362751 |
Yeshaswini Nagaraj1,2, Gonda de Jonge3, Anna Andreychenko4, Gabriele Presti5, Matthias A Fink6,7, Nikolay Pavlov4, Carlo C Quattrocchi5, Sergey Morozov4, Raymond Veldhuis8, Matthijs Oudkerk9,10, Peter M A van Ooijen11,12.
Abstract
OBJECTIVE: To develop an automatic COVID-19 Reporting and Data System (CO-RADS)-based classification in a multi-demographic setting.Entities:
Keywords: COVID-19; Deep learning; Diagnostic imaging; SARS-CoV-2; Tomography X-ray computed
Mesh:
Year: 2022 PMID: 35362751 PMCID: PMC8973680 DOI: 10.1007/s00330-022-08730-6
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 7.034
Fig. 1The overall workflow followed to validate machine learning models for automated COVID-19 suspicion staging based on the CO-RADS protocol. The scans with COVID-19 suspicion were selected from the Italian and Russian subcohorts retrospectively and annotated by experienced radiologists from different countries. In the first step, the datasets were processed using deep learning–based noise reduction (DLNR) and 3D segmentation masks were generated for each scan. Next, radiomic features were extracted and classified using ML algorithms. In the final step, statistical evaluation of the standard performance metrics and visualization of class-specific features were carried out to enhance the explainability of the models
Fig. 2Data flowchart of Russian and Italian subcohorts included for the study with the training and test split. Below the flowchart, a description of different evaluation settings of data scenarios is depicted. Note that n refers to the number of patients. CO-RADS–COVID-19 Reporting and Data Systems, CT 0, 1, 2, 3, 4–severity–based Russian annotations
Acquisition and reconstruction parameters of Italian and Russian subcohorts
| Italy | Russia | |||
|---|---|---|---|---|
| Acquisition parameters | ||||
| Scanner | SOMATOM force | Toshiba Aquilion64 | ||
| Scan mode | Spiral | Helical | ||
| Pitch | 1.2–1.5 | 1.484 | ||
| Tube voltage [kVp] | 100–150 | 120 | ||
| Tube current [mAs] | 33–296 | 80–500 (automatically adjusted to achieve noise level of 10 HU for 5.0-mm-thick slices) | ||
| Contrast | No | No | ||
| API (inspiration/expiration/mixed) | Inspiration | Suspended inspiration | ||
| Direction | Craniocaudal | Caudocranial | ||
| Upper limit | Pulmonary apex | 5 cm above lungs | ||
| Lower limit | Lower diaphragmatic limit | 5 cm below lungs | ||
| Reconstruction parameters | ||||
| Slice thickness [mm] | 1–3 | 1.0 | ||
| Slice increment [mm] | 1 | 0.8 | ||
| FOV [mm] | 350–500 | 350–500 | ||
| Reconstruction kernel | BL64–BR40 | FC07 | ||
| Reconstruction method | Iterative | QDS + (FBP) | ||
| Window width [HU] | 300/400 | 1500/1600 | 400 | 1500 |
| Window center [HU] | 30/40 | − 500/− 600 | 40 | − 500 |
Data characteristics and CT features of participants in Italian and Russian subcohorts
| Parameter | Italy | Russia | Total |
|---|---|---|---|
| 308 | 365 | 673 | |
| Age | – | 18–97 years | – |
| No. of CT scans per scoring | |||
| CO-RADS 1 | 87 (29%) | 140 (38%) | 227 |
| CO-RADS 2 | 15 (5%) | 30 (8%) | 45 |
| CO-RADS 3 | 74 (25%) | 39 (11%) | 113 |
| CO-RADS 4 | 45 (15%) | 64 (18%) | 109 |
| CO-RADS 5 | 76 (26%) | 88 (24%) | 164 |
| CT patterns | |||
| GGO | 194 (65%) | 200 (55%) | 394 |
| Pleural effusion | 77 (26%) | 3(0.8%) | 80 |
| Consolidation | 101 (34%) | 102(28%) | 203 |
| Crazy paving | 74 (25%) | 30(8%) | 104 |
| Vascular thickening | 63 (21%) | 106(29%) | 169 |
Number of samples (n) includes CO-RADS-0
Inter-observer variability between radiologists for each CO-RADS classification
| Type | Kappa | (95% CI) | (95% CI) | |
|---|---|---|---|---|
| Overall | 0.868 | 0.705 | 1.20 | < 0.001 |
| CO-RADS 1 | 0.938 | 0.765 | 1.10 | < 0.001 |
| CO-RADS 2 | 0.769 | 0.589 | 0.934 | < 0.001 |
| CO-RADS 3 | 0.913 | 0.741 | 1.00 | < 0.001 |
| CO-RADS 4 | 0.721 | 0.560 | 0.92 | < 0.001 |
| CO-RADS 5 | 1.00 | 0.841 | 1.17 | < 0.001 |
CI confidence interval
Area under the receiver operating curve (AUC) of all the machine learning (ML) algorithms for respective CO-RADS classification in each setting
| Settings | Type | Logistic regression | Multilayer perceptron | Random forest |
|---|---|---|---|---|
| Setting 1 | CO-RADS ≥ 2 | 0.82 (± 0.08) | 0.85 (± 0.06) | 0.89 (± 0.07) |
| CO-RADS 3 + 4 + 5 | 0.83 (± 0.06) | 0.85 (± 0.06) | 0.89 (± 0.06) | |
| CO-RADS 4 + 5 | 0.85 (± 0.06) | 0.86 (± 0.06) | 0.84 (± 0.06) | |
| CO-RADS 5 | 0.82 (± 0.08) | 0.81 (± 0.07) | 0.88 (± 0.06) | |
| Setting 2 | CO-RADS ≥ 2 | 0.86 (± 0.06) | 0.89 (± 0.06) | 0.97 (± 0.04) |
| CO-RADS 3 + 4 + 5 | 0.83 (± 0.06) | 0.92 (± 0.06) | 0.94 (± 0.06) | |
| CO-RADS 4 + 5 | 0.84 (± 0.06) | 0.88 (± 0.05) | 0.88 (± 0.04) | |
| CO-RADS 5 | 0.80 (± 0.04) | 0.79 (± 0.04) | 0.92 (± 0.07) | |
| Setting 3 | CO-RADS ≥ 2 | 0.77 (± 0.09) | 0.75 (± 0.07) | 0.79 (± 0.08) |
| CO-RADS 3 + 4 + 5 | 0.75 (± 0.06) | 0.71 (± 0.06) | 0.73 (± 0.06) | |
| CO-RADS 4 + 5 | 0.65 (± 0.06) | 0.70 (± 0.07) | 0.79 (± 0.09) | |
| CO-RADS 5 | 0.66 (± 0.06) | (0.71 ± 0.06) | 0.71 (± 0.06) | |
| Setting 4 | CO-RADS ≥ 2 | 0.78 (± 0.08) | 0.82 (± 0.06) | 0.83 (± 0.07) |
| CO-RADS 3 + 4 + 5 | 0.77 (± 0.06) | 0.78 (± 0.05) | 0.78 (± 0.03) | |
| CO-RADS 4 + 5 | 0.73 (± 0.06) | 0.73 (± 0.06) | 0.75 (± 0.05) | |
| CO-RADS 5 | 0.70 (± 0.06) | 0.73 (± 0.06) | 0.75 (± 0.04) | |
| Setting 5 | CO-RADS ≥ 2 | 0.69 (± 0.06) | 0.67 (± 0.06) | 0.68 (± 0.06) |
| CO-RADS 3 + 4 + 5 | 0.73 (± 0.08) | 0.72 (± 0.07) | 0.73 (± 0.06) | |
| CO-RADS 4 + 5 | 0.70 (± 0.06) | 0.68 (± 0.07) | 0.69 (± 0.07) | |
| CO-RADS 5 | 0.70 (± 0.07) | 0.69 (± 0.10) | 0.71 (± 0.08) | |
| Setting 6 | CO-RADS ≥ 2 | 0.74 (± 0.06) | 0.76 (± 0.06) | 0.76 (± 0.06) |
| CO-RADS 3 + 4 + 5 | 0.71 (± 0.04) | 0.76 (± 0.06) | 0.77 (± 0.07) | |
| CO-RADS 4 + 5 | 0.79 (± 0.06) | 0.73 (± 0.08) | 0.79 (± 0.07) | |
| CO-RADS 5 | 0.74 (± 0.09) | 0.75 (± 0.09) | 0.74 (± 0.08) |
Performance metrics of the best machine learning (ML) algorithms for respective CO-RADS classification in each setting
| Settings | Type | TP | FP | Precision | Recall | AUC | |
|---|---|---|---|---|---|---|---|
| Setting 1 | CO-RADS ≥ 2 | 50 | 5 | 0.91 | 0.91 | 0.91 | 0.89 (± 0.07) |
| CO-RADS 3 + 4 + 5 | 44 | 7 | 0.86 | 0.92 | 0.89 | 0.89 (± 0.06) | |
| CO-RADS 4 + 5 | 35 | 9 | 0.80 | 0.92 | 0.85 | 0.86 (± 0.06) | |
| CO-RADS 5 | 19 | 8 | 0.70 | 0.86 | 0.78 | 0.88 (± 0.06) | |
| Setting 2 | CO-RADS ≥ 2 | 52 | 2 | 0.96 | 0.95 | 0.95 | 0.97 (± 0.04) |
| CO-RADS 3 + 4 + 5 | 45 | 5 | 0.90 | 0.94 | 0.92 | 0.94 (± 0.06) | |
| CO-RADS 4 + 5 | 36 | 9 | 0.80 | 0.95 | 0.87 | 0.88 (± 0.06) | |
| CO-RADS 5 | 20 | 4 | 0.83 | 0.91 | 0.87 | 0.92 (± 0.07) | |
| Setting 3 | CO-RADS ≥ 2 | 159 | 12 | 0.93 | 0.76 | 0.83 | 0.79 (± 0.07) |
| CO-RADS 3 + 4 + 5 | 124 | 15 | 0.89 | 0.64 | 0.74 | 0.75 (± 0.06) | |
| CO-RADS 4 + 5 | 100 | 52 | 0.66 | 0.83 | 0.73 | 0.79 (± 0.09) | |
| CO-RADS 5 | 55 | 72 | 0.43 | 0.72 | 0.54 | 0.71 (± 0.06) | |
| Setting 4 | CO-RADS ≥ 2 | 163 | 7 | 0.96 | 0.78 | 0.86 | 0.83 (± 0.07) |
| CO-RADS 3 + 4 + 5 | 169 | 37 | 0.82 | 0.86 | 0.84 | 0.78 (± 0.03) | |
| CO-RADS 4 + 5 | 102 | 52 | 0.65 | 0.84 | 0.73 | 0.75 (± 0.04) | |
| CO-RADS 5 | 72 | 82 | 0.47 | 0.95 | 0.63 | 0.75 (± 0.08) | |
| Setting 5 | CO-RADS ≥ 2 | 145 | 45 | 0.76 | 0.66 | 0.71 | 0.69 (± 0.07) |
| CO-RADS 3 + 4 + 5 | 142 | 58 | 0.71 | 0.74 | 0.73 | 0.73 (± 0.06) | |
| CO-RADS 4 + 5 | 114 | 71 | 0.62 | 0.75 | 0.68 | 0.69 (± 0.06) | |
| CO-RADS 5 | 65 | 93 | 0.41 | 0.74 | 0.53 | 0.71 (± 0.08) | |
| Setting 6 | CO-RADS ≥ 2 | 156 | 21 | 0.88 | 0.70 | 0.79 | 0.76 (± 0.06) |
| CO-RADS 3 + 4 + 5 | 160 | 49 | 0.77 | 0.84 | 0.80 | 0.77 (± 0.07) | |
| CO-RADS 4 + 5 | 120 | 43 | 0.74 | 0.79 | 0.76 | 0.79 (± 0.06) | |
| CO-RADS 5 | 66 | 68 | 0.49 | 0.75 | 0.59 | 0.75 (± 0.09) |
Fig. 3The receiver operating curves of machine learning algorithms showing their ability to classify normal chest CT from other CO-RADS stages on the hold-out dataset (scenario 1). The performance of the classifiers increased after noise reduction
Fig. 4Top twenty feature visualization using SHAP for CO-RADS classification in each data setting. The features are arranged in a descending order of feature importance (SHAP values). Using this feature importance map, one can observe how each feature contributes to the machine learning model’s predictions and identifies the common features
Fig. 5The class-specific feature summery plot for CO-RADS 5 using SHAP in each setting. The feature impact on the classification is observed by a positive SHAP value indicated by red color. For example, “wavelet (LH) GLCM Imc1” shows positive impact on CO-RADS 5 prediction in most of the settings