| Literature DB >> 33927208 |
Parnian Afshar1, Shahin Heidarian2, Nastaran Enshaei1, Farnoosh Naderkhani1, Moezedin Javad Rafiee3, Anastasia Oikonomou4, Faranak Babaki Fard5, Kaveh Samimi6, Konstantinos N Plataniotis7, Arash Mohammadi8.
Abstract
Novel Coronavirus (COVID-19) has drastically overwhelmed more than 200 countries affecting millions and claiming almost 2 million lives, since its emergence in late 2019. This highly contagious disease can easily spread, and if not controlled in a timely fashion, can rapidly incapacitate healthcare systems. The current standard diagnosis method, the Reverse Transcription Polymerase Chain Reaction (RT- PCR), is time consuming, and subject to low sensitivity. Chest Radiograph (CXR), the first imaging modality to be used, is readily available and gives immediate results. However, it has notoriously lower sensitivity than Computed Tomography (CT), which can be used efficiently to complement other diagnostic methods. This paper introduces a new COVID-19 CT scan dataset, referred to as COVID-CT-MD, consisting of not only COVID-19 cases, but also healthy and participants infected by Community Acquired Pneumonia (CAP). COVID-CT-MD dataset, which is accompanied with lobe-level, slice-level and patient-level labels, has the potential to facilitate the COVID-19 research, in particular COVID-CT-MD can assist in development of advanced Machine Learning (ML) and Deep Neural Network (DNN) based solutions.Entities:
Mesh:
Year: 2021 PMID: 33927208 PMCID: PMC8085195 DOI: 10.1038/s41597-021-00900-3
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Available COVID-19 CT scan datasets. NA stands for not available.
| Dataset | Number of cases | Label type | Data Source | CT volume | Label Level | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| COVID | CAP | Normal | Classification | Segmentation | Multiple | Single | Available | Not available | Patient-level | Slice-level | Lobe-level | |
| Reference[ | 49 | NA | NA | ✓ | ✓ | ✓ | ✓ | |||||
| Reference[ | 20 | NA | NA | ✓ | ✓ | ✓ | ✓ | |||||
| Reference[ | 20 | NA | NA | ✓ | ✓ | ✓ | ✓ | |||||
| Reference[ | 856 | NA | 254 | ✓ | ✓ | ✓ | ✓ | |||||
| Reference[ | 216 | NA | 55 | ✓ | ✓ | ✓ | ✓ | |||||
| Reference[ | 60 | NA | 60 | ✓ | ✓ | ✓ | ✓ | |||||
| Reference[ | 95 | NA | 282 | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
| Reference[ | 2,980 | NA | NA | ✓ | ✓ | ✓ | ✓ | |||||
| COVID-CT-MD | 169 | 60 | 76 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||
CT scan settings used to acquire the COVID-CT-MD dataset.
| Diagnosis | Slice Thickness (mm) | Peak Kilovoltage (kVp) | Exposure Time (ms) | X-ray Tube Current (mA) | SID (mm) | SOD (mm) | Exposure values (mAs) |
|---|---|---|---|---|---|---|---|
| 2 | 110–130 | 600 | 153–343 | 940 | 535 | 61.2–180.0 | |
| 2 | 110–120 | 420–600 | 94–500 | 940–1040 | 535–570 | 38.4–175.24 | |
| 2 | 110 | 600 | 132–343 | 940 | 535 | 60.4–163.71 |
Fig. 1The distribution of the Exposure values for COVID-19, CAP and Normal cases.
The statistical parameters (mean and standard deviation) of the Exposure values.
| Diagnosis | Exposure mean | Exposure standard deviation |
|---|---|---|
| 111.43 | 23.70 | |
| 96.64 | 29.75 | |
| 109.18 | 23.97 |
Gender and age distribution in COVID-CT-MD.
| Diagnosis | Cases | Gender | Age (year) |
|---|---|---|---|
| 169 | 108 M/61 F | 51.96 ± 14.39 | |
| 60 | 35 M/25 F | 57.7 ± 21.7 | |
| 76 | 40 M/36 F | 43.4 ± 14.1 |
Fig. 2(a) The number of cases separated by the patient’s gender. (b) The distribution of age for COVID-19, CAP and Normal cases.
The number of cases, Slices, and Infection Ratio in the labeled dataset.
| Diagnosis | Cases | Slices Demonstrating Infection | Slice without infection | Infection Ratio |
|---|---|---|---|---|
| 54 | 3779 | 4269 | 7.0%–86.2% | |
| 25 | 1178 | 2718 | 7.8%–56.8% |
Fig. 3(a) The distribution of the Infection Ratio in the labeled dataset for COVID-19 and CAP cases. (b) The histogram of the Infection Ratio in the labeled dataset for COVID-19 and CAP cases.
Number of cases and slices, respectively, demonstrating infection in each lobe.
| Diagnosis | LLL | LUL | RLL | RML | RUL |
|---|---|---|---|---|---|
| 42&1669 | 38&1120 | 45&2008 | 26&420 | 29&826 | |
| 13&374 | 5&117 | 18&519 | 7&186 | 9&208 | |
| 56&2079 | 43&1237 | 63&2527 | 33&606 | 38&1034 |
LLL: Left Lower Lobe–LUL: Left Upper Lobe–RLL: Right Lower Lobe and Lingula–RML: Right Middle Lobe–RUL: Right Upper Lobe.
Fig. 4Average Infection Ratio in each lobe of the lung for COVID-19 and CAP cases in the labeled dataset.
Fig. 5Structure of the data included in COVID-CT-MD dataset.
| Measurement(s) | Low Dose Computed Tomography of the Chest • viral infectious disease |
| Technology Type(s) | digital curation • image processing technique |
| Factor Type(s) | sex • gender • age group • weight • clinical characteristics • covid-19 RT-PCR result • follow-up data |
| Sample Characteristic - Organism | Homo sapiens |