| Literature DB >> 35562705 |
Seongwon Na1, Yu Sub Sung2,3, Yousun Ko4, Youngbin Shin4, Junghyun Lee5, Jiyeon Ha6, Su Jung Ham7, Kyoungro Yoon8, Kyung Won Kim9.
Abstract
BACKGROUND: Despite the dramatic increase in the use of medical imaging in various therapeutic fields of clinical trials, the first step of image quality check (image QC), which aims to check whether images are uploaded appropriately according to the predefined rules, is still performed manually by image analysts, which requires a lot of manpower and time.Entities:
Keywords: Artificial intelligence; Computed tomography series; Deep learning; Image quality check; Transfer learning; Two-dimensional convert method
Mesh:
Year: 2022 PMID: 35562705 PMCID: PMC9107169 DOI: 10.1186/s12880-022-00815-4
Source DB: PubMed Journal: BMC Med Imaging ISSN: 1471-2342 Impact factor: 2.795
Summary of demographic variables for development, internal validation, and external validation
| Variables | Development set (n = 923/1042)* | Internal validation set (n = 101/179) | External validation set (n = 301/448) | |||
|---|---|---|---|---|---|---|
| Female (%, female:male) | 46.3 (427:496) | 36.6 (37:64) | 37.5 (113:188) | |||
| Age | 60.3 ± 14.0 | 60.5 ± 12.1 | 65.4 ± 14.6 | |||
| Height (cm) | 163.0 ± 9.1 | 163.8 ± 8.9 | 162.7 ± 9.8 | |||
| Weight (g) | 62.4 ± 12.2 | 64.2 ± 13.8 | 62.1 ± 13.2 | |||
| BMI | 23.5 ± 3.7 | 23.8 ± 3.8 | 23.4 ± 4.0 | |||
* Number of patients/CT scans
Fig. 1Details of deep learning model training and internal validation
Fig. 2Overview of the ImageQC-net pipeline
Fig. 3The image reconstruction method used in this study. The 3D CT image data with multiple slices are transformed into a 2D representative images using the maximum intensity projection in a coronal image (MIPcoronal) (a), average intensity projection (AIPcoronal) (b), and mid-coronal plane (c). In this example, we used a contrast-enhanced abdomen and pelvis CT
Performance of individual DLMs for body part classification: comparison among the algorithm models based on preprocessing methods and planes
| Pre-processing Method | Internal validation set | External validation set | ||||||
|---|---|---|---|---|---|---|---|---|
| Precision (%) | Recall (%) | Accuracy (%) | F1-score (%) | Precision (%) | Recall (%) | Accuracy (%) | F1-score (%) | |
| Axial | 100 | 100 | 100 | 100 | 94.99 | 94.15 | 94.86 | 94.6 |
| Sagittal | 100 | 100 | 100 | 100 | 97.52 | 96.85 | 97.09 | 97.2 |
| Coronal | 100 | 100 | 100 | 100 | 95.24 | 91.73 | 93.75 | 93.45 |
| Axial | 100 | 100 | 100 | 100 | 98.85 | 98.33 | 98.66a | 98.6 |
| Sagittal | 100 | 100 | 100 | 100 | 99.33 | 99.33 | 99.33ab | 99.3 |
| Coronal | 100 | 100 | 100 | 100 | 98.85 | 98.33 | 98.66a | 98.6 |
| Axial | 100 | 100 | 100 | 100 | 96.24 | 96.51 | 96.2 | 96.4 |
| Sagittal | 100 | 100 | 100 | 100 | 97.7 | 96.32 | 97.32 | 97 |
| Coronal | 100 | 100 | 100 | 100 | 96.24 | 96.51 | 96.2 | 96.3 |
aThe DLMs with the highest performance in each plane are selected to apply for ensemble AI model
bThe DLM with the highest performance from all pre-processing methods is selected as the best performing individual DLM
Fig. 4Comparison between the individual DLM and the ensemble AI model for body art classification based on the external dataset. a Confusion matrices of the best performing individual DLM and the ensemble AI model. b Abdominal CT misclassified as chest CT in both the best performing individual DLM and the ensemble AI model. c Neck CT misclassified as abdominal CT from the best performing DLM. d Abdominopelvic CT misclassified as abdominal CT from the best performing DLM
Performance of individual DLMs for contrast-enhancement classification: comparison among the algorithm models in each body part
| Body part | Preprocessing | Internal validation set | External validation set | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Precision (%) | Recall (%) | Accuracy (%) | F1-score (%) | Precision (%) | Recall (%) | Accuracy (%) | F1-score (%) | ||
| Brain | Mid-axial | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| Neck | 100 | 100 | 100 | 100 | 98.38 | 98.33 | 98.33 | 98.3 | |
| Chest | 100 | 100 | 100 | 100 | 98.38 | 98.48 | 98.41 | 98.4 | |
| Abdomen | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Abdomen and pelvis | 100 | 100 | 100 | 100 | 97.78 | 97.87 | 98.34 | 97.9 | |
| Brain | Mid-sagittal | 100 | 100 | 100 | 100 | 99.13 | 98.33 | 98.85 | 98.7 |
| Neck | 100 | 100 | 100 | 100 | 84.09 | 76.66 | 76.66 | 80 | |
| Chest | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Abdomen | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Abdomen and pelvis | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Brain | Mid-coronal | 100 | 100 | 100 | 100 | 99.13 | 98.33 | 98.85 | 98.7 |
| Neck | 100 | 100 | 100 | 100 | 91.17 | 91.66 | 91.66 | 91.4 | |
| Chest | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Abdomen | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Abdomen and pelvis | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
Fig. 5Comparison between the individual DLM and the ensemble AI model for contrast-enhancement classification based on the external dataset. a Confusion matrices of the best performing individual DLM and the ensemble AI model. b Misclassified case in the individual DLM. In the neck mid-plane CT image, the neck vessels are small; thus, the DLM algorithm may not identify contrast enhancement
Fig. 6overall DLM and ensemble AI model classification performance for the external dataset. a Overall DLM classification performance for the external dataset. b Overall Ensemble AI model classification performance for the external dataset. c Ensemble AI model external dataset misclassified case
Fig. 7ImageQC-net GUI software. a ImageQC-net GUI software. b Results available to users