| Literature DB >> 35468985 |
Taeyong Park1, Min A Yoon2, Young Chul Cho3, Su Jung Ham1, Yousun Ko3, Sehee Kim4, Heeryeol Jeong5, Jeongjin Lee5.
Abstract
Although CT radiomics has shown promising results in the evaluation of vertebral fractures, the need for manual segmentation of fractured vertebrae limited the routine clinical implementation of radiomics. Therefore, automated segmentation of fractured vertebrae is needed for successful clinical use of radiomics. In this study, we aimed to develop and validate an automated algorithm for segmentation of fractured vertebral bodies on CT, and to evaluate the applicability of the algorithm in a radiomics prediction model to differentiate benign and malignant fractures. A convolutional neural network was trained to perform automated segmentation of fractured vertebral bodies using 341 vertebrae with benign or malignant fractures from 158 patients, and was validated on independent test sets (internal test, 86 vertebrae [59 patients]; external test, 102 vertebrae [59 patients]). Then, a radiomics model predicting fracture malignancy on CT was constructed, and the prediction performance was compared between automated and human expert segmentations. The algorithm achieved good agreement with human expert segmentation at testing (Dice similarity coefficient, 0.93-0.94; cross-sectional area error, 2.66-2.97%; average surface distance, 0.40-0.54 mm). The radiomics model demonstrated good performance in the training set (AUC, 0.93). In the test sets, automated and human expert segmentations showed comparable prediction performances (AUC, internal test, 0.80 vs 0.87, p = 0.044; external test, 0.83 vs 0.80, p = 0.37). In summary, we developed and validated an automated segmentation algorithm that showed comparable performance to human expert segmentation in a CT radiomics model to predict fracture malignancy, which may enable more practical clinical utilization of radiomics.Entities:
Mesh:
Year: 2022 PMID: 35468985 PMCID: PMC9038736 DOI: 10.1038/s41598-022-10807-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flow diagram of the study.
Details of CT protocols.
| Scanner | Somatom Sensation 16, Somatom Definition Edge, Flash, Force, AS or AS+ (Siemens Healthineers) | LightSpeed VCT, Optima CT660 or Discovery CT750HD (GE Healthcare) |
| Tube voltage (kVp) | 120 | 120 |
| Time–current product | Care Dose 4D with quality reference mAs of 200 | auto mA and Smart mA (minimum of 100 and maximum of 400 mA) with a noise index set to 21.0 HU |
| Detector collimation (mm) | 0.6 | 1.25 |
| Rotation time (s) | 0.5 | 0.5 |
| Pitch | 1.0 | 0.97 |
| Reconstruction | Axial plane at 1 mm slice thicknesses with 0.7 mm increments using a standard kernel (B30 filter) | Axial plane at 1.25 mm slice thicknesses with 0.8 mm increments using a standard kernel |
| Voxel size (mm) | 0.293 × 0.293 × 1 (FOV 150 × 150) (most commonly used) (range, 0.287 × 0.287 × 1 [FOV, 147 × 147] − 0.324 × 0.324 × 1 [FOV, 166 × 166]) | |
| Matrix | 512 × 512 | |
Figure 2The proposed convolutional neural network (CNN) to segment fractured vertebral bodies on CT. (a) Overview of the development of the CNN and its detailed architecture. (b) Overall process of vertebral detection. (c) MBConV block used for the encoding path. (d) Attention block used for the decoding path.
Baseline demographic and clinical characteristics of patients in the training and test sets. Data for age are means ± standard deviation.
| Training set (n = 158) | Test set (n = 118) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Benign (n = 84) | Malignant (n = 74) | Internal (n = 59) | External (n = 59) | ||||||
| Benign (n = 27) | Malignant (n = 32) | Benign (n = 29) | Malignant (n = 30) | ||||||
| Fractured bodies | 188 | 153 | 37 | 49 | 56 | 46 | |||
| (acute: chronic = 116:72) | (acute: chronic = 30:7) | (acute: chronic = 39:17) | |||||||
| Vertebral levels (thoracic:lumbar) | Acute: 35:81, chronic: 31:41, malignant: 100:53 | Acute: 9:21, chronic: 2:5, malignant: 28:21 | Acute: 9:30, chronic: 6:11, malignant: 27:19 | ||||||
| Age (years) | 72 ± 14 | 59 ± 12 | < 0.001 | 67 ± 17 | 59 ± 14 | 0.08 | 66 ± 17 | 60 ± 15 | 0.15 |
| Sex (men:women) | 25:59 | 41:33 | < 0.001 | 9:18 | 19:13 | 0.05 | 9:20 | 19:11 | 0.01 |
| Origins of malignant fractures | Lung (n = 16), hepatobiliary (n = 13), multiple myeloma (n = 7), kidney (n = 6), colorectal (n = 6), breast (n = 5), stomach (n = 4), thyroid cancer (n = 3), neuroendocrine (n = 2), urothelial (n = 2), and others (n = 10) | Lung (n = 8), hepatobiliary (n = 6), breast (n = 5), prostate (n = 3), and others (n = 10) | Lung (n = 12), breast (n = 3), prostate (n = 3), hepatobiliary (n = 3), pancreas (n = 2), multiple myeloma (n = 2), and others (n = 5) | ||||||
Accuracy of automated segmentation algorithm for fractured vertebral body segmentation. All results are shown as median and interquartile ranges in brackets. DSC indicates dice similarity coefficient; CSA, cross-sectional area; ASD, average surface distance. ap-value for comparison between chronic benign, acute benign and malignant fractures.
| Internal test set | External test set | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Overall | Chronic Benign | Acute Benign | Malignant | Overall | Chronic Benign | Acute Benign | Malignant | |||
| (n = 86) | (n = 7) | (n = 30) | (n = 49) | (n = 102) | (n = 17) | (n = 39) | (n = 46) | |||
| DSC | 0.94 [0.92, 0.95] | 0.95 [0.94, 0.95] | 0.94 [0.93, 0.95] | 0.93 [0.90, 0.95] | 0.02 | 0.93 [0.92, 0.95] | 0.94 [0.92, 0.95] | 0.94 [0.93, 0.95] | 0.93 [0.88, 0.94] | < 0.001 |
| CSA error (%) | 2.66 [1.32, 4.43] | 3.24 [0.15, 3.41] | 3.14 [1.62, 4.23] | 2.63 [1.19, 5.02] | 0.40 | 2.97 [1.09, 4.96] | 2.05 [0.64, 3.97] | 2.51 [0.93, 4.01] | 3.92 [1.91, 7.68] | 0.01 |
| ASD (mm) | 0.40 [0.32, 0.55] | 0.38 [0.32, 0.40] | 0.35 [0.31, 0.39] | 0.48 [0.38, 0.62] | < 0.001 | 0.54 [0.42, 0.72] | 0.48 [0.38, 0.63] | 0.47 [0.36, 0.55] | 0.63 [0.48, 0.99] | < 0.001 |
Figure 3Representative images of automated segmentation of fractured vertebral bodies from the internal test set. (a) a 76-year-old woman with an acute benign fracture (voxel size, 0.287 × 0.287 × 1 mm), (b) a 19-year-old man with a malignant fracture from metastatic Ewing sarcoma/PNET (voxel size, 0.293 × 0.293 × 1 mm), and (c) a 68-year-old man with a malignant fracture from metastatic renal cell carcinoma (voxel size, 0.309 × 309 × 1 mm). When the osseous margin of the vertebral body could not be fully traced because of bone destruction, an imaginary line was drawn based on the contralateral normal appearing cortex or the most adjacent intact vertebral body as shown in (c). The green shaded area denotes segmentation by the human experts and the red shaded area denotes automated segmentation. The last two columns show three-dimensional volume meshes by the human experts (green) and the automated algorithm (red).
List of 12 radiomics features used to develop a radiomics model to predict malignancy of fracture. LoG indicates Laplacian of Gaussian Filtered features.
| Feature family | Feature name | LASSO coefficient (β) |
|---|---|---|
| Intercept | − 0.176 | |
| Morphological features | Approximate volume | − 0.483 |
| Major axis length | − 0.544 | |
| Local intensity features | Global intensity peak | 0.837 |
| Intensity-based statistical features | Minimum gray level | 0.745 |
| Intensity histogram features | Intensity histogram mean | 0.730 |
| Intensity histogram robust mean absolute deviation | 0.104 | |
| Gray level co-occurrence matrix | Joint entropy | − 0.088 |
| Gray level size zone matrix | Small zone low gray level emphasis | − 0.994 |
| Neighboring gray level dependence matrix | High dependence emphasis | 0.329 |
| LoG local intensity features | Local intensity peak | − 0.444 |
| LoG intensity-based statistical features | 75th percentile | − 0.105 |
| LoG filtered intensity histogram features | Maximum histogram gradient | − 0.051 |
Diagnostic performance of the radiomics prediction model. Numbers in brackets indicate 95% confidence interval.
| Internal test set (n = 59) | External test set. (n = 59) | |||||
|---|---|---|---|---|---|---|
| Human expert segmentation | Automated segmentation | Human expert segmentation | Automated segmentation | |||
| AUC | 0.87 [0.78, 0.96] | 0.80 [0.69, 0.91] | 0.044 | 0.80 [0.69, 0.92] | 0.83 [0.72, 0.94] | 0.37 |
| Accuracy (%) | 78 (46/59) [67, 89] | 71 (42/59) [60, 83] | 0.22 | 76 (45/59) [65, 87] | 76 (45/59) [65, 87] | > 0.999 |
| Sensitivity (%) | 78 (25/32) [64, 92] | 72 (23/32) [56, 88] | 0.63 | 77 (23/30) [62, 92] | 80 (24/30) [66, 94] | > 0.999 |
| Specificity (%) | 78 (21/27) [62, 94] | 70 (19/27) [53, 88] | 0.50 | 76 (22/29) [60, 91] | 72 (21/29) [56, 89] | > 0.999 |
| Positive predictive value (%) | 81 (25/31) [67, 95] | 74 (23/31) [59, 90] | N/A | 77 (23/30) [62, 92] | 75 (24/32) [60, 90] | N/A |
| Negative predictive value (%) | 75 (21/28) [59, 91] | 68 (19/28) [51, 85] | N/A | 76 (22/29) [60, 91] | 78 (21/27) [62, 94] | N/A |