| Literature DB >> 35204639 |
Yunchao Yin1, Derya Yakar1, Rudi A J O Dierckx1, Kim B Mouridsen1,2, Thomas C Kwee1, Robbert J de Haas1.
Abstract
Background: The exact focus of computed tomography (CT)-based artificial intelligence techniques when staging liver fibrosis is still not exactly known. This study aimed to determine both the added value of splenic information to hepatic information, and the correlation between important radiomic features and information exploited by deep learning models for liver fibrosis staging by CT-based radiomics.Entities:
Keywords: artificial intelligence; liver; machine learning; multidetector computed tomography; radiomics
Year: 2022 PMID: 35204639 PMCID: PMC8870954 DOI: 10.3390/diagnostics12020550
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flowchart of the study population. Abbreviation: CT = computed tomography.
Figure 2Overall algorithm scheme. First, the computed tomography (CT) volume was preprocessed; then, the liver and spleen were segmented as the region of interest. The radiomic features were extracted from the segmented liver and spleen. The machine learning classifiers were trained by hepatic features and hepatic–splenic features, respectively, to predict the probability array of liver fibrosis stages, namely F0–F4.
Demographics of the study population.
| Variable | Total Cohort | Liver Fibrosis Stage | |||||
|---|---|---|---|---|---|---|---|
| F0 | F1 | F2 | F3 | F4 | |||
| Total number of patients | 252 | 134 | 8 | 10 | 18 | 82 | |
| Median age | 59 | 63 | 64 | 57 | 48 | 60 | |
| Gender | Male | 140 (55.6%) | 68 (50.7%) | 3 (37.5%) | 7 (70.0%) | 11 (61.1%) | 51 (62.2%) |
| Female | 112 (44.4%) | 66 (49.3%) | 5 (62.5%) | 3 (30.0%) | 7 (38.9%) | 31 (37.8%) | |
| Etiology of liver fibrosis | Alcoholic | 26 | - | 0 | 0 | 1 | 25 |
| Autoimmune hepatitis | 5 | - | 0 | 1 | 1 | 3 | |
| HBV | 3 | - | 0 | 0 | 0 | 3 | |
| HCV | 10 | - | 0 | 0 | 1 | 9 | |
| PSC | 3 | - | 0 | 2 | 1 | 0 | |
| Steato- | 8 | - | 0 | 0 | 0 | 8 | |
| Wilson disease | 1 | - | 0 | 0 | 0 | 1 | |
| Other | 17 | - | 1 | 0 | 7 | 9 | |
| Unknown | 45 | - | 7 | 7 | 7 | 24 | |
Abbreviations: HBV = hepatitis B virus; HCV = hepatitis C virus; PSC = primary sclerosing cholangitis.
Figure 3Data distribution based on radiomic features after dimension reduction using the t-distributed stochastic neighbor embedding (t-SNE) method. Features are reduced to two dimensions represented by the two axes. Each dot represents a computed tomography (CT) scan, and its color represents the liver fibrosis stage as shown at the upper corner of the graph. A relative clustering of different fibrosis stages is observed, which means that CT scans of different fibrosis stages can be differentiated based on the two-dimensional features after dimension reduction.
Performance of machine learning classifiers for liver fibrosis staging.
| Machine Learning Classifier Type | Training Features | Accuracy | Microaveraged AUC | AUC (95% CI) | Accuracy (%; 95% CI) | ||||
|---|---|---|---|---|---|---|---|---|---|
| Significant Fibrosis | Advanced Fibrosis | Cirrhosis | Significant Fibrosis | Advanced Fibrosis | Cirrhosis | ||||
| Logistic regression classifier with L1 penalty | Selected liver features | 76 | 0.94 | 0.95 | 0.92 | 0.91 | 86 | 80 | 82 |
| Selected liver & spleen features | 84 | 0.95 * | 0.93 | 0.88 | 0.94 | 92 | 86 | 90 | |
| Logistic regression classifier with L2 penalty | Selected liver features | 78 | 0.95 | 0.96 | 0.91 | 0.93 | 88 | 82 | 84 |
| Selected liver & spleen features | 80 | 0.95 ** | 0.95 | 0.90 | 0.94 | 88 | 81 | 86 | |
| AdaBoosting | Selected liver features | 74 | 0.82 | 0.74 | 0.72 | 0.82 | 76 | 80 | 74 |
| Selected liver & spleen features | 76 | 0.84 *** | 0.57 | 0.61 | 0.84 | 78 | 82 | 57 | |
| Gradient Boosting | Selected liver features | 76 | 0.88 | 0.85 | 0.83 | 0.88 | 80 | 80 | 85 |
| Selected liver & spleen features | 78 | 0.91 **** | 0.84 | 0.88 | 0.86 | 80 | 84 | 84 | |
| XGBoosting | Selected liver features | 80 | 0.86 | 0.78 | 0.79 | 0.90 | 84 | 86 | 78 |
| Selected liver & spleen features | 82 | 0.93 ***** | 0.86 | 0.90 | 0.90 | 84 | 88 | 86 | |
* p = 0.243 when comparing liver features with liver and spleen features; ** p = 0.278 when comparing liver features with liver and spleen features; *** p = 0.025 when comparing liver features with liver and spleen features; **** p = 0.024 when comparing liver features with liver and spleen features; ***** p < 0.001 when comparing liver features with liver and spleen features. Abbreviations: AUC = area under the receiver operating characteristic curve; CI = confidence interval.
Top five weighted radiomic features of trained logistic regression classifiers. The feature names are based on the Pyradiomics library used for research [16].
| Weight Ranking | Radiomic Feature Names | |||||||
|---|---|---|---|---|---|---|---|---|
| L1 Penalty and F0 | L1 Penalty and F4 | L2 Penalty and F0 | L2 Penalty and F4 | |||||
| Feature Category | Feature | Feature Category | Feature | Feature Category | Feature | Feature Category | Feature | |
| 1st | GLRLM * | Low Gray Level Run Emphasis * | First order * | Robust Mean Absolute Deviation * | Shape * | Flatness * | NGTDM * | Coarseness * |
| 2nd | Shape * | Flatness * | Shape # | Maximum 2D Diameter Slice # | Shape * | Least Axis Length * | Shape # | Maximum 2D Diameter Slice # |
| 3rd | Shape * | Maximum 2D Diameter Row * | GLRLM * | Run Length Nonuniformity Normalized * | Shape * | Maximum 2D Diameter Row * | First order * | Robust Mean Absolute Deviation * |
| 4th | GLSZ * | Zone Entropy * | NGTDM * | Coarseness * | GLRLM * | Low Gray Level Run Emphasis * | Shape # | Surface Area # |
| 5th | Shape * | Least Axis Length * | GLCM # | Cluster Shade # | GLRLM * | Short Run Low Gray Level Emphasis * | Shape # | Minor Axis Length # |
* liver radiomic features; # splenic radiomic features. Abbreviations: GLRLM = gray level run length matrix; NGTDM = neighboring gray tone difference matrix; GLSZM = gray level size zone matrix; GLCM = gray level co-occurrence matrix.
Top five weighted radiomic features of trained tree-based ensemble classifiers. The feature names are based on the Pyradiomics library used for research [16].
| Weight Ranking | Radiomic Feature Names | |||||
|---|---|---|---|---|---|---|
| AdaBoosting | Gradient Boosting | XGBoosting | ||||
| Feature Category | Feature | Feature Category | Feature | Feature Category | Feature | |
| 1st | First order * | Total Energy * | NGTDM * | Strength * | NGTDM * | Strength * |
| 2nd | First order # | Median # | Shape * | Maximum 2D Diameter Row * | GLSZM * | Size Zone Nonuniformity * |
| 3rd | GLRLM # | Low Gray Level Run Emphasis # | GLCM # | Imc2 # | GLRLM # | Gray Level Variance # |
| 4th | GLSZM * | Size Zone Nonuniformity * | Shape * | Mesh Volume * | GLRLM * | Run Length Nonuniformity * |
| 5th | Shape * | Flatness * | GLSZM * | Zone Variance * | GLSZM # | Large Area High Gray Level Emphasis # |
* liver radiomic features; # splenic radiomic features. Abbreviations: NGTDM = neighboring gray tone difference matrix; GLSZM = gray level size zone matrix; GLRLM = gray level run length matrix; GLCM = gray level co-occurrence matrix.