| Literature DB >> 30178143 |
Wei Li1, Yang Huang1, Bo-Wen Zhuang1, Guang-Jian Liu2, Hang-Tong Hu1, Xin Li3, Jin-Yu Liang1, Zhu Wang1, Xiao-Wen Huang1, Chu-Qing Zhang4, Si-Min Ruan1, Xiao-Yan Xie1, Ming Kuang1,5, Ming-De Lu1,5, Li-Da Chen6, Wei Wang7.
Abstract
OBJECTIVE: To assess significant liver fibrosis by multiparametric ultrasomics data using machine learning.Entities:
Keywords: Data mining; Decision support techniques; Liver fibrosis; Machine learning; Ultrasonography
Mesh:
Year: 2018 PMID: 30178143 PMCID: PMC6510867 DOI: 10.1007/s00330-018-5680-z
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1The diagram shows the four-step process for the construction of an ultrasomic-based predictive model. (I) Ultrasomic images were obtained from different US modalities (conventional images, original radiofrequency data and dynamic CEMF images). (II) Big ultrasonic data were extracted as conventional radiomic features, original radiofrequency features and dynamic CEMF features. (III) Big data mining (correlation, cluster and predictive performance) was performed to select the optimal predictor, and the classification performance of ultrasomic models was tested via machine learning. (IV) In this study, we prospectively enrolled 144 patients with liver fibrosis to establish an ultrasomic model for the prediction of fibrosis stages
Demographic and clinical characteristics of patients
| Characteristic ( | Value |
|---|---|
| Age (years) # | 48.24 ± 13.75 |
| Gender (male/female) | 114/30 |
| Body mass index (kg/m2)# | 20.20 ± 3.16 |
| Hepatitis B surface antigen (+/-) | 98/46 |
| Platelet count (×109/l)* | 5.70 (4.69-7.22) |
| ALT level (U/l)* | 31.5 (20.5-50.0) |
| AST level (U/l)* | 30.0 (23.0-42.5) |
| AST/ALT* | 1.0 (0.734-1.262) |
| Albumin level (g/l)* | 41.8 (38.6-44.45) |
| Total bilirubin level (μmol/l)* | 12.55 (9.45-18.30) |
| γ-GL level (U/l)* | 53.5 (32.0-122.5) |
| Prothrombin time (s)* | 12.60 (12.10-13.40) |
| APRI* | 0.396 (0.281-0.733) |
| FIB4* | 1.344 (0.885-2.248) |
| HBV-DNA (<100/>100 cps/ml) | 62/82 |
| Fibrosis (F0/F1/F2/F3/F4) | 15/33/38/23/35 |
| Inflammation (A0/A1/A2/A3) | 9/70/50/15 |
| Steatosis (S0/S1/ S2/S3) | 101/36/5/2 |
Note: Unless otherwise indicated, data are number of patients
ALT alanine aminotransferase, AST aspartate aminotransferase, GL gamma-glutamyl transpeptidase, APRI aspartate aminotransferase to platelet ratio index, FIB4 fibrosis-4 index
*Data are medians, with interquartile range in parentheses
#Data are means ± standard deviation
Fig. 2The correlation heat map shows associations between conventional radiomics, ORF and CEMF features. Only a few parameters were highly correlated (blue) or highly anti-correlated (red). In general, we found that these features were only slightly or moderately correlated between ORF and CEMF features (a, Spearman rho, -0.26 to 0.24) and CEMF and conventional radiomic features (b, Spearman rho, -0.26 to 0.32). However, the variables between ORF and conventional radiomic features were more highly correlated (c, Spearman rho, -0.34 to 0.47)
Fig. 3Diagnostic value of the parameters in the diagnosis of fibrosis, activity and steatosis stages. The boxplot shows that ORF and CEMF parameters, with no significant difference, were the strongest predictors and were much higher than conventional radiomic features for liver fibrosis stages (a, both p < 0.01, ANOVA test). CEMF exhibited the highest diagnostic value for activity stages (b, both p < 0.05, ANOVA test), and ORF performed the best for steatosis stages (c, both p < 0.01, ANOVA test)
Fig. 4Boxplot showing the classification performance with the six machine-learning methods with all parameters. The p value is for a permutation test. The AdaBoost, RF and SVM classifier outperformed the other classifiers (all p < 0.001). These three classifiers showed less variation with a smaller quartile value and dispersion degree
Fig. 5The results of three machine-learning method analyses in classifying each combination of ultrasomic features. All three machine-learning methods-adaptive boosting (a), random forest (b) and support vector machine (c) showed that the ultrasomics models achieved much better performance than the models of a single modality and models of conventional radiomics and ORF features (all p < 0.001)
Training and validation results from machine learning-based classification of ultrasomics features
| Features | Adaboost | Random forest | Support vector machine | |||
|---|---|---|---|---|---|---|
| Training | Validation | Training | Validation | Training | Validation | |
| CR, ORF and CEMF | 0.97 ± 0.02 | 0.85 ± 0.01 | 1.00 | 0.85 ± 0.01 | 0.94 ± 0.01 | 0.85 ± 0.01 |
| CR and CEMF | 0.97 ± 0.03 | 0.82 ± 0.04 | 1.00 | 0.83 ± 0.02 | 0.91 ± 0.02 | 0.80 ± 0.03 |
| ORF and CEMF | 0.97 ± 0.02 | 0.84 ± 0.02 | 1.00 | 0.85 ± 0.03 | 0.91 ± 0.02 | 0.82 ± 0.04 |
| CR and ORF | 0.97 ± 0.01 | 0.78 ± 0.03 | 1.00 | 0.78 ± 0.03 | 0.91 ± 0.03 | 0.79 ± 0.04 |
| CR | 0.95 ± 0.04 | 0.68 ± 0.06 | 1.00 | 0.72 ± 0.05 | 0.84 ± 0.04 | 0.71 ± 0.05 |
| ORF | 0.95 ± 0.02 | 0.77 ± 0.02 | 1.00 | 0.73 ± 0.04 | 0.90 ± 0.03 | 0.74 ± 0.04 |
| CEMF | 0.97 ± 0.02 | 0.75 ± 0.03 | 1.00 | 0.77 ± 0.04 | 0.91 ± 0.05 | 0.74 ± 0.06 |
Note: Performance metrics are from hold-out samples (based on ten-fold cross-validation). Data in the table are mean ± standard deviation
CR conventional radiomics, ORF original radiofrequency, CEMF contrast-enhanced micro-flow
Performance metrics from machine learning-based classification of ultrasomic features
| Features | Adaboost | Random forest | Support vector machine | ||||||
|---|---|---|---|---|---|---|---|---|---|
| AUC | Sensitivity (%) | Specificity (%) | AUC | Sensitivity (%) | Specificity (%) | AUC | Sensitivity (%) | Specificity (%) | |
| CR, ORF and CEMF | 0.85 ± 0.01 | 87.5 | 76.9 | 0.85 ± 0.01 | 87.5 | 76.9 | 0.85 ± 0.01 | 93.8 | 69.2 |
| CR and CEMF | 0.82 ± 0.04 | 59.4 | 100 | 0.83 ± 0.02 | 71.9 | 92.3 | 0.80 ± 0.03 | 81.3 | 76.9 |
| ORF and CEMF | 0.84 ± 0.02 | 92.9 | 71.4 | 0.85 ± 0.03 | 92.9 | 71.4 | 0.82 ± 0.04 | 100 | 71.4 |
| CR and ORF | 0.78 ± 0.03 | 59.4 | 100 | 0.78 ± 0.03 | 56.3 | 92.3 | 0.79 ± 0.04 | 81.3 | 84.6 |
| CR | 0.68 ± 0.06 | 43.8 | 100 | 0.72 ± 0.05 | 50.0 | 92.3 | 0.71 ± 0.05 | 90.6 | 46.2 |
| ORF | 0.77 ± 0.02 | 81.3 | 69.2 | 0.73 ± 0.04 | 62.5 | 84.6 | 0.74 ± 0.04 | 87.5 | 69.2 |
| CEMF | 0.75 ± 0.03 | 78.1 | 69.2 | 0.77 ± 0.04 | 84.4 | 76.9 | 0.74 ± 0.06 | 90.6 | 53.9 |
Note: Performance metrics are validation results from hold-out samples (based on ten-fold cross-validation). Data of AUCs in the table are mean ± standard deviation
CR conventional radiomics, ORF original radiofrequency, CEMF contrast-enhanced micro-flow, AUC area under the receiver-operating characteristic curve