| Literature DB >> 29730602 |
Kun Wang1,2, Xue Lu1, Hui Zhou2,3, Yongyan Gao4, Jian Zheng1,5, Minghui Tong6, Changjun Wu7, Changzhu Liu8, Liping Huang9, Tian'an Jiang10, Fankun Meng11, Yongping Lu12, Hong Ai13, Xiao-Yan Xie14, Li-Ping Yin15, Ping Liang3, Jie Tian2,3, Rongqin Zheng1.
Abstract
OBJECTIVE: We aimed to evaluate the performance of the newly developed deep learning Radiomics of elastography (DLRE) for assessing liver fibrosis stages. DLRE adopts the radiomic strategy for quantitative analysis of the heterogeneity in two-dimensional shear wave elastography (2D-SWE) images.Entities:
Keywords: cirrhosis; hepatitis B; ultrasonography
Mesh:
Year: 2018 PMID: 29730602 PMCID: PMC6580779 DOI: 10.1136/gutjnl-2018-316204
Source DB: PubMed Journal: Gut ISSN: 0017-5749 Impact factor: 23.059
Figure 1Illustration of the two-dimensional shear wave elastography (2D-SWE) measurement and the deep learning Radiomics of elastography (DLRE) flow chart. (A) The top shows the 2D-SWE region of interest (ROI) (pseudocolour area), Q-Box (white circle area within 2D-SWE ROI) and DLRE ROI (red square area). The obtained 2D-SWE values are displayed on the right yellow box. The bottom is the corresponding B-mode ultrasound image. (B) An input layer (DLRE ROI) is analysed by using four convolution-pooling procedures (C1-P1 to C4-P4), and then last pooled maps are fully connected with 32 neural nodes to calculate its probability for classification. The neural nodes and other parameters of the convolutional neural network (CNN) model were automatically optimised by using all 2D-SWE images in the training cohort.
Figure 2The results of the multicentre patient enrolments. In total, 398 out of 654 patients from 12 Chinese hospitals were enrolled in this study. 2D-SWE, two-dimensional shear wave elastography.
Baseline characters of patients
| Variables | All patients | Training cohort | Validation cohort | P values |
| Number of patients (%) | 398 | 266 (66.8%) | 132 (33.2%) | – |
| Age (years) | 38.6±12.1 | 38.8±12.1 | 38.1±12.1 | 0.997 |
| Gender (male) | 265 (66.6%) | 175 (65.8%) | 90 (68.2%) | 0.634 |
| BMI (kg/m2) | 22.8±3.26 | 22.8±3.3 | 22.8±3.2 | 0.951 |
| FBG (g/L) | 5.2±1.2 | 5.2±1.1 | 5.1±1.4 | 0.860 |
| PLT (109/L) | 177.8±64.0 | 175.0±61.2 | 183.4±69.1 | 0.218 |
| AST (IU/L) | 43.9±46.5 | 43.5±42.2 | 44.6±54.5 | 0.832 |
| ALT (IU/L) | 54.9±53.9 | 54.9±57.5 | 54.9±46.0 | 0.997 |
| GGT (IU/L) | 47.5±62.1 | 49.2±64.6 | 43.9±56.7 | 0.428 |
| TB (µmol/L) | 15.9±8.6 | 16.0±9.2 | 15.7±7.3 | 0.724 |
| DB (µmol/L) | 5.3±5.2 | 5.6±5.9 | 4.7±2.9 | 0.113 |
| IB (µmol/L) | 10.9±5.8 | 10.8±6.1 | 11.0±5.2 | 0.727 |
| ALP (IU/L) | 83.8±33.5 | 82.1±32,5 | 87.5±35.2 | 0.131 |
| ALB (g/L) | 44.8±26.5 | 43.4±8.5 | 47.6±44.4 | 0.134 |
| PT (%) | 90.7±13.1 | 90.9±13.4 | 90.1±12.3 | 0.557 |
| HBV status | ||||
| HBeAg (+/−) | 150/248 | 96/170 | 54/78 | 0.428 |
| HBeAg (+/−) CI | 66/143 | 45/97 | 21/46 | 0.913 |
| Fibrosis stages | ||||
| F0-1 | 65 (16.3%) | 43 (16.1%) | 22 (16.7%) | 0.993 |
| F2 | 109 (27.4%) | 72 (27.1%) | 37 (28.0%) | 0.944 |
| F3 | 126 (31.7%) | 85 (32.0%) | 41 (31.1%) | 0.946 |
| F4 | 98 (24.6%) | 66 (24.8%) | 32 (24.2%) | 0.994 |
| Inflammation grades | ||||
| A0 | 1 (0.3%) | 1 (0.4%) | 0 (0%) | – |
| A1 | 137 (34.4%) | 84 (31.6%) | 53 (40.2%) | 0.132 |
| A2 | 148 (37.2%) | 106 (39.8%) | 42 (31.8%) | 0.149 |
| A3 | 112 (28.1%) | 75 (28.2%) | 37 (28.0%) | 0.939 |
Qualitative variables are in n (%), and quantitative variables are in mean±SD, when appropriate.
HBV status was categorised according to 2017 European Association for the Study of the Liver (EASL) guideline.
P values were calculated between the training and validation cohorts.
ALB, albumin; ALP, alkaline phosphatase; ALT, alanine aminotransferase; AST, aspartate aminotransferase; BMI, body mass index; DB, direct bilirubin; FBG, fasting blood glucose; GGT, gamma-glutamyl transpeptidase; HBeAg, hepatitis Be antigen; HBeAg (+/−) CI, hepatitis Be antigen (+/−) chronic infection; IB, indirect bilirubin; PLT, platelet count; PT, prothrombin activity; TB, total bilirubin.
Figure 3Comparison of ROC curves between DLRE, 2D-SWE and biomarkers for the assessment of liver fibrosis stages in training and validation cohorts, respectively. (A, D) F0-F3 versus F4 (F4) in training and validation cohorts. (B, E) F0-F2 versus F3-F4 (≥F3) in training and validation cohorts. (C, F) F0-F1 versus F2-F4 (≥F2) in training and validation cohorts. 2D-SWE, two-dimensional shear wave elastography; APRI, aspartate transaminase-to-platelet ratio index; DLRE, deep learning Radiomics of elastography; FIB-4, fibrosis index based on four factors.
Diagnostic performance of DLRE, 2D-SWE, APRI and FIB-4 for the assessment of liver fibrosis stages in training and validation cohorts
| n (P) | AUC | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | LR+ | LR− | ||
| Cirrhosis (F4) | |||||||||
| DLRE | T | 266 (24.8%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 132 (24.2%) | 0.97 | 96.9 | 88.0 | 72.1 | 99.9 | 8.1 | 0.0 | |
| 2D-SWE | T | 266 (24.8%) | 0.87*** | 80.3 | 79.0 | 55.8 | 92.4 | 3.8 | 0.3 |
| V | 132 (24.2%) | 0.86** | 87.5 | 76.0 | 53.8 | 95.0 | 3.7 | 0.2 | |
| APRI | T | 266 (24.8%) | 0.69*** | 63.6 | 73.5 | 44.2 | 86.0 | 2.4 | 0.5 |
| V | 132 (24.2%) | 0.70*** | 78.1 | 60.0 | 38.5 | 89.6 | 2.0 | 0.4 | |
| FIB-4 | T | 266 (24.8%) | 0.73*** | 63.6 | 77.0 | 47.7 | 86.5 | 2.8 | 0.5 |
| V | 132 (24.2%) | 0.75*** | 50.0 | 93.0 | 69.6 | 85.3 | 7.1 | 0.5 | |
| Advanced fibrosis (≥F3) | |||||||||
| DLRE | T | 266 (56.8%) | 0.99 | 97.4 | 95.7 | 95.6 | 71.9 | 22.4 | 0.0 |
| V | 132 (55.3%) | 0.98 | 90.4 | 98.3 | 98.5 | 89.2 | 53.3 | 0.1 | |
| 2D-SWE | T | 266 (56.8%) | 0.81*** | 81.5 | 70.4 | 78.3 | 74.3 | 2.8 | 0.3 |
| V | 132 (55.3%) | 0.85*** | 79.5 | 78.0 | 81.7 | 75.4 | 3.6 | 0.3 | |
| APRI | T | 266 (56.8%) | 0.65*** | 55.6 | 74.8 | 74.3 | 56.2 | 2.2 | 0.6 |
| V | 132 (55.3%) | 0.68*** | 72.6 | 64.4 | 71.6 | 65.5 | 2.0 | 0.4 | |
| FIB-4 | T | 266 (56.8%) | 0.65*** | 64.2 | 65.2 | 70.8 | 58.1 | 1.9 | 0.6 |
| V | 132 (55.3%) | 0.70*** | 48.0 | 86.4 | 81.4 | 57.3 | 3.5 | 0.6 | |
| Significance fibrosis (≥F2) | |||||||||
| DLRE | T | 266 (83.8%) | 0.99 | 100.0 | 97.7 | 99.6 | 100.0 | 43.0 | 0.0 |
| V | 132 (83.3%) | 0.85 | 69.1 | 90.9 | 97.4 | 37.0 | 7.6 | 0.3 | |
| 2D-SWE | T | 266 | 0.74*** | 50.7 | 88.4 | 95.8 | 25.7 | 4.4 | 0.6 |
| V | 132 | 0.77 | 49.1 | 95.5 | 98.2 | 27.3 | 10.8 | 0.5 | |
| APRI | T | 266 | 0.54*** | 37.7 | 74.4 | 88.4 | 18.7 | 1.5 | 0.8 |
| V | 132 | 0.60*** | 62.7 | 63.6 | 89.6 | 25.5 | 1.7 | 0.6 | |
| FIB-4 | T | 266 (83.8%) | 0.56*** | 51.1 | 62.8 | 87.7 | 19.9 | 1.4 | 0.8 |
| V | 132 (83.3%) | 0.62** | 60.9 | 68.2 | 90.5 | 25.9 | 1.9 | 0.6 | |
Statistical quantifications were demonstrated with 95% CI, when applicable.
AUC of DLRE was statistically compared with AUC of 2D-SWE, APRI and FIB-4, respectively, in the same fibrosis stage (*P<0.05; **P<0.01; ***P<0.001).
2D-SWE, two-dimensional shear wave elastography; APRI, aspartate transaminase-to-platelet ratio index; AUC, area under the receiver operating characteristic curve; DLRE, deep learning Radiomics of elastography; FIB-4, fibrosis index based on four factors; LR+, positive diagnostic likelihood ratio; LR−, negative diagnostic likelihood ratio; n, number of patients; NPV, negative predictive value; P, prevalence; PPV, positive predictive value; T, training cohort; V, validation cohort.
Intrastrategy and interstrategy comparisons of DLRE and 2D-SWE for their relationship of diagnostic accuracy versus the number of image/measurement acquisitions in assessing liver fibrosis stages in training and validation cohorts
| n (P) | AUC | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | LR+ | LR− | ||
|
| |||||||||
| DLRE | |||||||||
| 1 image | T | 266 (24.8%) | 0.94** | 95.5 | 84.5 | 67.0 | 98.3 | 6.2 | 0.1 |
| V | 132 (24.2%) | 0.84*** | 75.0 | 82.0 | 57.1 | 91.1 | 4.2 | 0.3 | |
| 3 images | T | 266 (24.8%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 132 (24.2%) | 0.96 | 84.4 | 95.0 | 84.4 | 95.0 | 16.9 | 0.2 | |
| 5 images | T | 266 (24.8%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 132 (24.2%) | 0.97 | 96.9 | 88.0 | 72.1 | 98.9 | 8.1 | 0.0 | |
| 2D-SWE | |||||||||
| 1 measurement | T | 266 (24.8%) | 0.87****** | 84.9 | 75.5 | 53.3 | 93.8 | 3.5 | 0.2 |
| V | 132 (24.2%) | 0.86 | 93.8 | 74.0 | 53.6 | 97.4 | 3.6 | 0.1 | |
| 3 measurements | T | 266 (24.8%) | 0.88****** | 86.4 | 74.5 | 52.8 | 94.3 | 3.4 | 0.2 |
| V | 132 (24.2%) | 0.85***** | 87.5 | 76.0 | 53.8 | 95.0 | 3.7 | 0.2 | |
| 5 measurements | T | 266 (24.8%) | 0.87****** | 80.3 | 79.0 | 55.8 | 92.4 | 3.8 | 0.3 |
| V | 132 (24.2%) | 0.86***** | 87.5 | 76.0 | 53.8 | 95.0 | 3.7 | 0.2 | |
|
| |||||||||
| DLRE | |||||||||
| 1 image | T | 266 (83.8%) | 0.91* | 86.8 | 81.7 | 86.2 | 82.5 | 4.8 | 0.2 |
| V | 132 (83.3%) | 0.82** | 75.3 | 79.7 | 82.1 | 72.3 | 3.7 | 0.3 | |
| 3 images | T | 266 (83.8%) | 0.96 | 90.1 | 88.7 | 91.3 | 81.2 | 8.0 | 0.1 |
| V | 132 (83.3%) | 0.95 | 94.5 | 86.4 | 89.6 | 92.7 | 7.0 | 0.1 | |
| 5 images | T | 266 (83.8%) | 0.99 | 97.4 | 95.7 | 95.6 | 71.9 | 22.4 | 0.0 |
| V | 132 (83.3%) | 0.98 | 90.4 | 98.3 | 98.5 | 89.2 | 53.3 | 0.1 | |
| 2D-SWE | |||||||||
| 1 measurement | T | 266 (83.8%) | 0.81****** | 72.2 | 75.7 | 79.6 | 67.4 | 3.0 | 0.4 |
| V | 132 (83.3%) | 0.83 | 68.5 | 89.8 | 89.3 | 69.7 | 6.7 | 0.4 | |
| 3 measurements | T | 266 (83.8%) | 0.81****** | 80.79 | 68.7 | 77.2 | 73.1 | 2.6 | 0.3 |
| V | 132 (83.3%) | 0.84***** | 78.1 | 79.7 | 82.6 | 74.6 | 3.8 | 0.3 | |
| 5 measurements | T | 266 (56.8%) | 0.81****** | 81.5 | 70.4 | 78.3 | 74.3 | 2.8 | 0.3 |
| V | 132 (55.3%) | 0.85****** | 79.5 | 78.0 | 81.7 | 75.4 | 3.6 | 0.3 | |
|
| |||||||||
| DLRE | |||||||||
| 1 image | T | 266 (83.8%) | 0.95 | 89.7 | 93.0 | 98.5 | 63.5 | 12.9 | 0.1 |
| V | 132 (83.3%) | 0.74 | 76.4 | 72.7 | 93.3 | 38.1 | 2.8 | 0.3 | |
| 3 images | T | 266 (83.8%) | 0.97 | 97.8 | 95.3 | 99.1 | 89.1 | 21.0 | 0.0 |
| V | 132 (83.3%) | 0.82 | 67.3 | 95.5 | 98.7 | 36.8 | 14.8 | 0.3 | |
| 5 images | T | 266 (83.8%) | 0.99 | 100.0 | 97.7 | 99.5 | 51.2 | 43.0 | 0.0 |
| V | 132 (83.3%) | 0.85 | 69.1 | 90.9 | 97.4 | 37.0 | 7.6 | 0.3 | |
| 2D-SWE | |||||||||
| 1 measurement | T | 266 (83.8%) | 0.72****** | 59.2 | 79.1 | 93.6 | 27.2 | 2.8 | 0.5 |
| V | 132 (83.3%) | 0.76 | 60.0 | 86.4 | 95.7 | 30.2 | 4.4 | 0.5 | |
| 3 measurements | T | 266 (83.8%) | 0.74****** | 54.3 | 83.7 | 94.5 | 26.1 | 3.3 | 0.6 |
| V | 132 (83.3%) | 0.77 | 51.8 | 95.5 | 98.3 | 28.4 | 11.4 | 0.5 | |
| 5 measurements | T | 266 (83.8%) | 0.74****** | 50.7 | 88.4 | 95.8 | 25.7 | 4.4 | 0.6 |
| V | 132 (83.3%) | 0.77 | 49.1 | 95.5 | 98.2 | 27.3 | 10.8 | 0.5 | |
Statistical quantifications were demonstrated with 95% CI, when applicable.
Intrastrategy comparison: for either DLRE or 2D-SWE, AUCs obtained by analysing three images/measurements were compared with these obtained by analysing one and five images/measurements, respectively (*P<0.05; **P<0.01; ***P<0.001).
Interstrategy comparison: for using the same number of images/measurements, AUCs obtained by DLRE were compared with these obtained by 2D-SWE in each liver fibrosis classification, respectively (****P<0.05; *****P<0.01; ******P<0.001).
2D-SWE, two-dimensional shear wave elastography; AUC, area under the receiver operating characteristic curve; DLRE, deep learning Radiomics of elastography; LR+, positive diagnostic likelihood ratio; LR−, negative diagnostic likelihood ratio; n, number of patients; NPV, negative predictive value; P, prevalence; PPV, positive predictive value; T, training cohort; V, validation cohort.
Figure 4Comparison of receiver operating characteristic (ROC) curves between deep learning Radiomics of elastography (DLRE) and two-dimensional shear wave elastography (2D-SWE) using different number of image acquisitions/measurements (1, 3 and 5) of each patient for the assessment of liver fibrosis stages. (A, D) F0-F3 versus F4 (F4) in training and validation cohorts. (B, E) F0-F2 versus F3-F4 (≥F3) in training and validation cohorts. (C, F) F0-F1 versus F2-F4 (≥F2) in training and validation cohorts.
Figure 5Comparison of receiver operating characteristic (ROC) curves between different combinations of hospitals for training deep learning Radiomics of elastography (DLRE) in the classification of liver fibrosis stages. (A, D) F0-F3 versus F4 (F4) in training (combination of hospitals B, D, G, E, H and J) and validation cohorts. (B, E) F0-F2 versus F3-F4 (≥F3) in training (combination of hospitals A, C and K) and validation cohorts. (C, F) F0-F1 versus F2-F4 (≥F2) in training (combination of hospitals A, G and K) and validation cohorts. Note: three ROC curves completely overlap each other in (A) and (C), as they all reach the optimal profile (area under the receiver operating characteristic curve (AUC)=1).
Comparisons using different combinations of hospitals for training DLRE to classify liver fibrosis stages in training and validation cohorts
| n (P) | AUC | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | LR+ | LR− | ||
| Cirrhosis (F4) | |||||||||
| Combo1 | T | 221 (27.6%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 177 (20.9%) | 0.95 | 86.5 | 94.3 | 80.0 | 96.4 | 15.1 | 0.1 | |
| Combo 2 | T | 221 (27.6%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 177 (20.9%) | 0.98 | 93.6 | 93.3 | 84.6 | 97.4 | 13.9 | 0.1 | |
| Combo 3 | T | 221 (27.6%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 177 (20.9%) | 0.97 | 95.2 | 92.4 | 81.6 | 98.2 | 12.6 | 0.1 | |
| Advanced fibrosis (≥F3) | |||||||||
| Combo 1 | T | 221 (59.7%) | 0.98 | 94.7 | 95.5 | 96.9 | 92.4 | 21.1 | 0.1 |
| V | 177 (52.0%) | 0.97 | 92.4 | 95.3 | 95.5 | 92.1 | 19.6 | 0.1 | |
| Combo 2 | T | 221 (59.7%) | 0.98 | 95.8 | 93.9 | 94.2 | 95.5 | 15.6 | 0.05 |
| V | 177 (52.0%) | 0.97 | 96.2 | 88.3 | 93.6 | 93.0 | 8.3 | 0.0 | |
| Combo 3 | T | 221 (59.7%) | 1.00 | 96.9 | 99.1 | 99.2 | 96.5 | 106.5 | 0.0 |
| V | 177 (52.0%) | 0.97 | 93.8 | 92.2 | 94.8 | 90.8 | 12.0 | 0.1 | |
| Significance fibrosis (≥F2) | |||||||||
| Combo 1 | T | 221 (87.3%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 177 (79.1%) | 0.83 | 84.3 | 70.3 | 91.5 | 54.2 | 2.8 | 0.2 | |
| Combo 2 | T | 221 (87.3%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 177 (79.1%) | 0.86 | 80.8 | 80.0 | 97.6 | 29.3 | 4.0 | 0.2 | |
| Combo 3 | T | 221 (87.3%) | 1.00 | 100.0 | 100.0 | 100.0 | 100.0 | – | 0.0 |
| V | 177 (79.1%) | 0.82 | 74.1 | 77.8 | 96.4 | 27.5 | 3.3 | 0.3 | |
Statistical quantifications were demonstrated with 95% CI, when applicable.
AUCs obtained by three different combinations of hospitals were statistically compared with each other in each classification and each cohort (*P<0.05; **P<0.01; ***P<0.001).
AUC, area under the receiver operating characteristic curve; Combo, combination of hospitals for training and validation cohorts; DLRE, deep learning Radiomics of elastography; LR+, positive diagnostic likelihood ratio; LR−, negative diagnostic likelihood ratio; n, number of patients; NPV, negative predictive value; P, prevalence; PPV, positive predictive value; T, training cohort; V, validation cohort.