| Literature DB >> 33005419 |
Xiao-Jie Lu1, Xiao-Jun Yang2, Jing-Yu Sun1, Xin Zhang3, Zhao-Xin Yuan4,5, Xiu-Hui Li6.
Abstract
BACKGROUND: China is a highly endemic area of chronic hepatitis B (CHB). The accuracy of existed noninvasive biomarkers including TE, APRI and FIB-4 for staging fibrosis is not high enough in Chinese cohort.Entities:
Keywords: HBV; Liver fibrosis; Machine learning; Noninvasive diagnosis
Year: 2020 PMID: 33005419 PMCID: PMC7520974 DOI: 10.1186/s40364-020-00215-2
Source DB: PubMed Journal: Biomark Res ISSN: 2050-7771
Fig. 1Flow diagram of the study population and reasons for exclusion. CHB, chronic HBV; HCC, hepatocellular carcinoma; HDV, hepatitis D virus
Baseline characteristics of the study population in training set (Huai’an, Jilin and Anhui) and in validation sets (Beijing)
| Variables | Huai’an ( | Jilin ( | Anhui ( | Beijing ( | |
|---|---|---|---|---|---|
| Male sex, n | 158 (63%) | 168 (57%) | 257 (63%) | 203 (61%) | 0.274 |
| Median age (years) (IQR) | 42 (32–49) | 42 (33–50) | 38 (29–48) | 38 (30–47) | 0.006 |
| Median BMI (kg/m2) (IQR) | 23.7 (22.5–25.1) | 23.7 (21.6–26.0) | 22.8 (20.4–25.4) | 23.7 (21.0–26.1) | 0.014 |
| Median fasting LSM value (kPa) | 10.4 (6.6–15.3) | 7.9 (5.9–11.8) | 5.7 (4.2–7.5) | 7.8 (5.6–12.4) | < 0.001 |
| Median ALT (IU/L) (IQR) | 42 (27–67) | 39 (24–65) | 49 (26–84.5) | 46.2 (25.2–79.4) | 0.046 |
| Median AST (IU/L) (IQR) | 35 (27–55) | 32 (25–50) | 34 (23–51) | 33 (25–52) | 0.108 |
| Median GGT (IU/L) (IQR) | 46 (24–98) | 37 (21–79) | 31.5 (21–73) | 32 (19–60) | < 0.001 |
| Median total bilirubin (IU/L) (IQR) | 15.8 (11.9–21.0) | 14.4 (11.6–21.2) | 17.7 (13.0–23.1) | 14.3 (11.4–20) | 0.654 |
| Median platelet counts (109/L) (IQR) | 165 (122–206) | 179 (140–215) | 161.5 (125–204) | 186 (148–225) | 0.148 |
| Median APRI (IQR) | 0.64 (0.41–1.35) | 0.51 (0.33–0.84) | 0.55 (0.34–0.92) | 0.50 (0.30–0.76) | < 0.001 |
| Median FIB-4 (IQR) | 1.61 (0.99–2.87) | 1.22 (0.82–1.95) | 1.17 (0.74–2.04) | 1.07 (0.73–1.69) | < 0.001 |
| Ultrasound size of spleen (mm2) (IQR) | 4410 (3866–5280) | 3780 (2976–4710) | 3468 (3120–3813) | 3620 (2890–4560) | < 0.001 |
| Ultrasound diameter of spleen vein (mm) (IQR) | 6 (5.2–7) | 7 (6.7–8) | 9 (8–10) | 7 (6–8) | < 0.001 |
| Ultrasound diameter of portal vein (mm) (IQR) | 11 (10–12) | 12 (11–12) | 11 (10–11) | 11 (10–12) | < 0.001 |
| Ultrasound velocity of portal vein (m/s) (IQR) | 0.16 (0.14–0.19) | 0.11 (0.10–0.12) | Not reported | Not reported | |
| Median size of liver biopsy (mm) (IQR) | 15 (12–16) | 16 (12–19) | 15 (12–20) | 16 (13–18) | < 0.001 |
| Metavir fibrosis stage (F0/F1/F2/F3/F4) | 21 (8.3%)/37 (14.7%)/46 (18.3%)/51 (20.2%)/97 (38.5%) | 32 (10.8%)/77 (25.9%)/84 (28.3%)/44 (14.8%)/60 (20.2%) | 10 (2.5%)/144 (35.3%)/129 (31.6%)/66 (16.2%)/59 (14.5%) | 26 (7.9%)/127 (38.3%)/73 (22%)/32 (9.6%)/74 (22.3%) | < 0.001 |
| Metavir activity grade (A0/A1/A2/A3) | 50 (19.8%)/106 (42.1%)/72 (28.6%)/24 (9.5%) | 190 (64.0%)/54 (18.2%)/52 (17.5%)/1 (0.3%) | 86 (21.1%)/254 (62.2%)/61 (15.0%)/7 (1.7%) | 13 (3.9%)/154 (46.4%)/114 (34.3%)/51 (15.4%) | < 0.001 |
ALT alanine transaminase, APRI (AST)-to-platelet ratio index, AST aspartate transaminase, BMI body mass index, GGT gamma-glutamyl transpeptidase, LSM liver stiffness measurement
Selection for orginal variables associated with the presence of fibrosis stage in the training set
| Variables | Spearman correlation analysis | Combined multivariate analysis | ||
|---|---|---|---|---|
| Significant fibrosis vs none | Cirrhosis vs F0–3 | |||
| Age (years) | 0.223 | < 0.001 | ||
| Male sex, n (%) | −0.021 | 0.630 | ||
| BMI | −0.013 | 0.770 | ||
| ALT (IU/L) | 0.030 | 0.489 | ||
| AST (IU/L) | 0.171 | < 0.001 | √ | √ |
| GGT (IU/L) | 0.187 | < 0.001 | √ | √ |
| Total bilirubin (IU/L) | 0.110 | 0.010 | ||
| Platelet count (109/L) | −0.356 | < 0.001 | √ | √ |
| WBC (109/L) | −0.257 | < 0.001 | ||
| PT (s) | 0.321 | < 0.001 | √ | √ |
| ALP (IU/L) | 0.094 | 0.027 | ||
| Albumin (g/L) | −0.144 | 0.001 | ||
| Cholesterol (mmol/L) | −0.071 | 0.099 | ||
| INR | 0.356 | < 0.001 | ||
| PIIINP (ng/ml) | 0.330 | < 0.001 | √ | √ |
| Type IV collagen (ng/ml) | 0.478 | < 0.001 | √ | √ |
| Laminin (ng/ml) | 0.465 | < 0.001 | √ | √ |
| HA (ng/ml) | 0.444 | < 0.001 | √ | √ |
| Ultrasound size of spleen (mm2) | 0.223 | < 0.001 | ||
| Ultrasound diameter of spleen vein (mm) | 0.097 | 0.024 | ||
| Ultrasound diameter of portal vein (mm) | 0.138 | 0.001 | √ | √ |
| Ultrasound velocity of portal vein (m/s) | 0.179 | < 0.001 | ||
| Fibroscan results (kPa) | 0.767 | < 0.001 | √ | √ |
ALT alanine transaminase, ALP alkaline phosphatase, AST aspartate transaminase, BMI body mass index, GGT gamma-glutamyl transpeptidase, HA hyaluronic acid, INR international normalized ratio, PIIINP type III procollagen aminoterminal peptide, PT prothrombin time, WBC white blood cell
Fig. 2Feature selection by using a parametric method, the least absolute shrinkage and selection operator (LASSO) regression. a Significant fibrosis feature selection of tuning parameter (λ) in the LASSO model used 10-fold cross-validation via minimum criteria. The AUC curve was plotted versus log(λ). Dotted vertical lines were drawn at the optimal values by using the minimum criteria and the 1 standard error of the minimum criteria (the 1 – standard error criteria). The optimal log(λ) of − 3.96 was chosen. b Cirrhosis feature selection and the optimal log(λ) of − 4.83 was chosen. c LASSO coefficient profiles of the 18 initially selected features. A vertical line was plotted at the optimal λ value, which resulted in 9 features with nonzero coefficients. d LASSO coefficient profiles of the 16 initially selected features. A vertical line was plotted at the optimal λ value, which resulted in 9 features with nonzero coefficients
Fig. 3The performances of the prediction models including FibroBox, TE, APRI and FIB-4 for significant fibrosis and cirrhosis in the Anhui cohort (a) and Beijing corhort (b) are assessed by the area under a receiver operating characteristic (ROC) curve
Diagnostic performance of FibroBox, TE, APRI and FIB-4 in the validation cohorts (Anhui and Beijing)
| Validation cohorts | Anhui ( | Beijing ( | ||
|---|---|---|---|---|
| F0–1 vs F2–4 | F0–3 vs F4 | F0–1 vs F2–4 | F0–3 vs F4 | |
| AUROC (95% CI) | 0.88 (0.84 to 0.92) | 0.87 (0.82 to 0.92) | 0.87 (0.83 to 0.91) | 0.90 (0.85 to 0.94) |
| Cut-off values | 0.38 | 0.56 | 0.38 | 0.56 |
| Sensitivity/specificity (%) | 80/82 | 78/81 | 75/88 | 72/90 |
| Correctly classified (%) | 81 | 81 | 82 | 88 |
| PPV/NPV (%) | 89/71 | 51/94 | 84/81 | 49/96 |
| Positive/negative LR | 4.5/0.2 | 4.1/0.3 | 6.2/0.3 | 7.2/0.3 |
| AUROC (95% CI) | 0.84 (0.79 to 0.88) | 0.84 (0.78 to 0.90) | 0.82 (0.77 to 0.87) | 0.89 (0.85 to 0.94) |
| Cut-off values | 7.8 | 11.3 | 7.8 | 11.3 |
| Sensitivity/specificity (%) | 77/82 | 75/85 | 77/84 | 95/69 |
| Correctly classified (%) | 79 | 83 | 81 | 72 |
| PPV/NPV (%) | 88/67 | 55/93 | 79/82 | 29/99 |
| Positive/negative LR | 4.4/0.3 | 4.8/0.3 | 4.7/0.3 | 3.0/0.1 |
| AUROC (95% CI) | 0.66 (0.60 to 0.73) | 0.72 (0.65 to 0.79) | 0.70 (0.65 to 0.76) | 0.75 (0.67 to 0.82) |
| Cut-off values | 0.50 | 0.50 | 0.43 | 0.62 |
| Sensitivity/specificity (%) | 64/70 | 81/56 | 75/58 | 69/69 |
| Correctly classified (%) | 67 | 61 | 66 | 69 |
| PPV/NPV (%) | 79/54 | 32/92 | 59/74 | 23/94 |
| Positive/negative LR | 2.2/0.5 | 1.8/0.3 | 1.8/0.4 | 2.2/0.4 |
| AUROC (95% CI) | 0.68 (0.62 to 0.74) | 0.79 (0.72 to 0.86) | 0.67 (0.61 to 0.73) | 0.70 (0.62 to 0.79) |
| Cut-off values | 1.71 | 1.62 | 1.20 | 1.20 |
| Sensitivity/specificity (%) | 44/87 | 75/75 | 58/71 | 71/62 |
| Correctly classified (%) | 60 | 75 | 65 | 63 |
| PPV/NPV (%) | 85/47 | 43/92 | 62/67 | 20/94 |
| Positive/negative LR | 3.4/0.6 | 3.0/0.3 | 2.0/0.6 | 1.9/0.5 |
| FibroBox and TE | < 0.001 | 0.058 | < 0.001 | 0.863 |
| FibroBox and APRI | < 0.001 | < 0.001 | < 0.001 | < 0.001 |
| FibroBox and FIB-4 | < 0.001 | 0.015 | < 0.001 | < 0.001 |
| TE and APRI | < 0.001 | 0.007 | < 0.001 | < 0.001 |
| TE and FIB-4 | < 0.001 | 0.264 | < 0.001 | < 0.001 |
| APRI and FIB-4 | 0.575 | 0.009 | 0.264 | 0.211 |
APRI AST-to-platelet ratio index, AST aspartate transaminase, AUROC area under the receiver operating characteristic curve, LR likelihood ratio, NPV negative predictive value, PPV positive predictive value, TE transient elastography
Fig. 4Decision curve analysis (DCA) of the prediction models including FibroBox, TE, APRI and FIB-4 for significant fibrosis and cirrhosis in the Anhui cohort (a) and Beijing corhort (b)