| Literature DB >> 35190650 |
Victoria Blanes-Vidal1,2, Katrine P Lindvig3,4, Maja Thiele3,4, Esmaeil S Nadimi5,6, Aleksander Krag3,4.
Abstract
For years, hepatologists have been seeking non-invasive methods able to detect significant liver fibrosis. However, no previous algorithm using routine blood markers has proven to be clinically appropriate in primary care. We present a novel approach based on artificial intelligence, able to predict significant liver fibrosis in low-prevalence populations using routinely available patient data. We built six ensemble learning models (LiverAID) with different complexities using a prospective screening cohort of 3352 asymptomatic subjects. 463 patients were at a significant risk that justified performing a liver biopsy. Using an unseen hold-out dataset, we conducted a head-to-head comparison with conventional methods: standard blood-based indices (FIB-4, Forns and APRI) and transient elastography (TE). LiverAID models appropriately identified patients with significant liver stiffness (> 8 kPa) (AUC of 0.86, 0.89, 0.91, 0.92, 0.92 and 0.94, and NPV ≥ 0.98), and had a significantly superior discriminative ability (p < 0.01) than conventional blood-based indices (AUC = 0.60-0.76). Compared to TE, LiverAID models showed a good ability to rule out significant biopsy-assessed fibrosis stages. Given the ready availability of the required data and the relatively high performance, our artificial intelligence-based models are valuable screening tools that could be used clinically for early identification of patients with asymptomatic chronic liver diseases in primary care.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35190650 PMCID: PMC8861108 DOI: 10.1038/s41598-022-06998-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Patients demographic information, clinical data and indirect serum markers used as input parameters in each of the LiverAID models.
Figure 2Data splitting including repeated random subsampling approach and final evaluation in the hold-out dataset.
– Subjects characteristics.
| Characteristic | Units or levels | Summary data (n = 3460) |
|---|---|---|
| LSM1 | kPa | 4.6 ± 2.1 |
| Liver stiffness status based on LSM2 | significant/not significant | 403/2949 |
| Sex | male/female | 1584/1768 |
| Age | years | 57 ± 13 |
| Weight | kg | 83 ± 25 |
| Alcohol consumption in the last 3 months | units/week | 6 ± 15 |
| BMI | 27.3 ± 7 | |
| METS points | 0/1/2/3/4/5 | 336/753/824/608/448/383 |
| Diabetes | no/yes | 3023/329 |
| Mid-upper arm circumference | cm | 30 ± 5 |
| ALT | U/L | 26 ± 16 |
| AST | U/L | 25 ± 11 |
| ALP | g/L | 69 ± 28 |
| GGT | U/L | 29 ± 36 |
| ALB | g/L | 45 ± 4 |
| INR | 1.0 ± 0.1 | |
| Bilirubine | µmol/L | 8 ± 6 |
| Platelets | 10^9/L | 242 ± 75 |
| Hb | mmol/L | 8.8 ± 1.0 |
| MCV | fL | 90 ± 5 |
| Leukocytes | 10^9/L | 6.0 ± 2.3 |
| CRP | mg/L | 1.5 ± 2.7 |
| Ferritin | µg/L | 145 ± 173 |
| Sodium | mmol/L | 140 ± 3 |
| Cholesterol | mmol/L | 5.0 ± 1.4 |
| Triglycerides | mmol/L | 1.1 ± 0.8 |
| Elevated triglycerides | no/yes | 2208/1144 |
| HDL | mmol/L | 1.5 ± 0.6 |
| Low HDL cholesterol | no/yes | 2397/955 |
| HbA1c | mmol/mol | 36 ± 6 |
| GLC | mmol/L | 5.7 ± 0.8 |
| GLCmean | mmol/L | 6.1 ± 0.8 |
| IgA | g/L | 2.2 ± 1.4 |
| IgG | g/L | 10.1 ± 2.9 |
| IgM | g/L | 0.86 ± 0.64 |
| MELD | 6 ± 1 | |
| Biopsy-assessed fibrosis stage | F0/F1/F2/F3/F4 | 41/168/139/50/65 |
All summary data are medians ± interquartile range or counts (%).
1LSM = Liver stiffness measurement (median of 10 valid transient elastography measurements).
2Number of subjects whose LSM is > 8 kPa (corresponding to “significant liver stiffness”) and subjects whose LSM is ≤ 8 kPa (corresponding to “not significant liver stiffness”).
ALB, Albumin; ALP, Alkaline phosphatase; GGT, Gamma-Glutamyltransferase; INR, International Normalised Ratio; ALT, Alanine aminotransferase; AST, Aspartate aminotransferase; CRP, C-reactive protein; GLC, Fasting glucose; GLCmean, Mean glucose calculated from HbA1C; Hb, Hemoglobine; HbA1c, Hemoglobin A1c; HDL, HDL cholesterol; MCV, Mean corpuscular volume; MELD, Model of End-stage Liver Disease.
A total of 3352 patients had a valid liver stiffness measurement and 463 patients also underwent a liver biopsy investigation. All demographic and serum variables included in the analysis had ≤ 15% missing values. Missing values in these variables were handled by creating imputations (replacement values) for these multivariate missing data.
- Diagnostic performance measures evaluated using the hold-out (completely unseen) dataset.
| Type of method/model | Prediction of significant liver stiffness defined as measured liver stiffness (LSM) > 8 kPa (N = 335) | Prediction of significant liver fibrosis defined as Kleiner biopsy stage (F2 to F4) (N = 55) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Method/model | AUC | Accuracy | Sensitivity | Specificity | PPV | NPV | Accuracy | Sensitivity | Specificity | PPV | NPV | |
| 1. Cut-off values for the standard blood-based indices1 | FIB-4 | – | 0.6 | 0.71 | 0.58 | 0.25 | 0.91 | 0.55 | 0.11 | 1.00 | 1.00 | 0.52 |
| Forns | – | 0.17 | 0.93 | 0.03 | 0.15 | 0.73 | 0.71 | 0.82 | 0.59 | 0.68 | 0.76 | |
| APRI | – | 0.85 | 0.49 | 0.91 | 0.52 | 0.9 | 0.62 | 0.92 | 0.31 | 0.57 | 0.80 | |
| 2. Logistic regression using standard blood indices as predictors | FIB-4 | 0.7 | 0.71 | 0.64 | 0.73 | 0.31 | 0.91 | 0.63 | 0.27 | 1.00 | 1.00 | 0.58 |
| Forns | 0.6 | 0.86 | 0.27 | 0.96 | 0.57 | 0.88 | 0.69 | 0.57 | 0.81 | 0.76 | 0.65 | |
| APRI | 0.74 | 0.77 | 0.6 | 0.8 | 0.36 | 0.91 | 0.69 | 0.75 | 0.63 | 0.68 | 0.71 | |
| FIB-4 + Forns + APRI | 0.76 | 0.74 | 0.66 | 0.76 | 0.33 | 0.92 | 0.63 | 0.87 | 0.40 | 0.59 | 0.75 | |
| 3. Ensemble learning models | LiverAID XXS | 0.86 | 0.52 | 0.95 | 0.44 | 0.22 | 0.98 | 0.65 | 0.96 | 0.33 | 0.60 | 0.90 |
| LiverAID XS | 0.89 | 0.56 | 0.95 | 0.49 | 0.24 | 0.98 | 0.56 | 0.96 | 0.16 | 0.54 | 0.79 | |
| LiverAID S | 0.91 | 0.68 | 0.92 | 0.64 | 0.3 | 0.98 | 0.61 | 0.96 | 0.26 | 0.57 | 0.85 | |
| LiverAID M | 0.92 | 0.69 | 0.9 | 0.66 | 0.31 | 0.98 | 0.61 | 0.96 | 0.26 | 0.57 | 0.85 | |
| LiverAID L | 0.92 | 0.69 | 0.92 | 0.65 | 0.31 | 0.98 | 0.61 | 0.98 | 0.22 | 0.57 | 0.88 | |
| LiverAID 4XL | 0.94 | 0.74 | 0.94 | 0.71 | 0.35 | 0.99 | 0.62 | 0.97 | 0.25 | 0.57 | 0.89 | |
| 4. Transient elastography | LSM | – | – | – | – | – | – | 0.84 | 0.93 | 0.74 | 0.79 | 0.91 |
1Cut-off values of FIB-4 = 1.25, Forns = 4.1 and APRI = 0.5 were used.
- Diagnostic performance measures for the prediction of significant liver stiffness defined as measured liver stiffness (LSM) > 8 kPa, evaluated using the hold-out (completely unseen) dataset, for each subpopulation: subjects at risk of NAFLD, subjects at risk of alcohol-related liver disease (ALD), and subjects randomly selected from the general population.
| Method/model | Accuracy | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|---|
| 1. Cut-off values for the standard blood-based indices1 | FIB-4 | 0.82 | 0.00 | 1.00 | – | 0.82 |
| Forns | 0.22 | 0.91 | 0.06 | 0.18 | 0.75 | |
| APRI | 0.86 | 0.48 | 0.95 | 0.67 | 0.89 | |
| 2. Logistic regression using standard blood indices as predictors | FIB-4 | 0.74 | 0.52 | 0.78 | 0.35 | 0.88 |
| Forns | 0.83 | 0.23 | 0.97 | 0.63 | 0.85 | |
| APRI | 0.76 | 0.48 | 0.83 | 0.38 | 0.88 | |
| FIB-4 + Forns + APRI | 0.76 | 0.64 | 0.78 | 0.39 | 0.91 | |
| 3. Ensemble learning models | LiverAID XXS | 0.61 | 0.96 | 0.54 | 0.30 | 0.98 |
| LiverAID XS | 0.53 | 0.93 | 0.45 | 0.26 | 0.97 | |
| LiverAID S | 0.63 | 0.94 | 0.56 | 0.31 | 0.98 | |
| LiverAID M | 0.63 | 0.94 | 0.56 | 0.31 | 0.98 | |
| LiverAID L | 0.63 | 0.93 | 0.57 | 0.31 | 0.98 | |
| LiverAID 4XL | 0.68 | 0.92 | 0.62 | 0.34 | 0.98 | |
| 1. Cut-off values for the standard blood-based indices1 | FIB-4 | 0.82 | 0.15 | 1.00 | 1.00 | 0.81 |
| Forns | 0.18 | 0.94 | 0.01 | 0.18 | 0.50 | |
| APRI | 0.79 | 0.55 | 0.85 | 0.50 | 0.88 | |
| 2. Logistic regression using standard blood indices as predictors | FIB-4 | 0.67 | 0.75 | 0.65 | 0.37 | 0.91 |
| Forns | 0.84 | 0.39 | 0.94 | 0.58 | 0.87 | |
| APRI | 0.69 | 0.70 | 0.69 | 0.38 | 0.89 | |
| FIB-4 + Forns + APRI | 0.67 | 0.67 | 0.67 | 0.33 | 0.89 | |
| 3. Ensemble learning models | LiverAID XXS | 0.59 | 0.90 | 0.52 | 0.30 | 0.96 |
| LiverAID XS | 0.51 | 0.99 | 0.40 | 0.27 | 0.99 | |
| LiverAID S | 0.64 | 0.90 | 0.58 | 0.33 | 0.96 | |
| LiverAID M | 0.64 | 0.90 | 0.58 | 0.33 | 0.96 | |
| LiverAID L | 0.64 | 0.93 | 0.57 | 0.33 | 0.98 | |
| LiverAID 4XL | 0.68 | 0.99 | 0.60 | 0.36 | 1.00 | |
| 1. Cut-off values for the standard blood-based indices1 | FIB-4 | 0.94 | 0.00 | 1.00 | - | 0.94 |
| Forns | 0.07 | 1.00 | 0.01 | 0.05 | 1.00 | |
| APRI | 0.90 | 0.25 | 0.94 | 0.20 | 0.96 | |
| 2. Logistic regression using standard blood indices as predictors | FIB-4 | 0.74 | 0.75 | 0.74 | 0.14 | 0.98 |
| Forns | 0.93 | 0.00 | 0.99 | 0.00 | 0.95 | |
| APRI | 0.88 | 0.75 | 0.88 | 0.27 | 0.98 | |
| FIB-4 + Forns + APRI | 0.79 | 0.68 | 0.79 | 0.22 | 0.96 | |
| 3. Ensemble learning models | LiverAID XXS | 0.84 | 0.75 | 0.84 | 0.19 | 0.99 |
| LiverAID XS | 0.68 | 0.90 | 0.67 | 0.12 | 0.99 | |
| LiverAID S | 0.87 | 0.75 | 0.88 | 0.23 | 0.99 | |
| LiverAID M | 0.87 | 0.75 | 0.88 | 0.23 | 0.99 | |
| LiverAID L | 0.86 | 0.75 | 0.87 | 0.22 | 0.99 | |
| LiverAID 4XL | 0.95 | 0.75 | 0.96 | 0.47 | 0.99 | |
Figure 3Receiver operating characteristic (ROC) curves for the prediction of clinically significant liver stiffness (> 8 kPa) in the hold-out dataset, repetition 1 to 5 respectively (n = 335) of LiverAID models (LiverAID XXS, XS, S, M, L, 4XL) and logistic regressions using standard blood-based indices of liver fibrosis as predictors (univariate FIB-4, Forns, APRI and multivariate FIB-4 + Forns + APRI).
- P-values for the comparison of AUC between LiverAID models and standard blood-based indices in predicting significant liver stiffness (LSM > 8 kPa).
| FIB-4 | Forns | APRI | FIB-4 + Forns + APRI | LiverAIDXXS | LiverAIDXS | LiverAIDS | LiverAIDM | LiverAIDL | LiverAID 4XL | |
|---|---|---|---|---|---|---|---|---|---|---|
| AUC | 0.70 | 0.60 | 0.74 | 0.76 | 0.86 | 0.89 | 0.91 | 0.92 | 0.92 | 0.94 |
| FIB-4 | – | |||||||||
| Forns | 0.987 | – | ||||||||
| APRI | 0.867 | 0.998 | – | |||||||
| FIB-4 + Forns + APRI | 0.021 | 0.003 | 0.173 | – | ||||||
| LiverAID XXS | 0.000 | 0.000 | 0.000–0.001 | 0.002–0.008 | – | |||||
| LiverAID XS | 0.000 | 0.000 | 0.000–0.001 | 0.001–0.008 | 0.065–0.430 | – | ||||
| LiverAID S | 0.000 | 0.000 | 0.000 | 0.000–0.001 | 0.005–0.031 | 0.000–0.102 | – | |||
| LiverAID M | 0.000 | 0.000 | 0.000 | 0.000 | 0.001–0.016 | 0.003–0.132 | 0.300–0.485 | – | ||
| LiverAID L | 0.000 | 0.000 | 0.000 | 0.000 | 0.004–0.060 | 0.001–0.067 | 0.187–0.482 | 0.172–0.440 | – | |
| LiverAID 4XL | 0.000 | 0.000 | 0.000 | 0.000 | 0.002–0.006 | 0.001–0.041 | 0.044–0.219 | 0.049–0.126 | 0.014–0.157 | – |
Ranges indicate the minimum and maximum P-values from the repeated models (repetitions 1 to 5). The results for each repetition can be found in Appendix.