| Literature DB >> 35198928 |
Grace Lai-Hung Wong1,2,3, Vicki Wing-Ki Hui1,2, Qingxiong Tan4, Jingwen Xu4, Hye Won Lee5, Terry Cheuk-Fung Yip1,2,3, Baoyao Yang4, Yee-Kit Tse1,2, Chong Yin4, Fei Lyu4, Jimmy Che-To Lai1,2,3, Grace Chung-Yan Lui2, Henry Lik-Yuen Chan1,6, Pong-Chi Yuen4, Vincent Wai-Sun Wong1,2,3.
Abstract
BACKGROUND & AIMS: Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territory-wide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL).Entities:
Keywords: ALT, alanine aminotransferase; APRI, aspartate transaminase-to-platelet ratio index; AUROC, area under the receiver operating characteristic curve; Antiviral treatment; CDARS, Clinical Data Analysis and Reporting System; CHB, chronic hepatitis B; CHC, chronic hepatitis C; CI, confidence intervals; CVH, chronic viral hepatitis; Cirrhosis; DM, diabetes mellitus; HADCL, Hospital Authority Data Collaboration Lab; HBV, hepatitis B virus; HBeAg, hepatitis B e antigen; HBsAg, hepatitis B surface antigen; HCC, hepatocellular carcinoma; ICD-9-CM, International Classification of Diseases, Ninth Revision Clinical Modification; Liver cancer; Mortality; NA, nucleos(t)ide analogue; RS, ridge score; WHO, World Health Organization; World Health Organization; aHR, adjusted hazard ratio; anti-HCV, antibody to hepatitis C virus
Year: 2022 PMID: 35198928 PMCID: PMC8844233 DOI: 10.1016/j.jhepr.2022.100441
Source DB: PubMed Journal: JHEP Rep ISSN: 2589-5559
Fig 1Selection of patients with CHB in the final analysis.
CHB, chronic hepatitis B; CHC, chronic hepatitis C.
Baseline clinical characteristics of patients with CHV first diagnosed at different periods from 2000 to 2018.
| Chronic hepatitis B (N = 126,890) | Chronic hepatitis C (N = 16,811) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Period | 2000–2004 | 2005–2009 | 2010–2013 | 2014–2018 | 2000–2004 | 2005–2009 | 2010–2013 | 2014–2018 | ||
| No. of patients | n = 19,060 | n = 29,809 | n = 37,011 | n = 41,010 | n = 5,362 | n = 3,694 | n = 3,279 | n = 4,476 | ||
| Male sex (n, %) | 12,175 (63.88) | 18,746 (62.89) | 22,425 (60.59) | 24,521 (59.79) | <0.001 | 3,343 (62.3) | 2,616 (70.8) | 2,268 (69.2) | 3,089 (69.0) | <0.001 |
| Age (years) | 48.33 (15.49) | 51.31 (14.43) | 54.00 (14.17) | 58.12 (14.24) | <0.001 | 51.4 (17.4) | 54.7 (15.6) | 56.5 (15.5) | 57.0 (14.7) | <0.001 |
| Platelet (×109/L) | 200.38 (99.19) | 211.88 (90.63) | 211.09 (83.18) | 214.67 (96.33) | <0.001 | 209.0 (105.9) | 211.2 (104.3) | 210.9 (98.3) | 218.7 (100.0) | <0.001 |
| Prothrombin time (s) | 12.68 (3.67) | 12.01 (3.20) | 12.06 (7.49) | 12.47 (3.36) | <0.001 | 12.4 (3.7) | 12.3 (5.4) | 12.6 (7.7) | 12.5 (3.6) | 0.341 |
| Albumin (g/L) | 38.26 (6.57) | 40.21 (6.09) | 40.27 (5.88) | 39.40 (6.20) | 0.034 | 36.2 (6.6) | 36.9 (6.7) | 37.7 (6.5) | 38.1 (6.2) | <0.001 |
| Total bilirubin (μmol/L) | 11.30 (8.00–17.46) | 12.00 (8.00–17.00) | 11.60 (8.00–16.00) | 11.00 (8.00–16.00) | <0.001 | 10.0 (7.00–16.0) | 11.4 (8.00–17.0) | 11.0 (8.00–16.0) | 11.0 (8.00–16.0) | <0.001 |
| ALT (IU/L) | 38.00 (23.00–71.00) | 33.00 (21.00–59.00) | 31.00 (20.00–51.00) | 28.00 (18.00–48.00) | <0.001 | 33.00 (18.00–67.00) | 38.00 (21.00–71.00) | 36.00 (21.00–65.00) | 36.90 (22.00–65.90) | <0.001 |
| AST (IU/L) | 40.00 (26.00–71.00) | 32.00 (23.00–50.00) | 30.00 (22.00–46.00) | 29.00 (21.00–46.00) | <0.001 | 42.00 (25.00–72.00) | 42.00 (27.00–71.00) | 40.00 (26.00–67.20) | 39.00 (26.00–65.00) | 0.187 |
| APRI | 1.90 (6.28) | 1.02 (3.45) | 0.93 (2.64) | 1.10 (6.12) | <0.001 | 1.5 (5.3) | 1.3 (2.7) | 1.2 (2.0) | 1.2 (4.0) | 0.057 |
| Forns index | 6.05 (2.70) | 5.83 (2.34) | 5.95 (2.21) | 6.32 (2.34) | <0.001 | 7.3 (2.4) | 7.1 (2.4) | 7.0 (2.5) | 6.6 (2.3) | <0.001 |
| FIB-4 | 0.75 (1.98) | 0.55 (1.93) | 0.62 (2.05) | 0.84 (4.53) | <0.001 | 0.6 (1.3) | 0.7 (1.7) | 0.8 (1.7) | 0.8 (4.3) | 0.085 |
| AFP (μmol/L) | 4.40 (3.–10.00) | 3.59 (2.–6.69) | 3.02 (2.–5.35) | 3.01 (2.–5.15) | <0.001 | 6.2 (3.–15.1) | 5.3 (3.–13.0) | 4.5 (3.–9.0) | 4.1 (3.–7.6) | <0.001 |
| Positive HBeAg (n, %) | 647 (35.63) | 1,949 (29.19) | 2,953 (22.75) | 2,236 (18.25) | <0.001 | |||||
| Missing (%) | 17,244 (90.47) | 23,131 (77.60) | 24,031 (64.93) | 28,756 (70.12) | ||||||
| HBV DNA (IU/L) | 5.26 (1.13) | 1.60 (2.49) | 0.50 (1.20) | 0.15 (0.49) | <0.001 | |||||
| Missing (%) | 18,970 (99.53) | 29,751 (99.81) | 36,605 (98.90) | 39,910 (97.32) | ||||||
| Advanced liver disease | ||||||||||
| APRI ≥2 | 552 (19.59) | 1,017 (9.87) | 948 (7.92) | 1,307 (8.99) | <0.001 | 174 (16.2) | 172 (13.7) | 168 (14.6) | 131 (9.6) | <0.001 |
| FIB-4 ≥3.25 | 98 (3.48) | 176 (1.71) | 246 (2.05) | 447 (3.08) | <0.001 | 21 (2.0) | 40 (3.2) | 44 (3.8) | 39 (2.9) | 0.071 |
| Forns index ≥8.4 | 163 (22.42) | 609 (14.82) | 725 (13.63) | 1,211 (16.58) | <0.001 | 50 (35.0) | 101 (27.9) | 114 (29.1) | 99 (18.6) | <0.001 |
| Comorbidities | ||||||||||
| Diabetes mellitus | 2,856 (14.98) | 5,326 (17.87) | 7,985 (21.57) | 11,244 (27.42) | <0.001 | 926 (17.3) | 739 (20.0) | 687 (21.0) | 983 (22.0) | <0.001 |
| Hypertension | 4,180 (21.93) | 9,566 (32.09) | 14,000 (37.83) | 18,744 (45.71) | <0.001 | 1,240 (23.1) | 1,320 (35.7) | 1,248 (38.1) | 1,776 (39.7) | <0.001 |
| Cardiovascular disease | 2,667 (13.99) | 4,600 (15.43) | 7,468 (20.18) | 10,020 (24.43) | <0.001 | 922 (17.2) | 811 (22.0) | 795 (24.2) | 1,081 (24.2) | <0.001 |
| Malignancy | <0.001 | |||||||||
| Colorectal cancer | 147 (0.77) | 473 (1.59) | 661 (1.79) | 1,082 (2.64) | <0.001 | 26 (0.5) | 28 (0.8) | 22 (0.7) | 35 (0.8) | 0.24 |
| Lung cancers | 149 (0.78) | 469 (1.57) | 635 (1.72) | 1,101 (2.68) | <0.001 | 28 (0.5) | 45 (1.2) | 58 (1.8) | 52 (1.2) | <0.001 |
| Urinary/renal malignancies | 36 (0.19) | 99 (0.33) | 141 (0.38) | 209 (0.51) | <0.001 | 10 (0.2) | 16 (0.4) | 12 (0.4) | 16 (0.4) | 0.18 |
| Cervical cancer (female only) | 12 (0.06) | 66 (0.22) | 66 (0.18) | 134 (0.33) | <0.001 | 3 (01) | 5 (0.1) | 0 (0.0) | 8 (0.2) | n.a. |
| Breast cancer | 141 (0.74) | 504 (1.69) | 595 (1.61) | 897 (2.19) | <0.001 | 9 (0.2) | 10 (0.3) | 12 (0.4) | 19 (0.4) | 0.101 |
| Lymphoma | 199 (1.04) | 310 (1.04) | 618 (1.67) | 1,583 (3.86) | <0.001 | 8 (0.1) | 19 (0.5) | 12 (0.4) | 12 (0.3) | 0.017 |
| Chronic kidney disease | 475 (2.49) | 550 (1.85) | 812 (2.19) | 1,149 (2.80) | <0.001 | 199 (3.7) | 139 (3.8) | 84 (2.6) | 113 (2.5) | 0.001 |
Descriptive statistics were calculated after subtraction of missing data from denominator. Total bilirubin, ALT, AST, alpha-foetoprotein, APRI, and FIB-4 are expressed in median (IQR), whereas other continuous variables are expressed in mean ± SD. Statistical tests involved: chi-square test, Fisher’s exact test, Student’s t test, Mann-Whitney test, one-way ANOVA, Kruskal-Wallis.
AFP, alpha-fetoprotein; ALT, alanine aminotransferase APRI, aspartate aminotransferase-to-platelet ratio index; AST, aspartate aminotransferase; CHV, chronic viral hepatitis; FIB-4, Fibrosis-4; ICD-9, International Classification of Diseases, Ninth Revision.
Data were log-transformed before missing value imputation was performed. Values of p were also calculated based on log-transformed values.
Percentages were computed based on non-missing values.
Comorbidities were all defined based on ICD-9 diagnosis codes.
Cox regression model for the factors associated with various clinical outcomes in patients with complete data for at least 1 of the serum fibrosis formulae.
| Parameters | Chronic hepatitis B | Chronic hepatitis C | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Univariate | Multivariable | Univariate | Multivariable | |||||||||
| HR | 95% CI | aHR | 95% CI | HR | 95% CI | aHR | 95% CI | |||||
| Male sex | 2.68 | (2.49–2.89) | <0.001 | 2.23 | (1.98–2.50) | <0.001 | 0.98 | (0.82–1.17) | 0.819 | 1.70 | (1.54–1.87) | <0.001 |
| Age (years) | 1.02 | (1.01–1.02) | <0.001 | 1.02 | (1.02–1.03) | <0.001 | 1.03 | (1.03–1.04) | <0.001 | 1.04 | (1.04–1.05) | <0.001 |
| Albumin (g/L) | 0.95 | (0.95–0.96) | <0.001 | 0.98 | (0.97–0.98) | <0.001 | 0.99 | (0.98–1.00) | 0.117 | 0.95 | (0.95–0.96) | <0.001 |
| ALT (>ULN) | 2.59 | (2.44–2.75) | <0.001 | 1.77 | (1.61–1.94) | <0.001 | 2.83 | (2.37–3.38) | <0.001 | 0.89 | (0.82–0.98) | 0.015 |
| Positive HBeAg | 1.42 | (1.28–1.57) | <0.001 | 1.46 | (1.31–1.62) | <0.001 | – | – | – | – | – | – |
| Antiviral treatment | 1.79 | (1.69–1.90) | <0.001 | 1.68 | (1.53–1.84) | <0.001 | 0.05 | (0.01–0.18) | <0.001 | 1.79 | (1.51–2.11) | <0.001 |
| Advanced liver fibrosis | 3.22 | (3.02–3.42) | <0.001 | 1.41 | (1.25–1.58) | <0.001 | 2.90 | (2.45–3.43) | <0.001 | 1.34 | (1.19–1.50) | <0.001 |
| Male sex | 1.57 | (1.48–1.67) | <0.001 | 1.32 | (1.20–1.46) | <0.001 | 0.76 | (0.66–0.87) | <0.001 | 0.87 | (0.76–1.01) | 0.067 |
| Age (years) | 1.02 | (1.02–1.02) | <0.001 | 1.01 | (1.01–1.02) | <0.001 | 1.02 | (1.01–1.02) | <0.001 | 1.00 | (1.00–1.01) | 0.066 |
| Albumin (g/L) | 0.92 | (0.92–0.92) | <0.001 | 0.95 | (0.94–0.95) | <0.001 | 0.95 | (0.94–0.96) | <0.001 | 0.96 | (0.95–0.97) | <0.001 |
| ALT (>ULN) | 1.94 | (1.83–2.04) | <0.001 | 1.28 | (1.18–1.40) | <0.001 | 2.12 | (1.85–2.44) | <0.001 | 1.50 | (1.30–1.74) | <0.001 |
| Positive HBeAg | 1.17 | (1.06–1.29) | 0.002 | 1.17 | (1.06–1.30) | 0.002 | – | – | – | – | – | – |
| Antiviral treatment | 1.18 | (1.11–1.25) | <0.001 | 1.01 | (0.93–1.11) | 0.758 | 0.04 | (0.01–0.13) | <0.001 | 0.06 | (0.02–0.19) | <0.001 |
| Advanced liver fibrosis | 4.34 | (4.10–4.59) | <0.001 | 1.25 | (1.13–1.40) | <0.001 | 4.01 | (3.51–4.59) | <0.001 | 1.56 | (1.32–1.85) | <0.001 |
| Male sex | 1.47 | (1.42–1.53) | <0.001 | 1.54 | (1.43–1.66) | <0.001 | 1.22 | (1.11–1.33) | <0.001 | 1.70 | (1.54–1.87) | <0.001 |
| Age (years) | 1.05 | (1.05–1.05) | <0.001 | 1.04 | (1.04–1.04) | <0.001 | 1.04 | (1.04–1.05) | <0.001 | 1.04 | (1.04–1.05) | <0.001 |
| Albumin (g/L) | 0.91 | (0.91–0.91) | <0.001 | 0.93 | (0.93–0.94) | <0.001 | 0.94 | (0.94–0.95) | <0.001 | 0.95 | (0.95–0.96) | <0.001 |
| ALT (>ULN) | 1.01 | (0.97–1.05) | 0.693 | 0.93 | (0.87–1.00) | 0.037 | 0.90 | (0.82–0.97) | 0.011 | 0.89 | (0.82–0.98) | 0.015 |
| Positive HBeAg | 0.76 | (0.70–0.83) | <0.001 | 0.92 | (0.85–1.01) | 0.069 | – | – | – | – | – | – |
| Antiviral treatment | 3.00 | (2.89–3.12) | <0.001 | 2.32 | (2.17–2.48) | <0.001 | 1.22 | (1.04–1.43) | 0.017 | 1.79 | (1.51–2.11) | <0.001 |
| Advanced liver fibrosis | 2.45 | (2.35–2.56) | <0.001 | 1.41 | (1.29–1.53) | <0.001 | 1.80 | (1.63–1.98) | <0.001 | 1.34 | (1.19–1.50) | <0.001 |
aHR, adjusted hazard ratio; ALT, alanine aminotransferase; HCC, hepatocellular carcinoma; HR, hazard ratio; ULN, upper limit of normal.
Parameters used to develop the machine learning models.
| Parameters | All | Selected mode 1 | Selected mode 2 |
|---|---|---|---|
| Male sex | ✓ | ✓ | ✓ |
| Age | ✓ | ✓ | ✓ |
| Platelet | ✓ | ✓ | ✓ |
| Albumin | ✓ | ✓ | ✓ |
| Total bilirubin | ✓ | ✓ | ✓ |
| ALT | ✓ | ✓ | ✓ |
| AST | ✓ | ||
| Alpha-foetoprotein | ✓ | ||
| International normalized ratio | ✓ | ||
| Creatinine | ✓ | ||
| Gamma glutamyl transferase | ✓ | ||
| Total cholesterol | ✓ | ||
| HbA1c | ✓ | ||
| Fasting glucose | ✓ | ||
| HBV DNA | ✓ | ||
| Positive HBeAg | ✓ | ||
| Cirrhosis | ✓ | ✓ | ✓ |
| Cardiovascular disease | ✓ | ✓ | |
| Colorectal cancer | ✓ | ✓ | |
| Lung cancers | ✓ | ✓ | |
| Urinary/renal malignancies | ✓ | ✓ | |
| Cervical cancer | ✓ | ✓ | |
| Breast cancer | ✓ | ✓ | |
| Lymphoma | ✓ | ✓ | |
| Chronic kidney disease | ✓ | ✓ | ✓ |
| Osteopenia | ✓ | ✓ | |
| Osteoporosis | ✓ | ✓ | |
| Diabetes mellitus | ✓ | ✓ | ✓ |
| Hypertension | ✓ | ✓ | ✓ |
| Anticoagulants | ✓ | ✓ | |
| ACEI/ARB | ✓ | ✓ | ✓ |
| Antiplatelet agents | ✓ | ✓ | ✓ |
| Beta blockers | ✓ | ✓ | ✓ |
| Histamine-2 receptor antagonist | ✓ | ✓ | |
| Insulin | ✓ | ✓ | ✓ |
| Immunosuppressant | ✓ | ✓ | |
| Loop diuretics | ✓ | ✓ | |
| Metformin | ✓ | ✓ | ✓ |
| NSAID | ✓ | ✓ | |
| Other lipid-lowering agents | ✓ | ✓ | ✓ |
| Other oral hypoglycaemic agents | ✓ | ✓ | ✓ |
| Proton pump inhibitor | ✓ | ✓ | ✓ |
| Potassium sparing diuretics | ✓ | ✓ | |
| Statins | ✓ | ✓ | ✓ |
| Sulphonylurea | ✓ | ✓ | ✓ |
| Thiazides | ✓ | ✓ |
ACEI, angiotensin-converting-enzyme inhibitor; ALT, alanine aminotransferase; ARB, angiotensin receptor blocker; AST, aspartate aminotransferase; CTP, Child–Turcotte–Pugh; HbA1c, haemoglobin A1c.
AUROC and the 95% CI of the machine learning models in training and validation cohorts to HCC.
| Machine learning model | Training cohort | Validation cohort | ||||
|---|---|---|---|---|---|---|
| 20 selected parameters | 36 selected parameters | All parameters | 20 selected parameters | 36 selected parameters | All parameters | |
| Logistic regression | 0.814 ± 0.006 | 0.829 ± 0.006 | 0.825 ± 0.006 | 0.818 ± 0.009 | 0.832 ± 0.009 | 0.829 ± 0.009 |
| Ridge regression | 0.817 ± 0.005 | 0.839 ± 0.005 | 0.842 ± 0.005 | 0.821 ± 0.009 | 0.840 ± 0.009 | 0.844 ± 0.009 |
| AdaBoost | 0.822 ± 0.006 | 0.828 ± 0.006 | 0.828 ± 0.006 | 0.824 ± 0.009 | 0.833 ± 0.009 | 0.832 ± 0.009 |
| Decision tree | 0.877 ± 0.005 | 0.884 ± 0.005 | 0.800 ± 0.005 | 0.802 ± 0.010 | 0.819 ± 0.010 | 0.818 ± 0.010 |
| Random forest | 0.987 ± 0.003 | 0.991 ± 0.003 | 0.992 ± 0.003 | 0.807 ± 0.010 | 0.821 ± 0.010 | 0.821 ± 0.010 |
AUROC, area under the receiver operating characteristic curve; HCC, hepatocellular carcinoma.
AUROC of the 5 machine learning algorithms were overall difference in the training cohort, p <0.05.
AUROC of the 5 machine learning algorithms were overall difference in the validation cohort, p <0.05.
AUROC higher than decision tree in the validation cohort, p <0.05.
AUROC higher than logistic regression and AdaBoost in both cohorts, p <0.05.
Accuracy of the machine learning models using selected parameters in diagnosing HCC in the training and validation cohorts.
| Machine learning algorithm | Dual cut-offs | n (%) (<lower cut-off /≥ upper cut-off) | Sensitivity (%) (95% CI) | Specificity (%) (95% CI) | PPV (%) (95% CI) | NPV (%) (95% CI) |
|---|---|---|---|---|---|---|
| Logistic regression | 0.18 | 43,951 (50.6) | 0.90 (0.89–0.91) | 0.54 (0.542–0.545) | 0.143 (0.141–0.146) | 0.985 (0.983–0.985) |
| 0.29 | 11,527 (13.3) | 0.52 (0.51–0.53) | 0.90 (0.898–0.902) | 0.307 (0.300–0.315) | 0.956 (0.955–0.958) | |
| Ridge regression | 0.07 | 48,341 (55.7) | 0.90 (0.89–0.91) | 0.596 (0.593–0.599) | 0.160 (0.156–0.164) | 0.986 (0.985–0.987) |
| 0.15 | 11,506 (13.3) | 0.52(0.51–0.53) | 0.900 (0.898–0.902) | 0.307 (0.300–0.315) | 0.956 (0.955–0.958) | |
| AdaBoost | 0.42 | 43,298 (49.9) | 0.91 (0.90–0.92) | 0.533 (0.529–0.536) | 0.142 (0.140–0.145) | 0.985 (0.984–0.986) |
| 0.45 | 10,363 (11.9) | 0.48 (0.46–0.49) | 0.911 (0.909–0.913) | 0.313 (0.308–0.320) | 0.953 (0.952–0.954) | |
| Decision tree | 0.04 | 48,765 (56.2) | 0.92 (0.92–0.93) | 0.603 (0.599–0.606) | 0.166 (0.163–0.170) | 0.990 (0.989–0.991) |
| 0.17 | 12,029 (13.9) | 0.63 (0.62–0.64) | 0.903 (0.902–0.905) | 0.356 (0.349–0.363) | 0.966 (0.965–0.967) | |
| Random forest | 0.45 | 71,804 (82.7) | 0.90 (0.90–0.91) | 0.998 (0.997–0.998) | 0.976 (0.973–0.979) | 0.992 (0.991–0.992) |
| 0.10 | 12,074 (13.9) | 0.97 (0.96–0.97) | 0.932 (0.930–0.933) | 0.547 (0.539–0.557) | 0.997 (0.997–0.998) | |
| Logistic regression | 0.18 | 19,448 (52.3) | 0.90 (0.89–0.91) | 0.568 (0.553–0.565) | 0.146 (0.142–0.151) | 0.985 (0.984–0.987) |
| 0.29 | 4,930 (13.3) | 0.52 (0.50–0.54) | 0.900 (0.896–0.902) | 0.304 (0.293–0.317) | 0.957 (0.955–0.959) | |
| Ridge regression | 0.07 | 20,816 (56.0) | 0.90 (0.89–0.91) | 0.598 (0.593–0.603) | 0.158 (0.152–0.164) | 0.986 (0.985–0.988) |
| 0.15 | 4,932 (13.3) | 0.52 (0.50–0.54) | 0.900 (0.897–0.903) | 0.304 (0.291–0.317) | 0.957 (0.955–0.960) | |
| AdaBoost | 0.42 | 18,725 (50.3) | 0.91 (0.90–0.92) | 0.538 (0.532–0.543) | 0.142 (0.137–0.146) | 0.987 (0.985–0.988) |
| 0.45 | 4,377 (11.8) | 0.47 (0.45–0.49) | 0.912 (0.909–0.914) | 0.310 (0.297–0.323) | 0.954 (0.952–0.956) | |
| Decision tree | 0.02 | 17,689 (47.6) | 0.90 (0.89–0.91) | 0.507 (0.501–0.511) | 0.133 (0.129–0.137) | 0.983 (0.982–0.985) |
| 0.17 | 4,987 (13.4) | 0.54 (0.52–0.56) | 0.900 (0.897–0.904) | 0.312 (0.302–0.330) | 0.959 (0.957–0.961) | |
| Random forest | 0.01 | 17,561 (47.2) | 0.90 (0.89–0.91) | 0.503 (0.496–0.508) | 0.132 (0.127–0.137) | 0.984 (0.982–0.986) |
| 0.20 | 4,561 (12.3) | 0.52 (0.50–0.53) | 0.910 (0.907–0.913) | 0.326 (0.312–0.341) | 0.957 (0.955–0.959) | |
In the training cohort, dual cut-offs were selected to achieve >90% sensitivity and specificity.
HCC, hepatocellular carcinoma; NPV, negative predictive value; PPV, positive predictive value.
Accuracy of HCC-RS compared with existing HCC risk scores in diagnosing HCC in the validation (n = 37,202) cohorts.
| Risk scores | AUROC | Dual cut-offs | n (%) (<lower cut-off /≥upper cut-off) | Sensitivity (%) (95% CI) | Specificity (%) (95% CI) | PPV (%) (95% CI) | NPV (%) (95% CI) |
|---|---|---|---|---|---|---|---|
| HCC-RS | 0.840 | 0.07 | 20,816 (56.0) | 90.0 (89.0–0.91) | 59.8 (59.3-60.3) | 15.8 (15.2-16.4) | 98.6 (98.5-98.8) |
| 0.15 | 4,932 (13.3) | 52.2 (50.3–54.0) | 90.0 (89.7-90.3) | 30.4 (29.1-31.7) | 95.7 (95.5-96.0) | ||
| CU-HCC score | 0.672 | <5 | 27,083 (72.8) | 46.4 (28.6–64.3) | 74.0 (69.9–78.4) | 10.3 (6.4–14.3) | 95.6 (94.2–97.1) |
| ≥20 | 7,812 (21.0) | 32.1 (14.3–50.0) | 79.7 (75.9–83.6) | 9.1 (4.5–14.0) | 94.8 (93.6–96.2) | ||
| GAG-HCC score | 0.745 | <80 | 25,781 (69.3) | 64.3 (46.4–82.1) | 71.5 (67.2–75.6) | 12.3 (8.8–15.9) | 97.0 (95.5–98.4) |
| ≥101 | 2,939 (7.9) | 28.6 (14.3–46.4) | 93.4 (91.1–95.6) | 21.1 (10.5–33.3) | 95.5 (94.5–96.6) | ||
| REACH-B score | 0.671 | <8 | 18,601 (50.0) | 72.7 (54.6–90.9) | 52.8 (45.4–59.7) | 16.2 (12.1–20.0) | 94.1 (89.8–97.9) |
| ≥14 | 558 (1.5) | 4.5 (0–13.5) | 98.9 (97.4–100) | 33.3 (0–100) | 89.2 (88.7–90.2) | ||
| PAGE-B score | 0.748 | <10 | 10,193 (27.4) | 95.7 (94.9–96.5) | 29.4 (28.9–30.0) | 10.7 (10.6–10.9) | 98.7 (98.5–99.0) |
| ≥13 | 17,969 (48.3) | 81.1 (79.4–82.7) | 54.6 (54.0–55.3) | 13.7 (13.4–14.0) | 97.0 (96.8–97.3) | ||
| REAL-B score | 0.712 | <4 | 6,548 (17.6) | 96.0 (95.2–96.9) | 19.2 (18.5–19.8) | 12.0 (11.9–12.2) | 97.7 (97.2–98.2) |
| ≥8 | 4,278 (11.5) | 27.0 (25.0–29.1) | 90.3 (89.8–90.7) | 24.2 (22.6–25.8) | 91.5 (91.3–91.7) |
AUROC, area under the receiver operating characteristic curve; HCC, hepatocellular carcinoma; HCC-RS, HCC ridge score; NPV, negative predictive value; PPV, positive predictive value.