| Literature DB >> 32395478 |
Markus Bo Schoenberg1, Julian Nikolaus Bucher1, Dominik Koch1, Nikolaus Börner1, Sebastian Hesse2, Enrico Narciso De Toni3, Max Seidensticker4, Martin Kurt Angele1, Christoph Klein2, Alexandr V Bazhin1, Jens Werner1, Markus Otto Guba1,3.
Abstract
BACKGROUND: Due to organ shortage, liver transplantation (LT) in hepatocellular carcinoma (HCC) patients can only be offered subsidiary to other curative treatments, including liver resection (LR). We aimed at developing and validating a machine-learning algorithm (ML) to predict which patients are sufficiently treated by LR.Entities:
Keywords: Hepatectomy; artificial intelligence; clinical oncology; hepatocellular carcinoma (HCC); machine learning (ML)
Year: 2020 PMID: 32395478 PMCID: PMC7210189 DOI: 10.21037/atm.2020.04.16
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Figure 1Workflow for the development of the random Forest model. AUROC, area under the receiver operator curve.
Figure 2Kaplan Meier Analysis of (A) overall survival and (B) disease free survival of the entire cohort.
Study data of the study cohort
| Characteristic | Study cohort, n=180 | Training data, n=127 | Test data, n=53 | Training |
|---|---|---|---|---|
| Demographics | ||||
| Age at operation in years, mean ± SD | 65.74±12.75 | 66.39±12.16 | 64.19±14.06 | 0.323 |
| Sex, n (%) | 0.254 | |||
| Male | 139 (77.22) | 101 (79.52) | 38 (71.70) | |
| Female | 41 (22.78) | 26 (20.47) | 15 (28.30) | |
| Underlying liver disease | ||||
| Cirrhosis, n (%) | 74 (41.11) | 54 (42.52) | 20 (37.74) | 0.552 |
| Child-Turcotte-Pugh A | 68 (37.78%) | 49 (38.58%) | 19 (35.19) | 0.551 |
| Child-Turcotte-Pugh B | 6 (3.33%) | 5 (3.94%) | 1 (1.89) | |
| Cause of cirrhosis, n (%) | 0.569 | |||
| Hepatitis C | 34 (18.89) | 24 (18.90) | 10 (18.87) | |
| Hepatitis B | 24 (13.33) | 19 (14.96) | 5 (9.43) | |
| Alcohol | 27 (15.00) | 22 (17.32) | 5 (9.43) | |
| Radiographic features | ||||
| No. of tumors at baseline, mean ± SD | 1.44±0.87 | 1.48±0.91 | 1.36±0.79 | 0.368 |
| Initial largest tumor diameter in mm, mean ± SD | 68.76±40.71 | 67.71±39.99 | 71.28±42.67 | 0.603 |
| Milan-Criteria, n (%) | 0.495 | |||
| Inside | 68 (37.78) | 50 (39.37) | 18 (33.96) | |
| Outside | 112 (62.22) | 77 (60.63) | 35 (66.04) | |
| Laboratory values | ||||
| ⍺-Fetoprotein prior Resection in ng/mL, median (IQR) | 13.7 (102.5) | 12.9 (96.8) | 21.1 (100.2) | 0.581 |
| Bilirubin mg/dL, mean ± SD | 0.996±2.23 | 1.07±2.63 | 0.83±0.52 | 0.342 |
| Albumin g/L, mean ± SD | 42.02±5.81 | 42.00±6.05 | 42.09±5.24 | 0.914 |
| ALT U/L, mean ± SD | 55.69±54.10 | 52.69±42.94 | 62.87±74.38 | 0.354 |
| AST U/L, mean ± SD | 64.83±98.65 | 57.43±40.06 | 82.55±170.75 | 0.294 |
| aPTT in seconds, mean ± SD | 28.20±5.93 | 28.06±4.82 | 28.55±8.05 | 0.680 |
| INR, mean ± SD | 1.05±0.097 | 1.05±0.097 | 1.05±0.099 | 0.996 |
| Creatinine mg/dL, mean ± SD | 1.02±0.25 | 1.03±0.23 | 1.00±0.30 | 0.487 |
| CRP mg/L, mean ± SD | 12.86±22.07 | 12.52±21.77 | 13.68±22.96 | 0.754 |
| Leukocytes 106/L, mean ± SD | 7056±2290 | 7140±2451 | 6856±1856 | 0.399 |
| Platelets 106/L, mean ± SD | 220.1±107 | 220.8±112 | 218.7±93 | 0.897 |
Training and test data is compared. bili, Bilirubin, mg/dL; alb, Albumin, g/L; AST, aspartate transferase, U/L, ALT, alanine transferase, U/L; afp, alpha Fetoprotein, ng/mL; aPTT, s; INR, international normalized ratio; crea, Creatinine, mg/dL; CRP, C-reactive protein, mg/L; WBC, Leukocytes, 106/L; plt, Platelets, 106/L; SD, standard deviation.
Results from multivariate analysis of disease-free survival for 90 days after stepwise selection of variables
| Univariate analysis | Multivariate analysis | ||||||
|---|---|---|---|---|---|---|---|
| HR | Confidence interval | P value | HR | Confidence interval | P value | ||
| Age at operation | 1.034 | 0.989–1.081 | 0.139 | 1.036 | 0.980–1.095 | 0.212 | |
| Creatinine | 6.297 | 1.922–20.62 | 0.002 | 5.662 | 1.041–30.798 | 0.045 | |
| HBV infection | 2.118 | 0.781–5.744 | 0.140 | 3.748 | 1.190–11.802 | 0.024 | |
| Kings-Score | 1.001 | 1.000–1.002 | 0.011 | 1.000 | 0.999–1.001 | 0.790 | |
| Portal hypertension | 3.571 | 1.055–12.09 | 0.041 | 2.352 | 0.513–10.776 | 0.271 | |
HBV, (Hepatitis B Virus).
Results from multivariate analysis of disease-free survival for 24 months after stepwise selection of variables
| Univariate analysis | Multivariate analysis | ||||||
|---|---|---|---|---|---|---|---|
| HR | Confidence interval | P value | HR | Confidence interval | P value | ||
| AFP (>21.5 ng/mL) | 1.050 | 1.001–1.121 | <0.001 | 1.020 | 1.001–1.081 | 0.022 | |
| Albumin | 0.964 | 0.933–0.996 | 0.028 | 0.952 | 0.897–1.011 | 0.109 | |
| CTP Score | 0.671 | 0.421–1.070 | 0.094 | 0.676 | 0.378–1.209 | 0.187 | |
| mGPS | 0.418 | 0.254–0.690 | <0.001 | 0.5823 | 0.317–1.069 | 0.081 | |
AFP, alpha Fetoprotein; ng/mL; CTP, child-turcotte-pugh score.
Results from multivariate analysis for the entire follow-up period after stepwise selection of variables
| Univariate analysis | Multivariate analysis | ||||||
|---|---|---|---|---|---|---|---|
| HR | Confidence interval | P value | HR | Confidence interval | P value | ||
| AFP (<21.5 ng/mL) | 0.672 | 0.460–0.983 | 0.041 | 0.653 | 0.431–0.989 | 0.044 | |
| AST (<41.5 U/L) | 1.003 | 1.001–1.004 | 0.003 | 0.651 | 0.432–0.980 | 0.039 | |
| BCLC >A | 2.050 | 1.328–3.165 | 0.002 | 1.769 | 1.122–2.786 | 0.014 | |
| C-Reactive Protein | 1.01 | 1.003–1.017 | 0.008 | 0.999 | 0.987–1.010 | 0.873 | |
| MELD | 1.123 | 1.020–1.237 | 0.018 | 0.560 | 0.370–0.847 | 0.006 | |
| mGPS =0 | 0.444 | 0.286–0.687 | <0.001 | 0.460 | 0.245–0.867 | 0.016 | |
| Largest tumor in mm | 1.007 | 1.003–1.011 | <0.001 | 1.002 | 0.997–1.007 | 0.472 | |
AFP, alpha Fetoprotein, ng/mL; AST, aspartate transferase, U/L; mGPS, modified Glascow Prognostic Scale; MELD, model of endstage liver disease; BCLC, Barcelona clinic liver cancer.
Results from univariate analysis of variables that were not predictive for the entire follow-up period
| Characteristic | Univariate analysis | ||
|---|---|---|---|
| HR | Confidence interval | P value | |
| Age at operation | 0.99 | 0.984–1.014 | 0.850 |
| ALT | 1.001 | 0.998–1.004 | 0.434 |
| Bilirubin | 1.031 | 0.966–1.100 | 0.417 |
| Creatinine | 1.634 | 0.729–3.666 | 0.233 |
| CTP score | 0.942 | 0.334–2.658 | 0.631 |
| Alcoholic liver disease | 0.684 | 0.382–1.224 | 0.180 |
| Extend of liver resection | 1.264 | 0.860–1.856 | 0.233 |
| Gender | 0.875 | 0.559–1.371 | 0.565 |
| HBV Iinfection | 1.345 | 0.778–2.325 | 0.306 |
| HCV Iinfection | 0.813 | 0.484–1.366 | 0.423 |
| International normalized ratio | 3.469 | 0.477–25.23 | 0.224 |
| Number of lesions | 1.033 | 0.839–1.271 | 0.764 |
| Platelets | 1.001 | 0.999–1.003 | 0.287 |
| Portal hypertension | 1.11 | 0.440–2.785 | 0.832 |
ALT, alanine transferase, U/L; CTP Score, child-turcotte-pugh score; HBV, hepatitis B virus; HCV, hepatitis C virus.
Figure 3ROC curves of independently predictive continuous variables. (A) ROC curve of AFP; (B) ROC curve of AST; (C) ROC curve of Model of Endstage Liver Disease. ROC, receiver operating characteristic; AFP, alpha-Fetoprotein; AST, aspartate transferase; AUC, area under the curve.
Figure 4Accuracy dependent on the number of variables based on recursive feature elimination.
Figure 5“Receiver Operating Curve” of test data prediction based on the developed Random Forest model.
Figure 6“Receiver Operating Curve” of early DFS prediction based on the developed Random Forest model.
Figure 7Survival analysis of DFS within 2 years of follow-up after defining “Low-Risk” and “High-Risk” patients.
Figure 8Survival Analysis of DFS within 2 years of follow-up after defining “Low-Risk” and “High-Risk” patients divided by minor and major resection. Low risk (LR) Minor vs. Major Resection: P=0.2. High risk (HR) Minor vs. Major Resection: P=0.8.
Figure 9Decision curve analysis (DCA). Red dashed line indicates the net benefit of the Random Forest (RF) Model across a range of threshold probabilities. The horizontal dashed black line represents the assumptions that no patient will be treated. The solid grey line represents the assumption that all patients will be treated.