Teerha Piratvisuth1, Tawesak Tanwandee2, Satawat Thongsawat3, Wattana Sukeepaisarnjaroen4, Juan Ignacio Esteban5,6, Marta Bes6,7, Bruno Köhler8,9, Ying He10, Magdalena Swiatek-de Lange10, David Morgenstern11, Henry Lik-Yuen Chan12. 1. NKC Institute of Gastroenterology and Hepatology, Songklanagarind Hospital, Prince of Songkla University, Hat Yai, Thailand. 2. Division of Gastroenterology, Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand. 3. Department of Internal Medicine, Maharaj Nakorn Chiang Mai Hospital, Chiang Mai University, Chiang Mai, Thailand. 4. Faculty of Medicine, Srinagarind Hospital, Khon Kaen University, Khon Kaen, Thailand. 5. Liver Unit, Hospital Universitari Vall d'Hebron, Barcelona, Spain. 6. Centro de Investigación Biomédica en red de Enfermedades Hepáticas y Digestivas, Insituto de Salud Carlos III, Madrid, Spain. 7. Transfusion Safety Laboratory, Banc de Sang i Teixits, Barcelona, Spain. 8. Department of Medical Oncology, National Center for Tumor Diseases, University Hospital Heidelberg, Heidelberg, Germany. 9. Liver Cancer Center Heidelberg, Heidelberg, Germany. 10. Roche Diagnostics GmbH, Penzberg, Germany. 11. Roche Diagnostics Operations, Indianapolis, IN, USA. 12. Department of Medicine and Therapeutics, Chinese University of Hong Kong, Hong Kong.
Abstract
Hepatocellular carcinoma (HCC), the sixth most common cancer worldwide, has an incidence rate equal to mortality. Over 80% of HCC cases occur within a high-risk population, mainly patients with both cirrhosis and chronic hepatitis B or C. With a 5-year survival rate ranging from <16% for advanced HCC to >90% for early stage HCC, there is a high medical need for the early detection of HCC. In this study, we systematically evaluated biomarkers mentioned in international guidelines and peer-reviewed literature for HCC surveillance and diagnosis with the aim of identifying combinations that display high sensitivity and specificity for early stage HCC. Fifty biomarkers were measured in the first sample panel, panel A (n = 110), and subjected to univariate analysis. Of these, 35 biomarkers (38 assays) from panel A and an additional 13 biomarkers from the literature were prioritized for subsequent multivariate evaluation with lasso regression and exhaustive search of two- to four-biomarker combinations (panel B). Panel B included 1,081 samples from patients with HCC (n = 308) or with chronic liver diseases (n = 740). Among all patients, 61.0% had hepatitis B, 32.9% had hepatitis C, and 60.5% had cirrhosis; 40.6% of patients with HCC had early stage cancer. Protein induced by vitamin K absence-II (PIVKA-II; also known as des-gamma-carboxy prothrombin [DCP]) and alpha-fetoprotein (AFP) demonstrated the best clinical performance, both individually and in combination, and the addition of a third biomarker (Lens culinaris agglutinin-reactive fraction of AFP [AFP-L3], cartilage oligomeric matrix protein [COMP], insulin-like growth factor-binding protein 3 [IGFBP3], or matrix metalloproteinase 3 [MMP3]) further increased sensitivity for the detection of both early stage and all-stage HCC. The addition of age and sex to the three-biomarker panel resulted in an improved diagnostic performance. Conclusion: The combination of AFP and PIVKA-II, with either IGFBP3, COMP or MMP3, plus age and sex, demonstrated the best performance for the detection of early- and all-stage HCC. These novel panels performed similar to that of the GALAD score (sex [gender], age, plus serum levels of AFP, AFP-L3 and DCP [PIVKA-II]), a promising screening tool developed for HCC detection.
Hepatocellular carcinoma (HCC), the sixth most common cancer worldwide, has an incidence rate equal to mortality. Over 80% of HCC cases occur within a high-risk population, mainly patients with both cirrhosis and chronic hepatitis B or C. With a 5-year survival rate ranging from <16% for advanced HCC to >90% for early stage HCC, there is a high medical need for the early detection of HCC. In this study, we systematically evaluated biomarkers mentioned in international guidelines and peer-reviewed literature for HCC surveillance and diagnosis with the aim of identifying combinations that display high sensitivity and specificity for early stage HCC. Fifty biomarkers were measured in the first sample panel, panel A (n = 110), and subjected to univariate analysis. Of these, 35 biomarkers (38 assays) from panel A and an additional 13 biomarkers from the literature were prioritized for subsequent multivariate evaluation with lasso regression and exhaustive search of two- to four-biomarker combinations (panel B). Panel B included 1,081 samples from patients with HCC (n = 308) or with chronic liver diseases (n = 740). Among all patients, 61.0% had hepatitis B, 32.9% had hepatitis C, and 60.5% had cirrhosis; 40.6% of patients with HCC had early stage cancer. Protein induced by vitamin K absence-II (PIVKA-II; also known as des-gamma-carboxy prothrombin [DCP]) and alpha-fetoprotein (AFP) demonstrated the best clinical performance, both individually and in combination, and the addition of a third biomarker (Lens culinaris agglutinin-reactive fraction of AFP [AFP-L3], cartilage oligomeric matrix protein [COMP], insulin-like growth factor-binding protein 3 [IGFBP3], or matrix metalloproteinase 3 [MMP3]) further increased sensitivity for the detection of both early stage and all-stage HCC. The addition of age and sex to the three-biomarker panel resulted in an improved diagnostic performance. Conclusion: The combination of AFP and PIVKA-II, with either IGFBP3, COMP or MMP3, plus age and sex, demonstrated the best performance for the detection of early- and all-stage HCC. These novel panels performed similar to that of the GALAD score (sex [gender], age, plus serum levels of AFP, AFP-L3 and DCP [PIVKA-II]), a promising screening tool developed for HCC detection.
American Association for the Study of Liver Diseasesalpha‐fetoproteinLens culinaris agglutinin‐reactive fraction of alpha‐fetoproteinalcoholic steatohepatitisarea under the curveBarcelona Clinic Liver Cancercartilage oligomeric matrix proteindes‐gamma‐carboxyprothrombin (also known as PIVKA‐II)European Association for the Study of the Liversex (gender), age, Lens culinaris agglutinin‐reactive fraction of alpha‐fetoprotein, alpha‐fetoprotein, des‐gamma‐carboxy prothrombinhepatitis B virushepatocellular carcinomahepatitis C virushuman epididymis protein 4insulin‐like growth factor‐binding protein 3Japanese Society of Hepatologymatrix metalloproteinase 3nonalcoholic steatohepatitisprotein induced by vitamin K absence‐II (also known as DCP)receiver operating curveultrasonographyHepatocellular carcinoma (HCC) is the sixth most common cancer worldwide and the third leading cause of cancer‐related mortality (~800,000 deaths/year).(
,
,
,
) HCC usually develops in patients with cirrhosis and is triggered by continuous cycles of inflammation and repair in the liver, which may be caused by chronic viral infection (hepatitis B virus [HBV], hepatitis C virus [HCV]), or alcoholic or nonalcoholic steatohepatitis (ASH or NASH).(
,
,
,
,
) HCC is typically diagnosed at advanced stages, leaving limited treatment options and resulting in a dismal 5‐year survival rate of <16%; however, early detection and prompt treatment significantly increases survival.(
) The average overall survival ranges from 80 months for early stage HCC (stage A; single nodule or three nodules <3 cm) to 15 months for stage C (portal invasion, nodal invasion, and metastases); patients with stage D (severe liver damage and performance status >2) have a dismal overall survival of approximately 4 months.(
,
) Current guidelines for HCC management recommend surveillance of high‐risk patients every 6 months using ultrasonography (USG); however, its effectiveness in the detection of early stage HCC is limited by operator skill and expertise and patient characteristics.(
,
,
,
,
,
) USG displays 51% sensitivity and 91% specificity for early stage HCC diagnosis; magnetic resonance imaging (MRI) and computed tomography (CT) display higher sensitivity (83.7% and 62.5%, respectively) and similar specificity (89.1% and 87.5%, respectively), but both are more costly with exposure to radiation and/or contrast agents and are therefore infrequently employed for HCC surveillance.(
,
,
,
,
) The measurement of the tumor biomarker alpha‐fetoprotein (AFP) has a reported sensitivity of 41%‐65% and specificity of 80%‐90% for the detection of HCC at any stage (20 ng/mL cutoff), but its use remains controversial.(
) AFP is included in the recommendations by the Japanese Society of Hepatology (JSH), Asia‐Pacific Association for the Study of the Liver, and American Association for the Study of Liver Diseases (AASLD) but not the European Association for the study of the Liver (EASL).(
) Two other biomarkers, des‐gamma‐carboxyprothrombin (DCP; also known as protein induced by vitamin K absence II [PIVKA‐II]), and Lens culinaris agglutinin‐reactive fraction of AFP (AFP‐L3), while routinely used for HCC screening in Japan, are not widely used outside Japan and are not included in other guidelines. However, both have been approved as risk markers of HCC by the US Food and Drug Administration in the United States and have a European Conformity marking in Europe.(
,
) The addition of age and sex (gender) to biomarkers AFP, AFP‐L3%, and DCP (also known as PIVKA‐II) in a diagnostic algorithm, the GALAD score, has resulted in a specificity of 81.6%‐93.3% and sensitivity of 80.2%‐85.6%, depending on where the analysis was undertaken, for the detection of early stage HCC. This is superior to any biomarker used alone.(
,
) Despite these promising developments, novel serologic tests have not yet been incorporated into the guidelines for the surveillance of patients at high risk.(
,
,
) The development of a serologic biomarker panel that can be applied to populations with diverse liver disease etiologies and complement the limitations of USG to improve the detection of early stage HCC is essential. The aim of this study was to systematically and comprehensively evaluate the sensitivity and specificity of a curated collection of biomarkers in a large sample cohort and to determine the biomarkers and biomarker combinations best suited for detection of early stage HCC.
Patients and Methods
Biomarker Selection
Biomarker candidates were included in this study if they were mentioned in the following international guidelines: AASLD, National Comprehensive Cancer Network, European Society for Medical Oncology–European Society of Digestive Oncology, EASL–European Organization for Research and Treatment of Cancer, British Society of Gastroenterology, National Institute for Health and Care Excellence, JSH, Liver Cancer in China guidelines, and Liver Cancer Association–National Cancer Center Korea Practice guidelines.(
,
,
,
,
,
,
,
) An additional PubMed literature search was performed using the term “HCC diagnosis, screening, and surveillance biomarkers.” All biomarker candidates were then evaluated in identical sample panels, permitting head‐to‐head comparison between biomarkers, and a multivariate analysis of all potential biomarker combinations was undertaken.
Study Cohort
The study recruited patients at high risk for developing HCC, diagnosed with either cirrhotic liver disease (independent of etiology), noncirrhotic NASH, chronic HBV infection, chronic HCV infection, and liver fibrosis (stage ≥F3)/cirrhosis or lesions suspicious for HCC based on imaging or untreated HCC. Patients were ineligible for the study if they were <18 years old, were receiving cancer treatment, had a history of other malignancy (other than liver cancer), had received immunomodulatory treatment within the last 6 months, or did not have sonography information within <4 weeks. Before the start of any cancer‐related therapy, a single blood draw was taken and each patient’s diagnosis was documented. Six samples from patients who took anticoagulants (warfarin/phenprocoumon) were excluded from PIVKA‐II analysis because the PIVKA‐II level was shown to significantly increase after warfarin administration.(
)Recruited patients were split into two cohorts, panels A and B. Biomarkers were assessed in a step‐wise approach, allowing head‐to‐head comparison of single biomarker performance in the first cohort (panel A) and then a multivariate analysis of possible biomarker combinations in the second cohort (panel B).
Sample Preparation, Storage, and Measurement
Blood serum and ethylene diamine tetraacetic acid‐plasma samples were collected before surgery, percutaneous ethanol injection, chemotherapy, or radiotherapy. Samples were stored at −70°C until analysis; repeated freeze–thaw cycles were avoided. A combination of two radiologic methods (USG, CT, or MRI) was used to confirm HCC diagnosis in cirrhotic liver. Biopsy and histopathologic analysis was mandatory in noncirrhotic liver. Biomarkers were analyzed using mouse microsomal triglyceride transfer protein enzyme‐linked immunosorbent assay, integrated mutation profiling of actionable cancer targets (IMPACT),(
) cobas e 601, cobas c 311, Fujifilm Micro Total Analysis System Wako, or Lumipulse immunoassays, according to the instructions of the manufacturers (Supporting Tables S1 and S2).
Data Analyses
Univariate receiver operating curve (ROC) analysis was used in panel A to assess the area under the curve (AUC) of each biomarker for distinguishing all‐stage HCC versus benign liver disease controls (Fig. 1). Biomarkers were prioritized and selected based on the clinical performance of AUC ≥ 0.7.
FIG. 1
Biomarker analysis strategy. Abbreviations: AUC, area under the curve; HCC, hepatocellular carcinoma.
Biomarker analysis strategy. Abbreviations: AUC, area under the curve; HCC, hepatocellular carcinoma.Selected biomarkers were analyzed with multivariate models in the second cohort (panel B). The aim of the multivariate analysis was to identify the best performing panel of biomarkers that could separate early stage HCC or all‐stage HCC from benign controls with high accuracy. Multivariate analyses were performed using two methods: 1) lasso regression (no fixed panel size) and 2) exhaustive search with logistic regression (fixed panel size of two to four biomarkers). In lasso regression, the number of selected biomarkers was determined by the hyperparameter “lambda” that controlled the number of features for each model. The lasso regression optimized the best model by maximizing the diagnostic accuracy while minimizing the number of biomarkers. In the exhaustive search, logistic regression models based on all possible combinations of two to four biomarkers were evaluated and compared. All possible two‐biomarker combinations were analyzed among the 41 biomarkers. Next, one or two possible biomarkers were added to the best performing two‐biomarker combination, PIVKA‐II + AFP, to search for three‐ and four‐biomarker combinations, e.g., PIVKA‐II + AFP + X and PIVKA‐II + AFP + X + Y. Estimation of diagnostic accuracy was done by a nested cross‐validation, with the data split randomly into training (3/4) and test (1/4) sets. The optimal model was selected using an inner cross‐validation of the training set and tested on the remaining data set. This procedure was repeated 100 times. The test data in the outer loop were used to evaluate the clinical performance of the optimal model. The nested cross‐validation procedure provided an estimate of the stability and a robust estimate of the performance (to avoid overfitting) for each model (Fig. 1). The performance of AUC, sensitivity at 90% specificity, and specificity at 90% sensitivity were reported for each model for detecting early stage and all‐stage HCC. The acceptance criteria (target performance) for the detection of early stage HCC (Barcelona Clinic Liver Cancer [BCLC] stages 0 and A) were preset at 90% sensitivity with specificity >70% or at 90% specificity with sensitivity >70%; this was also used as a reference for all‐stage HCC. The best performing biomarker combinations identified in the panel plus age and sex were then compared with the GALAD algorithm.In addition, the combination of the selected biomarkers using AND/OR rules was performed and compared with the models of the same biomarkers, using logistic regression from the multivariate analyses. In the AND rule analysis, the overall three‐biomarker combination was considered positive if all three biomarkers were positive and negative if any one biomarker was negative. In the OR rule analysis, the overall three‐biomarker combination was considered positive if any of the three biomarkers were positive and negative if all three biomarkers were negative. The cutoffs used in this analysis were prespecified.
Results
Blood samples were obtained from 1,322 patients between 2014 and 2016 at seven centers in four countries (China, Germany, Spain, and Thailand) (Fig. 2).
FIG. 2
Study design and sample selection. Highlighted boxes show which samples were included in the analysis. *No measurements as a sample were taken on visit 2. †One sample from two visits and only the sample from visit one was measured. ‡Five samples were excluded due to a suspicious diagnosis at visit one or a second replicate diagnosis at visit two. #Patients had CCC, mixed HCC/CCC, or variable HCC diagnosis and were excluded from the analysis. Abbreviation: CCC, cholangiocellular carcinoma; HCC, hepatocellular carcinoma.
Study design and sample selection. Highlighted boxes show which samples were included in the analysis. *No measurements as a sample were taken on visit 2. †One sample from two visits and only the sample from visit one was measured. ‡Five samples were excluded due to a suspicious diagnosis at visit one or a second replicate diagnosis at visit two. #Patients had CCC, mixed HCC/CCC, or variable HCC diagnosis and were excluded from the analysis. Abbreviation: CCC, cholangiocellular carcinoma; HCC, hepatocellular carcinoma.In panel A, 50 biomarkers were measured in 110 samples from 110 patients; 72.7% were men, 31.8% had cirrhosis, 17.3% and 16.4% had chronic HBV and HCV infection, respectively, 2.7% had ASH, and 0.9% had NASH (Table 1; Fig. 3). Of all biomarkers evaluated in panel A, 35 biomarkers had AUC ≥ 0.7 and were prioritized for multivariate evaluation. With the addition of two further assays for PIVKA‐II and one assay for carboxylesterase 1, 38 assays were selected; a further 13 biomarkers (detected with mainly commercially available, in vitro, diagnostic assays) were included without panel A data (Supporting Tables S1 and S2). In total, 51 biomarker assays were measured in panel B. Biomarkers were excluded for multivariate analysis if the same biomarkers were replicated on Fujifilm Micro Total Analysis System Wako, Lumipulse, or IMPACT platforms.
TABLE 1
Summary of patient demographic for samples analyzed in Panel A and Panel B
Demographics
Panel A
Control (n = 59)
HCC (n = 51)
Mean age, years
54.4
61
Gender, n (%)
Female
25 (42.4)
5 (9.8)
Male
34 (57.6)
46 (90.2)
Cirrhosis, n (%)
0 (0.0)
35 (68.6)
Panel B
Control (n = 740)
HCC (n = 308)
Mean age, years
55.5
60.8
Gender, n (%)
Female
323 (43.7)
65 (21.1)
Male
417 (56.4)
243 (78.9)
Race, n (%)
African black
2 (0.3)
0 (0.0)
Asian
638 (86.2)
248 (80.5)
Caucasian/white
92 (12.4)
55 (17.9)
Mixed
7 (0.9)
3 (1.0)
Other
1 (0.1)
2 (0.6)
Cirrhosis, n (%)
395 (53.4)
239 (77.6)
ASH, alcoholic steatohepatitis; HBV, hepatitis B virus; HCC, hepatocellular carcinoma; HCV, hepatitis C virus; n.a., not applicable; NASH, non‐alcoholic steatohepatitis.
FIG. 3
Baseline disease characteristics. (A,B) Percentage of patients included in panel A. (C,D) Percentage of patients included in panel B. *Other comprised adenoma, focal nodular hyperplasia, and hemangioma. Abbreviations: ASH, alcoholic steatohepatitis; BCLC, Barcelona clinic liver cancer; HCC, hepatocellular carcinoma; NASH, non‐alcoholic steatohepatitis.
Summary of patient demographic for samples analyzed in Panel A and Panel BASH, alcoholic steatohepatitis; HBV, hepatitis B virus; HCC, hepatocellular carcinoma; HCV, hepatitis C virus; n.a., not applicable; NASH, non‐alcoholic steatohepatitis.Baseline disease characteristics. (A,B) Percentage of patients included in panel A. (C,D) Percentage of patients included in panel B. *Other comprised adenoma, focal nodular hyperplasia, and hemangioma. Abbreviations: ASH, alcoholic steatohepatitis; BCLC, Barcelona clinic liver cancer; HCC, hepatocellular carcinoma; NASH, non‐alcoholic steatohepatitis.Panel B analysis was performed on 1,089 samples from 1,084 patients and, after excluding samples with suspicious diagnosis and questionable serum/plasma description, the final sample population consisted of 1,081 samples (740 controls, 308 HCC, and 33 with other diagnoses) (Fig. 2). Among controls and patients with HCC, 61% had HBV (63.9% controls and 53.9% patients with HCC), 32.9% had HCV (33% controls and 32.8% patients with HCC); 60.5% had cirrhosis (53.4% controls and 77.6% patients with HCC); 5.3% had noncirrhotic NASH (6.5% controls and 2.3% patients with HCC), and 9.5% had ASH (5.1% controls and 20.1% patients with HCC) (Table 1; Fig. 3). Controls were comprised of patients with chronic liver disease, such as cirrhotic liver disease, noncirrhotic NASH, ASH, and HBV or HCV. Among patients with HCC, 40.6% had BCLC stage 0 or A, 25.0% stage B, 26.6% stage C, and 3.9% stage D; stage information was missing for 3.9% of patients (Table 1; Fig. 3).Although the univariate ROC analysis showed that no single biomarker fulfilled clinical requirements for the detection of early stage or all‐stage HCC, PIVKA‐II and AFP displayed the best individual clinical performance for early stage HCC versus controls, with AUC of 79.0% (PIVKA‐II) and 83.4% (AFP) (Fig. 4). The 20 best performing biomarkers from panel B are shown in Table 2.
FIG. 4
ROC curves showing univariate analyses of early stage HCC versus controls with chronic liver diseases of PIVKA‐II, AFP, MMP3, IGFBP3, and COMP. Abbreviation: CI, confidence interval. Abbreviations: AFP, alpha‐fetoprotein; AUC, area under the curve; COMP, cartilage oligomeric matrix protein; HCC, hepatocellular carcinoma; IGFBP3, Insulin‐like growth factor‐binding protein 3; MMP3, matrix metalloproteinase‐3; PIVKA‐II, protein induced by vitamin k absence‐II; ROC, receiver operating characteristics.
TABLE 2
Top 20 individual biomarkers that showed best performance in the univariate analysis for detection of early stage HCC (panel B)
Biomarker
AUC %
HCC sensitivity %*
AFP cobas e 601
83.4
53.6
PIVKA‐II cobas e 601
79.0
56.0
GPC3‐N
78.2
36.8
AFP‐L3% Fujifilm Micro Total Analysis System Wako
77.8
47.2
AFP‐L3 cobas e 601
76.5
48.8
HE4
76.5
34.4
IGFBP3
74.0
30.4
HGF
72.5
30.4
MMP2
72.4
26.4
TIMP1
71.1
30.4
Ang2
71.0
28.8
GGT2
70.9
28.0
COMP
70.2
36.0
GP73
70.1
31.2
sAxl
70.1
26.4
Cyfra 21‐1
69.7
22.4
DKK1
68.9
21.6
CEA
68.8
25.6
IGF1
68.7
22.0
IL6
68.3
28.0
CA 19‐9
68.1
32.0
Abbreviations: Ang2, angiopoietin‐2; CA 19‐9, cancer antigen 19‐9; CEA, carcinoembryonic antigen; Cyfra 21‐1, cytokeratin 19 fragments; DKK1, dickkopf‐related protein 1; GGT2, gamma‐glutamyltransferase‐2; GP73, Golgi membrane protein 1; GPC3, glypican‐3; HGF, hepatocyte growth factor; IGF1, insulin‐like growth factor 1; IL6, interleukin‐6; sAxl, soluble Axl; TIMP1, tissue inhibitor of metalloproteinase‐1.
Sensitivity of biomarkers was assessed at 90% specificity.
ROC curves showing univariate analyses of early stage HCC versus controls with chronic liver diseases of PIVKA‐II, AFP, MMP3, IGFBP3, and COMP. Abbreviation: CI, confidence interval. Abbreviations: AFP, alpha‐fetoprotein; AUC, area under the curve; COMP, cartilage oligomeric matrix protein; HCC, hepatocellular carcinoma; IGFBP3, Insulin‐like growth factor‐binding protein 3; MMP3, matrix metalloproteinase‐3; PIVKA‐II, protein induced by vitamin k absence‐II; ROC, receiver operating characteristics.Top 20 individual biomarkers that showed best performance in the univariate analysis for detection of early stage HCC (panel B)Abbreviations: Ang2, angiopoietin‐2; CA 19‐9, cancer antigen 19‐9; CEA, carcinoembryonic antigen; Cyfra 21‐1, cytokeratin 19 fragments; DKK1, dickkopf‐related protein 1; GGT2, gamma‐glutamyltransferase‐2; GP73, Golgi membrane protein 1; GPC3, glypican‐3; HGF, hepatocyte growth factor; IGF1, insulin‐like growth factor 1; IL6, interleukin‐6; sAxl, soluble Axl; TIMP1, tissue inhibitor of metalloproteinase‐1.Sensitivity of biomarkers was assessed at 90% specificity.Multivariate analysis using lasso regression was performed using 41 biomarkers to identify parsimonious models that met the target performance of 70% sensitivity at 90% specificity or 70% specificity at 90% sensitivity. Lasso regression with no fixed panel size (number of variables in the model not specified) identified PIVKA‐II and AFP as the best combination of biomarkers for the identification of both all‐stage and early stage HCC versus controls (Supporting Table S3; Supporting Fig. S1). The addition of either gamma‐glutamyltransferase 2, human epididymis protein 4 (HE4), osteoprotegerin (OPG), or insulin‐like growth factor binding protein 3 (IGFBP3) to PIVKA‐II + AFP for all‐stage HCC (Supporting Fig. S1A) and HE4, cartilage oligomeric matrix protein (COMP), carcinoembryonic antigen, matrix metalloproteinase 3 (MMP3), or OPG to PIVKA‐II + AFP for early stage HCC (Supporting Fig. S1B) enhanced diagnostic value. No further improvement in clinical performance was observed with models containing more than three biomarkers. We then undertook an exhaustive search of all combinations of two to four biomarkers (approximately 1,600 combinations) to separate early stage HCC from controls (125 vs. 740) and all‐stage HCC from controls (308 vs. 740). The exhaustive search determined that only PIVKA‐II + AFP met all criteria for the identification of all‐stage HCC versus controls, with no enhanced detection noted on the addition of a third biomarker (Supporting Table S4). However, for early stage HCC versus controls, none of the two‐biomarker panels met the acceptance criteria and only the three‐biomarker panels, composed of PIVKA‐II + AFP with either MMP3, IGFBP3, or COMP, met the acceptance criteria (Table 3). Notably, however, the third‐selected biomarker (IGFBP3, COMP, or MMP3) was not consistently selected in the multivariate analyses (Supporting Table S4).
TABLE 3
Clinical performance of the biomarker combinations showing 90% specificity/sensitivity in the multivariate analysis, with and without age and gender, for all‐ and early‐stage HCC
All‐stage HCC (n = 308) vs controls (n = 734)*
Early‐stage HCC (n = 125) vs controls (n = 734)*
AUC, %
Sensitivity at 90% specificity, %
Specificity at 90% sensitivity, %
AUC, %
Sensitivity at 90% specificity, %
Specificity at 90% sensitivity, %
Biomarker Combination Without Age and Gender (Sex)
AFP+PIVKA‐II+MMP3
94.4
83.8
84.7
89.1
65.6
72.9
AFP+PIVKA‐II+IGFBP3
94.3
85.4
85.8
88.5
68.0
74.3
AFP+PVKA‐II+COMP
94.4
84.1
85.6
89.0
67.2
74.1
GALAD (cobas e 601 by Roche)
94.1
84.4
82.3
88.1
68.0
69.1
GALAD (Fujifilm Micro Total Analysis System WakoTM)
94.5
81.8
84.4
89.8
64.8
77.1
Biomarker Combination With Age and Gender (Sex)
AFP+PIVKA‐II+MMP3
95.3
84.4
85.9
90.7
67.2
75.1
AFP+PIVKA‐II+IGFBP3
95.3
87.0
85.8
90.6
71.2
77.4
AFP+PVKA‐II+COMP
95.4
85.7
85.5
90.9
72.8
80.2
GALAD (cobas e 601 by Roche)
95.2
86.0
84.1
90.5
71.2
77.7
GALAD (Fujifilm Micro Total Analysis System WakoTM)
94.5
81.8
84.4
89.8
64.8
77.1
AFP, alpha‐fetoprotein; AFP‐L3, Lens culinaris agglutinin‐reactive fraction of AFP; AUC, area under the curve; COMP, cartilage oligomeric matrix protein; GALAD, Gender, age, AFP‐L3, AFP, and PIVKA‐II; HCC, hepatocellular carcinoma; HE4, epididymal protein 4; IGFBP3, Insulin‐like growth factor‐binding protein 3; IL6, interleukin 6; MMP3, matrix metalloproteinase‐3; PIVKA‐II, protein induced by vitamin k absence‐II.
*Samples with missing measurements were excluded from the multivariate analysis.
Clinical performance of the biomarker combinations showing 90% specificity/sensitivity in the multivariate analysis, with and without age and gender, for all‐ and early‐stage HCCAFP, alpha‐fetoprotein; AFP‐L3, Lens culinaris agglutinin‐reactive fraction of AFP; AUC, area under the curve; COMP, cartilage oligomeric matrix protein; GALAD, Gender, age, AFP‐L3, AFP, and PIVKA‐II; HCC, hepatocellular carcinoma; HE4, epididymal protein 4; IGFBP3, Insulin‐like growth factor‐binding protein 3; IL6, interleukin 6; MMP3, matrix metalloproteinase‐3; PIVKA‐II, protein induced by vitamin k absence‐II.*Samples with missing measurements were excluded from the multivariate analysis.The combination of the selected biomarkers PIVKA‐II + AFP plus a third biomarker (either MMP3, IGFBP3, or COMP) using AND/OR rules did not show better clinical performance compared with the logistic regression models. None of the AND/OR combinations reached the target sensitivity and specificity (data not shown).The addition of age and sex to the three‐biomarker panels further enhanced diagnostic performance (Table 3). The HCC detection capability of the novel biomarker panel AFP + PIVKA‐II with either COMP, MMP3, or IGFBP3, plus age and sex, was compared with that of the GALAD score (−10.08 + 0.09 × Age + 1.67 × Sex + 2.34 × log10[AFP] + 0.04 × AFP‐L3% + 1.33 × log10[PIVKA‐II]), where Sex = 1 for men and 0 for women(
,
) (Fig. 5). The performance of the novel biomarker panels was similar to the GALAD score. The addition of age and sex to the novel three‐biomarker panels enhanced specificity at the high‐sensitivity region; similar results were obtained for two versions of the GALAD score, using assays from different manufacturers, and for the detection of all‐stage and early stage HCC (AUC > 95% and >90%, respectively) (Table 3; Supporting Table S5; Supporting Fig. S2).
FIG. 5
Results of combined biomarker panels plus age and gender (sex). (A) For all‐stage HCC. (B) For early stage HCC. aFujifilm, Fujifilm Micro Total Analysis System Wako. Abbreviations: AFP, alpha‐fetoprotein; AUC, area under the curve; COMP, cartilage oligomeric matrix protein; GALAD, Gender, age, AFP‐l3, AFP, and PIVKA‐II; IGFBP3, Insulin‐like growth factor‐binding protein 3; MMP3, matrix metalloproteinase‐3.
Results of combined biomarker panels plus age and gender (sex). (A) For all‐stage HCC. (B) For early stage HCC. aFujifilm, Fujifilm Micro Total Analysis System Wako. Abbreviations: AFP, alpha‐fetoprotein; AUC, area under the curve; COMP, cartilage oligomeric matrix protein; GALAD, Gender, age, AFP‐l3, AFP, and PIVKA‐II; IGFBP3, Insulin‐like growth factor‐binding protein 3; MMP3, matrix metalloproteinase‐3.
Discussion
In this study, we described the most comprehensive analysis to date of a large panel of serum biomarkers that have been proposed for surveillance of HCC. Through univariate and multivariate analysis, we identified candidate biomarker combinations that were compared with AFP alone and with the GALAD model.In summary, none of the evaluated individual biomarkers alone met the clinical requirements for the detection of early stage HCC with high sensitivity or high specificity. However, the combination of AFP + PIVKA‐II, along with a third biomarker (AFP‐L3, IGFBP3, COMP, or MMP3) plus age and sex demonstrated the best performance for the detection of early stage and all‐stage HCC. Further, when compared with the GALAD score, which has been shown to be superior to AFP for the detection of early stage and all‐stage HCC, the assessed three‐biomarker panels comprised of AFP + PIVKA‐II and either COMP, IGFBP‐3, or MMP3, plus age and sex, demonstrated similar sensitivity and specificity.(
,
,
) Importantly, the assessed three‐biomarker panels did not reach the requirements for high sensitivity without taking into account age and sex.The assessment of serum AFP for HCC surveillance among individuals at high risk is established in guidelines throughout Asia, but its use remains controversial elsewhere.(
,
,
,
) AFP has been inconsistently incorporated into guidelines. For example, it is recommended for surveillance in JSH guidelines but not in EASL or AASLD guidelines.(
) Although multiple studies have highlighted the value of AFP, either in combination with USG or other known biomarkers, in the detection of both early stage and all‐stage HCC, its utility is limited due to elevations in patients with chronic hepatitis (15%‐58%) and liver cirrhosis (11%‐47%) and due to the fact that levels are not elevated in up to 50% of patients with HCC.(
) These limitations have driven the search for novel biomarkers that could improve early stage HCC surveillance.Although many promising candidates have been evaluated, none have yet been adopted into clinical practice.(
,
) The assessment of these biomarkers is hampered by a lack of head‐to‐head data in large well‐characterized cohorts, with most studies including only a small number of biomarkers. We therefore developed a panel of biomarkers intended to be as comprehensive as possible by curating a list of those described in HCC guidelines and in peer‐reviewed literature at the time the study was initiated. We undertook a systematic assessment of their univariate performance and then applied multivariate analysis followed by an exhaustive search to identify the best performing combinations from this large panel.Our data show that none of the tested biomarkers, including previously extensively studied AFP, PIVKA‐II, glypican‐3, dickkopf‐related protein 1, and Golgi membrane protein 1, had sufficient sensitivity as single markers for routine use in HCC surveillance. The sensitivity for early stage HCC of the two best performing individual markers, AFP and PIVKA‐II, increased from 53.6% and 56.0%, respectively, to ~70% when used in combination. This is in accordance with studies reporting individual AFP and PIVKA‐II sensitivities of 41%‐65% and 28%‐89%, respectively, compared with a sensitivity of 78.3% when used in combination for the detection of all‐stage HCC.(
,
,
,
)Additionally, our comprehensive analysis has identified COMP, IGFBP3, and MMP3 as biomarkers that further enhance the clinical performance of AFP + PIVKA‐II, when used as a three‐biomarker panel, for the detection of both early stage and all‐stage HCC. Taken together, our data and the results from previous studies emphasize the need for additional HCC‐specific biomarkers that could further enhance the diagnostic value of AFP.The GALAD score has shown great potential sensitivity for the detection of early stage HCC and may be particularly well suited to the surveillance of patients with NASH in whom the performance of USG is limited.(
,
,
,
) In this study, several biomarker combinations containing AFP + PIVKA‐II, with the addition of either COMP, IGFBP3, or MMP3, plus age and sex, displayed similar sensitivities and specificities to those of the GALAD score for the detection of early stage and all‐stage HCC. The success of both the three‐biomarker panels analyzed and the GALAD model relies on taking into account specific risk factors, such as male sex and advanced age, when determining the HCC risk for patients with chronic liver diseases.(
) GALAD also allows the cutoff to be adjusted to customize the sensitivity and specificity of the biomarker panels according to local patient demographics and prevalence of HCC, for which substantial regional differences have been described.(
,
,
) All three novel three‐biomarker panels fulfilled the acceptance criteria and performed at a similar level as the GALAD score for early stage HCC detection; however, the third biomarker in each novel panel was inconsistently selected in the exhaustive search process and therefore require independent confirmation. Importantly, the performance of the novel three‐biomarker panels should be viewed cautiously as the performance was based on nested cross‐validation from a single study, which may overestimate performance, whereas the clinical performance of the GALAD algorithm has been validated in several independent studies.(
,
,
,
,
) Limitations of this study are the patient population, which was comprised mainly of patients with liver disease due to chronic hepatitis, which could have influenced the biomarkers selected in the multivariate analysis. However, multiple studies assessing GALAD in a greater proportion of patients with chronic liver disease with nonviral etiology have yielded similar results.(
,
,
,
) Details on etiology were also missing from a number of patients in panel A. Additionally, the majority of included patients with HCC identified as Asian and few women were enrolled; however, this is reflective of the higher incidence of HCC in both Asian and male populations.(
) In order to validate the clinical performance of the biomarker combinations, this study had a phase 2 case‐control design and included patients with varying baseline and disease characteristics, such as sex, age, liver disease etiology, and the presence of cirrhosis. Ongoing phase 3 studies are expected to provide additional evidence in support of biomarker combinations in longitudinal studies.(
,
) Another limitation of the current study was using different assay formats that may have detected different analyte pools. However, in this study, the clinical performance of AFP, PIVKA‐II, and AFP‐L3 across all the assay formats was shown to be comparable.In conclusion, we assessed the largest number of biomarkers for HCC surveillance to date and identified combinations of serum biomarkers that may be of use in the early diagnosis of patients at risk. These assays are rapid, easy to perform, noninvasive, and less dependent on operator skill than USG. Prospective studies to validate such biomarker panels are urgently needed. Currently, there are several large ongoing studies collecting samples for this purpose, including the Hepatocellular Carcinoma Early Detection Strategy study(
) and the Texas Hepatocellular Carcinoma Consortium Cohort study.(
) While a single‐center case‐control study found that the addition of USG provided only minimal improvement in the performance of the GALAD score,(
) USG results were scored categorically (either positive or negative) when numerical/continuous variables, e.g., number and size of nodules, may have been more informative. Larger, prospective, multicenter studies are required to formally assess this combinational approach, ideally using real‐world screening cohorts. Thus, the potential for further improving detection rates for early stage HCC by combining biomarker panels with individual modalities, such as USG, may be of interest for future research.Supplementary MaterialClick here for additional data file.
Authors: Sarah Berhane; Hidenori Toyoda; Toshifumi Tada; Takashi Kumada; Chiaki Kagebayashi; Shinji Satomura; Nora Schweitzer; Arndt Vogel; Michael P Manns; Julia Benckert; Thomas Berg; Maria Ebker; Jan Best; Alexander Dechêne; Guido Gerken; Joerg F Schlaak; Arndt Weinmann; Marcus A Wörns; Peter Galle; Winnie Yeo; Frankie Mo; Stephen L Chan; Helen Reeves; Trevor Cox; Philip Johnson Journal: Clin Gastroenterol Hepatol Date: 2016-01-13 Impact factor: 11.382
Authors: Jan Best; Lars P Bechmann; Jan-Peter Sowa; Svenja Sydor; Alexander Dechêne; Kristina Pflanz; Sotiria Bedreli; Clemens Schotten; Andreas Geier; Thomas Berg; Janett Fischer; Arndt Vogel; Heike Bantel; Arndt Weinmann; Jörn M Schattenberg; Yvonne Huber; Henning Wege; Johann von Felden; Kornelius Schulze; Dominik Bettinger; Robert Thimme; Friedrich Sinner; Kerstin Schütte; Karl Heinz Weiss; Hidenori Toyoda; Satoshi Yasuda; Takashi Kumada; Sarah Berhane; Marc Wichert; Dominik Heider; Guido Gerken; Philip Johnson; Ali Canbay Journal: Clin Gastroenterol Hepatol Date: 2019-11-08 Impact factor: 11.382
Authors: Ju Dong Yang; Benyam D Addissie; Kristin C Mara; William S Harmsen; Jianliang Dai; Ning Zhang; Nicha Wongjarupong; Hawa M Ali; Hamdi A Ali; Fatima A Hassan; Sravanthi Lavu; Jessica L Cvinar; Nasra H Giama; Catherine D Moser; Katsuyuki Miyabe; Loretta K Allotey; Alicia Algeciras-Schimnich; J Paul Theobald; Melissa M Ward; Mindie H Nguyen; Alex S Befeler; K Rajender Reddy; Myron Schwartz; Denise M Harnois; Hiroyuki Yamada; Sudhir Srivastava; Jo Ann Rinaudo; Gregory J Gores; Ziding Feng; Jorge A Marrero; Lewis R Roberts Journal: Cancer Epidemiol Biomarkers Prev Date: 2018-11-21 Impact factor: 4.090
Authors: Boris J B Beudeker; Zwier M A Groothuismink; Annemiek A van der Eijk; Jose D Debes; Andre Boonstra Journal: Cancers (Basel) Date: 2022-10-07 Impact factor: 6.575