| Literature DB >> 34725972 |
Silvia Liu1, Michael A Nalesnik1, Aatur Singhi1, Michelle A Wood-Trageser1, Parmjeet Randhawa1, Bao-Guo Ren1, Abhinav Humar2, Peng Liu3, Yan-Ping Yu1, George C Tseng3, George Michalopoulos1, Jian-Hua Luo1.
Abstract
Hepatocellular carcinoma (HCC) is one of the most lethal human cancers. Liver transplantation has been an effective approach to treat liver cancer. However, significant numbers of patients with HCC experience cancer recurrence, and the selection of suitable candidates for liver transplant remains a challenge. We developed a model to predict the likelihood of HCC recurrence after liver transplantation based on transcriptome and whole-exome sequencing analyses. We used a training cohort and a subsequent testing cohort based on liver transplantation performed before or after the first half of 2012. We found that the combination of transcriptome and mutation pathway analyses using a random forest machine learning correctly predicted HCC recurrence in 86.8% of the training set. The same algorithm yielded a correct prediction of HCC recurrence of 76.9% in the testing set. When the cohorts were combined, the prediction rate reached 84.4% in the leave-one-out cross-validation analysis. When the transcriptome analysis was combined with Milan criteria using the k-top scoring pairs (k-TSP) method, the testing cohort prediction rate improved to 80.8%, whereas the training cohort and the combined cohort prediction rates were 79% and 84.4%, respectively. Application of the transcriptome/mutation pathways RF model on eight tumor nodules from 3 patients with HCC yielded 8/8 consistency, suggesting a robust prediction despite the heterogeneity of HCC.Entities:
Mesh:
Year: 2021 PMID: 34725972 PMCID: PMC8948579 DOI: 10.1002/hep4.1846
Source DB: PubMed Journal: Hepatol Commun ISSN: 2471-254X
Clinical Features of the HCC Cohort
| Clinical Features | Category/Statistical Measurements | Training | Validation |
|---|---|---|---|
| Number of samples | Count | 38 | 26 |
| Latest recurrence status | Nonrecurrent | 23 | 18 |
| Recurrent | 15 | 8 | |
| Milan score | In | 16 (42.1%) | 20 (76.9%) |
| Out | 22 (57.9%) | 6 (23.1%) | |
| Tx age | Mean ± SD | 56.2 ± 8.9 | 61.9 ± 5.7 |
| Tx type | Orthotopic | 37 | 16 |
| Living related | 1 | 3 | |
| Living nonrelated | 0 | 7 | |
| Underlying disease | HBV | 5 (13.2%) | 0 (0.0%) |
| HCV | 17 (44.7%) | 9 (34.6%) | |
| Nodular regenerative hyperplasia | 1 (2.6%) | 0 (0.0%) | |
| Hemochromatosis | 2 (5.3%) | 0 (0.0%) | |
| NASH | 5 (13.2%) | 8 (30.8%) | |
| EtOH | 8 (21.1%) | 10 (38.5%) | |
| A1AT | 1 (2.6%) | 0 (0.0%) | |
| PBC | 1 (2.6%) | 1 (3.8%) | |
| NRH | 0 (0.0%) | 4 (15.4%) | |
| Orig. number of tumors | 1 | 10 | 9 |
| 2 | 8 | 6 | |
| 3 | 4 | 3 | |
| 4+ | 16 | 8 | |
| Orig. tumor sizes (cm) | [min, max] | [0.2, 21.0] | [0.3, 6.0] |
| Alive status at last follow‐up | Alive | 13 | 17 |
| Dead | 24 | 9 | |
| Unknown | 1 | 0 | |
| Pretransplant Rx | Y | 18 | 17 |
| N | 10 | 6 | |
| Unknown | 10 | 3 | |
| PreTx Rx type | RFA | 6 (15.8%) | 9 (34.6%) |
| Resection | 2 (5.3%) | 4 (15.4%) | |
| TACE | 13 (34.2%) | 14 (53.8%) | |
| Sorafenib | 0 (0.0%) | 5 (19.2%) | |
| None | 10 (26.3%) | 6 (23.1%) | |
| Immunosuppression | Tacrolimus | 18 (47.4%) | 25 (96.2%) |
| Mycophenolate | 11 (28.9%) | 25 (96.2%) | |
| Cyclosporine | 5 (13.2%) | 3 (11.5%) | |
| Everolimus | 3 (7.9%) | 14 (53.8%) | |
| Azathioprine | 0 | 4 (15.4%) | |
| mTOR inhibitor | Y | 3 | 13 |
| N | 28 | 12 | |
| Unknown | 7 | 1 | |
| Highest AFP level | [min, median, max] | [4, 40, 34,818] | [2.1, 34.05, 22,256] |
| HCC differentiation | Poor | 6 | 4 |
| Moderate | 21 | 17 | |
| Well | 11 | 5 | |
| Microvascular invasion | Yes | 23 | 14 |
| No | 13 | 12 | |
| Unknown | 2 | 0 | |
| Macrovascular invasion | Yes | 5 | 2 |
| No | 32 | 24 | |
| Unknown | 1 | 0 |
Abbreviations: A1AT, alpha‐1 antitrypsin deficiency; AFP, alpha‐fetoprotein; EtOH, ethanol; HBV, hepatitis B virus; HCV, hepatitis C virus; mTOR, mammalian target of rapamycin; N, no; NASH, nonalcoholic steatohepatitis; NRH, nodular regenerative hyperplasia; PBC, primary biliary cholangitis; RFA, radiofrequency ablation; Rx, prescription; TACE, transarterial chemoembolization; Tx, transplant; and Y , yes.
Clinical Features of Samples Collected From HCC Transplant Patients
| Cohort | Sample | Surgical Year | Recur Status | Milan | Months to Recur | Follow‐up (Months) |
|---|---|---|---|---|---|---|
| Training | Training 1 | 1988 | Non‐Recur | Out | NA | 72 |
| Training | Training 2 | 2009 | Non‐Recur | In | NA | 134 |
| Training | Training 3 | 2007 | Non‐Recur | Out | NA | 121.5 |
| Training | Training 4 | 2008 | Non‐Recur | In | NA | 136.9 |
| Training | Training 5 | 2008 | Non‐Recur | In | NA | 76.7 |
| Training | Training 6 | 2008 | Non‐Recur | Out | NA | 157.3 |
| Training | Training 7 | 2008 | Non‐Recur | Out | NA | 103 |
| Training | Training 8 | 2008 | Non‐Recur | In | NA | 104.2 |
| Training | Training 9 | 2008 | Non‐Recur | In | NA | 121.1 |
| Training | Training 10 | 2008 | Non‐Recur | In | NA | 113.1 |
| Training | Training 11 | 2009 | Non‐Recur | Out | NA | 110.4 |
| Training | Training 12 | 2009 | Non‐Recur | In | NA | 92.1 |
| Training | Training 13 | 2009 | Non‐Recur | In | NA | 127.8 |
| Training | Training 14 | 2009 | Non‐Recur | Out | NA | 116.2 |
| Training | Training 15 | 2009 | Non‐Recur | In | NA | 134.3 |
| Training | Training 16 | 2009 | Non‐Recur | In | NA | 92.2 |
| Training | Training 17 | 2009 | Non‐Recur | In | NA | 73.5 |
| Training | Training 18 | 2009 | Non‐Recur | In | NA | 137.2 |
| Training | Training 19 | 2009 | Non‐Recur | In | NA | 129.2 |
| Training | Training 20 | 2009 | Non‐Recur | In | NA | 145.8 |
| Training | Training 21 | 2009 | Non‐Recur | Out | NA | 142.3 |
| Training | Training 22 | 2012 | Non‐Recur | In | NA | 101.8 |
| Training | Training 23 | 1991 | Non‐Recur | Out | NA | 298.8 |
| Training | Training 24 | 1988 | Recur | Out | 26.3 | 31.1 |
| Training | Training 25 | 1989 | Recur | Out | 25.2 | 25.2 |
| Training | Training 26 | 1989 | Recur | Out | 5.8 | 5.8 |
| Training | Training 27 | 1990 | Recur | Out | 27.7 | 47.4 |
| Training | Training 28 | 1991 | Recur | Out | 9.1 | 9.1 |
| Training | Training 29 | 1992 | Recur | Out | 19.6 | 21.3 |
| Training | Training 30 | 2004 | Recur | Out | 25.5 | 29.1 |
| Training | Training 31 | 2007 | Recur | Out | 27.1 | 80.5 |
| Training | Training 32 | 2007 | Recur | Out | 10.9 | 13.6 |
| Training | Training 33 | 2007 | Recur | Out | 5.2 | 16.7 |
| Training | Training 34 | 2008 | Recur | Out | 15.2 | 79.1 |
| Training | Training 35 | 2012 | Recur | In | 35.4 | 43.2 |
| Training | Training 36 | 1988 | Recur | Out | 12.5 | 15.2 |
| Training | Training 37 | 1989 | Recur | Out | 6.6 | 6.6 |
| Training | Training 38 | 1990 | Recur | Out | 15.6 | 33.5 |
| Validation | Testing 1 | 2012 | Non‐Recur | In | NA | 55.6 |
| Validation | Testing 2 | 2015 | Non‐Recur | In | NA | 53.3 |
| Validation | Testing 3 | 2015 | Non‐Recur | In | NA | 49.7 |
| Validation | Testing 4 | 2015 | Non‐Recur | In | NA | 61 |
| Validation | Testing 5 | 2015 | Non‐Recur | In | NA | 61 |
| Validation | Testing 6 | 2014 | Non‐Recur | Out | NA | 48.7 |
| Validation | Testing 7 | 2015 | Non‐Recur | In | NA | 61 |
| Validation | Testing 8 | 2016 | Non‐Recur | In | NA | 56 |
| Validation | Testing 9 | 2016 | Non‐Recur | In | NA | 57.8 |
| Validation | Testing 10 | 2016 | Non‐Recur | In | NA | 37.9 |
| Validation | Testing 11 | 2016 | Non‐Recur | Out | NA | 37.3 |
| Validation | Testing 12 | 2016 | Non‐Recur | In | NA | 48.6 |
| Validation | Testing 13 | 2016 | Non‐Recur | In | NA | 57.4 |
| Validation | Testing 14 | 2016 | Non‐Recur | In | NA | 49.3 |
| Validation | Testing 15 | 2015 | Non‐Recur | In | NA | 65.8 |
| Validation | Testing 16 | 2016 | Non‐Recur | In | NA | 36.2 |
| Validation | Testing 17 | 2016 | Non‐Recur | In | NA | 51.5 |
| Validation | Testing 18 | 2016 | Non‐Recur | In | NA | 50.4 |
| Validation | Testing 19 | 2013 | Recur | In | 35.7 | 62.5 |
| Validation | Testing 20 | 2016 | Recur | Out | 7.5 | 38.6 |
| Validation | Testing 21 | 2016 | Recur | Out | 7.5 | 38.6 |
| Validation | Testing 22 | 2016 | Recur | Out | 7.5 | 38.6 |
| Validation | Testing 23 | 2016 | Recur | Out | 7.5 | 38.6 |
| Validation | Testing 24 | 2016 | Recur | In | 6.7 | 18.6 |
| Validation | Testing 25 | 2016 | Recur | In | 6.7 | 18.6 |
| Validation | Testing 26 | 2019 | Recur | In | 7.7 | 20.9 |
Abbreviations: NA, not available; and Recur, recurrence.
FIG. 1Flow chart of procedures for training and validation of genome prediction model. The procedure starts with the identification of cancer samples by the year of liver transplant surgery using 2012 as the demarcation. All samples before the first half of 2012 were used in the training set, whereas samples after the second half of 2012 were used as the testing set. The cancer areas and benign tissues from the non‐liver organ of the paraffin block were needle‐cored and used as “cancer” and “normal” tissues, respectively. All clinical information was blind to the researchers before the prediction.
FIG. 2ROC analysis of genome prediction model. (A) Training set ROC based on top 500 differentially expressed genes between recurrence and nonrecurrence samples from the transcriptome sequencing using LOOCV strategy with RF method. (B) Testing set ROC based on the algorithm determined in the training set of (A). (C) Training set ROC based on transcriptome and exome sequencing results using RF method. (D) Testing set ROC based on the algorithm determined in the training set of (C). (E) ROC of pooled training and testing cohorts based on transcriptome sequencing using LOOCV strategy with RF method. (F) ROC of pooled training and testing cohorts based on transcriptome and exome sequencings using LOOCV strategy with RF method. Abbreviation: CV, cross validation.
FIG. 3Kaplan‐Meier analysis of genome prediction model. (A) Training set Kaplan‐Meier analysis based on 500 differentially expressed genes from the transcriptome sequencing using LOOCV strategy with RF method. (B) Testing set Kaplan‐Meier analysis based on the algorithm determined in the training set of (A). (C) Training set Kaplan‐Meier analysis based on transcriptome and exome‐sequencing results using RF method. (D) Testing set Kaplan‐Meier analysis based on the algorithm determined in the training set of (C). (E) Kaplan‐Meier analysis of pooled training and testing cohorts based on transcriptome sequencing using LOOCV strategy with RF method. (F) Kaplan‐Meier analysis of pooled training and testing cohorts based on transcriptome and exome sequencings using LOOCV strategy with RF method.
FIG. 4ROC analysis of Milan criteria with the genome prediction model. (A) ROC analysis based on Milan criteria in the training set. (B) ROC analysis based on Milan criteria in the testing set. (C) ROC analysis based on Milan criteria in the combined training and testing sets. (D) ROC analysis of the training set based on Milan/transcriptome k‐TSP prediction model using LOOCV. (E) ROC analysis of the testing set based on Milan/transcriptome k‐TSP prediction algorithm determined in (D). (F) ROC analysis of the combined training and testing sets based on Milan/transcriptome k‐TSP prediction model using LOOCV.
FIG. 5Kaplan‐Meier analysis of Milan criteria with the genome prediction model. (A) Kaplan‐Meier analysis based on Milan criteria in the training set (A), the testing set (B), and the combined training and testing sets (C). (D) Kaplan‐Meier analysis of the training set based on Milan/transcriptome k‐TSP prediction model using LOOCV. (E) Kaplan‐Meier analysis of the testing set based on Milan/transcriptome k‐TSP prediction algorithm determined in (D). (F) Kaplan‐Meier analysis of the combined training and testing sets based on Milan/transcriptome k‐TSP prediction model using LOOCV.
Multiple Cancer Nodule Predictions From Patients With HCC
| Patient | Recur Status | Milan | RF Probability Score* | Prediction Status | Months to Recur |
|---|---|---|---|---|---|
| #V19A | Recur | Out | 0.6275 | Recur | 7.5 |
| #V19B | Recur | Out | 0.7909 | Recur | 7.5 |
| #V19C | Recur | Out | 0.8400 | Recur | 7.5 |
| #V19D | Recur | Out | 0.7534 | Recur | 7.5 |
| #V7A | Non‐Recur | In | 0.1235 | Non‐Recur | NA |
| #V7B | Non‐Recur | In | 0.0120 | Non‐Recur | NA |
| #V21A | Recur | In | 0.6690 | Recur | 6.7 |
| #V21B | Recur | In | 0.9183 | Recur | 6.7 |
Score > 0.5 = likely recurrence, and score <0.5 = likely nonrecurrence.
FIG. 6Transcriptomic alteration related to recurrence and mutation pathways of HCC samples. (A) Hierarchical clustering of HCC samples based on top 500 differential expression genes between nonrecurrence and recurrence HCC samples. (B) Heat map of five signaling pathways based on the differential mutation numbers in the pathways between nonrecurrence and recurrence samples. (C) Gene‐expression alterations and connections based on GO analysis.