| Literature DB >> 34841291 |
Tuyana Boldanova1,2, Geoffrey Fucile3, Jan Vosshenrich4, Aleksei Suslov2, Caner Ercan5, Mairene Coto-Llerena5, Luigi M Terracciano5,6,7, Christoph J Zech4, Daniel T Boll4, Stefan Wieland2, Markus H Heim1,2.
Abstract
Although transarterial chemoembolization (TACE) is the most widely used treatment for intermediate-stage, unresectable hepatocellular carcinoma (HCC), it is only effective in a subset of patients. In this study, we combine clinical, radiological, and genomics data in supervised machine-learning models toward the development of a clinically applicable predictive classifier of response to TACE in HCC patients. Our study consists of a discovery cohort of 33 tumors through which we identify predictive biomarkers, which are confirmed in a validation cohort. We find that radiological assessment of tumor area and several transcriptomic signatures, primarily the expression of FAM111B and HPRT1, are most predictive of response to TACE. Logistic regression decision support models consisting of tumor area and RNA-seq gene expression estimates for FAM111B and HPRT1 yield a predictive accuracy of ∼90%. Reverse transcription droplet digital PCR (RT-ddPCR) confirms these genes in combination with tumor area as a predictive classifier for response to TACE.Entities:
Keywords: biomarker; hepatocellular carcinoma; liver cancer; locoregional treatment; transarterial chemoembolisation
Mesh:
Year: 2021 PMID: 34841291 PMCID: PMC8606904 DOI: 10.1016/j.xcrm.2021.100444
Source DB: PubMed Journal: Cell Rep Med ISSN: 2666-3791
Figure 1Feature importance in predicting response to TACE
(A) Boxplots showing distribution of tumor area for responder and non-responder tumors (∗∗∗p ≤ 0.001 for Student’s t test).
(B) Feature importance scores from a random forest model involving clinical and radiological measurements in prediction of complete response to TACE at 3 months. The boxplots indicate the distribution of scores for 500 bootstraps of training and testing sets using the discovery cohort. Area_log10, log10 of tumor area; Mpp, mean of positive pixels; AFP_pre, pre-treatment α-fetoprotein (AFP) measurement at the time of biopsy; AFP_post, post-treatment AFP measurement 1–3 months after TACE; num_nodules, number of TACEed nodules.
Figure 2Transcriptomic features are predictive of response to TACE
(A) Scatterplot of the first 2 principal components of the RNA-seq gene expression matrix for significantly differentially expressed genes between responder and non-responder tumors. Expression values were normalized using the variance stabilizing transformation implemented in DESeq2.
(B) Boxplot showing the distribution of projections along the first principal component for responder and non-responder tumors (∗∗∗p ≤ 0.001 for Student’s t test).
(C) Heatmap showing hierarchically clustered RNA-seq gene expression vectors for differentially expressed genes between responder and non-responder tumors. The responder tumor labels are prefixed with “R_.” Expression values were normalized using the variance stabilizing transformation implemented in DESeq2 and scaled before hierarchical clustering using Pearson distance and Ward D2 agglomeration implemented in R version 4.0.3 using the heatmap.2 function from the gplots package.
Figure 3Dotplot showing a subset of the most significantly enriched Reactome pathways among genes differentially expressed between responder and non-responder tumors
p values were adjusted using false discovery rate control. The count indicates the number of genes within the pathway that are differentially expressed.
Figure 4Transcriptomic and radiological features accurately predict response to TACE
(A) Violin plots showing the distribution of accuracy, sensitivity, and specificity scores for logistic regression (LR) models using tumor area at baseline, and RNA-seq gene expression measures of FAM111B and HPRT1. The gene expression measures were normalized using the variance stabilizing transformation implemented in DESeq2, followed by scaling for use in the LR models. The violin plots indicate the distribution of scores for 500 bootstraps of training and testing sets of the discovery cohort.
(B) Violin plots showing the distribution of accuracy, sensitivity, and specificity scores for LR models using tumor area at baseline and RT-ddPCR gene expression measures of FAM111B and HPRT1. The gene expression measures were calculated as the average of 2 technical replicates followed by scaling for use in the LR models. The violin plots indicate the distribution of scores for 500 bootstraps of training and testing sets of the discovery cohort.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Polyclonal rabbit anti-human FAM111B Antibody | Invitrogen | Cat#PA5-58474; RRID: |
| Monoclonal rabbit anti-human HPRT antibody | abcam | Cat#ab109021; RRID: |
| Tumor and Non-Tumor liver biopsy tissue | University Hospital Basel | This paper |
| ZR-Duet DNA and RNA MiniPrep Plus kit | Zymo Research, Irvine, CA | Cat#D7003 |
| SureSelectXT Clinical Research Exome | Agilent Technologies | Cat#5190-7339 |
| SureSelect Human All Exon V6+COSMIC | Agilent Technologies | Cat#5190-9308 |
| TruSeq Stranded Total RNA Library Prep Kit with Ribo-Zero Gold | Illumina | Cat#20020599 |
| QX200 ddPCR EvaGreen Supermix | Bio-Rad | Cat#1864034 |
| QX200 Droplet Generation Oil for EvaGreen | Bio-Rad | Cat#1864005 |
| DG8 Cartridges for QX200/QX100 Droplet Generator | Bio-Rad | Cat#1864008 |
| DG8 Gaskets for QX200/QX100 Droplet Generator | Bio-Rad | Cat#1863009 |
| PCR Plate Heat Seal, foil, pierceable | Bio-Rad | Cat#1814040 |
| ddPCR 96-Well Plates | Bio-Rad | Cat#12001925 |
| MultiScribe Reverse Transcriptase | Invitrogen | Cat#4311235 |
| Random hexamers | Roche | Cat#11034731001 |
| RNase Inhibitor | Applied Biosystems | Cat# N8080119 |
| 10 mM dNTSs | Promega | Cat# U1515 |
| RNA-sequencing data (41 tumors and 37 adjacent non-tumor tissue and 15 liver biopsies with normal histology) | This paper | European Genome-phenome Archive: EGAS00001005558 |
| Whole-exome sequencing data (122 tumors and 115 adjacent non-tumor liver tissues) | Ng et al., unpublished data | European Genome-phenome Archive: EGAS00001005073 |
| FAM111B – forward, 5′-CTGGCATAAGA | This paper | N/A |
| FAM111B – reverse, 5′-GGGCTGAGTAG | This paper | N/A |
| HPRT1 – forward, 5′-ACATTGTAGCCCT | This paper | N/A |
| HPRT1 – reverse, 5′-AATCCAGCAGGT | This paper | N/A |
| mint Lesion™ 3.0 software | Mint Medical GmbH, Heidelberg, Germany | |
| GraphPad Prism 9.2.0 | GraphPad Software, San Diego | |
| STAR 2.7.1a | Dobin et al., 2013 | N/A |
| R 4.1.0 | N/A | |
| DESeq2 1.32.0 | N/A | |
| scikit-learn 0.24.1 | N/A | |
| PRETACE Python code | This paper | |