| Literature DB >> 36033263 |
Jincheng Cao1,2, Yanni Xu1, Xiaodi Liu1,2, Yan Cai3, Baoming Luo1.
Abstract
Aims: Hepatocellular carcinoma (HCC) remains a major tumoral burden globally, and its heterogeneity encumbers prognostic prediction. The lymphangiogenesis-related long non-coding RNAs (lrlncRNAs) reported to be implicated in immune response regulation show potential importance in predicting the prognostic and therapeutic outcome. Hence, this study aims to establish a lrlncRNA pairs-based signature not requiring specific expression levels of transcripts, which displays promising clinical practicality and satisfactory predictive capability. Main methods: Transcriptomic and clinical information of the Liver Hepatocellular Carcinoma (LIHC) project retrieved from the TCGA portal were used to find differently expressed lrlncRNA (DElrlncRNA) via analysis performed between lymphangiogenesis-related genes (lr-genes) and lncRNAs(lrlncRNA), and to ultimately construct the signature based on lrlncRNA pairs screened out via Lasso and Cox regression analyses. Akaike information criterion (AIC) values were computed to find the cut-off point optimum for high-risk and low-risk group allocation. The signature then underwent trials in terms of its predictive value for survival, clinicopathological features, immune cells infiltration in tumoral microenvironment, selected checkpoint biomarkers and chemosensitivity. Key findings: A novel lymphangiogenesis-related lncRNA pair signature was established using nine lrlncRNA pairs identified and significantly related to overall survival, clinicopathological features, immune cells infiltration and susceptibility to chemotherapy. Moreover, the signature efficacy was verified in acknowledged clinicopathological subgroups and partially validated by qRT-PCR assay in various human HCC cell lines. Significance: The novel lrlncRNA-pairs based signature was shown to effectively and independently estimate HCC prognosis and help screen patients suitable for anti-tumor immunotherapy and chemotherapy.Entities:
Keywords: Chemotherapy; Hepatocellular carcinoma; Immunocheckpoint; Lymphangiogenesis-related long noncoding RNAs; Prognostic signature; Tumoral infiltration of immune cells
Year: 2022 PMID: 36033263 PMCID: PMC9403397 DOI: 10.1016/j.heliyon.2022.e10215
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Lymphangiogenesis-related genes list.
| Gene Symbol | Description | Category | Gifts | GC Id | Relevance score |
|---|---|---|---|---|---|
| VEGFC | Vascular Endothelial Growth Factor C | Protein Coding | 46 | GC04M176683 | 16.69894028 |
| FLT4 | Fms Related Receptor Tyrosine Kinase 4 | Protein Coding | 50 | GC05M180607 | 16.69141579 |
| VEGFD | Vascular Endothelial Growth Factor D | Protein Coding | 33 | GC0XM015345 | 13.53529739 |
| CALCRL | Calcitonin Receptor Like Receptor | Protein Coding | 44 | GC02M187341 | 8.252692223 |
| PDPN | Podoplanin | Protein Coding | 39 | GC01P013583 | 8.003945351 |
| VEGFA | Vascular Endothelial Growth Factor A | Protein Coding | 47 | GC06P043770 | 7.842282772 |
| LYVE1 | Lymphatic Vessel Endothelial Hyaluronan Receptor 1 | Protein Coding | 41 | GC11M010753 | 7.152224541 |
| KDR | Kinase Insert Domain Receptor | Protein Coding | 52 | GC04M055078 | 7.094721794 |
| PROX1 | Prospero Homeobox 1 | Protein Coding | 42 | GC01P213983 | 6.581964493 |
| PTPN14 | Protein Tyrosine Phosphatase Non-Receptor Type 14 | Protein Coding | 43 | GC01M214348 | 6.412934303 |
| SOX18 | SRY-Box Transcription Factor 18 | Protein Coding | 39 | GC20M064047 | 4.160199642 |
| PTGS2 | Prostaglandin-Endoperoxide Synthase 2 | Protein Coding | 48 | GC01M186640 | 4.110999107 |
| FLT1 | Fms Related Receptor Tyrosine Kinase 1 | Protein Coding | 49 | GC13M028300 | 3.471794605 |
| CCBE1 | Collagen And Calcium Binding EGF Domains 1 | Protein Coding | 40 | GC18M059430 | 3.466086864 |
| ANGPT2 | Angiopoietin 2 | Protein Coding | 44 | GC08M006499 | 3.383462429 |
| FOXC2 | Forkhead Box C2 | Protein Coding | 44 | GC16P086574 | 3.310676575 |
| NRP2 | Neuropilin 2 | Protein Coding | 43 | GC02P205681 | 3.150856733 |
| FGF2 | Fibroblast Growth Factor 2 | Protein Coding | 46 | GC04P122826 | 2.654223204 |
| PGF | Placental Growth Factor | Protein Coding | 42 | GC14M074941 | 2.635710478 |
| TGFB1 | Transforming Growth Factor Beta 1 | Protein Coding | 50 | GC19M041301 | 2.578747272 |
| CCR7 | C–C Motif Chemokine Receptor 7 | Protein Coding | 44 | GC17M040556 | 2.554449558 |
| HIF1A | Hypoxia Inducible Factor 1 Subunit Alpha | Protein Coding | 46 | GC14P061695 | 2.478032827 |
| HMGB1 | High Mobility Group Box 1 | Protein Coding | 44 | GC13M030456 | 2.41782546 |
| CXCR4 | C-X-C Motif Chemokine Receptor 4 | Protein Coding | 51 | GC02M136114 | 2.304786205 |
| POSTN | Periostin | Protein Coding | 42 | GC13M037562 | 2.292275667 |
| CCL21 | C–C Motif Chemokine Ligand 21 | Protein Coding | 41 | GC09M034709 | 2.277864933 |
| VASH1 | Vasohibin 1 | Protein Coding | 36 | GC14P076761 | 2.217816591 |
| SHH | Sonic Hedgehog Signaling Molecule | Protein Coding | 48 | GC07M155799 | 2.203608274 |
| STAT3 | Signal Transducer And Activator Of Transcription 3 | Protein Coding | 51 | GC17M042313 | 2.19051075 |
| ANGPT1 | Angiopoietin 1 | Protein Coding | 45 | GC08M107246 | 2.184707403 |
| NRP1 | Neuropilin 1 | Protein Coding | 45 | GC10M033177 | 2.166389465 |
| ERBB2 | Erb-B2 Receptor Tyrosine Kinase 2 | Protein Coding | 52 | GC17P039687 | 2.150751829 |
| NR2F2 | Nuclear Receptor Subfamily 2 Group F Member 2 | Protein Coding | 48 | GC15P096325 | 2.107103348 |
| CXCL12 | C-X-C Motif Chemokine Ligand 12 | Protein Coding | 44 | GC10M044294 | 2.088619471 |
| VEGFB | Vascular Endothelial Growth Factor B | Protein Coding | 43 | GC11P064234 | 2.052349567 |
| HPSE | Heparanase | Protein Coding | 44 | GC04M083292 | 2.029333591 |
| TEK | TEK Receptor Tyrosine Kinase | Protein Coding | 49 | GC09P027109 | 2.020673752 |
| HGF | Hepatocyte Growth Factor | Protein Coding | 50 | GC07M081699 | 2.000943422 |
| MET | MET Proto-Oncogene, Receptor Tyrosine Kinase | Protein Coding | 52 | GC07P116672 | 1.986150503 |
| CEACAM1 | CEA Cell Adhesion Molecule 1 | Protein Coding | 41 | GC19M042507 | 1.980372667 |
| NFKB1 | Nuclear Factor Kappa B Subunit 1 | Protein Coding | 51 | GC04P102501 | 1.93708396 |
| NOS2 | Nitric Oxide Synthase 2 | Protein Coding | 49 | GC17M027756 | 1.935756326 |
| S1PR1 | Sphingosine-1-Phosphate Receptor 1 | Protein Coding | 44 | GC01P101236 | 1.911855936 |
| FOXC1 | Forkhead Box C1 | Protein Coding | 42 | GC06P001610 | 1.900332332 |
| PDGFB | Platelet Derived Growth Factor Subunit B | Protein Coding | 48 | GC22M051474 | 1.89337194 |
| CLEC14A | C-Type Lectin Domain Containing 14A | Protein Coding | 33 | GC14M038254 | 1.842555404 |
| SMAD4 | SMAD Family Member 4 | Protein Coding | 49 | GC18P051028 | 1.834085941 |
| IL17A | Interleukin 17A | Protein Coding | 41 | GC06P052186 | 1.829650402 |
| ITGA4 | Integrin Subunit Alpha 4 | Protein Coding | 48 | GC02P181456 | 1.812556744 |
| IL7R | Interleukin 7 Receptor | Protein Coding | 45 | GC05P035852 | 1.789921403 |
| TNF | Tumor Necrosis Factor | Protein Coding | 49 | GC06P061170 | 1.782137632 |
| SIX1 | SIX Homeobox 1 | Protein Coding | 43 | GC14M060643 | 1.782137632 |
| MMP9 | Matrix Metallopeptidase 9 | Protein Coding | 52 | GC20P046008 | 1.76113379 |
| SMARCA4 | SWI/SNF Related, Matrix Associated, Actin Dependent Regulator Of Chromatin, Subfamily A, Member 4 | Protein Coding | 48 | GC19P010932 | 1.741836548 |
| IL6 | Interleukin 6 | Protein Coding | 48 | GC07P022725 | 1.740508795 |
| MIR27B | MicroRNA 27b | RNA Gene | 21 | GC09P095097 | 1.720307231 |
| MIR9-1 | MicroRNA 9-1 | RNA Gene | 21 | GC01M156420 | 1.703069806 |
| PECAM1 | Platelet And Endothelial Cell Adhesion Molecule 1 | Protein Coding | 40 | GC17M064319 | 1.695909262 |
| MAPK14 | Mitogen-Activated Protein Kinase 14 | Protein Coding | 50 | GC06P061307 | 1.67373991 |
| MCAM | Melanoma Cell Adhesion Molecule | Protein Coding | 39 | GC11M119308 | 1.67373991 |
| TYMP | Thymidine Phosphorylase | Protein Coding | 45 | GC22M050525 | 1.671724916 |
| EDN1 | Endothelin 1 | Protein Coding | 46 | GC06P012256 | 1.657824516 |
| ITGB1 | Integrin Subunit Beta 1 | Protein Coding | 49 | GC10M032899 | 1.657404423 |
| PDGFA | Platelet Derived Growth Factor Subunit A | Protein Coding | 42 | GC07M000497 | 1.648259282 |
| IL24 | Interleukin 24 | Protein Coding | 41 | GC01P206897 | 1.591114283 |
| TIAM1 | TIAM Rac1 Associated GEF 1 | Protein Coding | 44 | GC21M031118 | 1.558186173 |
| ECM1 | Extracellular Matrix Protein 1 | Protein Coding | 43 | GC01P150508 | 1.558186173 |
| LIMS1 | LIM Zinc Finger Domain Containing 1 | Protein Coding | 40 | GC02P108534 | 1.558186173 |
| NES | Nestin | Protein Coding | 39 | GC01M156668 | 1.558186173 |
| CDKN2B-AS1 | CDKN2B Antisense RNA 1 | RNA Gene | 21 | GC09P021994 | 1.558186173 |
| ADM | Adrenomedullin | Protein Coding | 44 | GC11P010304 | 1.55810535 |
| ITGA9 | Integrin Subunit Alpha 9 | Protein Coding | 42 | GC03P037468 | 1.546670914 |
| MMP2 | Matrix Metallopeptidase 2 | Protein Coding | 52 | GC16P055390 | 1.536451578 |
| NFATC1 | Nuclear Factor Of Activated T Cells 1 | Protein Coding | 47 | GC18P079395 | 1.511538029 |
| TIE1 | Tyrosine Kinase With Immunoglobulin Like And EGF Like Domains 1 | Protein Coding | 42 | GC01P043300 | 1.511538029 |
QRT-PCR primer sequences.
| Species | Gene name | Primer sequence (5→3′) | |
|---|---|---|---|
| Homo sapiens | AC068506.1 | Forward | TCCCATCTCCCACTATTC |
| Reverse | AAGGCACATACAAGAAAGC | ||
| Homo sapiens | LENG8−AS1 | Forward | AGCACGGACTCTGATACAA |
| Reverse | TCAGCCAGTTCTCCCTAAT | ||
| Homo sapiens | AC006042.1 | Forward | TACTTTTACCCTTGAGCA |
| Reverse | GAACATCTACAATGAGCC | ||
| Homo sapiens | AL355488.1 | Forward | AGCACCTTGGTTCTGATGT |
| Reverse | CCTGGCTATGGCACTTACT | ||
| Homo sapiens | ACTB | Forward | CATGTACGTTGCTATCCAGGC |
| Reverse | CTCCTTAATGTCACGCACGAT | ||
Figure 1Flow chart of the study.
Figure 2DElrlncRNAs identification; Identifying differently expressed lymphangiogenesis-related lncRNAs (DElrlncRNAs) using the LIHC dataset from TCGA portal, as shown in the heatmap (A) and the volcano plot (B) (C) GO terms indicated the selected genes were lymphangiogenesis-related.
Figure 3Establishment of prognosis signature (A) 30 lncRNA pairs were analyzed by LASSO regression, in which lncRNA pairs were eventually removed from the model as the penalty (lambda) increased (B) The tuning parameter selection of the LASSO analysis, in which 16 lncRNA pairs were left (−4 < lambda.min<−3.7) (C) The univariate Cox regression analysis of the 9 significant DElrlncRNA pairs used to construct the signature (D) The forest map of the 9 DElrlncRNA pairs screened out by the Cox proportional hazard model, which was also used to select pairs for qRT-PCR validation.
Figure 4Biofunctions, pathways and predictivity of the signature (A)LncRNA and mRNA coexpression network of the signature (B) Biological Functions identified using GO analysis in the signature (C) Pathways associated with the signature as found out by KEGG analysis (D) The optimal cut-off point was calculated to allocate patients into two different risk groups using the AIC value (E) The AUCs for 1-, 2- and 3-year ROC curves were 0.825, 0.764, and 0.741, respectively (F) The risk score predicted with best efficiency comparing with other clinical features for 1-year survival.
Figure 5Signature survival prediction Risk scores (A) and survival status (B) of every case were shown and compared (C) The Kaplan-Meier plot showing the significantly slimmer chance of survival for the high-risk group. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001, also applicable for the following figures.
Figure 6Survival analysis for subgroups of gender (A, B), age (C, D), grade (E, F), stage (G, H), N stage (I, J), M stage (K, L) and T stage (M–O). P-values in all subgroups indicated statistical significance.
Figure 7Association of the signature with clinicopathological features (A)The heatmap showed that grade, clinical stage and T stage were significantly related to the risk score (B–H) Boxplots using Wilcoxon signed-rank tests agreed that T stage (B), stage (C) and grade (D) significantly correlated with the risk score, while M stage (E), age (F) and gender (G)did not, with the exception of N stage (H). Univariate (I) and multivariate (J) Cox regression analyses discerning independently prognostic predictors.
Figure 8Using nomogram to predict patients' survival (A)The nomogram model using the risk score and clinical stage to predict 1-/2-/3-year survival rates of LIHC patients (B–D) Calibration graphs indicating the predicted survival rates of 1- (B), 2- (C) and 3-year (D) nomogram model were comparable to the actual ones.
Figure 9Association of the signature with tumoral infiltration of immune cells (A) The correlation of the risk score with many types of tumor-infiltrating cells (B) High risk correlated with higher tumoral infiltration of macrophages (B), Th2 cells (C), myeloid dendritic cells (D), Treg cells (E) and neutrophils (F) and lower infiltration of CD8+ naïve cell (G), hematopoietic stem cells (H), endothelial cells (I) and central memory T cells (J).
Figure 10Association of the signature with immunocheckpoint genes; Expression of HAVCR2 (A), CD47(B) and CD276(C) was significantly higher in the high-risk group, while difference in LAG3 (D), PDCD1 (E) and CTLA4 (F) expression displayed no statistical significance between the groups.
Figure 11Association of the signature with chemosensitivity; IC50 of gemcitabine (A), doxorubicin (B), mitomycin C (C) and sorafenib (D) were significantly lower in the high-risk group.
Figure 12Verification of expression ratios in lymphangiogenesis-related lncRNA pairs; qRT-PCR results for the expression ratios of AC006042.1| AL355488.1 (A) and AC068506.1| LENG8-AS1 (B).