| Literature DB >> 35372518 |
Lin-Kun Zhong1, Xing-Yan Deng2, Fei Shen3, Wen-Song Cai3, Jian-Hua Feng3, Xiao-Xiong Gan3, Shan Jiang4, Chi-Zhuai Liu1, Ming-Guang Zhang1, Jiang-Wei Deng1, Bing-Xing Zheng1, Xiao-Zhang Xie1, Li-Qing Ning1, Hui Huang1, Shan-Shan Chen5, Jian-Hang Miao1, Bo Xu2.
Abstract
The accurate determination of the risk of cancer recurrence is a critical unmet need in managing thyroid cancer (TC). Although numerous studies have successfully demonstrated the use of high throughput molecular diagnostics in TC prediction, it has not been successfully applied in routine clinical use, particularly in Chinese patients. In our study, we objective to screen for characteristic genes specific to PTC and establish an accurate model for diagnosis and prognostic evaluation of PTC. We screen the differentially expressed genes by Python 3.6 in The Cancer Genome Atlas (TCGA) database. We discovered a three-gene signature Gap junction protein beta 4 (GJB4), Ripply transcriptional repressor 3 (RIPPLY3), and Adrenoceptor alpha 1B (ADRA1B) that had a statistically significant difference. Then we used Gene Expression Omnibus (GEO) database to establish a diagnostic and prognostic model to verify the three-gene signature. For experimental validation, immunohistochemistry in tissue microarrays showed that thyroid samples' proteins expressed by this three-gene are differentially expressed. Our protocol discovered a robust three-gene signature that can distinguish prognosis, which will have daily clinical application.Entities:
Keywords: COX analysis; PTC; SVM diagnostic model; accurate diagnosis; prognostic evaluation
Year: 2022 PMID: 35372518 PMCID: PMC8966665 DOI: 10.3389/fmolb.2022.807931
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1Flowchart of PTC prognostic signatures generation and validation procedures.
Univariate Cox regression analysis results.
| Gene symbol | Beta | HR (95% CI) |
|
|---|---|---|---|
| GJB4 | −0.057 | 0.94 (0.91–0.98) | 0.0066 |
| ADRA1B | −0.021 | 0.98 (0.96–0.99) | 0.0067 |
| RIPPLY3 | −0.11 | 0.9 (0.81–0.99) | 0.0360 |
FIGURE 2Heatmap of significantly differentially expressed genes. Each row represents a separate gene, each column represents a separate sample, a gradient from green to red indicates a low to high level of expression, and the samples are clustered from two types of tissue: normal tissue (green) and cancer tissue (red).
The top 10 upregulated and downregulated DE mRNA genes.
| Type | Genes | LogFC | p value |
|---|---|---|---|
| Up-regulated | ARHGAP36 | 8.894666584 | <0.001 |
| DMBX1 | 8.212911341 | <0.001 | |
| SLC18A3 | 8.071324334 | <0.001 | |
| TRY6 | 7.77283886 | <0.001 | |
| TMPRSS6 | 7.625107474 | <0.001 | |
| PRSS1 | 7.59111566 | <0.001 | |
| MMP13 | 7.567785968 | <0.001 | |
| KLK6 | 7.468232832 | <0.001 | |
| LOC400794 | 7.390954657 | <0.001 | |
| GABRB2 | 7.299817152 | <0.001 | |
| Down-regulated | KCNA1 | −4.139432844 | <0.001 |
| TFF3 | −3.811991634 | <0.001 | |
| LRP1B | −3.692660909 | <0.001 | |
| RELN | −3.629457676 | <0.001 | |
| IPCEF1 | −3.521246594 | <0.001 | |
| ZNF804B | −3.519727733 | <0.001 | |
| CNTN5 | −3.507769597 | <0.001 | |
| AGR3 | −3.492012695 | <0.001 | |
| VIT | −3.43067668 | <0.001 | |
| FAM180B | −3.414101394 | <0.001 |
DE,diferentially expressed;FC,fold change.
Survival analysis sample.
| Data set | High risk | Low risk | High risk for OS | Low risk for OS |
|
|---|---|---|---|---|---|
| Training set | 156 | 156 | 156 | 124 | 0.038 |
| Test set | 72 | 63 | 52 | 63 | 0.017 |
| All data | 228 | 219 | 176 | 219 | 0.002 |
Differential expression information of characteristic genes.
| Gene symbol | mRNA description | logFC |
| UP DOWN |
|---|---|---|---|---|
| GJB4 | gap junction protein beta 4 | 4.0572 | <0.001 | UP |
| ADRA1B | adrenoceptor alpha 1B | 2.4496 | <0.001 | UP |
| RIPPLY3 | ripply transcriptional repressor 3 | 2.1379 | <0.001 | UP |
FC,fold change.
FIGURE 3Kaplan-Meier curves for the low- and high-risk groups separated by the Risk-Score of the 3-gene signature in the TCGA PTC data. The blue line represents the patients with low risk and the others represent patients with high risk. Significant differences in overall survival between the two groups were analyzed by log-rank test. (A) Kaplan-Meier curves for training data survival; (B)Kaplan-Meier curves for test data; (C) Kaplan-Meier curves for all data.
FIGURE 4Receiver operating characteristic curves (ROC) for the prognosis models. (A) ROC fited based on training data; (B) ROC fited based on test data; (C) ROC fited based on all TCGA PTC data.
FIGURE 5The immunohistochemical results of GJB4, RIPPLY3, and ADRA1B characteristic gene tissue chips indicated that the expression in tumor tissues was significantly higher than that in adjacent tissues.
FIGURE 6The GO enrichment analyses. (A). Biological process; (B). Cellular component; (C). Molecular function.
FIGURE 7The PPI networks of DE mRNA.