| Literature DB >> 34422655 |
Yong Zhu1,2, Shaoqiu Chen3,4, Zhougui Ling3,5, Andrew Winnicki3, Lilly Xu3, Shaun Xu3, Ting Gong3, Bin Jiang1, Gang Huang6, Youping Deng3.
Abstract
Colorectal cancer often presents as a highly variable disease with myriad forms that are at times difficult to detect in early screenings with sufficient accuracy, for which novel diagnostic methods are an attractive and valuable area of improvement. To improve colorectal cancer diagnosis and prognosis, new biomarkers that can be assembled into a diagnostic panel must be identified, and tRNA-derived small RNAs (tsRNAs) are a particularly interesting and increasingly visible new class of molecules to examine. In this study, small RNA-seq data were profiled for the expression of 104 human tsRNAs in tumor tissue and adjacent normal tissue samples, and a diagnostic model was built based on four differentially expressed tsRNAs: tRF-22-WB86Q3P92, tRF-22-WE8SPOX52, tRF-22-WE8S68L52, tRF-18-8R1546D2. Furthermore, the diagnostic model was validated by two independent validation datasets (AUC was 0.97 and 0.99), and a LASSO model was applied to develop a seven-tsRNA-based risk score model for colorectal cancer prognosis. Finally, a tsRNA-mRNA interaction network was established according to potential mRNA targets predicted by bioinformatic methods. In conclusion, the results suggest that abnormal expression of tsRNA in colorectal cancer may have a functional effect on tumor action and moreover, that some of the tsRNAs identified in this study with diagnostic and prognostic potential could be of clinical significance.Entities:
Keywords: colorectal cancer; diagnosis; prognosis; random forest; tRNA-derived small RNAs
Year: 2021 PMID: 34422655 PMCID: PMC8371552 DOI: 10.3389/fonc.2021.701440
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Demographic and clinical characteristic of patients with Colorectal cancer in discovery Cohort set and two independent validation Cohort.
| Cohort | SRP107326 | TCGA | SRP183064 |
|---|---|---|---|
|
| |||
| Adjacent Normal | 104 | 11 | 6 |
| Primary Tumor | 104 | 603 | 6 |
|
| 62(55.7-68.2) | 67.0(57.0-65.8) | |
|
| |||
| Male, count (%) | 59(56.7) | 315(52.2) | |
|
| |||
| Asian | 104 | 13 | 6 |
| White | 308 | ||
| Black or African American | 71 | ||
| American Indian or Alaska Native | 1 | ||
| Not Reported | 210 | ||
|
| |||
| Radiation Therapy | Not Reported | 307 | Not Reported |
| Pharmaceutical Therapy | 296 | ||
|
| |||
| Stage I | 18 | 104 | |
| Stage II | 29 | 180 | |
| Stage III | 42 | 172 | |
| Stage IV | 13 | 89 | |
| Not Reported | 24 | 6 | |
|
| |||
| Adenocarcinoma | 501 | ||
| other | 68 | ||
|
| |||
| Death | Not Reported | 478 | Not Reported |
| Alive | 125 | ||
|
| 638.0(370.5-1091.0) |
Figure 1tsRNA diagnostic panel. (A) Volcano plot of differentially expressed tsRNAs between normal and cancer. (B) Random forest RF mean decrease of accuracy and mean decrease Gini score rank. (C) ROC curve of the diagnostic prediction model with tsRNA markers in the discovery data and two independent validation data sets. (D–F) Four model selected tsRNA expression in discovery dataset and two independent validation data sets.
Figure 2tsRNA prognostic panel. (A) prognostic marker selection: Univariant-Cox and LASSO-Cox were applied to a cohort with survival data to identify seven markers’ final determination. L1 norm is LASSO coefficient profiles of the 8 tsRNAs. A vertical line is drawn at the optimal value by 1 s.e. criteria and results in six non-zero coefficients. Five tsRNAs were selected in the LASSO Cox regression model. (B) Kaplan-Meier survival curve showing overall survival (OS) in patients without clinical characteristic in high risk and low risk score; (C) Kaplan-Meier survival curve showing overall survival (OS) in patients with clinical characteristic in high risk and low risk score; (D) Multivariate Cox regression analysis of the tsRNA-based prognostic model with OS. Event; *p < 0.05; ***p < 0.001.
Figure 3Survival analysis at different cancer stages. Prognostic model in (A) Stage I; (B), Stage II; (C), Stage III and (D) Stage IV.
Figure 4The tsRNA/mRNA network analysis. (A) Four diagnostic model selected tsRNAs and their predicted target mRNAs. (B) Seven prognostic model selected tsRNAs and their predicted target mRNAs.
Figure 5Function enrichment analysis. (A) biological process; (B) cellular component; (C), molecular function; STRING-db enrichment in (D) Pfam; (E) KEGG; (F) interPro enrichment.