| Literature DB >> 31616639 |
Joseph Dhahbi1, Yury O Nunez Lopez2, Augusto Schneider3, Berta Victoria4, Tatiana Saccon3,4, Krish Bharat1, Thaddeus McClatchey1, Hani Atamna1, Wojciech Scierski5, Pawel Golusinski6,7,8, Wojciech Golusinski8, Michal M Masternak4,7,8.
Abstract
Oral squamous cell carcinoma (OSCC) is the most common type of head and neck cancer and, as indicated by The Oral Cancer Foundation, kills at an alarming rate of roughly one person per hour. With this study, we aimed at better understanding disease mechanisms and identifying minimally invasive disease biomarkers by profiling novel small non-coding RNAs (specifically, tRNA halves and YRNA fragments) in both serum and tumor tissue from humans. Small RNA-Sequencing identified multiple 5' tRNA halves and 5' YRNA fragments that displayed significant differential expression levels in circulation and/or tumor tissue, as compared to control counterparts. In addition, by implementing a modification of weighted gene coexpression network analysis, we identified an upregulated genetic module comprised of 5' tRNA halves and miRNAs (miRNAs were described in previous study using the same samples) with significant association with the cancer trait. By consequently implementing miRNA-overtargeting network analysis, the biological function of the module (and by "guilt by association," the function of the 5' tRNA-Val-CAC-2-1 half) was found to involve the transcriptional targeting of specific genes involved in the negative regulation of the G1/S transition of the mitotic cell cycle. These findings suggest that 5' tRNA-Val-CAC-2-1 half (reduced in serum of OSCC patients and elevated in the tumor tissue) could potentially serve as an OSCC circulating biomarker and/or target for novel anticancer therapies. To our knowledge, this is the first time that the specific molecular function of a 5'-tRNA half is specifically pinpointed in OSCC.Entities:
Keywords: 5′ YRNA fragments; 5′ tRNA halves; OSCC; WGCNA; coexpression network; microRNA; oral cancer; small RNA-Seq
Year: 2019 PMID: 31616639 PMCID: PMC6775249 DOI: 10.3389/fonc.2019.00959
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Small non-coding RNA-Seq analysis in OSCC. (A–D) Multi-dimensional scaling (MDS) analysis of the expression levels of 5′ tRNA-halves and 5′ YRNA fragments. The MDS analysis generates distances that represent the biological coefficient of variation (BCV) between samples. MDS plots show serum expression differences of 5′ tRNA-halves between patients with and without OSCC (A) and between tumor and adjacent normal tissues (B). Also, MDS plots show serum expression differences of 5′ YRNA fragments between patients with and without OSCC (C) and between tumor and adjacent normal tissues (D). Dimension 1 of the MDS plot depicts the cancer effect on the expression levels of 5′ tRNA-halves or 5′ YRNA fragments, while dimension 2 represents the homogeneity between biological replicates. Samples N1–N5 represent Normal while samples C1–C5 represent Cancer. (E,F) Differential expression of a 5′ tRNA-half derived from the tRNA-Val-CAC-2-1 gene in serum from patients with and without OSCC (E), and in solid OSCC tumor relatively to adjacent normal tissue (F). The UCSC genome browser screenshots illustrate the alignment of reads to the tRNA-Val-CAC-2-1 gene. (E) The alignment (number of reads, y-axis) shows that the numbers of reads mapping to the 5′ end of tRNA-Val-CAC-2-1 gene are significantly lower in serum from patients with (red) than without (blue) OSCC. (F) In contrast, the alignment shows that the numbers of reads mapping to the 5′ end of tRNA-Val-CAC-2-1 gene are significantly higher in solid OSCC (red) relatively to adjacent normal tissue (blue). Shown at the bottom is the tRNA-Val-CAC-2-1 gene annotation from the tRNA genes track “Transfer RNA Genes Identified with tRNAscan-SE” associated with the human GRCh38/hg38 genome.
Twenty-two 5′ tRNA halves significantly (FDR < 5%) decreased in serum of patients with OSCC.
| tRNA-Arg-TCT-1-1 | chr1:93847572-93847657 + | 3191 | −6.1 | <0.001 | <0.001 | 73 | 3.6 | 0.004 | 12 |
| tRNA-Glu-TTC-2-1 | chr13:44917926-44917998 – | 1774 | −6.6 | <0.001 | <0.001 | 600 | 1.4 | 0.388 | 77 |
| tRNA-Glu-TTC-2-2 | chr15:26082233-26082305 – | 1780 | −6.8 | <0.001 | <0.001 | 591 | 1.4 | 0.373 | 77 |
| tRNA-His-GTG-1-5 | chr6:27158126-27158198 + | 1810 | −6.7 | <0.001 | <0.001 | 1,374 | 1.7 | 0.124 | 65 |
| tRNA-Lys-CTT-1-1 | chr14:58239894-58239967 – | 2449 | −5.1 | <0.001 | <0.001 | 3,226 | −1.0 | 0.968 | 97 |
| tRNA-Val-AAC-1-1 | chr3:169772229-169772302 + | 33416 | −5.7 | <0.001 | 0.001 | 54,730 | 1.4 | 0.013 | 23 |
| tRNA-Val-AAC-1-2 | chr5:181164153-181164226 + | 34680 | −5.7 | <0.001 | 0.001 | 56,639 | 1.4 | 0.013 | 23 |
| tRNA-Val-AAC-1-3 | chr5:181169609-181169682 + | 34102 | −5.6 | <0.001 | 0.001 | 55,723 | 1.4 | 0.013 | 23 |
| tRNA-Val-AAC-1-4 | chr5:181218269-181218342 - | 9117 | −5.8 | <0.001 | <0.001 | 14,817 | 1.5 | 0.064 | 53 |
| tRNA-Val-AAC-1-5 | chr6:27753399-27753472 – | 8871 | −5.7 | <0.001 | <0.001 | 14,333 | 1.4 | 0.074 | 53 |
| tRNA-Val-AAC-3-1 | chr6:27650927-27651000 – | 8897 | −5.9 | <0.001 | <0.001 | 14,112 | 1.4 | 0.074 | 53 |
| tRNA-Val-AAC-4-1 | chr6:27681105-27681178 – | 8658 | −6.0 | <0.001 | <0.001 | 14,021 | 1.4 | 0.073 | 53 |
| tRNA-Val-CAC-1-1 | chr1:161399699-161399772 – | 9451 | −5.5 | <0.001 | 0.001 | 15,785 | 1.4 | 0.107 | 61 |
| tRNA-Val-CAC-1-2 | chr5:181097069-181097142 + | 34761 | −5.4 | <0.001 | 0.001 | 59,247 | 1.4 | 0.029 | 33 |
| tRNA-Val-CAC-1-3 | chr5:181102252-181102325 – | 9028 | −5.5 | <0.001 | 0.001 | 15,380 | 1.4 | 0.105 | 61 |
| tRNA-Val-CAC-1-4 | chr5:181173649-181173722 + | 34900 | −5.5 | <0.001 | 0.001 | 59,839 | 1.4 | 0.028 | 33 |
| tRNA-Val-CAC-1-5 | chr5:181222394-181222467 – | 9298 | −5.4 | <0.001 | 0.001 | 15,538 | 1.4 | 0.097 | 59 |
| tRNA-Val-CAC-1-6 | chr6:26538053-26538126 + | 34603 | −5.5 | <0.001 | 0.001 | 58,857 | 1.4 | 0.029 | 33 |
| tRNA-Val-CAC-4-1 | chr1:143803993-143804066 – | 9212 | −5.6 | <0.001 | <0.001 | 15,155 | 1.4 | 0.106 | 61 |
| tRNA-Val-CAC-5-1 | chr1:121020728-121020801 – | 8863 | −5.6 | <0.001 | 0.001 | 14,921 | 1.4 | 0.107 | 61 |
| tRNA-Val-CAC-chr1-93 | chr1:149712551-149712624 – | 9465 | −5.5 | <0.001 | 0.001 | 15,726 | 1.4 | 0.098 | 59 |
| chr6:27280269-27280342 – | 17293 | −7.6 | <0.001 | <0.001 | 39,336 | 2.0 | <0.001 | 1 | |
tRNA gene name from Genomic tRNA Database (.
Genomic coordinates of tRNA genes in the human GRCh38/hg38 genome.
Average tRNA read counts-per-million (CPM) computed over all libraries from serum or tissue taking into account the estimated dispersions and the libraries sizes. It represents a measure of the overall expression level of the tRNA fragments.
Fold change, P-value and FDR (<5%) for differential abundance computed by EdgeR.
tRNA half that changed expression in both tumor tissues and serum.
Respective tumor tissue data presented for comparison.
Six 5′ tRNA halves significantly (FDR < 5%) increased in tumor relatively to healthy adjacent tissue in patients with OSCC.
| tRNA-Gly-CCC-1-2 | chr1:16861920-16861991 + | 39,401 | 2.1 | <0.001 | 0.716 | 118,338 | 1.2 | 0.584 | 68 |
| tRNA-Gly-CCC-chr1-137 | chr1:16545938-16546009 – | 12,631 | 2.1 | 0.001 | 3.875 | 37,450 | 1.1 | 0.705 | 79 |
| tRNA-Gly-GCC-2-5 | chr16:70789506-70789577 + | 46,189 | 1.8 | <0.001 | 1.989 | 120,217 | 1.2 | 0.586 | 68 |
| tRNA-Gly-GCC-2-6 | chr17:8125745-8125816 + | 46,245 | 1.8 | <0.001 | 1.989 | 117,350 | 1.2 | 0.594 | 69 |
| tRNA-Gly-GCC-5-1 | chr16:70788693-70788764 + | 45,777 | 1.8 | <0.001 | 1.989 | 119,488 | 1.2 | 0.582 | 68 |
| chr6:27280269-27280342 – | 39,336 | 2.0 | <0.001 | 0.777 | 17,293 | −7.6 | <0.001 | 0 | |
tRNA gene name from Genomic tRNA Database (.
Genomic coordinates of tRNA genes in the human GRCh38/hg38 genome.
Average tRNA read counts-per-million (CPM) computed over all libraries from serum or tissue taking into account the estimated dispersions and the libraries sizes. It represents a measure of the overall expression level of the tRNA fragments.
Fold change, P-value and FDR (<5%) for differential abundance computed by EdgeR.
tRNA half that changed expression in both tumor tissues and serum.
Respective serum data presented for comparison.
5′ YRNA fragments significantly changed in serum of patients with OSCC.
| RNY4-201 | chr7:148963314-148963410 + | 333105 | −2.7 | <0.001 | 0.002% |
| Y_RNA.295-201 | chr3:157153547-157153640 + | 331916 | −2.7 | <0.001 | 0.003% |
| RNY4P10-201 | chr6:33199600-33199696 + | 327747 | −2.6 | <0.001 | 0.003% |
| RNY4P7-201 | chr2:127798902-127798998 – | 151705 | −2.8 | <0.001 | 0.001% |
YRNA gene name from the UCSC GENCODE v24 track.
Genomic coordinates of the YRNA genes in the human GRCh38/hg38 genome.
Average tRNA read counts-per-million computed over all libraries and taking into account the estimated dispersions and the libraries sizes. It represents a measure of the overall expression level of the tRNA fragments.
Fold change, P-value and FDR (<5%) for differential abundance computed by EdgeR.
Figure 2Identifying cancer-relevant modules by weighted gene coexpression network analysis (WGCNA). The network was created from the weighted correlation matrix generated by the WGCNA package in the R environment. First, an adjacency matrix is calculated, then the topological overlap (TO) to hierarchically cluster genes into coexpression modules (see section Materials and Methods). Final module assignments were made based on module membership. (A) Cluster dendrogram groups genetic features into distinct modules. The y-axis represents a dissimilarity distance (1–TO). Dynamic tree cutting was used to determine modules, by dividing the dendrogram at significant branch points (identifying cancer-relevant modules). (B) Correlations between the module eigengenes (representative module expression pattern) and the cancer tratit. Green and Red modules display strong, highly significant correlations to the cancer trait (P-values between parenthesis, scale bar represents the range of correlation coefficients). (C) Barplot of the average gene significance (GS) for each detected module, equivalent to the average correlation between the module genetic features and the cancer trait. (D) Barplot of the average –log P-value of GS. Two modules: Green and Red have a mean GS P < 0.05 (–log10(P) > 1.3). Color-coding of the plots' bars represent module names, unless specified as for the scale bar in (B).
Figure 3Characteristic coexpression patterns (module eigengenes), gene significance (GS) and module membership for genetic features in OSCC-relevant modules. (A,B) Plots of the module eigengenes (representative coexpression pattern) for the cancer-relevant Green and Red modules (C,D). Scatter plots of correlations between gene significance (GS) and module membership for cancer-relevant Green and Red modules. Color-coding is equivalent to module names. Non-miRNA genetic features are labeled with black fonts in the scatter plots. Oval: most relevant 5′ tRNA half.
Figure 4Discovering molecular functions for the cancer-associated Green coexpression module. (A) Network of genes that are significantly overtargeted by the top 10 miRNAs in the Green module. Blue circles represent overtargeted genes, which are potentially downregulated in tumor tissue (validation of gene expression subset in next figure). Purple rectangles represent the top 10 miRNAs with highest membership in the Green module and high gene significance to the cancer trait (these miRNAs are upregulated in the OSSC tissue). (B) KEGG pathway enrichment among the Green module-overtargeted genes shown in (A). Companion table for the enrichment analysis is presented in Supplementary Table 3.
Figure 5Confirming cell cycle-related functions for the cancer-associated Green coexpression module. (A) Enrichment for gene ontology (GO, biological processes) annotations among the Green module-overtargeted genes shown in Figure 4A. Companion table for the enrichment analysis is presented in Supplementary Table 4. (B–E) Validation of gene expression changes in tumor tissue (compared to adjacent normal/healthy tissue) by quantitative real time PCR (qPCR). Relative quantification was implemented using the 2−ΔΔCt method for calculation of fold changes in gene expression. Expression of housekeeping gene β2-microglobulin was used as endogenous reference for normalization. Statistical analysis using the Student's t-test was implemented on the log transformed fold change data.