| Literature DB >> 31703248 |
Mario Pérez Sayáns1, Cintia Micaela Chamorro Petronacci1, Alejandro Ismael Lorenzo Pouso1, Elena Padín Iruegas2, Andrés Blanco Carrión1, José Manuel Suárez Peñaranda3, Abel García García1.
Abstract
The aim of this present study was to comprehensively describe somatic DNA alterations and transcriptional alterations in the last extension of the HNSCC subsets in TCGA, encompassing a total of 528 tumours. In order to achieve this goal, transcriptional analysis, functional enrichment assays, survival analysis, somatic copy number alteration analysis and somatic alteration analysis were carried out. A total of 3491 deregulated genes were found in HNSCC patients, and the functional analysis carried out determined that tissue development and cell differentiation were the most relevant signalling pathways in upregulated and downregulated genes, respectively. Somatic copy number alteration analysis showed a "top five" altered HNSCC genes: CDKN2A (deleted in 32.03% of patients), CDKN2B (deleted in 28.34% of patients), PPFIA1 (amplified in 26.02% of patients), FADD (amplified in 25.63% of patients) and ANO1 (amplified in 25.44% of patients). Somatic mutations analysis revealed TP53 mutation in 72% of the tumour samples followed by TTN (39%), FAT1 (23%) and MUC16 (19%). Another interesting result is the mutual exclusivity pattern that was discovered between the TP53 and PIK3CA mutations, and the co-occurrence of CDKN2A with the TP53 and FAT1 alterations. On analysis to relate differential expression genes and somatic copy number alterations, some genes were overexpressed and amplified, for example, FOXL2, but other deleted genes also showed overexpression, such as CDKN2A. Survival analysis revealed that overexpression of some oncogenes, such as EGFR, CDK6 or CDK4 were associated with poorer prognosis tumours. These new findings help us to develop new therapies and programs for the prevention of HNSCC.Entities:
Keywords: DNA copy number variation; differentially expressed genes; genomics; head and neck neoplasms; oncogenes
Year: 2019 PMID: 31703248 PMCID: PMC6912350 DOI: 10.3390/jcm8111896
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Figure 1mRNA expression for each gene was computed into the Cancer Genome Atlas (TCGA)-head and neck squamous cell carcinomas (HNSCCs) tumour and normal samples. (A) Differentially expressed genes z-score heat map comparing the mRNA levels for tumours and normal samples. Hierarchical clustering shows the associations between the samples which share similar alteration patterns. This figure also shows information which makes an association between the human papillomavirus (HPV) infection status and the resection sites. (B) Heatmap of z-score for relevant genes involved in HNSCC pathogenesis that are differentially expressed between the normal and tumour samples. Clinical information has also been added to better differentiate altered patterns according to the HPV infection status or resection site.
Figure 2The functional annotation analysis shows the association of the genes with several biological processes. (A) Top differentially expressed genes both up- or down-regulated were annotated functionally in GO terms derived from the biological process. Each GO term is represented as a node and links show the gene sharing level between two nodes. Each colour represents one GO term as indicated in the legend. (B) GO terms are segregated depending on when they are more likely to be affected by up- or down-regulated genes. The heat map indicates the significant level of each term annotation depending on the number of genes contained from the input list.
20 most relevant differentially expressed genes (DEGs) exclusively in oral cavity tumour samples compared with normal tissue.
| Gene | LogFC | |
|---|---|---|
|
| −18,712,923 | 4.87 × 10−29 |
|
| −16,587,232 | 1.47 × 10−22 |
|
| −16,583,309 | 2.57 × 10−22 |
|
| −14,657,516 | 1.75 × 10−17 |
|
| −13,584,484 | 5.26 × 10−15 |
|
| 175,035,565 | 1.07 × 10−14 |
|
| −13,347,877 | 1.67 × 10−14 |
|
| −13,085,358 | 6.18 × 10−14 |
|
| −13,145,417 | 7.70 × 10−14 |
|
| −12,952,743 | 1.06 × 10−13 |
|
| −12,528,259 | 7.59 × 10−13 |
|
| −12,484,188 | 9.27 × 10−13 |
|
| −12,205,336 | 3.31 × 10−12 |
|
| −1,218,887 | 3.51 × 10−12 |
|
| −12,226,496 | 4.06 × 10−12 |
|
| −12,319 | 4.48 × 10−12 |
|
| −12,128,568 | 4.66 × 10−12 |
|
| −12,117,518 | 4.85 × 10−12 |
|
| −12,054,309 | 6.57 × 10−12 |
|
| 152,505,375 | 1.05 × 10−11 |
20 most relevant DEGs exclusively in oropharynx tumour samples compared with normal tissue.
| Gene | LogFC | |
|---|---|---|
|
| −33,881,326 | 7.40 × 10−43 |
|
| 372,051,812 | 1.22 × 10−42 |
|
| 355,120,316 | 2.02 × 10−40 |
|
| −33,411,619 | 1.71 × 10−38 |
|
| 321,467,402 | 1.30 × 10−34 |
|
| 310,447,612 | 1.07 × 10−32 |
|
| 309,645,231 | 1.68 × 10−32 |
|
| 30,654,531 | 3.96 × 10−32 |
|
| 286,859,034 | 7.55 × 10−29 |
|
| −26,945,379 | 7.58 × 10−29 |
|
| 281,706,716 | 5.90 × 10−28 |
|
| 274,622,975 | 8.29 × 10−27 |
|
| 261,626,827 | 8.18 × 10−25 |
|
| 259,473,711 | 1.98 × 10−24 |
|
| 258,820,947 | 2.17 × 10−24 |
|
| −23,997,095 | 3.88 × 10−23 |
|
| −23,590,997 | 3.91 × 10−23 |
|
| 250,444,482 | 4.37 × 10−23 |
|
| 240,992,451 | 1.26 × 10−21 |
|
| −22,640,948 | 1.43 × 10−21 |
20 most relevant DEGs exclusively in hypopharynx tumour samples compared with normal tissue.
| Gene | logFC | |
|---|---|---|
|
| 868,622,546 | 1.65 × 10−121 |
|
| 244,008,193 | 1.00 × 10−24 |
|
| 238,001,899 | 7.97 × 10−24 |
|
| 214,408,262 | 3.48 × 10−20 |
|
| −16,847,413 | 1.01 × 10−19 |
|
| 210,854,558 | 1.23 × 10−19 |
|
| 207,400,768 | 2.97 × 10−19 |
|
| 179,379,726 | 3.00 × 10−15 |
|
| 177,738,587 | 8.93 × 10−15 |
|
| 173,946,254 | 1.45 × 10−14 |
|
| 172,587,192 | 3.47 × 10−14 |
|
| −14,096,238 | 5.27 × 10−14 |
|
| 170,462,856 | 5.27 × 10−14 |
|
| 16,819,588 | 9.36 × 10−14 |
|
| 164,630,763 | 2.77 × 10−13 |
|
| 163,016,037 | 4.34 × 10−13 |
|
| 161,613,625 | 7.17 × 10−13 |
|
| 159,875,222 | 9.05 × 10−13 |
|
| 159,391,324 | 1.28 × 10−12 |
|
| 156,667,856 | 2.28 × 10−12 |
20 most relevant DEGs exclusively in lip tumour samples compared with normal tissue.
| Gene | logFC | |
|---|---|---|
|
| −20,994,236 | 4.39 × 10−23 |
|
| −19,759,467 | 4.32 × 10−20 |
|
| −17,718,926 | 1.14 × 10−16 |
|
| −1,733,074 | 2.39 × 10−16 |
|
| 171,507,607 | 8.43 × 10−13 |
|
| −14,775,272 | 4.09 × 10−12 |
|
| −14,416,166 | 1.03 × 10−11 |
|
| −14,089,033 | 5.52 × 10−11 |
|
| −13,914,239 | 6.44 × 10−11 |
|
| −13,655,624 | 1.54 × 10−10 |
|
| −13,635,425 | 1.86 × 10−10 |
|
| −13,532,494 | 1.99 × 10−10 |
|
| −1,329,676 | 3.54 × 10−10 |
|
| −13,386,114 | 4.65 × 10−10 |
|
| −13,154,093 | 5.58 × 10−10 |
|
| −13,117,555 | 5.83 × 10−10 |
|
| −13,151,428 | 5.99 × 10−10 |
|
| 14,316,611 | 1.22 × 10−09 |
|
| −12,854,897 | 1.52 × 10−09 |
|
| 142,602,231 | 1.54 × 10−09 |
Survival analysis of DEGs in HNSCC. K-M p va. > 0 means high expression with poorer prognosis and <0 means lower expression with poorer prognosis. ONC: oncogene, TSG: tumour suppressor gene.
| Gene | K-M P.VAL | Cosmic Condition | FDR | |
|---|---|---|---|---|
|
| 0.02114593 | ONC | 3.41 × 10−10 | 1.97 × 10−09 |
|
| −0.0004892 | ONC | 1.21 × 10−06 | 4.90 × 10−06 |
|
| −0.0034022 | ONC | 1.76 × 10−16 | 1.54 × 10−15 |
|
| 0.00832232 | ONC | 1.35 × 10−07 | 6.11 × 10−07 |
|
| 0.01344407 | ONC | 6.01 × 10−12 | 3.96 × 10−11 |
|
| 0.01370546 | ONC | 1.43 × 10−09 | 7.86 × 10−09 |
|
| 0.00467991 | ONC | 1.33 × 10−08 | 6.68 × 10−08 |
|
| 0.00523221 | TSG | 3.26 × 10−07 | 1.41 × 10−06 |
|
| 0.02339692 | ONC | 5.87 × 10−17 | 5.34 × 10−16 |
|
| 0.04179603 | ONC | 6.87 × 10−26 | 9.67 × 10−25 |
|
| 0.00962668 | ONC | 4.44 × 10−07 | 1.89 × 10−06 |
|
| −0.0229118 | ONC | 1.18 × 10−24 | 1.57 × 10−23 |
|
| 0.00936317 | ONC | 7.48 × 10−67 | 2.54 × 10−65 |
|
| 0.00695101 | ONC | 1.21 × 10−17 | 1.15 × 10−16 |
|
| 0.00020864 | TSG | 3.69 × 10−29 | 5.73 × 10−28 |
|
| −0.0117944 | ONC | 4.94 × 10−09 | 2.57 × 10−08 |
|
| −0.0272594 | ONC | 4.74 × 10−07 | 2.01 × 10−06 |
|
| 0.0032238 | ONC | 3.99 × 10−07 | 1.71 × 10−06 |
|
| −0.0263913 | ONC | 4.14 × 10−11 | 2.57 × 10−10 |
|
| 0.04810527 | ONC | 1.01 × 10−08 | 5.10 × 10−08 |
|
| −0.0336722 | TSG | 7.65 × 10−07 | 3.17 × 10−06 |
|
| 0.032848 | ONC | 2.20 × 10−35 | 4.02 × 10−34 |
|
| 0.01805839 | ONC | 7.85 × 10−11 | 4.76 × 10−10 |
|
| −0.0433087 | TSG | 5.12 × 10−21 | 5.82 × 10−20 |
|
| −0.0164834 | ONC | 1.68 × 10−14 | 1.32 × 10−13 |
DEGs in HPV (+) and HPV (−) tumour samples.
| Gene | logFC | |
|---|---|---|
|
| 743,557,827 | 0 |
|
| 685,090,585 | 0 |
|
| 646,035,753 | 0 |
|
| 632,737,642 | 0 |
|
| 586,115,531 | 3.19 × 10−305 |
|
| 571,577,721 | 2.08 × 10−287 |
|
| 574,001,163 | 6.07 × 10−287 |
|
| 540,074,496 | 2.27 × 10−257 |
|
| 518,927,943 | 2.98 × 10−239 |
|
| 51,354,914 | 1.50 × 10−232 |
|
| 507,100,644 | 1.63 × 10−226 |
|
| 496,943,561 | 4.06 × 10−219 |
|
| 483,751,729 | 5.47 × 10−205 |
|
| 477,878,379 | 4.33 × 10−200 |
|
| 475,032,624 | 5.33 × 10−199 |
|
| 477,610,842 | 1.63 × 10−196 |
|
| 476,235,561 | 5.24 × 10−196 |
|
| 455,517,399 | 5.03 × 10−180 |
|
| 446,391,442 | 2.64 × 10−175 |
|
| 442,147,634 | 7.60 × 10−172 |
Figure 3The GISTIC algorithm results reveal alterations of different genes at copy number level. (A) The chromosomal location of these alterations is represented as peaks in which the height over the χ-axis represents the significant q-value calculated and the width on the Υ-axis is associated with the genomic extension affected by this change. (B) The Top COSMIC cancer gene census list altered in the TCGA-HNSCC cohort are represented in the oncoplot. Blue is associated with a deep deletion in a patient of a particular gene whereas red represents a deep amplification. The chromosome location is provided as well as the percentage (%) of patients affected with each gene’s somatic copy number alteration (SCNA) in the cohort.
COSMIC DEGs HNSCC also with a copy number changed in more than 10% of the cohort. DEL: deleted, AMP: amplified.
| Gene | logFC | FDR | Patients% | Condition | |
|---|---|---|---|---|---|
|
| −5.46 | 2.00 × 10−194 | 2.05 × 10−192 | 198,058,252 | AMP |
|
| 4.15 | 1.08 × 10−25 | 8.69 × 10−25 | 32,038,835 | DEL |
|
| 4.36 | 1.64 × 10−27 | 1.44 × 10−26 | 100,970,874 | AMP |
|
| 9.34 | 7.97 × 10−68 | 2.40 × 10−66 | 244,660,194 | AMP |
|
| 7.42 | 6.14 × 10−52 | 1.19 × 10−50 | 250,485,437 | AMP |
|
| 4.56 | 8.80 × 10−29 | 8.41 × 10−28 | 100,970,874 | AMP |
Figure 4The MuTect2 and VarScan variant caller results are combined in order to facilitate the process of finding every possible alteration in the TCGA-HNSCC cohort. (A) Variants show the differences in abundancy across the cohort with missense being the largest represented group. (B) Nucleotide changes in single nucleotide variants (SNVs) show the differences in frequency, and the cases in which citosines are substituted by guanosines in the coding DNA strand are more likely to occur. (C) Oncoplot showing the top 10 most altered genes. The colours are associated with one class of variant and the percentage (%) of patients affected is shown on the graph. (D) Mutation mutual exclusivity matrix heatmap representation. The green colour is associated with a positive correlation whereas red indicates mutual exclusivity. p-value < 0.05 *.