| Literature DB >> 30041465 |
Alexandra Iulia Irimie1, Cornelia Braicu2, Roxana Cojocneanu3, Lorand Magdo4, Anca Onaciu5, Cristina Ciocan6, Nikolay Mehterov7,8, Diana Dudea9, Smaranda Buduru10, Ioana Berindan-Neagoe11,12,13.
Abstract
Smoking is a well-known behavior that has an important negative impact on human health, and is considered to be a significant factor related to the development and progression of head and neck squamous cell carcinomas (HNSCCs). Use of high-dimensional datasets to discern novel HNSCC driver genes related to smoking represents an important challenge. The Cancer Genome Atlas (TCGA) analysis was performed in three co-existing groups of HNSCC in order to assess whether gene expression landscape is affected by tobacco smoking, having quit, or non-smoking status. We identified a set of differentially expressed genes that discriminate between smokers and non-smokers or based on human papilloma virus (HPV)16 status, or the co-occurrence of these two exposome components in HNSCC. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways classification shows that most of the genes are specific to cellular metabolism, emphasizing metabolic detoxification pathways, metabolism of chemical carcinogenesis, or drug metabolism. In the case of HPV16-positive patients it has been demonstrated that the altered genes are related to cellular adhesion and inflammation. The correlation between smoking and the survival rate was not statistically significant. This emphasizes the importance of the complex environmental exposure and genetic factors in order to establish prevention assays and personalized care system for HNSCC, with the potential for being extended to other cancer types.Entities:
Keywords: TGCA data; gene expression data; head and neck squamous cell carcinomas; smoking; survival rate
Mesh:
Year: 2018 PMID: 30041465 PMCID: PMC6069101 DOI: 10.3390/ijerph15071558
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Succinct presentation of the TCGA characteristics for patients diagnosed with HNSCCs used for gene expression analysis. Head and neck squamous cell carcinomas: HNSCCs, T: tumor, N: node, M0 no metastasis, Mx: metastases presence; HPV: human papilloma virus; TCGA: The Cancer Genome Atlas.
| Clinical Parameters | Patients ( | Females/Males | |
|---|---|---|---|
| Sex | Females | 134 | |
| Males | 374 | ||
| Undeclared | 11 | ||
| Age | Median, range | 61, 19–90 | |
| Median, range males | 59, 19–88 | ||
| Median, range females | 64.5, 24–90 | ||
| Undeclared | 11 | ||
| Clinical stage | 1 | 20 | 9/11 |
| 2 | 98 | 33/65 | |
| 3 | 101 | 26/75 | |
| 4 | 275 | 64/211 | |
| Unknown | 25 | ||
| Clinical TNM | T1N0M0 | 18 | 7/11 |
| T1N1M0 | 5 | 2/3 | |
| T1N2M0 | 7 | 1/6 | |
| T1N2M1 | 1 | -/1 | |
| T1NxM0 | 2 | 2/- | |
| T1N0M0 | 96 | 33/63 | |
| T1N0Mx | 1 | -/1 | |
| T2N1M0 | 12 | 3/9 | |
| T2N1Mx | 1 | 1/- | |
| T2N2M0 | 33 | 6/27 | |
| T2N2M1 | 1 | -/1 | |
| T2N2Mx | 3 | -/3 | |
| T1N3M0 | 1 | -/1 | |
| T2NxM0 | 1 | -/1 | |
| T3N0M0 | 58 | 15/43 | |
| T3N0Mx | 1 | 1/- | |
| T2N1M0 | 20 | 4/16 | |
| T3N1M1 | 1 | 1/- | |
| T3N2M0 | 46 | 8/38 | |
| T3N2M1 | 1 | 1/- | |
| T3N2Mx | 1 | -/1 | |
| T3N3M0 | 1 | -/1 | |
| T3NxM0 | 2 | -/2 | |
| T4N0M0 | 66 | 17/49 | |
| T4N0Mx | 1 | 1/- | |
| T4N1M0 | 39 | 13/26 | |
| T4N2M0 | 62 | 14/48 | |
| T4N2Mx | 1 | -/1 | |
| T4N3M0 | 7 | 2/5 | |
| T4NxM0 | 3 | -/3 | |
| TxN1M0 | 1 | -/1 | |
| TxN2M0 | 1 | -/1 | |
| TxNxMx | 9 | -/9 | |
| Unknown | 16 | ||
| Smoking history | Smoker | 174 | 35/139 |
| Reformed smoker <15 years | 134 | 26/108 | |
| Reformed smoker >15 years | 72 | 19/53 | |
| Reformed, unknown years | 2 | 0/2 | |
| Lifelong non-smoker | 114 | 49/65 | |
| Unknown | 23 | ||
| Anatomic neoplasm subdivision | Alveolar ridge | 18 | |
| Base of tongue | 27 | ||
| Buccal mucosa | 20 | ||
| Floor of mouth | 60 | ||
| Hard palate | 7 | ||
| Hypopharynx | 9 | ||
| Larynx | 114 | ||
| Lip | 3 | ||
| Oral cavity | 71 | ||
| Oral tongue | 127 | ||
| Oropharynx | 10 | ||
| Tonsil | 42 | ||
| Unknown | 11 | ||
| HPV | Positive | 72 | |
| Negative | 37 | ||
| Unknown | 410 |
Figure 1The interconnected genes with specific miRNAs using miRnet [22] involved in focal adhesion, extracellular matrix (ECM)–receptor interaction, or pathways in cancer interconnected with targeting microRNAs (miRNAs).
Genes with altered expression levels, based on a fold change (FC) ± 2, p-value ≤ 0.05 for smoking versus never smoking HNSCC patients.
| Gene | FC (abs) | Regulation | Gene | FC (abs) | Regulation | ||
|---|---|---|---|---|---|---|---|
|
| −2.59737 | 0.000603 | Down |
| 2.39461 | 0.000111 | Up |
|
| −2.35924 | 1.45 × 10−6 | Down |
| 2.394392 | 6.22 × 10−6 | Up |
|
| −2.31761 | 0.002097 | Down |
| 2.393242 | 3.2 × 10−6 | Up |
|
| −2.31495 | 0.000321 | Down |
| 2.390684 | 0.000256 | Up |
|
| −2.10091 | 0.046549 | Down |
| 2.380863 | 0.012372 | Up |
|
| −2.08283 | 0.000138 | Down |
| 2.373807 | 8 × 10−6 | Up |
|
| −2.05906 | 8.82 × 10−5 | Down |
| 2.372254 | 6.52 × 10−7 | Up |
|
| −2.02315 | 0.002341 | Down |
| 2.358872 | 0.00092 | Up |
|
| −2.01307 | 0.002746 | Down |
| 2.355639 | 6.22 × 10−6 | Up |
|
| 5.224654 | 8 × 10−6 | Up |
| 2.345015 | 0.000181 | Up |
|
| 4.924049 | 7.27 × 10−5 | Up |
| 2.343389 | 0.000117 | Up |
|
| 4.505707 | 3.3 × 10−7 | Up |
| 2.319508 | 1.31 × 10−6 | Up |
|
| 4.336869 | 4.15 × 10−5 | Up |
| 2.314619 | 3.29 × 10−5 | Up |
|
| 4.296636 | 4.68 × 10−7 | Up |
| 2.31125 | 3.18 × 10−5 | Up |
|
| 4.269958 | 5.3 × 10−13 | Up |
| 2.29202 | 0.000356 | Up |
|
| 3.985658 | 0.001123 | Up |
| 2.28985 | 0.00011 | Up |
|
| 3.985451 | 3.93 × 10−5 | Up |
| 2.2842 | 0.001556 | Up |
|
| 3.913692 | 2.69 × 10−15 | Up |
| 2.257771 | 0.002582 | Up |
|
| 3.748438 | 0.000214 | Up |
| 2.25193 | 0.000222 | Up |
|
| 3.573662 | 0.001207 | Up |
| 2.249305 | 2.06 × 10−5 | Up |
|
| 3.508977 | 3.02 × 10−8 | Up |
| 2.237741 | 6.45 × 10−5 | Up |
|
| 3.41544 | 1.21 × 10−5 | Up |
| 2.234898 | 0.000377 | Up |
|
| 3.289014 | 0.000277 | Up |
| 2.210954 | 0.000992 | Up |
|
| 3.252181 | 3.95 × 10−6 | Up |
| 2.206128 | 0.000183 | Up |
|
| 3.241253 | 2.07 × 10−5 | Up |
| 2.206114 | 0.002395 | Up |
|
| 3.186291 | 1.24 × 10−5 | Up |
| 2.200458 | 8.32 × 10−6 | Up |
|
| 3.18047 | 1.38 × 10−5 | Up |
| 2.194891 | 8.95 × 10−6 | Up |
|
| 3.156294 | 2.95 × 10−6 | Up |
| 2.181679 | 4.15 × 10−6 | Up |
|
| 3.142512 | 6.04 × 10−6 | Up |
| 2.18105 | 4.06 × 10−5 | Up |
|
| 3.102893 | 0.000721 | Up |
| 2.179797 | 0.009085 | Up |
|
| 3.077278 | 7.92 × 10−12 | Up |
| 2.177274 | 0.000323 | Up |
|
| 3.011847 | 0.002582 | Up |
| 2.175127 | 0.000402 | Up |
|
| 3.00819 | 1.43 × 10−5 | Up |
| 2.163271 | 0.00378 | Up |
|
| 2.849531 | 2.55 × 10−5 | Up |
| 2.150982 | 8.66 × 10−5 | Up |
|
| 2.812564 | 0.001704 | Up |
| 2.142351 | 0.004176 | Up |
|
| 2.792954 | 6.04 × 10−6 | Up |
| 2.138528 | 0.001642 | Up |
|
| 2.761805 | 0.001147 | Up |
| 2.131033 | 0.000887 | Up |
|
| 2.756387 | 3.74 × 10−5 | Up |
| 2.128482 | 0.000382 | Up |
|
| 2.731352 | 5.73 × 10−7 | Up |
| 2.119921 | 0.003053 | Up |
|
| 2.721629 | 2.09 × 10−7 | Up |
| 2.118758 | 0.001164 | Up |
|
| 2.67193 | 2.95 × 10−6 | Up |
| 2.117995 | 2.03 × 10−5 | Up |
|
| 2.666479 | 3.29 × 10−5 | Up |
| 2.117211 | 0.003146 | Up |
|
| 2.652387 | 0.004096 | Up |
| 2.112852 | 0.00034 | Up |
|
| 2.639082 | 3.2 × 10−6 | Up |
| 2.112379 | 0.001737 | Up |
|
| 2.635306 | 5.71 × 10−5 | Up |
| 2.108137 | 2.2 × 10−5 | Up |
|
| 2.629508 | 0.001642 | Up |
| 2.085114 | 5.8 × 10−6 | Up |
|
| 2.613751 | 2.12 × 10−6 | Up |
| 2.079458 | 4.68 × 10−7 | Up |
|
| 2.558155 | 0.0013 | Up |
| 2.07414 | 0.012696 | Up |
|
| 2.545223 | 0.002413 | Up |
| 2.068965 | 0.000399 | Up |
|
| 2.540783 | 7.2 × 10−6 | Up |
| 2.057724 | 8.81 × 10−5 | Up |
|
| 2.533913 | 0.00193 | Up |
| 2.055357 | 1.57 × 10−6 | Up |
|
| 2.530633 | 0.00011 | Up |
| 2.041222 | 0.001179 | Up |
|
| 2.504015 | 2.95 × 10−6 | Up |
| 2.033392 | 2.12 × 10−5 | Up |
|
| 2.491234 | 0.000516 | Up |
| 2.032663 | 0.001283 | Up |
|
| 2.479601 | 4.03 × 10−7 | Up |
| 2.023424 | 0.020511 | Up |
|
| 2.437306 | 0.001219 | Up |
| 2.018196 | 0.00048 | Up |
|
| 2.434416 | 7.96 × 10−6 | Up |
| 2.011134 | 0.000245 | Up |
|
| 2.41423 | 6.04 × 10−6 | Up |
| 2.010622 | 4.12 × 10−5 | Up |
|
| 2.396283 | 8.83 × 10−5 | Up |
| 2.009657 | 1.24 × 10−5 | Up |
|
| 2.007499 | 4.31 × 10−5 | Up |
Figure 2Venn diagram used for overlapping the altered gene expression pattern in the case of the three studied groups. Common and specific gene expression signatures for the three groups of HNSCC (head and neck squamous cell carcinomas) patients: non-smokers, ex-smokers, and smokers. (A) For the case of overexpressed genes, 20 common genes from non-smokers vs. smokers and ex-smokers vs. smokers, integrated as network using String, version 10.5) [19]; (B) The case of downregulated genes; (C) Heat maps for the expression level for the three HNSCC patient groups (current smokers, quitters, non-smokers), in dark blue being presented the downregulated genes and in red those overexpressed genes, generated using Gene Spring version 13.0.
Figure 3Gene network generated using Sting program; (A) Network of the interconnected genes; (B) KEGG (Kyoto Encyclopedia of Genes and Genomes) classification based on the altered signature in smoking versus never-smoking HNSCC patients.
Gene ontology (GO) classification based on the gene expression signature in smoking versus never-smoking HNSCC patients using the PantherDB online tool [21].
| Ontology Function | Type | No. Molecules | Percent (%) |
|---|---|---|---|
| Molecular function | binding (GO:0005488) | 33 | 36.7% |
| catalytic activity (GO:0003824) | 30 | 33.3% | |
| transporter activity (GO:0005215) | 14 | 15.6% | |
| receptor activity (GO:0004872) | 4 | 4.4% | |
| signal transducer activity (GO:0004871) | 4 | 4.4% | |
| structural molecule activity (GO:0005198) | 3 | 3.3% | |
| translation regulator activity (GO:0045182) | 1 | 1.1% | |
| antioxidant activity (GO:0016209) | 1 | 1.1% | |
| Biological process | cellular process (GO:0009987) | 48 | 28.1% |
| metabolic process (GO:0008152) | 32 | 18.7% | |
| biological regulation (GO:0065007) | 23 | 13.5% | |
| developmental process (GO:0032502) | 16 | 9.4% | |
| response to stimulus (GO:0050896) | 15 | 8.8% | |
| multicellular organismal process (GO:0032501) | 13 | 7.6% | |
| localization (GO:0051179) | 8 | 4.7% | |
| cellular component organization or biogenesis (GO:0071840) | 7 | 4.1% | |
| biological adhesion (GO:0022610) | 4 | 2.3% | |
| locomotion (GO:0040011) | 3 | 1.8% | |
| immune system process (GO:0002376) | 1 | 0.6% | |
| reproduction (GO:0000003) | 1 | 0.6% | |
| Protein class | transporter (PC00227) | 10 | 13.5% |
| hydrolase (PC00121) | 9 | 12.2% | |
| oxidoreductase (PC00176) | 8 | 10.8% | |
| transcription factor (PC00218) | 8 | 10.8% | |
| nucleic acid binding (PC00171) | 7 | 9.5% | |
| signaling molecule (PC00207) | 6 | 8.1% | |
| transferase (PC00220) | 5 | 6.8% | |
| enzyme modulator (PC00095) | 5 | 6.8% | |
| receptor (PC00197) | 3 | 4.1% | |
| extracellular matrix protein (PC00102) | 2 | 2.7% | |
| cytoskeletal protein (PC00085) | 2 | 2.7% | |
| transfer/carrier protein (PC00219) | 2 | 2.7% | |
| cell junction protein (PC00070) | 2 | 2.7% | |
| lyase (PC00144) | 1 | 1.4% | |
| calcium-binding protein (PC00060) | 1 | 1.4% | |
| defense/immunity protein (PC00090) | 1 | 1.4% | |
| membrane traffic protein (PC00150) | 1 | 1.4% | |
| isomerase (PC00135) | 1 | 1.4% |
Figure 4Venn diagram overlapping the (A) overexpressed and (B) downregulated genes based on a specific analysis of current smokers versus never smokers with HNSCC stage 1, 2, 3, and 4; (C) String Network for the case of downregulated genes in current smokers versus never smokers in HNSCC stage 1, where the genes involved in cell cycle regulation are highlighted in red.
Gene ontology (GO) classification based on the gene expression signature in smoking versus never smoking for stage 1 HNSCC patients using the PantherDB online tool, showing the maximum effect of smoking and the minimum effect of tumor environment.
| Ontology Function | Type | No. Molecules | Percent (%) |
|---|---|---|---|
| Molecular function | binding (GO:0005488) | 1433 | 37.5% |
| catalytic activity (GO:0003824) | 1262 | 33.0% | |
| transporter activity (GO:0005215) | 429 | 11.2% | |
| receptor activity (GO:0004872) | 255 | 6.7% | |
| signal transducer activity (GO:0004871) | 231 | 6.0% | |
| structural molecule activity (GO:0005198) | 157 | 4.1% | |
| antioxidant activity (GO:0016209) | 23 | 0.6% | |
| translation regulator activity (GO:0045182) | 17 | 0.4% | |
| channel regulator activity (GO:0016247) | 13 | 0.3% | |
| Biological process | cellular process (GO:0009987) | 2479 | 29.5% |
| metabolic process (GO:0008152) | 1589 | 18.9% | |
| biological regulation (GO:0065007) | 911 | 10.9% | |
| response to stimulus (GO:0050896) | 696 | 8.3% | |
| localization (GO:0051179) | 583 | 6.9% | |
| cellular component organization or biogenesis (GO:0071840) | 581 | 6.9% | |
| developmental process (GO:0032502) | 534 | 6.4% | |
| multicellular organismal process (GO:0032501) | 523 | 6.2% | |
| immune system process (GO:0002376) | 146 | 1.7% | |
| locomotion (GO:0040011) | 120 | 1.4% | |
| biological adhesion (GO:0022610) | 110 | 1.3% | |
| reproduction (GO:0000003) | 83 | 1.0% | |
| rhythmic process (GO:0048511) | 32 | 0.4% | |
| cell killing (GO:0001906) | 6 | 0.1% | |
| Protein class | nucleic acid binding (PC00171) | 565 | 15.2% |
| transcription factor (PC00218) | 504 | 13.5% | |
| hydrolase (PC00121) | 410 | 11.0% | |
| receptor (PC00197) | 355 | 9.5% | |
| transporter (PC00227) | 269 | 7.2% | |
| signaling molecule (PC00207) | 251 | 6.7% | |
| transferase (PC00220) | 240 | 6.4% | |
| cytoskeletal protein (PC00085) | 192 | 5.2% | |
| enzyme modulator (PC00095) | 179 | 4.8% | |
| oxidoreductase (PC00176) | 149 | 4.0% | |
| extracellular matrix protein (PC00102) | 84 | 2.3% | |
| membrane traffic protein (PC00150) | 83 | 2.2% | |
| ligase (PC00142) | 79 | 2.1% | |
| calcium-binding protein (PC00060) | 72 | 1.9% | |
| structural protein (PC00211) | 63 | 1.7% | |
| isomerase (PC00135) | 42 | 1.1% | |
| lyase (PC00144) | 36 | 1.0% | |
| defense/immunity protein (PC00090) | 35 | 0.9% | |
| cell adhesion molecule (PC00069) | 31 | 0.8% | |
| cell junction protein (PC00070) | 31 | 0.8% | |
| chaperone (PC00072) | 24 | 0.6% | |
| transfer/carrier protein (PC00219) | 19 | 0.5% | |
| transmembrane receptor regulatory/adaptor protein (PC00226) | 11 | 0.3% |
Figure 5The interconnected genes with specific miRNAs using miRnet [22] involved in focal adhesion, ECM–receptor interaction or gap junction.
Figure 6Gene expression signature based on HPV16 status with/without correlation with smoking in HNSCC patients. (A) Heat maps representing the expression level in the HNSCC patient group based on smoking and HPV status. For nonsmoking and HPV16-negative patients (Smoking− HPV−, n = 32), we had Smoking+ HPV− (n = 11), Smoking+ HPV+ (n = 11), Smoking− HPV+ (n = 11), in dark blue being presented the downregulated genes and in red those overexpressed genes, generated using Gene Spring version 13.0. (B) Venn diagram showing the differential signature in the case of the overexpressed genes highlighting the main altered pathways as displayed by KEGG classification. (C) Venn diagram to emphasize that the differential signature in the case of the overexpressed genes underlines the main altered pathways as obtained from String Network and KEGG (Kyoto Encyclopedia of Genes and Genomes) classification, with red dots showing the genes involved in cytokine–cytokine receptor interaction and blue dots the cell adhesion molecules.
Gene ontology (GO) classification based on the gene expression signature in HPV16-positive versus HPV16-negative patients using the PantherDB online tool [21].
| Ontology Function | Type | No. Molecules | Percent (%) |
|---|---|---|---|
| Molecular function | binding (GO:0005488) | 6900 | 41.1% |
| catalytic activity (GO:0003824) | 5156 | 30.7% | |
| transporter activity (GO:0005215) | 1474 | 8.8% | |
| receptor activity (GO:0004872) | 1450 | 8.6% | |
| signal transducer activity (GO:0004871) | 895 | 5.3% | |
| structural molecule activity (GO:0005198) | 826 | 4.9% | |
| translation regulator activity (GO:0045182) | 53 | 0.3% | |
| antioxidant activity (GO:0016209) | 23 | 0.1% | |
| binding (GO:0005488) | 6900 | 41.1% | |
| Biological process | cellular process (GO:0009987) | 10968 | 28.1% |
| metabolic process (GO:0008152) | 6589 | 16.9% | |
| biological regulation (GO:0065007) | 4065 | 10.4% | |
| response to stimulus (GO:0050896) | 3467 | 8.9% | |
| developmental process (GO:0032502) | 3319 | 8.5% | |
| multicellular organismal process (GO:0032501) | 2954 | 7.6% | |
| cellular component organization or biogenesis (GO:0071840) | 2626 | 6.7% | |
| localization (GO:0051179) | 2125 | 5.4% | |
| immune system process (GO:0002376) | 975 | 2.5% | |
| biological adhesion (GO:0022610) | 898 | 2.3% | |
| locomotion (GO:0040011) | 668 | 1.7% | |
| reproduction (GO:0000003) | 289 | 0.7% | |
| rhythmic process (GO:0048511) | 28 | 0.1% | |
| growth (GO:0040007) | 27 | 0.1% | |
| Protein class | hydrolase (PC00121) | 2235 | 13.2% |
| nucleic acid binding (PC00171) | 1676 | 9.9% | |
| signaling molecule (PC00207) | 1598 | 9.5% | |
| transcription factor (PC00218) | 1551 | 9.2% | |
| enzyme modulator (PC00095) | 1462 | 8.7% | |
| receptor (PC00197) | 1324 | 7.8% | |
| cytoskeletal protein (PC00085) | 987 | 5.9% | |
| transferase (PC00220) | 987 | 5.9% | |
| transporter (PC00227) | 977 | 5.8% | |
| oxidoreductase (PC00176) | 667 | 4.0% | |
| extracellular matrix protein (PC00102) | 632 | 3.7% | |
| cell adhesion molecule (PC00069) | 560 | 3.3% | |
| calcium-binding protein (PC00060) | 362 | 2.1% | |
| membrane traffic protein (PC00150) | 331 | 2.0% | |
| cell junction protein (PC00070) | 307 | 1.8% | |
| defense/immunity protein (PC00090) | 278 | 1.6% | |
| ligase (PC00142) | 185 | 1.1% | |
| structural protein (PC00211) | 173 | 1.0% | |
| chaperone (PC00072) | 157 | 0.9% | |
| transmembrane receptor regulatory/adaptor protein (PC00226) | 147 | 0.9% | |
| lyase (PC00144) | 134 | 0.8% | |
| isomerase (PC00135) | 66 | 0.4% | |
| transfer/carrier protein (PC00219) | 63 | 0.4% |