Literature DB >> 31188922

Mutational landscape of head and neck squamous cell carcinomas in a South Asian population.

Kulsoom Ghias1, Sadiq S Rehmani2, Safina A Razzak3, Sarosh Madhani4, M Kamran Azim5, Rashida Ahmed3, Mumtaz J Khan6.   

Abstract

Head and neck squamous cell carcinoma (HNSCC) is the sixth most common cancer type globally and contributes significantly to burden of disease in South Asia. In Pakistan, HNSCC is among the most commonly diagnosed cancer in males and females. The increasing regional burden of HNSCC along with a unique set of risk factors merited a deeper investigation of the disease at the genomic level. Whole exome sequencing of HNSCC samples and matched normal genomic DNA analysis (n=7) was performed. Significant somatic single nucleotide variants (SNVs) were identified and pathway analysis performed to determine frequently affected signaling pathways. We identified significant, novel recurrent mutations in ASNS (asparagine synthetase) that may affect substrate binding, and variants in driver genes including TP53, PIK3CA, FGFR2, ARID2, MLL3, MYC and ALK. Using the IntOGen platform, we identified MAP kinase, cell cycle, actin cytoskeleton regulation, PI3K-Akt signaling and other pathways in cancer as affected in the samples. This data is the first of its kind from the Pakistani population. The results of this study can guide a better mechanistic understanding of HNSCC in the population, ultimately contributing new, rational therapeutic targets for the treatment of the disease.

Entities:  

Year:  2019        PMID: 31188922      PMCID: PMC6905448          DOI: 10.1590/1678-4685-GMB-2018-0005

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

Head and neck squamous cell carcinomas (HNSCC), which include tumours of the oral cavity, oropharynx, hypopharynx and larynx, are the sixth most common cancer worldwide with a global incidence of ~600,000 cases (Jemal , 2011; Hayat ; Murar and Forastiere, 2008; Ferlay ). In Pakistan, a developing country in South Asia, HNSCC is among the most commonly diagnosed cancers in both males and females (Bhurgri ; Masood ). The major risk factors for HNSCC include tobacco use, alcohol consumption, and human papilloma virus (HPV) infection (Leemans ). HPV-negative disease accounts for ~80% of the HNSCC cases (Leemans ). Unlike developed countries, the incidence of HPV-negative disease has steadily increased in developing countries (Leemans ). The increased incidence in both males and females in Pakistan can be attributed to the prevalence of traditional risk factor such as smoking. The use of smokeless tobacco, betel nut, gutka (a preparation of crushed areca nut, tobacco, slaked lime and other flavorings) and betel quid or paan (a preparation of betel leaf, areca nut and occasionally tobacco) along with its related products are additional risk factors in this part of the world (Gupta and Johnson, 2014; Khan ; Li ). HNSCC is associated with considerable disease-related mortality and treatment-related morbidity (Forastiere ) and is a major public health concern for Pakistan (Bhurgri ; Bhurgri 2004, 2005; Warnakulasuriya 2009; Bray ) and worldwide. Despite the advances in all the major treatments for HNSCC including surgery, radiotherapy and chemotherapy, the mortality rate is ~50% (Laramore ; Leemans ). The existing literature focuses primarily on HNSCC in North American and European populations. There is a dearth of information specific for the South Asian population. The unique set of population-specific risk factors, germline variability and molecular heterogeneity of HNSCC demands a thorough molecular profiling of these tumours in this population in order to understand tumour progression, and identify actionable targets for therapy, leading to improved patient care. The aim of the study described here was to identify the global genetic aberrations underlying HPV-negative HNSCC in the South Asian (Pakistani) population.

Materials and Methods

Ethics approval and consent to participate

The Aga Khan University Ethics Review Committee approved the procedures used in collecting and processing of participant material and information (reference #: 1003-Sur/ERC-08). Written informed consent to participate was obtained from all subjects.

Sample collection

Fresh tumour tissue and matched blood were obtained from treatment-naïve patients undergoing surgical resection of HNSCC primary tumour at the Aga Khan University Hospital in Karachi, Pakistan. Patients with confirmed histological diagnosis of HNSCC were included in this study. At the time of resection, fresh tumour tissue away from the necrotic core measuring at least 0.5 cm2 was collected and stored in RNAlater® solution (Thermo Fisher Scientific) at -80 °C till further processing. Formalin-fixed tumour tissue samples were assessed by a histopathologist for tumour content and cellularity based on hematoxylin and eosin (H&E) staining. Seven tumour samples negative for HPV with at least 70% cancer cells and 1 μg (50 ng/μl) of extracted DNA (both tumour as well as genomic DNA) were utilized for whole exome sequencing.

DNA extraction

Genomic and tumour DNA was extracted in-house using TRIzol® Reagent (Invitrogen, USA) according to manufacturer’s instructions. Tumour DNA was extracted from at least 50 mg of tissue and genomic DNA was extracted from 3-5 mL of peripheral blood samples obtained before patients underwent surgical procedure. DNA yield and quality was assessed both in-house using a NanoDrop 2000 spectrophotometer (Thermo Scientific, USA) and by Macrogen Inc. (Seoul, South Korea) using PicoGreen® Assay (Invitrogen, USA).

Tumour HPV status

Formalin-fixed paraffin embedded (FFPE) tumour blocks were retrieved and DNA was extracted for assessing HPV status. PCR detection was performed using two sets of general HPV primers (GP5/GP6) (Baay ; Khan ). Additionally, HPV in situ hybridization (ISH) was performed on FFPE blocks using GenPoint assay according to the manufacturer’s instructions (Dako, Denmark). Dako assay can detect HPV-DNA from 13 high-risk genotypes.

Whole Exome Sequencing (WES)

WES was performed by Macrogen Inc. (Seoul, South Korea). 1-2 μg of tumour and genomic DNA was fragmented by nebulization. DNA libraries were prepared from each sample using TruSeq DNA Sample Prep Kit using the manufacturer’s protocol (Illumina, USA). Unique molecular indices were used for each sample. Exome enrichment was performed using the TruSeq Exome Enrichment kit (Illumina, USA). Paired-end sequencing was performed on Illumina HiSeq 2000 instrument. Each read was of 100 bp size.

Availability of data and materials

The data sets supporting the results of this article are included within the article and its supplementary files. The raw sequencing data of those patients that consented to deposition of data in a public database (4 out of 7 total) have been deposited in NCBI’s Sequence Read Archive and are accessible through accession number SRP083063.

Data analysis

Paired-end sequence reads from Illumina were mapped against UCSC Human Genome (hg) 19 using BWA (Li and Durbin 2009). Local realignment was performed using Genome Analysis Tool Kit (GATK) to improve mapping quality (McKenna ). Single nucleotide variants (SNVs) were identified in both somatic and germline DNA using MuTect (high-confidence mode) with default settings. Somatic variants were defined as those SNVs which were only identified in the somatic DNA and not seen in germline DNA. Variants marked REJECT were excluded from downstream analysis. Tumour mutational burden was calculated as previously described by others (Chalmers ). All mutations were annotated and prioritized using Variant Effect Predictor (VEP) and ANNOVAR. Further characterization of SNVs into missense, nonsense, frameshift, stop loss and stop gain variants was done using wANNOVAR, SNPEFF, SIFT and Polyphen. All somatic missense mutations were analysed for their likely tumourigenic impact based on CHASM (Cancer-specific High-throughput Annotation of Somatic Mutations) (Carter ; Wong ) and the IntOGen-mutations platform (Gonzalez-Perez ). A cut-off score threshold of ≤ 0.2 for FDR with a p-value of ≤ 0.05 was applied. The annotation ranked the SNVs for somatic driver mutations for specific cancer tissue types, predicted protein functional impact, allele frequencies from the 1000 Genomes Project and ESP6500 populations, and previous cancer association of the gene harbouring the variants. CHASM training set is composed of a positive class of driver mutations from the COSMIC database and VEST training set comprising a positive class of disease mutations from the Human Gene Mutation Database 66 and a negative class of variants detected in the ESP6500 population and 1000 Genomes Project cohort with an allele frequency of >1%. SNPeff (Cingolani ) and CHASM were used to identify stop-gain, start-loss and splice site variants in nonsynonymous coding region. Those SNVs identified by both tools were selected as significant. Mutations in non-coding regions were annotated using CADD and a cut-off threshold score of ≥15 with p<10–5 applied to predict benign and deleterious variants (Kircher ). Pathway analysis was carried out using the IntOGen-Mutations platform (Gonzalez-Perez ) and significantly (p≤0.05) affected pathways in the cohort and genes within identified.

ASNS protein modeling

The homology model of human asparagine synthetase was constructed using crystal structural coordinates of the enzyme from Escherichia coli (PDB id 1CT9). The Modeller program (Fiser and Sali, 2003) was used to build the asparagine synthetase model.

Results

Clinical characteristics and HPV-status of HNSCC patients

Primary tumour samples from 7 treatment-naïve HNSCC patients (Figure S1), along with their matched genomic DNA, were used for this study. The detailed demographics and clinical characteristics of these patients are provided in Table 1. The samples were taken from five male and two female patients, who had an average age at diagnosis of 54 years (SD = 13.24). Two patients reported a family history of cancer; one patient had a personal history of smoking (110 pack years), two of oral tobacco use, one of alcohol and oral tobacco use, and four reported use of betel nut/quid. All samples were negative for human papilloma virus (Figure 1).
Table 1

Clinical characteristics of HNSCC patients. Data that is unavailable is indicated with a dash (-).

Sample IDGenderAge at diagnosisFamily history of cancer (type)Smoking historyOral tobacco useBetel nut/quid useAlcohol useTNMStageTumour site
NM-02M67Yes (brain)YesNo-NopT4N0M0IVLeft buccal mucosa
(110 pack years)
NM-08M35NoNoYesYesNopT3N0M0IIIRight buccal mucosa
NM-11M57NoNoNoNoNopT1N0M0IRight tongue
NM-13F40NoNoYesYesNopT1N1M0IIILeft tongue
M-11M71Yes (-)NoYesYesYespT4N2bM0IVRight pyriform fossa
M-12F49NoNoNoYesNoT2N2bM0IVLower mandible alveolus
M-14M56NoNoNoNoNopT3N1M0IIIRight tongue
Figure 1

Human papilloma virus (HPV) detection. PCR (left) for HPV detection using GP5/GP6 primers (expected product ~150bp). HPV in situ hybridization (right) using GenPoint in a representative HPV-negative HNSCC sample at a magnification of 40 x 10X; inset at magnification of 4 x 10X shows control HPV-positive nuclei stained brown.

Summary of exome capture and sequencing results

Paired-end whole exome sequencing (WES) of all seven HNSCC samples and matched genomic DNA was performed on Illumina HiSeq 2000 platform. Each read was of 100 bp size. Additional details of the sequencing, including coverage and depth, are summarized in Table S1. Whole exome sequencing revealed a total of 3,959 single nucleotide variants across all 7 HNSCC samples, of which 2,547 are novel (Figure 2; Table 2, left panel). Nonsynonymous mutation rates ranged from 2.11 to 5.02 mutations per megabase (mean = 3.07) (Table 2, right panel). Several mutations recurred in more than one sample in both coding (Figure 3; Table 3) and non-coding regions (Table S2). Nonsense and splice site variants were also identified in all samples (Table S3).
Figure 2

Mutational landscape of HNSCC tumours. Left panel: Number of mutations (known and novel) in HNSCC patients Middle panel: significant somatic nucleotide variants (synonymous, nonsynonymous missense) Right panel: Rate of synonymous, nonsynonymous and other (3’ UTR, 3’ flank, 5’ UTR, 5’ flank, intron, splice site) mutations expressed in mutations per megabase of covered target sequence.

Table 2

Number of somatic single nucleotide variants (SNVs) in HNSCC patients; total and (novel).

Sample IDNM-02NM-08NM-11NM-13M-11M-12M-14
Nonsynonymous
Missense151 (106)91 (55)145 (111)122 (90)221 (61)101 (67)95 (65)
Nonsense14 (11)4 (4)7 (6)1 (1)5 (4)4 (4)7 (5)
Synonymous85 (49)35 (14)91 (61)69 (35)196 (30)93 (45)52 (20)
3’ UTR199 (179)131 (117)176 (155)156 (145)477 (138)206 (193)200 (185)
3’ Flank34 (32)23 (20)52 (49)24 (23)78 (25)35 (31)40 (37)
5’ UTR22 (21)14 (10)15 (10)14 (10)32 (11)17 (15)17 (15)
5’ Flank8 (4)6 (5)24 (18)9 (8)23 (8)9 (6)5 (5)
Intron33 (32)23 (17)74 (56)35 (33)79 (20)29 (26)34 (30)
Splice site4 (3)2 (1)2 (2)2 (2)1 (1)3 (3)3 (2)
Total (novel) 550 (437) 329 (243) 586 (468) 432 (347) 1112 (298) 497 (390) 453 (364)
Figure 3

Somatic coding single nucleotide variants (SNV) found in ≥ 2 HNSCC patients and dbSNP database. The variant allele frequency (VAF) on the x-axis indicates the proportion of reads with the variant allele within individual samples.

Table 3

Somatic coding single nucleotide variants (SNVs) found in ≥2 HNSCC patients. NS: nonsynonymous; S: synonymous. The variant allele frequency (VAF) indicates the proportion of reads with the variant allele within individual samples.

GeneChrPositionBase changeAmino acid changeVariant typeVariant Allele Frequency (VAF)FreqCOSMIC IDrsID
NM02NM08NM11NM13M11M12M14
CLMN chr1495669509A>GL726PNS0.1280.118-0.145-0.0790.1115/7COSM1293528-
CHEK2 chr2229091840T>CK152ENS0.158---0.1-0.2113/7COSM42871rs142470496
SFTPA1 chr1081373600G>AG160SNS0.179---0.195--2/7-rs368889920
DRD5 chr49784542A>CT297PNS-0.429----0.52/7COSM1431796rs2227851
NT5C3A chr733054388T>CD283GNS0.3-----0.3332/7COSM222478rs79747830
KRTAP13-3 chr2131797833C>TR133KNS--0.167---0.0892/7--
PAK2 chr3196509577C>GQ101HNS-0.8-0.086---2/7COSM1422035rs201465227
ST6GALNAC4 chr9130674582G>A-S0.294--0.10.2220.2860.4365/7-rs148599736
PPA1 chr1071969413A>G-S0.1180.143-0.1090.108-0.165/7--
KRT83 chr1252709871G>A-S0.173--0.1360.1470.1760.3755/7--
MYLK chr3123419183G>A-S0.1040.17-0.122-0.0670.0645/7-rs58176285
OR8I2 chr1155861593G>A-S0.3930.221-0.239-0.2180.2615/7--
HIST1H2BL chr627775319G>A-S0.078--0.0770.086-0.1324/7-rs141178835
ST6GALNAC4 chr9130674558C>T-S-0.3-0.1820.2790.4884/7--
PPA1 chr1071969401T>C-S0.1110.143-0.1060.125--4/7-rs150430650
KRT83 chr1252709724A>G-S0.1010.097--0.0920.093-4/7--
KRT83 chr1252709895G>A-S0.24---0.1920.2450.4294/7--
OR1M1 chr199204157T>C-S0.129--0.0710.1760.231-4/7--
OR1M1 chr199204184G>A-S0.325--0.1130.0890.296-4/7--
HHIPL2 chr1222715425A>G-S0.244--0.17-0.170.1114/7--
MYLK chr3123419189C>T-S0.1320.163-0.111--0.0654/7--
ASNS chr797498451C>G-S0.16---0.2190.096-3/7-rs76996735
IFITM1 chr11315009C>T-S0.32---0.3160.093-3/7-rs3197137
KRT83 chr1252709883T>C-S---0.1610.167-0.3753/7-rs2248473
OR7D4 chr199324989C>T-S0.088---0.0890.057-3/7-rs111293642
OR7D4 chr199324995C>T-S0.068---0.0720.05-3/7-rs201732443
CHEK2 chr2229091841G>A-S0.167---0.1-0.2223/7-rs146546850
OR10G7 chr11123908827T>C-S0.17--0.074-0.069-3/7--
KRT86 chr1252699041G>A-S---0.141-0.1270.1143/7--
KRT86 chr1252699545G>A-S---0.103-0.1270.193/7-rs374471358
KRT83 chr1252710279T>C-S0.118----0.1370.2673/7-rs202206430
KRTAP4-8 chr1739254154C>T-S0.167--0.173-0.102-3/7--
DHX40 chr1757663568A>G-S--0.1040.065--0.0623/7-rs2697395
LCE1E chr1152759892A>T-S-0.25--0.263--2/7--
FOLH1 chr1149204790A>G-S0.292---0.139--2/7-rs76509850
KRT81 chr1252681089G>A-S---0.1670.214--2/7--
KRT81 chr1252681092C>T-S---0.1670.214--2/7--
KRT83 chr1252710790T>C-S0.216----0.25-2/7-rs143202217
KRT83 chr1252714757T>C-S---0.082-0.227-2/7--
KRT83 chr1252710798T>C-S0.235----0.268-2/7--
KRT83 chr1252713122C>T-S---0.179-0.113-2/7--
SEH1L chr1812955467T>C-S----0.0860.128-2/7--
KRTAP10-7 chr2146020536T>C-S0.143---0.118--2/7-rs512211
KRTAP10-7 chr2146020542T>C-S0.13---0.118--2/7-rs512214
HIST2H2AC chr1149858563C>T-S-----0.0960.0932/7--
HIST2H2AC chr1149858593C>T-S-----0.1250.0712/7--
KIAA1549 chr7138601891A>T-S0.063----0.098-2/7--
GIMAP5 chr7150439323A>G-S-----0.1150.2782/7--
KRTAP5-11 chr1171293458G>A-S---0.093-0.095-2/7--
OR10G8 chr11123901199G>A-S---0.118-0.074-2/7--
OR10G8 chr11123901211G>A-S---0.136-0.12-2/7--
DUOX1 chr1545433188T>C-S0.071----0.081-2/7--
KRTAP4-11 chr1739274373G>A-S-----0.0490.1112/7--
KRTAP4-12 chr1739280045G>A-S---0.14-0.145-2/7--
KRTAP10-4 chr2145994676A>G-S0.103----0.065-2/7--
HIST2H2AB chr1149859383C>T-S---0.118--0.0712/7--
DRD5 chr49784550G>A-S0.375-----0.52/7-rs2227844
KRTAP5-5 chr111651760C>T-S0.11-----0.0772/7-rs183750160
DPY19L2 chr1263964600G>A-S---0.103--0.1052/7--
SETD8 chr12123875311C>T-S0.385--0.171---2/7-rs74356260
POTEE chr2132021452C>A-S-0.176-0.094---2/7--
CBWD1 chr9163985A>G-S-0.0650.062----2/7--
ZNF814 chr1958385762C>G-S--0.2630.353---2/7-rs199732634

Mutational landscape in HNSCC patients

On average, 227 coding mutations were identified per tumour, 39% of which are synonymous. The majority of the mutations identified were nonsynonymous missense mutations and mutations in the 3’ UTR region (Table 2). Filtering for driver and other significant variants using CHASM revealed alterations in genes that have been implicated in HNSCC or other cancers (Figure 2, middle panel; Table 4). Driver missense mutations in FGFR2 (Fibroblast Growth Factor Receptor 2), SETBP1 (SET Binding Protein 1), PIK3CA (Phosphatidylinositol-4,5-Bisphosphate 3-Kinase Catalytic Subunit Alpha), IGF2BP3 (Insulin Like Growth Factor 2 MRNA Binding Protein 3), TP53 (Tumour Protein P53), PTPN11 (Protein Tyrosine Phosphatase, Non-Receptor Type 11) and NF2 (Neurofibromin 2) were identified. Significant missense mutations were also identified in ASNS (Asparagine Synthetase (Glutamine-Hydrolyzing)) in four of the seven samples. Other genes that exhibited recurrent mutations included the CLMN (Calmin) gene (5/7), CHEK2 (Checkpoint Kinase 2) (3/7), and DRD5 (Dopamine Receptor D5) and PAK2 (P21 (RAC1) Activated Kinase 2) (2/7) (Table 3). These recurrent mutation sites have not been reported as hotspots in previous HNSCC sequencing studies.
Table 4

Somatic single nucleotide variants (SNVs) in HNSCC patients in coding regions. NS MS: nonsynonymous missense; S: synonymous. Driver missense variants are in bold text and synonymous variants in possible driver genes are marked with an asterisk (*).The variant allele frequency (VAF) indicates the proportion of reads with the variant allele within individual samples. The minor allele frequency (MAF) signifies prevalence of the known variants in the global population as per the ExAc dataset.

Sample IDGeneChrPositionVariant typeBase changeAmino acid changeVariant allele frequencyMinor allele frequencyrsIDCOSMIC ID
NM-02 FGFR2 chr10123256128NS MSG>TP595H0.273---
SETBP1 chr1842530740NS MSG>TG479C0.13---
ASNS chr797498395NS MSG>AA25V0.225---
chr797498404NS MSA>GM22T0.243---
KIF21B chr1200954042NS MSG-TR1250S0.143---
UBA7 chr349848502NS MSG>TP382H0.132---
KDM3B chr5137735569NS MSG>TA1023S0.158---
EXOSC1 chr1099196233NS MSG>TA186D0.3---
DENND5A chr119166573NS MSC>AV1031F0.273---
DGKZ chr1146394214NS MSG>TG541V0.5--
FOLH1 chr1149186320NS MSG>CN459K0.2270.00003rs201724751-
chr1149204779NS MSC>TR281H0.30.0351rs116795343-
FAT3 chr1192087697NS MSG>TG807C0.211---
SLC7A7 chr1423282391NS MSG>TL73M1---
CEP128 chr1481244269NS MSA>TL778Q0.174---
EML2 chr1946130008NS MSC>AW433C0.3---
CHRNA4 chr2061981122NS MSC>AK547N0.375---
GART chr2134889834NS MSC>TR595Q0.20.0003rs202015633-
CHEK2 chr2229091840NS MST>CK416E0.1580.0259rs74751600-
ARID2* chr1246245344SG>TS1146S0.333---
NM-08 ASNS chr797498378NS MSC>TA31T0.167---
ZFHX4 chr877618158NS MSG>TG612V0.231--COSM73358
CKAP5 chr1146819413NS MSC>GC427S0.12---
SF1 chr1164537028NS MSC>AR303L0.097---
HECTD4 chr12112669460NS MSC>GK1885N0.158---
FBN1 chr1548744840NS MSC>TA1822T0.2730.00003rs777539060-
HELZ chr1765144830NS MSG>TL826I0.2---
NM-11 RALGPS2 chr1178855145NS MSC>TT361M0.125---
DYNC1I2 chr2172584439NS MSC>AP369T0.125---
NFE2L2 chr2178098966NS MSC>AD27Y0.188---
POSTN chr1338154051NS MSG>TP536Q0.111---
INO80 chr1541377611NS MSG>AR277C0.158---
CDH16 chr1666949240NS MSG>TP156T0.364---
PRPSAP2 chr1718785908NS MST>CL147S0.098---
NM-13 ASNS chr797498378NS MSC>TA31T0.125---
ASNS chr797498395NS MSG>AA25V0.105---
GIGYF2 chr2233684687NS MSC>TR862C0.1380.00002rs561616045-
CBLB chr3105464767NS MSG>TP280H0.097---
LAP3 chr417598708NS MSC>AA343D0.214---
TRIM7 chr5180625732NS MSG>AL316F0.156---
ABCB8 chr7150733032NS MSG>AA331T0.2270.00004rs777741819-
ESRP1 chr895674755NS MSG>CV206L0.079---
ADCY6 chr1249176793NS MSC>AR142L0.375---
NFATC4 chr1424843541NS MSC>TS581L0.25--COSM3793625
EML5 chr1489124732NS MSC>AG1226W0.143---
ANKFY1 chr174086708NS MSG>TA688E0.176---
UQCRFS1 chr1929698630NS MSC>AC217F0.25---
ITSN1 chr2135237479NS MSG>TM1305I0.667---
ABCB7 chrX74332770NS MSC>GC96S0.111---
HDX chrX83730396NS MSG>CR4G0.231---
M-11 PIK3CA chr3178936091NS MSG>AE545K0.2780.000008rs104886003COSM763
IGF2BP3 chr723353160NS MSA>GI503T0.2110.0040rs79900450-
TP53 chr177577106NS MSG>CP278A0.647--COSM10814
SLC8A1 chr240656504NS MSC>TG306D0.234---
MITF chr370008494NS MSC>AQ362K0.333---
VEPH1 chr3157034861NS MSA>GL622P0.139---
chr3157099046NS MSC>GL342F0.208---
GNGT1 chr793536114NS MST>CV19A0.133---
ARHGEF10 chr81824752NS MSA>GD232G0.214---
NEBL chr1021098782NS MST>AD855V0.339---
NAALAD2 chr1189891404NS MSA>CL296F0.375---
SMG8 chr1757290439NS MSA>TH752L0.25--
RBM39 chr2034302295NS MSC>AC303F0.15---
M-12 ASNS chr797498378NS MSC>TA31T0.214---
GRK7 chr3141499490NS MSA>CY296S0.273---
TIPARP chr3156413805NS MSC>AP413Q0.121---
RGS3 chr9116346401NS MSC>AS903R1---
SPTBN2 chr1166472616NS MSC>AG711C1---
UBE4A chr11118253450NS MSC>AA726E0.15---
TP53 chr177577538NS MSC>TR248Q0.2860.00006rs11540652COSM10662
CDC27 chr1745229257NS MST>CT335A0.1670.00002rs199890121-
HELZ chr1765163619NS MSC>AC575F0.667---
ALK* chr229474099SC>AG692G1---
M-14 PTPN11 chr12112892407NS MST>GS189A0.1670.0027rs79068130-
NF2 chr2230090766NS MSG>TR588L1---
MLL3* chr7151962176ST>AP377P0.0840.4554rs62478356COSM4162022
MYC* chr8128750817SC>AT118T0.5---
Synonymous variants in previously identified driver genes ARID2 (AT-Rich Interaction Domain 2), ALK (Anaplastic Lymphoma Receptor Tyrosine Kinase), MLL3 [Myeloid/Lymphoid Or Mixed-Lineage Leukemia 3, also known as KMT2C (Lysine Methyltransferase 2C)] and MYC (V-Myc Avian Myelocytomatosis Viral Oncogene Homolog), were also identified (Figure 2, middle panel; Table 4). The ASNS gene was found to have a synonymous mutation in three samples, and recurrent synonymous mutations were also observed in CHEK2 and DRD5 genes (Table 3). Splice site variants in FCGR2A (Fc Fragment of IgG, Low Affinity IIa, Receptor (CD32)) and two genes involved in eukaryotic translation initiation [EIF4B (Eukaryotic Translation Initiation Factor 4B) and EIF4A3 (Eukaryotic Translation Initiation Factor 4A3)] were seen in two of the seven samples (Table S3). Significant non-coding mutations were filtered using CADD (Table S4). In the 3’UTR region, mutations in IGF1R (Insulin Like Growth Factor 1 Receptor) and ERBB4 (Erb-B2 Receptor Tyrosine Kinase 4) were identified as significant. Another eukaryotic translation initiation factor, EIF2B4 (Eukaryotic Translation Initiation Factor 2B Subunit Delta), exhibited significant splice site variance. IntOGen pathway analysis revealed that the MAP kinase pathway was the most significantly affected pathway in all samples tested. In addition, cell cycle, actin cytoskeleton regulation, PI3K-Akt signaling and other pathways in cancer were among those significantly enriched for exomic alterations in all samples (Table 5). Genes with driver mutations implicated in multiple pathways included FGFR2, PIK3CA, and TP53. Significant mutations in the pathway genes were all deleterious with respect to protein function as predicted by SIFT and PolyPhen.
Table 5

Significantly involved pathways (p ≤ 0.05) identified by IntOGen-Mutations platform. Driver mutations in each pathway are in bold text and marked with an asterisk (*).

Pathway IDKEGG annotationTotal genes in pathwayNumber of genes affectedPathway genes with significant/ driver (*) mutations
hsa04010MAPK signaling pathway25746 FGFR2*
PIK3CA*
TP53*
hsa04110Cell cycle12435 TP53*
CHEK2
CDC27
hsa05166HTLV-1 infection26060 PIK3CA*
TP53*
CHEK2
CDC27
ADCY6
NFATC4
hsa05200Pathways in cancer32671 FGFR2*
PIK3CA*
TP53*
CBLB
ADCY6
ADCY6
GNGT1
MITF
hsa04810Regulation of actin cytoskeleton21346 FGFR2*
PIK3CA*
hsa04151PI3K-Akt signaling pathway33880 FGFR2
PIK3CA*
TP53*
GNGT1

Asparagine synthetase protein modeling

The ASNS gene codes for asparagine synthetase, which catalyzes the formation of asparagine from glutamine, aspartate and ATP. Protein modeling of the effect of the three novel, recurrent mutations in ASNS identified in this cohort revealed that the mutated amino acids (p.A13T, p.A25V and p.M22T) are located in the vicinity (within 10 Å distance) of the glutamine binding pocket (Figure 4).
Figure 4

Homology model of the N-terminal domain of human asparagine synthetase (ASNS) complexed with glutamine (Gln). Amino acid changes due to nonsynonymous mutations in ASNS are indicated.

Discussion

This is the first study reported in the literature to describe the mutational landscape of Pakistani HNSCC patients. We performed exome sequencing of a small set of HPV-negative HNSCC patients from Pakistan. We identified a total of ~4000 somatic variants (novel and known). Previous studies have reported greater number of mutations in HPV-negative as compared to HPV-positive HNSCC tumours (Riaz ; Beck and Golemis, 2016). As a comparison, Stransky on average found 130 coding mutations per tumour (25% synonymous), while in the current cohort an average of 227 coding mutations per tumour (39% synonymous) were identified. Several variants were found in more than one sample and in genes that have been previously identified to play a role in HNSCC carcinogenesis. Next generation sequencing studies in other populations have identified mutations in the tumour suppressor gene TP53, which is associated with smoking-related disease, and the oncogene PIK3CA, at a mutation rate of 40-60% and 6-8%, respectively (Agrawal ; Stransky ; Loyo ). The TCGA study, with the largest cohort to date, reported a TP53 mutation rate of 72% and PIK3CA mutation rate of 18-21% (TCGA 2015). Mutations in TP53 gene were detected in two of the seven cases in the current study, and in PIK3CA in one patient. In a comparative genomic analysis of HPV-positive and HPV-negative tumours, the former showed mutations in FGFR2 and MLL3, among others. The mutational spectrum in HPV-negative tumours closely resembled lung and esophageal squamous cell carcinomas, with mutations identified in genes including TP53, MLL2/3, NOTCH1, PIK3CA and DDR2 (Seiwert ). The HPV-negative cohort in the current study exhibited a nonsense variant (p.Y223X) in DDR2 in a single sample. A different nonsense mutation (p.R709X) and missense mutations (p.I474M; p.I724M) have been previously identified exclusively in HNSCC recurrences (Hedberg ). DDR2 and FGFR2, which was identified as having a potential missense driver mutation in one sample in the current study, are both genes that code for receptor tyrosine kinases and are potentially targetable for therapeutics. In addition, an SNV was identified in MLL3 in a sample that also exhibited an SNV in the driver gene MYC. MLL genes encode histone lysine methyltransferases that are involved in chromatin remodeling. Recurrent mutations in MLL genes have been identified in several other cancers, including lung squamous cell carcinoma, and been associated with poor clinical outcomes (Morin ; Grasso ; Jones ; Kim ; Seiwert ). The oncogene MYC is most often altered in HPV-negative HNSCC tumours (TCGA 2015). Additionally, we discovered recurrent significant missense mutations in ASNS (asparagine synthetase) gene in 4 out of 7 samples. These SNVs in ASNS have not previously been reported in the literature as significant in HNSCC pathogenesis. The ASNS gene codes for a ubiquitously expressed, ATP-dependent enzyme that converts aspartate and glutamine to asparagine and glutamate (Balasubramanian ). The protein folds into two distinct domains, where the N-terminal domain contains two layers of antiparallel beta-sheets. The active site responsible for the binding and hydrolysis of glutamine is situated between these layers and important, evolutionarily conserved side chains involved in glutamine binding within the substrate binding pocket include Arg 49, Asn 74, Glu 76, and Asp 98 (Van Heeke and Schuster, 1989). While the amino acids mutated as a result of the novel and recurrent mutations in ASNS identified in this cohort are not part of the glutamine binding pocket, protein modeling revealed their proximity to the region. Therefore, these mutations may affect glutamine binding during catalysis. Elevated levels of ASNS play a role in drug resistance in acute lymphoblastic leukemias and have been implicated in solid tumour adaptation to nutrient deprivation and hypoxia (Balasubramanian ). ASNS expression has also been shown to be an independent factor affecting survival in hepatocellular carcinoma and low ASNS levels are correlated with poorer surgical outcomes (Zhang ). In HNSCC, deregulation of miR-183-5p and its target gene ASNS has been documented in a radiochemotherapy cell culture model of primary HNSCC cells and is a potential prognostic marker for radiochemotherapy outcome (Summerer ). Two recent reports have further elucidated the role of ASNS in carcinogenesis. One showed that ASNS expression in primary tumours is correlated with metastatic relapse and bioavailability of asparagine regulates metastatic potential and progression in breast cancer cells, potentially by affecting the epithelial-to-mesenchymal transition (Knott ). ASNS was also identified as a key target of the KRAS-ATF4 axis in non-small-cell lung cancer. Oncogenic KRAS regulates amino acid homeostasis and cellular response to nutrient stress via the ATF4 target ASNS, which subsequently contributes to inhibition of apoptosis and increase in proliferation of cancer cells (Gwinn ). While KRAS mutations are uncommon in HNSCC, particularly as compared to HRAS (Rothenberg and Ellisen, 2012), mutations in ASNS could effectively have the same functional consequences. Given the role of ASNS in cellular stress and the unfolded protein response, it is an intriguing target for further study in HNSCC pathogenesis. The current analysis also revealed significant low-frequency driver mutations in SETBP1, IGF2BP3, PTPN11 and NF2. SETBP1 was identified in a patient who also had a driver mutation in FGFR2. SETBP1 encodes a nuclear protein and its overexpression results in inhibition of the tumour-suppressor PP2A serine-threonine phosphatase activity (Cristobal ). Mutations in SETBP1 resulting in overexpression or gain of function have been documented previously in hematological malignancies (Ciccone ). An IGF2BP3 mutation was found in a sample that also had driver mutations in PIK3CA and TP53. The protein product of IGF2BP3 is an RNA-binding factor that promotes cancer invasion by binding to transcripts that encode proteins, such as CD44, for functions related to cell migration, proliferation and adhesion (Ennajdaoui ). IGF2BP3 mutations and copy number variations have been reported previously in HNSCC (Lin ; Clauditz ; Jimenez ), and its role in cell invasiveness and metastasis in several other cancers has been documented in the literature (Schaeffer ; Lin ; Taniuchi ; Hsu ; Shantha Kumara ; Belharazem ; de Lint ; Ennajdaoui ). Mutations in PTPN11 and NF2 genes were found in the same sample. The protein encoded by the proto-oncogene PTPN11 is a cytoplasmic tyrosine phosphatase, which is widely expressed in most tissues and known to play a regulatory role in normal hematopoiesis, and in mitogenic activation, metabolic control, transcription regulation, and cell migration signaling pathways (Chan and Feng, 2007). Somatic PTPN11 mutations have been detected in juvenile myelomonocytic leukemia, myelodysplastic syndromes and acute myeloid leukemia (Tartaglia ; Chan and Feng, 2007). While PTPN11 mutations have not been reported previously in HNSCC, this gene has been identified as a target of the tumour-suppressive microRNA miR-489. Knockdown of PTPN11 in HNSCC cell lines resulted in the inhibition of cell proliferation (Kikkawa ). Neurofibromatosis type 2 (NF2) is a tumour suppressor gene on chromosome 22q12 that encodes for merlin, a membrane-cytoskeleton scaffolding protein that inhibits key signaling pathways crucial to cell proliferation, such as the PI3K pathway. Somatic NF2 mutations have been reported in a number of different cancers (Schroeder ). In HNSCC, chromosome 22q is a frequent site of allele loss. Merlin and the cytoplasmic tail of CD44, which is regulated at the transcript level by IGF2BP3 gene product as mentioned above, create a molecular switch complex that is responsible for either cell growth or proliferation (Morrison ). In addition to non-synonymous mutations, synonymous mutations are known to frequently act as driver mutations in cancers (Supek ). We identified SNVs in MLL3, ARID2 and ALK. Mutations in MLL and ARID gene families have been previously documented for HNSCC (India Project Team of the International Cancer Genome Consortium, 2013; Martin ). The ALK gene encodes yet another receptor tyrosine kinase, which has been found to be aberrantly expressed in several tumours, including anaplastic large cell lymphomas (Chiarle ; Salaverria ), neuroblastoma (Lasorsa ; Theruvath ; Ueda ) and non-small cell lung cancer (Soda ; Quere ). A study of gingivo-buccal oral squamous cell carcinoma (OSCC-GB), an HNSCC clinical sub-type, in the Indian population revealed frequently altered genes that are specific to OSCC-GB and others that are also affected in HNSCC (India Project Team of the International Cancer Genome Consortium, 2013). Altered genes that are common between the OSCC-GB study and the current study in the Pakistani population are ARID2 and TP53. MLL family member MLL4 was also identified as a frequently altered gene specific to OSCC-GB. Other genes identified in the study in the Indian population, such as CASP8, HRAS and NOTCH1, are also altered in HNSCC in other populations (albeit at different frequencies and with varying significance) (Agrawal ; Stransky ; Seiwert ; The Cancer Genome Atylas Network, 2015; Al-Hebshi ), but were not identified in this study. The small sample size is a limitation of this study, which may explain low frequency of commonly mutated genes and why some of the commonly occurring HNSCC mutations such as NOTCH1 and HRAS were not identified in this small cohort. However, given limited resources, it was deemed important to establish preliminary data prior to a larger scale study. The approach of using a smaller discovery cohort followed by validation of identified mutations in a larger cohort has been proposed and taken by others and reported in the literature (Bacchetti ; Nichols ; Romero Arenas ; Hedberg ). It is also possible that given the heterogeneous nature of this disease and unique set of risk factors compared to Western countries, the predominant driver gene mutations may vary among populations. Previous studies in East and South Asian populations with oral squamous cell carcinoma have highlighted that the pattern of genetic mutations is significantly different from tumour profiles in other studies largely conducted in Caucasian populations (Vettore ; Su ). Population-based differences in mutational profile have also been documented for other cancer types. In lung cancers, several studies have highlighted the geographic variations in genes such as EGFR and LKBI between Asian (Chinese, Japanese, Korean) and Caucasian populations (Koivunen ; Mitsudomi, 2014; Li ). This is the first report describing the mutational spectrum of Pakistani HNSCC patients. In addition to reporting known HNSCC mutations, we have identified novel, recurrent mutations in ASNS and other genes in the Pakistani population. It has been well established that a complex interplay of genetic and environmental factors results in varying risk of cancer development and treatment outcomes across different ethnicities and geographic regions (Ma ). Such diversity among different populations can be explained by the type and frequency of variations in both germline and somatic genomes (Wang and Wheeler, 2014). Therefore, this study is an important step towards gaining a better mechanistic understanding of the complex nature of HNSCC. Future studies will be undertaken to confirm and validate the findings from this study in a larger cohort. Additionally, functional analysis of mutations and correlation with clinical outcomes will be performed.
  87 in total

Review 1.  Mechanisms of Invasion in Head and Neck Cancer.

Authors:  Lizandra Jimenez; Sangeeta K Jayakar; Thomas J Ow; Jeffrey E Segall
Journal:  Arch Pathol Lab Med       Date:  2015-06-05       Impact factor: 5.534

2.  The N-terminal cysteine of human asparagine synthetase is essential for glutamine-dependent activity.

Authors:  G Van Heeke; S M Schuster
Journal:  J Biol Chem       Date:  1989-11-25       Impact factor: 5.157

3.  Cancer of the oral cavity - trends in Karachi South (1995-2002).

Authors:  Yasmin Bhurgri
Journal:  Asian Pac J Cancer Prev       Date:  2005 Jan-Mar

4.  Adjuvant chemotherapy for resectable squamous cell carcinomas of the head and neck: report on Intergroup Study 0034.

Authors:  G E Laramore; C B Scott; M al-Sarraf; R E Haselow; T J Ervin; R Wheeler; J R Jacobs; D E Schuller; R A Gahbauer; J G Schwade
Journal:  Int J Radiat Oncol Biol Phys       Date:  1992       Impact factor: 7.038

5.  Karachi Cancer Registry Data--implications for the National Cancer Control Program of Pakistan.

Authors:  Yasmin Bhurgri
Journal:  Asian Pac J Cancer Prev       Date:  2004 Jan-Mar

Review 6.  Lessons learned from next-generation sequencing in head and neck cancer.

Authors:  Myriam Loyo; Ryan J Li; Chetan Bettegowda; Curtis R Pickering; Mitchell J Frederick; Jeffrey N Myers; Nishant Agrawal
Journal:  Head Neck       Date:  2012-08-21       Impact factor: 3.147

Review 7.  NF2/merlin in hereditary neurofibromatosis 2 versus cancer: biologic mechanisms and clinical associations.

Authors:  Rebecca Dunbar Schroeder; Laura S Angelo; Razelle Kurzrock
Journal:  Oncotarget       Date:  2014-01-15

8.  Comprehensive genomic characterization of head and neck squamous cell carcinomas.

Authors: 
Journal:  Nature       Date:  2015-01-29       Impact factor: 49.962

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  Asparagine synthetase is an independent predictor of surgical survival and a potential therapeutic target in hepatocellular carcinoma.

Authors:  B Zhang; L-W Dong; Y-X Tan; J Zhang; Y-F Pan; C Yang; M-H Li; Z-W Ding; L-J Liu; T-Y Jiang; J-H Yang; H-Y Wang
Journal:  Br J Cancer       Date:  2013-06-13       Impact factor: 7.640

View more
  1 in total

Review 1.  Predictive Biomarkers for Response and Toxicity of Induction Chemotherapy in Head and Neck Cancers.

Authors:  Łukasz Boguszewicz
Journal:  Front Oncol       Date:  2022-07-06       Impact factor: 5.738

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.