Literature DB >> 27749845

Genome-wide association analyses identify new susceptibility loci for oral cavity and pharyngeal cancer.

Corina Lesseur1, Brenda Diergaarde2,3, Andrew F Olshan4,5, Victor Wünsch-Filho6, Andrew R Ness7,8, Geoffrey Liu9, Martin Lacko10, José Eluf-Neto11, Silvia Franceschi1, Pagona Lagiou12, Gary J Macfarlane13, Lorenzo Richiardi14, Stefania Boccia15, Jerry Polesel16, Kristina Kjaerheim17, David Zaridze18, Mattias Johansson1, Ana M Menezes19, Maria Paula Curado20, Max Robinson21, Wolfgang Ahrens22, Cristina Canova23, Ariana Znaor1,24, Xavier Castellsagué25, David I Conway26,27, Ivana Holcátová28, Dana Mates29, Marta Vilensky30, Claire M Healy31, Neonila Szeszenia-Dąbrowska32, Eleonóra Fabiánová33, Jolanta Lissowska34, Jennifer R Grandis35,36, Mark C Weissler37, Eloiza H Tajara38, Fabio D Nunes39, Marcos B de Carvalho40, Steve Thomas8, Rayjean J Hung41, Wilbert H M Peters42, Rolando Herrero1, Gabriella Cadoni43, H Bas Bueno-de-Mesquita44,45,46, Annika Steffen47, Antonio Agudo48, Oxana Shangina18, Xiangjun Xiao49, Valérie Gaborieau1, Amélie Chabrier1, Devasena Anantharaman1, Paolo Boffetta50, Christopher I Amos49, James D McKay1, Paul Brennan1.   

Abstract

We conducted a genome-wide association study of oral cavity and pharyngeal cancer in 6,034 cases and 6,585 controls from Europe, North America and South America. We detected eight significantly associated loci (P < 5 × 10-8), seven of which are new for these cancer sites. Oral and pharyngeal cancers combined were associated with loci at 6p21.32 (rs3828805, HLA-DQB1), 10q26.13 (rs201982221, LHPP) and 11p15.4 (rs1453414, OR52N2-TRIM5). Oral cancer was associated with two new regions, 2p23.3 (rs6547741, GPN1) and 9q34.12 (rs928674, LAMC3), and with known cancer-related loci-9p21.3 (rs8181047, CDKN2B-AS1) and 5p15.33 (rs10462706, CLPTM1L). Oropharyngeal cancer associations were limited to the human leukocyte antigen (HLA) region, and classical HLA allele imputation showed a protective association with the class II haplotype HLA-DRB1*1301-HLA-DQA1*0103-HLA-DQB1*0603 (odds ratio (OR) = 0.59, P = 2.7 × 10-9). Stratified analyses on a subgroup of oropharyngeal cases with information available on human papillomavirus (HPV) status indicated that this association was considerably stronger in HPV-positive (OR = 0.23, P = 1.6 × 10-6) than in HPV-negative (OR = 0.75, P = 0.16) cancers.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27749845      PMCID: PMC5131845          DOI: 10.1038/ng.3685

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Cancers of the oral cavity (OC) and oropharynx (OPC) are predominantly caused by tobacco and alcohol use, although oral infection with HPV, particularly HPV16, is an increasingly important cause of OPC1, especially in the US and northern Europe1,2. The proportion of HPV-related OPCs varies widely and is estimated to be approximately 60% in the US, 30% in Europe and lower in South America2–5. Genetic factors have also been implicated in OC and OPC susceptibility, especially polymorphisms within alcohol-related genes including alcohol-dehydrogenase 1B (ADH1B) and ADH76,7. In order to identify additional susceptibility loci, 13,107 individuals from 12 epidemiological studies (Supplementary Table 1) were genotyped using the Illumina OncoArray and after stringent quality-control steps (Supplementary Table 2, Online Methods) 6,034 cases and 6,585 cancer-free controls remained for analyses (Table 1). We next performed genome-wide imputation using the Haplotype Reference Consortium panel8 and obtained approximately 7 million high-quality imputed variants (Supplementary Fig. 1). Given the ethnic diversity of our study, we evaluated associations within continent (Europe, North and South America) using multivariate unconditional logistic regressions under a log-additive genetic model adjusted for age, sex and regional eigenvectors. Results by continent were combined using fixed-effect meta-analyses to derive associations for overall OC and pharynx cancer (oral, oropharynx, hypopharynx and overlapping cancers; n = 6,034), as well as site-specific OC (n = 2,990) and OPC (n = 2,641). Although, several ethnic groups are present in the study, supervised ancestry analyses indicated that >90% of participants were predominantly of European (>70%CEU) ancestry, although some population admixture was observed in South America (Supplementary Table 3).
Table 1

Epidemiological and clinical characteristics of cases and controls.

CasesControls
N%N%
Total 6,0346,585
Tumor site
Oral cavity2,99049.6
Oropharynx2,64143.8
Hypopharynx3055.1
Overlapping731.2
Other250.4
Geographic Region
Europe2,49941.42,92844.5
North America2,54942.22,52238.3
South America98616.31,13517.2
Sex
Male4,52775.024,32565.68
Female1,50724.982,26034.32
Age, years
=<501,31521.81,35520.6
50-<602,00633.21,95429.7
60-<701,74829.01,98330.1
>=7096416.01,29319.6
Unknown10.02
Smoking Status
Never1,05717.52,50838.1
Former1,79229.72,26334.4
Current2,62343.51,46622.3
Unknown5629.33485.3
Drinking Status
Never82013.61,19918.2
Ever4,84080.24,84073.5
Unknown3746.25468.3
GWA meta-analyses of overall and site-specific cancers identified 9 regions at genome-wide significance (P < 5x10–8) (Fig.1). Quantile-quantile (Q-Q) plots of observed and expected P-values showed moderate genomic inflation (λ) for the 3 meta-analyses (λ range = 1.04–1.06, Supplementary Fig. 2–3). Since λ increases with sample size, we scaled it to 1000 cases and controls resulting in ameliorated inflation (λ1000 range = 1.009–1.01)9. Overall OC and pharynx cancer were associated with rs79767424 (5p14.3), rs1229984 (4q23), rs201982221 (10q26.13), rs1453414 (11p15.4) and 123 SNPs at 6p21.32 (Supplementary Table 4). Twenty-six variants were associated (P < 5x10–8) with OC (Supplementary Table 5), 4 of which mapped to 2p23.3, 1 to 4q23, 3 to 9q34.12, 13 to 5p15.33 and 5 to 9p21.3. For OPC, novel significant variants were located at 6p21.32 (62 SNPs, Supplementary Table 6). Suggestive susceptibility variants (P < 5x10–7, Supplementary Tables 7–9) were associated with OC at 4 additional loci: 6p21.33, 6p21.32, 15q21.2, 15q26.2 and, with OPC at 2q36.1. Other genomic locations outside the HLA region showed promising associations (P < 5x10–6) with OPC (Supplementary Table 10). For susceptibility loci at P < 5x10–8, functional annotation of regulatory features with ENCODE and eQTL information, if available, are summarized in Supplementary Table 11 and 12. Given the geographical heterogeneity of our population, we performed sensitivity analyses after excluding individuals with <70% CEU ancestry and these showed similar results (Supplementary Table 13). To validate array genotypes and imputed dosages, we directly genotyped by a different platform (TaqMan) at least one variant within each locus (P = 5x10–7) in a subset of approximately 700 individuals. Concordance between genotyped/imputed genotypes and TaqMan results was >97% for all regions with the exception of rs2398180, an imputed variant which had a concordance of 94% (Supplementary Table 14-15). For 2 rare variants, rs201982221 (10q26.13) and rs7976742 (5p14.3), TaqMan assays could not be designed and we used Sanger sequencing for validation (Supplementary Table 16). We were able to validate the rs201982221 deletion (Online Methods), but rs7976742 did not validate (Online Methods). The lead variant at each validated loci (P < 5x10–8) for overall and site-specific analyses are shown in Table 2. Results stratified by geographical region, smoking and alcohol status are displayed in Figure 2 (oral and pharynx cancer combined) and Figure 3 (oral cancer).
Figure 1

Genome-wide associations meta-analyses results.

The red line represents P = 5x10–8. The y-axis represents the –log10 P-values. (a) Overall OC and pharyngeal cancer 6,034 cases and 6585 controls (b) Oral cancer analysis with 2,990 cases and 6,585 controls (c) Oropharyngeal cancer analysis with 2,641 cases and 6,585 controls. Loci with GWA significant SNPs and technically validated are tagged with genomic location.

Table 2

Lead genome-wide significant SNP for each validated locus from the regional meta-analyses of oral and pharyngeal cancers combined and analysis of each cancer separatelya and pharynx cancer combined, as well as OCa and OPCa separately.

RegionSNPchr:posbGeneEA/OAcInfo (Rsq)dAFe case/controlOR P P het
Oral and pharyngeal cancer
4q23rs12299844:100239319 ADH1B A/GGeno0.03/0.060.562.29x10–150.002
6p21.32rs38288056:32636120 HLA-DQB1 C/T0.880.75/0.721.283.35x10–130.007
10q26.13rs20198222110:126157446 LHPP D/IGeno0.03/0.021.671.58x10–90.50
11p15.4rs145341411:5829084 OR52N2/TRIM5 C/AGeno0.23/0.201.194.78x10–80.55
Oral cancer
2p23.3rs65477412:27855924 GPN1 A/G0.980.50/0.540.833.97x10–80.34
4q23rs12299844:100239319 ADH1B A/GGeno0.03/0.060.571.09x10–90.02
5p15.33rs104627065:1343794 CLPTM1L T/C0.970.12/0.150.745.54x10–100.84
9p21.3rs81810479:22064465 CDKN2B-AS1 A/GGeno0.29/0.241.243.80x10–90.37
9q34.12rs9286749:133952024 LAMC3 G/A0.890.14/0.121.332.09x10–80.88
Oropharyngeal cancer
4q23rs12299844:100239319 ADH1B A/GGeno0.02/0.060.558.53x10–90.05
6p21.32rs38288056:32636120 HLA-DQB1 C/T0.880.75/0.721.372.21x10–120.07

OC=oral cancer, OPC=oropharyngeal cancer;

SNP position according to NCBI genome build 37 (Hg19);

EA=Effect allele; OA=other allele;

Geno=genotyped, SNP, INFO, R2 is the average across imputation batches;

AF=allele frequency of the effect allele

Figure 2

Forest plots of odds ratios for the lead SNP at each genome-wide significant loci in the overall oral and pharyngeal cancer meta-analysis.

(a)4q23, rs1229984 (b)10q26.13, rs20198222 (c)11p15.4 rs1453414. (d)6p21.32 rs3828805. EAF = effect allele frequency in 6585 controls. Effect allele in square brackets. OC = oral cancer; OPC = oropharynx cancer.

Figure 3

Forest plots of odds ratios for the lead SNP at each genome-wide significant loci in the oral cancer (OC) meta-analysis.

(a)2p23.3 rs6547741 (b)5p15.33, rs10462706 (c)9p21.3, rs8181047 (d)9q34.12, rs928674. EAF = effect allele frequency in 6585 controls. Effect allele in square brackets. Overall = oral and pharyngeal cancer; OPC = oropharynx cancer.

The rs1229984 (4q23, ADH1B) association has been previously reported as a susceptibility locus for OC and OPC6, and similar to previous findings this variant showed heterogeneity by region, smoking and alcohol drinking status10,11 (Fig. 2a). Three other 4q23 SNPs reached P < 5x10–8, although conditional analyses indicated these are not independent signals (Supplementary Table 17). The rs1573496 (ADH7) variant reported to be strongly associated with OC and OPC in the previous upper aerodigestive tract cancer GWAS7 was only moderately associated here (Supplementary Table 18). In the overall OC and pharynx cancer analysis, we identified rs201982221 at 10q26.13 (OR = 1.67, P = 1.58x10–9), that was also separately associated with OC (OR = 1.71, P = 1.04x10–7) and OPC (OR = 1.70, P = 7.9x10–7) (Fig. 2b). rs201982221 is located within the LHPP gene in a region with reported regulatory features (Supplementary Table 11). However, it is a rare intronic deletion in an area of low linkage disequilibrium (LD) (Supplementary Fig. 4), and thus warrants further validation in a different population. rs1453414, the lead signal at 11p15.4 (Supplementary Table 19), is an intronic variant that showed a borderline association in the overall (OR = 1.19, P = 4.78x10–8) and site-specific analyses [OC (OR = 1.19, P = 1.65x10–5) and OPC (OR = 1.22, P = 4.26x10–6)] (Fig. 2c, Supplementary Fig. 5). rs1453414 is upstream of OR52N2, an olfactory receptor, and within TRIM5, an E3-ubiquitin ligase, and is an eQTL for these genes in brain tissue12 (Supplementary Table 12). At 2p23.3, 4 SNPs showed evidence (P < 5x10–8) for an association with OC, and in conditional analyses did not appear to be independent (Supplementary Table 20). These signals map to a high LD area that includes C2orf16, ZNF512, CCDC121 and GPN1 (Supplementary Fig. 6). The lead SNP, rs6547741, was associated with OC but not with OPC, and maps to an intron of GPN1, a GTPase involved in RNA polymerase II transport and DNA repair13. Associations between rs6547741 and OC were homogenous across other stratified analyses by region, sex, smoking and drinking status (Fig. 3a). Variation within 5p15.33 was also exclusively associated with OC (OPC, rs10462706, P = 0.47). The top signal, rs10462706, was associated with decreased OC risk (OR = 0.74, P = 5.54x10–10) and is in low LD (r2 = 0.15, Supplementary Fig. 7) with the second strongest signal rs467095. These two variants are 7kb apart and map to intron 13 of CLPTM1L and in stratified analysis showed stronger effects in never smokers (Phet = 0.07 and Phet = 0.0028, respectively) and never drinkers (Phet = 0.01 and Phet = 0.0025, respectively) (Fig. 3b, Supplementary Fig. 8). Conditional analyses showed that these SNPs are not completely independent (Supplementary Table 21). TERT and CLPTM1L encode the telomerase reverse transcriptase (TERT) and the cleft-lip and palate-associated transmembrane 1-like protein (CLPTM1L), respectively. Notably, rs467095 is an esophageal TERT eQTL14 (Supplementary Table 12) and is in high LD with rs401681 (OR = 1.18, P = 2.1x10–7, r2 = 0.94) a widely studied SNP associated with risk of several cancers including: lung15,16, bladder, prostate, cervical, melanoma17, basal cell18, esophageal19, pancreatic20 and nasopharyngeal cancer21. Multiple 5p15.33 variants have been reported to independently influence cancer risk in both an increasing and decreasing fashion. Interestingly, rs401681[A] was associated with an increased OC risk similar to previous melanoma associations, and in an opposite direction to previous lung cancer results17. Several variants within the CDKN2A–CDKN2B locus (9p21.3) were found to be associated with OC. The lead SNP, rs8181047, is an intronic variant within the CDKN2B1 antisense RNA 1 (CDKN2B-AS1) (Fig. 3c). rs8181047 is in LD (r2range = 0.6-0.8) with 4 other 9p21.3 variants strongly associated with OC (Supplementary Fig. 9) that in conditional analyses did not show independent associations (Supplementary Table 22). The CDKN2A–CDKN2B locus contains genes involved in cell-cycle regulation and senescence and has been associated with multiple malignancies including melanoma22, glioma23, basal cell18, breast24, lung25, nasopharyngeal26 and esophageal cancer27. Notably, CDKN2A is frequently mutated in HPV-negative head and neck cancers28. The OC associated variants at 9q34.12 mapped to an intron of LAMC3, a laminin involved in cortical development29. rs928674, the peak signal, showed consistent effects across strata and a weaker association with OPC (P = 0.003) (Fig. 3d). rs928674 is in high LD with 3 other robustly associated 9q34.12 SNPs (r2range = 0.82–0.96, Supplemental Fig. 10, Supplementary Table 23) and is an esophageal mucosa cis-eQTL for a downstream gene AIF1L (Allograft Inflammatory Factor 1-Like)14. The most prominent finding in the overall and OPC meta-analyses was a large association signal at 6p21.32 within the HLA class II region. The lead variant in both analyses, rs3828805, maps 1.7kb 5' of HLA-DQB1 (Fig. 2d and Supplementary Fig. 11) and similar to other 6p21.32 variants (Supplementary Table 4 and 6), showed heterogeneity by geographical region (Phet = 0.007) with no effect in South America (P = 0.62). Association analyses of 6p21.32 variants (P < 5x10–8) conditioned on rs3828805 did not reveal multiple independent signals (Supplementary Table 24), suggesting a common haplotype. To further investigate HLA associations, we imputed classical alleles in 11,436 individuals (>70% Caucasian ancestry) (Online Methods). Three classical HLA alleles DRB1*1301, DQA1*0103 and DQB1*0603 reached P < 5x10-8 in the overall analysis and were also strongly associated with OPC (Supplementary Table 25). These alleles are in high LD (r2 > 0.9) and are part of the HLA class II haplotype, DRB1*1301-DQA1*0103-DQB1*0603, which is common in Europeans and previously reported to be associated with decreased cervical cancer risk30. DRB1*1301-DQA1*0103-DQB1*0603 was strongly associated with reduced OPC risk (OR = 0.59, P = 2.7x10-9) and more weakly with OC risk (OR = 0.75, P = 1.7x10–4). Further conditional analysis on this haplotype and 6p21.32 variants did not reveal evidence of additional independent effects (Supplementary Table 26). Given the importance of HPV infection in the etiology of cervical and oropharyngeal cancer31, we conducted post-hoc analyses to examine the effect of DRB1*1301-DQA1*0103-DQB1*0603 in a subset of 576 cases with available HPV-status and 3,662 controls. DRB1*1301-DQA1*0103-DQB1*0603 was associated with a strong reduced risk of HPV-positive OPC (OR = 0.23, P = 1x10-6, n = 336) with no significant association in 240 OPC HPV-negative cases (OR = 0.75, P = 0.16) (Table 3). These results indicate that the class II HLA region is implicated in at least two HPV-driven cancers, namely HPV-positive OPC and cervical cancer. The lack of an association between 6p21.3 SNPs and OPC risk in South America could relate to previous findings that less than 10% of OPC are HPV-positive in this region4,5. Moreover, a weaker association with OC could be due to a smaller proportion of these cases being HPV-positive, as well as possibly some misclassified OPC cases, especially for base of the tongue tumors. Further evaluation of the extent and specificity of this HLA effect in HPV-associated cancers is important given the strength of the observed association. This may help elucidate why some individuals are at higher risk of HPV-positive OPC after HPV infection and may also have implications for cancer immunotherapies targeting the HLA class II antigen presentation pathway32.
Table 3

Associations of DRB1*1301-DQA1*0103-DQB1*0603 haplotype in individuals with >70% Caucasian ancestry

Meta-analysisc
haplotypea case/controlcase/controlHFb caseHFb controlORPORP
Oral and pharynx cancer 0.683.32x10–10
Europe207/4222,497/2,9280.040.070.604.04x10–8
North America207/2762,342/2,3290.040.060.741.68x10–3
South America74/101613/7270.060.070.860.35
Oral cancer 0.751.72x10–4
Europe106/4221,231/2,9280.040.070.601.52x10–5
North America128/2761,135/2,3290.060.060.924.67x10–1
South America41/101351/7270.060.070.800.26
Oropharynx cancer 0.592.73x10–9
Europe84/4221,098/2,9280.040.070.572.69x10–5
North America72/2761,119/2,3290.030.060.523.49x10–6
South America31/101216/7270.070.071.050.81
Oropharynx cancer by HPV status d
HPV-positive11/505336/3,6860.010.070.231.6x10–6
HPV-negative25/505240/3,6860.050.070.750.16

Number of copies of the haplotype in cases and controls

Haplotype frequency calculated as total number of copies of haplotype in the population (haplotype copies/2n).

Fixed-effects meta-analysis of regional associations adjusted for age, sex and eigenvectors.

HPV-status available in a subset of cases from ARCAGE, EPIC, CHANCE and Pittsburgh studies

In summary, we identified seven oral and pharyngeal cancer susceptibility loci including a strong HLA signal narrowed to a class II haplotype. Future replication of these findings in an independent population is warranted as well as fine-mapping and functional studies necessary to establish the biological framework underneath these associations.

Online Methods

Study population and genotyping

The study population comprised 6,034 cases and 6,585 controls derived from 12 epidemiological studies, the majority of case-control design and part of the International Head and Neck Cancer Epidemiology Consortium (INHANCE). Additionally, cases and controls from a European cohort study (EPIC) and cases from a United Kingdom case-series (HN5000) were also included. Characteristics and references for each study are summarized in Supplementary Table 1. Informed consent was obtained for all participants and studies were approved by respective Institutions Review Boards. Cancer cases comprised the following ICD codes: oral cavity (C02.0-C02.9, C03.0-C03.9, C04.0-C04.9, C05.0-C06.9) oropharynx (C01.9, C02.4, C09.0-C10.9), hypopharynx (C13.0-C13.9), overlapping (C14 and combination of other sites) and 25 oral or pharyngeal cases with unknown ICD code (other). Genomic DNA isolated from blood or buccal cells was genotyped at the Center for Inherited Disease Research (CIDR) with a novel genotyping tool, the Illumina OncoArray custom designed for cancer studies by the OncoArray Consortium33 part of the Genetic Associations and Mechanisms in Oncology (GAME-ON) Network. The majority of the samples were genotyped as part of the oral and pharynx cancer OncoArray, with the exception of 2,476 shared controls (1,453 from the EPIC study and 1,023 from the Toronto study) that were genotyped at CIDR but as part of the Lung OncoArray. Genotype calls were made by the Dartmouth team in GenomeStudio software (Illumina, Inc.) using a standardized cluster file for OncoArray studies. Cluster plots for top SNPs for individuals genotyped as part the oral and pharynx cancer OncoArray are shown in Supplementary Figure 12.

GWAS quality control

We used PLINK 1.934 to conduct systematic quality-control (QC) steps on genotypes calls. An initial filtering step on the complete dataset excluded samples with genotyping rate <80% and SNPs with call rate <80%, then we excluded samples and SNPs <95% call rates. During the individuals QC, we removed samples with unsolved genetic and reported sex discrepancies and individuals with outlying autosomic heterozygosity rate (+/– 4 standard deviations (SD)). Identity-by-descent (IBD) analysis performed on the LD pruned dataset (r2 < 0.1) identified; 103 expected experimental duplicate-pairs (IBD > 0.9), from these we excluded the sample with lower genotyping rate, and 74 unexpected duplicate-pairs that were excluded. Additionally, we identified 44 unexpected relatives pairs (IBD > 0.3) and excluded one sample from each pair, prioritizing cases over controls and for pairs with same status we excluded according to genotyping rate. After the initial SNP filtering calling rate and exclusion of duplicated and zeroed probes, 513,311 probes remained for analysis. Next and given the heterogeneity of the study population, we divided the dataset by geographical region (Europe, North America and South America) and excluded SNPs within each region with deviation of Hardy-Weinberg Equilibrium (HWE) in controls (P < 1x10–7). To account for potential population stratification within geographical area, we performed principal components analysis (PCA) in EIGENSTRAT35 using approximately 10,000 common markers in low LD (r2 < 0.004, MAF > 0.05), subsequently we excluded population outliers (n = 139) and derived regional eigenvectors to adjust regional GWAS analyses (Supplementary Fig. 13). PCA analyses on each individual study are displayed in Supplementary Fig. 14. To inquire about ancestry within regions, we used STRUCTURE 2.3.436 under a supervised approach with HapMap samples to determine individual CEU, YRI and CHB/JPT ancestries. All coordinates refer to genome build HG19/GRCh37.

Imputation

Imputation of unknown genetic variation was performed using the Michigan Imputation Server37 (https://imputationserver.sph.umich.edu/), a free service for largescale population studies. We used SHAPEIT38 for prephasing, Minimac339 for imputation and the first release of the Haplotype Reference Consortium panel8 (HRC) a large collection of human haplotypes (n = 64,976) that combines data from multiple initiatives including the 1000 genomes project. For imputation we used a set of high-quality SNPs; MAF > 0.01, call rate >98%, we imputed in randomized batches of approximately 3000 individuals and analyzed imputation quality statistics together, SNPs with MAF < 0.01 or R2 < 0.3 in any of the batches were excluded before associations analysis. Thus, the final set of 7,099,472 imputed variants used in association analysis had very high quality: 86% of the variants with MAF ≥ 0.05 had R2 > 0.9 and 52% of the less common variants (MAF < 0.05) had R2 > 0.9 (Supplementary Figure 1).

GWAS and meta-analyses

We undertook a by geographic region GWAS and meta-analysis approach to evaluate the relation between SNPs and overall OC and pharynx cancer, and site-specific OC and OPC risk; in total we tested we evaluated 7,574,753 SNPs (genotyped and imputed variants). By region GWAS were performed in PLINK and R glm function for genotype dosages, using multivariable unconditional logistic regression assuming a log-additive genetic or dosage model with age, sex and principal components as covariates. Next, association statistics were included in a fixed-effects meta-analysis performed in PLINK34. The P-value for heterogeneity was calculated using Cochran’s Q. Conditional analyses within associated regions were performed in R (glm) and meta-analysis of regional results were done in PLINK.

HLA imputation and haplotype analyses

Using SNP2HLA40 and the Type I Diabetes Genetics Consortium reference panel of 5,225 individuals of European descent, we imputed HLA classical alleles and aminoacids in 11,436 study participants with >70% CEU ancestry. Individuals were randomized into 12 groups approximately of 1000 each and quality imputation scores from beagle examined across groups. Markers with R2 >= 0.7 in all groups and average R2 > 0.9 (8,286 makers of 8,961 included in the panel) were carried on for analyses and 98% of these had R2 > 0.95. We then performed regional association analyses of binary markers followed by meta-analysis of imputed binary markers (SNPs, classical alleles and aminoacids) as previously described using PLINK and R. DRB1*1301-DQA1*0103-DQB1*0603 haplotype information was extracted from beagle files of phased best-guess genotypes and used to estimate ORs and 95%CIs from multivariable unconditional logistic regression models with each phenotype. Haplotype and HPV stratified analyses in a subset of cases with available data, was performed with unconditional logistic regression with haplotype as predictor and oropharyngeal cancer HPV positive or negative as outcomes. The measure of HPV positivity was derived by HPV16E6 serology in the ARCAGE and EPIC studies, tumor HPV16 DNA and p16 overexpression in the CHANCE study and for the Pittsburgh cases a combination of tumor HPV in situ hybridization and p16 immunohistochemistry.

Technical validation genotyping

In order to confirm array genotypes and imputed dosages, we genotyped at least one SNP within each associated (P < 5x10–7) genomic region using TaqMan assays (Thermo Fisher Scientific) in a subset of approximately 900 individuals from ARCAGE, IARC Latin America, EPIC, and IARC oral cancer studies (Supplementary Table 14). Taqman genotyping included standard positive and negative controls; assay numbers and cycling conditions are described in Supplementary Table 15. For 10 of the 12 SNPs genotyped by TaqMan there was no evident departure from Hardy-Weinberg equilibrium (P > 0.05). However, two variants rs467095 and rs3731239 were slightly out of HWE only in samples from the EPIC study (P < 0.01), which might relate to the small sample size genotyped in the technical validation. The rare variants at 5p14.3 (rs79767424) and 10q26.13 (rs201982221, Supplementary Figure 15) could not be genotyped with TaqMan assays, thus for these, validation was performed with Sanger sequencing on the on a small subset of samples available within IARC studies; rs201982221 (15 wild-type, 15 heterozygous and 2 homozygous) and for rs79767424 (15 wild-type and 15 heterozygous). Primer sequences and conditions for PCR are described in Supplementary Table 16; amplification products were sequenced on the Applied Biosystems 3730 DNA Analyzer at Biofidal, France.

Bioinformatic annotation and analyses

To explore possible functional implications of variants within associated regions, we used an internet tool HaploReg v4.141 that includes annotations from the Encyclopedia of DNA elements (ENCODE). Functional annotation for variants reaching P < 5 x 10-8 is summarized Supplemental Table 11. We also retrieved eQTL information from several sources including the Genotype-Tissue Expression Project (GTEx)14 and HaploReg41. Manhattan and quantile-quantile plots were generated in R using the package qqman42, and forest plots were generated using the metafor R library43. Regional association plots were generated using LocusZoom44 with LD and recombination rates data from the 1000 Genomes Project November 2014 release. To search for additional, pairwise LD information we used the online tool LD link45.
  43 in total

1.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

2.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

3.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

Review 4.  Human papillomavirus types in head and neck squamous cell carcinomas worldwide: a systematic review.

Authors:  Aimee R Kreimer; Gary M Clifford; Peter Boyle; Silvia Franceschi
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2005-02       Impact factor: 4.254

5.  Evidence for an important role of alcohol- and aldehyde-metabolizing genes in cancers of the upper aerodigestive tract.

Authors:  Mia Hashibe; Paolo Boffetta; David Zaridze; Oxana Shangina; Neonila Szeszenia-Dabrowska; Dana Mates; Vladimir Janout; Eleonóra Fabiánová; Vladimir Bencko; Norman Moullan; Amelie Chabrier; Rayjean Hung; Janet Hall; Federico Canzian; Paul Brennan
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2006-04       Impact factor: 4.254

6.  Next-generation genotype imputation service and methods.

Authors:  Sayantan Das; Lukas Forer; Sebastian Schönherr; Carlo Sidore; Adam E Locke; Alan Kwong; Scott I Vrieze; Emily Y Chew; Shawn Levy; Matt McGue; David Schlessinger; Dwight Stambolian; Po-Ru Loh; William G Iacono; Anand Swaroop; Laura J Scott; Francesco Cucca; Florian Kronenberg; Michael Boehnke; Gonçalo R Abecasis; Christian Fuchsberger
Journal:  Nat Genet       Date:  2016-08-29       Impact factor: 38.330

7.  Human papillomavirus (HPV) 16 and the prognosis of head and neck cancer in a geographical region with a low prevalence of HPV infection.

Authors:  Rossana Verónica Mendoza López; José Eduardo Levi; José Eluf-Neto; Rosalina Jorge Koifman; Sergio Koifman; Maria Paula Curado; Pedro Michaluart-Junior; David Livingstone Alves Figueiredo; Fabiano Pinto Saggioro; Marcos Brasilino de Carvalho; Luiz Paulo Kowalski; Márcio Abrahão; Francisco de Góis-Filho; Eloiza Helena Tajara; Tim Waterboer; Paolo Boffetta; Paul Brennan; Victor Wünsch-Filho
Journal:  Cancer Causes Control       Date:  2014-01-29       Impact factor: 2.506

8.  A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22.1, 1q32.1 and 5p15.33.

Authors:  Gloria M Petersen; Laufey Amundadottir; Charles S Fuchs; Peter Kraft; Rachael Z Stolzenberg-Solomon; Kevin B Jacobs; Alan A Arslan; H Bas Bueno-de-Mesquita; Steven Gallinger; Myron Gross; Kathy Helzlsouer; Elizabeth A Holly; Eric J Jacobs; Alison P Klein; Andrea LaCroix; Donghui Li; Margaret T Mandelson; Sara H Olson; Harvey A Risch; Wei Zheng; Demetrius Albanes; William R Bamlet; Christine D Berg; Marie-Christine Boutron-Ruault; Julie E Buring; Paige M Bracci; Federico Canzian; Sandra Clipp; Michelle Cotterchio; Mariza de Andrade; Eric J Duell; J Michael Gaziano; Edward L Giovannucci; Michael Goggins; Göran Hallmans; Susan E Hankinson; Manal Hassan; Barbara Howard; David J Hunter; Amy Hutchinson; Mazda Jenab; Rudolf Kaaks; Charles Kooperberg; Vittorio Krogh; Robert C Kurtz; Shannon M Lynch; Robert R McWilliams; Julie B Mendelsohn; Dominique S Michaud; Hemang Parikh; Alpa V Patel; Petra H M Peeters; Aleksandar Rajkovic; Elio Riboli; Laudina Rodriguez; Daniela Seminara; Xiao-Ou Shu; Gilles Thomas; Anne Tjønneland; Geoffrey S Tobias; Dimitrios Trichopoulos; Stephen K Van Den Eeden; Jarmo Virtamo; Jean Wactawski-Wende; Zhaoming Wang; Brian M Wolpin; Herbert Yu; Kai Yu; Anne Zeleniuch-Jacquotte; Joseph F Fraumeni; Robert N Hoover; Patricia Hartge; Stephen J Chanock
Journal:  Nat Genet       Date:  2010-01-24       Impact factor: 38.330

9.  HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants.

Authors:  Lucas D Ward; Manolis Kellis
Journal:  Nucleic Acids Res       Date:  2011-11-07       Impact factor: 16.971

10.  Imputing amino acid polymorphisms in human leukocyte antigens.

Authors:  Xiaoming Jia; Buhm Han; Suna Onengut-Gumuscu; Wei-Min Chen; Patrick J Concannon; Stephen S Rich; Soumya Raychaudhuri; Paul I W de Bakker
Journal:  PLoS One       Date:  2013-06-06       Impact factor: 3.240

View more
  64 in total

Review 1.  Genome-Wide Association Studies of Cancer in Diverse Populations.

Authors:  Sungshim L Park; Iona Cheng; Christopher A Haiman
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2017-06-21       Impact factor: 4.254

2.  MicroRNA-Related Genetic Variants Associated with Survival of Head and Neck Squamous Cell Carcinoma.

Authors:  Owen M Wilkins; Alexander J Titus; Lucas A Salas; Jiang Gui; Melissa Eliot; Rondi A Butler; Erich M Sturgis; Guojun Li; Karl T Kelsey; Brock C Christensen
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2018-06-07       Impact factor: 4.254

Review 3.  Integration of microbiology, molecular pathology, and epidemiology: a new paradigm to explore the pathogenesis of microbiome-driven neoplasms.

Authors:  Tsuyoshi Hamada; Jonathan A Nowak; Danny A Milner; Mingyang Song; Shuji Ogino
Journal:  J Pathol       Date:  2019-02-20       Impact factor: 7.996

Review 4.  Therapeutic Implications of the Genetic Landscape of Head and Neck Cancer.

Authors:  Janice Cho; Daniel E Johnson; Jennifer R Grandis
Journal:  Semin Radiat Oncol       Date:  2018-01       Impact factor: 5.934

5.  HLA and KIR Associations of Cervical Neoplasia.

Authors:  Xiao Bao; Aimee L Hanson; Margaret M Madeleine; Sophia S Wang; Stephen M Schwartz; Felicity Newell; Ulrika Pettersson-Kymmer; Kari Hemminki; Sven Tiews; Winfried Steinberg; Janet S Rader; Felipe Castro; Mahboobeh Safaeian; Eduardo L Franco; François Coutlée; Claes Ohlsson; Adrian Cortes; Mhairi Marshall; Pamela Mukhopadhyay; Katie Cremin; Lisa G Johnson; Suzanne M Garland; Sepehr N Tabrizi; Nicolas Wentzensen; Freddy Sitas; Cornelia Trimble; Julian Little; Maggie Cruickshank; Ian H Frazer; Allan Hildesheim; Matthew A Brown; Emma L Duncan; Ying Pu Sun; Paul J Leo
Journal:  J Infect Dis       Date:  2018-11-05       Impact factor: 5.226

6.  A Genome-Wide Association Study Identifies Two Novel Susceptible Regions for Squamous Cell Carcinoma of the Head and Neck.

Authors:  Hongliang Liu; Jian Wang; Robert Yu; Sanjay Shete; Erich M Sturgis; Guojun Li; Kristina R Dahlstrom; Zhensheng Liu; Christopher I Amos; Qingyi Wei
Journal:  Cancer Res       Date:  2020-04-10       Impact factor: 12.701

7.  The Era of the Genome and Dental Medicine.

Authors:  K Divaris
Journal:  J Dent Res       Date:  2019-08       Impact factor: 6.116

8.  Mendelian Randomization and mediation analysis of leukocyte telomere length and risk of lung and head and neck cancers.

Authors:  Linda Kachuri; Olli Saarela; Stig Egil Bojesen; George Davey Smith; Geoffrey Liu; Maria Teresa Landi; Neil E Caporaso; David C Christiani; Mattias Johansson; Salvatore Panico; Kim Overvad; Antonia Trichopoulou; Paolo Vineis; Ghislaine Scelo; David Zaridze; Xifeng Wu; Demetrius Albanes; Brenda Diergaarde; Pagona Lagiou; Gary J Macfarlane; Melinda C Aldrich; Adonina Tardón; Gad Rennert; Andrew F Olshan; Mark C Weissler; Chu Chen; Gary E Goodman; Jennifer A Doherty; Andrew R Ness; Heike Bickeböller; H-Erich Wichmann; Angela Risch; John K Field; M Dawn Teare; Lambertus A Kiemeney; Erik H F M van der Heijden; June C Carroll; Aage Haugen; Shanbeh Zienolddiny; Vidar Skaug; Victor Wünsch-Filho; Eloiza H Tajara; Raquel Ayoub Moysés; Fabio Daumas Nunes; Stephen Lam; Jose Eluf-Neto; Martin Lacko; Wilbert H M Peters; Loïc Le Marchand; Eric J Duell; Angeline S Andrew; Silvia Franceschi; Matthew B Schabath; Jonas Manjer; Susanne Arnold; Philip Lazarus; Anush Mukeriya; Beata Swiatkowska; Vladimir Janout; Ivana Holcatova; Jelena Stojsic; Dana Mates; Jolanta Lissowska; Stefania Boccia; Corina Lesseur; Xuchen Zong; James D McKay; Paul Brennan; Christopher I Amos; Rayjean J Hung
Journal:  Int J Epidemiol       Date:  2019-06-01       Impact factor: 7.196

9.  LHPP inhibits hepatocellular carcinoma cell growth and metastasis.

Authors:  Lijuan Liao; Deyu Duan; Yanfeng Liu; Liang Chen
Journal:  Cell Cycle       Date:  2020-06-24       Impact factor: 4.534

10.  Large-Scale Genome-Wide Association Study of East Asians Identifies Loci Associated With Risk for Colorectal Cancer.

Authors:  Yingchang Lu; Sun-Seog Kweon; Chizu Tanikawa; Wei-Hua Jia; Yong-Bing Xiang; Qiuyin Cai; Chenjie Zeng; Stephanie L Schmit; Aesun Shin; Keitaro Matsuo; Sun Ha Jee; Dong-Hyun Kim; Jeongseon Kim; Wanqing Wen; Jiajun Shi; Xingyi Guo; Bingshan Li; Nan Wang; Ben Zhang; Xinxiang Li; Min-Ho Shin; Hong-Lan Li; Zefang Ren; Jae Hwan Oh; Isao Oze; Yoon-Ok Ahn; Keum Ji Jung; David V Conti; Fredrick R Schumacher; Gad Rennert; Mark A Jenkins; Peter T Campbell; Michael Hoffmeister; Graham Casey; Stephen B Gruber; Jing Gao; Yu-Tang Gao; Zhi-Zhong Pan; Yoichiro Kamatani; Yi-Xin Zeng; Xiao-Ou Shu; Jirong Long; Koichi Matsuda; Wei Zheng
Journal:  Gastroenterology       Date:  2018-12-06       Impact factor: 22.682

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.