| Literature DB >> 30254314 |
Aida Ferreiro-Iglesias1, Corina Lesseur1, James McKay1, Rayjean J Hung2, Younghun Han3, Xuchen Zong2, David Christiani4, Mattias Johansson1, Xiangjun Xiao3, Yafang Li3, David C Qian3, Xuemei Ji3, Geoffrey Liu2, Neil Caporaso5, Ghislaine Scelo1, David Zaridze6, Anush Mukeriya6, Milica Kontic7, Simona Ognjanovic8, Jolanta Lissowska9, Małgorzata Szołkowska10, Beata Swiatkowska11, Vladimir Janout12, Ivana Holcatova13, Ciprian Bolca14, Milan Savic15, Miodrag Ognjanovic8, Stig Egil Bojesen16,17,18, Xifeng Wu19, Demetrios Albanes5, Melinda C Aldrich20, Adonina Tardon21, Ana Fernandez-Somoano21, Guillermo Fernandez-Tardon21, Loic Le Marchand22, Gadi Rennert23, Chu Chen24, Jennifer Doherty24,25, Gary Goodman26, Heike Bickeböller27, H-Erich Wichmann28,29,30, Angela Risch31,32,33, Albert Rosenberger27, Hongbing Shen34, Juncheng Dai34, John K Field35, Michael Davies35, Penella Woll36, M Dawn Teare37, Lambertus A Kiemeney38, Erik H F M van der Heijden38, Jian-Min Yuan39, Yun-Chul Hong40, Aage Haugen41, Shanbeh Zienolddiny41, Stephen Lam42, Ming-Sound Tsao43, Mikael Johansson44, Kjell Grankvist45, Matthew B Schabath46, Angeline Andrew3, Eric Duell47, Olle Melander48,49, Hans Brunnström50, Philip Lazarus51, Susanne Arnold52, Stacey Slone52, Jinyoung Byun3, Ahsan Kamal3, Dakai Zhu3, Maria Teresa Landi5, Christopher I Amos3, Paul Brennan53.
Abstract
Lung cancer has several genetic associations identified within the major histocompatibility complex (MHC); although the basis for these associations remains elusive. Here, we analyze MHC genetic variation among 26,044 lung cancer patients and 20,836 controls densely genotyped across the MHC, using the Illumina Illumina OncoArray or Illumina 660W SNP microarray. We impute sequence variation in classical HLA genes, fine-map MHC associations for lung cancer risk with major histologies and compare results between ethnicities. Independent and novel associations within HLA genes are identified in Europeans including amino acids in the HLA-B*0801 peptide binding groove and an independent HLA-DQB1*06 loci group. In Asians, associations are driven by two independent HLA allele sets that both increase risk in HLA-DQB1*0401 and HLA-DRB1*0701; the latter better represented by the amino acid Ala-104. These results implicate several HLA-tumor peptide interactions as the major MHC factor modulating lung cancer susceptibility.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30254314 PMCID: PMC6156406 DOI: 10.1038/s41467-018-05890-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Demographic characteristics of the participating studies after quality control filters
| European ancestry | Asian ancestry | |||
|---|---|---|---|---|
| Case no. (%) | Control no. (%) | Case no. (%) | Control no. (%) | |
| OncoArray studies- passed QC | 18,924 | 15,439 | 2324 | 1656 |
| Age | ||||
| ≤50 | 2098 (11.1) | 2084 (13.5) | 242 (10.4) | 154 (9.3) |
| >50 | 16,801 (88.8) | 13,306 (86.2) | 2080 (89.5) | 1502 (90.7) |
| Missing | 25 (0.1) | 49 (0.3) | 2 (0.1) | 0 (0) |
| Sex | ||||
| Male | 11,685 (61.7) | 9240 (59.8) | 1578 (67.9) | 1070 (64.6) |
| Female | 7236 (38.2) | 6196 (40.1) | 746 (32.1) | 586 (35.4) |
| Missing | 3 (0.02) | 3 (0.02) | 0 (0) | 0 (0) |
| Smoking status | ||||
| Never | 1830 (9.7) | 4870 (31.5) | 815 (35.0) | 668 (40.3) |
| Ever | 16,682 (88.2) | 10,219 (66.2) | 1503 (64.7) | 983 (59.4) |
| Current | 9459 (50.0) | 4285 (27.8) | 986 (42.4) | 739 (44.6) |
| Missing | 412 (2.2) | 350 (2.3) | 6 (0.3) | 5(0.3) |
| Histology | ||||
| Adenocarcinoma | 7088 (37.5) | — | 1192 (51.3) | — |
| Squamous cell carcinoma | 4581 (24.2) | — | 641 (27.6) | — |
| Small cell carcinoma | 190 (10.5) | — | 94 (4.0) | — |
| Othersa | 5265 (27.8) | 397(17.1) | ||
aLarge Cell, Bronchioloalveolar Carcinoma, Non-Small Cell Carcinoma, Carcinoids, Others or Missing
Fig. 1European regional association plots of variants in the MHC region and lung cancer overall and major histologies (a–d); plots of stepwise conditional analyses for squamous cell carcinoma (d.1–d.3). Each panel on the left shows the association plot for each unconditioned analysis a lung cancer overall, b adenocarcinoma, c small cell, d squamous cell carcinoma. The association for each locus used for conditioning is shown in green in each panel (d.1) unconditioned, (d.2) conditioned on HLA-B*0801, (d.3) conditioned on HLA-B*0801 and HLA-DQB1*06. Detailed association results in Table 2 and Supplementary Fig. 1. Circles represent –log10 (P values) for each binary marker using the imputed allelic dosage (between 0 and 2). The dashed black horizontal lines represent the study-wide significant threshold of P = 6 × 10−6. The physical positions of HLA genes on chromosome 6 are shown at the bottom. The color of the circles indicates the type of marker; light blue—SNPs outside HLA genes, green—classical HLA alleles and red—amino acid polymorphisms of the HLA genes; deep blue—SNP within HLA genes)
Top associations of the HLA alleles with squamous cell carcinoma of European ancestry
| Locus | HLA variant | Frequency | Unconditional analysisa | Model including AH8.1 + | |||
|---|---|---|---|---|---|---|---|
| Controlsc | Casesc | OR (95% CI) | OR (95% CI) | ||||
| AH 8.1 |
| 0.06 | 0.07 | 1.30 (1.18-1.42) | 4.78 × 10−08 | 1.24 (1.14–1.37) | 2.13 × 10−06 |
|
|
| 0.15 | 0.17 | 1.18 (1.10-1.25) | 9.36 × 10−07 | ||
|
|
| 0.15 | 0.17 | 1.17 (1.10-1.25) | 5.53 × 10−07 | ||
|
|
| 0.10 | 0.12 | 1.25 (1.16-1.34) | 9.01 × 10−09 | ||
|
|
| 0.11 | 0.14 | 1.25 (1.16-1.34) | 6.38 × 10−10 | ||
|
|
| 0.20 | 0.29 | 1.18 (1.12-1.24) | 1.33 × 10−09 | ||
|
|
| 0.11 | 0.14 | 1.25 (1.17-1.34) | 4.45 × 10−10 | ||
|
|
| 0.24 | 0.21 | 0.85 (0.80-0.90) | 3.05 × 10−08 | 0.86 (0.78-0.91) | 9.96 × 10−08 |
|
| 0.01 | 0.01 | 0.87 (0.67-1.12) | 0.3 | |||
|
| 0.12 | 0.10 | 0.90 (0.83-0.97) | 0.007 | |||
|
| 0.07 | 0.06 | 0.84 (0.76-0.93) | 5.06 × 10−04 | |||
|
| 0.04 | 0.03 | 0.86 (0.75-0.99) | 0.03 | |||
|
| 0.009 | 0.007 | 0.80 (0.61-1.06) | 0.1 | |||
AH 8.1 ancestral haplotype 8.1, HLA human leucocyte antigen, OR odds ratio, 95% CI confidence interval
aObtained from multivariate unconditional logistic regression assuming an additive genetic model with sex and principal components as covariates
bClassical two-digit allele accounting for the four digit alleles found (*0601,*0602,*0603,*0604,*0609)
c Number of samples included in the analysis: 4,581 cases and 15,439 controls
The study-wide significant threshold was P = 6 × 10−6 (Bonferroni correction)
Fig. 2Asian regional association plots of variants in the MHC region and lung cancer overall and major histologies (a–c); plots of stepwise conditional analyses for lung adenocarcinoma (b. 1–b. 4). Each panel on the left shows the association plot for each analysis a lung cancer overall, b adenocarcinoma, c squamous cell carcinoma. The association for each locus used for conditioning is shown in green in each panel on the right (b. 1) unconditioned, (b. 2) conditioned on HLA-DQB1*0401, (b. 3) conditioned on HLA-DQB1*0401 and HLA-DRB1*0701, (b. 4) conditioned on HLA-DQB1*0401, HLA-DRB1*0701 and rs2256919 (SNP within HLA-A). Detailed association results in Table 3 and Supplementary Figure 1. Circles represent –log10 (P values) for each binary marker using the imputed allelic dosage (between 0 and 2). The dashed black horizontal lines represent the study-wide significant threshold of P = 6 × 10−6. The physical positions of HLA genes on chromosome 6 are shown at the bottom. The color of the circles indicates the type of marker; light blue—SNPs, green—classical HLA alleles, and red—amino acid polymorphisms of the HLA genes
Top associations of the HLA alleles with adenocarcinoma of Asian ancestry
| Locus | Variant | Frequency | Unconditional analysisa | Model including | |||
|---|---|---|---|---|---|---|---|
| Controlsb | Casesb | OR (95% CI) | OR (95%CI) | ||||
| HLA-DQB1 |
| 0.06 | 0.09 | 1.67 (1.35-2.05) | 1.59 × 10−06 | 1.73 (1.41-2.14) | 2.85 × 10−07 |
| HLA-DRB1 |
| 0.05 | 0.09 | 1.62 (1.31-2.01) | 5.48 × 10−06 | 1.63 (1.32-2.03) | 5.34 × 10−06 |
| HLA_A (intronic) | rs2256919 | 0.44 | 0.38 | 0.75 (0.67-0.83) | 1.75 × 10−07 | 0.76 (0.68-0.85) | 8.92 × 10−07 |
HLA human leucocyte antigen, OR odds ratio, 95% CI confidence interval
a Obtained from multivariate unconditional logistic regression assuming an additive genetic model with sex and principal components as covariates
b Number of samples included in the analysis: 1192 cases and 1656 controls
The study-wide significant threshold was P = 6 × 10−6 (Bonferroni correction)
Fig. 3Three-dimensional ribbon models for the HLA-B (a) and HLA-DR (b) proteins. These structures are based on Protein Data Bank entries 2bvp and 3pdo, respectively, with a direct view of the peptide-binding groove. Key amino acid positions identified by unconditioned association analyses (squamous cell carcinoma of European ancestry) are highlighted. This figure was prepared using UCSF Chimera (see URL)
HLA alleles and amino acids contained in the best models obtained for SCC in Europeans judged by the BIC criterion
| Model | Locus | OR (95%CI) | BIC | BIC difi | |
|---|---|---|---|---|---|
| A | |||||
| AH8.1 + HLA-DQB1 *06 | 21605 | +14 | |||
| AH8.1 | 1.24 (1.14-1.37) | 2.13 × 10−6 | |||
| HLA-DQB1 *06 | 0.86 (0.78-0.91) | 9.96 × 10−8 | |||
| B | |||||
| B-Thr163 + DRB1-Arg74 + HLA-DQB1 *06 | 21594 | + 3 | |||
| B(163) | 1.13 (1.07-1.20) | 1.68 × 10−5 | |||
| DRB1(74) | 1.11 (1.03-1.20) | 8.87 × 10−3 | |||
| HLA-DQB1 *06 | 0.87 (0.82-0.92) | 1.15 × 10−6 | |||
| C | |||||
| B-Thr163 + HLA-DQB1 *06 | 21591 | 0 | |||
| B(163) | 1.17 (1.11-1.23) | 1.83 × 10−9 | |||
| HLA-DQB1 *06 | 0.86 (0.81-0.91) | 8.47 × 10−8 | |||
| D | |||||
| DRB1-Arg74 + HLA-DQB1 *06 | 21603 | +12 | |||
| DRB1(74) | 1.19 (1.11-1.28) | 6.59 × 10−7 | |||
| HLA-DQB1 *06 | 0.87 (0.82-0.91) | 8.10 × 10−7 |
BIC Bayesian information criterion, HLA human leukocyte antigen
c Number of samples included in the analyses: 4581 cases and 15,439 controls
i models having their BIC within: + 1–2 of the minimum have substantial support; + 4–7 of the minimum have considerably less support; > 10 above the minimum have either essentially no support
Meta-analyses of top associated HLA variants with adenocarcinoma of Asian ancestry
| Locus | Variant | Study | N | Frequency | Unconditional analysis | Cochran’s Q | |
|---|---|---|---|---|---|---|---|
| Controls/Cases | Controls/Cases | OR (95%CI) | P value | Phet | |||
| HLA-DQB1 |
| Oncoarraya | 1656 / 1192 | 0.06 / 0.09 | 1.67 (1.35-2.05) | 1.59 × 10−06 | |
| Lan et al.b | 3741 / 3469 | 0.06 / 0.07 | 1.22 (1.07-1.40) | 4.69 × 10−04 | |||
| Combinedc | 5129 / 4576 | 0.06 / 0.08 | 1.34 (1.19-1.51) | 4.88 × 10−07 | 0.02 | ||
| HLA-DRB1 |
| Oncoarraya | 1656 / 1192 | 0.05 / 0.09 | 1.62 (1.31-2.01) | 5.48 × 10−06 | |
| Lan et al. b | 3741 / 3469 | 0.06 / 0.07 | 1.10 (0.96-1.26) | 0.15 | |||
| Combinedc | 5129 / 4576 | 0.06 / 0.08 | 1.23 (1.10-1.38) | 3.66 × 10−04 | 0.003 | ||
|
| Oncoarraya | 1656 / 1192 | 0.36 / 0.43 | 1.34 (1.20-1.50) | 5.48 × 10−06 | ||
| Lan et al. b | 3741 / 3469 | 0.35 / 0.39 | 1.16 (1.08-1.24) | 5.11 × 10−06 | |||
| Combinedc | 5129 / 4576 | 0.35 / 0.40 | 1.20 (1.14-1.28) | 5.21 × 10−10 | 0.03 | ||
| HLA_A (intronic) | rs2256919 | Oncoarraya | 1656 / 1192 | 0.44 / 0.38 | 0.75 (0.67-0.83) | 1.75 × 10−07 | |
| Lan et al. b | 3741 / 3469 | 0.43 / 0.42 | 0.96 (0.90-1.03) | 0.26 | |||
| Combinedc | 5129 / 4576 | 0.43 / 0.41 | 0.89 (0.84-0.95) | 2.29 × 10−04 | 0.001 | ||
HLA, human leucocyte antigen; OR, odds ratio; 95%CI, confidence interval
a Obtained from multivariate unconditional logistic regression assuming an additive genetic model with sex and principal components as covariates
b Obtained from multivariate unconditional logistic regression assuming an additive genetic model with principal components as covariates
c For combined analyses we excluded 353 never-smoking women (85 cases and 268 controls) from Oncoarray’s studies (Seoul and NJLCS) to avoid a possible overlap of samples.
d amino acids in tigh LD (r2 = 0.99) with Glu-98 and Gln-10, part of HLA-DRB1*0701 and HLA-DRB1*0401 protein sequence and with the rs2395185 (reported by Lan et al.)
The study-wide significant threshold was P = 6 × 10−6 (Bonferroni correction)