| Literature DB >> 32157176 |
Jadwiga Buchwald1, Meghan J Chenoweth2, Teemu Palviainen1, Gu Zhu3, Christian Benner1, Scott Gordon3, Tellervo Korhonen1, Samuli Ripatti1,4,5, Pamela A F Madden6, Terho Lehtimäki7,8, Olli T Raitakari9,10,11, Veikko Salomaa12, Richard J Rose13, Tony P George14,15, Caryn Lerman16, Matti Pirinen1,4,17, Nicholas G Martin3, Jaakko Kaprio1,4, Anu Loukola1,18, Rachel F Tyndale19,20,21.
Abstract
Smoking behaviors, including amount smoked, smoking cessation, and tobacco-related diseases, are altered by the rate of nicotine clearance. Nicotine clearance can be estimated using the nicotine metabolite ratio (NMR) (ratio of 3'hydroxycotinine/cotinine), but only in current smokers. Advancing the genomics of this highly heritable biomarker of CYP2A6, the main metabolic enzyme for nicotine, will also enable investigation of never and former smokers. We performed the largest genome-wide association study (GWAS) to date of the NMR in European ancestry current smokers (n = 5185), found 1255 genome-wide significant variants, and replicated the chromosome 19 locus. Fine-mapping of chromosome 19 revealed 13 putatively causal variants, with nine of these being highly putatively causal and mapping to CYP2A6, MAP3K10, ADCK4, and CYP2B6. We also identified a putatively causal variant on chromosome 4 mapping to TMPRSS11E and demonstrated an association between TMPRSS11E variation and a UGT2B17 activity phenotype. Together the 14 putatively causal SNPs explained ~38% of NMR variation, a substantial increase from the ~20 to 30% previously explained. Our additional GWASs of nicotine intake biomarkers showed that cotinine and smoking intensity (cotinine/cigarettes per day (CPD)) shared chromosome 19 and chromosome 4 loci with the NMR, and that cotinine and a more accurate biomarker, cotinine + 3'hydroxycotinine, shared a chromosome 15 locus near CHRNA5 with CPD and Pack-Years (i.e., cumulative exposure). Understanding the genetic factors influencing smoking-related traits facilitates epidemiological studies of smoking and disease, as well as assists in optimizing smoking cessation support, which in turn will reduce the enormous personal and societal costs associated with smoking.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32157176 PMCID: PMC7483250 DOI: 10.1038/s41380-020-0702-z
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 13.437
Figure 1.Chromosomal location and annotation for all genome-wide significant SNPs for all six phenotypes.
Annotations of genome-wide significant (GWS) SNPs in the main analytic models of the (A) NMR, (B) COT+3HC, (C) COT, (D) COT/CPD, (E) CPD, and (F) Pack-Years are shown. Annotations were performed using NCBI’s dbSNP (build 37). The GWS SNPs were found within or 5’ or 3’ (i.e. within 90kb of the gene) of the genes indicated. The only intergenic GWS SNP located >90kb from a gene was the chromosome 9 COT+3HC SNP (located 194kb 5’ of SMARCA1). For clarity, the NMR plot (A) excludes genes with <4 GWS SNPs: PSMC4, HNRNPUL1, LGALS14, ZNF780B, CNTD2, PLD3, CYP2S1, CEACAM5, RABAC1, LEUTX, ZNF546, TTC9B, PRX, SERTAD3, AXL, CCDC97, BCKDHA, CEACAM6, ARHGEF1, GRIK5, POU2F2, ERF, PSG3. Note: the number of genome-wide significant SNPs found is influenced by linkage disequilibrium patterns and the size of the gene.
Number of total and putatively causal GWS SNPs, top SNP, and heritability estimate for each association locus for each phenotype
| PHENOTYPE | CHR | # GWS SNPs | # PUTATIVELY CAUSAL SNPS [ | REGIONAL HERITABILITY ESTIMATE (%)[ | TOP SNP | BP POSITION [ | MAJOR/MINOR | MAF | LOCATION | eQTL[ | meQTL[ |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 4 | 48 | 1 | 1.8 | rs34638591 | 69359223 | C/T | 0.43 |
| None | |
|
| 19 | 1207 | 13 | 36.4 | rs56113850 | 41353107 | C/T | 0.45 |
| Cis, Trans | |
|
| |||||||||||
|
| 4 | 315 | 1 | 0.9 | rs10000284 | 69669183 | T/C | 0.08 | None | None | |
|
| 9 | 1 | 1 | 0.7 | rs12684930 | 1821338 | G/C | 0.08 | None | None | |
|
| 15 | 309 | 1 | 2.5 | rs2036527 | 78851615 | G/A | 0.35 |
| Cis | |
|
| |||||||||||
|
| 4 | 286 | 1 | 0.8 | rs294775 | 69680933 | T/C | 0.08 | None | None | |
|
| 15 | 307 | 1 | 2.4 | rs2036527 | 78851615 | G/A | 0.35 |
| Cis | |
|
| 19 | 5 | 1 | 0.8 | rs2316205 | 41346768 | C/T | 0.49 |
| Cis | |
|
| |||||||||||
|
| 4 | 281 | 1 | 0.2 | rs294778 | 69682555 | A/G | 0.08 | None | None | |
|
| 19 | 8 | 1 | 1.6 | rs56113850 | 41353107 | C/T | 0.45 |
| Cis, Trans | |
|
| |||||||||||
|
| 1 | 1 | 1 | 0.5 | rs860873 | 95387208 | G/A | 0.47 |
| None | |
|
| 15 | 84 | 1 | 0.6 | rs72740955 | 78849779 | C/T | 0.35 |
| Cis | |
|
| |||||||||||
|
| 5 | 2 | 1 | 0.7 | rs2337033 | 167392011 | A/G | 0.35 | None | None | |
|
| 15 | 27 | 1 | 0.5 | rs72740955 | 78849779 | C/T | 0.35 |
| Cis | |
The results shown are for the main model that adjusted for population substructure, age, sex, BMI, alcohol use, and birth year. CHR, chromosome; GWS, genome-wide significant; SNP, single-nucleotide polymorphism; BP, base pair; MAJOR, the more common allele; MINOR, the less common allele; MAF, minor allele frequency; eQTL, expression quantitative loci; meQTL, methylation quantitative trait loci; NMR, the nicotine metabolite ratio; COT, cotinine; 3HC, 3-hydroxycotinine; CPD, cigarettes per day
Number of SNPs in the FINEMAP top configuration of putatively causal SNPs
Obtained with FINEMAP (full results in Table S5)
Base pair positions reported according to GRCh37 (hg19)
Full eQTL information is available in Table S5
Full meQTL information is available in Table S6
Figure 2.Associations in the meta-GWAS of the Nicotine Metabolite Ratio (NMR) were found in chromosome 4 and chromosome 19.
(A) Manhattan plot of the meta-GWAS of the NMR highlighting two association loci. The dashed line represents the genome-wide significance threshold of P<5×10-8. (B) The chromosome 4 association locus: the top signal comprised three intronic SNPs in TMPRSS11E (rs34638591 (C > T), rs36103218 (T > C), and rs34103191 (G > A)) with identical P-values. FINEMAP and GCTA analyses revealed one putatively causal and conditionally independent chromosome 4 signal for the NMR. (C) The chromosome 19 association locus: the top SNP was rs56113850 (C > T) found in intron 4 of CYP2A6. (D) FINEMAP and GCTA analyses revealed several putatively causal and conditionally independent chromosome 19 signals for the NMR. In plots (B) and (C), the light purple areas represent the genome-wide significant regions, and the linkage disequilibrium values are based on the 1000-Genomes reference panel. The top SNP has been indicated with a purple diamond, the red line represents the genome-wide significance threshold of P=5×10−8, and the blue line represents the suggestive threshold of P=1×10-5. Plots reflect the main analytic model. Base pair positions are provided according to GRCh37 (hg19).
FINEMAP top configuration of causal SNPs for the NMR on chromosome 19
| SNP | BP | MAJOR/MINOR | MAF | SNP PROB | BETA | SE | P-VALUE | LOCATION | NCBI dbSNP FUNCTIONAL ANNOTATION | eQTL | meQTL |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
|
| 41338761 | G/A | 0.14 | 1 | −0.42 | 0.05 | 1.48e-15 | NA | None | Cis | |
|
| 41355195 | C/G | 0.37 | 1 | −0.37 | 0.04 | 2.05e-25 |
| intron variant |
| Cis |
|
| 41356379 | A/C | 0.12 | 0.9995 | −0.81 | 0.04 | 9.46e-94 | NA (CYP2A6*9) |
| None | |
|
| 41523016 | T/C | 0.4 | 0.9621 | 0.42 | 0.07 | 1.28e-10 |
| downstream variant 500B,utr variant 3’ | None | Cis |
|
| 41354533 | A/T | 0.02 | 0.9042 | −0.9 | 0.08 | 5.81e-29 |
| missense ( | None | None |
|
| 41204624 | C/T | 0.01 | 0.8302 | −0.61 | 0.11 | 2.17e-08 |
| intron variant | None | None |
|
| 41524087 | G/A | 0.4 | 0.8293 | −0.41 | 0.07 | 4.99e-10 |
| utr variant 3’ | None | Cis |
|
| 41362293 | C/T | 0.15 | 0.5804 | 0.39 | 0.05 | 5.54e-15 | NA | None | None | |
|
| 40685483 | C/T | 0.01 | 0.5311 | −0.5 | 0.1 | 3.59e-07 | NA | None | Trans | |
|
| 41909719 | G/A | 0.02 | 0.3935 | −0.57 | 0.1 | 3.66e-09 |
| intron variant | None | None |
|
| 41437426 | C/G | 0.24 | 0.3430 | −0.25 | 0.03 | 2.05e-14 |
| intron variant |
| Cis |
|
| 41296117 | C/G | 0.28 | 0.3296 | 0.19 | 0.03 | 7.05e-11 |
| intron variant |
| Cis |
|
| 41344101 | T/C | 0.03 | 0.2402 | 0.83 | 0.09 | 2.73e-20 | NA |
| Cis, Trans | |
The results are from the main model. The most probable configuration consisted of 13 SNPs (depicted in the table) and their heritability estimate was 36.7% (95% CI: 32.9–41.0%). All betas listed are from the joint model including the 13 top configuration SNPs, and they are reported for the minor allele. SNP PROB indicates the posterior probability of being a causal SNP. FINEMAP gave a regional heritability estimate of 36.4% (95% CI: 32.6–40.4%) for the main model results and suggested that there are 9–14 causal SNPs within the region. SNP, single-nucleotide polymorphism; BP, base pair; MAJOR, the more common allele; MINOR, the less common allele; MAF, minor allele frequency; SE, standard error; eQTL, expression quantitative loci; meQTL, methylation quantitative trait loci; Cis, within 1Mb of the position of the SNP; Trans, > 1Mb away from the position of the SNP
Figure 3.Number and overlap of genome-wide significant SNP associations with the six phenotypes.
(A) Upset plot depicting exact numbers of overlapping SNP associations for all main model results (1 885 GWS SNPs across the six phenotypes). (B) The best fit proportional Venn diagram highlighting the proportions of GWS SNP associations and relationships between phenotypes for all main model results. The Venn diagram was drawn using the venneuler R package (59). It is approximative with the stress factor equaling 0.008 (<0.01 threshold advised), which indicates a good fit, but unlike the figure suggests, the NMR GWS SNPs do not overlap with the COT+3HC GWS SNPs. (C) Upset plots depicting overlaps of the main model results by chromosome. For chromosomes 1 and 9, one GWS SNP occurred for CPD and COT+3HC, respectively. For chromosome 5, two GWS SNPs occurred for Pack-Years.