Literature DB >> 32157176

Genome-wide association meta-analysis of nicotine metabolism and cigarette consumption measures in smokers of European descent.

Jadwiga Buchwald1, Meghan J Chenoweth2, Teemu Palviainen1, Gu Zhu3, Christian Benner1, Scott Gordon3, Tellervo Korhonen1, Samuli Ripatti1,4,5, Pamela A F Madden6, Terho Lehtimäki7,8, Olli T Raitakari9,10,11, Veikko Salomaa12, Richard J Rose13, Tony P George14,15, Caryn Lerman16, Matti Pirinen1,4,17, Nicholas G Martin3, Jaakko Kaprio1,4, Anu Loukola1,18, Rachel F Tyndale19,20,21.   

Abstract

Smoking behaviors, including amount smoked, smoking cessation, and tobacco-related diseases, are altered by the rate of nicotine clearance. Nicotine clearance can be estimated using the nicotine metabolite ratio (NMR) (ratio of 3'hydroxycotinine/cotinine), but only in current smokers. Advancing the genomics of this highly heritable biomarker of CYP2A6, the main metabolic enzyme for nicotine, will also enable investigation of never and former smokers. We performed the largest genome-wide association study (GWAS) to date of the NMR in European ancestry current smokers (n = 5185), found 1255 genome-wide significant variants, and replicated the chromosome 19 locus. Fine-mapping of chromosome 19 revealed 13 putatively causal variants, with nine of these being highly putatively causal and mapping to CYP2A6, MAP3K10, ADCK4, and CYP2B6. We also identified a putatively causal variant on chromosome 4 mapping to TMPRSS11E and demonstrated an association between TMPRSS11E variation and a UGT2B17 activity phenotype. Together the 14 putatively causal SNPs explained ~38% of NMR variation, a substantial increase from the ~20 to 30% previously explained. Our additional GWASs of nicotine intake biomarkers showed that cotinine and smoking intensity (cotinine/cigarettes per day (CPD)) shared chromosome 19 and chromosome 4 loci with the NMR, and that cotinine and a more accurate biomarker, cotinine + 3'hydroxycotinine, shared a chromosome 15 locus near CHRNA5 with CPD and Pack-Years (i.e., cumulative exposure). Understanding the genetic factors influencing smoking-related traits facilitates epidemiological studies of smoking and disease, as well as assists in optimizing smoking cessation support, which in turn will reduce the enormous personal and societal costs associated with smoking.
© 2020. The Author(s), under exclusive licence to Springer Nature Limited.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32157176      PMCID: PMC7483250          DOI: 10.1038/s41380-020-0702-z

Source DB:  PubMed          Journal:  Mol Psychiatry        ISSN: 1359-4184            Impact factor:   13.437


Introduction

Cigarette smoking persists in part due to the reinforcing properties of nicotine, the major psychoactive compound in cigarettes (1). The Nicotine Metabolite Ratio (NMR) (2), the ratio of major nicotine metabolites (3’hydroxycotinine (3HC)/cotinine (COT)), is heritable (h2~80% (3)) and is highly correlated (r=0.83) with the rate of nicotine metabolic clearance (2) and thus associates with numerous smoking behaviors. Higher NMR is associated with greater cigarette consumption and lower cessation (reviewed in (4)). Genetically variable CYP2A6 metabolically inactivates nicotine to COT; COT is then metabolized to 3HC exclusively by CYP2A6 (5, 6). A higher NMR indicates faster nicotine inactivation and CYP2A6 activity (2, 7–9). CYP2A6 also metabolically activates tobacco-specific nitrosamines (10); thus, the association between the NMR and lung cancer (11) may be influenced by both altered smoking quantity (i.e. carcinogen intake) and procarcinogen activation. Further, CYP2A6 can metabolize many other drugs (e.g. tegafur and letrozole) (12, 13). The NMR prospectively predicts smoking cessation outcomes; smokers with higher NMR had higher quit rates on varenicline versus nicotine patch, whereas quit rates were similar in those with lower NMR (14). Additional tobacco products such as pipe, cigars, and smokeless tobacco contain nicotine, as do newer products (e.g. electronic cigarettes) (15). The NMR influences nicotine intake across many product types, including commercial smokeless tobacco and iqmik (16). Virtually all of the genome-wide significant (GWS) SNPs in genome-wide association studies (GWASs) of the NMR (3, 17–19) reside in the chromosome 19 region that contains CYP2A6. Here we assessed five cohorts from Australia (20), Finland (3), and North America (14) to conduct the largest NMR meta-GWAS to date in 5185 European ancestry current smokers. We also ran GWASs of biomarkers of nicotine intake, COT+3HC and COT alone; COT+3HC is more accurate than COT due to the accumulation of COT in slow metabolizers (21). To directly investigate genetic similarities and differences between objective and self-reported nicotine intake phenotypes, we also performed GWASs for two self-reported measures, cigarettes smoked per day (CPD) and Pack-Years (i.e. cumulative exposure). Finally, we performed a GWAS of a biomarker of smoking intensity, COT/CPD, using COT given its use as a common nicotine intake biomarker. To our knowledge, these represent the first GWASs of COT+3HC and COT/CPD, and the first within-dataset comparison of the NMR and these numerous smoking biomarkers. This work will enhance NMR genomics for studies that include non-, former- and intermittent-smokers (as NMR can only be measured in current, regular smokers (7)), for example in tobacco-related disease risk assessments (22), as well as in precision medicine approaches for smoking cessation (14, 23) and for cancer due to the role of CYP2A6 in chemotherapeutic (e.g. letrozole, tegafur) drug metabolism (12, 13).

Materials and Methods

Study Samples and Phenotypes

European current smokers (n=5185) with cotinine levels ≥10 ng/ml were studied (see Supplementary information and Tables S1 and S2 for cohort details; cohort data is available from lead investigators). A power curve is shown in Figure S1. Individuals with cotinine <10 ng/ml, suggestive of non-daily smoking (24) and thus unstable NMR measurements, were excluded. The six phenotypes are shown in Figure S2. COT and 3HC were quantified from blood samples using LC-MS/MS or GC-MS (for FINRISK); these previously validated approaches yielded highly concordant results (25). Self-report variables were acquired from surveys. We set all CPD and Pack-Years values of zero to missing and replaced 3HC values below the limit of detection (LOD) of 1 ng/ml with (26). Phenotypes were rank-transformed using the ‘rntransform’ function in the R package ‘GenABEL’(27) to follow the standard normal distribution N(0,1).

GWAS Analyses

Contributing sites performed genotyping, post-genotyping quality control, imputation, and GWAS analyses (Table S2). Cohort-specific quality control of GWAS summary results was performed at the University of Helsinki. The variant inclusion criteria were: MAF>1%, imputation info score (Rsq)>0.7, call rate>0.9, and Hardy-Weinberg Equilibrium P>1×10-6. Base pair positions are reported according to GRCh37 (hg19). GWAS analyses of the six phenotypes were performed using linear mixed models with Rvtests (http://zhanxw.github.io/rvtests/) or GEMMA (28). The continuous scale of the effect allele dose (0–2) was used to account for imputation uncertainties. A relatedness matrix, calculated from all variants with MAF >5% using the Balding-Nichols method in the vcf2kinship package in RVTESTS, accounted for the entire spectrum of genetic relatedness from familial relationships to more distant population structure. Additionally, population stratification was accounted for prior to genotype imputation by restricting the analyses to European ancestry participants (details by study cohort in Table S2). The genomic inflation factor (λ) was calculated with the function ‘estlambda’ from the R package ‘GenABEL’ (27) (Table S3); together with an LD score regression intercept test using LD Hub (29), we saw no evidence of significant inflation (Table S4). QQ-plots and Manhattan plots were acquired with the R package ‘qqman’ (30). Covariates for the base analytic model included sex, age, and BMI (in kg/m2) (31), while the main analytic model further controlled for alcohol use (grams/week) (due to high co-use and associations with NMR (31–33)) and birth-year (due to trends in smoking attitudes, social acceptance, regulatory bans, and variation in recruitment years). Because the NMR influences CPD (31), and CPD may influence the NMR (via nicotine/tobacco inhibition of CYP2A6 (34)), we ran additional models for NMR controlling for CPD, and for CPD controlling for NMR. We also ran a model for COT and COT+3HC, controlling for CPD. Missing covariate values (all <10%) were recoded to the median value. Reported results reflect the main analytic model unless otherwise stated; alternative models are found in the supplementary data.

GWAS Meta-Analysis, Fine-mapping, and Annotation

Model- and phenotype-specific meta-analyses were performed using GWAMA (35), using fixed effects models, applying study-specific genomic control correction to adjust for remaining population stratification, and using a GWS association threshold of P<5×10-8. An association locus was defined as the region extending 2.5Mb in both directions (none were >2.5Mb) of the GWS SNP with the smallest P-value. Regional plots were created with LocusZoom (36); LD data were from 1000-Genomes. We additionally performed the largest COT GWAS (n=8885) by meta-analysing our base model results (as most similar) with those from Ware et al. (37) (n=4548) using non-overlapping samples. To identify putatively causal SNPs in each GWS region, we performed a shotgun stochastic search with FINEMAP v1.2 (38) and a step-wise conditional regression with GCTA v1.91.3beta (39) (see Supplementary Information). For each locus, we calculated the heritability estimate of the causal variants using FINEMAP (see Benner et al. 2018; bioRxiv https://doi.org/10.1101/318618). We report the heritability estimate of the causal variants for each locus as calculated by FINEMAP (Benner et al. 2018; bioRxiv https://doi.org/10.1101/318618). Variant annotation included assessment of genomic location, functional consequence, and association with RNA expression levels and methylation pattern (see Supplementary Information).

Code availability

All scripts used for statistical analyses are available upon request.

Results

Altogether 1885 GWS (P<5×10−8) SNPs and six association loci were found on chromosomes 1, 4, 5, 9, 15, and 19 across the six phenotypes (Figure 1, Table 1, Table S5, Table S6). Using a leave-one-out approach, illustrated in cohort-specific plots, the results were largely consistent across the five cohorts (Figure S3 and S4). Correlations between the phenotypes are found in Table S7.
Figure 1.

Chromosomal location and annotation for all genome-wide significant SNPs for all six phenotypes.

Annotations of genome-wide significant (GWS) SNPs in the main analytic models of the (A) NMR, (B) COT+3HC, (C) COT, (D) COT/CPD, (E) CPD, and (F) Pack-Years are shown. Annotations were performed using NCBI’s dbSNP (build 37). The GWS SNPs were found within or 5’ or 3’ (i.e. within 90kb of the gene) of the genes indicated. The only intergenic GWS SNP located >90kb from a gene was the chromosome 9 COT+3HC SNP (located 194kb 5’ of SMARCA1). For clarity, the NMR plot (A) excludes genes with <4 GWS SNPs: PSMC4, HNRNPUL1, LGALS14, ZNF780B, CNTD2, PLD3, CYP2S1, CEACAM5, RABAC1, LEUTX, ZNF546, TTC9B, PRX, SERTAD3, AXL, CCDC97, BCKDHA, CEACAM6, ARHGEF1, GRIK5, POU2F2, ERF, PSG3. Note: the number of genome-wide significant SNPs found is influenced by linkage disequilibrium patterns and the size of the gene.

Table 1.

Number of total and putatively causal GWS SNPs, top SNP, and heritability estimate for each association locus for each phenotype

PHENOTYPECHR# GWS SNPs# PUTATIVELY CAUSAL SNPS [1]REGIONAL HERITABILITY ESTIMATE (%)[2]TOP SNPBP POSITION [3]MAJOR/MINORMAFLOCATIONeQTL[4]meQTL[5]
NMR 44811.8rs34638591rs36103218rs34103191693592236935925369359280C/TT/CG/A0.43TMPRSS11E (intron) UGT2B15, UGT2B17, UGT2B29P,RP11–1267H10.2, RP11–1267H10.1 None
NMR 1912071336.4rs5611385041353107C/T0.45CYP2A6 (intron) EGLN2, CYP2A7, CYP2T2P Cis, Trans

COT+3HC 431510.9rs1000028469669183T/C0.08UGT2B10 (12.5kb 5’)NoneNone
COT+3HC 9110.7rs126849301821338G/C0.08SMARCA2 (194kb 5’)NoneNone
COT+3HC 1530912.5rs203652778851615G/A0.35CHRNA5 (6.2kb 5’) CHRNA3, CHRNA5, CHRNB4,IREB2, PSMA4, RP11–160C18.2,RP11–650L12.2 Cis

COT 428610.8rs29477569680933T/C0.08UGT2B10 (0.8kb 5’)NoneNone
COT 1530712.4rs203652778851615G/A0.35CHRNA5 (6.2kb 5’) CHRNA3, CHRNA5, CHRNB4, IREB2, PSMA4, RP11–160C18.2, RP11–650L12.2 Cis
COT 19510.8rs231620541346768C/T0.49CYP2A6 (2.7kb 3’) EGLN2, CYP2A7 Cis

COT/CPD 428110.2rs29477869682555A/G0.08UGT2B10 (intron)NoneNone
COT/CPD 19811.6rs5611385041353107C/T0.45CYP2A6 (intron) EGLN2, CYP2A7, CYP2T2P Cis, Trans

CPD 1110.5rs86087395387208G/A0.47CNN3 (intron) RP11–86H7.7, RP4–639F20.1, RWDD3 None
CPD 158410.6rs7274095578849779C/T0.35CHRNA5 (8.1kb 5’) CHRNA3, CHRNA5, IREB2, PSMA4,RP11–160C18.2, RP11–650L12.2 Cis

PACK-YEARS 5210.7rs2337033167392011A/G0.35TENM2 (intron)NoneNone
PACK-YEARS 152710.5rs7274095578849779C/T0.35CHRNA5 (8.1kb 5’) CHRNA3, CHRNA5, IREB2, PSMA4,RP11–160C18.2, RP11–650L12.2 Cis

The results shown are for the main model that adjusted for population substructure, age, sex, BMI, alcohol use, and birth year. CHR, chromosome; GWS, genome-wide significant; SNP, single-nucleotide polymorphism; BP, base pair; MAJOR, the more common allele; MINOR, the less common allele; MAF, minor allele frequency; eQTL, expression quantitative loci; meQTL, methylation quantitative trait loci; NMR, the nicotine metabolite ratio; COT, cotinine; 3HC, 3-hydroxycotinine; CPD, cigarettes per day

Number of SNPs in the FINEMAP top configuration of putatively causal SNPs

Obtained with FINEMAP (full results in Table S5)

Base pair positions reported according to GRCh37 (hg19)

Full eQTL information is available in Table S5

Full meQTL information is available in Table S6

Associations for the NMR were found on chromosomes 4 and 19

Two association loci on chromosomes 4 and 19 (Figure 2) were found for the NMR, explaining 38.2% of variation (Table 1). We replicated the top SNP, rs56113850, located on chromosome 19 in an intron of CYP2A6 (Figure 2, Table 1) (3, 18, 19). The chromosome 19 locus explained 36.4% of NMR variation (Table 1) (vs. ~20–30% previously explained (3, 17, 19)) and consisted of several SNPs residing in previously unreported genes (Table S8). The chromosome 4 locus for the NMR was novel, with most of the GWS SNPs mapping to TMPRSS11E, a transmembrane serine protease (Figure 2), explaining 1.8% of NMR variation (Table 1). The smallest P-value was obtained for three SNPs in an intron of TMPRSS11E (Table 1, Table S5), in perfect linkage disequilibrium (LD) in Europeans from the 1000 Genomes Project. Controlling for CPD did not substantially alter the number of GWS SNPs (Table S5).
Figure 2.

Associations in the meta-GWAS of the Nicotine Metabolite Ratio (NMR) were found in chromosome 4 and chromosome 19.

(A) Manhattan plot of the meta-GWAS of the NMR highlighting two association loci. The dashed line represents the genome-wide significance threshold of P<5×10-8. (B) The chromosome 4 association locus: the top signal comprised three intronic SNPs in TMPRSS11E (rs34638591 (C > T), rs36103218 (T > C), and rs34103191 (G > A)) with identical P-values. FINEMAP and GCTA analyses revealed one putatively causal and conditionally independent chromosome 4 signal for the NMR. (C) The chromosome 19 association locus: the top SNP was rs56113850 (C > T) found in intron 4 of CYP2A6. (D) FINEMAP and GCTA analyses revealed several putatively causal and conditionally independent chromosome 19 signals for the NMR. In plots (B) and (C), the light purple areas represent the genome-wide significant regions, and the linkage disequilibrium values are based on the 1000-Genomes reference panel. The top SNP has been indicated with a purple diamond, the red line represents the genome-wide significance threshold of P=5×10−8, and the blue line represents the suggestive threshold of P=1×10-5. Plots reflect the main analytic model. Base pair positions are provided according to GRCh37 (hg19).

FINEMAP analyses of chromosome 19 in cohorts 1–2 showed that nine out of 13 putatively causal SNPs were highly likely causal (probability>0.5) (Table 2, Table S9). Of these nine, two are known functional CYP2A6 variants (rs1801272/CYP2A6*2, rs28399433/CYP2A6*9; www.pharmvar.org), three were in/near to CYP2A6 (including rs113288603), two were in CYP2B6, one was in ADCK4, and one was near MAP3K10 (Table 2). Notably, only rs113288603 was among the conditionally independent SNPs in prior GWASs that used stepwise conditional analyses (3, 17–19). We also performed stepwise conditional analysis with GCTA using the meta-GWAS results, yielding six conditionally independent SNPs (Table S9); only one overlapped with the FINEMAP SNPs (Figure 2, Table S10). The GCTA-configuration includes by default the top SNP (i.e. rs56113850), which was not found with FINEMAP. Our further comparisons suggested that GCTA may not be appropriate (see Supplementary Information). For chromosome 4, FINEMAP and GCTA analyses highlighted only one SNP as being causal or conditionally independent (Table S9). Neither method could distinguish the causal or conditionally independent SNP among the highly correlated SNPs in the area.
Table 2.

FINEMAP top configuration of causal SNPs for the NMR on chromosome 19

SNPBPMAJOR/MINORMAFSNP PROBBETASEP-VALUELOCATIONNCBI dbSNP FUNCTIONAL ANNOTATIONeQTLmeQTL

RS11667982 41338761G/A0.141−0.420.051.48e-15CYP2A6 (10.7kb 3’)NANoneCis
RS7250713 41355195C/G0.371−0.370.042.05e-25 CYP2A6 intron variant EGLN2, CYP2A7, CYP2G1P Cis
RS28399433 41356379A/C0.120.9995−0.810.049.46e-94CYP2A6 (0.02kb 5’)NA (CYP2A6*9) CYP2A7, CYP2G1P, C19orf54, ADCK4 None
RS3181842 41523016T/C0.40.96210.420.071.28e-10 CYP2B6 downstream variant 500B,utr variant 3’NoneCis
RS1801272 41354533A/T0.020.9042−0.90.085.81e-29 CYP2A6 missense (CYP2A6*2)NoneNone
RS148856862 41204624C/T0.010.8302−0.610.112.17e-08 ADCK4 intron variantNoneNone
RS707265 41524087G/A0.40.8293−0.410.074.99e-10 CYP2B6 utr variant 3’NoneCis
RS113288603 41362293C/T0.150.58040.390.055.54e-15CYP2A6 (5.9kb 5’)NANoneNone
RS12972927 40685483C/T0.010.5311−0.50.13.59e-07MAP3K10 (12.0kb 5’)NANoneTrans
RS11672345 41909719G/A0.020.3935−0.570.13.66e-09 BCKDHA intron variantNoneNone
RS7248187 41437426C/G0.240.3430−0.250.032.05e-14 CYP2B7P1 intron variant CYP2B7P1 Cis
RS2604911 41296117C/G0.280.32960.190.037.05e-11 MIA-RAB4B/RAB4B-EGLN2/RAB4B intron variant ADCK4 Cis
RS11672809 41344101T/C0.030.24020.830.092.73e-20CYP2A6 (5.3kb 3’)NA CYP2A6 Cis, Trans

The results are from the main model. The most probable configuration consisted of 13 SNPs (depicted in the table) and their heritability estimate was 36.7% (95% CI: 32.9–41.0%). All betas listed are from the joint model including the 13 top configuration SNPs, and they are reported for the minor allele. SNP PROB indicates the posterior probability of being a causal SNP. FINEMAP gave a regional heritability estimate of 36.4% (95% CI: 32.6–40.4%) for the main model results and suggested that there are 9–14 causal SNPs within the region. SNP, single-nucleotide polymorphism; BP, base pair; MAJOR, the more common allele; MINOR, the less common allele; MAF, minor allele frequency; SE, standard error; eQTL, expression quantitative loci; meQTL, methylation quantitative trait loci; Cis, within 1Mb of the position of the SNP; Trans, > 1Mb away from the position of the SNP

Associations for COT+3HC and COT, biomarkers of nicotine intake, were found on chromosomes 4, 9, 15, and 19

For COT+3HC, three association loci on chromosomes 4, 9, and 15 (Figure S5, Figure S6) were found, explaining 4.1% of variation (Table 1). The chromosome 4 GWS SNPs mapped to seven UDP-glucuronosyltransferase (UGT) genes (Figure 1). The top chromosome 4 SNP was rs10000284, located ~13kb upstream of UGT2B10, encoding the UGT2B10 enzyme involved in nicotine and cotinine glucuronidation (40) (Table 1, Table S5, Figure S6). One GWS SNP was detected on chromosome 9 (Table 1, Table S5, Figure S6), located ~194kb upstream of SMARCA2 which putatively regulates transcription via chromatin remodelling (41). On chromosome 15, the GWS SNPs mapped to IREB2, ADAMTS7, CHRNB4, CHRNA3, CHRNA5, HYKK, and PSMA4 (Figure 1). The top SNP on chromosome 15 was rs2036527, located ~6kb upstream of CHRNA5 (Table S5, Figure S6) encoding the α5 nicotinic acetylcholine receptor (nAChR) subunit. FINEMAP and GCTA analyses revealed only the top SNP from chromosomes 4, 9, and 15 to be putatively causal and conditionally independent (Table S9). The meta-GWAS of COT revealed association loci on chromosomes 4, 15 and 19 (Figure S5, Figure S6); the GWS SNPs explained 4.0% of variation (Table 1). The chromosome 4 and 15 genes were shared with COT+3HC; the chromosome 19 signal mapped to CYP2A6 (Figure 1). The chromosome 9 COT+3HC GWS SNP (rs12684930) was nearly GWS (P=8.1×10−8) for COT. FINEMAP and GCTA analyses of COT revealed the top SNP from chromosome 4, 15, and 19 to be putatively causal and conditionally independent (Table S9). After meta-analysing our COT results with those from Ware et al. (37) (total n= 8885), signals on chromosomes 4, 15, and 19 remained, whereas the chromosome 9 signal disappeared (Figure S7). The original Ware et al. COT GWAS detected variants on chromosomes 4 and 15, but not on 19 (37). Controlling for CPD reduced the number of GWS SNPs found on chromosome 15 in the COT+3HC and COT GWASs (Table S5).

Associations for COT/CPD, a measure of smoking intensity, were found on chromosomes 4 and 19

Two association loci on chromosome 4 and 19 (Figure S5, Figure S6) were identified for COT/CPD, explaining 1.7% of variation (Table 1). The chromosome 4 SNPs mapped to five UGTs (Figure 1). The top SNP on chromosome 4 was rs294778, located in an intron of UGT2B10 (Table 1, Table S5, Figure S6). The GWS SNPs on chromosome 19 mapped to CYP2A6 and CYP2A7; the top SNP was rs56113850 (Figure 1, Table 1, Table S5, Figure S6) as for the NMR. FINEMAP and GCTA analyses revealed the top SNP from chromosomes 4 and 19 to be putatively causal and conditionally independent (Table S9). As for the NMR, rs56113850 is unlikely to causally influence COT/CPD; rs56113850 did not remain significant for COT/CPD after conditioning on the 13 SNPs pinpointed by FINEMAP for the NMR (Table S10).

Associations for CPD and Pack-Years, self-reported measures of nicotine intake, were found on chromosomes 1, 5, and 15

For CPD, three association loci on chromosomes 1, 5, and 15 (Figure S5, Figure S6) explained 1.1% of variation (Table 1). There were single GWS SNP associations on chromosomes 1 (rs860873, located in an intron of CNN3) and 5 (rs2337033, located in an intron of TENM2), significant only in the base model (Table S5). The top SNP on chromosome 15 was rs72740955, located ~8.1kb upstream of CHRNA5 (Table S5); rs72740955 is in high LD (r2=0.9 and D’=0.97, 1000 Genomes) with the functional CHRNA5 variant rs16969968, which was associated with COT, COT+3HC and CPD (Table S6), replicating prior associations with smoking quantity (42). Our FINEMAP and GCTA analyses revealed only the top SNP from chromosomes 1, 5, and 15 to be putatively causal and conditionally independent (Table S9). For Pack-Years, two association loci on chromosomes 5 and 15 (Figure S5, Figure S6) explained 1.2% of variation (Table 1). A single GWS SNP was detected on chromosome 5 (rs2337033, mapping to an intron of TENM2) (Table 1, Table S5, Figure S6) which was also GWS for CPD in the base model (Table S6). As for CPD, the top SNP on chromosome 15 was rs72740955, located ~8.1kb upstream of CHRNA5 (Table S5, Figure S6). FINEMAP and GCTA analyses revealed only the top SNP from chromosomes 5 and 15 to be putatively causal and conditionally independent (Table S9).

Many of the GWS SNPs were eQTL and/or meQTL

Many of the GWS SNPs were known expression quantitative trait loci (eQTL) (Table S11, Figure S8) and/or known methylation quantitative trait loci (meQTL) (Table S12, Figure S9). For instance, the T (vs. C) allele of the intronic TMPRSS11E rs36103218 SNP, associated with lower NMR, was associated with higher UGT2B17 expression in the liver in GTEx and higher UGT2B17 (glucuronidates 3HC (43)) activity in the PNAT2 cohort (Figure S10). The top chromosome 15 SNPs, rs2036527 (for COT+3HC and COT) and rs72740955 (for CPD and Pack-Years) were associated with altered expression of the α5 nAChR subunit in brain in GTEx (Figure S11).

Summary of genetic similarities and differences across the nicotine metabolism and smoking phenotypes

There was substantial overlap between the GWS SNPs and association loci found for the NMR and COT+3HC, COT, and COT/CPD (Figure 3, Table 1). Although none of the chromosome 4 SNPs were shared, they were found in the same region containing TMPRSS11E and UGT2 genes (Figure 3, Figure S6). In contrast to chromosome 4, all chromosome 19 SNPs that were GWS for COT and/or COT/CPD were shared with the NMR and mapped near to CYP2A6 (Figure 3, Table S6). Compared to the NMR, far fewer overall chromosome 19 SNPs (and within a narrower region) were observed for COT and COT/CPD (Table 1, Table S5). For COT+3HC, there were no GWS SNPs in chromosome 19. Comparing the NMR to the self-reported measures of nicotine intake, there was no overlap: neither of the two significant chromosomes for the NMR (4 and 19) were shared with CPD (1 and 15) or Pack-Years (5 and 15) (Table 1). We were unable to quantify genetic similarity using LD-score regression (LDSC) (44) across all phenotypes, because SNP-heritability estimates from LDSC were highly inaccurate (Table S4); LDSC likely requires a larger sample size and/or higher polygenicity for robust results.
Figure 3.

Number and overlap of genome-wide significant SNP associations with the six phenotypes.

(A) Upset plot depicting exact numbers of overlapping SNP associations for all main model results (1 885 GWS SNPs across the six phenotypes). (B) The best fit proportional Venn diagram highlighting the proportions of GWS SNP associations and relationships between phenotypes for all main model results. The Venn diagram was drawn using the venneuler R package (59). It is approximative with the stress factor equaling 0.008 (<0.01 threshold advised), which indicates a good fit, but unlike the figure suggests, the NMR GWS SNPs do not overlap with the COT+3HC GWS SNPs. (C) Upset plots depicting overlaps of the main model results by chromosome. For chromosomes 1 and 9, one GWS SNP occurred for CPD and COT+3HC, respectively. For chromosome 5, two GWS SNPs occurred for Pack-Years.

Discussion

The associated loci captured 38% of NMR variation, 4% of variation in nicotine intake measured by objective biomarkers (COT+3HC, COT), 2% of variation in smoking intensity (COT/CPD), and 1% of variation in self-reported nicotine intake (CPD, Pack-Years). We confirmed the chromosome 19 association locus found in prior NMR GWASs by us (3, 17) and others (18, 19). The number of NMR GWS SNPs was essentially unchanged after additionally controlling for CPD (Table S5), suggesting altered cigarette consumption does not impact the observed genomic influences on the NMR. The top SNP from the meta-GWAS, rs56113850, was not identified in the FINEMAP top configuration and our additional analyses suggested that rs56113850 is not a causal variant itself but tags multiple causal variants, consistent with its high correlation with four SNPs pinpointed by FINEMAP (Figure S12). We also identified an entirely novel chromosome 4 signal for the NMR mapping to TMPRSS11E, adjacent to UGT2B17. According to the GWAS Catalog (https://www.ebi.ac.uk/gwas/; accessed September 24, 2019), TMPRSS11E is associated with hematological and blood lipid traits (45–47). Altered UGT2B17 expression and activity (Figure S10) is a possible mechanism underpinning the association with NMR. Prior work in African American smokers, however, showed that the UGT2B17 gene deletion did not alter NMR (48, 49), suggesting there may be inter-ethnic variation in the relationship between UGT2B17 activity and the NMR. Understanding NMR genomics permits extensions to epidemiological studies that include non-, former- and intermittent-users of tobacco products where the NMR itself cannot be reliably measured (7). The NMR is associated with tobacco-related disease (11), thus the assessment of NMR genomics in epidemiological studies could greatly improve our mechanistic understanding of disease risk pathways, and improve prevention efforts. A greater understanding of the contribution of CYP2A6 variation to NMR genomics could also advance precision medicine approaches in nicotine dependence; recently a CYP2A6 weighted genetic risk score was shown to replicate NMR-stratified cessation treatment findings (23). Moreover, NMR genomic information may be particularly useful in tailoring chemotherapy, as CYP2A6 metabolizes the chemotherapeutic agents letrozole and tegafur (12, 13), and other CYP2A6 substrate drug treatments (reviewed in 50, 51). This study was the first GWAS of COT+3HC, an improved biomarker of nicotine intake compared to COT (21). While the chromosome 9 SNP (rs12684930, located ~194kb 5’ from SMARCA2) associated with COT+3HC was not previously associated with smoking-related traits, SMARCA2 has been linked in the GWAS Catalog to ADHD, bipolar disorder, MDD, and schizophrenia which are also associated with smoking (52–54). Variation in the chromosome 15 CHRNA5-CHRNA3-CHRNB4 cluster, which was significant in our analyses of COT+3HC, COT, CPD, and Pack-Years, has been robustly associated with nicotine phenotypes (37, 55, 56). The minor alleles of the top chromosome 15 SNPs for the objective nicotine biomarkers (i.e. rs2036527) and self-reported intake measures (i.e. rs72740955) were associated with higher intake and lower α5-nAChR expression, consistent with α5-nAChR knockout mice showing greater nicotine intake (57). We also found GWS SNPs in ADAMTS7 (for COT+3HC and COT) and IREB2 (for COT+3HC, COT, and CPD). A gene- and pathway-based analysis highlighted ADAMTS7 and IREB2 among the top genes associated with CPD (58), and the recent GWAS of CPD by the GSCAN consortium identified an intronic ADAMTS7 SNP as conditionally independent (42); additionally these genes are associated with tobacco-related diseases in the GWAS Catalog. The lack of extensive genomic overlap between phenotypes may be due to poor capture of intake phenotype heritability (1–4%) versus NMR heritability (~38%) and low power for our CPD analysis. The GSCAN consortium’s CPD GWAS (>330 000 smokers) (42) identified rs56113850 among the conditionally independent SNPs, which was the top SNP for both the NMR and COT/CPD in our study: the rs56113850 T allele was associated with higher COT/CPD (beta=0.16) but lower NMR (beta=−0.69), likely due to slower removal of COT, the denominator in NMR (21). In support of this, a secondary analysis in PNAT2 showed no apparent impact of the rs56113850 T allele on increasing (COT+3HC)/CPD (Figure S13). Comparing between intake phenotypes, COT+3HC (vs. COT) was more genetically similar to CPD. In contrast, COT (vs. COT+3HC) was more genetically similar to the NMR, likely due to the impact of CYP2A6 activity (i.e. the NMR) on COT formation and metabolism (21). Strengths of our study include the large sample size for biomarker analyses (e.g. NMR) and the use of biochemical intake measures. We were well-powered to detect variants that explain at least 0.76%, 1.0%, or 1.5% (power >80%, >96%, and >99%, respectively) of phenotypic variance. Fine-mapping of the association loci using a stochastic search allowed for any possible configuration of causal SNPs; this was valuable for the chromosome 19 NMR association locus which was not suited for stepwise conditional analyses. A limitation was that data from only two cohorts were available for the fine-mapping analysis. In addition, all study participants were of European descent, thus limiting the generalizability of our results to other populations. Studies in larger samples are needed to detect more variants with phenotypic impacts below 0.76%. In conclusion, we identified over 1200 SNPs associated with the NMR, capturing ~38% of phenotype variation. Not surprisingly, the chromosome 19 locus, containing CYP2A6, comprised the majority of the GWS SNPs and explained ~36% of phenotype variation, a substantial increase from the ~20–30% variance previously accounted for (3, 17, 19). We also identified several novel loci influencing the NMR and nicotine exposure phenotypes, including a novel chromosome 4 region mapping to TMPRSS11E and several UGT2 genes. A greater understanding of the genomic influences on nicotine metabolism and tobacco exposure phenotypes may improve smoking cessation treatments and our understanding of tobacco-related disease risk.
  50 in total

1.  Nicotine metabolite ratio as an index of cytochrome P450 2A6 metabolic activity.

Authors:  Delia Dempsey; Piotr Tutka; Peyton Jacob; Faith Allen; Kerri Schoedel; Rachel F Tyndale; Neal L Benowitz
Journal:  Clin Pharmacol Ther       Date:  2004-07       Impact factor: 6.875

2.  Role of human cytochrome P4502A6 in C-oxidation of nicotine.

Authors:  M Nakajima; T Yamamoto; K Nunoya; T Yokoi; K Nagashima; K Inoue; Y Funae; N Shimada; T Kamataki; Y Kuroiwa
Journal:  Drug Metab Dispos       Date:  1996-11       Impact factor: 3.922

3.  Within-subject variation of the salivary 3HC/COT ratio in regular daily smokers: prospects for estimating CYP2A6 enzyme activity in large-scale surveys of nicotine metabolic rate.

Authors:  Rod A Lea; Stuart Dickson; Neal L Benowitz
Journal:  J Anal Toxicol       Date:  2006 Jul-Aug       Impact factor: 3.367

4.  Characterization of CYP2A6 involved in 3'-hydroxylation of cotinine in human liver microsomes.

Authors:  M Nakajima; T Yamamoto; K Nunoya; T Yokoi; K Nagashima; K Inoue; Y Funae; N Shimada; T Kamataki; Y Kuroiwa
Journal:  J Pharmacol Exp Ther       Date:  1996-05       Impact factor: 4.030

Review 5.  Nicotine addiction.

Authors:  Neal L Benowitz
Journal:  N Engl J Med       Date:  2010-06-17       Impact factor: 91.245

Review 6.  Pharmacogenetic Optimization of Smoking Cessation Treatment.

Authors:  Meghan J Chenoweth; Rachel F Tyndale
Journal:  Trends Pharmacol Sci       Date:  2016-10-03       Impact factor: 14.819

Review 7.  CYP2A6 polymorphisms and risk for tobacco-related cancers.

Authors:  Ana Rossini; Tatiana de Almeida Simão; Rodolpho Mattos Albano; Luis Felipe Ribeiro Pinto
Journal:  Pharmacogenomics       Date:  2008-11       Impact factor: 2.533

8.  Stability of the nicotine metabolite ratio in ad libitum and reducing smokers.

Authors:  Marc E Mooney; Zhong-Ze Li; Sharon E Murphy; Paul R Pentel; Chap Le; Dorothy K Hatsukami
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2008-06       Impact factor: 4.254

9.  Association of CYP2A6 activity with lung cancer incidence in smokers: The multiethnic cohort study.

Authors:  Sungshim L Park; Sharon E Murphy; Lynne R Wilkens; Daniel O Stram; Stephen S Hecht; Loïc Le Marchand
Journal:  PLoS One       Date:  2017-05-25       Impact factor: 3.240

10.  A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism.

Authors:  Anu Loukola; Jadwiga Buchwald; Richa Gupta; Teemu Palviainen; Jenni Hällfors; Emmi Tikkanen; Tellervo Korhonen; Miina Ollikainen; Antti-Pekka Sarin; Samuli Ripatti; Terho Lehtimäki; Olli Raitakari; Veikko Salomaa; Richard J Rose; Rachel F Tyndale; Jaakko Kaprio
Journal:  PLoS Genet       Date:  2015-09-25       Impact factor: 5.917

View more
  14 in total

1.  Accuracy and applications of sequencing and genotyping approaches for CYP2A6 and homologous genes.

Authors:  Alec W R Langlois; Ahmed El-Boraie; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Caryn Lerman; Jo Knight; Steven E Scherer; Meghan J Chenoweth; Rachel F Tyndale
Journal:  Pharmacogenet Genomics       Date:  2022-02-21       Impact factor: 2.000

2.  The Promise of Polygenic Risk Prediction in Smoking Cessation: Evidence From Two Treatment Trials.

Authors:  Michael Bray; Yoonhoo Chang; Timothy B Baker; Douglas Jorenby; Robert M Carney; Louis Fox; Giang Pham; Faith Stoneking; Nina Smock; Christopher I Amos; Laura Bierut; Li-Shiun Chen
Journal:  Nicotine Tob Res       Date:  2022-10-17       Impact factor: 5.825

3.  Dissecting the genetic overlap of smoking behaviors, lung cancer, and chronic obstructive pulmonary disease: A focus on nicotinic receptors and nicotine metabolizing enzyme.

Authors:  Michael J Bray; Li-Shiun Chen; Louis Fox; Dana B Hancock; Robert C Culverhouse; Sarah M Hartz; Eric O Johnson; Mengzhen Liu; James D McKay; Nancy L Saccone; John E Hokanson; Scott I Vrieze; Rachel F Tyndale; Timothy B Baker; Laura J Bierut
Journal:  Genet Epidemiol       Date:  2020-08-16       Impact factor: 2.135

Review 4.  Pharmacogenetics factors influencing smoking cessation success; the importance of nicotine metabolism.

Authors:  Yadira X Perez-Paramo; Philip Lazarus
Journal:  Expert Opin Drug Metab Toxicol       Date:  2020-12-29       Impact factor: 4.481

5.  Clarifying the Genetic Influences on Nicotine Dependence and Quantity of Use in Cigarette Smokers.

Authors:  Brad Verhulst; Shaunna L Clark; Jingchun Chen; Hermine H Maes; Xiangning Chen; Michael C Neale
Journal:  Behav Genet       Date:  2021-04-21       Impact factor: 2.965

6.  Transferability of Ancestry-Specific and Cross-Ancestry CYP2A6 Activity Genetic Risk Scores in African and European Populations.

Authors:  Ahmed El-Boraie; Meghan J Chenoweth; Jennie G Pouget; Neal L Benowitz; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Nicole L Nollen; Lisa Sanderson Cox; Caryn Lerman; Jo Knight; Rachel F Tyndale
Journal:  Clin Pharmacol Ther       Date:  2021-01-01       Impact factor: 6.903

7.  Effect of race and glucuronidation rates on the relationship between nicotine metabolite ratio and nicotine clearance.

Authors:  Evangelia Liakoni; Rachel F Tyndale; Peyton Jacob; Delia A Dempsey; Newton Addo; Neal L Benowitz
Journal:  Pharmacogenet Genomics       Date:  2021-07-01       Impact factor: 2.000

8.  Functional characterization of novel rare CYP2A6 variants and potential implications for clinical outcomes.

Authors:  Ahmed El-Boraie; Julie-Anne Tanner; Andy Z X Zhu; Katrina G Claw; Bhagwat Prasad; Erin G Schuetz; Kenneth E Thummel; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Neal L Benowitz; Caryn Lerman; Rachel F Tyndale
Journal:  Clin Transl Sci       Date:  2021-09-25       Impact factor: 4.689

9.  Meta-Analysis of Transcriptome-Wide Association Studies across 13 Brain Tissues Identified Novel Clusters of Genes Associated with Nicotine Addiction.

Authors:  Zhenyao Ye; Chen Mo; Hongjie Ke; Qi Yan; Chixiang Chen; Peter Kochunov; L Elliot Hong; Braxton D Mitchell; Shuo Chen; Tianzhou Ma
Journal:  Genes (Basel)       Date:  2021-12-23       Impact factor: 4.141

10.  Expanding the genetic architecture of nicotine dependence and its shared genetics with multiple traits.

Authors:  Bryan C Quach; Michael J Bray; Nathan C Gaddis; Mengzhen Liu; Teemu Palviainen; Camelia C Minica; Stephanie Zellers; Richard Sherva; Fazil Aliev; Michael Nothnagel; Kendra A Young; Jesse A Marks; Hannah Young; Megan U Carnes; Yuelong Guo; Alex Waldrop; Nancy Y A Sey; Maria T Landi; Daniel W McNeil; Dmitriy Drichel; Lindsay A Farrer; Christina A Markunas; Jacqueline M Vink; Jouke-Jan Hottenga; William G Iacono; Henry R Kranzler; Nancy L Saccone; Michael C Neale; Pamela Madden; Marcella Rietschel; Mary L Marazita; Matthew McGue; Hyejung Won; Georg Winterer; Richard Grucza; Danielle M Dick; Joel Gelernter; Neil E Caporaso; Timothy B Baker; Dorret I Boomsma; Jaakko Kaprio; John E Hokanson; Scott Vrieze; Laura J Bierut; Eric O Johnson; Dana B Hancock
Journal:  Nat Commun       Date:  2020-11-03       Impact factor: 17.694

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.