Literature DB >> 27113016

Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries.

James W Baurley1, Christopher K Edlund1, Carissa I Pardamean1, David V Conti1, Ruth Krasnow2, Harold S Javitz2, Hyman Hops3, Gary E Swan4, Neal L Benowitz5, Andrew W Bergen1.   

Abstract

INTRODUCTION: Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3'-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed.
METHODS: Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status.
RESULTS: African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5).
CONCLUSIONS: This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan-continental population biomarkers for nicotine metabolism. IMPLICATIONS: This multiple ancestry meta-GWAS of the laboratory study-based NMR provides novel evidence and replication for genome-wide association of CYP2A6 single nucleotide and insertion-deletion polymorphisms. We identify three regions of genome-wide significance: proximal, intronic, and distal to CYP2A6. We replicate the top-ranking single nucleotide polymorphism from a recent GWAS of the NMR in Finnish smokers, identify a functional mechanism for this intronic variant from in silico analyses of RNA-seq data that is consistent with CYP2A6 expression measured in postmortem lung and liver, and provide additional support for the intergenic region between CYP2A6 and CYP2A7.
© The Author 2016. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27113016      PMCID: PMC4978985          DOI: 10.1093/ntr/ntw117

Source DB:  PubMed          Journal:  Nicotine Tob Res        ISSN: 1462-2203            Impact factor:   4.244


Introduction

The overall smoking rate in developed countries has significantly declined over the last 50 years.[1] However, tobacco-attributable disease remains the largest potentially modifiable cause of mortality,[2] with an annual (2004–2009) United States mortality between 380000 and 480000.[1,3] In 2012, more than 42 million adults in the United States were current cigarette smokers.[4] Lung cancer, the classic tobacco-attributable disease, continues to be the single largest cause of death due to cancer in the United States, and to cancer among males in the world.[5] The greatest incidence and mortality rates in the United States have been observed among African American males.[6,7] Overall smoking prevalence among Asian Americans is lower than other racial groups,[8] but reflects a complex relationship with native country differences in smoking rates by gender and acculturation.[9] Although lung cancer rates are falling in some groups of Asian Americans, they are rising in others.[10] Reduction of tobacco product use and associated morbidity and mortality will require enhanced tobacco control strategies, personalization of tobacco dependence treatment, and improved access to lung cancer treatment. Nicotine metabolism influences cigarette consumption,[11] tobacco exposures,[12,13] tobacco attributable disease, for example, lung cancer risk,[14,15] and response to smoking cessation treatments.[16,17] Nicotine metabolism[18] is primarily regulated by aldehyde oxidase (AO) and cytochrome oxidase 2A6 (CYP2A6), for the successive oxidation of nicotine to cotinine, by CYP2A6, for the oxidation of cotinine to trans-3′-hydroxycotinine, and by uridine diphosphate glycosyltransferase 2 family, members B10 (UGT2B10) for the N-glucuronidation of nicotine and cotinine, and B17 (UGT2B17),[19] for the O-glucuronidation of trans-3′-hydroxycotinine. Variation in nicotine metabolism activity is influenced by multiple patient characteristics,[20,21] including genetic variation in nicotine metabolism pathway genes.[18,22] Development of biomarkers to assess nicotine pharmacokinetics and pharmacodynamics may help optimize (personalize) smoking cessation treatment.[23,24] The ratio of trans-3′-hydroxycotinine/cotinine, termed the nicotine metabolite ratio (NMR), reflecting enzymatic activity of the major pathway for cotinine metabolism, has been shown to be a useful nicotine metabolism biomarker, due to its high correlation with the clearance of nicotine,[11] heritability (0.67 [95% CI 0.56–0.76][25] and 0.81 [95%CI 0.70–0.88][26]), stability, and measurement reliability.[27,28] Recent modeling of CYP2A6 variants in European American individuals and laboratory study derived metabolite ratios has demonstrated that such modeling can account for nearly half of NMR variance,[29] and that this metabolic activity metric can predict smoking cessation.[16] Motivated by the need for a more comprehensive model of nicotine metabolism, we present the first meta-genome-wide association scan (GWAS) of the laboratory study-based NMR data from multiple ancestries. We use several knowledge-bases to identify potential functional mechanisms for top associations, and compare our results with recent literature. This meta-GWAS provides evidence that specific polymorphisms can predict nicotine metabolism in multiple continental populations and suggests further analyses to characterize these findings in diverse ancestries. This work supports development of biomarkers for use in clinical and population-based studies of nicotine metabolism,[30] effects on tobacco product use,[31] related disease,[1] and cessation.[32]

Methods

Human Subjects

Institutional Review Board (IRB) approval for each study, and informed written consent from each participant, was obtained by the Principal Investigators of each study. IRB approval for these analyses was obtained from the Committee on Human Research at the University of California, San Francisco and the Human Subjects Committee at SRI International.

Study Design

For our GWAS, we selected data and DNA samples from three existing laboratory studies of nicotine metabolism. These studies were based on clinical administration of deuterium-substituted nicotine and cotinine followed by body fluid collection and analysis of nicotine metabolites via established gas chromatography/tandem mass spectroscopy methods.[11] We selected DNA samples from unrelated African-American, Asian-American, and European-American ancestry individuals (N = 326 total) for genotyping on the Smokescreen Genotyping Array.[33] For all individuals in this analysis, the NMR is defined as the trans-3′-hydroxycotinine to cotinine ratio obtained from the 6h, postadministration biospecimen (blood or saliva). Pharmacokinetics in Twins (“PKTWIN”)[34]: European-American participants were recruited from the Northern California Twin Registry to investigate heritable components of nicotine metabolism. Participants consented to 30 min venous administration of deuterium-labeled nicotine and cotinine, followed by an 8 h hospital stay for blood and urine sample collections. Pharmacogenetic Study of Nicotine Metabolism (“588”)[11]: European-, African-, and Asian-American smokers and nonsmokers of both sexes were recruited from the San Francisco Bay Area through multi-media advertisements for a nicotine metabolism study. Participants consented to morning oral administration of labeled nicotine and cotinine. The following biospecimens were collected: saliva up to 60h after dosing; blood up to 480min; and urine up to 8h. SMOking in FAMilies (“SMOFAM”)[35]: Individuals from 61 pedigrees with at least three ever-smokers per pedigree were recruited from the Pacific Northwest to assess the relations between genetic factors, environmental factors, and tobacco use. Participants completed a clinical study of nicotine metabolism and consented to oral administration of a fixed dose of deuterium-labeled nicotine and cotinine (non-nicotine users were given unlabeled cotinine) at home monitored by a nurse, followed by salivary sample collection at multiple timepoints as well as a blood sample for DNA analysis.

Sample Genotyping

DNA samples were genotyped on the Smokescreen Genotyping Array at RUCDR Infinite Biologics (Piscataway, NJ). The Smokescreen Genotyping Array is a custom, genome-wide array for research on smoking behavior, addiction, pharmacological treatment, and related disease.[33] It contains 646247 single nucleotide polymorphisms (SNPs) and indels for discovery and characterization studies. Genes (N = 1014) relevant to addiction and smoking-related phenotypes were identified by literature search, expert nomination, and biological knowledge-bases. The array contains dense coverage of common variation (MAF ≥ 0.05) for these genes (±20 kilo basepairs [kbp], 255862 markers) in African (specifically Yoruba in Ibadan, Nigeria, 97.5% at r 2 ≥ 0.80, 95.2% at r 2 ≥ 0.9), East Asian (98.1% at r 2 ≥ 0.80, 97.0% at r 2 ≥ 0.9), and European (98.1% at r 2 ≥ 0.80, 97.0% at r 2 ≥ 0.9) populations. About 17632 rare exonic coding variants are also included for these genes. The array contains nearly complete coverage of the chr19q13.2 nicotine-metabolizing enzyme genes, CYP2A6 (612 markers) and CYP2B6 (1628 markers). Additional markers are included for tagging of common SNPs over the entire region (chr19:41285047–41574301), including EGLN2, CYP2A7, CYP2G1P, and CYP2B7P1.

Genotyping Quality Control and Imputation

Genotypes and sample-level quality statistics were generated from raw CEL files using the Affymetrix Power Tools software suite (v. 1.16.0)[36] and Smokescreen library and annotation files (v. r2). In brief, samples with a Dish QC value <0.82, stage I genotyping call rate <97%, scan failures, positive controls added by the genotyping lab, samples with discordant genotyped versus reported sex, monozygotic twins, siblings, and replicate samples with the lowest call rate were removed. The Affymetrix SNPolisher software (v. 1.5.0) was used to determine the best probeset for each marker and categorize markers based on quality. Markers with their best probeset categorized as CallRateBelowThreshold (N = 5987), Hemizygous (N = 181), Other (N = 49 999), Off-target variant (N = 1866), or which failed previous validation by the manufacturer and were categorized as MonoHighResolution (N = 7918) were removed. All markers overlapping the 1000 Genomes Project reference set (Phase 3, version 5 downloaded from the Minimac3 website[37]), were matched to the forward strand. Study genotypes were pre-phased using MACH (v1.0),[38] using 200 states, 20 rounds, and all other default settings. Markers having a concordance <95% (N = 22554) in previously Smokescreen-genotyped 1000 Genomes samples (N = 61) were excluded from the inference set before imputation. Minimac3 (v1.0.7) was used to impute markers in the 1000 Genomes Project reference set in 5 million basepair (Mbp) chunks for chromosomes 1–22, using a 250 kbp buffer for each chunk and all other default settings.

Statistical Analysis

Sample characteristics were summarized by study and ancestry. Chi square tests (for categorical variables) and F-tests (for continuous variables) were used to test for differences in groups. A significance level of 0.05 was used for these tests. Linear regression was used to model the relationship between the natural log transformed NMR and imputation estimated allelic dosages, adjusting for age, ancestry, gender, body mass index, and smoking status. We incorporated covariates available to us across the three studies, based on prior knowledge[18] and practice,[25] and recently validated in a large (N = 1672) sample of individuals screened for a clinical trial.[21] Ancestry was estimated using principal-components analysis of 5545 ancestry informative markers.[39] Genome-wide association scans were performed separately for self-reported European, African, and Asian American individuals, then combined in an inverse-variance based meta-analysis.[40] All regression analysis was performed in [R].[41] We excluded variants from reporting when the estimate of the squared correlation between imputed and true genotypes was less than 0.3 (cutoff recommended in the MACH software documentation) or the variant was rare (observed allele frequency < 0.01) in one or more of the study groups. P-values were evaluated for deviations from the expected distribution using quantile-quantile (QQ) plots and summarized in genome-wide association plots using the [R] qqman package.[42] All variant-NMR associations with meta-GWAS P-values <5.0E-5 were reported. A P <5.0E-8 was considered genome-wide significant (GWS). Additional model goodness of fit and conditional analyses were performed for top associations. After initial review, we performed a post hoc smoking status × SNP interaction analysis of the CYP2A6 region. Extraction of epigenetic annotation from the Roadmap Epigenomics Consortium[43] was performed for four top-ranked SNPs using HaploReg version 4.1[44] and the Epigenome Browser.[45] Splicing Based Analysis of Variants (SPANR),[46] a bioinformatics resource using RNA-seq data from the Human Body Map Project (GSE30611, 16 tissues from 16 individuals), was used to estimate effects of one intronic variant on CYP2A6 exon 5 splicing. The Genotype-Tissue Expression (GTEx) project[47] (Version 4, build 200) was queried to investigate the relationship between three top-ranked SNPs and CYP2A6 expression in two relevant tissues; we report effect sizes and P-values obtained from GTEx and apply a Bonferroni correction to GTEx results. Chromosome coordinates refer to the hg19 assembly. Quanto[48] was used to estimate sample sizes for future GWAS studies.

Results

Of 326 study participants genotyped, three samples failed genotyping; four samples were unexpectedly related (full siblings or monozygotic twins) or replicates; and four had differences in chromosomal and clinical genders. Three European American ancestry individuals had missing clinical variables and were excluded. Forty-nine African, 51 Asian, and 212 European American ancestry individuals, comprising 312 individuals total, were included in statistical analyses (Table 1). The 588 study was the source of all African American and Asian American individuals while European American individuals originated from all three laboratory studies. The African American group had the highest mean age. A majority of the PKTWIN and SMOFAM samples were female. SMOFAM participants had the highest mean body mass index. The highest proportion of current smokers were found among European American individuals from the 588 study. We observed significantly lower NMRs in African American and Asian American individuals than in European American individuals (P = 5.0E-5 and P = 2.4E-6, respectively). In this study, we did not detect a statistically significant difference between African American and Asian American NMRs (P = 0.58).
Table 1.

Characteristics of Individuals from Nicotine Metabolism Laboratory Studies (N = 312)

Study588PKTWINa SMOFAMa P
AncestryAfricanAsianEuropeanEuropeanEuropean
N 4951679550
Age, M (SD)38.3 (11.3)28.3 (7.41)34.3 (12.2)36.9 (11.8)28.1 (1.38)<.0001
Sex (% F)38.749.044.874.758.0<.0001
BMI, M (SD)26.5 (4.20)23.8 (2.87)24.9 (3.71)25.1 (4.13)28.6 (7.28)<.0001
Smoking (%)53.129.456.718.946.0<.0001
NMR−1.79 (0.50)−1.84 (0.56)−1.42 (0.47)−1.47 (0.57)−1.46 (0.45)0.0076

588 and SMOFAM = oral administration of labeled nicotine and cotinine and collection and analysis of saliva (SMOFAM, only saliva) and blood (plasma); PKTWIN = venous administration of labeled nicotine and cotinine and analysis of blood (plasma); Natural log transformed nicotine metabolite ratio (NMR) values are reported.

aUnrelated individuals only.

Characteristics of Individuals from Nicotine Metabolism Laboratory Studies (N = 312) 588 and SMOFAM = oral administration of labeled nicotine and cotinine and collection and analysis of saliva (SMOFAM, only saliva) and blood (plasma); PKTWIN = venous administration of labeled nicotine and cotinine and analysis of blood (plasma); Natural log transformed nicotine metabolite ratio (NMR) values are reported. aUnrelated individuals only. After filtering rare and poorly imputed genetic variants, 5891179 variants remained in the analyses. Quantile-quantile plots of each genome-wide scan and the meta-analysis indicated no inflation of the observed P-values over expected (Supplementary Figure 1); genomic inflation factors were 0.996, 0.993, 0.998, and 1.039 for European, African and Asian American GWAS analyses, and meta-GWAS analysis, respectively. Meta-GWAS analysis of the natural log transformed, laboratory study-based NMR identified a single region of 43 kbp with 36 GWS results (34 are SNPs, and 2 are insertion–deletion polymorphisms) at the chr19q13.2 CYP2A6 locus (Figure 1 and Supplementary Table 1). While most GWS findings in the meta-GWAS were with common variants, 5, 1, and 5 SNPs of 36 GWS findings had minor allele frequencies between 0.01 and 0.05, in African, Asian, and European American samples, respectively. Regional associations in the CYP2A6-CYP2A7 region are presented in Figure 1 with select details presented in Table 2. A region in the 3′ flanking region of the transcript, ~9.5 kbp proximal to CYP2A6, and a region in intervening sequence 4 (IVS4), have the most statistically significant meta-GWAS results. The 3′ flank top SNP (rs12459249, chr19:41339896), and the IVS4 top SNP (rs56113850, chr19:41353107) exhibit P values 2.46E-18 and 6.61E-18. A third region intergenic between CYP2A6 and CYP2A7 (a distance of 25 kbp) contained two local P value minima, ~7 and ~18 kbp distal of CYP2A6 (rs7247098, chr19:41363098 and rs34226463, chr19:41374558), with P values 1.68E-11 and 1.45E-12. There is substantial linkage disequilibrium between the three regions in East Asian (EAS) and European (EUR) 1000 Genomes Project populations (all r 2 > 0.51) with more modest linkage disequilibrium observed in African (AFR) 1000 Genomes Project populations (all r 2 < 0.40) (Supplementary Figures 3–5). Conditional analyses adjusting for rs12459249 (smallest P-value in the meta-GWAS) resulted in no additional GWS findings; rs7247903 (GWS in the original meta-GWAS) in the CYP2A6-CYP2A7 intergenic region had the smallest P-value (P = 1.22E-6) (Table 2 and Supplementary Figure 3). Linkage disequilibrium values between the post-conditional top ranked SNP rs7247903 and the top ranked proximal, intronic, and intergenic meta-GWAS findings are low in 1000 Genomes Project populations (all r 2 < 0.14), suggesting that this variant represents a second intergenic signal. Of the five SNPs cited above, rs7247098 and rs34226463 were imputed, and rs12459249, rs56113850 and rs7247903 were genotyped. No results at the GWS (P < 5E-8) or reporting (P < 5E-5) thresholds were observed for a smoking status × SNP interaction analysis of the CYP2A6 region.
Figure 1.

Region of GWS meta-analysis results plus flanking 20 kbp by ancestry and overall.

Table 2.

Selected chr19q13.2 Ancestry-Specific GWAS and Meta-GWAS Analysis Results

GWASRSIDrs12459249rs4001926rs56113850rs7247098rs7247903a rs7247903b rs34226463
Coor41339896413480624135310741363098413724754137247541374558
African AmericanAltAF0.310.860.490.480.160.160.60
B −0.58−0.03−0.40−0.38−0.30−0.28−0.43
SE0.110.160.110.160.130.100.17
P 5.69E-06c 8.62E-015.22E-041.93E-022.72E-028.91E-031.81E-02
R 2 0.510.170.390.280.270.590.28
Asian AmericanAltAF0.520.560.580.380.180.180.57
B −0.40−0.38−0.36−0.27−0.38−0.19−0.37
SE0.090.080.090.120.120.130.11
P 1.12E-046.51E-05d 1.62E-042.86E-023.69E-031.61E-011.20E-03
R 2 0.510.520.500.380.430.530.46
European AmericanAltAF0.320.490.320.340.050.050.46
B −0.31−0.25−0.33−0.32−0.49−0.37−0.31
SE0.050.050.050.050.100.090.05
P 1.02E-08 6.96E-07 3.81E-10 e 1.67E-08 5.87E-071.30E-04 4.14E-08
R 2 0.290.260.310.290.260.340.28
Meta B −0.37−0.26−0.34−0.32−0.42−0.30−0.33
SE0.040.040.040.050.070.060.05
P 2.46E-18 f 9.94E-11 6.61E-18 1.68E-11 2.66E-10 1.22E-06g 1.45E-12

RSID = SNP identifier; Coor = chr19 basepair location; AltAF = alternative allele frequency; B = beta; SE = standard error. R 2 = proportion of natural log NMR variance. P = GWAS natural log NMR model or meta-analysis inverse variance P value. Bold values were to indicate the SNPs with P values < 5E-8, the GWS threshold.

aGWAS and meta-GWAS analysis results.

bGWAS and meta-GWAS analysis results, adjusting for rs12459249.

cTop ranked African American SNP.

dTop ranked Asian American SNP.

eTop ranked European American SNP.

fTop ranked meta-GWAS analysis result.

gTop ranked meta-GWAS analysis result, adjusting for rs1245249.

Region of GWS meta-analysis results plus flanking 20 kbp by ancestry and overall. Selected chr19q13.2 Ancestry-Specific GWAS and Meta-GWAS Analysis Results RSID = SNP identifier; Coor = chr19 basepair location; AltAF = alternative allele frequency; B = beta; SE = standard error. R 2 = proportion of natural log NMR variance. P = GWAS natural log NMR model or meta-analysis inverse variance P value. Bold values were to indicate the SNPs with P values < 5E-8, the GWS threshold. aGWAS and meta-GWAS analysis results. bGWAS and meta-GWAS analysis results, adjusting for rs12459249. cTop ranked African American SNP. dTop ranked Asian American SNP. eTop ranked European American SNP. fTop ranked meta-GWAS analysis result. gTop ranked meta-GWAS analysis result, adjusting for rs1245249. Ancestry-specific GWS results were only found for the largest ancestry group, covering 35 kbp, extending from the 3′ to the 5′ flanks of the CYP2A6 transcript. The top SNP in the European American ancestry group GWAS was rs56113850 (B = −0.31, SE = 0.05, P = 3.81E-10), found 151bp 5′ of exon 5; in the African and Asian American samples, this SNP had Β and P values of −0.40 and 5.2E-4 and of −0.36 and 1.6E-4, respectively (Table 2). The top CYP2A6 SNP in the African American GWAS was rs12459249 (Β = −0.58, SE = 0.11, P = 5.69E-6), found ~9.5 kbp proximal to CYP2A6; in the Asian American and European American samples, this SNP had Β and P values of −0.40 and 1.1E-4 and of −0.31 and 1.0E-8, respectively. The top CYP2A6 SNP in the Asian American GWAS was rs4001926 (Β = −0.38, SE = 0.09, P = 6.5E-5), found ~1.3 kbp proximal to CYP2A6; this SNP had no evidence of association in African Americans, but did have evidence of association in European Americans (Table 2). HaploReg identified epigenetic annotations from lung and liver tissues for the top-ranked SNPs (Supplementary Table 2). Observed epigenetic annotations reported by the EpiGenomics Consortium included H3K4me1 marks at rs12459249 and at rs56113850 in liver, H3K4me1 histone marks at rs56113850 in lung, and H3K27ac and H3K9ac histone marks at rs56113850 in liver. Histone methylation marks identified the edge of an enhancer at rs12459249 in liver, and weak transcription at rs56113850 in lung and liver. Using SPANR, the C (alternate) and T alleles of rs56113850 predict inclusion of exon 5 in 11% and in 46.5% of CYP2A6 transcripts, respectively. GTEx eQTL analyses identified statistically significant reductions of CYP2A6 expression in postmortem lung (N = 278) and liver (N = 97) (Table 3). For rs12459249, effect sizes were similar in both tissues, but not statistically significant. For rs56113850, effects were stronger and statistically significant (P values < 0.0005) in both tissues. Neither intergenic GWS SNP was available in the GTEx eQTL database, however, rs4001921 (203 base pairs distal, with meta-GWAS P = 8.1E-11 and in LD with rs7247098, see Supplementary Table 1) exhibited similar effects in both tissues as rs56113850, but with statistical significance only in lung.
Table 3.

Genome-Wide Significant Single Nucleotide Polymorphisms and Association with CYP2A6 Expressiona

SNPrs12459249rs56113850rs4001921b
GWS locusproximalintronicIntergenic
Lung effect−0.15−0.28−0.24
Lung P 0.0370.000480.0019
Liver effect−0.14−0.23−0.22
Liver P 0.0910.000380.016

aFrom GTEx.

b r 2 = 0.92, 0.98, and 0.91 with rs7247098 in AFR, EAS, and EUR 1000 Genome Project populations.

Genome-Wide Significant Single Nucleotide Polymorphisms and Association with CYP2A6 Expressiona aFrom GTEx. b r 2 = 0.92, 0.98, and 0.91 with rs7247098 in AFR, EAS, and EUR 1000 Genome Project populations.

Discussion

We have identified a region of GWS association with the laboratory-based NMR among individuals from three continental ancestries, that is, with SNPs proximal (rs12459249), intronic (rs56113850), and distal (rs7247098 and rs34226463) to CYP2A6 with P values < 1.0E-17 and < 1.0E-10, respectively. The distal intergenic region shows evidence of independent association after conditioning on rs12459249. This study replicates the top ranked finding in a meta-GWAS of the NMR in a Finnish population,[26] that is, rs56113850, which is second ranked in our meta-GWAS and top ranked in our European American sample. Based on databases derived from molecular analyses of postmortem human tissues, we propose that reduced transcription explains the reduction in the NMR associated with the intronic variant rs56113850. The significance and effect size of the top ranked CYP2A6 SNPs in our meta-GWAS of the laboratory-based NMR demonstrates the power of performing analyses with metabolite ratios.[49] This meta-GWAS significantly improved upon the candidate gene-based analyses of the laboratory-based NMR previously performed using samples from two of the three cohorts,[50] identifying SNPs of greater effect size and significance in a smaller sample of unrelated individuals from three cohorts. This is due to the much greater density of markers for drug metabolizing enzyme genes on the Smokescreen array than on the earlier DMET Plus array.[51] The Smokescreen array was designed to capture rare and common variants found in the 1000 Genomes Project and Exome Sequencing Project for the CYP2A6-CYP2B6 region (chr19:41285047–41574301). Another strength is that we were able to use three independent sources of epigenetic and transcript data to propose functional mechanisms. In particular, GWS replication and mechanistic data unambiguously supports use of rs56113850 in future European ancestry nicotine and tobacco research. Sample size was a limitation for the African American and Asian American NMR GWAS analyses. Linkage disequilibrium patterns differ among ancestries (Table 2 and Supplementary Figures 3–5), influencing power to detect effects on the NMR in a meta-GWAS. Here, we nominate SNPs that may serve as candidates for NMR research in three ancestries (we indicated in Table 2 the most significant SNP in each ancestry-specific GWAS). Larger studies are required before these candidate SNPs can be considered as defined markers for tobacco research in African American and Asian American populations. A minor limitation of this research is that we lacked information on all possible covariates influencing the NMR[18] in all three studies. Thus, we did not include alcohol consumption, information on menopausal status, use of estrogen-based prophylactic hormones, or hormone replacement therapy; with one post hoc exception, we did not incorporate covariate interactions into the model.[21] Observing GWS at classical functional alleles will require larger sample sizes than available in this analysis.[52]

CYP2A6 Region SNPs Previously Identified at GWS With Nicotine Metabolism and Related Phenotypes

The recent meta-GWAS of the NMR in three Finnish cohorts totaling 1518 smokers,[26] identified hundreds of GWS SNPs in a 4.2 Mbp region on chr19q13.2. The top ranked SNP in that study, rs56113850 (β = −0.65, P = 5.8E-86), explained 0.14–0.23 of NMR variance. In a drug metabolizing enzyme and transporter candidate gene study, rs4803381, located in the CYP2A6 promoter (c.−1013), and rs1137115, located in exon 1 (c.51) of CYP2A6, were associated with the NMR at GWS in treatment-seeking smokers (β = −0.280, P = 1.25E-21, N = 633 individuals, and β = −0.240, P = 7.30E-14, N = 614 individuals), explaining 0.13 and 0.09 of NMR variance.[50] The CYP2A6 locus has been associated in meta-GWAS analyses with reduced cigarette consumption at rs4105144, found ~2.7 kbp distal with P = 2.2E-12 in 83317 European smokers, with an effect size of 0.39 cigarettes/day.[53] CYP2A6 SNPs rs8102683 (intergenic, ~7 kbp distal and tagging a large copy number polymorphism), and rs11878603 (proximal to CYP2A6), were identified with P values 4.3E-26 and 9.7E-30, and accounted for 2% of the variance in cigarette consumption in 17158 Japanese smokers.[54] A meta-GWAS in 9614 non-Hispanic Whites and African Americans identified association of rs56113850 with moderate centrilobular emphysema (overall P = 1.3E-9, non-Hispanic White P = 1.2E-6, African American P = 2.1E-4), with an allele effect size reduction of 2% in both groups.[55]

Future Analyses of the NMR and Related Phenotypes

Analytic validity,[28] clinical validity,[17] and clinical utility[56] of the NMR have been recently established. Our analysis and that of Loukola et al. find the same GWS variant in European ancestry studies with modest sample sizes. We consider what sample sizes are needed to identify GWS for future analyses of the NMR or related phenotypes. The only ancestry sample that yielded GWS signals in this study was the European American sample of 212 individuals. Based on univariate power analyses, ancestry-specific NMR distributions (Table 1) and effect sizes (Table 2) of top ranked SNPs in the African American (rs12459249) and Asian American (rs4001926) GWAS analyses, modest sample sizes (<100 participants) will be sufficient to observe GWS association with the NMR at these SNPs. Based on univariate power analyses, prior effect sizes of CYP2A6 SNPs associated with cigarette consumption at GWS in European (rs4105144) and Japanese (rs8102683) smokers, and recent cigarette consumption in the European Union[57] and in Japan,[54] sample sizes of ~31500 and ~1900 smokers would be required to observe GWS association with cigarette consumption of these CYP2A6 variants in European and Japanese populations.

Implications for Exposure, Attributable Disease, and Cessation Research

Genetics has an overarching influence across components of nicotine dependence.[58] The CYP2A6 gene product plays a major role in the nicotine metabolism pathway,[59,60] which influences nicotine intake.[22,61] Nicotine binds to nicotinic cholinergic receptors (nAChRs), triggering neurotransmitter release, which, over time, leads to nicotine dependence. The development of nicotine dependence is influenced by variants at cholinergic receptor genes[62] and CYP2A6,[63] among other genes.[53,64-67] Given the global nature of the tobacco epidemic and efforts to personalize prevention and treatment, additional ancestries in the United States and in other countries need to be included in tobacco biomarker genomic analyses to discover and characterize variants at CYP2A6 and elsewhere in the genome for use in nicotine and tobacco research. Analyses beyond single SNP models, that is, integrating haplotypic, functional genomic, and clinical data, will improve our understanding of these genes’ influence on nicotine metabolism and smoking behaviors and can help characterize tobacco product exposures and risks for smoking-attributable diseases[14,68] and co-morbidities,[69] as well as development of personalized therapies in multiple populations.[70]

Supplementary Material

Supplementary Tables 1 and 2 and Figures 1–6 can be found online at http://www.ntr.oxfordjournals.org

Funding

This work was supported by the National Institute on Drug Abuse, National Institutes of Health, United States Department of Health and Human Services (SMOKESCREEN: GENETIC SCREENING TOOL FOR TOBACCO DEPENDENCE & TREATMENT [HHS 271201300004C-0-0-1 to JWB] and DMET Genes, Nicotine Metabolism and Prospective Abstinence [DA033813 to AWB]). Computing was supported by an Amazon Web Services Research Grant to JWB. Previous funding that supported collection, analysis, and curation of the three datasets included in this analysis was received from the National Institute on Drug Abuse: Pharmacokinetics and Pharmacodynamics of Nicotine (DA002277 to NLB), Young Adult Substance Use—Predictors and Consequences (DA003706 to HH), Pharmacokinetics of Nicotine in Twins (DA011170 to GES), Pharmacogenetics of Nicotine Addiction Treatment Consortium (DA020830 to NLB, Rachel F. Tyndale, and Caryn Lerman); and from the Tobacco-Related Disease Research Program of the University of California: Nicotine Metabolism in Families (7PT-2004 to NLB).

Declaration of Interests

JWB reports ownership of BioRealm LLC and intention to commercialize the Smokescreen Genotyping Array as a potential competing interest. CKE reports employment by BioRealm LLC and a financial interest in the Smokescreen Genotyping Array as potential competing interests. CIP reports employment by BioRealm LLC as a potential competing interest. DVC, RK, HSJ, HH, GES, and NLB report no potential competing interests. HSJ and NLB disclose service as paid expert witnesses in litigation against tobacco companies. NLB discloses consultation with pharmaceutical companies that market smoking cessation medications. AWB reports past employment at SRI International as a potential competing interest.
  64 in total

1.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

2.  Race, gender, and nicotine metabolism in adolescent smokers.

Authors:  Mark L Rubinstein; Saul Shiffman; Michelle A Rait; Neal L Benowitz
Journal:  Nicotine Tob Res       Date:  2012-12-13       Impact factor: 4.244

3.  Genome-wide association study of nicotine dependence in American populations: identification of novel risk loci in both African-Americans and European-Americans.

Authors:  Joel Gelernter; Henry R Kranzler; Richard Sherva; Laura Almasy; Aryeh I Herman; Ryan Koesterer; Hongyu Zhao; Lindsay A Farrer
Journal:  Biol Psychiatry       Date:  2014-09-16       Impact factor: 13.382

4.  The contribution of common CYP2A6 alleles to variation in nicotine metabolism among European-Americans.

Authors:  Joseph Bloom; Anthony L Hinrichs; Jen C Wang; Linda B von Weymarn; Evan D Kharasch; Laura J Bierut; Alison Goate; Sharon E Murphy
Journal:  Pharmacogenet Genomics       Date:  2011-07       Impact factor: 2.089

5.  Multiple independent loci at chromosome 15q25.1 affect smoking quantity: a meta-analysis and comparison with lung cancer and COPD.

Authors:  Nancy L Saccone; Robert C Culverhouse; Tae-Hwi Schwantes-An; Dale S Cannon; Xiangning Chen; Sven Cichon; Ina Giegling; Shizhong Han; Younghun Han; Kaisu Keskitalo-Vuokko; Xiangyang Kong; Maria Teresa Landi; Jennie Z Ma; Susan E Short; Sarah H Stephens; Victoria L Stevens; Lingwei Sun; Yufei Wang; Angela S Wenzlaff; Steven H Aggen; Naomi Breslau; Peter Broderick; Nilanjan Chatterjee; Jingchun Chen; Andrew C Heath; Markku Heliövaara; Nicole R Hoft; David J Hunter; Majken K Jensen; Nicholas G Martin; Grant W Montgomery; Tianhua Niu; Thomas J Payne; Leena Peltonen; Michele L Pergadia; John P Rice; Richard Sherva; Margaret R Spitz; Juzhong Sun; Jen C Wang; Robert B Weiss; William Wheeler; Stephanie H Witt; Bao-Zhu Yang; Neil E Caporaso; Marissa A Ehringer; Tim Eisen; Susan M Gapstur; Joel Gelernter; Richard Houlston; Jaakko Kaprio; Kenneth S Kendler; Peter Kraft; Mark F Leppert; Ming D Li; Pamela A F Madden; Markus M Nöthen; Sreekumar Pillai; Marcella Rietschel; Dan Rujescu; Ann Schwartz; Christopher I Amos; Laura J Bierut
Journal:  PLoS Genet       Date:  2010-08-05       Impact factor: 5.917

6.  Using a cessation-related outcome index to assess California's cessation progress at the population level.

Authors:  Hao Tang; Erin Abramsohn; Hye-Youn Park; David W Cowling; Wael K Al-Delaimy
Journal:  Tob Control       Date:  2010-04       Impact factor: 7.552

7.  Drug Metabolizing Enzyme and Transporter Gene Variation, Nicotine Metabolism, Prospective Abstinence, and Cigarette Consumption.

Authors:  Andrew W Bergen; Martha Michel; Denise Nishita; Ruth Krasnow; Harold S Javitz; Karen N Conneely; Christina N Lessov-Schlaggar; Hyman Hops; Andy Z X Zhu; James W Baurley; Jennifer B McClure; Sharon M Hall; Timothy B Baker; David V Conti; Neal L Benowitz; Caryn Lerman; Rachel F Tyndale; Gary E Swan
Journal:  PLoS One       Date:  2015-07-01       Impact factor: 3.240

8.  Smokescreen: a targeted genotyping array for addiction research.

Authors:  James W Baurley; Christopher K Edlund; Carissa I Pardamean; David V Conti; Andrew W Bergen
Journal:  BMC Genomics       Date:  2016-02-27       Impact factor: 3.969

9.  Haplotypes with copy number and single nucleotide polymorphisms in CYP2A6 locus are associated with smoking quantity in a Japanese population.

Authors:  Natsuhiko Kumasaka; Masayuki Aoki; Yukinori Okada; Atsushi Takahashi; Kouichi Ozaki; Taisei Mushiroda; Tomomitsu Hirota; Mayumi Tamari; Toshihiro Tanaka; Yusuke Nakamura; Naoyuki Kamatani; Michiaki Kubo
Journal:  PLoS One       Date:  2012-09-25       Impact factor: 3.240

10.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

View more
  20 in total

1.  Evaluation of a weighted genetic risk score for the prediction of biomarkers of CYP2A6 activity.

Authors:  Ahmed El-Boraie; Taraneh Taghavi; Meghan J Chenoweth; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Caryn Lerman; Nicole L Nollen; Neal L Benowitz; Rachel F Tyndale
Journal:  Addict Biol       Date:  2019-02-27       Impact factor: 4.280

2.  CYP2A6 genetic polymorphisms and biomarkers of tobacco smoke constituents in relation to risk of lung cancer in the Singapore Chinese Health Study.

Authors:  Jian-Min Yuan; Heather H Nelson; Steven G Carmella; Renwei Wang; Jacquelyn Kuriger-Laber; Aizhen Jin; Jennifer Adams-Haduch; Stephen S Hecht; Woon-Puay Koh; Sharon E Murphy
Journal:  Carcinogenesis       Date:  2017-04-01       Impact factor: 4.944

3.  Novel CYP2A6 diplotypes identified through next-generation sequencing are associated with in-vitro and in-vivo nicotine metabolism.

Authors:  Julie-Anne Tanner; Andy Z Zhu; Katrina G Claw; Bhagwat Prasad; Viktoriya Korchina; Jianhong Hu; HarshaVardhan Doddapaneni; Donna M Muzny; Erin G Schuetz; Caryn Lerman; Kenneth E Thummel; Steven E Scherer; Rachel F Tyndale
Journal:  Pharmacogenet Genomics       Date:  2018-01       Impact factor: 2.089

4.  Use of polygenic risk scores of nicotine metabolism in predicting smoking behaviors.

Authors:  Li-Shiun Chen; Sarah M Hartz; Timothy B Baker; Yinjiao Ma; Nancy L Saccone; Laura J Bierut
Journal:  Pharmacogenomics       Date:  2018-11-16       Impact factor: 2.533

5.  Genome-wide association study of a nicotine metabolism biomarker in African American smokers: impact of chromosome 19 genetic influences.

Authors:  Meghan J Chenoweth; Jennifer J Ware; Andy Z X Zhu; Christopher B Cole; Lisa Sanderson Cox; Nikki Nollen; Jasjit S Ahluwalia; Neal L Benowitz; Robert A Schnoll; Larry W Hawk; Paul M Cinciripini; Tony P George; Caryn Lerman; Joanne Knight; Rachel F Tyndale
Journal:  Addiction       Date:  2017-11-02       Impact factor: 6.526

Review 6.  Human Genetics of Addiction: New Insights and Future Directions.

Authors:  Dana B Hancock; Christina A Markunas; Laura J Bierut; Eric O Johnson
Journal:  Curr Psychiatry Rep       Date:  2018-03-05       Impact factor: 5.285

7.  Common and Rare Variants Genetic Association Analysis of Cigarettes per Day Among Ever-Smokers in Chronic Obstructive Pulmonary Disease Cases and Controls.

Authors:  Sharon M Lutz; Brittni Frederiksen; Ferdouse Begum; Merry-Lynn N McDonald; Michael H Cho; Brian D Hobbs; Margaret M Parker; Dawn L DeMeo; Craig P Hersh; Marissa A Ehringer; Kendra Young; Lai Jiang; Marilyn G Foreman; Greg L Kinney; Barry J Make; David A Lomas; Per Bakke; Amund Gulsvik; James D Crapo; Edwin K Silverman; Terri H Beaty; John E Hokanson
Journal:  Nicotine Tob Res       Date:  2019-05-21       Impact factor: 4.244

Review 8.  Biosignature Discovery for Substance Use Disorders Using Statistical Learning.

Authors:  James W Baurley; Christopher S McMahan; Carolyn M Ervin; Bens Pardamean; Andrew W Bergen
Journal:  Trends Mol Med       Date:  2018-02-04       Impact factor: 11.951

Review 9.  The Value of Biosamples in Smoking Cessation Trials: A Review of Genetic, Metabolomic, and Epigenetic Findings.

Authors:  Nancy L Saccone; James W Baurley; Andrew W Bergen; Sean P David; Hannah R Elliott; Marilyn G Foreman; Jaakko Kaprio; Thomas M Piasecki; Caroline L Relton; Laurie Zawertailo; Laura J Bierut; Rachel F Tyndale; Li-Shiun Chen
Journal:  Nicotine Tob Res       Date:  2018-03-06       Impact factor: 4.244

Review 10.  Pharmacogenetics factors influencing smoking cessation success; the importance of nicotine metabolism.

Authors:  Yadira X Perez-Paramo; Philip Lazarus
Journal:  Expert Opin Drug Metab Toxicol       Date:  2020-12-29       Impact factor: 4.481

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.