| Literature DB >> 34476898 |
Ahmed El-Boraie1,2, Julie-Anne Tanner3, Andy Z X Zhu4, Katrina G Claw5, Bhagwat Prasad6, Erin G Schuetz7, Kenneth E Thummel8, Koya Fukunaga9, Taisei Mushiroda9, Michiaki Kubo9, Neal L Benowitz10, Caryn Lerman11, Rachel F Tyndale1,2,12.
Abstract
CYP2A6 activity, phenotyped by the nicotine metabolite ratio (NMR), is a predictor of several smoking behaviors, including cessation and smoking-related disease risk. The heritability of the NMR is 60-80%, yet weighted genetic risk scores (wGRSs) based on common variants explain only 30-35%. Rare variants (minor allele frequency <1%) are hypothesized to explain some of this missing heritability. We present two targeted sequencing studies where rare protein-coding variants are functionally characterized in vivo, in silico, and in vitro to examine this hypothesis. In a smoking cessation trial, 1687 individuals were sequenced; characterization measures included the in vivo NMR, in vitro protein expression, and metabolic activity measured from recombinant proteins. In a human liver bank, 312 human liver samples were sequenced; measures included RNA expression, protein expression, and metabolic activity from extracted liver tissue. In total, 38 of 47 rare coding variants identified were novel; characterizations ranged from gain-of-function to loss-of-function. On a population level, the portion of NMR variation explained by the rare coding variants was small (~1%). However, upon incorporation, the accuracy of the wGRS was improved for individuals with rare protein-coding variants (i.e., the residuals were reduced), and approximately one-third of these individuals (12/39) were re-assigned from normal to slow metabolizer status. Rare coding variants can alter an individual's CYP2A6 activity; their integration into wGRSs through precise functional characterization is necessary to accurately assess clinical outcomes and achieve precision medicine for all. Investigation into noncoding variants is warranted to further explain the missing heritability in the NMR.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34476898 PMCID: PMC8742641 DOI: 10.1111/cts.13135
Source DB: PubMed Journal: Clin Transl Sci ISSN: 1752-8054 Impact factor: 4.689
Rare CYP2A6 variants identified in study one
| Amino acid change | rsID | hg19 Position | hg19 REF | hg19 ALT | Transcript change | Frequency count (ancestry) | Novel or known | In silico prediction |
|---|---|---|---|---|---|---|---|---|
| I61F | rs200554095 | 41355885 | T | A | c.181A>T | 1 (AFR) | Novel | 0 = Neutral |
| R128L | rs4986891 | 41354629 | C | A | c.383G>T | 4 (AFR) |
| 1 = Deleterious |
| S131A | rs59552350 | 41354621 | A | C | c.391T>G | 4 (AFR) |
| 1 = Deleterious |
| R190L | rs571335587 | 41354209 | C | A | c.569G>T | 1 (AFR) | Novel | 0.8 = Deleterious |
| Q239K | rs138978736 | 41352896 | G | T | c.715C>A | 4 (AFR) | Novel | 0 = Neutral |
| P264T | rs761666827 | 41352821 | G | T | c.790C>A | 1 (AFR) | Novel | 0.8 = Deleterious |
| R265Q | rs140471703 | 41352817 | C | T | c.794G>A | 3 (AFR) |
| 1 = Deleterious |
| M275I | rs111869995 | 41352786 | C | A | c.825G>T | 1 (AFR) | Novel | 0.6 = Deleterious |
| E330D | rs137904044 | 41351370 | C | A | c.990G>T | 1 (AFR) | Novel | 0.8 = Deleterious |
| R333X | rs61605570 | 41351363 | T | A | c.997A>T | 3 (AFR) | Novel | 1 = Deleterious |
| M368I | rs772964366 | 41351256 | C | G | c.1104G>C | 1(AFR) | Novel | 0.2 = Neutral |
| T378I | rs114558780 | 41351227 | G | A | c.1133C>T | 3 (AFR) | Novel | 0.4 = Neutral |
| E419K | rs768416963 | 41350584 | C | T | c.1255G>A | 1 (AFR) | Novel | 0 = Neutral |
| I434V | rs1302192284 | 41350539 | T | C | c.1300A>G | 2 (AFR) | Novel | 0 = Neutral |
| R64C | rs374515279 | 41355875 | G | A | c.191G>A | 2 (1 AFR, 1 EUR) | Novel | 0 = Neutral |
| P35L | rs377713545 | 41356228 | G | A | c.104C>T | 1 (EUR) | Novel | 1 = Deleterious |
| R64H | rs374515279 | 41355875 | C | T | c.191G>A | 1 (EUR) | Novel | 0.4 = Neutral |
| E97K | rs145308399 | 41355777 | C | T | c.289G>A | 8 (EUR) | Novel | 0.6 = Deleterious |
| G115D | rs758479488 | 41354668 | C | T | c.344G>A | 1 (EUR) | Novel | 1 = Deleterious |
| V140G | rs777098658 | 41354593 | A | C | c.419T>G | 1 (EUR) | Novel | 0.6 = Deleterious |
| V140A | rs777098658 | 41354593 | A | G | c.419T>C | 1 (EUR) | Novel | 0.6 = Deleterious |
| D158E | rs60605885 | 41354538 | G | C | c.474C>G | 3 (EUR) |
| 0 = Neutral |
| L160I | rs60563539 | 41354534 | G | T | c.478C>A | 3 (EUR) |
| 0 = Neutral |
| K194E | rs199916117 | 41354198 | T | C | c.580A>G | 1 (EUR) |
| 0 = Neutral |
| R257C | rs145157460 | 41352842 | G | C | c.769C>T | 1 (EUR) | Novel | 0.4 = Neutral |
| R257G | rs145157460 | 41352842 | G | A | c.769C>G | 2 (EUR) | Novel | 0 = Neutral |
| E279Q | rs58261757 | 41351999 | C | G | c.835G>C | 1 (EUR) | Novel | 0.2 = Neutral |
| I300T | rs148693084 | 41351935 | A | G | c.899T>C | 2 (EUR) | Novel | 0 = Neutral |
| R311C | rs58571639 | 41351903 | G | A | c.931C>T | 1 (EUR+AFR) | Novel | 0.8 = Deleterious |
| YGFL312‐315L | Unavailable | 41351891 | AAGAAGCCAT | A | c.944del | 1 (EUR) | Novel | 1 = Deleterious |
| V323M | rs1303839356 | 41351867 | C | T | c.967G>A | 1 (EUR) | Novel | 0.4 = Neutral |
| R339Q | rs150247689 | 41351344 | C | T | c.1016G>A | 1 (EUR) | Novel | 0.8 = Deleterious |
| M352T | rs143841823 | 41351305 | A | G | c.1055T>C | 3 (EUR) | Novel | 0.2 = Neutral |
| F362S | rs778019189 | 41351275 | A | G | c.1085T>C | 1 (EUR) | Novel | 1 = Deleterious |
| S465T | rs746173331 | 41349793 | A | T | c.1393T>A | 1 (EUR) | Novel | 0 = Neutral |
| I471T | rs5031016 | 41349774 | A | G | c.1412T>C | 1 (EUR) |
| 0.6 = Deleterious |
| G479V | rs5031017 | 41349750 | C | A | c.1436G>T | 2 (EUR) |
| 1 = Deleterious |
In silico prediction based off framework using several prediction tools that are optimized for ADME or pharmacogenes. Novel is defined as not previously functionally characterized or part of a known CYP2A6 * allele. X: Premature stop codon.
Abbreviations: ADME, absorption, distribution, metabolism, and excretion; AFR, African ancestry; ALT, alternate allele on the positive strand of the hg19 reference genome build; EUR, European ancestry; REF, reference allele.
Participant self‐reported race is multi‐racial.
FIGURE 1Beeswarm plots of the rare coding variants plotted individually, and grouped, with their NMR values in the (a) AFR (N = 541) and (b) EUR (N = 1026) populations. Reference: Individuals without a CYP2A6 rare variant and without any common functional CYP2A6 * alleles. X: Premature stop Codon. Statistical analyses were based off the SKAT test statistic for continuous traits comparing the Rare Grouped and Reference groups. $$: p < 0.001 $$$: p < 0.0001. AFR, African‐ancestry; EUR, European‐ancestry; NMR, nicotine metabolite ratio
FIGURE 2(a) Western blot loading approximately equal POR amounts for each novel CYP2A6 rare variant identified in Smoking Cessation Trial. Sample loading was based on sample dilution curves (Figure S4), adjusting to achieve similar POR amounts between constructs (i.e., more sample was loaded for variant constructs with low expression), and within the respective POR and 2A6 linear ranges. (b) Bar graph displaying the CYP2A6/POR protein expression ratios for each variant construct. kDa, kilodalton; WT, wildtype expressed in E. coli; DEL, CYP2A6 excised from bicistronic construct with POR intact; C.E., Commercially Expressed CYP2A6 (CAT #456254; Corning); X, Premature stop Codon
Detailed V max, K m, V max/K m (Turnover), and percentage of WT (Turnover Rates) for all novel rare variants identified in the Smoking Cessation Trial
| Variant construct |
|
|
| % WT (turnover rate) |
|---|---|---|---|---|
| WT | 6.0 | 48.0 | 125 | 100 |
|
| 3.3 | 42.2 | 79 | 64 |
| P35L | 6.0 | 134.0 | 45 | 36 |
| I61F | 8.7 | 63.5 | 137 | 110 |
| R64H | 5.6 | 60.6 | 93 | 74 |
| R64C | 3.7 | 39.8 | 93 | 75 |
| E97K | 7.8 | 82.9 | 95 | 76 |
| Q239K | 12.7 | 67.2 | 189 | 152 |
| V140A | 5.7 | 35.7 | 160 | 128 |
| V140G | 3.5 | 34.9 | 100 | 80 |
| R190L | 2.9 | 153.4 | 19 | 15 |
| R257C | 5.0 | 124.3 | 40 | 32 |
| R257G | 5.4 | 152.7 | 35 | 28 |
| P264T | 3.1 | 87.4 | 36 | 29 |
| M275I | 7.3 | 68.6 | 106 | 85 |
| E279Q | 4.6 | 62.2 | 74 | 60 |
| I300T | 7.2 | 185.3 | 39 | 31 |
| R311C | 2.8 | 107.1 | 26 | 21 |
| YGFL312‐315L | 1.0 | 112.0 | 9 | 7 |
| V323M | 1.1 | 130.2 | 8 | 7 |
| E330D | 8.9 | 97.1 | 91 | 73 |
| R333X | 0.5 | 375.9 | 1 | 1 |
| R339Q | 8.2 | 41.0 | 201 | 161 |
| M352T | 11.2 | 55.5 | 201 | 161 |
| F362S | 3.4 | 108.5 | 31 | 25 |
| M368I | 10.3 | 74.1 | 140 | 112 |
| T378I | 4.9 | 111.3 | 44 | 35 |
| E419K | 3.9 | 75.5 | 52 | 42 |
| I434V | 3.5 | 83.0 | 42 | 34 |
Abbreviations: COT, cotinine; K m, kinetic metabolite; WT, wildtype; V max, maximum value; X, premature stop Codon.
Summary of in vitro assessments, in vivo associations, and variant construct functional assignments in AFR (N = 541) and EUR (N = 1026) populations
| Variant construct | CYP2A6/POR protein |
| Average NMR (Individual NMR) | Aggregate functional assignment (4‐level) | Aggregate functional assignment (3‐level) | Level of evidence | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| AFR | ||||||||||
| Reference | WT | 1.08 | N | 125 | N | 0.351 | N | N | N | — |
| V365M ( | 0.56 | D | 79 | D | 0.235 | D | D | D | — | |
| Novel rare | R64C | 0.22 | L | 93 | D | 0.192 | L | L | D | 3 |
| R190L | 0.09 | L | 19 | L | 0.173 | L | L | D | 3 | |
| R333X | 0.01 | L | 1 | L | 0.108 (0.069, 0.123, 0.133) | L | L | D | 3 | |
| E419K | 0.61 | D | 52 | L | 0.154 | L | L | D | 3 | |
| I434V | 0.09 | L | 42 | L | 0.158 (0.13, 0.186) | L | L | D | 3 | |
| M275I | 0.40 | D | 106 | D | 0.257 | D | D | D | 3 | |
| E330D | 0.01 | L | 92 | D | 0.250 | D | D | D | 3 | |
| T378I | 0.56 | D | 44 | L | 0.239 (0.211, 0.242, 0.264) | D | D | D | 3 | |
| I61F | 0.60 | D | 137 | N | 0.192 | L | D | D | 2 | |
| M368I | 0.57 | D | 140 | N | 0.231 | D | D | D | 2 | |
| Q239K | 1.01 | N | 189 | G | 0.394 (0.319, 0.353, 0.509) | N | N | N | 3 | |
| P264T | 1.01 | N | 36 | L | 0.509 | G | N | N | 2 | |
| Known rare | F118L, R128L, S131A ( | 0 | L | 0% | L | 0.245 (0.065, 0.128, 0.298, 0.489) | D | L | D | 3 |
| R265Q ( | 0 | L | 5% | L | 0.16 (0.104, 0.179, 0.197) | L | L | D | 3 | |
| EUR | ||||||||||
| Reference | WT | 1.08 | N | 125 | N | 0.458 | N | N | N | — |
| Novel rare | R64C | 0.22 | L | 93 | D | 0.139 | L | L | D | 3 |
| G115D | 0.06 | L | 9 | L | 0.132 | L | L | D | 3 | |
| R257S | 0.43 | D | 35 | L | 0.139 (0.08, 0.197) | L | L | D | 3 | |
| I300T | 0.22 | L | 39 | L | 0.262 (0.069, 0.455) | D | L | D | 3 | |
| YGFL312—315L | 0.00 | L | 9 | L | 0.207 | D | L | D | 3 | |
| V323M | 0.00 | L | 8 | L | 0.265 | D | L | D | 3 | |
| F362S | 0.12 | L | 32 | L | 0.168 | L | L | D | 3 | |
| R64H | 0.73 | D | 93 | D | 0.206 | D | D | D | 3 | |
| E97K | 0.35 | L | 95 | D | 0.29975 (0.127, 0.162, 0.18, 0.191, 0.192, 0.263, 0.529, 0.754) | D | D | D | 3 | |
| V140G | 0.55 | D | 100 | D | 0.339 | D | D | D | 3 | |
| V140A | 0.54 | D | 160 | 0.265 | D | D | D | 2 | ||
| M352T | 0.90 | N | 201 | G | 0.372 (0.134, 0.315, 0.668) | N | N | N | 3 | |
| R339Q | 0.76 | D | 201 | G | 0.291 | D | N | N | 2 | |
| P35L | 0.30 | L | 45 | L | 0.873 | G | I.E. | I.E. | 1 | |
| R257C | 0.65 | D | 40 | L | 0.464 | N | I.E. | I.E. | 1 | |
| E279Q | 0.65 | D | 74 | D | 0.395 | N | I.E. | I.E. | 1 | |
| R311C | 0.00 | L | 26 | L | 0.359 | N | I.E. | I.E. | 1 | |
| S465T | 0.95 | N | 189 | G | 0.298 | D | I.E. | I.E. | 1 | |
| Known rare | L160I, D158E ( | Unknown | 66% | D | 0.223 (0.153, 0.293) | D | D | D | 2 | |
| K194E ( | Unknown | 219% | G | 0.168 | L | I.E. | I.E. | 1 | ||
| I471T ( | Unknown | 5% | L | 0.119 | L | L | D | 2 | ||
| G479V ( | Unknown | 0% | L | 0.549 (0.172, 0.925) | G | I.E. | I.E. | 1 | ||
CYP2A6/POR: ratio of CYP2A6 protein expression relative to POR.
V max/K m: measure of catalytic efficiency based on in vitro metabolism to the nicotine substrate. X: Premature stop Codon. L: loss‐of‐function, D: decrease‐of‐function, N: neutral‐function G: gain‐of‐function. Aggregate Functional Assignment (4‐level): overall functional assignment of variants split into four groups: L, D, N, and G. Aggregate Functional Assignment (3‐level): overall functional assignment of variants split into three groups: D, N, and G. I.E.: inconsistent evidence between in vitro and in vivo parameters. Cut points for definitions were based on reference constructs and were as follows. CYP2A6/POR: L: 0–0.4, D: 0.4–0.8, N: 0.8–1.1, G: >1.1. V max/K m: L: 0–55, D: 55–110, N:110–160, G: >160. Average NMR: L: 0–0.2, D: 0.2–0.35, N: 0.35–0.5, G: >0.5.
Abbreviations: AFR, African ancestry; EUR, European ancestry; K m, kinetic metabolite; NMR, nicotine metabolite ratio; V max, maximum value; WT, wildtype.
Only individuals that were heterozygote for the allele were considered (i.e. CYP2A6 *1/*17, and not *17/*17).
All coding variants in the haplotype were required to be considered part of the known * allele.
Reported in Mwenifumbo et al.
Reported in Hosono et al.
Reported in Piliguian et al.
FIGURE 3Respective (a) AFR wGRS (N = 504) and (b) EUR wGRS (N = 933), individuals with rare coding variants highlighted. Original wGRSs were based exclusively on common variants. (c, d) Updated wGRSs incorporating the functional rare coding variants weighted collectively as a decrease‐of‐function in the AFR and EUR population, respectively. Pie charts represent the proportion of individuals classified as slow or normal metabolizers by the wGRS before (a), (b), and after (c), (d) incorporating rare variant data. wGRS cut points for metabolizer status determination have been previously described (slow wGRS <2.089; normal wGRS ≥2.089 in AFR ; slow wGRS <2.14; normal wGRS ≥2.14 in EUR ). AFR, African‐ancestry; EUR, European‐ancestry; NMR, nicotine metabolite ratio; wGRS, weighted Genetic Risk Score
FIGURE 4Association of rare CYP2A6 coding variants with (a) CYP2A6 mRNA levels (FPKM values), (b) CYP2A6 protein levels (pmol/mg microsomal protein), and (c) CYP2A6 enzyme activity (cotinine formation from nicotine, nmol/min/mg microsomal protein) in the human liver bank (N = 312). Reference: Excluding those with a CYP2A6 rare variant and/or common functional CYP2A6 * alleles. S1: Variant also identified in Smoking Cessation Trial. Statistical analyses were based off the SKAT test statistic for continuous traits comparing the Rare Grouped and Reference groups. $: p < 0.05. (d) Summary of the assessments in the human liver bank. Enzyme activity, cotinine formation from nicotine in vitro; L, loss‐of‐function; D, decrease‐of‐function; N, neutral‐function; G, gain‐of‐function. Aggregate Functional Assignment (4‐level): overall functional assignment of variants split into four groups: L, D, N, and G. Aggregate Functional Assignment (3‐level): overall functional assignment of variants split into three groups: D, N, and G. I.E.: inconsistent evidence between expression and kinetic parameters. Cut points for definitions were based on reference variants and were as follows. Defining parameters: mRNA: L: 0–250, D: 251–375, N: 376–600, G: greater than 600. Protein: L: 0–15, D: 16–20, N: 21–30, G: greater than 30. Enzyme Activity: L: 0–0.08, D: 0.09–0.10, N: 0.11–0.15, G: greater than 0.15