Literature DB >> 34476898

Functional characterization of novel rare CYP2A6 variants and potential implications for clinical outcomes.

Ahmed El-Boraie1,2, Julie-Anne Tanner3, Andy Z X Zhu4, Katrina G Claw5, Bhagwat Prasad6, Erin G Schuetz7, Kenneth E Thummel8, Koya Fukunaga9, Taisei Mushiroda9, Michiaki Kubo9, Neal L Benowitz10, Caryn Lerman11, Rachel F Tyndale1,2,12.   

Abstract

CYP2A6 activity, phenotyped by the nicotine metabolite ratio (NMR), is a predictor of several smoking behaviors, including cessation and smoking-related disease risk. The heritability of the NMR is 60-80%, yet weighted genetic risk scores (wGRSs) based on common variants explain only 30-35%. Rare variants (minor allele frequency <1%) are hypothesized to explain some of this missing heritability. We present two targeted sequencing studies where rare protein-coding variants are functionally characterized in vivo, in silico, and in vitro to examine this hypothesis. In a smoking cessation trial, 1687 individuals were sequenced; characterization measures included the in vivo NMR, in vitro protein expression, and metabolic activity measured from recombinant proteins. In a human liver bank, 312 human liver samples were sequenced; measures included RNA expression, protein expression, and metabolic activity from extracted liver tissue. In total, 38 of 47 rare coding variants identified were novel; characterizations ranged from gain-of-function to loss-of-function. On a population level, the portion of NMR variation explained by the rare coding variants was small (~1%). However, upon incorporation, the accuracy of the wGRS was improved for individuals with rare protein-coding variants (i.e., the residuals were reduced), and approximately one-third of these individuals (12/39) were re-assigned from normal to slow metabolizer status. Rare coding variants can alter an individual's CYP2A6 activity; their integration into wGRSs through precise functional characterization is necessary to accurately assess clinical outcomes and achieve precision medicine for all. Investigation into noncoding variants is warranted to further explain the missing heritability in the NMR.
© 2021 The Authors. Clinical and Translational Science published by Wiley Periodicals LLC on behalf of American Society for Clinical Pharmacology and Therapeutics.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34476898      PMCID: PMC8742641          DOI: 10.1111/cts.13135

Source DB:  PubMed          Journal:  Clin Transl Sci        ISSN: 1752-8054            Impact factor:   4.689


WHAT IS THE CURRENT KNOWLEDGE ON THE TOPIC? Common CYP2A6 variants (minor allele frequency >1%) explain 30–35% of the variation in CYP2A6 activity, despite high heritability estimates (60–80%) in the CYP2A6 activity biomarker measure. One hypothesis is that rare coding variants (minor allele frequency <1%) may explain a portion of the missing heritability from pharmacogenes, including CYP2A6. WHAT QUESTION DID THIS STUDY ADDRESS? What is the relative contribution of rare coding variants in explaining variation in CYP2A6 activity? How necessary is the incorporation of rare coding variants in predicting individual metabolic status, and consequent tailoring of treatment? WHAT DOES THIS STUDY ADD TO OUR KNOWLEDGE? Rare coding variants may explain only a small fraction of the variation on a population level; however, their role may be important on an individual level, altering the predicted metabolic status in a third of the individuals with these rare coding variants. HOW MIGHT THIS CHANGE CLINICAL PHARMACOLOGY OR TRANSLATIONAL SCIENCE? Evaluating rare coding variants in pharmacogenes, such as CYP2A6, will be valuable in enhancing the investigation of CYP2A6’s influence on tobacco addiction and disease pathogenesis, by providing a more accurate reflection of the phenotypic metabolic status through improved genetic assessments.

INTRODUCTION

Nicotine is responsible for tobacco’s addictive properties, and the rate of metabolic inactivation is associated with several smoking behaviors and cessation outcomes. Nicotine is predominately metabolized by the cytochrome P450 2A6 (CYP2A6) enzyme to form inactive cotinine (COT). COT is further metabolized to 3‐hydroxycotinine (3HC), exclusively by CYP2A6 ; the 3HC/COT ratio, called the nicotine metabolite ratio (NMR), is an index of CYP2A6 activity and a proxy for nicotine clearance. Variation in the NMR is associated with several smoking phenotypes including acquisition, quantity, , topography, dependence, , and smoking‐related disease risk, , and is a biomarker for personalizing smoking cessation treatment. , , , For example, in the Pharmacogenetics of Nicotine Addiction and Treatment 2 (PNAT2) smoking cessation trial, normal metabolizers (i.e., higher NMR) quit more on varenicline versus nicotine patch, whereas slow metabolizers (i.e., lower NMR) benefited more from the nicotine patch (equal quit rate, but lower incidence of side effects on the patch). The NMR requires COT to be at steady‐state, and thus it can only be reliably measured in regular smokers. This limits the assessment of the impact of NMR for CYP2A6 substrates as well as for tobacco‐related disease consequences (lung cancer, chronic obstructive pulmonary disease [COPD], and diabetes) , in intermittent‐, non‐, and former‐smokers. Heritability estimates for the NMR range between 60% and 80%. , Ninety‐six percent of NMR genomewide association studies (GWASs) hits are within or close to CYP2A6. Weighted genetic risk scores (wGRSs) based on common variants (minor allele frequency [MAF] >1%) explain 30–35% of the variation in the NMR phenotype. , Although substantial, this shortfall is not uncommon; many heritable phenotypes have only a portion of variation accounted for by identified genetic variants (e.g., human height ). Several researchers have hypothesized that rare variants (MAF < 1%), which are typically not assessed in GWAS genotyping arrays, contribute to this missing heritability, including for pharmacogenes; CYP2A6 rare variants could account for up to 40% of the functional variability. Nevertheless, there are several obstacles in understanding the role of rare variants on CYP function. (1) The high nucleotide sequence similarity/homology within CYP subfamily genes creates potential for sequencing errors: sequencing CYP2A6 is particularly difficult due to its high nucleotide similarity (94%) with the CYP2A7 pseudogene, making it challenging to verify variants from public sequencing databases. (2) The lack of rare variant functional characterization: often functional assignments are heavily reliant on in silico predictions, which are particularly inaccurate for CYP genes. Recently, a combined assortment of in silico prediction tools, weighted specifically for CYPs, has been put forth. In this paper, we present two studies to identify, functionally characterize, and assess the impact of rare CYP2A6 coding variants. We use two targeted sequencing approaches designed for accurate CYP2A6 sequencing, in conjunction with variant confirmation through orthogonal genotyping methods. Further, we integrate several in silico, in vivo, and in vitro approaches to characterize variants. Because statistical power is limited when studying rare coding variants individually using solely in vivo measures, we pair our assessments with in vitro functional assays to confirm variant effects on enzyme activity. Our group and others have made use of in vitro cDNA expression systems to assist in interpreting the effect of CYP protein‐coding variants. , , We assess the extent to which rare coding variants account for heritable variation, and if their incorporation leads to improved prediction of CYP2A6 metabolizer status. Furthermore, we present another set of CYP2A6 measures assessed in a human liver bank, including RNA quantification, protein abundance, and substrate metabolic activity assessed using human liver microsomes (HLMs).

METHODS

CYP2A6 read alignment simulation

A simulation was performed, aligning simulated reference CYP2A6 sequencing reads of varying lengths and insert sizes to assess public whole‐genome sequencing accuracy, where similar analyses have been described for the CYP2D6 gene. The read alignment simulation suggests there are several exons in which CYP2A6 reads could misalign, specifically exons 5 and 9 (Figure S1). Further details are described in the Supplementary Material. The following studies were approved by institutional review boards at all participating sites. Flowcharts of the different analyses and cohorts are summarized in (Figure S2).

Smoking cessation trial

Study population

There were 1684 treatment‐seeking smokers, including 541 of self‐reported African ancestry (AFR) and 1026 of self‐reported European ancestry (EUR), were screened at baseline as part of the PNAT2 (NCT01314001) smoking cessation trial. The remaining 117 subjects consisted of small numbers of other racial/ethnic populations; thus we focused on individuals of AFR and EUR ancestry.

Sequencing

Targeted deep exon sequencing was performed through the Illumina MiSeq sequencing system on the CYP2A6 gene. All exons were amplified through multiplex polymerase chain reaction (PCR) using gene‐specific primers, where amplicon sizes ranged from 260 to 490 bp; where the difficult‐to‐sequence exon 9 was captured through a nested PCR approach. Paired‐end sequencing reads were mapped to the Hg19 reference genome using BWA‐mem (version 0.7.15). Variants were called using GATK’s haplotype caller pipeline (version 3.3). Validation of the sequencing protocol was assessed in a sample of 120 polymerase chain reaction Japanese individuals (not part of the trial) to ensure targeted sequencing accuracy of CYP2A6 compared to Sanger sequencing, details of this validation can be found in Supplementary Table S1. All coding variants not yet described as a CYP2A6 * allele (pharmvar.org) were considered novel (i.e., whereas variants may be catalogued in other public sequencing databases), they have not previously been functionally characterized. Furthermore, all novel variants identified were confirmed through Sanger sequencing, the gold‐standard for validating next‐generation sequencing variant calls.

In vivo measure quantification

Free (unconjugated) concentrations of COT and 3HC, used to calculate the NMR, were measured from whole blood, where samples were collected at intake while individuals were smoking ad libitum.

In silico prediction

To designate in silico assignments, an optimized pharmacogenomic/absorption, distribution, metabolism, and excretion (ADME) prediction framework, published elsewhere, was utilized. Details are described in the Supplementary Material.

In vitro functional characterization

Details on the construction, expression, measure of protein quantity, and enzyme activity for the CYP2A6 variant constructs has been described elsewhere , and is described in full in the Supplementary Material. In vitro metabolism of nicotine to cotinine was used to determine CYP2A6 activity for each construct, as described elsewhere , and in the Supplementary Material.

Variant functional assignments

Assignments of loss‐of‐function, decrease‐of‐function, neutral, and gain‐of‐function were determined relative to the reference constructs CYP2A6 wildtype (WT) and CYP2A6*17 (known decrease‐of‐function variant). An aggregate functional assignment based on the three measures was made, and the level of evidence was assigned based on agreement between measures (3‐most confident, 1‐least confident). In cases where the in vitro and in vivo measures were in disagreement, the construct would be designated “inconsistent evidence” (IE) and assigned a level of evidence of 1. Together, two versions of aggregate functional assignments were created, a four‐level assignment distinguishing between loss and decrease‐of‐function, and a simplified three‐level assignment, which merged these two groups into a single decrease‐of‐function group. In vitro assignments for known rare coding variants were based on previous publications. , ,

Human liver bank

There were 312 human liver tissue samples, including 298 EUR individuals from two liver banks (smoking status unknown), (1) the St. Jude Liver Resource at the St. Jude Children’s Research Hospital (Memphis, TN) and (2) the University of Washington Human Liver Bank (Seattle, WA). The remaining 14 subjects consisted of small numbers of other racial/ethnic populations.

Sequencing and in vitro measures

CYP2A6 DNA sequencing, mRNA and protein quantification, and enzyme activity assays have been described previously, , and is briefly redescribed in the Supplementary Materials. All novel variants identified were confirmed through RNA sequencing using a different set of primers and library construction than those in the DNA sequencing.

Statistical analyses

Rare variant analyses

Rare variant association testing using the SKAT method was performed in BioBin version 2.3.0, where variants were binned based on an MAF cutoff of less than 1% and restricted to comparing CYP2A6 coding variants to the reference groups.

Reference groups

Reference groups were defined without a CYP2A6 rare variant and without one of the common CYP2A6 * alleles that are associated with the NMR (and included in their respective wGRSs). For AFR individuals, excluded common alleles were CYP2A6 *1X2, *4, *9, *12, *17, *20, *25, *26, *27, and *35. For EUR individuals, excluded common alleles were CYP2A6 *2, *4, *9, and *12.

Updated wGRSs

CYP2A6 activity wGRSs have been described for both AFR and EUR populations, based on 11 and seven common variants, respectively. For the wGRS analyses, ancestry was determined using principal components analysis, restricting to 504 AFR and 933 EUR individuals from the PNAT2 trial. To incorporate rare coding variants, effect sizes for the rare coding variants group were calculated through SNPTEST (version 2.5.2), as described previously ; remaining statistical analyses were performed in R (version 3.6.0) or RStudio (version 1.1.463). Effect sizes were computed either from the novel and previously known rare coding variants grouped together, or the novel rare coding variants alone. The variant weight was estimated by multiplying the standard deviation of NMR (0.181 and 0.205 in the AFR and EUR groups, respectively) by the effect size (i.e., beta) of the grouped rare variant alleles. The NMR was not normally distributed (by the Shapiro‐Wilk test) and was therefore log‐transformed, which best represents the nicotine clearance rate. Linear regression assessed log‐transformed NMR (log‐NMR) variation accounted for by the wGRS models, and residuals were calculated for individual participants from the wGRS line of best fit (i.e., distance between the wGRS model and outlier data points). A paired t‐test was used to assess the improvement in residual distribution after incorporating rare coding variants into the wGRS.

RESULTS

Identification of rare protein‐coding variants in CYP2A6 among 541 of African ancestry and 1026 individuals of European in the PNAT2 trial

There were 37 rare coding variants identified across 70 individuals (Table 1) through targeted CYP2A6 exon sequencing. Of the 37, eight were known, whereas 29 were novel and have not been previously identified as part of a CYP2A6 * allele or functionally characterized (pharmvar.org). All novel variants would be defined as rare (MAF <1%) based on the sequenced sample and based on public sequencing databases (e.g., gnomAD). Some of the novel variants were found concurrently with other novel variants or other common variants (Table S2). There were 15 rare coding variants identified in 31 of 541 AFR individuals, and 23 rare coding variants in 39 of 1026 EUR individuals (one of these variants overlapped between the AFR and EUR groups). There was a higher frequency of rare coding variants overall in AFRs (31/541; 5.7%) versus EURs (39/1026; 3.8%).
TABLE 1

Rare CYP2A6 variants identified in study one

Amino acid changersIDhg19 Positionhg19 REFhg19 ALTTranscript changeFrequency count (ancestry)Novel or knownIn silico prediction
I61Frs20055409541355885TAc.181A>T1 (AFR)Novel0 = Neutral
R128Lrs498689141354629CAc.383G>T4 (AFR) CYP2A6 *26 1 = Deleterious
S131Ars5955235041354621ACc.391T>G4 (AFR) CYP2A6 *26 1 = Deleterious
R190Lrs57133558741354209CAc.569G>T1 (AFR)Novel0.8 = Deleterious
Q239Krs13897873641352896GTc.715C>A4 (AFR)Novel0 = Neutral
P264Trs76166682741352821GTc.790C>A1 (AFR)Novel0.8 = Deleterious
R265Qrs14047170341352817CTc.794G>A3 (AFR) CYP2A6 *41 1 = Deleterious
M275Irs11186999541352786CAc.825G>T1 (AFR)Novel0.6 = Deleterious
E330Drs13790404441351370CAc.990G>T1 (AFR)Novel0.8 = Deleterious
R333Xrs6160557041351363TAc.997A>T3 (AFR)Novel1 = Deleterious
M368Irs77296436641351256CGc.1104G>C1(AFR)Novel0.2 = Neutral
T378Irs11455878041351227GAc.1133C>T3 (AFR)Novel0.4 = Neutral
E419Krs76841696341350584CTc.1255G>A1 (AFR)Novel0 = Neutral
I434Vrs130219228441350539TCc.1300A>G2 (AFR)Novel0 = Neutral
R64Crs37451527941355875GAc.191G>A2 (1 AFR, 1 EUR)Novel0 = Neutral
P35Lrs37771354541356228GAc.104C>T1 (EUR)Novel1 = Deleterious
R64Hrs37451527941355875CTc.191G>A1 (EUR)Novel0.4 = Neutral
E97Krs14530839941355777CTc.289G>A8 (EUR)Novel0.6 = Deleterious
G115Drs75847948841354668CTc.344G>A1 (EUR)Novel1 = Deleterious
V140Grs77709865841354593ACc.419T>G1 (EUR)Novel0.6 = Deleterious
V140Ars77709865841354593AGc.419T>C1 (EUR)Novel0.6 = Deleterious
D158Ers6060588541354538GCc.474C>G3 (EUR) CYP2A6 *22 0 = Neutral
L160Irs6056353941354534GTc.478C>A3 (EUR) CYP2A6 *22 0 = Neutral
K194Ers19991611741354198TCc.580A>G1 (EUR) CYP2A6 *15 0 = Neutral
R257Crs14515746041352842GCc.769C>T1 (EUR)Novel0.4 = Neutral
R257Grs14515746041352842GAc.769C>G2 (EUR)Novel0 = Neutral
E279Qrs5826175741351999CGc.835G>C1 (EUR)Novel0.2 = Neutral
I300Trs14869308441351935AGc.899T>C2 (EUR)Novel0 = Neutral
R311Crs5857163941351903GAc.931C>T1 (EUR+AFR) a Novel0.8 = Deleterious
YGFL312‐315LUnavailable41351891AAGAAGCCATAc.944del1 (EUR)Novel1 = Deleterious
V323Mrs130383935641351867CTc.967G>A1 (EUR)Novel0.4 = Neutral
R339Qrs15024768941351344CTc.1016G>A1 (EUR)Novel0.8 = Deleterious
M352Trs14384182341351305AGc.1055T>C3 (EUR)Novel0.2 = Neutral
F362Srs77801918941351275AGc.1085T>C1 (EUR)Novel1 = Deleterious
S465Trs74617333141349793ATc.1393T>A1 (EUR)Novel0 = Neutral
I471Trs503101641349774AGc.1412T>C1 (EUR) CYP2A6 *7/*19 0.6 = Deleterious
G479Vrs503101741349750CAc.1436G>T2 (EUR) CYP2A6 *5 1 = Deleterious

In silico prediction based off framework using several prediction tools that are optimized for ADME or pharmacogenes. Novel is defined as not previously functionally characterized or part of a known CYP2A6 * allele. X: Premature stop codon.

Abbreviations: ADME, absorption, distribution, metabolism, and excretion; AFR, African ancestry; ALT, alternate allele on the positive strand of the hg19 reference genome build; EUR, European ancestry; REF, reference allele.

Participant self‐reported race is multi‐racial.

Rare CYP2A6 variants identified in study one In silico prediction based off framework using several prediction tools that are optimized for ADME or pharmacogenes. Novel is defined as not previously functionally characterized or part of a known CYP2A6 * allele. X: Premature stop codon. Abbreviations: ADME, absorption, distribution, metabolism, and excretion; AFR, African ancestry; ALT, alternate allele on the positive strand of the hg19 reference genome build; EUR, European ancestry; REF, reference allele. Participant self‐reported race is multi‐racial.

In vivo associations

When grouped, rare coding variants were associated with decreased NMR relative to the reference group (those without rare coding variants and without common functionally relevant CYP2A6 * alleles; Figure 1). However, relative to the reference group, some variants appeared to have neutral or gain‐of‐function relationships with the NMR; these were more common in the EUR smoker group than in the AFR smoker group.
FIGURE 1

Beeswarm plots of the rare coding variants plotted individually, and grouped, with their NMR values in the (a) AFR (N = 541) and (b) EUR (N = 1026) populations. Reference: Individuals without a CYP2A6 rare variant and without any common functional CYP2A6 * alleles. X: Premature stop Codon. Statistical analyses were based off the SKAT test statistic for continuous traits comparing the Rare Grouped and Reference groups. $$: p < 0.001 $$$: p < 0.0001. AFR, African‐ancestry; EUR, European‐ancestry; NMR, nicotine metabolite ratio

Beeswarm plots of the rare coding variants plotted individually, and grouped, with their NMR values in the (a) AFR (N = 541) and (b) EUR (N = 1026) populations. Reference: Individuals without a CYP2A6 rare variant and without any common functional CYP2A6 * alleles. X: Premature stop Codon. Statistical analyses were based off the SKAT test statistic for continuous traits comparing the Rare Grouped and Reference groups. $$: p < 0.001 $$$: p < 0.0001. AFR, African‐ancestry; EUR, European‐ancestry; NMR, nicotine metabolite ratio

In silico predictions

The in silico‐based predictions using the optimized ADME‐prediction framework to assess variant outcomes were not able to distinguish the unique in vivo NMR associations between variants (Figure S3), which prompted further testing using in vitro approaches.

In vitro functional characterization

All 29 novel rare coding variants were introduced into respective CYP2A6‐POR bicistronic constructs and proteins were expressed. A WT, CYP2A6*17 construct (a common decrease‐of‐function CYP2A6 variant), and a construct with CYP2A6 excised (CYP‐DEL) were expressed concurrently with the novel variants and used as reference points. , , For immunoblotting, quality control checks were performed to confirm antibody specificity and protein identification in the WT and CYP‐DEL constructs, and a commercially expressed CYP2A6 protein source against pooled HLM where CYP2A6 and POR is expressed; further details are explained in Supplementary Figure S4. CYP2A6 protein levels from the constructs were determined using a standard curve of commercially expressed CYP2A6 protein (250–1000 fmol; Figure S5). The ratio of CYP2A6 protein levels to POR protein levels was determined by loading approximately equivalent POR (internal plasmid expression control; Figure 2a). The sample dilution curves (Figure S5) were used to determine an equivalent amount of POR. The ratio of CYP2A6 to POR was used to evaluate changes in CYP2A6 expression for each variant relative to the WT construct (Figure 2b). The in vitro (i.e., 2A6/POR protein expression and Vmax/Km catalytic efficiency corrected for CYP2A6 expression level) and in vivo (i.e., NMR) measures are summarized in (Tables 2 and 3).
FIGURE 2

(a) Western blot loading approximately equal POR amounts for each novel CYP2A6 rare variant identified in Smoking Cessation Trial. Sample loading was based on sample dilution curves (Figure S4), adjusting to achieve similar POR amounts between constructs (i.e., more sample was loaded for variant constructs with low expression), and within the respective POR and 2A6 linear ranges. (b) Bar graph displaying the CYP2A6/POR protein expression ratios for each variant construct. kDa, kilodalton; WT, wildtype expressed in E. coli; DEL, CYP2A6 excised from bicistronic construct with POR intact; C.E., Commercially Expressed CYP2A6 (CAT #456254; Corning); X, Premature stop Codon

TABLE 2

Detailed V max, K m, V max/K m (Turnover), and percentage of WT (Turnover Rates) for all novel rare variants identified in the Smoking Cessation Trial

Variant construct V max (pmol COT/min/pmol 2A6) K m (μM) V max/K m (nl/min/pmol 2A6)% WT (turnover rate)
WT6.048.0125100
V365M ( CYP2A6 *17)3.342.27964
P35L6.0134.04536
I61F8.763.5137110
R64H5.660.69374
R64C3.739.89375
E97K7.882.99576
Q239K12.767.2189152
V140A5.735.7160128
V140G3.534.910080
R190L2.9153.41915
R257C5.0124.34032
R257G5.4152.73528
P264T3.187.43629
M275I7.368.610685
E279Q4.662.27460
I300T7.2185.33931
R311C2.8107.12621
YGFL312‐315L1.0112.097
V323M1.1130.287
E330D8.997.19173
R333X0.5375.911
R339Q8.241.0201161
M352T11.255.5201161
F362S3.4108.53125
M368I10.374.1140112
T378I4.9111.34435
E419K3.975.55242
I434V3.583.04234

Abbreviations: COT, cotinine; K m, kinetic metabolite; WT, wildtype; V max, maximum value; X, premature stop Codon.

TABLE 3

Summary of in vitro assessments, in vivo associations, and variant construct functional assignments in AFR (N = 541) and EUR (N = 1026) populations

Variant constructCYP2A6/POR protein V max/K m (nl/min/pmol 2A6)Average NMR (Individual NMR) a Aggregate functional assignment (4‐level)Aggregate functional assignment (3‐level)Level of evidence
AFR
ReferenceWT1.08N125N0.351NNN
V365M (CYP2A6 *17)0.56D79D0.235DDD
Novel rareR64C0.22L93D0.192LLD3
R190L0.09L19L0.173LLD3
R333X0.01L1L0.108 (0.069, 0.123, 0.133)LLD3
E419K0.61D52L0.154LLD3
I434V0.09L42L0.158 (0.13, 0.186)LLD3
M275I0.40D106D0.257DDD3
E330D0.01L92D0.250DDD3
T378I0.56D44L0.239 (0.211, 0.242, 0.264)DDD3
I61F0.60D137N0.192LDD2
M368I0.57D140N0.231DDD2
Q239K1.01N189G0.394 (0.319, 0.353, 0.509)NNN3
P264T1.01N36L0.509GNN2
Known rareF118L, R128L, S131A (CYP2A6 *26) a 0 c L0% d L0.245 (0.065, 0.128, 0.298, 0.489)DLD3
R265Q (CYP2A6 *41)0 e L5% d L0.16 (0.104, 0.179, 0.197)LLD3
EUR
ReferenceWT1.08N125N0.458NNN
Novel rareR64C0.22L93D0.139LLD3
G115D0.06L9L0.132LLD3
R257S0.43D35L0.139 (0.08, 0.197)LLD3
I300T0.22L39L0.262 (0.069, 0.455)DLD3
YGFL312—315L0.00L9L0.207DLD3
V323M0.00L8L0.265DLD3
F362S0.12L32L0.168LLD3
R64H0.73D93D0.206DDD3
E97K0.35L95D0.29975 (0.127, 0.162, 0.18, 0.191, 0.192, 0.263, 0.529, 0.754)DDD3
V140G0.55D100D0.339DDD3
V140A0.54D1600.265DDD2
M352T0.90N201G0.372 (0.134, 0.315, 0.668)NNN3
R339Q0.76D201G0.291DNN2
P35L0.30L45L0.873GI.E.I.E.1
R257C0.65D40L0.464NI.E.I.E.1
E279Q0.65D74D0.395NI.E.I.E.1
R311C0.00L26L0.359NI.E.I.E.1
S465T0.95N189G0.298DI.E.I.E.1
Known rareL160I, D158E (CYP2A6 *22) a Unknown66% d D0.223 (0.153, 0.293)DDD2
K194E (CYP2A6 *15)Unknown219% d G0.168LI.E.I.E.1
I471T (CYP2A6 *7)Unknown5% d L0.119LLD2
G479V (CYP2A6 *5)Unknown0% d L0.549 (0.172, 0.925)GI.E.I.E.1

CYP2A6/POR: ratio of CYP2A6 protein expression relative to POR.

V max/K m: measure of catalytic efficiency based on in vitro metabolism to the nicotine substrate. X: Premature stop Codon. L: loss‐of‐function, D: decrease‐of‐function, N: neutral‐function G: gain‐of‐function. Aggregate Functional Assignment (4‐level): overall functional assignment of variants split into four groups: L, D, N, and G. Aggregate Functional Assignment (3‐level): overall functional assignment of variants split into three groups: D, N, and G. I.E.: inconsistent evidence between in vitro and in vivo parameters. Cut points for definitions were based on reference constructs and were as follows. CYP2A6/POR: L: 0–0.4, D: 0.4–0.8, N: 0.8–1.1, G: >1.1. V max/K m: L: 0–55, D: 55–110, N:110–160, G: >160. Average NMR: L: 0–0.2, D: 0.2–0.35, N: 0.35–0.5, G: >0.5.

Abbreviations: AFR, African ancestry; EUR, European ancestry; K m, kinetic metabolite; NMR, nicotine metabolite ratio; V max, maximum value; WT, wildtype.

Only individuals that were heterozygote for the allele were considered (i.e. CYP2A6 *1/*17, and not *17/*17).

All coding variants in the haplotype were required to be considered part of the known * allele.

Reported in Mwenifumbo et al.

Reported in Hosono et al.

Reported in Piliguian et al.

(a) Western blot loading approximately equal POR amounts for each novel CYP2A6 rare variant identified in Smoking Cessation Trial. Sample loading was based on sample dilution curves (Figure S4), adjusting to achieve similar POR amounts between constructs (i.e., more sample was loaded for variant constructs with low expression), and within the respective POR and 2A6 linear ranges. (b) Bar graph displaying the CYP2A6/POR protein expression ratios for each variant construct. kDa, kilodalton; WT, wildtype expressed in E. coli; DEL, CYP2A6 excised from bicistronic construct with POR intact; C.E., Commercially Expressed CYP2A6 (CAT #456254; Corning); X, Premature stop Codon Detailed V max, K m, V max/K m (Turnover), and percentage of WT (Turnover Rates) for all novel rare variants identified in the Smoking Cessation Trial Abbreviations: COT, cotinine; K m, kinetic metabolite; WT, wildtype; V max, maximum value; X, premature stop Codon. Summary of in vitro assessments, in vivo associations, and variant construct functional assignments in AFR (N = 541) and EUR (N = 1026) populations CYP2A6/POR: ratio of CYP2A6 protein expression relative to POR. V max/K m: measure of catalytic efficiency based on in vitro metabolism to the nicotine substrate. X: Premature stop Codon. L: loss‐of‐function, D: decrease‐of‐function, N: neutral‐function G: gain‐of‐function. Aggregate Functional Assignment (4‐level): overall functional assignment of variants split into four groups: L, D, N, and G. Aggregate Functional Assignment (3‐level): overall functional assignment of variants split into three groups: D, N, and G. I.E.: inconsistent evidence between in vitro and in vivo parameters. Cut points for definitions were based on reference constructs and were as follows. CYP2A6/POR: L: 0–0.4, D: 0.4–0.8, N: 0.8–1.1, G: >1.1. V max/K m: L: 0–55, D: 55–110, N:110–160, G: >160. Average NMR: L: 0–0.2, D: 0.2–0.35, N: 0.35–0.5, G: >0.5. Abbreviations: AFR, African ancestry; EUR, European ancestry; K m, kinetic metabolite; NMR, nicotine metabolite ratio; V max, maximum value; WT, wildtype. Only individuals that were heterozygote for the allele were considered (i.e. CYP2A6 *1/*17, and not *17/*17). All coding variants in the haplotype were required to be considered part of the known * allele. Reported in Mwenifumbo et al. Reported in Hosono et al. Reported in Piliguian et al.

Variant integration into CYP2A6 wGRSs

Most individuals with a functionally important rare variant (variants with a level of evidence of 2 or 3; Table 3) were outliers from the lines of best fit in the original wGRS to log‐NMR correlations, as assessed by their residual value (Figure 3a,b; i.e., distance between the wGRS line of best fit and data points, to measure of how well the line fits for individuals sequenced with rare coding variants). In the AFR population, the average residual value of individuals with a functionally relevant rare variant was 0.238, compared to the general AFR population with an average residual of 0.188. In other words, the wGRS (based on common variants) is a poorer fit for those with rare coding variants than for everyone else. Likewise, in the EUR population, these values were 0.213 and 0.140, respectively. This suggests that the wGRS score assigned for those with rare coding variants was less reflective of their log‐NMR (i.e., weaker metabolizer status prediction).
FIGURE 3

Respective (a) AFR wGRS (N = 504) and (b) EUR wGRS (N = 933), individuals with rare coding variants highlighted. Original wGRSs were based exclusively on common variants. (c, d) Updated wGRSs incorporating the functional rare coding variants weighted collectively as a decrease‐of‐function in the AFR and EUR population, respectively. Pie charts represent the proportion of individuals classified as slow or normal metabolizers by the wGRS before (a), (b), and after (c), (d) incorporating rare variant data. wGRS cut points for metabolizer status determination have been previously described (slow wGRS <2.089; normal wGRS ≥2.089 in AFR ; slow wGRS <2.14; normal wGRS ≥2.14 in EUR ). AFR, African‐ancestry; EUR, European‐ancestry; NMR, nicotine metabolite ratio; wGRS, weighted Genetic Risk Score

Respective (a) AFR wGRS (N = 504) and (b) EUR wGRS (N = 933), individuals with rare coding variants highlighted. Original wGRSs were based exclusively on common variants. (c, d) Updated wGRSs incorporating the functional rare coding variants weighted collectively as a decrease‐of‐function in the AFR and EUR population, respectively. Pie charts represent the proportion of individuals classified as slow or normal metabolizers by the wGRS before (a), (b), and after (c), (d) incorporating rare variant data. wGRS cut points for metabolizer status determination have been previously described (slow wGRS <2.089; normal wGRS ≥2.089 in AFR ; slow wGRS <2.14; normal wGRS ≥2.14 in EUR ). AFR, African‐ancestry; EUR, European‐ancestry; NMR, nicotine metabolite ratio; wGRS, weighted Genetic Risk Score The rare coding variants were then weighted as a collective decrease‐of‐function group (based on the 3‐level functional assignment) and incorporated into the previously reported CYP2A6 wGRSs. Individuals without rare coding variants, or with rare coding variants deemed to have inconsistent evidence (Table 3), retained their original wGRS values. The overall beta evaluated for the decrease of function rare variant group in the AFR population was −0.695 (the consequent variant weight was −0.126), and this beta was consistent when incorporating the list of known rare coding variants. Likewise, in the EUR population, the overall beta was −0.819 (variant weight −0.168). There was only a minor improvement in the overall variance captured, as expected, given the small number of people with functionally important rare coding variants. In AFR smokers, the variance of NMR captured increased from R 2 = 30.7% (Figure 3a) to 31.9% (Figure 3c), whereas in EUR smokers, the variance increased from R 2 = 33.8% (Figure 3b) to 34.8% (Figure 3d). Similar findings were observed when incorporating known demographic covariates of the NMR (sex, age, and body mass index [BMI]) in both AFR smokers (R 2 = 33.5–34.6%) and EUR smokers (R 2 = 37.6–38.6%). However, when examining the specific individuals with functionally important rare coding variants in the AFR population (N = 18), the average residual value for individuals with a rare variant decreased from 0.238 to 0.158. Likewise, in the EUR population (N = 21), the average residual value for individuals with a rare variant decreased from 0.213 to 0.179. Together, there was a significant improvement in the residuals for those with rare coding variants (p < 0.001 based on a paired t‐test). In the AFR group, eight individuals were reclassified from normal to slow metabolizer status after incorporating the rare coding variants (Figure 3c). In the EUR group, four individuals were reclassified from normal to slow metabolizer status after incorporating the rare coding variants (Figure 3d). For these 12 individuals, this re‐classification based on their improved wGRS predicted metabolizer status (based on a 2.089 AFR and 2.14 EUR cutoff point) , lead to improved concordance with their NMR‐based metabolizer status (based on a 0.31 cutoff point).

Identification and characterization of rare protein‐coding variants in CYP2A6 among 312 individuals in a human liver bank

Ten rare coding variants (one known and nine novel) were identified among 17 individuals of the 312 total individuals sequenced (Table S3). When grouped, the rare coding variants were not associated with overall changes in CYP2A6 mRNA or protein (Figure 4a,b). However, they were overall associated with a decrease in enzyme activity (Figure 4c), suggesting some variants may alter transcription or translation/stability and resulting activity, whereas most coding variants directly altered intrinsic metabolic activity. Like in the Smoking Cessation Trial, the variants demonstrated a spectrum of functional assignments. Two of the variants identified in the Smoking Cessation Trial were also identified in the Human Liver Bank (E97K and R311C); the effects on CYP2A6 activity were consistent for E97K between the two studies. The 311C variant was associated with higher mRNA and protein expression in the human liver bank, neutral in vivo activity (NMR) in the smoking cessation trial and decreased intrinsic in vitro activity (based on the constructs); overall, this suggests that R311C is associated with higher expression, but lower intrinsic enzyme activity, leading to neutral in vivo activity. A summary of the functional characterizations of the rare coding variants identified in the Human Liver Bank are in (Figure 4d).
FIGURE 4

Association of rare CYP2A6 coding variants with (a) CYP2A6 mRNA levels (FPKM values), (b) CYP2A6 protein levels (pmol/mg microsomal protein), and (c) CYP2A6 enzyme activity (cotinine formation from nicotine, nmol/min/mg microsomal protein) in the human liver bank (N = 312). Reference: Excluding those with a CYP2A6 rare variant and/or common functional CYP2A6 * alleles. S1: Variant also identified in Smoking Cessation Trial. Statistical analyses were based off the SKAT test statistic for continuous traits comparing the Rare Grouped and Reference groups. $: p < 0.05. (d) Summary of the assessments in the human liver bank. Enzyme activity, cotinine formation from nicotine in vitro; L, loss‐of‐function; D, decrease‐of‐function; N, neutral‐function; G, gain‐of‐function. Aggregate Functional Assignment (4‐level): overall functional assignment of variants split into four groups: L, D, N, and G. Aggregate Functional Assignment (3‐level): overall functional assignment of variants split into three groups: D, N, and G. I.E.: inconsistent evidence between expression and kinetic parameters. Cut points for definitions were based on reference variants and were as follows. Defining parameters: mRNA: L: 0–250, D: 251–375, N: 376–600, G: greater than 600. Protein: L: 0–15, D: 16–20, N: 21–30, G: greater than 30. Enzyme Activity: L: 0–0.08, D: 0.09–0.10, N: 0.11–0.15, G: greater than 0.15

Association of rare CYP2A6 coding variants with (a) CYP2A6 mRNA levels (FPKM values), (b) CYP2A6 protein levels (pmol/mg microsomal protein), and (c) CYP2A6 enzyme activity (cotinine formation from nicotine, nmol/min/mg microsomal protein) in the human liver bank (N = 312). Reference: Excluding those with a CYP2A6 rare variant and/or common functional CYP2A6 * alleles. S1: Variant also identified in Smoking Cessation Trial. Statistical analyses were based off the SKAT test statistic for continuous traits comparing the Rare Grouped and Reference groups. $: p < 0.05. (d) Summary of the assessments in the human liver bank. Enzyme activity, cotinine formation from nicotine in vitro; L, loss‐of‐function; D, decrease‐of‐function; N, neutral‐function; G, gain‐of‐function. Aggregate Functional Assignment (4‐level): overall functional assignment of variants split into four groups: L, D, N, and G. Aggregate Functional Assignment (3‐level): overall functional assignment of variants split into three groups: D, N, and G. I.E.: inconsistent evidence between expression and kinetic parameters. Cut points for definitions were based on reference variants and were as follows. Defining parameters: mRNA: L: 0–250, D: 251–375, N: 376–600, G: greater than 600. Protein: L: 0–15, D: 16–20, N: 21–30, G: greater than 30. Enzyme Activity: L: 0–0.08, D: 0.09–0.10, N: 0.11–0.15, G: greater than 0.15

DISCUSSION

There were 47 rare coding variants identified among 87 individuals of 1996 sequenced in the two studies, 38 of which were novel and have not been previously functionally characterized. These rare coding variants were collectively associated with decreased enzyme activity, as observed in vivo by the NMR (Figure 1) and in vitro through metabolic assays using cDNA expressed enzymes (Tables 2 and 3) and HLM (Figure 4c). Most of these coding rare variants impose effects on metabolic activity (Tables 2 and 3, and Figure 4); some mediate these effects through changes in mRNA/protein expression (Figure 4), but most altered intrinsic enzyme properties, perhaps through alterations to the enzyme’s access channel or affecting substrate binding. Most rare coding variants reported in CYP genes, including previously for CYP2A6, are associated with decreased CYP enzyme activity , ; exploration of noncoding rare variants may yield more neutral or gain of function effects. The region containing the stretch of amino acid residues 300–330 may play an important function in the protein folding and stability of CYP2A6, as indicated by a consistent reduction of CYP2A6 protein from variant constructs in this region (Figure 2), and the high conservation of this region according to in silico predictions (Table 1). At a population level, the impact of rare coding variants appears to be small, represented by a minor increase in the variance (R2) captured in the NMR (Figure 3c,d; i.e., ~ 1% of variance was explained by the rare coding variants in the AFR and EUR samples). Even when controlling for nongenetic factors (sex, age, and BMI), the variance captured by the rare variants was still 1%, suggesting that this fraction of variation is directly explained by the variants and is not skewed by nongenetic factors. However, on an individual level, the inclusion of rare coding variants led to a shift for these individuals from predicted normal to predicted slow metabolizer status (Figure 3c,d), which was more in line with their NMR metabolizer status (i.e., 12 of 39 individuals sequenced with rare coding variants were re‐assigned to slow metabolizer status when incorporating rare coding variants into their wGRS). In the context of smoking cessation treatment, based on the findings from the PNAT2 clinical trial, to derive the greatest quitting rates with the fewest incidences of side effects, treatment recommendations for 12 of 39 individuals would shift from varenicline to the nicotine patch. The importance of integrating rare coding variants for these individuals was also observed by the significant reduction in the residual values from the wGRS line of best fit (i.e., the wGRSs now worked more accurately for those with rare coding variants). The 1% of variation captured falls short of the 40% estimate previously suggested to explain functional variation in CYP2A6. Three potential elements may have contributed to the high estimate. The first is the use of public sequencing databases. As demonstrated by the read alignment simulation, there is a potential for sequencing errors if the sequencing protocol does not meet rigorous criteria in terms of read length or selection of probes that prevent off‐target sequencing (Figure S1); caution is necessary when assessing whole‐genome sequencing databases. The second element is the dependence on in silico functional predictions. In silico variant predictions, even when aggregating a collection of approaches, were ineffective in distinguishing relationships with the NMR (Figure S3). The in vitro characterizations of variant effects on CYP2A6 activity were more consistent with in vivo measures of CYP2A6 activity (as demonstrated here and previously , , ). Third, future sequencing and analyses should explore the role of noncoding rare coding variants, such as those in regulatory elements including promotor regions which can affect the expression of CYP2A6. Another source of CYP2A6 functional variability may be explained by rare genetic variants in other genes that affect the regulation or activity of CYP2A6, for example, the gene encoding POR, which is responsible for the electron transfer between NADPH and CYP enzymes. In the Smoking Cessation Trial, a greater number of rare coding variants per capita was identified in the AFR population than in the EUR population (Table 1), as noted in other studies. This reflects the greater genetic diversity found in AFR populations in general, and is consistent with the on average lower NMR in AFR compared to EUR, partially explained by the greater frequency of variants in AFR individuals. An important role of rare coding variants in mediating smoking behaviors has also been shown for the genes encoding nicotine acetylcholine receptor alpha subunits (CHRNA). Rare coding variants identified in CHRNA4 were associated with nicotine dependence measures and smoking‐related disease risk. As the NMR is also associated with several smoking behaviors, including nicotine dependence measures and disease risk, we predict functionally relevant CYP2A6 rare coding variants would yield similar results. Indeed, in a previous study involving deep sequencing the CYP2A6 gene, there was an association between smoking amount and rare CYP2A6 variants predicted to be deleterious based on in silico predictions ; these analyses could be strengthened through the integration of in vitro functional characterization, as presented in this study. Due to our focus on coding rare coding variants, there was only a small number of participants with functionally important rare coding variants in our study, thus we were underpowered to study associations with clinical smoking phenotypes, such as smoking quantity or cessation; inclusion of functional noncoding rare coding variants would provide greater power to explore clinical associations. For our in vitro functional characterization in the Smoking Cessation Trial, we used an E. coli expression system. Whereas a major advantage of this expression system is a high protein yield that can be used for structural or metabolic assays, a disadvantage are the differences in transcriptional/translational/post‐translational mechanisms in bacterial versus human cells, which could influence effects on enzyme protein levels. Despite this limitation, this system has been shown here and elsewhere to reflect in vivo CYP2A6 activity. The use of RNA‐seq to confirm rare coding variants identified in the Human Liver Bank is not as reliable as Sanger sequencing, as was done to confirm variants in the Smoking Cessation Trial; however, unique primer sets were used between the DNA and RNA sequencing, making it less likely for the two orthogonal approaches to result in identical variant calling errors. Due to the cDNA nature of our in vitro functional assay, we focused our assessments on coding variants; however, it is possible that rare coding variants within regulatory elements and splice sites could contribute to the functional variability in CYP2A6 and the NMR. Future studies should consider the implications of these noncoding rare coding variants. Finally, for our in vitro assessments all variants were characterized in individual constructs, however, as displayed in Table S2, it is possible that variant combinations could lead to alternative effects on CYP2A6 activity that were not explicitly tested for. However, the rare variants were most often found in the same individuals as CYP2A6*9, a variant in the 5′ regulatory element (TATA box) that affects expression which could not be explicitly tested in our E. coli (prokaryote) expression system. Furthermore, the integration of the genetic risk scores in Figure 3 accounts for circumstances where individuals also have other functional variants. Considering there are 494 amino acids in the CYP2A6 protein with thousands of potential coding variant possibilities, expanding the catalogue of functionally characterized variants will be necessary. One approach would be through the techniques presented here; another approach would be to implement a deep mutational scanning approach. Yeast‐based activity assays to test thousands of variants are being piloted in CYP2C9, although the reflection of these assays to gold standard in vitro metabolic assays is still undetermined. Alternatively, current in silico prediction techniques could undergo further refinement with the growing knowledge of variant effects. In conclusion, we demonstrate that incorporation of functionally relevant rare coding variants can be important for the determination of individual metabolizer status. Incorporation of these rare coding variants led to more accurate CYP2A6 activity grouping (normal vs. slow) for individuals genotyped with rare coding variants. Improved CYP2A6 groupings can in turn provide better assessments of the impact of CYP2A6 on outcomes of interest, including smoking behaviors, disease risk assessment, as well as improving tailored selection of smoking cessation treatment. Currently, the high costs and poor accuracy of sequencing may prevent integration of rare coding variants, but as the accuracy, frequency, and appeal of sequencing increases, to achieve precision medicine for all, integration of these variants should be considered.

CONFLICTS OF INTEREST

R.F.T. has consulted for Quinn Emanuel and Ethismos on unrelated topics. N.L.B. has been a consultant to Pfizer and Achieve Life Sciences, companies that market or are developing smoking cessation medications, and has been an expert witness in litigation against tobacco companies. All other authors declared no competing interests for this work.

AUTHOR CONTRIBUTIONS

A.E. and R.F.T. wrote the manuscript. A.E., A.Z.X.Z., K.F., T.M., M.K., and R.F.T. designed the research. A.E., J.T., K.G.C., B.P., and K.F. performed the research. A.E. analyzed the data. E.G.S., K.E.T., T.M., M.K., N.L.B., C.L., and R.F.T. contributed reagents/analytical tools. Supplementary Material Click here for additional data file.
  41 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Role of human cytochrome P4502A6 in C-oxidation of nicotine.

Authors:  M Nakajima; T Yamamoto; K Nunoya; T Yokoi; K Nagashima; K Inoue; Y Funae; N Shimada; T Kamataki; Y Kuroiwa
Journal:  Drug Metab Dispos       Date:  1996-11       Impact factor: 3.922

Review 3.  African genetic diversity: implications for human demographic history, modern human origins, and complex disease mapping.

Authors:  Michael C Campbell; Sarah A Tishkoff
Journal:  Annu Rev Genomics Hum Genet       Date:  2008       Impact factor: 8.929

4.  Evaluation of a weighted genetic risk score for the prediction of biomarkers of CYP2A6 activity.

Authors:  Ahmed El-Boraie; Taraneh Taghavi; Meghan J Chenoweth; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Caryn Lerman; Nicole L Nollen; Neal L Benowitz; Rachel F Tyndale
Journal:  Addict Biol       Date:  2019-02-27       Impact factor: 4.280

5.  Functional Characterization of 34 CYP2A6 Allelic Variants by Assessment of Nicotine C-Oxidation and Coumarin 7-Hydroxylation Activities.

Authors:  Hiroki Hosono; Masaki Kumondai; Masamitsu Maekawa; Hiroaki Yamaguchi; Nariyasu Mano; Akifumi Oda; Noriyasu Hirasawa; Masahiro Hiratsuka
Journal:  Drug Metab Dispos       Date:  2016-12-14       Impact factor: 3.922

6.  Predictors of Variation in CYP2A6 mRNA, Protein, and Enzyme Activity in a Human Liver Bank: Influence of Genetic and Nongenetic Factors.

Authors:  Julie-Anne Tanner; Bhagwat Prasad; Katrina G Claw; Patricia Stapleton; Amarjit Chaudhry; Erin G Schuetz; Kenneth E Thummel; Rachel F Tyndale
Journal:  J Pharmacol Exp Ther       Date:  2016-11-04       Impact factor: 4.030

7.  Quantifying rare, deleterious variation in 12 human cytochrome P450 drug-metabolism genes in a large-scale exome dataset.

Authors:  Adam S Gordon; Holly K Tabor; Andrew D Johnson; Beverly M Snively; Themistocles L Assimes; Paul L Auer; John P A Ioannidis; Ulrike Peters; Jennifer G Robinson; Lara E Sucheston; Danxin Wang; Nona Sotoodehnia; Jerome I Rotter; Bruce M Psaty; Rebecca D Jackson; David M Herrington; Christopher J O'Donnell; Alexander P Reiner; Stephen S Rich; Mark J Rieder; Michael J Bamshad; Deborah A Nickerson
Journal:  Hum Mol Genet       Date:  2013-11-26       Impact factor: 6.150

8.  Heritability of adult body height: a comparative study of twin cohorts in eight countries.

Authors:  Karri Silventoinen; Sampo Sammalisto; Markus Perola; Dorret I Boomsma; Belinda K Cornes; Chayna Davis; Leo Dunkel; Marlies De Lange; Jennifer R Harris; Jacob V B Hjelmborg; Michelle Luciano; Nicholas G Martin; Jakob Mortensen; Lorenza Nisticò; Nancy L Pedersen; Axel Skytthe; Tim D Spector; Maria Antonietta Stazi; Gonneke Willemsen; Jaakko Kaprio
Journal:  Twin Res       Date:  2003-10

9.  Novel features and enhancements in BioBin, a tool for the biologically inspired binning and association analysis of rare variants.

Authors:  Anna O Basile; Marta Byrska-Bishop; John Wallace; Alexander T Frase; Marylyn D Ritchie
Journal:  Bioinformatics       Date:  2018-02-01       Impact factor: 6.937

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  3 in total

1.  The genetic landscape of major drug metabolizing cytochrome P450 genes-an updated analysis of population-scale sequencing data.

Authors:  Yitian Zhou; Volker M Lauschke
Journal:  Pharmacogenomics J       Date:  2022-09-06       Impact factor: 3.245

2.  Accuracy and applications of sequencing and genotyping approaches for CYP2A6 and homologous genes.

Authors:  Alec W R Langlois; Ahmed El-Boraie; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Caryn Lerman; Jo Knight; Steven E Scherer; Meghan J Chenoweth; Rachel F Tyndale
Journal:  Pharmacogenet Genomics       Date:  2022-02-21       Impact factor: 2.000

3.  Functional characterization of novel rare CYP2A6 variants and potential implications for clinical outcomes.

Authors:  Ahmed El-Boraie; Julie-Anne Tanner; Andy Z X Zhu; Katrina G Claw; Bhagwat Prasad; Erin G Schuetz; Kenneth E Thummel; Koya Fukunaga; Taisei Mushiroda; Michiaki Kubo; Neal L Benowitz; Caryn Lerman; Rachel F Tyndale
Journal:  Clin Transl Sci       Date:  2021-09-25       Impact factor: 4.689

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.