Literature DB >> 32004414

CYP2C9 and CYP2C19: Deep Mutational Scanning and Functional Characterization of Genomic Missense Variants.

Lingxin Zhang1, Vivekananda Sarangi2, Irene Moon1, Jia Yu1, Duan Liu1, Sandhya Devarajan1, Joel M Reid1, Krishna R Kalari2, Liewei Wang1, Richard Weinshilboum1.   

Abstract

Single nucleotide variants in the open reading frames (ORFs) of pharmacogenes are important causes of interindividual variability in drug response. The functional characterization of variants of unknown significance within ORFs remains a major challenge for pharmacogenomics. Deep mutational scanning (DMS) is a high-throughput technique that makes it possible to analyze the functional effect of hundreds of variants in a parallel and scalable fashion. We adapted a "landing pad" DMS system to study the function of missense variants in the ORFs of cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19). We studied 230 observed missense variants in the CYP2C9 and CYP2C19 ORFs and found that 19 of 109 CYP2C9 and 36 of 121 CYP2C19 variants displayed less than ~ 25% of the wild-type protein expression, a level that may have clinical relevance. Our results support DMS as an efficient method for the identification of damaging ORF variants that might have potential clinical pharmacogenomic application.
© 2020 Mayo Clinic. Clinical and Translational Science published by Wiley Periodicals, Inc. on behalf of the American Society for Clinical Pharmacology and Therapeutics.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32004414      PMCID: PMC7359949          DOI: 10.1111/cts.12758

Source DB:  PubMed          Journal:  Clin Transl Sci        ISSN: 1752-8054            Impact factor:   4.689


WHAT IS THE CURRENT KNOWLEDGE ON THE TOPIC? ☑ With the increasing application of next generation sequencing (NGS) to known “pharmacogenes,” large numbers of open reading frame (ORF) “variants of unknown significance” (VUS) are being identified. However, the functional implications of those VUS remains unclear. WHAT QUESTION DID THIS STUDY ADDRESS? ☑ This study was designed to determine whether the application of DMS might make it possible to determine the functional effects of VUS that have been observed in the ORFs of cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P4540 family 2 subfamily C member 19 (CYP2C19). WHAT DOES THIS STUDY ADD TO OUR KNOWLEDGE? ☑ The results of the study demonstrate that deep mutational scanning (DMS) can be used to determine the functional implications of ORF VUS in CYP2C9 and CYP2C19. HOW MIGHT THIS CHANGE CLINICAL PHARMACOLOGY OR TRANSLATIONAL SCIENCE? ☑ The application of DMS to additional pharmacogenes would potentially expand the accuracy and clinical utility of the application of NGS to pharmacogenomics. Genetic polymorphisms in or near pharmacogenes are a major cause of individual variation in drug response phenotypes.1 Cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) are genes that encode important cytochrome P450 enzymes that catalyze the phase I biotransformation of a variety of therapeutic drugs, including antiplatelet agents, selective serotonin reuptake inhibitors, and proton pump inhibitors.2, 3, 4, 5 Several years ago, the Mayo Clinic launched the RIGHT 1K study, in which next generation DNA sequencing (NGS) was performed with DNA from 1013 Mayo Biobank participants to identify variants in 84 pharmacogenes, including CYP2C9 and CYP2C19.1, 6 However, many of the polymorphisms observed in those patients were variants of unknown significance (VUS).1, 7, 8 In a recent publication, we functionally characterized six novel nonsynonymous open reading frame (ORF) variants in the CYP2C9 gene and seven nonsynonymous ORF variants in the CYP2C19 gene observed in DNA from participants in the Right 1K study.9 Conventional methods for the characterization of individual sequence variants “one‐at‐a‐time” are reliable, but they are also time‐consuming, labor‐intense, and not easily scalable. As a result, only a limited number of variants can practically be investigated in that fashion. To help address this challenge, predictive algorithms, such as Polyphen‐2, SIFT, and PROVEAN, among others, represent efforts to help identify deleterious variants, but their reliability is variable and inadequate for clinical application.10, 11, 12 Laboratory‐based assays are still required to reliably interpret the impact of ORF missense variants on protein function. As a result, a significant gap remains in the functional interpretation of the large number of missense VUS being discovered as DNA sequencing is applied ever more broadly. Deep mutational scanning (DMS) methods provide a platform with which a large number of missense variants can be interrogated in parallel.13, 14 For example, Fowler et al. developed a DMS assay for all possible variants in clinically important genes, such as BRCA1, TPMT, and PTEN, linking genotypes to functionally determined phenotypes.15, 16 Specifically, engineered “landing pad” HEK 293T cells were used as a platform to integrate pooled variant libraries, resulting in one variant per cell for further functional assessment.15 DMS includes the creation of variant libraries for a gene, selecting the library for function (e.g., resistance to drug or fluorescence markers of protein quantity) and—finally—high‐throughput DNA sequencing of variants to link them with the “activity” assayed in a functional test. In our previous “one‐at‐a‐time” functional genomic study, we identified a series of missense variants in the ORFs of CYP2C9 and CYP2C19 that resulted in decreased protein expression as a result of proteasome‐mediated degradation, presumably due to an alteration in protein folding. In the present study, fluorescence‐activated cell sorting (FACS) was used as a selection mechanism for cells expressing variant sequences (i.e., the cells were sorted according to the abundance of reporter protein expression). We should point out that saturation mutagenesis can also be used to create a “saturation” library, including all possible mutations for an ORF sequence in a single reaction, and that approach has been applied in some previous DMS studies.15 However, many of the variants created are unlikely to occur clinically. Therefore, we applied a different mutagenesis approach, nicking mutagenesis, to generate libraries that contained human genomic variation known to be present in the general population. Generation of variants by nicking mutagenesis is not exhaustive, but it is much more efficient than the generation of mutants one at a time by site‐directed mutagenesis.17 In the present study, we have used DMS to analyze the functional implications of missense variants that have been observed in the ORFs of CYP2C9 and CYP2C19, ORFs that were fused to green fluorescent protein (GFP). Recombinase was used to integrate the variant libraries into landing pad cells, one per cell. Multiplexed functional selection performed with FACS was then used to separate the cells into different “bins” based on fluorescence readout at the single cell level. Amplicon sequencing of DNA in each bin, followed by computational analysis of the frequency of variants appearing in each bin, was used to determine levels of protein expression. To identify potentially severely damaging variants for these two important CYP pharmacogenes, we analyzed 230 observed missense variants in ORFs present in a publicly available database consisting of exome sequencing  data for 60,000 general population subjects. We included genetic variants with allele frequencies (> 0.00001) observed by the Exome Aggregation Consortium (ExAC; http://exac.broadinstitute.org/) as well as novel ORF VUSs from the Mayo Right 1K and Right 10K projects.6, 9, 18 The RIGHT 1K samples served as a control because we had already studied them individually.9 We found that 19 of 109 CYP2C9 and 36 of the 121 CYP2C19 missense variants that were studied displayed less than ~ 25% of the wild‐type (WT) protein expression, a level that might have clinical relevance.9 We also compared variant calling by the DMS method with the predictions of computational algorithms and, finally, we validated serverely damaging variants by the use of Western blot analysis. Our findings suggest that DMS can be an efficient method for the high‐throughput identification of low protein abundance ORF variants that might have potential clinical implications.

Methods

Generation of landing pad HEK 293T cells

The landing pad construct and the promoter‐less cassette for CYP ORFs was created by Gibson assembly (Supplemental Text). TALENs were used to create double stranded breaks at the AAVS1 site and homology‐directed repair was used to introduce landing pad constructs into HEK 293T cells.15 The landing pad construct expressed doxycycline inducible blue fluorescent protein (BFP), which was used as a selection marker for landing pad insertion. Thirty percent of HEK 293T cells display hypotriploid karyotypes.19 The assay requires that a single variant be integrated per cell. Single cells were sorted into each well of a 96‐well plate for cloning. To identify which cells contained a single landing pad, we used Bxb1 recombinase mediated integration of a 1:1 ratio mixture of GFP and mCherry promotor‐less plasmids and analyzed fluorescence by flow cytometry. Clones having the lowest percentage of both GFP and mCherry were most likely to be candidate for further assay. The attB‐mCherry and attB‐GFP plasmids were transfected 24 hours after Bxb1 recombinase transfection, and BFP induction by doxycycline. After 5 days, candidate clones were trypsinized, washed with phosphate‐buffered saline, and were fixed in 4% formaldehyde at 4°C for 10 minutes. The cells were analyzed by flow cytometer FACS CantoX (BD Biosciences, San Jose, CA) and by the use of FACSDiva version 8.0 software and Flowjo software version 10 (BD Biosciences). The FACS CantoX instrument utilizes colinear 405, 488, and 561 nm lasers plus forward and side angle light scatter. Fluorescence images of variants were visualized using fluorescence microscopy (EVOS FLoid Cell Imaging Station; Life Technologies, Waltham, MA).

Nicking mutagenesis for variant library preparation

Nicking mutagenesis methods were modified from Whitehead et al. to construct variant libraries for ORFs containing CYP2C9 and CYP2C19 missense variants.17 Nicking mutagenesis uses Nt.BbvCI and Nb.BbvCI sites and exonuclease cleavage to degrade WT template DNA. Nt.BbvCI enzyme with nicking endonuclease and exonuclease digestion of WT template DNA was used to form single stranded DNA. At the annealing step, five oligos carrying the variant of interest were annealed separately to single strand templates and then pooled with five reactions in one pot, using high fidelity DNA polymerase and Taq ligase to close the double strand. The second template strand was degraded by Nb.BbvCI endonuclease cleavage and exonuclease digestion, and a new second strand was synthesized with a common primer on a cassette plasmid backbone. Phosphorylated oligos for CYP2C9 and CYP2C19 variants were purchased from IDT (Coralville, IA). Sanger sequencing was used to validate the sequences of variant clones. Detailed protocols are provided in the Supplemental Text.

Fluorescence‐activated cell sorting

Library cells were washed, trypsinized, and resuspended in phosphate‐buffered saline containing 5% fetal bovine serum. Cells were then sorted on an FACSAria with 407, 488, and 532 nm lasers (BD Biosciences) into four bins, and the cells were collected in culture medium. BFP−/mCherry+ cells containing CYP2C9 or CYP2C19 variants were flow sorted and grown for 5 days. BFP‐/mCherry+ cells were sorted again to determine the protein expression of CYP2C9/CYP2C19 variants based on their GFP/mCherry ratios. Gates were set based on GFP/mCherry ratios for cells integrating known CYP2C9/CYP2C19 variants and WT proteins as gating references. Four gates were set to dissect the pooled libraries into four different bins based on GFP/mCherry ratios. Data were analyzed by FACSDiva version 8.0.1 software.

Variant calling

Variant frequencies were calculated by high‐throughput sequencing of the DNA collected in each sorted bin. Genomic DNA was extracted from sorted cells using DNA extraction kits (Qiagen) and amplicons were sequenced as decribed in the Supplemental Text. Fastq files were aligned with respective CYP2C9 and CYP2C19 reference sequences using BWA mem aligner version 0.7.15. Samtools mpileup version 1.5 was used with a custom python script for single nucleotide variation calling. VarScan pileup2indel version 2.3.9 was used for calling INDELS. A base quality score cutoff of 20 and a mapping quality score cutoff of 20 were applied for both single nucleotide variation and INDEL calling. Custom scripts were used to summarize the data and add allele frequencies at all positions in the reference sequence.Variant counts in each bin were tabulated and each variant’s frequency in each bin was calculated. The effects of variants were obtained based on the frequency of variants in each bin.

Results

Generation of landing pad cell lines

The landing pad construct was generated and a promotor‐less cassette was constructed with ORFs for CYP2C9 and CYP2C19. The ORFs were fused to GFP as an indicator of protein expression and mCherry, which was independently expressed as a transfection control. The DMS assay requires that only a single transgenetic cassette be integrated per cell, which could be guaranted by having only a single landing pad per cell (Figure 1 a). To generate a cell line that integrated only a single copy of the landing pad, the landing pad construct was inserted into HEK 293T cells and cells were selected by BFP. Single cells positive for BFP were sorted into each well of a 96‐well plate for cloning. We observed different levels of BFP expression among different landing pad clones, probably because HEK 293T cells have been reported to have hypotriploid karyotypes.19 To identify a candidate clone with only a single landing pad, we selected > 30 clones with low BFP expression levels. We then tested those clones by using bxb1 recombinase to integrate a mixture of 1:1 ratio GFP only and mCherry only promotor‐less plasmids (plasmid sequences are shown in the Supplemental Text) into landing pad clones, followed by flow cytometry analysis. If a candidate clone contained only a single landing pad, it would integrate either GFP or mCherry, but not both. For clone 20, quadrant Q2 for the flow cytometry showed a negligible percentage of cells with both GFP and mCherry expression (Figure 1 b). In addition, no overlay of green and red was observed in the fluroscence image for clone 20, once again indicating that clone 20 had only a single landing pad (Figure 1 c). As a result, clone 20 was selected for use in the following experiments. This system allowed us to monitor variant protein expression in a high‐throughput fashion.
Figure 1

Generation of HEK 293T landing pad cells. (a) Plasmid maps of the landing pad construct and the promoter‐less cassette for CYP open reading frames that were fused to green fluorescent protein (GFP) and engineered for the simultaneous expression of IRES‐mCherry. (b) Flow cytometry results for landing pad HEK 293T clone 20 that had a low percentage of both GFP and mCherry, compatible with the conclusion that clone 20 contains a single landing pad. (c) Merged fluorescence photos showing that mCherry and GFP did not superimpose for clone 20. The landing pad clone 20 integrated either GFP or mCherry, which indicated that clone 20 contained a single landing pad. BFP, blue fluorescent protein; CMV, human cytomegalovirus promotor; IRES, internal ribosome entry site.

Generation of HEK 293T landing pad cells. (a) Plasmid maps of the landing pad construct and the promoter‐less cassette for CYP open reading frames that were fused to green fluorescent protein (GFP) and engineered for the simultaneous expression of IRES‐mCherry. (b) Flow cytometry results for landing pad HEK 293T clone 20 that had a low percentage of both GFP and mCherry, compatible with the conclusion that clone 20 contains a single landing pad. (c) Merged fluorescence photos showing that mCherry and GFP did not superimpose for clone 20. The landing pad clone 20 integrated either GFP or mCherry, which indicated that clone 20 contained a single landing pad. BFP, blue fluorescent protein; CMV, human cytomegalovirus promotor; IRES, internal ribosome entry site.

Effect of CYP2C9 and CYP2C19 variants on protein levels

In our previous study, CYP2C9 218C>T, CYP2C9 343A>C, CYP2C19*3 (636G>A), and CYP2C19 815A>G affected final protein quantity, resulting in varying levels of decreased protein expression.9 In the present study, the reporter consisted of a promoter‐less cassette containing the C‐terminus of the CYP2C9 or CYP2C19 ORFs fused to GFP. That construct was expressed once it was integrated into landing pad cells to ensure one variant per cell and the landing pad BFP was disrupted. Flow cytometry analysis of BFP‐/mCherry+ cells showed that known “damaging” variants expressed significantly reduced levels of GFP and lower GFP/mCherry ratios, indicating that those cells expressed less protein. Mean GFP/mCherry ratios of known damaging variants were compared with those of WT proteins. The value for CYP2C9 218C>T was 28.4%; CYP2C9 343A>C was 48.7%; CYP2C19*3 (636G>A) was 17.6%; and CYP2C19 815A>G was 61.2% of the mean WT GFP/mCherry ratios, percentages that were roughly identical with Western blot results that we had previously reported (Figure 2 a–c).9 Next, we created constructs for nonsynonymous variants with allele frequencies > 0.00001 as reported by the Exome Aggregation Consortium and by the Mayo Clinic Right 1K study and created variant libraries for CYP2C9 and CYP2C19. Pooled variant libraries for both genes were integrated into landing pad cells. Using already known damaging variants for CYP2C9 and CYP2C19 as well as WT constructs as references for FACS gating, the cells were sorted into different “bins” based on GFP/mCherry ratios (Figure 2 d). Variants of CYP2C9/CYP2C19 with the lowest GFP/mCherry ratios (< 25% protein expression) were sorted into bin1 and were classified as severely damaging. WT‐like variants were sorted into bin4. The gating for four‐way sorting of CYP2C9/CYP2C19 pooled variants libraries is shown in Figure 3 a,b. Pools of cells in each bin were collected and were then used as input material for DNA sequencing. We used NGS to monitor the frequencies of variants in each bin. Variant frequencies (F v) appearing in each bin were then used to determine protein abundance scores. Abundance scores for each CYP2C9 and CYP2C19 variant were calculated by use of the following equation:
Figure 2

Flow cytometry of cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) constructs with known variants. (a,b) Flow cytometry analysis of blue fluorescent protein (BFP)‐/mCherrry+ cells that integrated wild‐type or known damaging variants, CYP2C9 218C>T and CYP2C9 343A>C, CYP2C19*3 and CYP2C19 815A>G. Note that for both wild‐type (WT) allozymes, most of the cells eluted toward higher green fluorescent protein (GFP)/mCherry ratios, while allozymes containing damaging variants eluted at significantly lower GFP/mCherry ratios than did the cells containing the WT. Mean GFP/mCherry ratios for those variants were consistent with Western blot results obtained during our previous study.9 (c) Cells transfected with constructs expressing known severely damaging variants (CYP2C9 218C>T and CYP2C19*3) eluted at low GFP/mCherry ratios, whereas other variants eluted between the two extremes (CYP2C9 343A>C and CYP2C19 815A>G), compatible with their being “damaging” variants. Four “bins” were established based on WT CYP2C9 and CYP2C19 and known damaging CYP2C9 and CYP2C19 variants, as shown diagramatically in (d).

Figure 3

Fluorescence‐activated cell sorting‐sorting of pooled cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) variant libraries. Blue fluorescent protein−/mCherrry+ cells integrating CYP2C9 and CYP2C19 pooled variant libraries were sorted into four bins based on their green fluorescent protein (GFP)/mCherry ratios. Gates were set based on wild‐type CYP2C9 and CYP2C19 and known damaging CYP2C9 and CYP2C19 variants. Pools of sorted cells in each bin were collect and were used as input material for subsequent amplicon DNA sequencing.

Flow cytometry of cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) constructs with known variants. (a,b) Flow cytometry analysis of blue fluorescent protein (BFP)‐/mCherrry+ cells that integrated wild‐type or known damaging variants, CYP2C9 218C>T and CYP2C9 343A>C, CYP2C19*3 and CYP2C19 815A>G. Note that for both wild‐type (WT) allozymes, most of the cells eluted toward higher green fluorescent protein (GFP)/mCherry ratios, while allozymes containing damaging variants eluted at significantly lower GFP/mCherry ratios than did the cells containing the WT. Mean GFP/mCherry ratios for those variants were consistent with Western blot results obtained during our previous study.9 (c) Cells transfected with constructs expressing known severely damaging variants (CYP2C9 218C>T and CYP2C19*3) eluted at low GFP/mCherry ratios, whereas other variants eluted between the two extremes (CYP2C9 343A>C and CYP2C19 815A>G), compatible with their being “damaging” variants. Four “bins” were established based on WT CYP2C9 and CYP2C19 and known damaging CYP2C9 and CYP2C19 variants, as shown diagramatically in (d). Fluorescence‐activated cell sorting‐sorting of pooled cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) variant libraries. Blue fluorescent protein−/mCherrry+ cells integrating CYP2C9 and CYP2C19 pooled variant libraries were sorted into four bins based on their green fluorescent protein (GFP)/mCherry ratios. Gates were set based on wild‐type CYP2C9 and CYP2C19 and known damaging CYP2C9 and CYP2C19 variants. Pools of sorted cells in each bin were collect and were used as input material for subsequent amplicon DNA sequencing. For each experiment, an “abundance score” for each variant studied was obtained by multiplying the variant frequency by weighted values (0.25–1) with bin1 assigned 0.25 and bin4 with 1.0.16 The final abundance score for each variant was calculated by averaging mean abundance scores across replicate assays. Variants were scored in at least three experiments, as shown graphically in Figure 4 and Figure . Variants were classified as “severely damaging,” “damaging,” or “tolerated,” with thresholds chosen on the basis of abuandance scores for known CYP2C9/CYP2C19 variants.9 Using Western blot results and corresponding enzyme activities from our previous publication as a reference,9 CYP2C9 variants with abundance scores below 0.578 (CYP2C9 1076T>C), were classified as “severely damaging,” displaying ~ 25% protein abundance as compared with WT, whereas those with abundance scores above that threshold but lower than 0.670 (CYP2C9 343A>C) were classified as “damaging” displaying ~ 50% of the protein abundance of the WT allozyme. Variants with scores above this threshold were considered “tolerated.” CYP2C19 variants with abundance scores below 0.597 (CYP2C19 1349C>A), were classified as “severely damaging,” whereas those with abundance scores above this threshold, but lower than 0.635 (CYP2C19 557G>A), were classified as “damaging.” Variants with scores above this threshold were considered to be “tolerated.” We found that 19 of 109 CYP2C9 and 36 of 121 CYP2C19 missense variants displayed less than ~ 25% of WT protein expression.
Figure 4

Protein abundance scores for 109 cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and 121 cytochrome P450 family 2 subfamily C member 19 (CYP2C19) variants. (a) Abundance score values for CYP2C9 and CYP2C19 variants. Variants having abundance scores lower than that for CYP2C9 (1076T>C) were classified as severely damaging, whereas variants having abundance scores lower than that for CYP2C9 (343A>C) were classified as damaging variants. The results shown are averages for four replicates. (b) Mean abundance scores for CYP2C19 variants are shown in the histogram. Variants having abundance scores lower than that for CYP2C19 (1349C>A) were classified as severely damaging, whereas those with abundance scores lower than that for CYP2C19 (557G>A) were classified as damaging variants. The results shown are averages abudance scores for four replicates. SD values are listed in Tables and .

Protein abundance scores for 109 cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and 121 cytochrome P450 family 2 subfamily C member 19 (CYP2C19) variants. (a) Abundance score values for CYP2C9 and CYP2C19 variants. Variants having abundance scores lower than that for CYP2C9 (1076T>C) were classified as severely damaging, whereas variants having abundance scores lower than that for CYP2C9 (343A>C) were classified as damaging variants. The results shown are averages for four replicates. (b) Mean abundance scores for CYP2C19 variants are shown in the histogram. Variants having abundance scores lower than that for CYP2C19 (1349C>A) were classified as severely damaging, whereas those with abundance scores lower than that for CYP2C19 (557G>A) were classified as damaging variants. The results shown are averages abudance scores for four replicates. SD values are listed in Tables and . The variant calling results obtained by the use of DMS for variants from the ExAC study that had allele frequencies > 0.00001 and variants from the Mayo RIGHT 10K study are listed in Tables 1 and 2. The DMS results were also compared with predictions obtained using SIFT, Provean, and Polyphen2, and those results are listed in Tables , and , for CYP2C9 and CYP2C19, respectively. For variants that resulted in dramatically reduced protein expression levels, CYP2C9*11, CYP2C9*21, and another 12 CYP2C9 variants, as well as CYP2C19*8, CYP2C19*22, and another 26 CYP2C19 variants, the results were in good agreement among all three of these predictive algorithms. However, five CYP2C9 variants and seven CYP2C19 variants displayed < 25% of WT protein expression, whereas SIFT predicted that they were “tolerated.” In summary, we found that the three commonly applied algorithms, which we tested on our 230 variants, disagreed among themselves 30.4% of the time, and they disagreed with our DMS assays, for at least one algorithm, 54.7% of the time. We also searched PharmVar, a repository for pharmacogene variation that records functionally validated variants. For CYP2C9*21 and CYP2C19*2B, *8 and *22, our results were consistent with those reported in PharmVar.20 However, PharmVar currently lists several allozymes for both CYP2C9 and CYP2C19, which we found to be severely damaging as having only limited or moderate evidence. As a result, our results provide additional information with regard to the functional implications of these variants. Finally, 15 CYP2C9 and 30 CYP2C19 variants that we found to be severely damaging had not previously been reported or were reported to have uncertain function in PharmVar.20
Table 1

Protein abuduance scores of CYP2C9 variants from ExAC browser and Mayo Right 10K study

Exact cDNAExact amino acidRSIDCommon allele nameAllele frequencyRight 10K (variant prevalence)DMSPharmVar
WTHeterzygousHomozygousFunctional studyAbundance score
c.752A>Gp.His251Argrs2256871CYP2C9*90.0067151007950Damaging0.653893822No record
c.449G>Ap.Arg150Hisrs7900194CYP2C9*80.0052421007410Damaging0.617433538Uncertain function
c.1003C>Tp.Arg335Trprs28371685CYP2C9*110.00378410033510Severely Damaging0.549201141possibly decreased
c.374G>Ap.Arg125Hisrs72558189CYP2C9*140.0029531008130Damaging0.627008929No record
c.1465C>Tp.Pro489Serrs9332239CYP2C9*120.00191210007770Damaging0.583353141No record
c.1080C>Gp.Asp360Glurs28371686CYP2C9*50.001163   Damaging0.629538267Moderate evidence
c.801C>Tp.Phe267hPhers149158426 0.000855   Damaging0.597268637No record
c.835C>Ap.Pro279Thrrs182132442CYP2C9*290.00047021008040TOLERATED0.672275959possibly decreased
c.1324G>Ap.Gly442Serrs368545396 0.000428610072120Damaging0.603761357unknown function
c.14T>Cp.Val5Alars138957855 0.00030481007770TOLERATED0.688673248No record
c.394C>Tp.Arg132Trprs199523631CYP2C9*450.00028041007860Damaging0.639522141unknown function
c.1264A>Gp.Ser422Glyrs776769484 0.00024771008220Damaging0.627667871No record
c.895A>Gp.Thr299Alars72558192CYP2C9*160.0002473   TOLERATED0.673073074No record
c.520C>Gp.Pro174Alars199539783 0.00016491007950Damaging0.651742423No record
c.269T>Cp.Leu90Prors72558187CYP2C9 *130.0001401   Damaging0.616189783possibly decreased
c.431G>Ap.Arg144Hisrs141489852 0.0001401   Damaging0.651784299No record
c.1084C> Gp.Leu362Val. 0.000132   Damaging0.586379329No record
c.1088C>Tp.Pro363Leurs150663116 0.00013191008220Damaging0.581270896possibly decreased
c.449G>Tp.Arg150Leu  0.00013191007490TOLERATED0.689018975No record
c.290G>Cp.Arg97Thr. 0.0001236   Damaging0.608683972No record
c.515G>Cp.Cys172Serrs147617899 0.00011551007950TOLERATED0.748597212No record
c.1034T>Cp.Met345Thr. 0.0001154   Damaging0.601859628No record
c.1341A>Cp.Leu447Phers59485260 0.00010721007950Damaging0.609375No record
c.343A>Cp.Ser115Argrs771237265 0.0001011008220Damaging0.670161008No record
c.1095C>Ap.Ser365Argrs139532088 0.00009894   Damaging0.578622159Uncertain function
c.980T>Cp.Ile327Thrrs57505750CYP2C9*310.0000907   Damaging0.611262013No record
c.89C>Tp.Pro30Leurs142240658CYP2C9*210.000090671008220Severely Damaging0.483953225Uncertain function
c.448C>Tp.Arg150Cysrs17847037CYP2C9*480.00009066   TOLERATED0.710010892No record
c.688C>Gp.His230Asp. 0.0000837   Damaging0.65625unknown function
c.389C>Tp.Thr130Metrs200965026CYP2C9*440.000082481008040Damaging0.639755882No record
c.517G>Ap.Ala173Thrrs373758696 0.000082471008040Severely Damaging0.548579579No record
c.395G>Ap.Arg132Glnrs200183364CYP2C9*330.000082461008310TOLERATED0.711347628No record
c.1362G>Cp.Gln454His.CYP2C9*190.00008243   Damaging0.645897332No record
c.680C>Tp.Pro227Leu. 0.0000756   TOLERATED0.705389146possibly decreased
c.1439C>Tp.Pro480Leurs530950257 0.000074171008220Severely Damaging0.479818446possibly decreased
c.556C>Tp.Arg186Cysrs150435881 0.00006602   Damaging0.592060606possibly decreased
c.1166T>Cp.Ile389Thr. 0.00006597   Damaging0.631849327No record
c.1421A>Gp.Asn474Serrs141011391 0.000065931008220Damaging0.647619048No record
c.373C>Tp.Arg125Cys. 0.00005775   TOLERATED0.677726163uncertain function
c.1187A>Gp.His396Argrs187793133 0.00005772   TOLERATED0.700935174No record
c.1370A>Gp.Asn457Ser.CYP2C9*610.0000577   Severely Damaging0.543305795uncertain function
c.263T>Cp.Ile88Thrrs139656048 0.00005768   TOLERATED0.676032437No record
c.218C>Tp.Pro73Leurs762081829 0.00005691008310Severely Damaging0.571077996No record
c.595G>Ap.Glu199Lys. 0.00004956   Damaging0.651252305No record
c.371G>Ap.Arg124Glnrs12414460CYP2C9*420.0000495   Damaging0.656819836uncertain function
c.458T>Cp.Val153Alars373993395 0.000049451008130Damaging0.58402074uncertain function
c.719T>Cp.Met240Thr. 0.00004163   Damaging0.640173483No record
c.619A>Tp.Ile207Phe. 0.00004134   TOLERATED0.674486125No record
c.371G>Tp.Arg124Leurs12414460 0.00004125   Severely Damaging0.566767249uncertain function
c.539C>Tp.Ser180Phers200149294 0.000041251008310Severely Damaging0.568764708uncertain function
c.164C>Tp.Thr55Ilers771905380 0.000041241008220TOLERATED0.701629687No record
c.1004G>Ap.Arg335Gln.CYP2C9*340.00004122   TOLERATED0.695204529No record
c.709G>Ap.Val237Ile. 0.00003334   Damaging0.634745896No record
c.624G>Cp.Leu208Phe. 0.00003308   TOLERATED0.681854013No record
c.538T>Cp.Ser180Pro. 0.000033   Severely Damaging0.529048923No record
c.541A>Gp.Ile181Val. 0.000033   TOLERATED0.689873007No record
c.1297C>Tp.Arg433Trp. 0.00003299   Severely Damaging0.564814815No record
c.386T>Ap.Met129Lysrs750097042 0.000032991008310Damaging0.643411363No record
c.122A>Tp.Asn41Ile. 0.00003298   Severely Damaging0.458665155No record
c.1024A>Gp.Arg342Gly. 0.00003298   Severely Damaging0.575108971normal function
c.1036C>Tp.Pro346Ser. 0.000032981008130Damaging0.654218921No record
c.1429G>Ap.Ala477Thr.CYP2C9*300.00003297   Severely Damaging0.558712121uncertain function
c.632C>Tp.Pro211Leu. 0.00002482   TOLERATED0.692533067No record
c.629G>Ap.Ser210Asn. 0.00002481   Damaging0.644966918No record
c.600G>Tp.Lys200Asnrs766903671 0.000024781008220Damaging0.663003421No record
c.370C>Tp.Arg124Trp.CYP2C9*430.00002475   TOLERATED0.737650497No record
c.1153A>Tp.Thr385Ser. 0.00002474   Severely Damaging0.509235183No record
c.863A>Gp.Glu288Gly. 0.00002474   Damaging0.601528578No record
c.1135T>Cp.Tyr379His. 0.00002474   Damaging0.66603031No record
c.433G>Tp.Val145Phe. 0.00002473   Severely Damaging0.460289634No record
c.1022A>Gp.Asp341Gly. 0.00002473   Severely Damaging0.470510412No record
c.704A>Gp.Lys235Arg. 0.00001668   TOLERATED0.68009768uncertain function
c.791T>Gp.Ile264Serrs761895497 0.000016561008220Damaging0.599331078uncertain function
c.773A>Gp.Asn258Ser. 0.00001656   TOLERATED0.684042216No record
c.638T>Cp.Ile213Thr. 0.00001655   Damaging0.648500178No record
c.637A>Cp.Ile213Leu. 0.00001655   TOLERATED0.676605037No record
c.618G>Tp.Lys206Asn. 0.00001654   TOLERATED0.690691147No record
c.356A>Cp.Lys119Thr.CYP2C9*410.00001651   Damaging0.601033475decreased function
c.1076T>Cp.Ile359Thrrs56165452CYP2C9*40.0000165   Severely Damaging0.578003356possibly decreased
c.389C>Ap.Thr130Lys. 0.0000165   Damaging0.668748634unknown function
c.820G>Ap.Glu274Lys. 0.0000165   TOLERATED0.687495665No record
c.1080C>Ap.Asp360Glurs28371686 0.00001649   Damaging0.605882567No record
c.137G>Ap.Gly46Asp. 0.00001649   Damaging0.611297817No record
c.109C>Tp.Pro37Ser. 0.00001649   Damaging0.62487462No record
c.1045G>Ap.Asp349Asn. 0.00001649   Damaging0.648954995No record
c.908G>Tp.Ser303Ile. 0.00001649   Damaging0.653594136No record
c.986G>Ap.Arg329His. 0.00001649   Damaging0.66037359unknown function
c.1136A>Gp.Tyr379Cysrs773704286 0.000016491008310Damaging0.665754704No record
c.1214A>Tp.Glu405Val. 0.00001649   TOLERATED0.675922464No record
c.146A>Gp.Asp49Gly.CYP2C9*370.00001649   TOLERATED0.678200758uncertain function
c.422T>Cp.Ile141Thrrs148615754 0.00001649   TOLERATED0.681659253No record
c.1159A>Gp.Ile387Valrs764211126CYP2C9*560.000016491008310TOLERATED0.688530952No record
c.968T>Cp.Val323Ala. 0.00001649   TOLERATED0.700837059No record
c.989T>Cp.Val330Ala. 0.00001649   TOLERATED0.727107621No record
c.257C>Ap.Ala86Asp. 0.00001648   Severely Damaging0.513086248No record
c.38G>Ap.Cys13Tyr. 0.00001648   Damaging0.590756013uncertain function
c.445G>Ap.Ala149Thr.CYP2C9*460.00001648   Damaging0.604943716No record
c.312A>Tp.Glu104Asp. 0.00001648   Damaging0.638095238No record
c.271G>Ap.Gly91Arg. 0.00001648   Damaging0.651222774No record
c.296T>Ap.Ile99Asn. 0.00001648   Damaging0.651222774No record
c.1415T>Cp.Val472Ala. 0.00001648   Damaging0.668443691No record
c.247G>Ap.Val83Met. 0.00001648   Damaging0.669561409uncertain function
c.460G>Ap.Glu154Lys. 0.000016481008310TOLERATED0.688904038decreased function
c.296T>Cp.Ile99Thr. 0.00001648   TOLERATED0.6911887No record
c.8C>Ap.Ser3Tyr. 0.00001648   TOLERATED0.694461693No record
c.791T>Cp.Ile264Thrrs761895497 0.00001091008220TOLERATED0.683929331unknown function
c._del707p.Asn236Thrfs*5. 0.000004099   Severely Damaging0.25No record
c.709G>Cp.Val237Leu. Not recorded   Damaging0.623703953No record
c.229C>Tp.Leu77Met. Not recorded   TOLERATED0.72516812No record

DMS, deep mutational scanning; ExAC, Exome Aggregation Consortium; RSID, Reference SNP ID number; WT, wild‐type.

Table 2

Protein abuduance scores of CYP2C19 variants from ExAC browser and Mayo Right 10K study

Exact cDNAExact Amino acidRSIDCommon allele nameAllele FrequencyRight 10K (variant prevalence)DMSPharmVar
WTHeterzygousHomozygousFunctional studyAbundance score
c.991G>AV331Irs3758581 0.062428802124933Damaging0.599924823Nomal function
c.276G>CE92Drs17878459CYP2C19*2B0.023694805986TOLERATED0.713318402No record
c.518C>TA173Vrs61311738 0.0058181008040TOLERATED0.643519769No record
c.481G>CA161Prs181297724 0.004786   Severely damaging0.475593771No function
c.449G>AR150Hrs58973490CYP2C19*110.0026529985981TOLERATED0.69271491normal function
c.55A>CI19Lrs17882687CYP2C19*150.0021271007950Damaging0.623042396normal function
c.358T>CW120Rrs41291556CYP2C19*80.0015410045390Severely damaging0.47978285No function
c.1228C>TR410Crs17879685CYP2C19*130.0015171008220TOLERATED0.676655086Normal function
c.431G>AR144Hrs17884712CYP2C19*90.0010791008040Damaging0.621114695decreased function
c.365A>CE122Ars17885179 0.00096371008220TOLERATED0.753787879No record
c.784G>AD262N. 0.0008586   Damaging0.614038591No record
c.448C>TR150Crs142974781 0.00048591008310Damaging0.62953438No record
c.680C>TP227Lrs6413438CYP2C19*100.0004481007950Damaging0.624712679decreased function
c.985C>TR329Crs59734894 0.00038721007860Severely damaging0.554502254No record
c.394C>TR132Wrs149590953 0.00038711008130Damaging0.635242943No record
c.241G>AE81Krs149072229 0.000387110068160TOLERATED0.667726879No record
c.374G>AR125Hrs141774245 0.000296510062210Damaging0.615675954No record
c.337G>AV113Irs145119820 0.00027181007950TOLERATED0.683885811No record
c.1295A>TK432Irs146991374 0.000264   Damaging0.609742945No record
c.217C>TR73Crs145328984CYP2C19*300.00025531008210Damaging0.623329263uncertain function
c.1078G>AD360Nrs144036596 0.0002471   TOLERATED0.693150526None
c.395G>AR132Qrs72552267CYP2C19*60.00023891008220TOLERATED0.718026114No function
c.164C>GT55Srs572853437 0.000132   TOLERATED0.718673467No record
c.557G>CR186Prs140278421CYP2C19*220.00012361008310Severely damaging0.560050139No function
c.1004G>AR335Qrs118203757CYP2C19*240.00012361008010Damaging0.601312932no function
c.557G>AR186H  0.000108   Damaging0.635198497No record
c.831C>AN277K. 9.888E‐051008030TOLERATED0.681062109No record
c.1048G>AA350Trs201509150 9.884E‐05   Severely damaging0.536022841No record
c.1127T>AF376Y. 9.884E‐051008310TOLERATED0.711485306No record
c.25C>GL9V. 0.0000908   TOLERATED0.731784659No record
c.1007G>TS336Irs143833145 9.061E‐05   TOLERATED0.637387252No record
c.1036C>TP346S. 0.0000906   Severely damaging0.580346079No record
c.556C>TR186Crs183701923 8.242E‐05   Severely damaging0.564313994No record
c.389C>TT130Mrs150152656 8.237E‐051008220TOLERATED0.684802678No record
c.527A>GN176Srs57700608 7.417E‐0510074100TOLERATED0.671875No record
c.1075A>GI359V. 7.414E‐05   Damaging0.603645833No record
c.440A>GE147Grs147453531 7.413E‐051008130TOLERATED0.643455877No record
c.781C>TR261W. 6.605E‐05   Severely damaging0.501301905No record
c.836A>CQ279Prs61526399 6.591E‐05   Severely damaging0.570907901No record
c.1001A>TN334I. 0.0000659   TOLERATED0.648759438No record
c.221T>CM74T. 6.589E‐05   Severely damaging0.58131891No record
c.1324C>TR442Crs192154563CYP2C19*165.772E‐05   Damaging0.62128858decreased function
c.1021G>CD341Hrs770829708 5.766E‐051008130TOLERATED0.656054948No record
c.85C>TL29F. 4.946E‐05   TOLERATED0.692408379No record
c.837G>TQ279H. 4.944E‐05   TOLERATED0.678912244No record
c.1003C>TR335Wrs368758960 4.942E‐051008310Severely damaging0.497372294No record
c.1034T>AM345Krs201132803 4.942E‐051008010Severely damaging0.497916398No record
c.371G>AR124Qrs200346442 4.942E‐05   Damaging0.602183853No record
c.925G>AA309T. 4.942E‐05   TOLERATED0.636880409No record
c.218G>AR73H. 4.119E‐05   Severely damaging0.476206097No record
c.1072T>CY358H. 4.119E‐05   TOLERATED0.648167478No record
c.370C>TR124W. 4.118E‐05   Severely damaging0.505962416No record
c.726T>AS242R. 3.311E‐05   TOLERATED0.702093879No record
c.801C>AF267Lrs377674118 3.303E‐051008310Severely damaging0.537597325No record
c.1171C>GL391V. 3.298E‐05   Severely damaging0.569318436No record
c.1205C>TP402L. 3.298E‐05   TOLERATED0.642305732No record
c.1216A>GM406Vrs144056033 3.298E‐051008220TOLERATED0.699731714No record
c.1060G>AE354K. 3.295E‐05   Severely damaging0.475659325No record
c.896C>GT299R. 3.295E‐05   Severely damaging0.531790263No record
c.305T>CL102P. 3.295E‐05   Severely damaging0.574479079No record
c.905C>TT302Irs58259047 3.295E‐05   Damaging0.602678118No record
c.1120G>AV374Irs113934938CYP2C19*283.295E‐05   Damaging0.627074235normal function
c.961G>TA321S. 3.295E‐05   TOLERATED0.639949178No record
c.928C>TL310F. 3.295E‐051008220TOLERATED0.679301841No record
c.907A>GS303G. 3.295E‐05   TOLERATED0.717662336No record
c.648C>GC216W. 0.0000274   TOLERATED0.656662248No record
c.671A>TD224V. 2.561E‐05   Severely damaging0.53699972No record
c.753C>GH251Qrs148247410 2.478E‐05   Severely damaging0.563668153No function
c.769A>TI257F. 2.477E‐05   TOLERATED0.681306328No record
c.631C>AP211T. 2.476E‐05   TOLERATED0.659306032No record
c.629C>AT210N. 2.476E‐05   TOLERATED0.684317643No record
c.1316G>TG439V. 2.474E‐05   TOLERATED0.665474123No record
c.1371C>AN457K. 2.474E‐05   TOLERATED0.682756843No record
c.1414G>AV472I. 2.473E‐05   TOLERATED0.670673077No record
c.60G>CW20C. 2.473E‐05   TOLERATED0.672724033No record
c.562G>AD188Nrs370803989CYP2C19*352.473E‐051008040TOLERATED0.677997182uncertain function
c.82A>GK28E. 2.473E‐05   TOLERATED0.680943598No record
c.185G>CG62A. 2.472E‐05   Severely damaging0.508683191No record
c.190G>AV64Mrs150045105 2.472E‐05   TOLERATED0.650769501No record
c.848C>TT283I. 2.472E‐05   TOLERATED0.65615054No record
c.1037C>TP346L. 2.471E‐05   Severely damaging0.502953907No record
c.1080C>AD360E. 2.471E‐05   Severely damaging0.549265027No record
c.1078G>CD360Hrs144036596 2.471E‐051008130Severely damaging0.567361111No record
c.373C>TR125Crs200150287 2.471E‐051008310TOLERATED0.669159602No record
c.850A>GI284V. 2.471E‐05   TOLERATED0.685908298No record
c.430C>TR144C. 2.471E‐05   TOLERATED0.688887841No record
c.721G>AE241K. 1.657E‐05   Damaging0.619158302No record
c.728A>GD243G. 1.655E‐05   TOLERATED0.696277128No record
c.1288G>CA430P. 1.652E‐05   Severely damaging0.545128622No record
c.813G>AM271I. 1.652E‐05   TOLERATED0.663920987No record
c.169C>GL57V. 1.651E‐05   Damaging0.634196461No record
c.1465C>TP489S. 0.0000165   Severely damaging0.577527563No record
c.1373T>GL458Rrs761587034 1.649E‐051008310Severely damaging0.51802352No record
c.1349C>AT450Nrs141690375 1.649E‐05   Severely damaging0.597025262No record
c.1439C>TP480Lrs779501712 1.649E‐051008310Damaging0.626171339No record
c.1325G>AR442Hrs138112316 1.649E‐05   TOLERATED0.646762738No record
c.65A>GQ22Rrs144928727 1.649E‐05   TOLERATED0.656465743No record
c.1398C>AD466E. 1.649E‐05   TOLERATED0.661470643No record
c.593T>CM198Trs186489608 1.649E‐05   TOLERATED0.667063402No record
c.1330G>CE444Q. 1.649E‐05   TOLERATED0.694466074No record
c.1213G>CE405Q. 1.649E‐05   TOLERATED0.711027827No record
c.518C>AA173Drs61311738 1.648E‐051008040Severely damaging0.499614976No record
c.1076T>AI359N. 1.648E‐05   Severely damaging0.522206232No record
c.197C>GT66S. 1.648E‐05   Damaging0.626403544No record
c.862G>AV288I. 1.648E‐05   TOLERATED0.673350956No record
c.537C>GC179W. 1.648E‐05   TOLERATED0.694502569No record
c.865A>GI289V. 1.648E‐05   TOLERATED0.714496776No record
c.445G>AA149T. 1.647E‐05   Severely damaging0.521050873No record
c.905C>GT302Rrs58259047 1.647E‐05   Severely damaging0.521776385No record
c.331G>AG111R. 1.647E‐05   Severely damaging0.541454278No record
c.1021G>AD341N. 1.647E‐05   Severely damaging0.571470548No function
c.409G>CG137R. 1.647E‐05   Severely damaging0.571889977No record
c.1013G>TC338F. 1.647E‐05   Damaging0.609881164No record
c.217C>AR73Srs145328984 1.647E‐051008210TOLERATED0.651405999No record
c.410G>AG137E. 1.647E‐05   TOLERATED0.653990969No record
c.326G>CG109A. 1.647E‐05   TOLERATED0.658559311No record
c.347A>GN116S. 1.647E‐05   TOLERATED0.666187952No record
c.271G>CG91Rrs118203756CYP2C19*231.647E‐05   TOLERATED0.668239425uncertain function
c.1002C>AN334Krs563052490 1.647E‐051008130TOLERATED0.687674269No record
c.1112C>GT371Srs568155950 1.647E‐051008310TOLERATED0.715706169No record
c.578A>GQ193R  4.06E‐06   TOLERATED0.658206476No record

ExAC, Exome Aggregation Consortium; RSID, Reference SNP ID number; WT, wild‐type.

Protein abuduance scores of CYP2C9 variants from ExAC browser and Mayo Right 10K study DMS, deep mutational scanning; ExAC, Exome Aggregation Consortium; RSID, Reference SNP ID number; WT, wild‐type. Protein abuduance scores of CYP2C19 variants from ExAC browser and Mayo Right 10K study ExAC, Exome Aggregation Consortium; RSID, Reference SNP ID number; WT, wild‐type.

Validation of severely damaging variants of CYP2C9 and CYP2C19

In silico predictions were not always consistent with our DMS results, as described in the preceeding paragraph. That fact emphasizes the need for the validation of variant classification using DMS or other functional assays. Although the efficiency of calling damaging variants using DMS exceeded the throughput of classical mutagenesis methods, we still needed to confirm the accuracy of calling for the variants that we studied. Therefore, we validated our variant protein expression calls by the use of Western blot analyses. As shown in Figure 5 a,b, our newly identified severely damaging variants for CYP2C9 and CYP2C19 displayed significantly decreased protein expression (< 25% protein expression with the exception of CYP2C9 371G>T which had ~ 50% protein expression) compared with the WT protein—shown at the far right in each of the panels. The binning patterns for variant frequencies for selected allozyme are shown in Figures 5 c,d.
Figure 5

Western blot validation of cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) allozymes identified as containing severely damaging variants. (a,b) The protein expression of CYP2C9 and CYP2C19 in blue fluorescent protein−/mCherry+ cells integrating severely damaging variants for were validated by Western blot analysis. The mCherry and β‐actin were used as loading controls. Each image includes a control lane for wild‐type (WT) CYP2C9 or CYP2C19. (c,d) Variant frequencies for three representative “severely” damaging variants for CYP2C9 and CYP2C19 showing their distributions in each of the four bins.

Western blot validation of cytochrome P450 family 2 subfamily C member 9 (CYP2C9) and cytochrome P450 family 2 subfamily C member 19 (CYP2C19) allozymes identified as containing severely damaging variants. (a,b) The protein expression of CYP2C9 and CYP2C19 in blue fluorescent protein−/mCherry+ cells integrating severely damaging variants for were validated by Western blot analysis. The mCherry and β‐actin were used as loading controls. Each image includes a control lane for wild‐type (WT) CYP2C9 or CYP2C19. (c,d) Variant frequencies for three representative “severely” damaging variants for CYP2C9 and CYP2C19 showing their distributions in each of the four bins.

Discussion

The functional characterization of ORF missense variants in clinically important pharmacogenes remains a major challenge for pharmacogenomics. In a recent study, we identified and functionally characterized six novel nonsynonymous ORF variants in CYP2C9 and seven in CYP2C19 based on Mayo Right 1K data.9 We found that the enzyme activities of those variants generally correlated well with protein expression levels.9 Missense variants in CYP2C9/CYP2C19 may alter protein folding, leading to decreased protein expression as a result of accelerated proteasome‐mediated degradation, a major factor responsible for decreased enzymatic activity in pharmacogenomics.9, 21, 22, 23 The loss of function of allozymes containing nonsynonymous CYP2C9/CYP2C19 ORF single nucleotide polymorphisms due to decreased protein expression made it possible to analyze their function by fluorescence reporter assays.9 Because of the very large number of missense variants in ORFs, it is practically difficult to link the genotypes of these variants to their functional phenotypes using “one‐at‐a‐time” expression systems. Fowler and colleagues developed DMS to analyze variants for all possible amino acid alterations in several genes. Saturation mutagenesis was also used for DMS in several previous studies.9, 24, 25 That approach has advantages for use in pre‐emptive pharmacogenomics and makes it possible to interpet variant function based on protein structrual mapping.16, 26 Degenerate codons were used to generate the saturation libraries but some variants of interest may be missed due to codon bias, with up to 30–50% of possible variants missing from the final data sets.16 CYP2C9 and CYP2C19 each have ORFs that are 1.6 kb in length, so it would be difficult to generate saturation mutation libraries. As a result, we chose to apply a modification of the nicking mutagenesis method developed by Whitehead et al.17 to create focused variant libraries for missense variants that had reported to occur in humans for use in our study. Specifically, we analyzed 230 nonsynonymous ORF variants for CYP2C9 and CYP2C19 from the ExAC study that had minor allele frequencies > 0.00001. All of those variants had been shown to occur in humans and were not so rare as to be “private.” FACS sorting was used to separate variants of differing protein expression levels, all of which were subsequently analyzed by NGS to make it possible to calculate the frequency of each of the variants studied in each of our four FACS bins (see Figures 2 and 3). X‐ray crystal structures have been determined for CYP2C9 and CYP2C19.27, 28 Six substrate recognition sites (SRS) have been identified in CYP2C enzyme sequences: amino acids 96–117 (SRS1), 198–205 (SRS2), 233–240 (SRS3), 286–304 (SRS4), 359–369 (SRS5), and 470–477 (SRS6).29 Nineteen of the 75 CYP2C9 and 9 of the 58 CYP2C19 damaging variants listed in Tables 1 and 2, which displayed reduced protein expression of at least 50% protein, mapped to known SRS sites, so they may also influence substrate binding. Other damaging variants listed in Tables 1 and 2 fall outside of those sites but may influence activity due to the disruption of active sites, although they have no influence on protein abundance. For example, we have reported that CYP2C9 709G>C and CYP2C19 65A>G displayed significantly reduced enzyme activity, but their protein levels were similar to that of that the WT.9 In silico predictions have been widely applied to predict variation in protein structure and function. Our previous work and that of others supports the importance of the application of additional, functional methods to validate results obtained by using predictive algorithms.30, 31 Therefore, we compared variant calling by use of DMS with the predictions of computational algorithms, and differences were found between our results and those of prediction algorithms, as listed in Tables and . Those differences may be due to either the accuracy of the prediction algorithms or to underlying molecular mechanisms. For example, CYP2C9 709G>C and CYP2C19 65A>G and CYP2C19*13 have WT‐like protein expression but loss of enzyme activity.9, 32 We also determined whether variants identified by DMS were truly damaging by the use of Western blot analyses, as shown in Figure 5. A limitation of DMS based on fluorescence is that some genes have been found to not be amenable to this assay, because damaging variants for those genes had similar fluorescent intensities as did WT proteins.16 We found that DMS seems to be most sensitive for screening for severely damaging variants, which displayed clear fluorescence separation from WT‐like variants. However, it required careful interpretation of intermediate‐fluorescing variants. The validation of functional studies for variants characterized in this fashion will be essential if we are to incorporate these results into clinical decision making and electronic health records. To validate the most severely damaging variants that we identified by DMS, we used Western blot assays as our standard functional assay for validation, even though those studies were laborious and time‐consuming—but still necessary at this time. Protein expression is obviously an important aspect of functional genomics—but only one aspect. Additional functional validation based on enzyme activities for different CYP substrates should be performed in the future to further extend the functional characterization of the variant allozymes that we have studied. The regulation of CYP activity is a complex process involving multiple mechanisms, which include transcripton regulation by nuclear receptors, such as the pregnane X receptor, the constitutive androstane receptor, the glucocorticoid receptor and by members of the TSPYL gene family.33, 34, 35, 36 DMS, as we have used it, has the limitation of not addressing upstream DNA sequence variants, such as CYP2C19*17 that results in an increase in transcriptional activity.37, 38, 39 High‐throughput methodology for studying variants outside of ORFs will obviously be required for the interpretation of CYP variants that map outside of protein coding sequences. As a result, DMS is not a “final answer” but rather represents a significant step forward in our efforts to link genomic variation to variation in drug response phenotypes. In summary, we have identified and validated 15 CYP2C9 and 30 CYP2C19 severely damaging variants that had not previously been reported in PharmVar.20 Those variants are potentially clinically actionable. Functional studies of those variants showed decreased protein expression, which could result in decreased drug metabolism. Our results add information that may help to improve the accuracy of current prediction algorithms and they may also provide test data sets for machine‐learning methods that might “learn” to predict the effects of ORF missense variants. The Mayo Clinic recently expanded the RIGHT 1K study to include an even larger cohort, the RIGHT 10K study that includes an additional 10,085 DNA samples with sequencing data for 77 pharmacogenes.6 Single nucleotide polymorphisms from both RIGHT 1K and RIGHT 10K studies were included in our analyses. This same DMS methodology can now be implemented to study other important pharmacogenes and preemptive NGS Mayo RIGHT 10K data as ever larger numbers of ORF missense variants are identified.

Funding

This study was funded by National Institutes of Health (NIH) grants U19 GM61388 (The Pharmacogenomics Research Network), R01 GM28157, R01 GM125633, and T32 GM08685, and by the Mayo Clinic Center for Individualized Medicine.

Conflict of Interest

Both Drs.Weinshilboum and Wang are co‐founders of and stockholders in OneOme, LLC. All other authors declared no competing interests for this work.

Author Contributions

L.Z., V.S., J.Y., D.L., and R.W. wrote the manuscript. L.Z., V.S., J.Y., L.W., and R.W. designed the research. L.Z., I.M., S.D., and J.R. performed the research. L.Z., V.S., and K.K. analyzed the data. All authors have given final approval of the manuscript for submission. Figure S1. Click here for additional data file. Table S1. Click here for additional data file. Table S2. Click here for additional data file. Table S3. Click here for additional data file. Table S4. Click here for additional data file. Table S5. Click here for additional data file. Table S6. Click here for additional data file. Supplemental Text. Click here for additional data file.
  38 in total

1.  SIFT missense predictions for genomes.

Authors:  Robert Vaser; Swarnaseetha Adusumalli; Sim Ngak Leng; Mile Sikic; Pauline C Ng
Journal:  Nat Protoc       Date:  2015-12-03       Impact factor: 13.491

Review 2.  HEK293 in cell biology and cancer research: phenotype, karyotype, tumorigenicity, and stress-induced genome-phenotype evolution.

Authors:  A A Stepanenko; V V Dmitrenko
Journal:  Gene       Date:  2015-05-27       Impact factor: 3.688

Review 3.  Clinical Pharmacogenetics Implementation Consortium (CPIC) Guideline for CYP2D6 and CYP2C19 Genotypes and Dosing of Selective Serotonin Reuptake Inhibitors.

Authors:  J K Hicks; J R Bishop; K Sangkuhl; D J Müller; Y Ji; S G Leckband; J S Leeder; R L Graham; D L Chiulli; A LLerena; T C Skaar; S A Scott; J C Stingl; T E Klein; K E Caudle; A Gaedigk
Journal:  Clin Pharmacol Ther       Date:  2015-06-29       Impact factor: 6.875

4.  Structural characterization of human cytochrome P450 2C19: active site differences between P450s 2C8, 2C9, and 2C19.

Authors:  R Leila Reynald; Stefaan Sansen; C David Stout; Eric F Johnson
Journal:  J Biol Chem       Date:  2012-11-01       Impact factor: 5.157

5.  Effects of CYP2C9*3 and CYP2C9*13 on Diclofenac Metabolism and Inhibition-based Drug-Drug Interactions.

Authors:  Jing Zi; Duan Liu; Pingping Ma; He Huang; Juanli Zhu; Dongqing Wei; Jin Yang; Chao Chen
Journal:  Drug Metab Pharmacokinet       Date:  2010       Impact factor: 3.614

Review 6.  Clinical Pharmacogenetics Implementation Consortium guidelines for cytochrome P450-2C19 (CYP2C19) genotype and clopidogrel therapy.

Authors:  S A Scott; K Sangkuhl; E E Gardner; C M Stein; J-S Hulot; J A Johnson; D M Roden; T E Klein; A R Shuldiner
Journal:  Clin Pharmacol Ther       Date:  2011-06-29       Impact factor: 6.875

7.  Aggresome formation and pharmacogenetics: sulfotransferase 1A3 as a model system.

Authors:  Liewei Wang; Vivien C Yee; Richard M Weinshilboum
Journal:  Biochem Biophys Res Commun       Date:  2004-12-10       Impact factor: 3.575

8.  Measuring the activity of protein variants on a large scale using deep mutational scanning.

Authors:  Douglas M Fowler; Jason J Stephany; Stanley Fields
Journal:  Nat Protoc       Date:  2014-08-28       Impact factor: 13.491

Review 9.  Pharmacogenomics: Precision Medicine and Drug Response.

Authors:  Richard M Weinshilboum; Liewei Wang
Journal:  Mayo Clin Proc       Date:  2017-11-01       Impact factor: 7.616

10.  Cytochrome P450 2C19 Poor Metabolizer Phenotype in Treatment Resistant Depression: Treatment and Diagnostic Implications.

Authors:  Marin Veldic; Ahmed T Ahmed; Caren J Blacker; Jennifer R Geske; Joanna M Biernacka; Kristin L Borreggine; Katherine M Moore; Miguel L Prieto; Jennifer L Vande Voort; Paul E Croarkin; Astrid A Hoberg; Simon Kung; Renato D Alarcon; Nicola Keeth; Balwinder Singh; William V Bobo; Mark A Frye
Journal:  Front Pharmacol       Date:  2019-02-19       Impact factor: 5.810

View more
  7 in total

Review 1.  Keeping pace with CYP2D6 haplotype discovery: innovative methods to assign function.

Authors:  Karen E Brown; Jack W Staples; Erica L Woodahl
Journal:  Pharmacogenomics       Date:  2022-01-27       Impact factor: 2.533

2.  Mapping the miRNA-mRNA Interactome in Human Hepatocytes and Identification of Functional mirSNPs in Pharmacogenes.

Authors:  Nicholas R Powell; Harrison Zhao; Joseph Ipe; Yunlong Liu; Todd C Skaar
Journal:  Clin Pharmacol Ther       Date:  2021-08-13       Impact factor: 6.903

3.  Implementation of preemptive DNA sequence-based pharmacogenomics testing across a large academic medical center: The Mayo-Baylor RIGHT 10K Study.

Authors:  Liewei Wang; Steven E Scherer; Suzette J Bielinski; Donna M Muzny; Leila A Jones; John Logan Black; Ann M Moyer; Jyothsna Giri; Richard R Sharp; Eric T Matey; Jessica A Wright; Lance J Oyen; Wayne T Nicholson; Mathieu Wiepert; Terri Sullard; Timothy B Curry; Carolyn R Rohrer Vitek; Tammy M McAllister; Jennifer L St Sauver; Pedro J Caraballo; Konstantinos N Lazaridis; Eric Venner; Xiang Qin; Jianhong Hu; Christie L Kovar; Viktoriya Korchina; Kimberly Walker; HarshaVardhan Doddapaneni; Tsung-Jung Wu; Ritika Raj; Shawn Denson; Wen Liu; Gauthami Chandanavelli; Lan Zhang; Qiaoyan Wang; Divya Kalra; Mary Beth Karow; Kimberley J Harris; Hugues Sicotte; Sandra E Peterson; Amy E Barthel; Brenda E Moore; Jennifer M Skierka; Michelle L Kluge; Katrina E Kotzer; Karen Kloke; Jessica M Vander Pol; Heather Marker; Joseph A Sutton; Adrijana Kekic; Ashley Ebenhoh; Dennis M Bierle; Michael J Schuh; Christopher Grilli; Sara Erickson; Audrey Umbreit; Leah Ward; Sheena Crosby; Eric A Nelson; Sharon Levey; Michelle Elliott; Steve G Peters; Naveen Pereira; Mark Frye; Fadi Shamoun; Matthew P Goetz; Iftikhar J Kullo; Robert Wermers; Jan A Anderson; Christine M Formea; Razan M El Melik; John D Zeuli; Joseph R Herges; Carrie A Krieger; Robert W Hoel; Jodi L Taraba; Scott R St Thomas; Imad Absah; Matthew E Bernard; Stephanie R Fink; Andrea Gossard; Pamela L Grubbs; Therese M Jacobson; Paul Takahashi; Sharon C Zehe; Susan Buckles; Michelle Bumgardner; Colette Gallagher; Kelliann Fee-Schroeder; Nichole R Nicholas; Melody L Powers; Ahmed K Ragab; Darcy M Richardson; Anthony Stai; Jaymi Wilson; Joel E Pacyna; Janet E Olson; Erica J Sutton; Annika T Beck; Caroline Horrow; Krishna R Kalari; Nicholas B Larson; Hongfang Liu; Liwei Wang; Guilherme S Lopes; Bijan J Borah; Robert R Freimuth; Ye Zhu; Debra J Jacobson; Matthew A Hathcock; Sebastian M Armasu; Michaela E McGree; Ruoxiang Jiang; Tyler H Koep; Jason L Ross; Matthew G Hilden; Kathleen Bosse; Bronwyn Ramey; Isabelle Searcy; Eric Boerwinkle; Richard A Gibbs; Richard M Weinshilboum
Journal:  Genet Med       Date:  2022-03-21       Impact factor: 8.864

4.  Targeted Genotyping in Clinical Pharmacogenomics: What Is Missing?

Authors:  Jaime L Lopes; Kimberley Harris; Mary Beth Karow; Sandra E Peterson; Michelle L Kluge; Katrina E Kotzer; Guilherme S Lopes; Nicholas B Larson; Suzette J Bielinski; Steven E Scherer; Liewei Wang; Richard M Weinshilboum; John L Black; Ann M Moyer
Journal:  J Mol Diagn       Date:  2022-01-15       Impact factor: 5.341

5.  Evaluation of CYP2C19 Gene Polymorphisms in Patients with Acid Peptic Disorders Treated with Esomeprazole.

Authors:  Lorena Díaz-Ordóñez; Diana Ramírez-Montaño; Estephania Candelo; Carolina González-Restrepo; Sebastián Silva-Peña; Carlos Arturo Rojas; Mario Sepulveda Copete; Hector Raul Echavarria; Harry Pachajoa
Journal:  Pharmgenomics Pers Med       Date:  2021-04-29

6.  SLCO1B1: Application and Limitations of Deep Mutational Scanning for Genomic Missense Variant Function.

Authors:  Lingxin Zhang; Vivekananda Sarangi; Ming-Fen Ho; Irene Moon; Krishna R Kalari; Liewei Wang; Richard M Weinshilboum
Journal:  Drug Metab Dispos       Date:  2021-03-03       Impact factor: 3.922

7.  Massively parallel characterization of CYP2C9 variant enzyme activity and abundance.

Authors:  Clara J Amorosi; Melissa A Chiasson; Matthew G McDonald; Lai Hong Wong; Katherine A Sitko; Gabriel Boyle; John P Kowalski; Allan E Rettie; Douglas M Fowler; Maitreya J Dunham
Journal:  Am J Hum Genet       Date:  2021-07-26       Impact factor: 11.025

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.