Literature DB >> 35089958

Identification of pharmacogenetic variants from large scale next generation sequencing data in the Saudi population.

Ewa Goljan1,2, Mohammed Abouelhoda3, Mohamed M ElKalioby2,3, Amjad Jabaan2, Nada Alghithi2, Brian F Meyer1,2, Dorota Monies1,2.   

Abstract

It is well documented that drug responses are related to Absorption, Distribution, Metabolism, and Excretion (ADME) characteristics of individual patients. Several studies have identified genetic variability in pharmacogenes, that are either directly responsible for or are associated with ADME, giving rise to individualized treatments. Our objective was to provide a comprehensive overview of pharmacogenetic variation in the Saudi population. We mined next generation sequencing (NGS) data from 11,889 unrelated Saudi nationals, to determine the presence and frequencies of known functional SNP variants in 8 clinically relevant pharmacogenes (CYP2C9, CYP2C19, CYP3A5, CYP4F2, VKORC1, DPYD, TPMT and NUDT15), recommended by the Clinical Pharmacogenetics Implementation Consortium (CPIC), and collectively identified 82 such star alleles. Functionally significant pharmacogenetic variants were prevalent especially in CYP genes (excluding CYP3A5), with 10-44.4% of variants predicted to be inactive or to have decreased activity. In CYP3A5, inactive alleles (87.5%) were the most common. Only 1.8%, 0.7% and 0.7% of NUDT15, TPMT and DPYD variants respectively, were predicted to affect gene activity. In contrast, VKORC1 was found functionally, to be highly polymorphic with 53.7% of Saudi individuals harboring variants predicted to result in decreased activity and 31.3% having variants leading to increased metabolic activity. Furthermore, among the 8 pharmacogenes studied, we detected six rare variants with an aggregated frequency of 1.1%, that among several other ethnicities, were uniquely found in Saudi population. Similarly, within our cohort, the 8 pharmacogenes yielded forty-six novel variants predicted to be deleterious. Based upon our findings, 99.2% of individuals from the Saudi population carry at least one actionable pharmacogenetic variant.

Entities:  

Mesh:

Year:  2022        PMID: 35089958      PMCID: PMC8797234          DOI: 10.1371/journal.pone.0263137

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Pharmacogenomics (PGx) studies genetic variations in an individual’s drug metabolizing enzymes, associating these with adverse drug events or the level of drug response [1]. Drug efficacy and toxicity may be predicted from the genetic background of individuals, particularly in respect of the Cytochrome P450 (CYP) family of liver enzymes [2, 3]. These enzymes catalyze the conversion of substances that are metabolized by our bodies, including pharmaceuticals [4]. Overall, the efficacy of a drug is related to its Absorption, Distribution, Metabolism and Excretion (ADME) [3, 5]. The efficacy of a drug may also be associated with drug target polymorphisms. Drug targets can include receptors, enzymes and membrane transporters [2]. CYPs are responsible for deactivation of many drugs through direct metabolic activity or via facilitation of excretion and thus play a central role in ADME related efficacy. CYPs are also important in enzymatic conversion of some drugs from their native to bioactive forms [6]. These differences in drug metabolism highlight the current trend towards individualized pharmacotherapy, such that the right drug is delivered at the right dose to the right patient. A standard dose of a given drug is not always safe, effective or economical in an individual patient [7, 8]. The high incidence of adverse drug events (ADEs) represents a heavy burden for the US health care system. Almost 7 million emergency department visits are related to ADEs each year with an estimated cost of $3.5 billion annually [9]. Large scale genomic studies provide opportunities to associate drug responses with individual pharmacogenetic profiles. Such knowledge may improve drug efficacy, result in better outcomes, and in some instances, prevents life-threatening adverse drug events. Dosing no longer needs to be based on the average drug responses of a patient population but can be personalized, taking into consideration individual pharmacogenomic and environmental variation. There are well-established drug-gene interactions, that include but are not limited to, clopidogrel (CYP2C19), warfarin (CYP2C9, VKORC1 and CYP4F2), thiopurines (TPMT, NUDT15), tacrolimus (CYP3A5) and fluorouracil (DPYD) [10-14]. These medications are commonly used globally with Saudi Arabia being no exception [9, 15]. Protocols targeting use of the right drug at the right dose in the right person, based on genomic data to personalize treatment, have already been clinically implemented successfully in other countries, e.g. the RIGHT protocol in the US and U-PGx project in Austria, Spain, Great Britain, Greece, Italy, Netherlands and Slovenia [16-18]. The allelic frequencies of genes encoding drug-metabolizing enzymes and their phenotypic consequences may vary considerably between ethnic groups. The impact of these allelic variants has been well studied in Caucasians and some other ethnicities, yet poorly in Arabs [5, 19]. This study expands pharmacogenetic knowledge of the Arab population. Description of the allelic spectrum of pharmacogenes, both known and novel variants, their frequency, and phenotypic designation in Saudi nationals, will provide a basis for better clinical management in this population. During the last decade technological advances have enabled comprehensive mapping of human pharmacogenes [20, 21]. Next Generation Sequencing (NGS) and High Performance Computing (HPC) are two technologies that have enhanced this field [22, 23]. The mining of variants using sequence data from population-based genome programs, provide an opportunity to characterize the pharmacogenomic profiles of each of these groups. Here we describe our findings from the Saudi population.

Results

Mining of NGS data from a total of 11,889 (1,928 PGx gene panels and of 9,961 exomes) unrelated individuals was used to impute allele and haplotype frequencies. We analyzed frequencies of 82 haplotypes distributed across 8 pharmacogenes (Table 1). Nineteen CYP2C9 variants (*2, *3, *5, *6, *7, *8, *9, *11, *12, *14, *24, *32, *33, *36, *39, *43, *44, *45, *60) were identified that jointly accounted for 21.1% of all CYP2C9 alleles in Saudi Arabs, however, only CYP2C9*2 with a minor allele frequency (MAF) of 13.4% and CYP2C9*3 (MAF = 5.3%) were relatively common. Fifteen variant alleles (*2, *3, *4A, *6, *8, *9, *12, *13, *15, *16, *17, *24, *28, *30, *34) were found in CYP2C19 of which *17 and *2 were the most common: 25.9% and 9.6%, respectively. A splice site variant (rs7767746) that is the core allele for CYP3A5*3 was present in 84.7% of the population. Three other alleles (*6, *7 and *8) in CYP3A5 showed MAFs from <0.1% to 2.4%. In CYP4F2, 44.4% of Saudi individuals harbor a *3 allele, the remaining population being wild type. We detected four VKORC1 alleles; the most common was VKORC1*2 (MAF = 53.7%) followed by rs7294, 3730G>A (MAF = 29.2%). Two other VKORC1 variants: 106G>T (rs61742245) and 196G>A (rs72547529) were less commonly observed with MAFs of 2.1% and <0.1%, respectively. Genetic polymorphisms in DPYD and TPMT were rare in the Saudi population. We identified eight variants (rs67376798, rs3918290, rs1801266, rs115232898, rs112766203.1, rs72549304, rs146356975, rs56038477) for DPYD and ten star alleles for TPMT although the overall MAFs for both these were low, 0.7% (DPYD) and 0.9% (TPMT). Two alleles (*3 and *5) were identified in NUDT15. The *3 allele was present with a MAF of 1.8%; *5 being much less common (MAF<0.1%), in the population.
Table 1

Frequencies and functional status of pharmacogenetic alleles in Saudi population.

GeneAlleleCore variantVariant typeFunctional StatusAllele Frequency, SA (%)
CYP2C9 *
*1 NoneNormal78.9
*2 rs1799853Missense (R144C)Decreased13.4
*3 rs1057910Missense (I359L)Inactive5.3
*4 rs56165452Missense (I359T)Decreased0
*5 rs28371686Missense (D360E)Decreased0.2
*6 rs9332131FrameshiftInactive0.1
*7 rs67807361Missense (L19I)Uncertain function<0.1
*8 rs7900194Missense (R150H)Decreased0.5
*9 rs2256871Missense (H251R)Normal0.7
*11 rs28371685Missense (R335W)Decreased0.6
*12 rs9332239Missense (P489S)Decreased<0.1
*13 rs72558187Missense (L90P)Inactive0
*14 rs72558189Missense (R125H)Decreased<0.1
*24 rs749060448Missense (E354K)Inactive<0.1
*32 rs868182778Missense (V490F)Uncertain function<0.1
*33 rs200183364Missense (R132Q)Inactive0.3
*36 rs114071557Start lostUncertain function<0.1
*39 rs762239445Missense(R124W)Inactive<0.1
*43 rs767576260Missense (R124W)Inactive<0.1
*44 rs200965026Missense (T130M))Decreased<0.1
*45 rs199523631Missense (R132W)Inactive<0.1
*60 rs767284820Missense (L467P)Uncertain function<0.1
CYP2C19 *
*1 NoneNormal63.3
*2 rs4244285Splicing defectInactive9.6
*3 rs4986893Stop-gain (W212X)Inactive0.1
*4A rs28399504Start lostInactive<0.1
*4B rs28399504, rs12248560Start lost, RegulatoryInactive0
*5 rs56337013Missense (R433W)Inactive0
*6 rs72552267Missense (R132Q)Inactive<0.1
*7 rs72558186Splicing defectInactive0
*8 rs41291556Missense (W120R)Inactive0.1
*9 rs17884712Missense (R144H)Decreased0.2
*10 rs6413438Missense (P227L)Decreased0
*12 rs55640102Stop-lost (X491C)Uncertain function<0.1
*13 rs17879685Missense (R410C)Normal0.4
*15 rs17882687Missense (I19L)Normal0.4
*16 rs192154563Missense (R442C)Decreased<0.1
*17 rs12248560RegulatoryIncreased25.9
*24 rs118203757Missense (R335Q)Inactive<0.1
*28 rs113934938Missense (V374I)Normal<0.1
*30 rs145328984Missense (R73C)Uncertain function0.1
*34 rs367543002, rs367543003Missense (P3S, F4L)Uncertain function<0.1
CYP3A5 *
*1 NoneNormal12.5
*2 rs28365083Missense (T398N)Uncertain function0
*3 rs776746Splicing defectInactive84.5
*6 rs10264272Splicing defectInactive2.4
*7 rs41303343FrameshiftInactive0.4
*8 rs55817950Missense (R28C)Uncertain function<0.1
CYP4F2 *
*1 NoneNormal55.6
*3 rs2108622Missense (V433M)Decreased function44.4
VKORC1 *
Wild-typeNoneNormal15.08
1173C>T (*2)rs9934438RegulatoryDecreased expression53.7
3730G>Ars7294UTRIncreased29.2
85G>Trs104894539Missense(V29L)Increased0
106G>Trs61742245Missense (D36Y)Increased2.1
172A>Grs104894541Missense (R58G)Increased0
196G>Ars72547529Missense (V66M)Increased<0.1
292C>Grs72547528Missense (R98W)Increased0
383T>Grs104894542Missense (L128R)Increased0
DPYD * / **
*1 NoneNormal99.82
2846A>Trs67376798Missense (D949V)Inactive<0.1
*2A rs3918290Splicing defectInactive0.1
*8 rs1801266Missense(R235W)Inactive<0.1
557A>Grs115232898Missense (Y186C)Decreased function0.1
2279C>Trs112766203.1Missense (T760I)Decreased function<0.1
1475C>Trs72549304Missense (S492L)Inactive<0.1
868A>Grs146356975Missense (K290E)Decreased function<0.1
1236G>A (HapB3)rs56038477Synonymous (E412 =)Decreased function0.5
TPMT **
*1 NoneNormal99.1
*2 rs1800462Missense (A80P)Inactive<0.1
*3A rs1800460, rs1142345Missense (A154T, Y240C)Inactive0.3
*3B rs1800460Missense (A154T)Inactive<0.1
*3C rs1142345Missense (Y240C)Inactive0.4
*6 rs75543815Missense (Y180F)Uncertain function0
*8 rs56161402Missense (R215H)Uncertain function0.2
*12 rs200220210Missense (S125L)Uncertain function<0.1
*24 rs6921269Missense (Q179H)Uncertain function0.1
*25 rs377085266Missense(C212R)Uncertain function<0.1
*34 rs111901354Missense (R82W)Uncertain function<0.1
NUDT15 *
*1 NoneNormal98.5
*3 rs116855232Missense (R139C)Inactive1.8
*5 rs186364861Missense (V18I)Uncertain function<0.1

Functional status of star alleles was defined according to:

* the Pharmacogene Variation Consortium (https://www.pharmvar.org) and the Clinical Pharmacogenetics Implementation Consortium guidelines (https://cpicpgx.org/guidlines/).

**-literature [24].

Functional status of star alleles was defined according to: * the Pharmacogene Variation Consortium (https://www.pharmvar.org) and the Clinical Pharmacogenetics Implementation Consortium guidelines (https://cpicpgx.org/guidlines/). **-literature [24]. Functional consequences predicted for PGx alleles in the Saudi population were found predominantly in CYP genes. In CYP3A5 we found the highest number (87.5%) of inactive alleles as a result of the frequently observed intronic splice site CYP3A5 *3 variant. CYP4F2 showed decreased function alleles in 44.4% of individuals, whereas, in two other CYP genes (CYP2C9 and CYP2C19) reduced function alleles (inactive or decreased) were less common, being 20.6% and 10.1%, respectively. In other prominent PGx genes, allele function was much more conserved, with only 1.8%, 0.7% and 0.7% of NUDT15, TMPT and DPYD variants predicted to affect activity, respectively. In contrast, functionally, VKORC1 was highly polymorphic with 53.7% of Saudi individuals harboring variants predicted to result in decreased activity, whereas 31.3% carry variants leading to increased metabolic activity (Table 1 and S1 Table, Fig 1).
Fig 1

Combined functional consequences of genetic variations in pharmacogenes, within the Saudi population.

Based on genotypic data and predicted functional consequences of variant alleles we defined genotype-to-phenotype correlations (Fig 2). The phenotyping algorithms were derived from CPIC guidelines which were available only for CYP2C19, CYP2C9, CYP3A5, TPMT, NUDT15, and DPYD. Extensive metabolizer (EM) was the most frequent category for DPYD (98.7%), TPMT (97.8%) and NUDT15 (95.6%). EM status was also the highest (64.5% and 38.3%) for CYP2C9 and CYP2C19 although a significant number of remaining individuals (35.4% and 61.6%) are predicted to carry an altered drug metabolizer status for these two genes. CYP3A5 non-expressers (poor metabolizer, PM) represented 77.8% of the population.
Fig 2

Percentage of the predicted CYP2C19 (A), CYP3A5 (B), DPYD (C), CYP2C9 (D), TPMT (E), NUDT15 (F) metabolizer groups.

EM, extensive metabolizer; IM, intermediate metabolizer; PM, poor metabolizer; UM, ultrarapid metabolizer; UNF, uncertain function.

Percentage of the predicted CYP2C19 (A), CYP3A5 (B), DPYD (C), CYP2C9 (D), TPMT (E), NUDT15 (F) metabolizer groups.

EM, extensive metabolizer; IM, intermediate metabolizer; PM, poor metabolizer; UM, ultrarapid metabolizer; UNF, uncertain function. The percentage of Saudi individuals who carry actionable PGx variant(s) is summarized in Fig 3. Of the 1928 Saudi individuals (genotyped using the PGx gene panel), 99.2% carry at least one actionable PGx allele, with a maximum of 5 detected in 1.1% of the population.
Fig 3

Percentage of Saudi subjects carrying actionable variants in zero to five pharmacogenomics genes.

Of all 62 previously reported, predicted to be pathogenic (based on a two-fold scoring) rare variants (MAF<1%), four (1 stop-gain, 2 frameshift and 2 missense variants) with an aggregated frequency of 0.67% were uniquely observed in Saudi individuals when compared with other populations (European, Finish, Hispanic, African, South Asian, East Asian, Ashkenazi Jews and Arabs). Two missense variants were only present in Arabs (Table 2 and S2 Table). Next, we identified 46 novel sequence alterations in seven of the eight PGx genes studied. They included 5 stop-gain, 5 splice site, 1 frameshift, and 35 missense variants with an ADME score of ≥84%. DPYD revealed the largest number (n = 19) of novel alterations, the most frequent being DPYD:p.Ile971Thr, having a MAF of 0.00055 (Table 3 and S3 Table).
Table 2

Rare pharmacogenetic variants present in Saudi population in comparison to other populations.

Minor allele frequency (%)
GeneRSIDVariant typeSAAFRAMRASJEASFINNFESASGMEQARBKaviar
CYP2C9 rs771127798Frameshift0.07600000<0.01000<0.01
CYP2C9 rs200985348Stop-gained0.017000000000<0.01
CYP3A5 rs1267703650Missense (I442T)0.1350<0.0100000000
CYP4F2 rs780094643Missense(G417V)0.13000000<0.0100.10<0.01
CYP4F2 rs763539865Frameshift0.421000000<0.0100<0.01
DPYD rs568132506Missense(P86L)0.32400.0090000.006<0.010.050.30.007

A complete summary of rare variants in Saudi population is presented in S2 Table. SA, Saudi Arabia; AFR, Africans; AMR, Latin/Admixed Americans; ASJ, Ashkenazi Jews; EAS, East Asians; FIN, European Finish; NFE, Non-Finish European; SAS, South Asians; GME, Greater Middle East Variome; QARB, Qatari Arabs; RSID, reference SNP cluster ID.

Table 3

Novel pharmacogenomics variants in Saudi population.

GeneVariantVariant typeMinor allele frequency, SA (%)
CYP2C19 NM_000769.4:c.914C>A:p.Thr305Asnmissense0.00420557
CYP2C19 NM_000769.4:c.332-1G>Asplice acceptor0.00841114
CYP2C19 NM_000769.4:c.482-2A>Gsplice acceptor0.00841114
CYP2C19 NM_000769.4:c.1034T>C:p.Met345Thrmissense0.01682227
CYP2C19 NM_000769.4:c.1071A>C:p.Arg357Sermissense0.00841114
CYP2C9 NM_000771.4:c.1023C>G:p.Asp341Glumissense0.00420557
CYP2C9 NM_000771.4:c.893G>C:p.Gly298Alamissense0.01261670
CYP2C9 NM_000771.4:c.961+1G>Asplice donor0.00841114
CYP2C9 NM_000771.4:c.1061A>G:p.Glu354Glymissense0.00420557
CYP2C9 NM_000771.4:c.1198G>T:p.Glu400Terstop gained0.01682227
CYP2C9 NM_000771.4:c.1243G>T:p.Glu415Terstop gained0.00420557
CYP3A5 NM_000777.5:c.1205A>G:p.His402Argmissense0.00420557
CYP3A5 NM_000777.5:c.1120G>A:p.Glu374Lysmissense0.00420557
CYP3A5 NM_000777.5:c.1067T>C:p.Leu356Promissense0.00420557
CYP3A5 NM_000777.5:c.1063T>C:p.Tyr355Hismissense0.00420557
CYP3A5 NM_000777.5:c.957T>A:p.Tyr319Terstop gained0.01682227
CYP3A5 NM_000777.5:c.931A>G:p.Ser311Glymissense0.00420557
CYP3A5 NM_000777.5:c.409T>C:p.Phe137Leumissense0.00420557
CYP3A5 NM_000777.5:c.219-2A>Gsplice acceptor0.00420557
CYP4F2 NM_001082.5:c.1288A>T:p.Asn430Tyrmissense0.02102784
CYP4F2 NM_001082.5:c.1231G>C:p.Gly411Argmissense0.00420557
CYP4F2 NM_001082.5:c.985G>A:p.Gly329Sermissense0.00420557
CYP4F2 NM_001082.5:c.889G>T:p.Asp297Tyrmissense0.00841114
DPYD NM_000110.4:c.2836delG:p.Ala946LeufsTer2frameshift0.00841114
DPYD NM_000110.4:c.1526C>G:p.Ser509Terstop gained0.00420557
DPYD NM_000110.4:c.958+1G>Asplice donor0.00841114
DPYD NM_000110.4:c.390T>A:p.Cys130Terstop gained0.00420557
DPYD NM_000110.4:c.2912T>C:p.Ile971Thrmissense0.05467239
DPYD NM_000110.4:c.2310C>G:p.Ile770Metmissense0.00420557
DPYD NM_000110.4:c.2137A>C:p.Asn713Hismissense0.00420557
DPYD NM_000110.4:c.2083T>G:p.Cys695Glymissense0.00420557
DPYD NM_000110.4:c.1804C>A:p.Pro602Thrmissense0.00420557
DPYD NM_000110.4:c.1679T>C:p.Ile560Thrmissense0.00420557
DPYD NM_000110.4:c.1657C>T:p.Pro553Sermissense0.00420557
DPYD NM_000110.4:c.1591G>A:p.Val531Metmissense0.00420557
DPYD NM_000110.4:c.1405A>G:p.Met469Valmissense0.00420557
DPYD NM_000110.4:c.1309G>A:p.Ala437Thrmissense0.00841114
DPYD NM_000110.4:c.1076T>C:p.Val359Alamissense0.00420557
DPYD NM_000110.4:c.574C>T:p.Leu192Phemissense0.00420557
DPYD NM_000110.4:c.431C>T:p.Ala144Valmissense0.01682227
DPYD NM_000110.4:c.217C>T:p.Leu73Phemissense0.00420557
DPYD NM_000110.4:c.194C>A:p.Thr65Lysmissense0.00420557
TPMT NM_000367.5:c.581G>A:p.Gly194Aspmissense0.00841114
TPMT NM_000367.5:c.454A>G:p.Arg152Glymissense0.01682227
TPMT NM_000367.5:c.202C>A:p.Pro68Thrmissense0.00420557
VKORC1 NM_024006.6:c.404G>A:p.Cys135Tyrmissense0.01261670
A complete summary of rare variants in Saudi population is presented in S2 Table. SA, Saudi Arabia; AFR, Africans; AMR, Latin/Admixed Americans; ASJ, Ashkenazi Jews; EAS, East Asians; FIN, European Finish; NFE, Non-Finish European; SAS, South Asians; GME, Greater Middle East Variome; QARB, Qatari Arabs; RSID, reference SNP cluster ID.

Discussion

Inter-individual differences in drug efficacy drive current trends towards personalized pharmacotherapy targeting delivery of the right drug, at the right dose to the right patient. A standard dose of a given drug is not always safe, effective or economical in an individual patient [7, 8]. Mining of large-scale NGS data is a very powerful tool for cataloging the range and frequency of genetic variation in populations [25]. We used whole exome and PGx gene panel NGS data to estimate pharmacogenetic diversity in the Saudi population, thus far poorly recorded in current databases, compared to many other ethnic groups. Our analysis provides the most comprehensive overview of PGx variability (predicted to be clinically relevant), of 8 phase I and phase II enzymes, in the Saudi population, published to date. We found that 61.6% of the Saudi cohort carry actionable CYP2C19 alleles, which may be associated with an increased risk of major adverse cardiovascular events during antiplatelet therapy with clopidogrel. In this instance ADEs range from stent thrombosis in poor and intermediate metabolizers, to bleeding risk in rapid and ultrarapid metabolizers. This drug was prescribed to several thousand patients who were treated at King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia (KFSHR&RC) last year alone. Similar to European, African and Ashkenazi populations, CYP2C19*17 was the most frequent allele. CYP2C19*30 was unique to Arabs, with CYP2C19*13 and CYP2C19*15 detected in Saudi individuals, observed only in Africans (*13 and *15) and Jews (*15). Actionable CYP2C9 alleles associated with metabolism of warfarin were identified in 35.4% of Saudis. Furthermore, the CYP4F2*3 and VKORC1*2 variants responsible for increased [26] and decreased warfarin activity [12] respectively, were strongly represented in our study population. CYP4F2 acts as an important counterpart to VKORC1 in limiting excessive accumulation of vitamin K [27, 28]. Inappropriate warfarin dosing underlies one of the most frequently reported adverse events, acute haemorrhages being one of the most common emergency visits in the US [29]. At KFSH&RC alone, warfarin is prescribed for several thousand patients every year. According to updated CPIC guidelines, genotypes of CYP2C9, VKORC1 and CYP4F2, should be considered together to estimate therapeutic warfarin dosing. One of the key factors strongly considered in dosing algorithms include ethnicity and population related genetic information. The majority of PGx data underpinning these guidelines arises from European, African American and East Asian ancestry [12]. Very little is known about pharmacogenetics in Arabs. Our study shows that the frequency of CYP2C9*2, *3, and VKORC1*2 in the Saudi population is similar to that of Europeans[25]. Other CYP2C9 variants common in Africans and present in Europeans (e.g. CYP2C9*5, *6, *8, and *11), that should be considered in warfarin dosing algorithms due to associated bleeding risk, show low occurrence in the Saudi population. Based on our findings, and subject to clinical validation, dosing recommendations for warfarin in Saudi patients should follow those for non-African ancestry, as recommended in CPIC guidelines [22]. However, studying the impact of a significantly higher frequency, of the functionally inactive CYP2C9*33 allele, on warfarin dosing in the Saudi population is strongly indicated. The vast majority of the Saudi population carries the CYP3A5*3 variant that results in a truncated mRNA with loss of protein expression [30]. Frequency of the *3 allele varies extremely across human populations and is correlated with distance from the equator. Equatorial populations may experience shortage of water and a sodium retaining phenotype in hot climates [31]. Our findings show frequency of this allele in the Saudi population, to be similar to that in six other populations (Ashkenazi, European, American, Finish and both Asians) [24, 25]. This gene catalyzes the metabolism of tacrolimus, a mainstay immunosuppressant. Patients with the CYP3A5*3 allele require the standard dose of this medication [13]. At KFSH&RC alone ~4000 patients received tacrolimus last year, and 22.2% of these may be normal metabolizers (2.6%) or intermediate metabolizers (19.6.%), requiring an increased tacrolimus dose to achieve a successful outcome. Clinical validation of this would be required, particularly given relatedness of donors and recipients in a consanguineous population, where histoincompatibility may be less than observed elsewhere. Genetic variation in TPMT and NUDT15 are strongly linked to the risk for adverse reactions, to thiopurines commonly used for treatment of malignant and non-malignant conditions [11]. The “normal” starting doses are generally high based on clinical trials which are enriched in wild-type individuals. Full doses are tailored for normal metabolizers and may cause acute toxicity in intermediate and poor metabolizers [32]. Thiopurine tolerance is highly correlated with genetic ancestry [33]. The functionally inactive TPMT*3A allele is much less common in Saudi individuals relative to American, European and Ashkenazi populations [24, 25]. CPIC guidelines recommend a customized dose of thiopurines in compound intermediate metabolizers (intermediate metabolizers in both TMPT and NUDT15 [11]. We identified 0.03% (n = 3) compound intermediate metabolizers in Saudi population. Genetic variation in DPYD is a strong predictor of adverse risk related to use of the chemotherapeutic agent fluorouracil, commonly used in the treatment of various malignancies. Many cases have been reported of severe toxicities or even lethal outcome due to the DPYD poor or null metabolizer phenotype [34]. In our study we identified 1.3% of Saudi individuals who carry either a functionally normal allele plus one null or one functionally decreased allele, and would be predicted to be intermediate metabolizers. Reduced doses of fluorouracil may be indicated for these individuals [10]. More importantly our study detected in the Saudi population, the DPYD rare pathogenic mutation (c.257C>T) that may be responsible for severe toxicity in heterozygous patients or lethality in homozygous cancer patients treated with fluoropyrimidines [35]. We found this variant to be significantly enriched in the Saudi population with approximately 1 in every 333 individuals heterozygous for this allele. This DPYD allele is also present in the Qatari population (0.3%) whereas it is very rare in other populations, with frequencies (relative to the Saudi population) <36-fold in Americans, <52-fold in Europeans, <99-fold in South Asians, and was absent in other compared populations (Table 2 and S2 Table). Given the high rate of consanguinity (~60%) in Saudi Arabia, we can expect relative to outbred populations, a higher incidence of homozygotes for the DPYD (c.257C>T) mutation. Consanguinity increases the probability of a mate to be a carrier of the same recessive allele [36]. Thus, genotyping DPYD in the Saudi population may have greater clinical relevance. In most of the pharmacogenes screened we observed alleles shared with other Arabs [19, 24, 37], and some unique to the Saudi population. Amongst those shared with other Arabs, some were observed at significantly (p<0.05) different frequencies (S1 and S2 Tables). Large-scale NGS data mining enables discovery of novel and rare pharmacogenetic alterations [3]. They are often population specific alleles and are not incorporated within current pharmacogenomic assays. Our study shows that such variants are present in the Saudi population, with computational algorithms predicting their functional significance in multiple instances. They may significantly add to knowledge of potentially actionable variants in ADME genes within the Saudi population and should be further investigated. Novel variants require experimental validation to test their functional effects in drug response [38]. Our study highlights the value of mining large NGS databases as a powerful tool, to improve knowledge of genomic variation within ADME genes, and stimulate their further investigation and eventual implementation in clinical practice. The data we present from one of the larger Middle Eastern countries, provides the most comprehensive overview of pharmacogenetic variants in Arabs, who to date are underrepresented in international genomic databases. We believe it will have both immediate and near-term clinical implications, expanding the application of pharmacogenetics and the practice of precision or individualized medicine in Arab patients.

Study limitations

The clinical impact of variants identified by this study remain in question as information from relevant clinical trials are limited. While PGx variants are predicted to be actionable in other populations, one cannot assume that these variants will ultimately have the same impact in the Saudi population without clinical verification. Another limitation of our study is the technical constraints of exome sequencing; non-coding regions and loci with high genomic complexity are poorly, or not covered at all. Structural changes and copy number variations which may be relevant are not reliably identified by whole exome or gene panel sequencing. Thus, we were not able to call star alleles with whole gene deletions, duplications or hybrids that are common in the assignment of CYP2D6 alleles. Accordingly, we did not include CYP2D6 in our analysis. Furthermore, actionable variants located in non-coding regions CYP2C19 rs12248560, CYP3A5 rs776746, VKORC1 rs9934438, VKORC1 rs7294, DPYD rs67376798 were not covered by whole exome sequencing, our data for these being exclusively obtained from the PGx custom gene panel only.

Methods

Manuscript was based on access to fully anonymized data from Saudi Human Genome Project for which waiver of consents was granted by the IRB of King Faisal Specialist Hospital and Research Center. The dataset used for mining of pharmacogenomic variants comprised 9,961 exomes and 1,928 PGx custom gene panels (genes are listed in S4 Table), from unrelated Arab individuals sequenced by the Saudi Human Genome Program (SHGP) between 2015 and 2019, as part of a comprehensive investigation of rare diseases in the Saudi population [39, 40]. We studied eight genes for which the Clinical Pharmacogenetics Implementation Consortium (CPIC) guidelines are curated (https://cpicpgx.org/guidelines/) and present on FDA labels (https://www.fda.gov.Drugs/ScienceReseach/ucm572698). CYP star allele assignment and their clinical function was derived from Pharmacogene Variation Consortium (https://www.pharmvar.org/genes/) and CPIC allele functional tables. Metabolizer types were inferred based on CPIC guidelines and the Pharmacogenomics Knowledgebase (PharmGKB) https://www.pharmgkb.org/ and they were defined as follows: ultrarapid metabolizer (UM), intermediate metabolizer (IM), extensive/normal metabolizer (EM), poor metabolizer (PM), rapid metabolizer (RM), IM to EM and PM to EM. Our method for Star allele calling was based upon using the Stargazer algorithm (v.1.0.8). This algorithm performs statistical haplotype phasing using Beagle [41] with reference samples from the 1000 Genomes Project [42]. The Beagle method is based on localized haplotype-cluster model, which is an empirical linkage disequilibrium model that can take the local structure in the data into consideration. The Beagle algorithm is accurate and runs fast due to the use of an EM-based algorithm that literately fits the best model to the data. Afterwards, the phased haplotypes computed by Beagle are then matched to publicly available star allele information, mostly in the PharmVar database (https://www.pharmvar.org) and PharmGKB (https://www.pharmgkb.org/). Finally, Stargazer reports the star allele findings in a tabular format along with prediction of the related metabolizer information. Frequencies of intronic and UTR variants were covered only by the PGx panel and their frequency was calculated based on the cohort of 1928 individuals. Variants with MAF <1% were defined as rare and genetic alterations with frequencies that exceeded the observed frequencies in other populations (European, Finish, Hispanic, African, South Asian, East Asian, Ashkenazi Jews and Arabs) by >20-fold were considered as being “Saudi-specific”. A Chi-square test was used to determine the statistical difference for allele frequencies between different populations. A p-value less than 0.05 was considered significant. Next, we classified alleles as novel if they were not observed in: 1000 Genomes (phase3), gnomAD (v.3.1.1), Exac (v.0.3) and Kaviar (v.160204). Functional consequence of PGx rare Saudi-specific and novel variants was predicted using a two-fold approach. Any variants with a high IMPACT rating, such as frameshift indels or stop loss variants were considered to be deleterious [43, 44]. We then applied the ADME-optimized framework that is an ensemble of deleteriousness prediction methods for predicting deleteriousness in pharmacogenes. We used 18 prediction algorithms to compute the ADME scores including CADD, SIFT, PolyPhen, LRT (likelihood ratio test), MutationAssessor, FATHMM, FATHMM-MKL, PROVEAN, VEST3, DANN, MetSVM, MetaLR, GERP++, SiPhy, PhyloP-vertebrate, PhyloP-mammalian, PhastCons-vertebrate, and PhastCons-mammalian. ADME scores larger than 84% were considered to affect pharmacogene functionality [45]. We used phenotypes generated from Stargazer for CYP2C19, CYP2C9, CYP3A5, DPYD, NUDT15 and TPMT to determine the percentage of individuals predicted to have actionable PGx variants. For VKORC1 (rs9934438) and CYP4F2*3 (rs2108622), individuals carrying heterozygous (CT) or homozygous (TT) and heterozygous (GA) or homozygous (AA), respectively were considered to have an actionable variant in those genes.

Actionable PGx variants identified in the Saudi population.

(XLSX) Click here for additional data file.

List of rare PGx variants.

(XLSX) Click here for additional data file.

List of novel PGx variants in the Saudi population.

(XLSX) Click here for additional data file.

List of genes in the custom PGx gene panel.

(XLSX) Click here for additional data file.

Classification thresholds and prediction algorithms for novel PGx variants.

(XLSX) Click here for additional data file.
  45 in total

Review 1.  Drug target pharmacogenomics: an overview.

Authors:  J A Johnson
Journal:  Am J Pharmacogenomics       Date:  2001

2.  Pharmacogenomics, ancestry and clinical decision making for global populations.

Authors:  E Ramos; A Doumatey; A G Elkahloun; D Shriner; H Huang; G Chen; J Zhou; H McLeod; A Adeyemo; C N Rotimi
Journal:  Pharmacogenomics J       Date:  2013-07-09       Impact factor: 3.550

3.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

4.  Comprehensive overview of the pharmacogenetic diversity in Ashkenazi Jews.

Authors:  Yitian Zhou; Volker M Lauschke
Journal:  J Med Genet       Date:  2018-07-03       Impact factor: 6.318

5.  Clinical Pharmacogenetics Implementation Consortium (CPIC) Guidelines for CYP3A5 Genotype and Tacrolimus Dosing.

Authors:  K A Birdwell; B Decker; J M Barbarino; J F Peterson; C M Stein; W Sadee; D Wang; A A Vinks; Y He; J J Swen; J S Leeder; Rhn van Schaik; K E Thummel; T E Klein; K E Caudle; I A M MacPhee
Journal:  Clin Pharmacol Ther       Date:  2015-06-03       Impact factor: 6.875

6.  Uncommon dihydropyrimidine dehydrogenase mutations and toxicity by fluoropyrimidines: a lethal case with a new variant.

Authors:  Marzia Del Re; Erica Quaquarini; Federico Sottotetti; Angela Michelucci; Raffaella Palumbo; Paolo Simi; Romano Danesi; Antonio Bernardo
Journal:  Pharmacogenomics       Date:  2015-12-14       Impact factor: 2.533

7.  Clinical Pharmacogenetics Implementation Consortium guidelines for CYP2C19 genotype and clopidogrel therapy: 2013 update.

Authors:  S A Scott; K Sangkuhl; C M Stein; J-S Hulot; J L Mega; D M Roden; T E Klein; M S Sabatine; J A Johnson; A R Shuldiner
Journal:  Clin Pharmacol Ther       Date:  2013-05-22       Impact factor: 6.875

8.  Comprehensive gene panels provide advantages over clinical exome sequencing for Mendelian diseases.

Authors: 
Journal:  Genome Biol       Date:  2015-06-26       Impact factor: 13.583

9.  Cross-Comparison of Exome Analysis, Next-Generation Sequencing of Amplicons, and the iPLEX(®) ADME PGx Panel for Pharmacogenomic Profiling.

Authors:  Eng Wee Chua; Simone L Cree; Kim N T Ton; Klaus Lehnert; Phillip Shepherd; Nuala Helsby; Martin A Kennedy
Journal:  Front Pharmacol       Date:  2016-01-26       Impact factor: 5.810

10.  Ensembl 2021.

Authors:  Kevin L Howe; Premanand Achuthan; James Allen; Jamie Allen; Jorge Alvarez-Jarreta; M Ridwan Amode; Irina M Armean; Andrey G Azov; Ruth Bennett; Jyothish Bhai; Konstantinos Billis; Sanjay Boddu; Mehrnaz Charkhchi; Carla Cummins; Luca Da Rin Fioretto; Claire Davidson; Kamalkumar Dodiya; Bilal El Houdaigui; Reham Fatima; Astrid Gall; Carlos Garcia Giron; Tiago Grego; Cristina Guijarro-Clarke; Leanne Haggerty; Anmol Hemrom; Thibaut Hourlier; Osagie G Izuogu; Thomas Juettemann; Vinay Kaikala; Mike Kay; Ilias Lavidas; Tuan Le; Diana Lemos; Jose Gonzalez Martinez; José Carlos Marugán; Thomas Maurel; Aoife C McMahon; Shamika Mohanan; Benjamin Moore; Matthieu Muffato; Denye N Oheh; Dimitrios Paraschas; Anne Parker; Andrew Parton; Irina Prosovetskaia; Manoj P Sakthivel; Ahamed I Abdul Salam; Bianca M Schmitt; Helen Schuilenburg; Dan Sheppard; Emily Steed; Michal Szpak; Marek Szuba; Kieron Taylor; Anja Thormann; Glen Threadgold; Brandon Walts; Andrea Winterbottom; Marc Chakiachvili; Ameya Chaubal; Nishadi De Silva; Bethany Flint; Adam Frankish; Sarah E Hunt; Garth R IIsley; Nick Langridge; Jane E Loveland; Fergal J Martin; Jonathan M Mudge; Joanella Morales; Emily Perry; Magali Ruffier; John Tate; David Thybert; Stephen J Trevanion; Fiona Cunningham; Andrew D Yates; Daniel R Zerbino; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

View more
  2 in total

1.  Genetic Variations of the DPYD Gene and Its Relationship with Ancestry Proportions in Different Ecuadorian Trihybrid Populations.

Authors:  Camila Farinango; Jennifer Gallardo-Cóndor; Byron Freire-Paspuel; Rodrigo Flores-Espinoza; Gabriela Jaramillo-Koupermann; Andrés López-Cortés; Germán Burgos; Eduardo Tejera; Alejandro Cabrera-Andrade
Journal:  J Pers Med       Date:  2022-06-10

2.  Prevalence of exposure to pharmacogenetic drugs by the Saudis treated at the health care centers of the Ministry of National Guard.

Authors:  Mohammad A Alshabeeb; Mesnad Alyabsi; Bien Paras
Journal:  Saudi Pharm J       Date:  2022-06-22       Impact factor: 4.562

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.