Literature DB >> 32390825

The Identification of Admixture Patterns Could Refine Pharmacogenetic Counseling: Analysis of a Population-Based Sample in Mexico.

José Jaime Martínez-Magaña1, Alma Delia Genis-Mendoza1,2, Jorge Ameth Villatoro Velázquez3,4, Beatriz Camarena5, Raul Martín Del Campo Sanchez3,4, Clara Fleiz Bautista3,4, Marycarmen Bustos Gamiño3, Esbehidy Reséndiz3, Alejandro Aguilar5, María Elena Medina-Mora3,4, Humberto Nicolini1.   

Abstract

Pharmacogenetic analysis has generated translational data that could be applied to guide treatments according to individual genetic variations. However, pharmacogenetic counseling in some mestizo (admixed) populations may require tailoring to different patterns of admixture. The identification and clustering of individuals with related admixture patterns in such populations could help to refine the practice of pharmacogenetic counseling. This study identifies related groups in a highly admixed population-based sample from Mexico, and analyzes the differential distribution of actionable pharmacogenetic variants. A subsample of 1728 individuals from the Mexican Genomic Database for Addiction Research (MxGDAR/Encodat) was analyzed. Genotyping was performed with the commercial PsychArray BeadChip, genome-wide ancestry was estimated using EIGENSOFT, and model-based clustering was applied to defined admixture groups. Actionable pharmacogenetic variants were identified with a query to the Pharmacogenomics Knowledge Base (PharmGKB) database, and functional prediction using the Variant Effect Predictor (VEP). Allele frequencies were compared with chi-square tests and differentiation was estimated by FST. Seven admixture groups were identified in Mexico. Some, like Group 1, Group 4, and Group 5, were found exclusively in certain geographic areas. More than 90% of the individuals, in some groups (Group 1, Group 4 and Group 5) were found in the Central-East and Southeast region of the country. MTRR p.I49M, ABCG2 p.Q141K, CHRNA5 p.D398N, SLCO2B1 rs2851069 show a low degree of differentiation between admixture groups. ANKK1 p.G318R and p.H90R, had the lowest allele frequency of Group 1. The reduction in these alleles reduces the risk of toxicity from anticancer and antihypercholesterolemic drugs. Our analysis identified different admixture patterns and described how they could be used to refine the practice of pharmacogenetic counseling for this admixed population.
Copyright © 2020 Martínez-Magaña, Genis-Mendoza, Villatoro Velázquez, Camarena, Martín del Campo Sanchez, Fleiz Bautista, Bustos Gamiño, Reséndiz, Aguilar, Medina-Mora and Nicolini.

Entities:  

Keywords:  Mexican population-based sample; MxGDAR/Encodat; admixture; genomics; pharmacogenetics

Year:  2020        PMID: 32390825      PMCID: PMC7188951          DOI: 10.3389/fphar.2020.00324

Source DB:  PubMed          Journal:  Front Pharmacol        ISSN: 1663-9812            Impact factor:   5.810


Introduction

In recent years, pharmacogenetic (PGx) studies have generated substantial information that is useful in clinical settings (Relling and Evans, 2015; Van Der Wouden et al., 2016; Verbelen et al., 2017). PGx variation influences the efficacy and toxicity of drugs through the alteration of pharmacodynamic or pharmacokinetic processes. Pharmacogenetic studies have uncovered many relationships between drugs and specific genes, but not all of this information can be used to implement PGx-based treatment guidelines. Various initiatives have attempted to compile information and develop PGx evidence-based drug dosing guidelines, like the Pharmacogenomics Knowledge Base (PharmGKB; (Thorn et al., 2013)). The PharmGKB classifies pharmacogenes into four different evidence levels (Barbarino et al., 2018), where pharmacogenes at higher levels are termed “actionable”–that is, they can be used in treatment guidelines for PGx-based counseling. Most pharmacogenetic studies have been carried out on individuals with a low degree of admixture. Actionable allele frequencies are dependent on ancestry, and such differences must be taken into account in clinical counseling (Abdul Jalil et al., 2015; Mizzi et al., 2016; Goh et al., 2017). A recent analysis of PGx variation across 26 global populations using data from Phase 3 of the 1000 Genomes Project identified clusters of individuals by continent, with high degrees of differentiation even among continental populations (Wright et al., 2018). Researchers have expressed concerns about the assignment of ancestry in PGx analysis, principally because of its potential impact on the differentiation of genetic variants among continental populations (Zhang and Finkelstein, 2019). Mexico has a differential pattern of ancestry, made up primarily of three populations: Native American, European, and African (Wang et al., 2008; Moreno-Estrada et al., 2014). Its genomic ancestry has been divided into Native American (NA) and Mexican Mestizo (MM) classifications (Silva-Zolezzi et al., 2009). Differences have been reported between these two groups in actionable PGx variants, including greater allele frequency of VKORC1 (rs8050894), CYP2B6 (rs2279343), and CYP3A5 (rs776746) in the NA population, and of CYP2C19 (rs4244285), CYP2C9 (rs1799853, rs1057910), NAT2 (rs179930), SLCO1B1 (rs4149015), and APOE (rs7412) in the MM population (Gonzalez-Covarrubias et al., 2019). The MM population is the largest in Mexico, and the prevalence of admixture patterns in this population is high. Grouping all MM individuals together could thus hide differences in allele frequencies. These differences have not been estimated for the MM population based on the degree of admixture or the distribution of the genome-wide global ancestry. However, an analysis of the admixture patterns and clustering of individuals with similarities could improve pharmacogenetic counseling for individuals in this population. The aim of the present study is thus to analyze global admixture patterns in a population-based Mexican sample in order to assess their impact on actionable pharmacogenetic variants.

Methods

Participants

This study analyzed a subsample of the Mexican Genomic Database for Addiction Research (MxGDAR/Encodat), derived from the Mexican National Survey of Tobacco, Alcohol, and Drug use (Villatoro-Velázquez et al., 2017). All of the geographic regions where more than half of the population speaks a Native American (NA) language were excluded (Villarreal, 2014). The survey was carried out in two phases, with the sampling performed in the second phase. There were questionnaires in each phase: the first focused on sociodemographic, social, and interpersonal information, with a section on patterns of alcohol, tobacco, and drug use (Reséndiz-Escobar et al., 2018), and the second on screening for psychiatric symptomatology (Pato et al., 2013). In the second phase, a sample of buccal epithelial cells was also collected ( ). All of the protocols in this study were approved by the Research Ethics Committees of the Instituto Nacional de Psiquiatría Ramón de la Fuente Muñiz (Approval No. CEI/C/083/2015) and the Instituto Nacional de Medicina Genómica (Approval No. 01/2017/I). The aims of the study were explained to each participant, and each was informed that they could end their participation at any time. All participants provided written informed consent; assent for a minor participant was obtained both from the participant and from a parent or legal guardian.
Figure 1

Sampling scheme for the Mexican Genomic Database for Addiction Research (MxGDAR/Encodat).

Sampling scheme for the Mexican Genomic Database for Addiction Research (MxGDAR/Encodat).

DNA Extraction, Microarray Genotyping, and Quality Control

DNA was extracted using a commercial modified salting-out procedure (Qiagen, USA), according to the manufacturer’s instructions. DNA extraction quality and integrity were evaluated by analysis with a NanoDrop spectrophotometer (Thermofisher, USA) and 2% agarose gel. The MxGDAR/Encodat database included 7171 of the 13,228 buccal epithelial samples collected (54.21%) that met the following quality criteria: i) 230/260 and 260/230 ratios > 1.8, ii) concentration > 50 ng/µL, and iii) no signs of DNA degradation. The DNA extraction was divided evenly between the Laboratorio de Genómica de Enfermedades Psiquiátricas y Neurodegenerativas of the Instituto Nacional de Medicina Genómica and the Departamento de Farmacogenómica of the Instituto Nacional de Psiquiatría Ramón de la Fuente Muñiz. We found no difference in the quality of microarray samples between the laboratories; the insufficient quality of half of the samples could have been an effect of problems in sample collection procedures. In a household survey like the Encodat 2016, collecting biological samples is problematic because it cannot be done under the controlled conditions of a clinical environment (Gudiseva et al., 2016). Genotyping was performed with the commercial microarray PsychArray BeadChip (Illumina, USA), according to the manufacturer’s instructions. In this preliminary analysis we genotyped a total of 1728 samples, with a random sampling that included at least 15 individuals from each of the 32 states in Mexico. The fluorescent intensities were read with the iScan system (Illumina, USA) and converted to genotype calls with Genome Studio software (Illumina, USA). Genotyping was carried out at the Unidad de Alta Tecnología para Expresión y Microarreglos of the Instituto Nacional de Medicina Genómica. The quality control of the genotyped data was performed using Plink (Purcell et al., 2007). Single nucleotide polymorphisms (SNPs) were excluded if they did not meet the following criteria: i) no-call rate > 5%, ii) minor allele frequency (MAF) < 5%, iii) p-value < 1E-06 in a chi-square test for Hardy-Weinberg equilibrium, or iv) they were duplicates. Individuals were excluded if they had no call rate > 5%. To identify cryptic familial relationships, we performed an identity-by-state (IBS): all individuals with an IBS > 1.6 were marked and those with the lower genotyping rate were excluded. This quality control left 1657 individuals for analysis.

Global Ancestry Estimations and Model-Based Admixture Clustering

Global ancestry estimations were performed with a genome-wide approach. For this purpose, the SNPs of the 1657 individuals that remained after quality filters were then filtered for independence. The linkage disequilibrium (LD) pruning algorithm was implemented in Plink, with a window size of 50 kb, a step size of 5, and a variance inflation factor (VIF) of 2. LD pruning left a total of 104,726 SNPs for analysis. The database was then merged with the SNPs of the Human Genome Diversity Project (HGDP; (Cavalli-Sforza, 2005)), and the SNPs not present in either database were excluded, leaving a total of 25,562 SNPs for genome-wide global ancestry estimations. EIGENSOFT (Price et al., 2006), was used to calculate ten global ancestry components. In this estimation, 133 individuals with values greater than three standard deviations were excluded, leaving 1524 individuals for analysis. The 133 excluded individuals had admixture proportions of European ancestry greater than 96%, as estimated through a model-based analysis with ADMIXTURE software (Scheet and Stephens, 2006). We removed these individuals, because we also filtered the communities with higher NA ancestry during the sampling procedure, to not deviate the calculation of admixture patterns to the founders’ populations. The sociodemographic characteristics of the sample are shown in .
Table 1

Sociodemographic characteristics of the subsample.

Subsample MxGDAR/Encodat (n = 1525)
Age (years) (SD) 38.34 (14.46)
Gender
 Male652 (42.75%)
 Female873 (57.25%)
Marital Status
 Married645 (42.30%)
 Cohabiting289 (18.95%)
 Separated82 (5.38%)
 Divorced30 (1.98%)
 Widowed56 (3.67%)
 Single423 (27.74%)
Occupation
 Professional30 (1.96%)
 Teacher13 (0.85%)
 Company Director3 (0.19%)
 Small Business Owner85 (5.57%)
 Office Worker78 (5.11%)
 Skilled worker73 (4.79%)
 Unskilled worker227 (18.16%)
 Farm Worker 24 (1.57%)
 Farmer71 (4.66%)
 Unemployed68 (4.46%)
 Student115 (7.54%)
 Housewife519 (34.03%)
 Retired30 (1.98%)
 Other139 (9.04%)
Sociodemographic characteristics of the subsample. Next, a model-based clustering algorithm using mclust software (Scrucca et al., 2016) carried out Gaussian mixture modeling to identify admixture groups in the subsample of 1524 individuals. A geographic distribution of the groups was then determined for the following regions of Mexico: i) Northwest (Baja California, Baja California Sur, Nayarit, Sinaloa, and Sonora), ii) North (Coahuila, Chihuahua, Durango, Nuevo León, San Luis Potosí, Tamaulipas, and Zacatecas), iii) Central-West (Aguascalientes, Colima, Jalisco, and Michoacán), iv) Central-East (Mexico City, Guerrero, Hidalgo, Estado de México, Morelos, Puebla, Quéretaro, and Tlaxcala), and v) Southeast (Campeche, Chiapas, Oaxaca, Quintana Roo, Tabasco, Veracruz, and Yucatán) (Borges et al., 2018).

Annotation and Actionable Pharmacogenetic Variant Analysis in Admixture Subgroups

Variants in genes with actionable pharmacogenetic effects were then identified using the hg19 coordinates for those genes. Genes were included for evidence levels of 1 to 2, according to the classification of the PharmGKB (Barbarino et al., 2018), a searchable pharmacogenetic knowledge database that categorizes genes according to levels of evidence for alterations in their response to different drugs, and that provides pharmacogenetic treatment guidance where those levels are high. For genes with evidence levels of 1 or 2, PsychArray has a total of 7955 single-nucleotide variants (65.83% of the total reported in the 1000 Genomes Database), of which 5809 in our population (73.02%) had an MAF < 0.05, 185 (23.33%) were excluded for a low call rate, and 11 (0.14%) were excluded for Hardy-Weinberger disequilibrium ( contains the list of variants lost in these filtering steps). Some important pharmacogenetic variation was lost during these filtering processes, like those found in CYP2D6 (evidence level 1A). Of the twelve variants design on CYP2D6 that could be found in the PsychArray, 10 were lost in the MAF, and 2 in the call rate filtering. We extracted the variants in the genes with actionable pharmacogenetic effect with annotations in the Ensembl Variant Effect Predictor (McLaren et al., 2016), and extracted all the variants that were missense (classifying these as damaging if a damaging effect was predicted by both SIFT and Polyphen2, but benign if only one algorithm predicted a damaging effect), synonymous, or annotated with a regulatory region. For regulatory variants, we extracted those with an annotated regulatory region based in the ENCODE identifier. A manual search was performed in PharmGKB for the evidence level for the benign missense, synonymous, and regulatory variants; for annotated variants a search was performed in PubMed for reports of at least 10 associations with drug-variant relationships. The global allele frequency of each actionable pharmacogenetic missense variant in the MxGDAR/Encodat was compared with those identified in the Genome Aggregation Database (gnomAD; (Kobayashi et al., 2017)) using a chi-square test and a delta de MAF calculation (dMAF). The degree of differentiation of variants between the admixture subgroups was analyzed with Wright’s fixation index (FST; (Patterson et al., 2006)).

Results

Analysis of the Regional Distribution of Admixture Subgroups

An analysis of global ancestry found the distribution of European to Native American ancestry of individuals in the MxGDAR/Encodat ( ), and an analysis of admixture groups detected seven groups ( ). Groups 2 and 3 show the greatest frequency, with a total of 821 (53.87%) individuals in the two groups ( ). Groups 2, 3, and 7 have a heterogeneous distribution in different regions, with no local concentration. Groups 4 and 5, with a total of 270 (17.72%) individuals, are concentrated in the Southeast, with more than 90% of the individuals in each group in that region ( ). The 170 individuals in Group 1 (72.96%) are in the Central-East and Southeast, and the 160 (94.67%) in Group 6 are in the Northwest, North, and Central-West regions. Groups 1 and 4 are the most closely related to the Native American population ( ).
Figure 2

Admixture analysis of the MxGDAR/Encodat. Panels: (A) Principal component analysis with the Human Genome Diversity Project Database (HGDP) admixture reference; individuals from the MxGDAR/Encodat in red and individuals from the HGDP database in gray; (B) Plotting of the individuals from the MxGDAR/Encodat (those in red in panel A), color according to the admixture group identified by Gaussian model-based clustering; (C) Analysis of the geographical distribution of the identified admixture subgroups in each region. Each pie represents the percentages of admixture groups identified in that particular region.

Table 2

Distribution of admixture groups in Mexico by region.

Region1 Group 1Group 2Group 3Group 4Group 5Group 6Group 7Total
Northwest 7 (3.68)116 (30.77)27 (6.08)1 (0.64)1 (0.88)33 (19.53)5 (16.12)190 (12.47)
North 39 (8.82)114 (30.24)192 (43.24)0 (0.00)2 (1.77)89 (52.66)6 (19.35)442 (29.00)
Central-West 17 (7.30)97 (25.73)94 (21.17)0 (0.00)0 (0.00)38 (22.49)1 (3.23)247 (16.21)
Central-East 123 (52.79)38 (10.08)101 (22.75)1 (0.64)3 (2.65)8 (4.73)8 (25.81)282 (18.50)
Southeast 47 (20.17)12 (3.12)30 (6.76)155 (98.73)107 (94.69)1 (0.59)11 (35.48)363 (23.82)
Total233 (15.29)377 (24.74)444 (29.14)157 (10.30)113 (7.41)169 (11.09)31 (2.03)1524 (100.00)

1States included in each region: Northwest: Baja California, Baja California Sur, Nayarit, Sinaloa, and Sonora; North: Coahuila, Chihuahua, Durango, Nuevo León, San Luis Potosí, Tamaulipas, and Zacatecas; Central-West: Aguascalientes, Colima, Jalisco, and Michoacán; Central-East: Mexico City, Guerrero, Hidalgo, Estado de México, Morelos, Puebla, Quéretaro, and Tlaxcala; and Southeast: Campeche, Chiapas, Oaxaca, Quintana Roo, Tabasco, Veracruz, and Yucatán.

Admixture analysis of the MxGDAR/Encodat. Panels: (A) Principal component analysis with the Human Genome Diversity Project Database (HGDP) admixture reference; individuals from the MxGDAR/Encodat in red and individuals from the HGDP database in gray; (B) Plotting of the individuals from the MxGDAR/Encodat (those in red in panel A), color according to the admixture group identified by Gaussian model-based clustering; (C) Analysis of the geographical distribution of the identified admixture subgroups in each region. Each pie represents the percentages of admixture groups identified in that particular region. Distribution of admixture groups in Mexico by region. 1States included in each region: Northwest: Baja California, Baja California Sur, Nayarit, Sinaloa, and Sonora; North: Coahuila, Chihuahua, Durango, Nuevo León, San Luis Potosí, Tamaulipas, and Zacatecas; Central-West: Aguascalientes, Colima, Jalisco, and Michoacán; Central-East: Mexico City, Guerrero, Hidalgo, Estado de México, Morelos, Puebla, Quéretaro, and Tlaxcala; and Southeast: Campeche, Chiapas, Oaxaca, Quintana Roo, Tabasco, Veracruz, and Yucatán.

Analysis of Actionable Pharmacogenetic Missense Damaging Variants

Analysis of the actionable pharmacogenetic missense variants found a total of 32 variants in 31 genes, four of which have a PharmGkB evidence level of 1 ( ): SLCO1B1*5, NUDT15*3, and CYP4F2*3 with evidence level 1A, and XPC p.Q939K with level 1B. The difference in minor allele frequency (dMAF) in these variants between the individuals in the MxGDAR/Encodat and the gnomAD range from -0.27 to -0.36 and are statistically significant. The variant with the greatest difference, -0.36, was XPC p.Q939K. NUDT15*3 was the only variant with a greater allele frequency in the MxGDAR/Encodat than in the gnomAD, with a difference of 0.45.
Table 3

Missense damaging variants in pharmacogenes with pharmGkB evidence Level 1.

Evidence 1A
ChrbpGeneSNPAllelesSynonymousAmino acid changeMAF1 gMAF2 ΔMAF3 DrugEffect on Drug
1221331549 SLCO1B1 rs4149056T > C SLCO1B1*5V174A0.10600.1326-.0266SimvastatinHigher risk of myopathy
1348619855 NUDT15 rs116855232C > T NUDT15*3 R139C0.07330.02810.0452Purine-based compoundsHigher risk of leukopenia, neutropenia, or alopecia
1915990431 CYP4F2 rs2108622C > T CYP4F2*3 V433M0.19200.2735-.0815WarfarinHigher dose
Evidence 1B
Chr bp Gene SNP Alleles Synonymous Amino acid change MAF gMAF ΔMAF Drug Effect on Drug
314187449 XPC rs2228001G > TNRQ939K0.26010.6359-.3758Platinum-based compoundsDecreased risk for toxicity

1MAF, Minor allele frequency in the MxGDAR/Encodat. 2gMAF, Global minor allele frequency (MAF) reported by the gnomAD. 3ΔMAF, difference between MxDAR/Encodat individuals and gMAF.

Missense damaging variants in pharmacogenes with pharmGkB evidence Level 1. 1MAF, Minor allele frequency in the MxGDAR/Encodat. 2gMAF, Global minor allele frequency (MAF) reported by the gnomAD. 3ΔMAF, difference between MxDAR/Encodat individuals and gMAF. There were 28 variants at the PharmGkB evidence level 2 ( ), 13 of which were at level 2A (MTHFR p.A22V, FCGR2A p.H166R, UGT1A1*6, ADRB2 p.G71R, ABCB1*2, NAT2*6, CYP2C8*3, KCNJ11 p.K23E, GSTP1 p.I188V, VDR p.M51T, NQO1 p.P187S, APOE-E2, and COMT p.V158M), and 15 at level 2B (EPHX1 p.Y113H and p.H139R, UMPS p.G213A, ADD1 p.G460W, UGT2B15*2, MTRR p.I49M, OPRM1 p.N40D, SOD2 p.V16A, CHRNA5 p.D398N, GP1BA p.T161M, XRCC1 p.Q399R, ERCC1 p.Q506K, ITPA p.P32T, CBR3 p.V244M, and PNPLA3 p.I148M). There were differences in allele frequencies ranging from -0.45 to 0.18 between the MxGDAR/Encodat and the gnomAD. The NQO1 p.P187S variant showed the greatest positive difference in minor allele frequency (dMAF = 0.1835) in the MxGDAR/Encodat, and the XRCC1 p.Q399R the greatest negative difference (dMAF = -0.45). The variants MTHFR p.A22V, GSTP1 p.I188V, NQO1 p.P187S, UMPS p.G213A, ERCC1 p.Q506K, and PNPLA3 p.I148M showed the largest positive differences in the MXGDAR/Encodat, and NAT2*6, KCNJ11 p.K23E, VDR p.M51T, EPHX1 p.Y113H, UGT2B15*2, MTRR p.I49M, SOD2 p.V16A, CHRNA5 p.D398N, XRCC1 p.Q399R, and CBR3 p.V244M the largest negative differences.
Table 4

Missense damaging variants in pharmacogenes with pharmGkB evidence level of 2.

Evidence 2A
ChrPositionGeneSNPAllelesSynonymousAmino acid changeMAF1 gMAF2 ΔMAF3 DrugEffect on Drug
111856378 MTHFR rs1801133G > ANRA222V0.48540.30850.1769Methotrexate and platinum-based compundsHigher risk of toxicity and decreased response
161479745 FCGR2A rs1801274 A > G NA H166R 0.4810 0.4791 0.0019 Trastuzumab Decreased response4
2234669144 UGT1A1 rs4148323G > A UGT1A1*6 G71R0.03120.02210.0091IrinotecanIncreased risk of neutropenia
5148206440 ADRB2 rs1042713G > ANRG16R0.44530.42080.0245Salbutamol/SalmeterolDecreased response
787160561 ABCB1 rs2032582A > C ABCB1*2 S893A0.47650.5498-0.0733Atazanavir/simvastatin/ondansetron/sunitinibDecreased clearance
818258103 NAT2 rs1799930G > A NAT2*6 R197Q0.13940.2730-0.1336Ethambutol/isoniazid/pyrazinamide/rifampinIncreased risk of hepatoxicity
1096798749 CYP2C8 rs10509681T > C CYP2C8*3 K329R0.05620.0838-0.0276RosiglitazoneIncreased metabolism
1117409572 KCNJ11 rs5219T > CNRK23E0.38990.6477-0.2578Glibenclamide and sulfonamidesDecreased response
67352689 GSTP1 rs1695A > GNRI188V0.46350.34020.1233Cyclophosphamide, epirubicin, uracil-based compounds and platinum-based compoundsDecreased response and increased toxicity
1248272895 VDR rs2228570A > GNRM51T0.49880.6295-0.1307Peginterferon alfa-2b/ribavirinDecreased response
1669745145 NQO1 rs1800566G > ANRP187S0.43560.25210.1835Anthracyclines and uracil-based compoundsWorse outcome
194541207 APOE rs7412C > T APOE-E2 R176C0.04470.0612-0.0165AtorvastatinIncreased response
2219951271 COMT rs4680G > ANRV158M0.39240.4625-0.0701Nicotine/methadone/oxycodone/remifentanil/sufentanil/tramadolIncreased response
Evidence 2B
Chr bp Gene SNP Alleles Synonymous Amino acid change MAF gMAF ΔMAF Drug Effect on Drug
1226019633 EPHX1 rs1051740T > CNRY113H0.40590.32150.0844CarbamazepineHigher metabolism
226026406rs2234922A > GNRH139R0.07350.1870-0.1135CarbamazepineIncreased dose
3124456742 UMPS rs1801019G > CNRG213A0.34420.19460.1496Capecitabine/fluorouracil/leucovorin/tegafurIncreased toxicity
4 2906707 ADD1 rs4961 G > T NR G460W 0.1943 0.2027 -0.0084 Furosemide/spironolactone Decreased response
69536084 UGT2B15 rs1902023A > C UGT2B15*2 Y85D0.33290.5149-0.1820Lorazepam/oxazepamIncreased clearance
57870973 MTRR rs1801394A > GNRI49M0.19760.4678-0.2702MethotrexateGreater toxicicty and increased response
6154360797 OPRM1 rs1799971A > GNRN40D0.21810.18840.0297Naloxone/ethanol/alfentanil/buprenorphine/fentanyl/heroin/morphine/sufentanil/tramadolIncreased cortisol peak and Increased dose (opioids)
160113872 SOD2 rs4880A > GNRV16A0.34300.4823-0.1393CyclophosphamideDecreased survival
1578882925 CHRNA5 rs16969968G > ANRD398N0.11920.2655-0.1463NicotineIncreased risk of dependence
174836381 GP1BA rs6065C > TNRT161M0.15100.09800.0530AspirinDecreased risk of resistance
1944055726 XRCC1 rs25487T > CNRQ399R0.23420.6850-0.4508Platinum coumpoundsDecreased response
45912736 ERCC1 rs3212986C > ANRQ506K0.43900.28640.1526Platinum-based compoundsDecreased risk of nephrotoxicity
203193842 ITPA rs1127354C > ANRP32T0.02400.0750-0.0510Peginterferon alfa-2b/ribavirin/interferon alfa-2bDecreased risk of anemia
2137518706 CBR3 rs1056892G > ANRV244M0.25120.3669-0.1157AnthracyclinesDecreased risk of cardiac damage
2244324727 PNPLA3 rs738409C > GNRI148M0.41740.27770.1397Asparaginase/cyclophosphamide/daunorubicin/prednisolone/vincristineIncreased risk of hepatoxicity

1MAF, Minor allele frequency in the MxGDAR/Encodat. 2gMAF, Global minor allele frequency (MAF) reported by the gnomAD. 3ΔMAF, difference between MxDAR/Encodat individuals and gMAF. 4 In bold are present the non-significant variants (p-value < 0.05).

Missense damaging variants in pharmacogenes with pharmGkB evidence level of 2. 1MAF, Minor allele frequency in the MxGDAR/Encodat. 2gMAF, Global minor allele frequency (MAF) reported by the gnomAD. 3ΔMAF, difference between MxDAR/Encodat individuals and gMAF. 4 In bold are present the non-significant variants (p-value < 0.05).

Analysis of Actionable Pharmacogenetic Synonymous, Benign, and Regulatory Variants in the MxGDAR/Encodat

A total of 427 synonymous, benign, and regulatory variants were identified, only seven of which had been previously been associated with drug response ( ). Of these seven, there was one at evidence level 1A (CYP2C19*2), three at level 2A (ABCG2 p.Q141K, NAT2*7), and three at level 2B (GNB3 p.S274S, GP1BA p.T161M, NEDD4L p.Q8Q). The differences in minor allele frequency between the MxGDAR/Encodat and the gnomAD ranged from -0.1820 to 0.1387. The synonymous variant GNB3 p.S274S had the largest negative difference (dMAF = -0.1820), and the variant ABGCG2 p.Q141K the largest positive difference (dMAF = 0.1387).
Table 5

Synonymous, benign, and regulatory actionable pharmacogenetic variation in the MxGDAR/Encodat.

Evidence 1A
ChrbpGeneSNPAllelesSynonymousAmino acid changeMAF1 gMAF2 ΔMAF3 DrugEffect on Drug
1096541616 CYP2C19 rs4244285G > ACYP2C19*2p.P227P0.10610.2214-0.1153Amitriptiline/Esctialopram/Citalopram/Clomipramine/Sertraline/ClopidrogelPoor Metabolizer (Clinical CPIC Dosing Guideline)
Evidence 2A
489052323 ABCG2 rs2231142C > ANRp.Q141K0.25810.11940.1387RosuvastatinHiger plasma concentration
818258370 NAT2 rs1799931G > ANAT2*7p.G286E0.16470.07730.0874Ethambutol/isoniazid/pyrazinamide/rifampinIncreased risk of hepatoxicity
1631105353 VKORC1 rs17708472G > AVKORC1*4Promoter (ENSR00000085299)0.07980.0937-0.0139WarfarinHigher dose
Evidence 2B
126954875GNB3rs5443C > TNRp.S274S0.63740.8194-0.1820SildenafilReduction in positive erectile response
174836381GP1BArs6065C > TNRp.T161M0.15110.13160.0195AspirinIncreased aspirin resistance
1855816791NEDD4Lrs4149601G > ANRp.Q8Q0.13400.2762-0.1422Diuretics or hydrochlorothiazideDecreased response

1MAF, Minor allele frequency in the MxGDAR/Encodat. 2gMAF, Global minor allele frequency (MAF) reported by the gnomAD. 3ΔMAF, difference between MxDAR/Encodat individuals and gMAF.

Synonymous, benign, and regulatory actionable pharmacogenetic variation in the MxGDAR/Encodat. 1MAF, Minor allele frequency in the MxGDAR/Encodat. 2gMAF, Global minor allele frequency (MAF) reported by the gnomAD. 3ΔMAF, difference between MxDAR/Encodat individuals and gMAF.

Analysis of Actionable Pharmacogenetic Variants in the Admixture Subgroups

An analysis of differentiation of the identified actionable pharmacogenetic variants in the different admixture groups found a total of 105 PGx variants with a low degree of differentiation (FST > 0.01), of which 36 (34.29%) had a regulatory or synonymous function and 69 (65.71%) were missense variants ( ). Of the latter, 20 variants had a FST > 0.03. In Groups 1 and 4, both with regional distribution (one in the Central-East region, and the other in the Southeast; see and ),15 of the 20 variants had lower frequencies, and five higher frequencies.
Figure 3

Regulatory and missense variants by FST value between admixture groups.

Figure 4

Differences in minor allelic frequencies (MAF) of the PGx variants with FST > 0.03: (A) MAF differences of the damaging missense variants between admixture groups; (B) MAF differences of regulatory, benign or synonymous variants between admixture groups.

Regulatory and missense variants by FST value between admixture groups. Differences in minor allelic frequencies (MAF) of the PGx variants with FST > 0.03: (A) MAF differences of the damaging missense variants between admixture groups; (B) MAF differences of regulatory, benign or synonymous variants between admixture groups.

Discussion

The analysis of actionable pharmacogenetic variants in admixed populations has been centered on genes or groups recruited in specific regions (Bonifaz-Peña et al., 2014; Cuautle-Rodríguez et al., 2014; Gonzalez-Covarrubias et al., 2016; Gonzales-Covarrubias et al., 2017). The present study is one of the first to analyze the distribution of known actionable PGx variants in a population-based sample for all states of Mexico, and examining the possible contributions a consideration of admixture patterns could make to pharmacogenetic counseling. The analysis found that 50 actionable PGx variants (96.15%) were different in the MxGDAR/Encodat database than in the gnomAD database for the global population. The following sections will discuss the implications of these differences for the pharmacological parameters of response, toxicity, and dosage requirements, by PharmGKB evidence level.

Evidence Level 1

The XPC p.Q939K variant, with evidence level 1B, reduces toxicity to individuals treated with platinum-based compounds (Caronia et al., 2009; Sakano et al., 2010). It was more than 30% less prevalent in the Mexican population, which could mean increased risk for toxicity. The only variant with evidence level 1 that showed greater MAF in the MxGDAR/Encodat was NUDT15*3 (dMAF = 0.0452), a variant that may increase the risk of leukopenia or neutropenia in those individuals treated with purine compounds (Yang et al., 2014). Differences were also found in CYP2C19*2, which has treatment guidelines that depend on different drug-gene relationships, and which had a lower MAF, which could mean a reduced response to drugs like escitalopram, citalopram, and clopidogrel. The reduction of this allele has been previously reported in a sample from western Mexico (Favela-Mendoza et al., 2015).

Evidence Level 2

The variants with evidence level 2, ABCG2 p.Q141K, MTHFR p.A22V, GSTP1 p.I188V, NQO1 p.P187S, UMPS p.G213A, ERCC1 p.Q506K, and PNPLA3 p.I148M, showed higher allele frequencies in the Mexican population. These variants could affect the treatment of cancer patients. MTHFR p.A22V could have a greater degree of toxicity and decreased response in treatments with methotrexate and platinum-based compounds (Huang et al., 2008; López-Rodríguez et al., 2018), GSTP1 p.I188V could lessen the response to cyclophosphamide, epirubucin, uracil-based, and platinum-based compounds (Oliveira et al., 2010; Zhang et al., 2011), and NQO1 p.P187S could diminish the response to anthracyclines and uracil-based compounds (Fagerholm et al., 2008). PNPLA3 p.I148M could increase hepatotoxicity in individuals treated with asparaginase, ciclophosphamide, daunorubicin, prednisolone, or vincristine (Chambers et al., 2011; Gutierrez-Camino et al., 2017), while ERCC1 p.Q506K could decrease nephrotoxicity in patients treated with platinum-based compounds (Tzvetkov et al., 2011). UMPS p.G213A could increase the toxicity of the cancer drugs capacetabine and fluorouracil (Tsunoda et al., 2011) and of leucovirin, an agent used to reduce the effect of methotrexate treatment. ABCG2 p.Q141K increases the plasma concentration of rosuvastatin, which could reduce its effect in the Mexican population (Tomlinson et al., 2010). The variants NAT2*6, KCNJ11 p.K23E, VDR p.M51T, EPHX1 p.H139R, UGT2B15*2, MTRR p.I49M, SOD2 p.V16A, CHRNA5 p.D398N, XRCC1 p.Q399R, GNB3 p.S274S, NEDD4L p.Q8Q, and CBR3 p.V244M all had reduced MAF in the Mexican population. These variants could potentially affect a variety of drugs, including those for cancer, psychiatric illness, tuberculosis, viruses, and diabetes. The reduction in the frequency of the NAT2*6 allele could reduce hepatotoxicity in treatment with ethambutol, pyramizid, and rifampin (Kim et al., 2009; Ben Mahmoud et al., 2012), and KCNJ11 p.K23E could increase the response to glibenclamide and sulfonamides (Javorsky et al., 2012). The reduced MAF of VDR p.M51T could increase response rates to peginterferon and ribavirin (García-Martín et al., 2013; El-Derany et al., 2016). The reduction of EPHX1 p.H139R could result in lower dose requirements for the psychiatric drug carbamazepine (Kamiya et al., 2005; Hung et al., 2012; Puranik et al., 2013), and that of UGT2B15*2 could reduce the clearance of lorazepam and oxazepam (He et al., 2009; Bhatt et al., 2019). For oncology patients, MTRR p.I49M could reduce the toxicity of metrotrexate (López-Rodríguez et al., 2018), XRCC1 p.Q399R could increase the response to platinum-based compounds (Yin et al., 2012), and SOD2 p.V16A could increase survival in those treated with cyclophosphamide (Glynn et al., 2009). CBR3 p.V244M could increase the risk of cardiac damage from anthracyclines (Blanco et al., 2008; Gonzalez-Covarrubias et al., 2008). Interestingly, the variant CHRNA5 p.D398N could reduce the dependence on nicotine in the Mexican population (Johnson et al., 2010; Gelernter et al., 2015; Hancock et al., 2017), which could help to explain the lower rates of cigarette consumption reported for Latino populations (Pagano et al., 2018). Further studies are needed to explore the effect of this variant on nicotine dependence.

The Pharmacogenetic Counseling Based on these Variants Could Depend on the Admixture Pattern Found in different Geographical Regions of Mexico

The admixture of the Mexican population has been divided into two populations: Native Americans (NA) and Mexican Mestizos (MM) (Wang et al., 2008; Moreno-Estrada et al., 2014). There have been some attempts to identify how variants in the NA population affect pharmacology (Gonzalez-Covarrubias et al., 2016; Romero-Hidalgo et al., 2017; Gonzalez-Covarrubias et al., 2019), but in many analyses the entire MM population is grouped together, with no estimation of how admixture patterns within that group could affect the frequency of actionable pharmacogenetic variants or clinical counseling. In the present study, a global ancestry model-based clustering algorithm identified seven admixture groups, some of which had a particular distribution in specific geographic areas in Mexico. Groups 4 and 5 were found only in the Central-East and Southeast, and Group 1 in the Northwest, North, and Central-West regions. Groups 1 and 4 are the most closely related to the NA population. The mapping of geographically dependent admixture patterns has been analyzed in the African-American and Latino populations in the United States (Bryc et al., 2015). Bryc et al., found that specific admixture groups of these populations were more prevalent in some regions, and they also reported that self-identification (used mainly in epidemiological studies) did not correlate with individuals’ ancestry. The identification of admixture patterns could help to more accurately identify individual ancestry and aid with pharmacogenetic counseling and treatment decisions guided by genomics. The present study found 20 actionable Pgx variants with possible dependence on admixture patterns. The variants MTRR p.I49M and NAT2*6 are less frequent in individuals from Groups 1 and 4 (those with a greater degree of NA ancestry) and could predict a reduction in toxicity in cancer patients treated with methotrexate, reduce the risk of hepatotoxicity in tuberculosis patients, and the risk in its use for treatment of acute lymphoblastic leukemia (Gast et al., 2007; Huang et al., 2008), for which it is one of the main drugs used in Mexico (Medellin-Garibay et al., 2019). Another variant with a substantial difference in MAF between admixture groups was ABCG2 p.Q141K, which affects the response to rosuvastatin (Tomlinson et al., 2010), the most cost-effective option for dyslipidemia treatment in Mexico (Briseño and Mino-León, 2010). Although we were able to identify missense variants in pharmacogenes with possible differential actionable effects in the Mexican population, one limitation of this study is that the MxGDAR/Encodat database does not have information about drug responses or adverse reactions, which limits the conclusions that can be drawn about the specific effects on the Mexican population. Another limitation is our use of a genotyping platform with reduced coverage and a fixed number of variants. Microarray genotyping technology eliminates the possibility of finding new variants in these pharmacogenes, but this could be accomplished with other techniques, such as next-generation technologies (NGS). An analysis of the Mexican population recently performed utilizing NGS found novel variation for some pharmacogenes between NA and MM populations (Gonzalez-Covarrubias et al., 2019). However, that analysis considered the MM population as a single group: an analysis of admixture patterns within the MM population, as performed in the present study, could be applied using NGS to refine the analysis of the Mexican population, and genomic guided some public health decisions in Mexico.

Conclusions

A description of the pharmacogenetic variants that can be actionable in this representative subsample of the Mexican population will help to understand and reduce treatment disparities in persons of admixed genetic background with differing pharmacogenetic variants. Such studies can serve as guides to precision medicine in the Mexican population and in other populations with mixed genetic backgrounds.

Data Availability Statement

The datasets generated for this study can be found in the European Variation Archive (EVA).

Ethics Statement

The studies involving human participants were reviewed and approved by Instituto Nacional de Psiquiatría Ramón de la Fuente Muñiz and Instituto Nacional de Medicina Genómica. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author Contributions

JM-M, AG-M, JV, and HN developed the analyses and wrote the first version of the manuscript. JM-M, RM and JV performed bioinformatics and statistical analyses. JV, RM, CF, MB, ER, and MM-M contributed to data collection. BC, AA, JM-M, and AG-M contributed to the genetic experiments. HN, MM-M and JV conceived, designed, and coordinated the project.

Funding

This study received funding from the Instituto Nacional de Medicina Genómica (Grant No. 23/2015/I), and from the Comisión Nacional de Ciencia y Tecnología (CONACyT) 2016 Fund for the Development of Scientific Projects to Address National Problems (Grant No. PN22296). The development of the surveys was funded by the Comisión Nacional Contra las Adicciones (CONADIC).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  4 in total

1.  Neuropsychiatric Genetics of Psychosis in the Mexican Population: A Genome-Wide Association Study Protocol for Schizophrenia, Schizoaffective, and Bipolar Disorder Patients and Controls.

Authors:  Beatriz Camarena; Elizabeth G Atkinson; Mark Baker; Claudia Becerra-Palars; Lori B Chibnik; Raúl Escamilla-Orozco; Joanna Jiménez-Pavón; Zan Koenig; Carla Márquez-Luna; Alicia R Martin; Ingrid Pamela Morales-Cedillo; Ana Maria Olivares; Hiram Ortega-Ortiz; Alejandra Monserrat Rodriguez-Ramírez; Ricardo Saracco-Alvarez; Rebecca E Basaldua; Brena F Sena; Karestan C Koenen
Journal:  Complex Psychiatry       Date:  2021-08-24

2.  Genome-wide association study of psychiatric and substance use comorbidity in Mexican individuals.

Authors:  José Jaime Martínez-Magaña; Alma Delia Genis-Mendoza; Jorge Ameth Villatoro Velázquez; Marycarmen Bustos-Gamiño; Isela Esther Juárez-Rojop; Carlos Alfonso Tovilla-Zarate; Emmanuel Sarmiento; Erasmo Saucedo; Oscar Rodríguez-Mayoral; Clara Fleiz-Bautista; Beatriz Camarena; Alejandro Aguilar; Thelma Beatriz Gonzalez-Castro; María Elena Medina-Mora; Humberto Nicolini
Journal:  Sci Rep       Date:  2021-03-24       Impact factor: 4.379

3.  Hispanic ethnicity and the rs4880 variant in SOD2 are associated with elevated liver enzymes and bilirubin levels in children receiving asparaginase-containing chemotherapy for acute lymphoblastic leukemia.

Authors:  Sharon Wu; Mengxi Wang; Amani Alqahtani; Mimi Lou; Wendy Stock; Deepa Bhojwani; Houda Alachkar
Journal:  Biomed Pharmacother       Date:  2022-04-29       Impact factor: 7.419

4.  Genome-Wide Analysis of Disordered Eating Behavior in the Mexican Population.

Authors:  José Jaime Martínez-Magaña; Sandra Hernandez; Ana Rosa Garcia; Valeria Cardoso-Barajas; Emmanuel Sarmiento; Beatriz Camarena; Alejandro Caballero; Laura Gonzalez; Jorge Ameth Villatoro-Velazquez; Maria Elena Medina-Mora; Marycarmen Bustos-Gamiño; Clara Fleiz-Bautista; Carlos Alfonso Tovilla-Zarate; Isela Esther Juárez-Rojop; Humberto Nicolini; Alma Delia Genis-Mendoza
Journal:  Nutrients       Date:  2022-01-17       Impact factor: 5.717

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.