Literature DB >> 32068798

Genome-Wide Selection Scan in an Arabian Peninsula Population Identifies a TNKS Haplotype Linked to Metabolic Traits and Hypertension.

Muthukrishnan Eaaswarkhanth1, Andre Luiz Campelo Dos Santos2,3, Omer Gokcumen2, Fahd Al-Mulla1, Thangavel Alphonse Thanaraj1.   

Abstract

Despite the extreme and varying environmental conditions prevalent in the Arabian Peninsula, it has experienced several waves of human migrations following the out-of-Africa diaspora. Eventually, the inhabitants of the peninsula region adapted to the hot and dry environment. The adaptation and natural selection that shaped the extant human populations of the Arabian Peninsula region have been scarcely studied. In an attempt to explore natural selection in the region, we analyzed 662,750 variants in 583 Kuwaiti individuals. We searched for regions in the genome that display signatures of positive selection in the Kuwaiti population using an integrative approach in a conservative manner. We highlight a haplotype overlapping TNKS that showed strong signals of positive selection based on the results of the multiple selection tests conducted (integrated Haplotype Score, Cross Population Extended Haplotype Homozygosity, Population Branch Statistics, and log-likelihood ratio scores). Notably, the TNKS haplotype under selection potentially conferred a fitness advantage to the Kuwaiti ancestors for surviving in the harsh environment while posing a major health risk to present-day Kuwaitis.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  zzm321990 TNKSzzm321990 ; Kuwait population; hypertension; metabolic traits; positive selection

Mesh:

Substances:

Year:  2020        PMID: 32068798      PMCID: PMC7093833          DOI: 10.1093/gbe/evaa033

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Archaeological evidence suggests that the Arabian Peninsula played a key role during the dispersal of modern humans out-of-Africa (Cabrera et al. 2010; Rose and Petraglia 2010; Petraglia et al. 2019). Anatomically modern humans have inhabited the Arabian Peninsula since immediately after the out-of-Africa migration; therefore, the resident populations have a long and complex evolutionary history (Petraglia and Alsharekh 2003). Being at the crossroads between Africa and Eurasia, the Arabian Peninsula served as a point of interaction of human populations and trade across the region (Groucutt and Petraglia 2012). The resettlement of peoples and traders facilitated population admixture and increased genetic diversity. Several genetic studies offer insights on how the genetic ancestry, consanguinity, and admixture have structured the genetic diversity of the Arab populations. For example, uniparental genetic examinations demonstrated maternal (Abu-Amero et al. 2007; Rowold et al. 2007; Abu-Amero et al. 2008) and paternal (Abu-Amero et al. 2009; Triki-Fendri et al. 2016) genetic affinities and admixture events. Genome-wide characterization studies (Behar et al. 2010; Hunter-Zinck et al. 2010; Alsmadi et al. 2013) have elaborated the genetic structure and diversity within the peninsula as well as across the continents. In addition, a recent whole-exome-based study revealed the heterogeneous genetic structure of the Middle Eastern populations (Scott et al. 2016). The population of the State of Kuwait exemplifies the overall heterogeneity of Middle Eastern populations (Alsmadi et al. 2013, 2014). Kuwait is one of the seven countries located in the Arabian Peninsula on the coastal region of the Arabian Gulf. Also placed at the head of the Persian Gulf, Kuwait is bordered by Saudi Arabia and Iraq to the south and north, respectively. The ancestors of the extant Kuwaiti population were early settlers that migrated from Saudi Arabia (Alghanim 1998; Casey 2007). Until the discovery of oil, they mostly derived their livelihoods from fishing and merchant seafaring (Lienhardt 2001). The movement of populations from settlements in neighboring regions, particularly Saudi Arabia and Persia (Alenizi et al. 2008), in addition to the consequent admixture between populations and consanguinity (Yang et al. 2014), have potentially shaped the genetic diversity of the Kuwaiti population. Our previous studies revealed that the genetic structure of the Kuwait population is heterogeneous, comprising three distinct ancestral genetic backgrounds that could be linked roughly to contemporary Saudi Arabian, Persian, and Bedouin populations (Alsmadi et al. 2013; Alsmadi et al. 2014; Thareja et al. 2015). Paleoanthropological studies have recorded the dramatic environmental transformations and extreme climatic conditions in the Arabian Peninsula over time in addition to the subsequent human dispersal into the region (Groucutt and Petraglia 2012). The extreme and varying environmental conditions could have influenced natural selection and triggered adaptation to the hot and dry desert climates (Rose and Petraglia 2010). Additionally, the ramifications of adaptive trends reported for continental populations (e.g., lactose tolerance, skin color, resistance to blood pathogens, etc.) may have implications for the health of Arabian populations. Indeed, genome-wide selection scans have revealed positive selection for lactose tolerance, as well as skin and eye color similar to those in Europeans and malaria resistance as that in Africans (Fernandes et al. 2019). Such studies provide a robust framework for investigating the potential adaptive trends in specific populations in the Arabian Peninsula. However, such focused studies are scarce (Yang et al. 2014; Fernandes et al. 2019). For example, an earlier small-scale exploration of ancestry components suggested that genetic regions associated with olfactory pathways were under natural selection in Kuwaiti populations (Yang et al. 2014). However, the study was limited to a sample of <50 individuals. In the present study, we build on the findings of previous studies by increasing the sample size considerably in addition to applying multiple approaches that have become available recently to identify selection in a genome-wide manner in Kuwaiti populations (fig. 1).
. 1.

—Geographic location of the genotyped individuals and materials and methods applied. (A) All the genotyped individuals were residents of Kuwait, in the Arabian Peninsula. (B) Schematic diagram of the methods, techniques, and filtering actions applied to the genome-wide genotype data. (C) The thresholds applied to all the selection data to identify 385 single nucleotide polymorphisms specifically under positive selection in the Kuwait population.

—Geographic location of the genotyped individuals and materials and methods applied. (A) All the genotyped individuals were residents of Kuwait, in the Arabian Peninsula. (B) Schematic diagram of the methods, techniques, and filtering actions applied to the genome-wide genotype data. (C) The thresholds applied to all the selection data to identify 385 single nucleotide polymorphisms specifically under positive selection in the Kuwait population.

Materials and Methods

Study Samples

A total of 583 Kuwaiti individuals included in this study were selected from a larger cohort used in our previous studies (Alsmadi et al. 2013; Hebbar et al. 2017, 2018). All participants were recruited after obtaining written informed consent under protocols approved by the International Scientific Advisory Board and the Ethical Review Committee at Dasman Diabetes Institute, Kuwait. The participant recruitment, sample collection, and related procedures were conducted in accordance with the Declaration of Helsinki and detailed elsewhere (Alsmadi et al. 2013; Hebbar et al. 2017, 2018). To ensure that the study individuals are unrelated, we examined relatedness among them using PLINK (threshold PI_HAT > 0.125, i.e., up to third degree relatives) and randomly removed one sample per pair of related individuals. For replication analysis we used Saudi Arabia genotype data from a published study (Fernandes et al. 2019).

Genotyping and Quality Control

We genotyped 583 healthy unrelated Kuwaiti individuals residing in the State of Kuwait using Illumina HumanOmniExpress arrays for 730,525 single nucleotide polymorphisms (SNPs). The quality control (QC) checks and data filtering were executed in PLINK (Chang et al. 2015). The data set was filtered through standard QC filtering to include only autosomal SNPs, specifically those that had a genotyping success rate >90% and passed the Hardy–Weinberg Exact test with a P-value >0.001. In addition, we eliminated strand-based ambiguous A/T and G/C SNPs. After the above filtering steps, ∼662,750 SNPs remained to perform “within population” selection tests, integrated Haplotype Score (iHS) and log-likelihood ratio scores (LLRS) (fig. 1). We merged Kuwait data set with 1000 Genomes Project phase 3 data set (Auton et al. 2015) yielding a combined data set of 661,231 SNPs to conduct “among population” selection tests, Cross Population Extended Haplotype Homozygosity (XP-EHH) and Population Branch Statistics (PBS) (fig. 1). All the four selection test results were combined, keeping only the overlapping SNPs resulting in a total of 556,188 SNPs for further examination (fig. 1). To confirm whether genotyping and QC filtering criteria was adequate and that population relationships were as expected, we conducted principal component analysis (PCA) on the pruned Kuwaiti population data set combined with 1000 Genomes Project phase 3 data set (Auton et al. 2015), using smartpca program in the EIGENSOFT package version 6.1.4 (Patterson et al. 2006; Price et al. 2006). We pruned the data set by removing one SNP of any pair in strong pairwise linkage disequilibrium (LD) r2 > 0.4 within a window of 200 SNPs (sliding the window by 25 SNPs at a time) using “indep-pairwise” option in PLINK. The SNPs were phased using Beagle (Browning and Browning 2007).

Selection Analysis

Collectively, four different selection tests that measure deviations from expected linkage disequilibrium/homozygosity (iHS, XP-EHH) and population differentiation (PBS, LLRS) distributions across the genome were performed. Each of the tests has different statistical power to detect signatures of slightly different types of selection (e.g., complete vs. incomplete sweeps) and are sensitive to the time of selection (Cadzow et al. 2014). Therefore, the integration of multiple approaches facilitated the comprehensive interrogation of the genome for selection signatures and offered a list of candidate regions that could have been evolving under nonneutral evolutionary forces. We implemented iHS (Voight et al. 2006) and XP-EHH (Sabeti et al. 2007) tests available in selscan v1.2.0a (Szpiech and Hernandez 2014). The XP-EHH test was conducted by comparing the Kuwaiti (KWT) population with the Utah Residents with Northern and Western European Ancestry (CEU); Han Chinese from Beijing, China (CHB); and Yoruba from Ibadan, Nigeria (YRI), available from the 1000 Genomes Project phase 3 data set (Auton et al. 2015). We calculated PBS scores as described elsewhere (Yi et al. 2010). First, we computed SNP-wise FST values for all pairwise comparisons between populations—KWT, CEU, CHB, and YRI. The FST values were then used to calculate PBS scores in three ways: 1) between KWT and CEU using YRI as an outgroup; 2) between KWT and CHB using YRI as an outgroup, and 3) between KWT and YRI using CHB as an outgroup. The PBS scores were expected to pinpoint loci under selection exclusively in the Kuwaiti population. Finally, we computed LLRS scores for positive selection using Ohana (Cheng et al. 2019). This is a recently published selection detection framework that identifies signals of positive selection through population differentiation independent of self-reported ancestry or admixture correction to group individuals into populations.

Functional and PheWAS Trait Analysis

Gene functions were determined using the UCSC Genome Browser (http://genome.ucsc.edu/; last accessed May 2019) and literature searches. We performed Gene Ontology (GO) enrichment analysis using an online tool (http://geneontology.org/; last accessed May 2019) integrated with the PANTHER classification system (Mi et al. 2019). The association of the putatively selected SNPs with any phenotypic traits was detected using the PheWAS tool of the GeneATLAS database (Canela-Xandri et al. 2018). The gene expression data of the selected SNPs were obtained from the GTEx portal (https://gtexportal.org/home/; last accessed November 2019). The LD pattern of the positively selected seven SNPs was plotted using Haploview version 4.2 (Barrett et al. 2005). The haplotype network was created using PopART (Leigh and Bryant 2015).

Code Availability

All the codes related to analyses and generation of figures are available in this link.

Results

Principal component analysis

We performed PCA of Kuwaiti populations combined with the global populations from 1000 Genomes Project phase 3 data set to confirm the adequacy of genotype and QC filter metrics applied and to examine population relationships. The results were consistent with the expectations. The PCA scatter plot representing the first two principal components (supplementary fig. S1, Supplementary Material online) shows distinct clustering of Kuwaitis, Europeans, Africans, East Asians, and South Asians revealing within and among population relationships. Consistent with our previous studies (Alsmadi et al. 2013; John et al. 2018), the placement of Kuwaiti subgroups in the proximity of their putative ancestral populations (KuwaitiB next to African cluster; KuwaitiP and KuwaitiS subgroups nearer to South Asian and European clusters) was observed.

Integration of All Selection Tests Scores: 385 Common SNPs under Selection

We interrogated 556,188 SNPs with all the four selection test results (fig. 1, supplementary table S1, Supplementary Material online). According to previously published guidelines (Cardona et al. 2014), we subsequently identified 385 SNPs, which had XP-EHH, PBS, and LLRS values > 0, in addition to iHS values > 2 (fig. 1, supplementary table S2, Supplementary Material online). We surmised that the filtering would detect specific variants that are more likely to be true positive candidates for classical positive selection in the Kuwaiti population. As expected, the allele frequencies of the alternate alleles for the 385 SNPs were high in the Kuwaiti population in comparison with the allele frequencies for the same SNPs in the other global populations (supplementary fig. S2 and table S2, Supplementary Material online).

Window-Based Screening and Genomic Regions under Positive Selection

We then identified the genomic locations of the 385 SNPs (fig. 2). Specifically, we speculated that if indeed selective sweeps in the Kuwaiti population explain the deviations from neutral expectations in the 385 SNPs, then they are expected to cluster within a smaller number of haplotypes. Therefore, we checked for clustering of the SNPs across the human reference genome in 100-kb windows. We identified ∼220 windows of 100 kb in length with at least one SNP under putative selection. In addition, we observed multiple instances where adjacent 100-kb windows harbored putatively selected SNPs, which we subsequently merged into single regions (supplementary table S3, Supplementary Material online).
. 2.

—Regions in the autosomal DNA under positive selection in the Kuwait population. All the 385 SNPs under putative positive selection in the Kuwait population were assigned to 100-kb windows in the autosomal DNA. Each colored bar is a 25-Mb segment in the chromosome representing at least one 100-kb window, and their colors reflect the number of SNPs under positive selection.

—Regions in the autosomal DNA under positive selection in the Kuwait population. All the 385 SNPs under putative positive selection in the Kuwait population were assigned to 100-kb windows in the autosomal DNA. Each colored bar is a 25-Mb segment in the chromosome representing at least one 100-kb window, and their colors reflect the number of SNPs under positive selection. To understand the functional implications of the putatively selected regions, we identified all the genes (n = 379) inside the 220 100-kb windows and then performed a GO enrichment analysis using all the GO annotations for Homo sapiens as the comparison background. After Bonferroni correction (P < 0.05), the analysis returned results associated only with a single group of biological processes: Glycosaminoglycan biosynthetic process (aminoglycan biosynthetic process, aminoglycan metabolic process, glycosaminoglycan metabolic process) (supplementary table S4, Supplementary Material online). There were no significant results for molecular functions and cellular components.

TNKS Haplotype Related to Obesity, Hypertension, and Asthma

To investigate further the functional impact of putatively selected haplotypes, we conducted a more thorough manual investigation of the genomic regions containing five or more SNPs identified by our selection scan. Specifically, we checked if any variant that was in LD with the haplotypes had been reported in previous studies or in the UK Biobank (Sudlow et al. 2015; Bycroft et al. 2018) (supplementary table S5, Supplementary Material online). Based on the manual curation, we highlighted a ∼400-kb haplotype on chromosome 8 (chr8: 9.3–9.7 Mb), which harbors seven putatively selected SNPs (fig. 3). The selection scores for all the SNPs around this haplotype region are presented in the line plots (fig. 3). To reveal the haplotype structure of the locus, we first investigated whether the seven putatively selected SNPs were in LD with each other (fig. 4). Indeed, a single haplotype carried the alternate alleles for all the seven SNPs (1111111) (fig. 4, supplementary fig. S3, Supplementary Material online). We further observed that the haplotype exhibited the highest allele frequency in the Kuwaiti population when compared with other global populations (supplementary fig. S4, Supplementary Material online). In addition, within the Kuwaiti population, the subgroup of individuals with putative Saudi ancestry (Alsmadi et al. 2013) seemed to be driving up the frequency of the haplotype 1111111 (supplementary fig. S4, Supplementary Material online).
. 3.

—Putatively selected TNKS haplotype: chr8: 9.3–9.7 Mb. The values of all the selection scores for all the SNPs around the region are presented in the line plots. All the selection scores for the seven SNPs under positive selection in the region are indicated by gray bars, whereas their positions relative to TNKS are indicated by red small bars in the model at the bottom. The LD pattern for these seven SNPs is presented in figure 4.

. 4.

—Haplotype architecture around TNKS gene and associated phenotypic traits and gene expression level changes. (A) Haplotype clustermap for the Kuwait population in the chr8: 9.3–9.7 Mb region. The haplotype composed of the alternate alleles (black) of the seven SNPs under positive selection in the region is the most common even within putative subgroups, KWS—Kuwaitis with putative Saudi ancestry (green); KWP—Kuwaiti with putative Persian ancestry (red); B—Kuwaitis with putative Bedouin ancestry (gray). Each row represents one of the 583 genotyped individuals. (B) Linkage disequilibrium (LD) pattern of the positively selected seven SNPs. The upper panel shows the location of seven SNPs in TNKS and the lower panel presents the output of Haploview. Numbers in each square present the r2 value of a pairwise LD relationship between any two SNPs. (C) GeneATLAS PheWAS results for SNP rs13276086. In total, 17 phenotypic traits revealed an association above the threshold (–log10[P-value] > 8, red dashed line). (D) Violin plots of TNKS allele-specific eQTLs according to individual SNP genotypes in the spinal cord and the tibial nerve, in addition to the transformed fibroblasts, from the Genotype-Tissue Expression (GTEx) release V8 database. The teal region indicates the density distribution of the samples in each genotype. The white line in the box plot (black) shows the median value of the expression of each genotype.

—Putatively selected TNKS haplotype: chr8: 9.3–9.7 Mb. The values of all the selection scores for all the SNPs around the region are presented in the line plots. All the selection scores for the seven SNPs under positive selection in the region are indicated by gray bars, whereas their positions relative to TNKS are indicated by red small bars in the model at the bottom. The LD pattern for these seven SNPs is presented in figure 4. —Haplotype architecture around TNKS gene and associated phenotypic traits and gene expression level changes. (A) Haplotype clustermap for the Kuwait population in the chr8: 9.3–9.7 Mb region. The haplotype composed of the alternate alleles (black) of the seven SNPs under positive selection in the region is the most common even within putative subgroups, KWS—Kuwaitis with putative Saudi ancestry (green); KWP—Kuwaiti with putative Persian ancestry (red); B—Kuwaitis with putative Bedouin ancestry (gray). Each row represents one of the 583 genotyped individuals. (B) Linkage disequilibrium (LD) pattern of the positively selected seven SNPs. The upper panel shows the location of seven SNPs in TNKS and the lower panel presents the output of Haploview. Numbers in each square present the r2 value of a pairwise LD relationship between any two SNPs. (C) GeneATLAS PheWAS results for SNP rs13276086. In total, 17 phenotypic traits revealed an association above the threshold (–log10[P-value] > 8, red dashed line). (D) Violin plots of TNKS allele-specific eQTLs according to individual SNP genotypes in the spinal cord and the tibial nerve, in addition to the transformed fibroblasts, from the Genotype-Tissue Expression (GTEx) release V8 database. The teal region indicates the density distribution of the samples in each genotype. The white line in the box plot (black) shows the median value of the expression of each genotype. Subsequently, we investigated the potential functional impact of the haplotype. The overall 400-kb haplotype block encompasses a single gene, the poly-ADP-ribosyltransferase 1 (TNKS). Three out of the seven SNPs that revealed selection signatures were located upstream of the TNKS whereas the other four were within the introns of the gene (figs. 3 and 4). The variation in the gene has been associated with adiposity (Lindgren et al. 2009), type 2 diabetes (Xue et al. 2018) and asthma (Ober et al. 2000). Indeed, when we searched the UK Biobank database, we found that the specific SNPs within the putatively selected haplotype that we identified using our approach were associated with metabolic disorders; for example, body mass index and limb fat mass were associated with adiposity, and eosinophil percentage was associated with asthma (George 2005) (fig. 4, table 1, supplementary table S6, Supplementary Material online). Therefore, both studies published previously, and a reanalysis of UK Biobank data set revealed that the specific TNKS haplotype is potentially associated with both metabolic traits and asthma.
Table 1

Summary Table of the Seven SNPs within the Putatively Selected Haplotype and PheWAS Traits

CHRPOSaSNPREFALTAlternate Allele Frequency
PheWAS Traitb
KWTSAUAFRAMREASEURSASANHc
89394053rs6987057TG0.830.910.690.650.600.840.73Platelet distribution width, body mass index (BMI), red blood cell (erythrocyte) distribution width
89411808rs9644677GA0.790.870.570.790.580.750.660.62Hypertension
89412153rs9329203GA0.810.900.630.800.570.770.67Hypertension, platelet distribution width, standing height
89540693rs13273033GA0.830.920.720.820.670.780.700.69Hypertension, platelet distribution width, standing height
89555092rs4841196AG0.740.830.610.570.380.720.57Hypertension, platelet distribution width, body mass index (BMI), red blood cell (erythrocyte) distribution width, arm fat percentage (left), arm fat percentage (right), impedance of leg (left), impedance of leg (right)
89578982rs13276086TG0.720.830.550.540.460.670.57Hypertension, platelet distribution width, standing height, body mass index (BMI), red blood cell (erythrocyte) distribution width, arm fat percentage (left), arm fat percentage (right), impedance of leg (left), impedance of leg (right), arm fat mass (left), arm fat mass (right), cheese intake, eosinophil percentage, impedance of arm (right), impedance of whole body, leg fat mass (left), leg fat mass (right)
89601699rs12545912GT0.800.900.650.840.720.770.690.62Hypertension, platelet distribution width, standing height

Note.—KWT, Kuwaiti; SAU, Saudi Arabian; AFR, African; AMR, Admixed American; EAS, East Asian; EUR, European; SAS, South Asian; ANH, Ancient Humans.

The SNPs positions are based on the human reference genome build GRCh37/hg19.

Threshold: –log10 (P-value) > 8, PheWAS analysis provided through GeneATLAS; individual P-values are presented in supplementary table S6, Supplementary Material online.

ANH data from https://reich.hms.harvard.edu/downloadable-genotypes-present-day-and-ancient-dna-data-compiled-published-papers (last accessed November 2019).

Summary Table of the Seven SNPs within the Putatively Selected Haplotype and PheWAS Traits Note.—KWT, Kuwaiti; SAU, Saudi Arabian; AFR, African; AMR, Admixed American; EAS, East Asian; EUR, European; SAS, South Asian; ANH, Ancient Humans. The SNPs positions are based on the human reference genome build GRCh37/hg19. Threshold: –log10 (P-value) > 8, PheWAS analysis provided through GeneATLAS; individual P-values are presented in supplementary table S6, Supplementary Material online. ANH data from https://reich.hms.harvard.edu/downloadable-genotypes-present-day-and-ancient-dna-data-compiled-published-papers (last accessed November 2019). In addition to verifying previous associations, we observed that six out of seven putatively selected SNPs in the region were associated significantly with hypertension, as inferred from the UK Biobank’s GeneATLAS database (Canela-Xandri et al. 2018). To the best of our knowledge, this is the first report linking the region to hypertension, and considering hypertension is associated with high levels of adiposity (obesity) (Beevers et al. 2001), it further implicates the variation in TNKS in the pleiotropic effects on metabolism at the organismal level. In addition, expression data of TNKS revealed that depending on which of the seven SNPs of the haplotype a given individual carries, it is upregulated mainly in the spinal cord and tibial nerve, besides the transformed fibroblasts (fig. 4). According to existing data based on expression quantitative trait loci, TNKS is expressed globally and mainly in brain-related tissues. The spinal cord along with the brain structures and peripheral nervous system keep up the sympathetic nervous system activity, which is involved in metabolic disorders associated with hypertension (Tanaka and Itoh 2019). Accordingly, the upregulation of TNKS in spinal cord or tibial nerve is crucial in contributing to the metabolic processes. It is critical to note here that most association studies that link genetic to phenotypic variation have been conducted in people of European descent; therefore, they have a limited capacity to capture population-specific associations that are currently widespread comprehensively. However, considering the derived haplotype is extremely common in multiple populations, a more general impact could be deduced from already available databases.

Selection in Saudi Arabia: Overlap between Positively Selected Regions Linked to Metabolic Traits

In an effort to detect signals of positive selection in another population from the Arabian Peninsula applying our integrated conservative approach, we screened the same set of variants (662,750) genotyped from 96 Saudi Arabian individuals available from a previous study (Fernandes et al. 2019). We observed differential pattern of selection signals in Saudi Arabians (supplementary table S7, Supplementary Material online). We found that different regions of the genome showed signatures of selection between Kuwaiti and Saudi Arabian populations. However, of the top 14 genomic regions under selection in Saudi Arabians, four with few common SNPs overlap with Kuwaitis. It is interesting to note that the panel of gene regions that are positively selected in Saudi Arabians also have been associated with metabolic traits (Kathiresan et al. 2007; Paterson et al. 2010) and asthma (Ober et al. 2000). For example, adiponectin levels that are correlated with obesity risk (Qi et al. 2011) and type 2 diabetes (Meigs et al. 2007). This replication analysis indicates that the hypertension related observation is specific to Kuwaiti population.

Discussion

We identified gene regions that presented strong signals of positive selection in Kuwaiti populations using an integrative approach. Our decision to focus only on putatively selected SNPs that were identified based on multiple tests of selection made our approach highly conservative and, therefore, not prone to a high false-positive rate. For example, some of the top candidates identified using PBS (not exhibiting adequately high signals in other selection tests) cluster into a haplotype block that included LCT. The result was not surprising because lactose tolerance is one of the most extensively studied adaptive traits and it has been published elsewhere that LCT haplotypes were positively selected in the Middle Eastern populations (Enattah et al. 2008; Bayoumi et al. 2016; Liebert et al. 2017). With our approach, we identified regions that have similar functional relevance (e.g., metabolism) in Kuwaiti and Saudi Arabian populations. However, these regions do not overlap between these two populations indicating that the putative selective pressures in the region may have different ramifications in the genomes of these populations. The absence of overlapping selection signals can be attributed to the different outcomes of selection tests and an heterogeneous population structure. The extant Arabian Peninsula populations exemplify interregional genetic heterogeneity, which is evident in Saudi Arabian and Kuwaiti populations (Hunter-Zinck et al. 2010; Alsmadi et al. 2013; Tadmouri et al. 2014; Scott et al. 2016; Hajjej et al. 2018; Khubrani et al. 2018). Thus, it may not be surprising that not all four selection tests return similar results in Kuwaiti and Saudi Arabian populations. For example, using the same conservative thresholds for multiple selection tests, we were not able to replicate the selection signal for the TNKS haplotype in the Saudi Arabian population. Specifically, out of the seven SNPs for which we documented above-threshold signals for four different selection tests in the Kuwaiti population, four had lower iHS values than our threshold, and none of these SNPs showed increased LLRS values in the Saudi Arabian population. This result does not mean that the haplotype frequency and haplotype structure of this locus can be explained by neutrality alone in the Saudi Arabian population. On the contrary, all seven SNPs that we highlighted in the Kuwaiti population also showed above-threshold PBS and XP-EHH values in the Saudi Arabian population (supplementary table S8, Supplementary Material online). Moreover, the putatively selected TNKS haplotype (with all SNPs have the alternative allele) has a higher allele frequency in the Saudi Arabian population (78%) than in the Kuwaiti population (68%). In fact, it is the frequency of the Kuwaiti population subgroup with Saudi Arabian ancestry (76%) that has raised the overall occurrence of this all SNPs derived allele haplotype (1111111) in Kuwaitis (supplementary fig. S4, Supplementary Material online). Thus, there is reason to believe that this haplotype is also selected in the Saudi Arabian population. The iHS test may have reduced power because the allele is at or close to fixation (Voight et al. 2006), whereas the LLRS may have reduced power due to a different population structure in Saudi Arabia (Cheng et al. 2019). Therefore, the lack of overlap between different populations should be considered as an outcome of multiple selection tests with different sensitivities. Overall, our study establishes a robust framework for the generation of additional adaptive hypotheses for Arabian Peninsula populations. In the present study, we chose to highlight one 400-kb haplotype that was detected to be under positive selection based on multiple tests. The haplotype encompasses TNKS, and we verified its association with metabolic traits (table 1, supplementary table S6, Supplementary Material online). Indeed, we observed a considerable prevalence of the seven TNKS haplotype SNPs in modern continental populations and three SNPs in ancient humans (table 1). In addition, we found that the haplotype is associated with hypertension and an increase in TNKS expression in the nervous system. An earlier study identified a TNKS intronic variant, rs6994574_G > A with signals of recent positive selection in East Asian (CHB+JPT) and African (YRI) populations from HapMap Phase 2 database based on long-range haplotype (LRH) and iHS tests (supplementary table 9 in International HapMap Consortium et al. 2007). This SNP information is not available in the Kuwait and Saudi Arabia data sets, as it was not included in the Illumina HumanOmniExpress arrays used for genotyping in the current study. However, rs6994574 is in strong LD with one of the seven positively selected SNPs rs13276086 in East Asians (r2 = 1) and Africans (r2 = 0.8). It is noteworthy that rs13276086 has been associated with 17 phenotypic traits according to GeneATLAS PheWAS database (fig. 4, supplementary table S6, Supplementary Material online). Interestingly, the alternate alleles of both the SNPs, rs6994574_A and rs13276086_G have been significantly related to hypertension with an odds ratio of 0.97 (http://geneatlas.roslin.ed.ac.uk/phewas/; last accessed January 2020; Canela-Xandri et al. 2018). Another study examining telomere biology genes showed evidence of balancing selection in the TNKS region, consisting of 52 SNPs, by evaluating population differentiation (FST), genetic diversity, allele frequency, and LD in global populations from HGDP-CEPH, HapMap Phases 2, and 3 databases (Mirabello et al. 2012). Notably, those 52 TNKS SNPs include three of the seven SNPs under putative selection that we report in the present study (rs9644677, rs13273033, rs12545912). Yet another Tajima’s D statistic-based investigation in individuals of Pacific Rim ancestry showed a balancing selection of 48 SNPs in the TNKS region (Savage et al. 2005). The harsh desert climate of the Kuwait region could have driven the selection (Weder 2007; Young 2007). For example, a recent study speculated that natural selection for insulin resistance and the associated hypertension, in addition to increased activity in the sympathetic nervous system could have been beneficial in hunter-gatherer populations conferring a hemodynamic advantage (Lewis et al. 2019). The TNKS is indeed very likely to be pleiotropic. It is expressed widely across multiple organs, with especially abundant expression in the adult nervous system (Fagerberg et al. 2014). It is a poly-ADP-ribosyltransferase enzyme and its molecular function and protein partners in the cell are well understood (Kim 2018). Its activity is linked to major cellular processes including Wnt signaling pathway (Huang et al. 2009) and vesicle trafficking (Chi and Lodish 2000), as well as telomere length (Cook et al. 2002). However, little is known about the impact of haplotypic variation to TNKS function, and consequently the cellular and organismal phenotypes. There are multiple previous association study results that connect genetic variation within this locus with various phenotypes at a genome-wide significant P-values of <1.0E–08 (table 1, supplementary table S6, Supplementary Material online). In addition, the GWAS Catalog (https://www.ebi.ac.uk/gwas/genes/TNKS; last accessed November 2019) lists several TNKS variants associated with multiple disorders and traits, such as multiple myeloma, neuroticism, bipolar disorder, schizophrenia, blood pressure, obesity, and bone mineral density. Overall, it is plausible that the putatively adaptive haplotype has medically negative consequences that may fit a “thrifty” (Ayub et al. 2014) or “drifty” (Speakman 2008) gene scenario. However, additional work is needed to properly test this hypothesis. The pleiotropic effects of the TNKS haplotypes are consistent with the speculative phenotypes described in the present study. Notably, other gene regions under potential selection have also been associated with obesity and hypertension (supplementary table S5, Supplementary Material online). Therefore, it is plausible that the TNKS haplotype exemplifies a general trend in which a more rapid metabolism rate and hypertension have been selected in the Kuwaiti population, which increased the allele frequency of multiple haplotypes and conferred some degree of fitness advantage to ancestors of present-day Kuwaiti populations in the extremely dry and hot ecological environments. In modern Kuwait, however, the effect of the TNKS haplotype is potentially detrimental. Indeed, hypertension and obesity are prevalent in the Kuwaiti population, affecting a staggering 25.3% and 48.2% of the population, respectively (Al-Sejari 2018). The World Health Organization has estimated that the mortality rate in Kuwait due to noncommunicable diseases is ∼72% (https://www.who.int/nmh/publications/ncd-profiles-2018/en/; last accessed June 2019), which is alarmingly high. Such mortality and morbidity levels could be attributed largely to the drastic changes in the lifestyles and behaviors associated with westernization following oil discovery. Nevertheless, our results suggest that past adaptive trends have further predisposed Kuwaiti populations to the illnesses above at the genetic level. Overall, the mechanisms through which the TNKS haplotype conferred a fitness advantage and how the same haplotype predisposes the population to metabolic diseases remain fascinating areas that could be explored in future research. Click here for additional data file.
  63 in total

1.  Tankyrase is a golgi-associated mitogen-activated protein kinase substrate that interacts with IRAP in GLUT4 vesicles.

Authors:  N W Chi; H F Lodish
Journal:  J Biol Chem       Date:  2000-12-08       Impact factor: 5.157

2.  Thrifty genes for obesity, an attractive but flawed idea, and an alternative perspective: the 'drifty gene' hypothesis.

Authors:  J R Speakman
Journal:  Int J Obes (Lond)       Date:  2008-10-14       Impact factor: 5.095

3.  Tankyrase inhibition stabilizes axin and antagonizes Wnt signalling.

Authors:  Shih-Min A Huang; Yuji M Mishina; Shanming Liu; Atwood Cheung; Frank Stegmeier; Gregory A Michaud; Olga Charlat; Elizabeth Wiellette; Yue Zhang; Stephanie Wiessner; Marc Hild; Xiaoying Shi; Christopher J Wilson; Craig Mickanin; Vic Myer; Aleem Fazal; Ronald Tomlinson; Fabrizio Serluca; Wenlin Shao; Hong Cheng; Michael Shultz; Christina Rau; Markus Schirle; Judith Schlegl; Sonja Ghidelli; Stephen Fawell; Chris Lu; Daniel Curtis; Marc W Kirschner; Christoph Lengauer; Peter M Finan; John A Tallarico; Tewis Bouwmeester; Jeffery A Porter; Andreas Bauer; Feng Cong
Journal:  Nature       Date:  2009-09-16       Impact factor: 49.962

4.  Genome-wide detection and characterization of positive selection in human populations.

Authors:  Pardis C Sabeti; Patrick Varilly; Ben Fry; Jason Lohmueller; Elizabeth Hostetter; Chris Cotsapas; Xiaohui Xie; Elizabeth H Byrne; Steven A McCarroll; Rachelle Gaudet; Stephen F Schaffner; Eric S Lander; Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; Todd A Johnson; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

5.  Eurasian and African mitochondrial DNA influences in the Saudi Arabian population.

Authors:  Khaled K Abu-Amero; Ana M González; Jose M Larruga; Thomas M Bosley; Vicente M Cabrera
Journal:  BMC Evol Biol       Date:  2007-03-01       Impact factor: 3.260

6.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

7.  The genetic heterogeneity of Arab populations as inferred from HLA genes.

Authors:  Abdelhafidh Hajjej; Wassim Y Almawi; Antonio Arnaiz-Villena; Lasmar Hattab; Slama Hmida
Journal:  PLoS One       Date:  2018-03-09       Impact factor: 3.240

8.  Saudi Arabian Y-Chromosome diversity and its relationship with nearby regions.

Authors:  Khaled K Abu-Amero; Ali Hellani; Ana M González; Jose M Larruga; Vicente M Cabrera; Peter A Underhill
Journal:  BMC Genet       Date:  2009-09-22       Impact factor: 2.797

9.  Mitochondrial DNA structure in the Arabian Peninsula.

Authors:  Khaled K Abu-Amero; José M Larruga; Vicente M Cabrera; Ana M González
Journal:  BMC Evol Biol       Date:  2008-02-12       Impact factor: 3.260

10.  Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry.

Authors:  Gaurav Thareja; Sumi Elsa John; Prashantha Hebbar; Kazem Behbehani; Thangavel Alphonse Thanaraj; Osama Alsmadi
Journal:  BMC Genomics       Date:  2015-02-18       Impact factor: 3.969

View more
  6 in total

1.  Identification of candidate biomarkers and pathways associated with type 1 diabetes mellitus using bioinformatics analysis.

Authors:  Madhu Pujar; Basavaraj Vastrad; Satish Kavatagimath; Chanabasayya Vastrad; Shivakumar Kotturshetti
Journal:  Sci Rep       Date:  2022-06-01       Impact factor: 4.996

2.  Demographic history differences between Hispanics and Brazilians imprint haplotype features.

Authors:  Pedro Rodrigues Sousa da Cruz; Galina Ananina; Rodrigo Secolin; Vera Lúcia Gil-da-Silva-Lopes; Carmen Silvia Passos Lima; Paulo Henrique Condeixa de França; Amanda Donatti; Gustavo Jacob Lourenço; Tânia Kawasaki de Araujo; Milena Simioni; Iscia Lopes-Cendes; Fernando Ferreira Costa; Mônica Barbosa de Melo
Journal:  G3 (Bethesda)       Date:  2022-07-06       Impact factor: 3.542

Review 3.  Generalizability of GWA-Identified Genetic Risk Variants for Metabolic Traits to Populations from the Arabian Peninsula.

Authors:  Prashantha Hebbar; Mohamed Abu-Farha; Jehad Abubaker; Arshad Mohamed Channanath; Fahd Al-Mulla; Thangavel Alphonse Thanaraj
Journal:  Genes (Basel)       Date:  2021-10-18       Impact factor: 4.096

4.  Salivary Microbiome Diversity in Kuwaiti Adolescents with Varied Body Mass Index-A Pilot Study.

Authors:  Hend Alqaderi; Meganathan P Ramakodi; Rasheeba Nizam; Sindhu Jacob; Sriraman Devarajan; Muthukrishnan Eaaswarkhanth; Fahd Al-Mulla
Journal:  Microorganisms       Date:  2021-06-04

5.  Unraveling a fine-scale high genetic heterogeneity and recent continental connections of an Arabian Peninsula population.

Authors:  Muthukrishnan Eaaswarkhanth; Ajai K Pathak; Linda Ongaro; Francesco Montinaro; Prashantha Hebbar; Osama Alsmadi; Mait Metspalu; Fahd Al-Mulla; Thangavel Alphonse Thanaraj
Journal:  Eur J Hum Genet       Date:  2021-03-22       Impact factor: 4.246

6.  The genomic history of the Middle East.

Authors:  Mohamed A Almarri; Marc Haber; Reem A Lootah; Pille Hallast; Saeed Al Turki; Hilary C Martin; Yali Xue; Chris Tyler-Smith
Journal:  Cell       Date:  2021-08-04       Impact factor: 41.582

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.