Literature DB >> 28905132

Transethnic insight into the genetics of glycaemic traits: fine-mapping results from the Population Architecture using Genomics and Epidemiology (PAGE) consortium.

Stephanie A Bien1, James S Pankow2, Jeffrey Haessler3, Yinchang Lu4, Nathan Pankratz5, Rebecca R Rohde6, Alfred Tamuno7, Christopher S Carlson3, Fredrick R Schumacher8, Petra Bůžková9, Martha L Daviglus10, Unhee Lim11, Myriam Fornage12, Lindsay Fernandez-Rhodes6, Larissa Avilés-Santa13, Steven Buyske14,15, Myron D Gross5, Mariaelisa Graff6, Carmen R Isasi16, Lewis H Kuller17, JoAnn E Manson18, Tara C Matise14, Ross L Prentice3, Lynne R Wilkens11, Sachiko Yoneyama19,20, Ruth J F Loos7,21,22,23, Lucia A Hindorff24, Loic Le Marchand11, Kari E North6,25, Christopher A Haiman26, Ulrike Peters3, Charles Kooperberg3.   

Abstract

AIMS/HYPOTHESIS: Elevated levels of fasting glucose and fasting insulin in non-diabetic individuals are markers of dysregulation of glucose metabolism and are strong risk factors for type 2 diabetes. Genome-wide association studies have discovered over 50 SNPs associated with these traits. Most of these loci were discovered in European populations and have not been tested in a well-powered multi-ethnic study. We hypothesised that a large, ancestrally diverse, fine-mapping genetic study of glycaemic traits would identify novel and population-specific associations that were previously undetectable by European-centric studies.
METHODS: A multiethnic study of up to 26,760 unrelated individuals without diabetes, of predominantly Hispanic/Latino and African ancestries, were genotyped using the Metabochip. Transethnic meta-analysis of racial/ethnic-specific linear regression analyses were performed for fasting glucose and fasting insulin. We attempted to replicate 39 fasting glucose and 17 fasting insulin loci. Genetic fine-mapping was performed through sequential conditional analyses in 15 regions that included both the initially reported SNP association(s) and denser coverage of SNP markers. In addition, Metabochip-wide analyses were performed to discover novel fasting glucose and fasting insulin loci. The most significant SNP associations were further examined using bioinformatic functional annotation.
RESULTS: Previously reported SNP associations were significantly replicated (p ≤ 0.05) in 31/39 fasting glucose loci and 14/17 fasting insulin loci. Eleven glycaemic trait loci were refined to a smaller list of potentially causal variants through transethnic meta-analysis. Stepwise conditional analysis identified two loci with independent secondary signals (G6PC2-rs477224 and GCK-rs2908290), which had not previously been reported. Population-specific conditional analyses identified an independent signal in G6PC2 tagged by the rare variant rs77719485 in African ancestry. Further Metabochip-wide analysis uncovered one novel fasting insulin locus at SLC17A2-rs75862513. CONCLUSIONS/
INTERPRETATION: These findings suggest that while glycaemic trait loci often have generalisable effects across the studied populations, transethnic genetic studies help to prioritise likely functional SNPs, identify novel associations that may be population-specific and in turn have the potential to influence screening efforts or therapeutic discoveries. DATA AVAILABILITY: The summary statistics from each of the ancestry-specific and transethnic (combined ancestry) results can be found under the PAGE study on dbGaP here: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000356.v1.p1.

Entities:  

Keywords:  Fine-mapping; Genetic; Glucose; Glycaemic; Insulin; Multiethnic; Page; Transethnic; Type 2 diabetes

Mesh:

Substances:

Year:  2017        PMID: 28905132      PMCID: PMC5918310          DOI: 10.1007/s00125-017-4405-1

Source DB:  PubMed          Journal:  Diabetologia        ISSN: 0012-186X            Impact factor:   10.122


Introduction

Type 2 diabetes is a growing epidemic that disproportionally burdens US minority populations [1]. Elevated levels of fasting glucose and fasting insulin in individuals without diabetes are markers of dysregulated glucose metabolism and are strong risk factors for type 2 diabetes [2]. Although twin and family studies provide heritability estimates of 10–50% for these traits [3, 4], family-based linkage studies have been largely unsuccessful in identifying specific contributing loci. Genome-wide association studies (GWAS) greatly accelerated the pace of discovery of genetic variants contributing to glycaemic traits. For example, the Meta-Analyses of Glucose and Insulin-related traits (MAGIC) consortium performed a large-scale investigation of glycaemic traits in individuals of European descent without diabetes and identified 24 fasting glucose loci and eight fasting insulin loci, three of which were associated with both traits [5, 6]. These findings have implicated genes and pathways known to be related to glucose metabolism (e.g. GCK/G6PC2 and glucose dephosphorylation), as well as novel pathways (e.g. MTNR1B and circadian rhythmicity). However, in some instances, the interpretation of GWAS findings has been challenging. For instance, many of the known loci are positioned in non-coding, putative regulatory regions of the genome, which in turn makes it difficult to identify the gene target(s). Additionally, the most significant variant is often not the causal variant but is a correlated variant in linkage disequilibrium with the functional variant(s). While early GWAS efforts were focused on populations of European descent, initial attempts to generalise GWAS findings to more diverse populations have had limited success [7-9]. Importantly, these studies tended to be small and only included the initial most significant GWAS variant (index SNP). However, it is critical that transethnic investigation of GWAS loci include both the index variant and all correlated variants, given that patterns of linkage disequilibrium vary by ancestry and the functional SNP(s) are rarely known. On average, European populations have more highly correlated SNPs and extended haplotypes in comparison with populations of African ancestry (AA). Hispanic/Latino (H/L) populations, on the other hand, are more admixed with highly variable contributions of African, European and New World ancestry. Due in part to reduction in linkage disequilibrium with neighbouring SNPs, transethnic studies can utilise these differences across and within admixed populations to localise causal variants, and discover novel population-specific associations that were undetectable in genetically homogeneous studies. Thus, transethnic studies may provide insight into the underlying biology of complex traits, which may differ among groups. The Metabochip was developed to fine-map GWAS loci for metabolic and cardiovascular traits, as well as replicate promising loci with suggestive, but not genome-wide, significant p values [10]. Among the 196,725 Metabochip variants selected for fine-mapping metabolic and cardiovascular-related loci, approximately 40,000 were selected for type 2 diabetes and related biomarkers. Among the 39 fasting glucose loci and 17 fasting insulin loci [5, 6] that were available for replication, 15 loci included not only the index SNP but also denser coverage of SNPs on the Metabochip that could be utilised for fine-mapping. Importantly, despite very large sample sizes, attempted Metabochip fine-mapping in a population of European descent generally did not yield stronger associations than the original GWAS index SNP and did not reduce the number of SNPs reaching similar levels of significance [11]. As such, this effort was unable to narrow in on functional candidate SNP(s). This study examined the association of Metabochip SNPs with fasting glucose and fasting insulin in a multiethnic study of up to 26,760 participants: 14,953 H/L, 10,380 AA, 998 Asian and Pacific Islander (ASN) and 429 American Indian/Alaskan Native (AI/AN) populations from the Population Architecture using Genetic Epidemiology (PAGE) consortium. Specifically, we carried out the following procedures: (1) tested the association of index SNPs previously reported for 39 fasting glucose and 17 fasting insulin loci from studies of individuals of European descent; (2) used transethnic meta-analysis to refine known glycaemic trait loci in 15 loci which were densely covered with SNPs on the Metabochip; (3) investigated remaining metabolic and cardiovascular trait loci on the Metabochip for association with these glycaemic traits and (4) performed bioinformatic functional annotation of the most significant (lead) SNPs to further prioritise likely causal variants.

Methods

Ethics statement

This study was performed in accordance with the tenets of the Declaration of Helsinki and approved by the Institutional Review Boards of each participating study. All study participants provided written informed consent.

Study population and trait measurement

The PAGE consortium was funded by the National Human Genome Research Institute (NHGRI) to investigate the epidemiological architecture of well-replicated genetic variants associated with human diseases or traits [12]. This analysis includes self-reported H/L, AA, ASN and AI/AN individuals without diabetes, aged 18 years or over, from the Multiethnic Cohort Study (MEC), the Women’s Health Initiative (WHI), Atherosclerosis Risk in Communities (ARIC), Coronary Artery Risk Development in Young Adults (CARDIA), the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) and the Mount Sinai School of Medicine’s (MSSM) DNA biobank (BioMe). Further details about each cohort can be found in the electronic supplementary materials (ESM) Methods (study population and trait measurement section). Fasting glucose and fasting insulin concentrations were measured using standard assays, at laboratories specific to each PAGE site (ESM Table 1). Individuals self-reporting that they had ever been diagnosed with diabetes or taken diabetes medications or who had fasting blood glucose levels ≥ 6.99 mmol/l (≥ 126 mg/dl) were excluded from analyses. Individuals with BMI < 16.5 kg/m2 or BMI > 70 kg/m2 were also excluded on the assumption that these extremes could be attributable to data coding errors or underlying illness or could reflect a familial syndrome. Prior to analyses, each study removed race/ethnicity outliers using ancestry informative principal components. After exclusions, fasting glucose analyses consisted of 14,953 H/L, 10,380 AA, 998 ASN and 429 AI/AN individuals. Fasting insulin analyses involved fewer individuals: 12,895 H/L, 8361 AA, 998 ASN and 420 AI/AN. Fasting insulin was not available for BioMe. Race/ethnicity was self-reported. Descriptive characteristics of PAGE study participants by cohort can be found in ESM Table 2. While ASN and AI/AN were included for transethnic meta-analysis, population-specific analyses were underpowered due to small sample sizes. As such, ASN and AI/AN population-specific analyses were used as a comparison for consistency in the direction of effect.

Genotyping and quality control

Genotyping was performed using the Metabochip, the design of which has been described elsewhere [10]. In brief, the 200K Metabochip is designed to cost effectively analyse putative association signals identified through GWAS of many glucose- and insulin-related metabolic and cardiovascular traits and to fine-map established loci [10]. More than 122,000 SNPs were included to fine-map 257 GWAS loci for 23 traits [10]. Fine-mapping loci were defined as the GWAS index SNP and all correlated SNPs (r 2 ≥ 0.5) that were within 0.02 cM of the index and having a minor allele frequency (MAF) > 1% in at least one HapMap Phase I population. SNPs were excluded if the Illumina design score was < 0.5 or there were SNPs within 15 bp of the SNP of interest with MAF of > 2% among Europeans (CEU [HapMap Population Code for Utah residents (CEPH) with Northern and Western European ancestry]). Metobochip genotyping was performed for MEC, ARIC, CARDIA, HCHS/SOL and WHI [13] individuals. Standard quality control filters were applied for samples and SNPs, including missing rate and Hardy–Weinberg equilibrium (p < 1 × 10−7). A portion of WHI individuals of AA had both Metabochip and the Affymetrix 6.0 genotype data available from the SNP Health Association Resource (SHARe); this was used to impute Metabochip SNPs in the remaining SHARe participants with only Affymetrix 6.0 GWAS [8] and only dosages with imputation R 2 > 0.3 were included in the analyses. In BioMe, genotypes from the Illumina HumanOmniExpress array were imputed to 1000 Genome Phase I haplotype panels (March 2012) [14]. Metabochip SNPs with ‘proper info’ score ≥ 0.4 were included in the analysis. Principal components were determined within each study using the Eigensoft software [15]. We excluded SNPs with a minor allele count less than 5 within each study by racial/ethnic population. The sample success rate and concordance rate for duplicate pairs across all studies was ≥ 95% and ≥ 99%, respectively. Further genotyping and analytical characteristics of the participating studies are further summarised in ESM Methods (genotyping and quality control section) and ESM Table 1.

Replication and fine-mapping approach

The overall study design for replication, fine-mapping and discovery of novel loci is summarised in Fig. 1. For replication of known loci, unconditional association analyses were performed for previously reported index SNPs listed in ESM Table 3. A nominal significance level (α = 0.05) was used to define replication of a locus. Next, unconditional association analyses were performed for all SNPs in a locus by race/ethnicity and by transethnic meta-analysis. A locus-specific p value threshold was defined as 0.05 divided by the number of SNPs passing quality control in each region (ranging from α = 1.4 × 10−5 to α = 4.1 × 10−4, Table 1). Locus-specific significance was used to conservatively adjust for multiple testing, while also acknowledging that genetic variation is known to influence glycaemic traits in these regions. Linkage disequilibrium was calculated for PAGE H/L, AA and Asian samples with 500 kb sliding windows using PLINK [16]. Metabochip linkage disequilibrium and frequency information in Europeans was provided by the 1000 Genomes Phase 3 population. These linkage disequilibrium patterns were used to evaluate locus refinement. Additionally, LocusZoom plots [17] were used to graphically display the fine-mapping results and linkage disequilibrium for these plots used 1000 Genomes Phase I Super Populations (European ancestry [EUR], admixed American ancestry [AMR], African ancestry [AFR]). After identifying the most significant lead SNP in each region, we searched for additional independent association signals by including the lead SNP in the conditional model and then testing each of the remaining SNPs in a region. These conditional analyses were repeated, adding in the lead SNP and conditional lead SNP(s), until no SNP in the model had a conditional p value less than the locus-specific significance. Sequential conditional analyses were performed for each race/ethnicity and transethnic meta-analysis. Further details on our approach to locus refinement are provided in ESM Methods (replication and fine-mapping of known glycaemic trait loci section).
Fig. 1

PAGE Metabochip Study Design. Primary results presented were from models including BMI as a covariate. ESM Tables 5 and 6 include results from models without BMI as a covariate

Table 1

Characterisation of 15 fine-mapping genomic regions analysed for fasting glucose and fasting insulin

ChromosomeLocusBase pair range (GRCh37/hg19)No. of SNPs on MetabochipNo. of SNPsa αTrait
1q32.3 PROX1 214,124,818–214,167,5081531293.9 × 10−4 Glucose
2p23.3 GCKR 27,389,634–27,951,65810999665.2 × 10−5 Both
2q31.1 G6PC2 169,752,640–169,814,6552402112.4 × 10−4 Glucose
3q21.1 ADCY5 122,976,919–123,206,9199247866.2 × 10−5 Glucose
3q26.2 SLC2A2 170,532,111–170,769,1717176537.7 × 10−5 Glucose
7p21.2 DGKB 14,185,088–15,145,520389435551.4 × 10−5 Glucose
7p13 GCK 44,222,003–44,266,0771481224.1 × 10−4 Glucose
9p24.2 GLIS3 4,243,162–4,310,5584193851.3 × 10−4 Glucose
10q25.2 ADRA2A/TCF7L2 112,967,738–113,053,0394624241.2 × 10−4 Glucose
11p15.4 CRY2 45,706,162–46,162,82910829215.4 × 10−5 Glucose
11p11.2 MADD 46,921,641–48,091,303239220372.5 × 10−5 Glucose
11q12.2 FADS2 61,505,583–61,751,6247266437.8 × 10−5 Glucose
11q14.3 MTNR1B 92,667,047–92,725,3212141802.8 × 10−4 Glucose
12q23.2 IGF1 103,851,897–104,450,976130710594.7 × 10−5 Insulin
15q22.2 C2CD4A 62,099,182–62,520,10911439495.3 × 10−5 Glucose

α is the Bonferroni significance threshold (0.05/no. of SNPs passing quality control) used to define region-specific significance

aNo. of SNPs passing quality control in the transethnic meta-analysis

PAGE Metabochip Study Design. Primary results presented were from models including BMI as a covariate. ESM Tables 5 and 6 include results from models without BMI as a covariate Characterisation of 15 fine-mapping genomic regions analysed for fasting glucose and fasting insulin α is the Bonferroni significance threshold (0.05/no. of SNPs passing quality control) used to define region-specific significance aNo. of SNPs passing quality control in the transethnic meta-analysis

Discovery of novel loci

Metabochip-wide analyses were performed to identify novel associations with fasting glucose and fasting insulin. Statistical significance for the Metabochip-wide analysis was set at 0.05 divided by the number of Metabochip SNPs passing quality control (α = 2.7 × 10−7). Results were examined through qq plots and Manhattan plots for each model, highlighting known regions defined in ESM Table 4. Further details are provided in ESM Methods (strategy for selecting novel associations section).

Statistical analysis

First, in each study with unrelated individuals we performed race/ethnic-specific analyses for fasting glucose and natural log-transformed fasting insulin, excluding ancestry outliers and first-degree relatives. In HCHS/SOL, a weighted version of generalised estimation equations was used to account for unequal inclusion probabilities and complex family-based sampling designs [18]. Models adjusted for age, sex (except WHI), study site (as applicable), smoking status (current vs former/never), continuous BMI and ancestry principal components. Like previous studies [11], primary analyses adjusted for BMI because it is a major risk factor for type 2 diabetes and is correlated with glycaemic traits. For comparison, all models were also run without adjustment for BMI. Next, fixed-effect models with inverse-variance weighting were used to pool the study-specific SNP effect estimates and their standard errors by race/ethnicity as implemented in METAL [19]. Finally, summary statistics from METAL for H/L, AA, NA/AI and ASN were combined using inverse-variance weighted fixed effects meta-analysis in METAL. Q statistics and I 2 were used to evaluate heterogeneity across studies and race/ethnicity. Further details are provided in ESM Methods (statistical analysis section).

Functional annotation

Detailed information on the functional annotation methods and various datasets used is provided in ESM Methods (functional annotation section). In brief, it is expected that the lead SNPs are more likely to be functional or to be in stronger linkage disequilibrium with underlying functional variant(s). Therefore, lead SNPs and all correlated SNPs (r 2 > 0.2 in 1000 Genomes Phase 3 AFR/AMR populations) were annotated using publicly available functional datasets. Potential functional effects were assessed using PolyPhen2 [20] (http://genetics.bwh.harvard.edu/pph2/, accessed 24 August 2016) for non-synonymous variants, SPANR (http://tools.genes.toronto.edu/) [21] for variants near splice sites, TargetScan miRNA Regulatory Sites for 3′-UTR regions [22], ENCODE/NIH Roadmap data [23-25] and GTEx (https://www.gtexportal.org/home/) [26] to identify non-coding variants positioned in predicted regulatory elements.

Results

Demographics

We included a total of 26,760 participants (14,953 H/L, 10,380 AA, 998 ASN, and 429 AI/AN) in fasting glucose analyses. The sample sizes for fasting insulin analyses were slightly smaller, with a total of 22,674 participants (12,895 H/L, 8361 AA, 998 ASN and 420 AI/AN). The mean age across the five cohorts was 55 years for men and 59 years for women (range 18–93 years). Study-specific descriptive characteristics are shown in ESM Table 2. Particularly due to the inclusion of the WHI cohort, the proportion of women in the total study population was high, with the highest fraction observed in AA (82.6% for fasting glucose and 97.1% for fasting insulin). Glycaemic trait distributions were similar across studies and ethnicities, with average fasting glucose levels ranging from 4.7 ± 0.7 mmol/l to 5.5 ± 0.6 mmol/l and average fasting insulin levels ranging from 43.3 ± 23.6 pmol/l to 75.9 ± 38.8 pmol/l.

Generalisation of European glycaemic trait loci

We found that 31/39 (79.5%) fasting glucose loci and 14/17 (82.3%) fasting insulin loci had a p value smaller than 0.05. Index SNP associations were directionally consistent in our transethnic PAGE meta-analysis and only four SNPs had heterogeneity p values less than 0.05 (Table 2). The effect estimates (βs) of index SNPs in the transethnic meta-analysis were very similar to those published in Metabochip analysis of individuals of European descent (Pearson’s r 2 = 0.86, 95% CI 0.78, 0.91; p < 2.2 × 10−16; ESM Fig. 1). At three loci (WARS, GIPR and DPYSLS) we observed replication in only H/L and not the transethnic meta-analysis. Interestingly, while the sample sizes were much smaller for Asian individuals than for H/L and AA individuals, the transethnic meta-analysis of the PROX1 index (rs340874) was only nominally significant and directionally consistent in the Asian samples. In the remaining loci that did not replicate in transethnic meta-analysis or the race/ethnic-specific analyses, the effects were generally similar or at least in the same direction. Analyses without inclusion of BMI as a covariate were generally similar, with slightly lower significance at some loci. Full summary statistics for models with and without BMI covariate are reported in ESM Table 5 and ESM Table 6, respectively.
Table 2

Replication of European Metabochip index SNPs for 39 fasting glucose and 17 fasting insulin loci via transethnic meta-analysis

Locus/geneLead EURC/NC alleleCoded allele frequencyEffect β of coded allele (SE)Analyses with p < 0.05 p value TE Meta (Het.)
EURH/LAAASNTE MetaEURH/LAAASNTE Meta
Fasting glucose loci (NTE = 26,760, NEUR = 118,881)
 1q32.3PROX1 rs340874A/G0.480.600.820.610.67−0.015 (0.002)−0.004 (0.006)−0.009 (0.009)0.076 (0.027)−0.003 (0.005)ASN0.59 (0.02)
 2p23.3GCKR rs780094A/G0.390.350.190.520.30−0.029 (0.002)−0.033 (0.007)−0.016 (0.010)−0.051 (0.027)−0.029 (0.005)H/L, ASN, TE2 × 10−8 (0.2)
 2q31.1G6PC2 rs560887A/G0.300.170.070.030.14−0.075 (0.003)−0.086 (0.008)−0.063 (0.014)−0.065 (0.077)−0.079 (0.007)H/L, AA, TE1 × 10−29 (0.48)
 3q21.1ADCY5 rs11708067A/G0.790.750.840.960.780.024 (0.003)0.021 (0.007)0.052 (0.010)−0.254 (0.171)0.031 (0.006)H/L, AA, TE5 × 10−8 (0.02)
 3q26.2SLC2A2 rs1280A/G0.860.840.650.970.730.031 (0.003)0.052 (0.009)−0.001 (0.007)0.043 (0.082)0.021 (0.006)H/L, TE1 × 10−4 (2 × 10−5)
 7p21.2DGKB rs2191349A/C0.530.480.570.690.510.032 (0.002)0.023 (0.006)0.005 (0.009)0.003 (0.028)0.017 (0.005)H/L, TE8 × 10−4 (0.42)
 7p13GCK rs730497A/G0.160.200.180.180.200.061 (0.003)0.061 (0.008)0.056 (0.009)0.004 (0.034)0.057 (0.006)H/L, AA, TE3 × 10−22 (0.37)
 8q24.11SLC30A8 rs11558471A/G0.680.750.900.570.770.032 (0.002)0.018 (0.007)0.014 (0.012)−0.004 (0.026)0.017 (0.006)H/L, TE4 × 10−3 (0.22)
 9p24.2GLIS3 rs10814916A/C0.490.430.330.540.40−0.017 (0.002)−0.016 (0.006)−0.009 (0.008)−0.066 (0.027)−0.015 (0.005)H/L, ASN, TE1 × 10−3 (0.21)
 10q25.2ADRA2A rs11195502A/G0.090.130.340.070.25−0.036 (0.004)−0.014 (0.010)−0.012 (0.008)−0.022 (0.054)−0.013 (0.006)TE0.04 (0.62)
 10q25.2TCF7L2 rs4506565A/T0.700.710.560.930.64−0.024 (0.002)−0.030 (0.007)−0.019 (0.007)−0.137 (0.060)−0.025 (0.005)All3 × 10−7 (0.19)
 11p11.2CRY2 rs11605924A/C0.490.540.860.810.630.022 (0.002)0.017 (0.006)0.027 (0.011)−0.066 (0.034)0.018 (0.005)All1 × 10−3 (0.03)
 11p11.2MADD rs11039182A/G0.730.820.950.970.850.023 (0.003)0.000 (0.009)0.021 (0.016)−0.002 (0.091)0.004 (0.007)None0.55 (0.67)
 11q12.2FADS2 rs174550A/G0.660.520.910.570.600.018 (0.002)0.026 (0.007)0.036 (0.013)0.039 (0.027)0.029 (0.006)H/L, AA, TE7 × 10−7 (0.9)
 11q14.3MTNR1B rs10830963C/G0.710.790.930.600.81−0.078 (0.003)−0.062 (0.008)−0.090 (0.014)−0.078 (0.026)−0.068 (0.006)All7 × 10−27 (0.21)
 15q22.2C2CD4A rs4502156A/G0.550.400.260.520.350.023 (0.002)0.017 (0.007)0.006 (0.008)0.008 (0.026)0.012 (0.005)H/L, TE0.01 (0.77)
 9p21.3CDKN2B rs10811661A/G0.820.860.930.560.860.024 (0.003)0.021 (0.009)0.017 (0.014)0.072 (0.026)0.024 (0.007)H/L, ASN, TE0.02 (0.29)
 5q15PCSK1 rs4869272A/G0.690.750.780.730.760.018 (0.002)0.021 (0.007)0.019 (0.008)0.032 (0.029)0.020 (0.005)H/L, AA, TE1 × 10−3 (0.97)
 13q12.2PDX1 rs11619319A/G0.770.710.830.550.75−0.020 (0.002)−0.008 (0.007)−0.017 (0.010)−0.054 (0.026)−0.012 (0.006)AA, ASN, TE0.05 (0.32)
 8p23.1PPP1R3B rs983309A/C0.120.210.280.020.240.026 (0.003)0.023 (0.008)0.017 (0.008)0.004 (0.104)0.020 (0.006)H/L, AA, TE2 × 10−3 (0.96)
 7p12.1GRB10 rs6943153A/G0.340.450.680.280.540.015 (0.002)0.019 (0.006)−0.004 (0.008)−0.010 (0.030)0.009 (0.005)H/L, TE0.07 (0.11)
 11q13.4ARAP1 rs11603334A/G0.170.080.050.050.07−0.019 (0.003)−0.030 (0.011)−0.039 (0.016)−0.086 (0.067)−0.033 (0.009)H/L, AA, TE1 × 10−5 (0.69)
 20p11.21FOXA2 rs6113722A/G0.040.050.160.180.13−0.035 (0.005)−0.042 (0.014)−0.040 (0.010)−0.090 (0.033)−0.043 (0.008)All2 × 10−6 (0.55)
 9q31.3IKBKAP rs16913693A/C0.970.960.7710.810.043 (0.007)0.010 (0.017)−0.012 (0.008)0.334 (0.333)−0.008 (0.008)None0.51 (0.48)
 9q34.3DNLZ rs3829109A/G0.290.330.170.130.28−0.017 (0.003)−0.021 (0.007)−0.026 (0.010)0.000 (0.040)−0.022 (0.006)H/L, AA, TE5 × 10−4 (0.91)
 14q32.2WARS rs3783347A/C0.210.120.060.10.11−0.017 (0.003)−0.023 (0.010)0.000 (0.014)0.000 (0.044)−0.014 (0.008)H/L0.08 (0.40)
 19q13.32GIPR rs2302593C/G0.50.510.280.390.420.014 (0.002)−0.013 (0.006)−0.002 (0.008)0.019 (0.027)−0.008 (0.005)H/L0.05 (0.55)
 6p22.3CDKAL1 rs9368222A/C0.280.230.190.410.230.014 (0.002)0.025 (0.007)0.025 (0.009)0.041 (0.026)0.026 (0.006)H/L, AA, TE3 × 10−5 (0.94)
 12q24.33P2RX2 rs10747083A/G0.660.690.850.830.740.013 (0.002)0.010 (0.007)0.012 (0.011)−0.017 (0.034)0.010 (0.006)None0.12 (0.88)
 20q12TOP1 rs6072275A/G0.160.120.080.020.110.016 (0.003)0.021 (0.010)0.019 (0.013)−0.075 (0.121)0.021 (0.008)H/L, TE5 × 10−3 (0.53)
 3q27.2IGF2BP2 rs7651090A/G0.690.70.460.70.59−0.013 (0.002)−0.011 (0.007)−0.011 (0.007)−0.023 (0.029)−0.011 (0.005)TE0.07 (0.90)
 13q13.1KL rs576674A/G0.850.680.40.850.56−0.017 (0.003)−0.026 (0.007)−0.014 (0.007)0.054 (0.038)−0.019 (0.005)H/L, AA, TE7 × 10−4 (0.08)
 3p21.31AMT rs11715915A/G0.320.210.240.080.22−0.012 (0.002)−0.007 (0.008)0.003 (0.008)0.053 (0.051)−0.002 (0.006)None0.59 (0.56)
 6p24.3RREB1 rs17762454A/G0.260.330.160.410.280.012 (0.002)0.017 (0.007)0.012 (0.010)0.011 (0.027)0.015 (0.005)H/L, TE0.02 (0.97)
 5q13.3ZBED3 rs7708285A/G0.730.690.850.910.74−0.011 (0.003)−0.004 (0.007)0.003 (0.010)0.002 (0.060)−0.003 (0.006)None0.4 (0.47)
 12q13.3GLS2 rs2657879A/G0.820.810.93NA0.83−0.012 (0.003)−0.011 (0.008)0.016 (0.015)−0.005 (0.007)None0.43 (0.11)
 2p23.3DPYSL5 rs1371614A/G0.250.380.350.160.360.020 (0.004)0.021 (0.007)−0.006 (0.007)−0.021 (0.036)0.009 (0.005)H/L0.03 (0.05)
 15q22.2C2CD4B rs12440695*A/G0.630.570.830.710.650.008 (0.003)0.004 (0.007)−0.002 (0.009)−0.011 (0.028)0.003 (0.005)None0.63 (0.58)
 11p11.2OR4S1 rs1483121A/G0.140.090.030.030.08−0.027 (0.005)0.008 (0.011)−0.022 (0.022)−0.101 (0.220)0.002 (0.010)None0.59 (0.62)
Fasting insulin loci (NTE = 22,674, NEUR = 99,029)
 1q41LYPLAL1 rs4846565A/G0.330.410.090.340.32−0.013 (0.002)−0.023 (0.008)−0.007 (0.013)0.022 (0.028)−0.017 (0.007)H/L, TE0.01 (0.34)
 2p23.3GCKR rs780094A/G0.390.350.190.520.30−0.029 (0.002)−0.031 (0.008)−0.029 (0.010)−0.011 (0.027)−0.030 (0.006)H/L, AA, TE2 × 10−7 (0.41)
 2q24.3GRB14 rs10195252A/G0.600.670.280.890.490.017 (0.002)0.041 (0.008)0.036 (0.008)−0.044 (0.044)0.037 (0.006)H/L, AA, TE1 × 10−10 (0.29)
 2q36.3IRS1 rs2943645A/G0.630.740.630.900.680.019 (0.002)0.018 (0.009)0.012 (0.008)0.062 (0.046)0.016 (0.006)H/L, TE4 × 10−3 (0.54)
 3p25.2PPARG rs17036328A/G0.860.890.830.950.850.021 (0.003)0.038 (0.012)0.009 (0.010)0.036 (0.068)0.022 (0.007)H/L, TE2 × 10−3 (0.15)
 4q22.1FAM13A rs3822072A/G0.480.440.510.630.470.012 (0.002)0.008 (0.008)0.018 (0.010)0.024 (0.028)0.012 (0.006)AA, TE0.04 (0.82)
 4q24TET2 rs974801A/G0.620.580.720.400.64−0.014 (0.002)−0.018 (0.008)−0.009 (0.008)−0.023 (0.027)−0.015 (0.006)H/L, TE6 × 10−3 (0.31)
 4q32.1PDGFC rs6822892A/G0.680.590.270.700.450.014 (0.002)0.012 (0.008)0.003 (0.008)0.009 (0.029)0.009 (0.006)None0.12 (0.76)
 5q11.2ARL15 rs4865796A/G0.670.790.750.810.770.015 (0.002)0.016 (0.009)0.024 (0.008)0.006 (0.036)0.020 (0.006)AA, TE9 × 10−4 (0.80)
 5q11.2ANKRD55 rs459193A/G0.270.270.420.520.36−0.015 (0.002)−0.025 (0.009)−0.022 (0.008)−0.040 (0.026)−0.022 (0.006)All4 × 10−5 (0.30)
 6p21.31UHRF1BP1 rs6912327A/G0.800.690.35NA0.510.016 (0.003)0.004 (0.008)−0.004 (0.008)0.001 (0.006)None0.83 (0.08)
 6q22.33RSPO3 rs2745353A/G0.510.580.600.610.590.011 (0.002)0.016 (0.008)0.010 (0.008)−0.039 (0.027)0.011 (0.005)H/L, TE0.03 (0.25)
 7q11.23HIP1 rs1167800A/G0.540.670.840.690.730.011 (0.002)0.018 (0.008)0.009 (0.010)−0.004 (0.028)0.011 (0.006)H/L0.08 (0.07)
 8p23.1PPP1R3B rs983309A/C0.130.210.280.020.250.022 (0.003)0.026 (0.010)0.024 (0.008)−0.082 (0.103)0.026 (0.006)All2 × 10−5 (0.02)
 10q25.2TCF7L2 rs7903146A/G0.270.250.280.080.27−0.013 (0.002)−0.014 (0.009)−0.022 (0.008)0.023 (0.057)−0.019 (0.006)AA, TE1 × 10−3 (0.51)
 12q23.2IGF1 rs35767A/G0.180.240.440.330.36−0.003 (0.003)−0.014 (0.011)0.006 (0.008)−0.050 (0.032)−0.004 (0.006)None0.43 (0.28)
 19q13.11PEPD rs731839A/G0.660.610.630.480.61−0.015 (0.002)−0.016 (0.008)−0.003 (0.008)−0.037 (0.026)−0.012 (0.005)H/L, TE0.03 (0.23)

EUR, individuals of European descent from Scott et al. [11] genotyped on Metabochip. Models included continuous BMI covariate, *rs12440695 used as a linkage disequilibrium proxy (r 2 = 0.98) for the index SNP rs11071657, which did not pass quality control. β, allelic effect size for an additive genetic model corresponding to the coded (C) allele, is shown in units of mmol/l for fasting glucose and natural log-transformed pmol/l for fasting insulin. Full results for models with and without BMI covariate for fasting glucose and fasting insulin are shown in ESM Table 5 and ESM Table 6, respectively

p values are shown for the transethnic (TE) meta-analysis and heterogeneity (Het.) in effect across populations

Replication of European Metabochip index SNPs for 39 fasting glucose and 17 fasting insulin loci via transethnic meta-analysis EUR, individuals of European descent from Scott et al. [11] genotyped on Metabochip. Models included continuous BMI covariate, *rs12440695 used as a linkage disequilibrium proxy (r 2 = 0.98) for the index SNP rs11071657, which did not pass quality control. β, allelic effect size for an additive genetic model corresponding to the coded (C) allele, is shown in units of mmol/l for fasting glucose and natural log-transformed pmol/l for fasting insulin. Full results for models with and without BMI covariate for fasting glucose and fasting insulin are shown in ESM Table 5 and ESM Table 6, respectively p values are shown for the transethnic (TE) meta-analysis and heterogeneity (Het.) in effect across populations

Fine-mapping of European glycaemic trait loci

Among the 15 glycaemic trait loci for which fine-mapping was attempted on the Metabochip, ten fasting glucose loci and two fasting insulin loci had one or more SNPs that reached locus-specific significance (α = 0.05/number of SNPs in the locus) in the transethnic meta-analysis. The p values ranged from 1.0 × 10−29 at G6PC2-rs560887 to 1.5 × 10−4 at PROX1-rs10494973 (Table 3). Although AI/AN ancestries were included in the transethnic meta-analysis, the AI/AN results are not shown because the small sample size was underpowered for population-specific analysis. At four fasting glucose loci, the most significant lead SNP in PAGE transethnic meta-analysis was the same as the European index SNP from prior Metabochip evaluation (G6PC2, ADCY5, MTNR1B and FADS2). For six fasting glucose loci (PROX1, GCKR, SLC2A2, DGKB, GCK and GLIS3) and the one fasting insulin locus (GCKR), the lead SNP in PAGE transethnic meta-analysis was in moderate or weak linkage disequilibrium with the index SNP in 1000 Genomes Population EUR (r 2 > 0.2). At these six fasting glucose loci and one fasting insulin locus, the PAGE lead SNP and EUR index SNP were not independent of each other as only one of the two SNP associations maintained nominal significance in transethnic conditional meta-analysis where both lead and index variants were included in the model. This was further supported by investigation of potential fine-mapping through locus zoom plots.
Table 3

Most significant lead SNPs in ten fasting glucose and two fasting insulin fine-mapping loci identified in transethnic meta-analysis

RegionLead PAGE SNPFrequency of coded (C) alleleEffect β of coded allele (SE) p value r 2 with EUR index SNPc No. of LD SNPse
C/NTEa EURH/LAAASNTE MetaH/LAAASNTE Metab Het.EUR SNPd EURH/LAAASNEURTE (% red.)f
Fasting glucose loci
 1q32.3PROX1 rs10494973C/G0.030.480.030.010.010.060 (0.016)0.050 (0.018)**0.100 (0.036)**−0.274 (0.384)2 × 10−4 0.44rs340874<0.10<0.10<0.10<0.1041 (75)
 2p23.3GCKR rs1260326A/G0.290.410.340.150.52−0.032 (0.005)−0.036 (0.007)***−0.020 (0.010)*−0.051 (0.026)*2 × 10−9 0.44rs7800940.920.910.420.9327490 (67)
 2q31.1G6PC2 rs560887A/G0.140.310.170.070.03−0.079 (0.007)−0.086 (0.008)***−0.063 (0.014)***−0.065 (0.077)1 × 10−29 0.48Same11111189 (92)
 3q21.1ADCY5 rs11708067A/G0.780.820.750.840.970.031 (0.006)0.021 (0.007)**0.052 (0.010)***−0.254 (0.171)5 × 10−8 0.02Same11117218 (75)
 3q26.2SLC2A2 rs1604038A/G0.440.290.340.580.23−0.026 (0.005)−0.031 (0.007)***−0.023 (0.007)**0.037 (0.032)1 × 10−7 0.2rs12800.380.450.340.09318162 (49)
 7p21.2DGKB rs62448618A/T0.340.500.380.270.500.022 (0.005)0.030 (0.007)***0.014 (0.008)−0.001 (0.026)1 × 10−5 0.33rs21913490.810.610.030.3913312 (91)
 7p13GCK rs2908286A/G0.190.180.20.180.200.060 (0.006)0.064 (0.008)***0.061 (0.009)***0.002 (0.032)9 × 10−25 0.27rs7304970.990.90.520.912518 (28)
 9p24.2GLIS3 rs10974438A/C0.760.620.710.860.63−0.023 (0.006)−0.019 (0.007)**−0.021 (0.010)*−0.080 (0.028)**6 × 10−5 0.16rs108149160.530.270.080.69547 (87)
 11q12.2FADS2 rs174547A/G0.600.660.520.910.550.029 (0.006)0.026 (0.007)***0.038 (0.013)**0.039 (0.027)4 × 10−7 0.86Same111114744 (70)
 11q14.3MTNR1B rs10830963C/G0.810.780.790.930.59−0.068 (0.006)−0.062 (0.008)***−0.090 (0.014)***−0.078 (0.026)**7 × 10−27 0.21Same1111941 (99)
Fasting insulin loci
 2p23.3GCKR rs1260326A/G0.290.410.350.160.52−0.035 (0.006)−0.034 (0.008)***−0.034 (0.010)***−0.010 (0.027)1 × 10−8 0.20rs7800940.920.910.420.9327490 (67)
 12q23.2IGF1 rs10860845A/C0.60.830.480.740.65−0.023 (0.006)−0.025 (0.008)***−0.023 (0.008)**0.002 (0.028)3 × 10−5 0.76rs860598<0.10<0.10<0.10<0.1032264 (80)

β: effect size from an additive multivariate model including BMI and corresponding to the coded (C) allele, is shown in units of mmol/l for fasting glucose and natural log-transformed pmol/l for fasting insulin

aMAF averaged across ethnicities H/L, AI/AN and ASN from the transethnic (TE) meta-analysis for coded allele

b p value from the transethnic meta-analysis

cLinkage disequilibrium calculated from 1000 genomes Phase 3 super populations (EUR, AFR, AMR, and ASN)

dEuropean SNP index defined as most significant SNP from the Scott et al. [11] Metabochip analysis

eNo. of SNPs in linkage disequilibrium using r 2 > 0.2 calculated from 1000 genomes Phase 3 super populations with transethnic equal to the intersect of SNPs in EUR, AFR, AMR and ASN

fPercentage reduction in the number of SNPs

*p < 0.05, **p < 0.01 and ***p < 0.001 for race/ethnic-specific analyses

†Significant at region-specific Bonferroni-corrected transethnic meta-analysis p values (ranging from α = 1.41 × 10−5 to α = 4.1 × 10−4)

EUR, Europeans, LD, linkage disequilibrium, TE, transethnic

Most significant lead SNPs in ten fasting glucose and two fasting insulin fine-mapping loci identified in transethnic meta-analysis β: effect size from an additive multivariate model including BMI and corresponding to the coded (C) allele, is shown in units of mmol/l for fasting glucose and natural log-transformed pmol/l for fasting insulin aMAF averaged across ethnicities H/L, AI/AN and ASN from the transethnic (TE) meta-analysis for coded allele b p value from the transethnic meta-analysis cLinkage disequilibrium calculated from 1000 genomes Phase 3 super populations (EUR, AFR, AMR, and ASN) dEuropean SNP index defined as most significant SNP from the Scott et al. [11] Metabochip analysis eNo. of SNPs in linkage disequilibrium using r 2 > 0.2 calculated from 1000 genomes Phase 3 super populations with transethnic equal to the intersect of SNPs in EUR, AFR, AMR and ASN fPercentage reduction in the number of SNPs *p < 0.05, **p < 0.01 and ***p < 0.001 for race/ethnic-specific analyses †Significant at region-specific Bonferroni-corrected transethnic meta-analysis p values (ranging from α = 1.41 × 10−5 to α = 4.1 × 10−4) EUR, Europeans, LD, linkage disequilibrium, TE, transethnic For each of the 11 glycaemic trait loci with potential transethnic fine-mapping (fasting glucose loci–PROX1, G6PC2, ADCY5, MTNR1B, FADS2, GCKR, SLC2A2, DGKB, GCK and GLIS3; fasting insulin locus–GCKR), we found that the number of SNPs in linkage disequilibrium with the most significant marker in the transethnic results (r 2 ≥ 0.2 in the 1KG super populations AFR and AMR) were less than the number of SNPs tagged by the EUR marker (r 2 ≥ 0.2 in EUR). Visual inspection of locus zoom plots indicated that transethnic meta-analysis refined each of these loci by reducing the number of highly correlated SNPs reaching the same level of significance and/or narrowing the genomic region containing putative causal SNPs (ESM Fig. 2). On average, the number of variants in high linkage disequilibrium was reduced by 72.5% with the number of linkage disequilibrium SNPs ranging from one at MTNR1B to 162 at SLC2A2 in the PAGE transethnic meta-analysis results. Refinement was most evident at the SLC2A2 locus (Fig. 2). Bioinformatic functional follow-up was performed for each of the eleven glycaemic trait loci with one or more variants passing the region-specific significance threshold in our transethnic meta-analysis. We observed an overlap of promoter and enhancer sequences at each locus and identified potential target genes. These data not only provided further support for the fine-mapping results but also revealed additional insights into the aetiology of glycaemic traits. UCSC Genome Browser images of each locus are provided in ESM Fig. 3. The results of our in silico functional annotations are summarised in ESM Table 7.
Fig. 2

SLC2A2 regional plot. Regional plots of SNP associations (−log10(p value)) with fasting glucose are shown for the MAGIC European (a) and the PAGE transethnic (b) meta-analyses. Not all SNPs used in the transethnic meta-analysis were present in the available MAGIC data (www.magicinvestigators.org/downloads/, accessed 26 June 2017) because of mapping issues [11]. SNPs not passing quality control or outside the fine-mapping region were removed from the transethnic plots. The colour scale indicates linkage disequilibrium (r 2) between each fine-mapping SNP and the GWAS index SNP (rs1280, purple diamond), which was calculated using 1000 Genomes Populations (CEU for MAGIC and AMR for PAGE). The population chosen for linkage disequilibrium colouring in the transethnic meta-analysis was based on population-specific analysis results (choosing the one with strongest underlying SNP associations). The most significant SNPs in MAGIC fine-mapping (rs11709140) and PAGE (rs1604038) are labelled

SLC2A2 regional plot. Regional plots of SNP associations (−log10(p value)) with fasting glucose are shown for the MAGIC European (a) and the PAGE transethnic (b) meta-analyses. Not all SNPs used in the transethnic meta-analysis were present in the available MAGIC data (www.magicinvestigators.org/downloads/, accessed 26 June 2017) because of mapping issues [11]. SNPs not passing quality control or outside the fine-mapping region were removed from the transethnic plots. The colour scale indicates linkage disequilibrium (r 2) between each fine-mapping SNP and the GWAS index SNP (rs1280, purple diamond), which was calculated using 1000 Genomes Populations (CEU for MAGIC and AMR for PAGE). The population chosen for linkage disequilibrium colouring in the transethnic meta-analysis was based on population-specific analysis results (choosing the one with strongest underlying SNP associations). The most significant SNPs in MAGIC fine-mapping (rs11709140) and PAGE (rs1604038) are labelled

Secondary associations at known glycaemic trait loci

To identify additional independent association signals at significant loci, conditional analyses were performed. Results of these analyses and population-specific associations are shown in Table 4. For transethnic conditional meta-analyses, ten fasting glucose loci and two fasting insulin loci were analysed. Independent secondary associations were identified at two fasting glucose loci (G6PC2-rs477224 and GCK-rs2908286). The second round of conditional analyses did not identify significant tertiary signals. Bioinformatic follow-up of rs477224 suggested that the variant is positioned within a pancreatic islet enhancer. The rs2908290 variant was in weak linkage disequilibrium (AMR r 2 = 0.26, AFR r 2 = 0.23) with a variant, rs2971677, predicted to alter splicing efficiency of GCK.
Table 4

Independent secondary signals at known fasting glucose and fasting insulin loci

LocusSecondary SNPa Frequency of coded (C) allele for secondary SNPEffect of coded (C) allele for secondary SNP p valueb Primary SNPc LD r 2d Cond. p value (second./primary)e
C/NTEAAH/LAI/ANASNTEAAH/LAI/ANASN
Transethnic meta-analysis fasting glucose
   G6PC2 rs477224A/G0.5750.4860.6450.6590.820−0.036 (0.005)−0.034 (0.007)***−0.042 (0.007)***0.035 (0.042)−0.006 (0.035)3 × 10−14 rs560887<0.12 × 10−5/5 × 10−26
   GCK rs2908290A/G0.4500.5340.3880.3670.4270.040 (0.005)0.043 (0.007)***0.038 (0.006)***−0.009 (0.041)0.058 (0.027)*10 × 10−18 rs2908286<0.12 × 10−8/6 × 10−16
Population-specific AA fasting glucose
   G6PC2 rs77719485A/C0.9760.9730.9960.9950.138 (0.020)0.143 (0.022)***0.115 (0.054)−0.046 (0.283)6x10−11 rs560887<0.12 × 10−6/5 × 10−7

Sequential conditional analysis was performed on ten fasting glucose and two fasting insulin loci

In the AA fasting glucose analysis, rs77719485 was the most significant SNP in the locus and rs560887 was the second most significant. AA effects for rs560887 are shown in Table 3

aLead SNP from conditional analysis reaching locus-specific significance

b p value from the secondary SNP not adjusted for the primary SNP

cLead SNP from primary (unconditional) analysis

dLD r2 between primary and secondary SNP

e p values from conditional analysis

*p < 0.05 and ***p < 0.001 for race/ethnic-specific analyses

LD, linkage disequilibrium

Independent secondary signals at known fasting glucose and fasting insulin loci Sequential conditional analysis was performed on ten fasting glucose and two fasting insulin loci In the AA fasting glucose analysis, rs77719485 was the most significant SNP in the locus and rs560887 was the second most significant. AA effects for rs560887 are shown in Table 3 aLead SNP from conditional analysis reaching locus-specific significance b p value from the secondary SNP not adjusted for the primary SNP cLead SNP from primary (unconditional) analysis dLD r2 between primary and secondary SNP e p values from conditional analysis *p < 0.05 and ***p < 0.001 for race/ethnic-specific analyses LD, linkage disequilibrium To identify population-specific loci, we conducted separate conditional analyses for significant loci in the primary H/L (GCKR-rs1260326, G6PC2-rs560887, SLC2A2-rs1280, DGKB-rs1005256, GCK-rs1799884, FADS3-rs12577276, MTNR1B-rs10830963, C2CD4A-rs7167881), AA (G6PC2-rs77719485, GCK-rs2908286, CRY2-rs117493014, MADD-rs77082299, ADCY5-rs11708067, MTNR1B-rs10830963) and Asian populations (GLIS3-rs4395942). A population-specific variant was detected in the AA analysis of the G6PC2 locus. The lead fasting glucose SNP, rs77719485, is less frequent in AA population (MAF 2.4%) and rare or monomorphic in the other populations (MAF 0.4% in H/L). Like the transethnic lead SNP, rs560887, bioinformatic follow-up suggested that rs77719485 may affect splicing efficiency for exon 4 for G6PC2.

Association testing outside of glycaemic trait fine-mapping regions to identify potential novel variants

In secondary analyses, we conducted a Metabochip-wide scan to identify potential novel or pleiotropic variants, given that the chip included variants with suggestive signals in established loci for many known metabolic traits. Models were run with and without BMI as a covariate (ESM Table 8, ESM Figs 4,5). Using the Bonferroni significance threshold (0.05/182,055 = 2.7 × 10−7), we identified one novel association for fasting insulin (rs75862513, p = 4.3 × 10−8, Fig. 3) at the SLC17A2 locus previously associated with height and blood pressure [27, 28]. After BMI adjustment (ESM Fig. 5), the association was attenuated suggesting that the effects may be mediated by BMI.
Fig. 3

Fasting insulin association p values for each Metabochip variant from the transethnic meta-analysis in model without BMI. The –log10 of p values for each SNP on the Metabochip is plotted against chromosomal positions. Grey and black circles, SNPs alternating by chromosome; red squares, SNPs in previously reported glycaemic trait loci (within 1 Mb of index SNP n = 28,580); blue diamonds, novel SNP associations reaching Metabochip-wide significance (all are in the SLC17A2 locus); solid line, threshold for Metabochip-wide significance (0.05/174,898 = 2.9 × 10−7); dashed line, threshold for genome-wide significance α = 5.0 × 10−8

Fasting insulin association p values for each Metabochip variant from the transethnic meta-analysis in model without BMI. The –log10 of p values for each SNP on the Metabochip is plotted against chromosomal positions. Grey and black circles, SNPs alternating by chromosome; red squares, SNPs in previously reported glycaemic trait loci (within 1 Mb of index SNP n = 28,580); blue diamonds, novel SNP associations reaching Metabochip-wide significance (all are in the SLC17A2 locus); solid line, threshold for Metabochip-wide significance (0.05/174,898 = 2.9 × 10−7); dashed line, threshold for genome-wide significance α = 5.0 × 10−8

Discussion

In this large multiethnic study population of close to 30,000 participants, we used transethnic fine-mapping to narrow the list of putative causal variants for eleven glycaemic trait loci. On average, we observed a 72% reduction in the number of candidate SNPs, before bioinformatic follow-up. We further demonstrated that many of the genetic variants associated with glycaemic traits likely exert their effects through regulatory mechanisms (splicing or enhancer activity), and provide detailed annotations for subsequent laboratory follow-up. These regulatory annotations provide putative targets for laboratory follow-up (e.g. genome editing) and important insights into strong targets for future therapeutic interventions. For example, this study found that most of the implicated enhancer elements were binding sites for the transcription factor FOXA2 in pancreatic islets, and previous studies have suggested that differential expression of FOXA2 is a genetic determinant of fasting glucose levels, as well as type 2 diabetes risk [29, 30]. Like the previous European Metabochip analysis, we found that rs6113722, which is positioned within a lncRNA adjacent to FOXA2, was associated (p = 3.2 × 10−8) with fasting glucose. As such, expression levels of FOXA2 could be a particularly important regulator of glucose homeostasis and a putative target for genome editing. Although the clinical application of genome editing is in its infancy, in vivo studies have already demonstrated the utility of the CRISPR/Cas9 technique. For example, to mimic observations of the naturally occurring loss-of-function mutation in the gene encoding LDL receptor antagonist PCSK9, a previous study in mice used CRISPR/Cas9 vectors to decrease PCSK9 protein levels, which resulted in increased hepatic LDL receptor levels, and a subsequent decrease in blood cholesterol levels [31]. Identification of key targets, such as FOXA2, and potential regulatory elements of these targets for laboratory follow-up is a critical first step in the translation of GWAS findings. Analysis of known glycaemic trait loci in this diverse population study suggests the genetic determinants of glycaemic trait levels are likely to be similar across populations. In comparison with previous glycaemic trait studies conducted in diverse populations [7, 32], the replication of effects across populations is more extensive, likely due to the size of this study population. Although most of the loci in the European study were generalisable across populations, this study exemplifies the notion that analysis in diverse populations can refine known loci as well as help in the discovery of novel, sometimes population-specific, associations. For instance, in addition to the well-established splice variant rs560887 that has been robustly associated with fasting glucose, transethnic meta-analysis of the G6PC2 locus identified an additional signal that may implicate regulatory functionality in glycaemia-related tissues. At this same locus, an AA-specific variant, rs77719485, was found to be strongly associated with fasting glucose and, like rs560887 [33], is predicted to affect splicing efficiency. By expanding our analysis to the entire Metabochip, we discovered strong associations with SLC17A2, that were not previously reported by the Metabochip analysis carried out by Scott et al [11] in Europeans. rs75862513 is a relatively rare variant that appears to be monomorphic in Europeans and was most frequent in the Asian (MAF = 0.04-A) and H/L (MAF = 0.001-A) populations in this study. If replicated in an independent dataset, this finding may represent a new locus not previously detected in European- or AA-specific analyses. These examples illustrate the power of transethnic analysis for locus refinement and novel discovery. Strengths of this study include the large study size, high-density genotyping and representation of multiple diverse populations. In light of the heavy burden of hyperglycaemia in H/L and AA populations, this study begins to address the major gap in knowledge related to the genetic architecture of glycaemic traits in understudied American minority populations. The large study population, combined with new annotation resources, allowed transethnic fine-mapping and prediction of regulatory elements. However, there were several limitations that should be noted. Although this study included populations from four major racial/ethnic groups, the greatest proportions of participants were H/L and AA. As such, this study was limited in its ability to detect associations with more prominent effects in Asian populations [34, 35]. We also acknowledge that fine-mapping approaches only serve as an initial step in determining the underlying causal variant(s) driving association signals by prioritising likely causal candidates for more onerous laboratory follow-up. To further meet this objective, functional elements and variants were identified using bioinformatics databases. However, given that the functional evidence detected by these datasets is incomplete, future functional studies are critical in determining the underlying causal variants. That being said, the combination of fine-mapping with bioinformatics data is particularly useful for reducing both the physical genomic regions of interest and prioritising candidates for molecular characterisation. Furthermore, the in silico approaches help to provide richer inferences regarding the biological mechanisms modulating fasting glucose and insulin levels. As such, fine-mapping is an essential step in functional interpretation of GWAS signals because laboratory follow-up of all possible variants in GWAS loci is prohibitively expensive and time-intensive. This transethnic study comprehensively fine-mapped known common variants associated with concentrations of fasting glucose and insulin. Genomic regions harbouring known risk variants were refined, novel functional candidates were proposed, new independent signals in previously fasting glucose-implicated genes were identified and one novel locus was discovered. Thus, these results suggest that transethnic meta-analysis can help in transforming GWAS results into new biological insight. (PDF 4624 kb) (XLSX 71 kb)
  35 in total

1.  Heritability and familiality of type 2 diabetes and related quantitative traits in the Botnia Study.

Authors:  P Almgren; M Lehtovirta; B Isomaa; L Sarelin; M R Taskinen; V Lyssenko; T Tuomi; L Groop
Journal:  Diabetologia       Date:  2011-08-09       Impact factor: 10.122

2.  Transferability and fine-mapping of glucose and insulin quantitative trait loci across populations: CARe, the Candidate Gene Association Resource.

Authors:  C-T Liu; M C Y Ng; D Rybin; A Adeyemo; S J Bielinski; E Boerwinkle; I Borecki; B Cade; Y D I Chen; L Djousse; M Fornage; M O Goodarzi; S F A Grant; X Guo; T Harris; E Kabagambe; J R Kizer; Y Liu; K L Lunetta; K Mukamal; J A Nettleton; J S Pankow; S R Patel; E Ramos; L Rasmussen-Torvik; S S Rich; C N Rotimi; D Sarpong; D Shriner; M Sims; J M Zmuda; S Redline; W H Kao; D Siscovick; J C Florez; J I Rotter; J Dupuis; J G Wilson; D W Bowden; J B Meigs
Journal:  Diabetologia       Date:  2012-08-16       Impact factor: 10.122

3.  Multiple functional polymorphisms in the G6PC2 gene contribute to the association with higher fasting plasma glucose levels.

Authors:  D A Baerenwald; A Bonnefond; N Bouatia-Naji; B P Flemming; O C Umunakwe; J K Oeser; L D Pound; N L Conley; S Cauchi; S Lobbens; E Eury; B Balkau; O Lantieri; P K Dadi; D A Jacobson; P Froguel; R M O'Brien
Journal:  Diabetologia       Date:  2013-03-19       Impact factor: 10.122

4.  Permanent alteration of PCSK9 with in vivo CRISPR-Cas9 genome editing.

Authors:  Qiurong Ding; Alanna Strong; Kevin M Patel; Sze-Ling Ng; Bridget S Gosis; Stephanie N Regan; Chad A Cowan; Daniel J Rader; Kiran Musunuru
Journal:  Circ Res       Date:  2014-06-10       Impact factor: 17.367

5.  Gene-age interactions in blood pressure regulation: a large-scale investigation with the CHARGE, Global BPgen, and ICBP Consortia.

Authors:  Jeannette Simino; Gang Shi; Joshua C Bis; Daniel I Chasman; Georg B Ehret; Xiangjun Gu; Xiuqing Guo; Shih-Jen Hwang; Eric Sijbrands; Albert V Smith; Germaine C Verwoert; Jennifer L Bragg-Gresham; Gemma Cadby; Peng Chen; Ching-Yu Cheng; Tanguy Corre; Rudolf A de Boer; Anuj Goel; Toby Johnson; Chiea-Chuen Khor; Carla Lluís-Ganella; Jian'an Luan; Leo-Pekka Lyytikäinen; Ilja M Nolte; Xueling Sim; Siim Sõber; Peter J van der Most; Niek Verweij; Jing Hua Zhao; Najaf Amin; Eric Boerwinkle; Claude Bouchard; Abbas Dehghan; Gudny Eiriksdottir; Roberto Elosua; Oscar H Franco; Christian Gieger; Tamara B Harris; Serge Hercberg; Albert Hofman; Alan L James; Andrew D Johnson; Mika Kähönen; Kay-Tee Khaw; Zoltan Kutalik; Martin G Larson; Lenore J Launer; Guo Li; Jianjun Liu; Kiang Liu; Alanna C Morrison; Gerjan Navis; Rick Twee-Hee Ong; George J Papanicolau; Brenda W Penninx; Bruce M Psaty; Leslie J Raffel; Olli T Raitakari; Kenneth Rice; Fernando Rivadeneira; Lynda M Rose; Serena Sanna; Robert A Scott; David S Siscovick; Ronald P Stolk; Andre G Uitterlinden; Dhananjay Vaidya; Melanie M van der Klauw; Ramachandran S Vasan; Eranga Nishanthie Vithana; Uwe Völker; Henry Völzke; Hugh Watkins; Terri L Young; Tin Aung; Murielle Bochud; Martin Farrall; Catharina A Hartman; Maris Laan; Edward G Lakatta; Terho Lehtimäki; Ruth J F Loos; Gavin Lucas; Pierre Meneton; Lyle J Palmer; Rainer Rettig; Harold Snieder; E Shyong Tai; Yik-Ying Teo; Pim van der Harst; Nicholas J Wareham; Cisca Wijmenga; Tien Yin Wong; Myriam Fornage; Vilmundur Gudnason; Daniel Levy; Walter Palmas; Paul M Ridker; Jerome I Rotter; Cornelia M van Duijn; Jacqueline C M Witteman; Aravinda Chakravarti; Dabeeru C Rao
Journal:  Am J Hum Genet       Date:  2014-06-19       Impact factor: 11.025

6.  The Next PAGE in understanding complex traits: design for the analysis of Population Architecture Using Genetics and Epidemiology (PAGE) Study.

Authors:  Tara C Matise; Jose Luis Ambite; Steven Buyske; Christopher S Carlson; Shelley A Cole; Dana C Crawford; Christopher A Haiman; Gerardo Heiss; Charles Kooperberg; Loic Le Marchand; Teri A Manolio; Kari E North; Ulrike Peters; Marylyn D Ritchie; Lucia A Hindorff; Jonathan L Haines
Journal:  Am J Epidemiol       Date:  2011-08-11       Impact factor: 4.897

7.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

8.  The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits.

Authors:  Benjamin F Voight; Hyun Min Kang; Jun Ding; Cameron D Palmer; Carlo Sidore; Peter S Chines; Noël P Burtt; Christian Fuchsberger; Yanming Li; Jeanette Erdmann; Timothy M Frayling; Iris M Heid; Anne U Jackson; Toby Johnson; Tuomas O Kilpeläinen; Cecilia M Lindgren; Andrew P Morris; Inga Prokopenko; Joshua C Randall; Richa Saxena; Nicole Soranzo; Elizabeth K Speliotes; Tanya M Teslovich; Eleanor Wheeler; Jared Maguire; Melissa Parkin; Simon Potter; N William Rayner; Neil Robertson; Kathleen Stirrups; Wendy Winckler; Serena Sanna; Antonella Mulas; Ramaiah Nagaraja; Francesco Cucca; Inês Barroso; Panos Deloukas; Ruth J F Loos; Sekar Kathiresan; Patricia B Munroe; Christopher Newton-Cheh; Arne Pfeufer; Nilesh J Samani; Heribert Schunkert; Joel N Hirschhorn; David Altshuler; Mark I McCarthy; Gonçalo R Abecasis; Michael Boehnke
Journal:  PLoS Genet       Date:  2012-08-02       Impact factor: 5.917

9.  Transferability and fine mapping of type 2 diabetes loci in African Americans: the Candidate Gene Association Resource Plus Study.

Authors:  Maggie C Y Ng; Richa Saxena; Jiang Li; Nicholette D Palmer; Latchezar Dimitrov; Jianzhao Xu; Laura J Rasmussen-Torvik; Joseph M Zmuda; David S Siscovick; Sanjay R Patel; Errol D Crook; Mario Sims; Yii-Der I Chen; Alain G Bertoni; Mingyao Li; Struan F A Grant; Josée Dupuis; James B Meigs; Bruce M Psaty; James S Pankow; Carl D Langefeld; Barry I Freedman; Jerome I Rotter; James G Wilson; Donald W Bowden
Journal:  Diabetes       Date:  2012-11-27       Impact factor: 9.461

10.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

View more
  14 in total

Review 1.  Evaluating the promise of inclusion of African ancestry populations in genomics.

Authors:  Amy R Bentley; Shawneequa L Callier; Charles N Rotimi
Journal:  NPJ Genom Med       Date:  2020-02-25       Impact factor: 8.617

Review 2.  The Future of Genomic Studies Must Be Globally Representative: Perspectives from PAGE.

Authors:  Stephanie A Bien; Genevieve L Wojcik; Chani J Hodonsky; Christopher R Gignoux; Iona Cheng; Tara C Matise; Ulrike Peters; Eimear E Kenny; Kari E North
Journal:  Annu Rev Genomics Hum Genet       Date:  2019-04-12       Impact factor: 8.929

Review 3.  Insights from population-based analyses of plasma lipids across the allele frequency spectrum.

Authors:  Gina M Peloso; Pradeep Natarajan
Journal:  Curr Opin Genet Dev       Date:  2018-02-13       Impact factor: 5.578

4.  Bivariate Genome-Wide Association Scan Identifies 6 Novel Loci Associated With Lipid Levels and Coronary Artery Disease.

Authors:  Katherine M Siewert; Benjamin F Voight
Journal:  Circ Genom Precis Med       Date:  2018-12

5.  Multi-Omic Approaches to Identify Genetic Factors in Metabolic Syndrome.

Authors:  Karen C Clark; Anne E Kwitek
Journal:  Compr Physiol       Date:  2021-12-29       Impact factor: 8.915

6.  Multi-ethnic GWAS and fine-mapping of glycaemic traits identify novel loci in the PAGE Study.

Authors:  Carolina G Downie; Sofia F Dimos; Stephanie A Bien; Yao Hu; Burcu F Darst; Linda M Polfus; Yujie Wang; Genevieve L Wojcik; Ran Tao; Laura M Raffield; Nicole D Armstrong; Hannah G Polikowsky; Jennifer E Below; Adolfo Correa; Marguerite R Irvin; Laura J F Rasmussen-Torvik; Christopher S Carlson; Lawrence S Phillips; Simin Liu; James S Pankow; Stephen S Rich; Jerome I Rotter; Steven Buyske; Tara C Matise; Kari E North; Christy L Avery; Christopher A Haiman; Ruth J F Loos; Charles Kooperberg; Mariaelisa Graff; Heather M Highland
Journal:  Diabetologia       Date:  2021-12-24       Impact factor: 10.460

7.  Candidate Gene and Genome-Wide Association Studies for Circulating Leptin Levels Reveal Population and Sex-Specific Associations in High Cardiovascular Risk Mediterranean Subjects.

Authors:  Carolina Ortega-Azorín; Oscar Coltell; Eva M Asensio; Jose V Sorlí; José I González; Olga Portolés; Carmen Saiz; Ramon Estruch; Judith B Ramírez-Sabio; Alejandro Pérez-Fidalgo; Jose M Ordovas; Dolores Corella
Journal:  Nutrients       Date:  2019-11-13       Impact factor: 5.717

8.  Association of diabetes-related variants in ADCY5 and CDKAL1 with neonatal insulin, C-peptide, and birth weight.

Authors:  Ivette-Guadalupe Aguilera-Venegas; Julia-Del-Socorro Mora-Peña; Marion Velazquez-Villafaña; Martha-Isabel Gonzalez-Dominguez; Gloria Barbosa-Sabanero; Hector-Manuel Gomez-Zapata; Maria-Luisa Lazo-de-la-Vega-Monroy
Journal:  Endocrine       Date:  2021-06-24       Impact factor: 3.633

9.  Genome-wide association study identifying novel variant for fasting insulin and allelic heterogeneity in known glycemic loci in Chilean adolescents: The Santiago Longitudinal Study.

Authors:  Victoria L Buchanan; Yujie Wang; Estela Blanco; Mariaelisa Graff; Cecilia Albala; Raquel Burrows; José L Santos; Bárbara Angel; Betsy Lozoff; Venkata Saroja Voruganti; Xiuqing Guo; Kent D Taylor; Yii-Der Ida Chen; Jie Yao; Jingyi Tan; Carolina Downie; Heather M Highland; Anne E Justice; Sheila Gahagan; Kari E North
Journal:  Pediatr Obes       Date:  2020-12-30       Impact factor: 4.000

Review 10.  Evaluating the promise of inclusion of African ancestry populations in genomics.

Authors:  Amy R Bentley; Shawneequa L Callier; Charles N Rotimi
Journal:  NPJ Genom Med       Date:  2020-02-25       Impact factor: 8.617

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.