Literature DB >> 25393876

Identification of allelic heterogeneity at type-2 diabetes loci and impact on prediction.

Yann C Klimentidis1, Jin Zhou1, Nathan E Wineinger2.   

Abstract

Although over 60 single nucleotide polymorphisms (SNPs) have been identified by meta-analysis of genome-wide association studies for type-2 diabetes (T2D) among individuals of European descent, much of the genetic variation remains unexplained. There are likely many more SNPs that contribute to variation in T2D risk, some of which may lie in the regions surrounding established SNPs--a phenomenon often referred to as allelic heterogeneity. Here, we use the summary statistics from the DIAGRAM consortium meta-analysis of T2D genome-wide association studies along with linkage disequilibrium patterns inferred from a large reference sample to identify novel SNPs associated with T2D surrounding each of the previously established risk loci. We then examine the extent to which the use of these additional SNPs improves prediction of T2D risk in an independent validation dataset. Our results suggest that multiple SNPs at each of 3 loci contribute to T2D susceptibility (TCF7L2, CDKN2A/B, and KCNQ1; p<5×10(-8)). Using a less stringent threshold (p<5×10(-4)), we identify 34 additional loci with multiple associated SNPs. The addition of these SNPs slightly improves T2D prediction compared to the use of only the respective lead SNPs, when assessed using an independent validation cohort. Our findings suggest that some currently established T2D risk loci likely harbor multiple polymorphisms which contribute independently and collectively to T2D risk. This opens a promising avenue for improving prediction of T2D, and for a better understanding of the genetic architecture of T2D.

Entities:  

Mesh:

Year:  2014        PMID: 25393876      PMCID: PMC4231111          DOI: 10.1371/journal.pone.0113072

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Approximately 65 loci have been shown to be associated with type-2 diabetes (T2D) through genome-wide association studies (GWAS). However, variation at these loci accounts for a small proportion of the expected heritability of T2D [1], [2]. Among several potential strategies for identifying additional contributing genetic variation, one approach is to determine whether there are additional genetic markers near established loci that act independently or jointly with the reported marker (lead SNP). Allelic heterogeneity is a feature of the genetic architecture of many traits, including common traits and diseases such as height, BMI, and T2D [3]–[6]. In the context of T2D genetics, both Morris et al. [2] and Yang et al. [7] have suggested that additional SNPs in established loci are associated with T2D risk. However, Morris et al. only considered SNPs in weak linkage disequilibrium (r2<0.05) with the lead SNP, and that were not in the same recombination interval. Hence, without formal conditional analysis, they identified two loci as having multiple associations at genome-wide significance (KCNQ1 and CDKN2A/B), and two more at suggestive levels (DGKB and MC4R). Yang et al. have recently developed a method for identifying additional associated SNPs based on conditional/joint (C/J) analysis using GWAS summary statistics and linkage disequilibrium (LD) information from a reference sample [7]. They applied their method to only a single established T2D locus (CDKN2A/B), and identified two novel SNPs at that locus that were significantly associated with T2D when fitted jointly. Finally, on a smaller scale (1,924 cases and 5,380 controls), Ke [8] identified multiple associated loci at the CDKN2A/B and TSPAN8 loci. Although higher power is afforded with the GWAS meta-analysis approach to identify associations with single SNPs, it does not allow for direct C/J analysis since the actual genotype data is not available. The advantage of the method developed by Yang et al. is that it takes advantage of the greater power of GWAS meta-analyses, while also testing for C/J associations, which would otherwise be impossible without individual level data. Here, we comprehensively examine allelic heterogeneity based on the method developed by Yang et al. at 65 T2D loci discovered by the DIAGRAM consortium, using the summary statistics from their recent meta-analysis of T2D GWAS. We then examine the extent to which these newly identified SNPs increase the accuracy of T2D risk prediction in an independent validation dataset.

Methods

Datasets

We used 6,054 nominally unrelated European-American subjects (genomic relationship coefficient <0.025, based on approximately 2.5 million SNPs) from the Atherosclerosis Risk in Communities (ARIC) study [9] to obtain linkage disequilibrium (LD) estimates. According to Yang et al. [7], this sample size is sufficient for LD estimation with minimal error. In order to maximize the overlap of SNPs between the meta-analysis summary statistics (see below) and the ARIC study, we used IMPUTE2 software [10] along with 1000 Genomes reference data to impute millions of additional SNPs. Prior to imputation, we excluded individuals with a high genotype missing rate (>10%). SNPs were excluded based on extreme minor allele frequency (<0.5%), a high missing rate (>10%), or failed Hardy-Weinberg equilibrium (p<0.005). After imputation, we excluded SNPs with ‘info’ <0.6 (measure of imputation quality), and SNPs with genotype dosage between 0.33 and 0.66, or between 1.33 and 1.66. Intermediate dosages outside of these specified ranges were rounded to the nearest integer. We did not use intermediate genotype dosages since this was not an option with the GCTA software, described below. The validation dataset consisted of European-American subjects from the Multi-Ethnic Study of Atherosclerosis (MESA) [11], which included 225 T2D cases and 1,985 controls. T2D cases were defined as having a fasting glucose level ≥126 mg/dL, a self-report of taking diabetes medication, or a physician diagnosis of T2D. This dataset can be considered an independent validation dataset since it was not part of the DIAGRAM meta-analysis, whereas ARIC was a part of this meta-analysis, thus precluding it from any validation assessment. We implemented genotype QC and imputation as detailed above. However, we did not round or remove intermediate genotype dosages. The MESA and ARIC dataset were obtained from dbGaP (database of Genotypes and Phenotypes). IRB approval was obtained from the University of Arizona.

Conditional/Joint Analysis

Using the summary statistics from the discovery phase of the latest version (v3) of the DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) consortium, available to the public through an online source [12] and LD estimates from the ARIC dataset as described above, we used GCTA software [13] to perform stepwise model selection. Briefly, SNPs were selected into the model based on p-values in the meta-analysis. An iterative scheme was adopted in which C/J analyses were alternatively performed with the stepwise selection procedure. SNPs with a re-estimated (i.e. through C/J estimation as opposed to marginal estimation) p-value under a certain threshold were selected. For a full description of the method, see Yang et al. [7]. We restricted our analysis to only the genomic regions within 1 Mb of the top SNP at the 65 established T2D loci as reported in Morris et al [2]. This filtering along with the QC filtering described above resulted in 112,329 SNPs being used in this analysis. For each SNP, we recorded the following information as input for the C/J analysis: effect allele, effect size (log of odds ratio), corresponding standard error, p-value, allele frequency of the effect allele (based on ARIC sample described above, as this was not available in the DIAGRAM summary statistic file), and sample size (sum of cases and controls). We used PolyPhen-2 [14] to determine whether any of the newly identified SNPs had any predicted functional effect, and RegulomeDB [15] to determine whether these SNPs may lie in regulatory regions (e.g. transcription factor binding sites) or are associated with specific DNA features (e.g. DNAse sensitivity).

Validation/Prediction

We compared several prediction models. First we constructed a baseline model which only considered demographic information (sex and age). Then we added a weighted genetic risk score [16] based on only the set of lead SNPs with weights corresponding to the log odds ratios according to the DIAGRAM meta-analysis summary statistics. Lead SNPs were defined as those that had the lowest p-value in the respective 2 Mb region according to the DIAGRAM Stage 1 meta-analysis summary statistics. We then considered a weighted genetic risk score based on all SNPs identified by the C/J analysis with weights corresponding to the coefficients estimated from the C/J analysis. We conducted the above analyses at the following p-value thresholds based on the C/J results: 5×10−8, 5×10−7, 5×10−6, 5×10−5, and 5×10−4. We examined the proportion of variance explained by these additional SNPs by calculating the variance explained on the liability scale, estimated through the odds ratios and allele frequencies of the SNPs, and assuming a disease prevalence of 10%, using the Mangrove package [17] in R [18]. We also calculated Nagelkerke's R2 [19] of the models which include age and sex and each of the GRS, using the fmsb package [20] in R, and report the Akaike information criterion (AIC) [21] for each of these models. Prediction accuracy was estimated using the area under the receiver operating characteristic curve (AUC) as implemented in the pROC package [22] in R. Differences in AUC among models were compared by examining the change in AUC (ΔAUC) and assessed using the DeLong test [23] to determine statistical significance.

Results

Conditional/Joint analysis

We identified novel genome-wide significant (p<5×10−8) SNPs in the C/J analysis at the three following loci: TCF7L2, CDKN2A/B, and KCNQ1 (see Table 1). In the TCF7L2 region, we identified three SNPs (rs7917983, rs17747324, rs12266632) within a 32 kb region. The lead SNP (rs4506565) was not selected in this model, but is positioned in this region and is in moderate LD with each of the novel findings (r2 between 0.18 and 0.70). For each of these novel findings, the marginal effect sizes and p-values in the meta-analysis are similar to those estimated in the C/J analysis. By relaxing the p-value threshold to p<5×10−4, we discovered an additional SNP in this region (rs10128255). In the CDKN2A/B region, the lead SNP (rs2383208) was not selected in the C/J analysis. Instead, two SNPs (rs10757282 and rs10811661) approximately 1.9 kb downstream of the lead SNP were discovered. These SNPs are only 110 bases apart. rs10757282 is in relatively low LD with the lead SNP (r2 = 0.29). However, rs10811661 is in high LD with the lead SNP (r2 = 0.94). It should be noted that the correlation between the respective risk alleles is negative (r = −0.54), suggesting that estimates obtained through the single marker association were underestimated for both SNPs, as evidenced by the larger effect sizes and lower p-values estimated in the C/J analysis compared to the meta-analysis marginal association results (see Table 1). In the KCNQ1 region, we identified two SNPs in the C/J analysis (p<5×10−8). One SNP (rs462402) is in moderate LD with the lead SNP, rs231362 (r2 = 0.45). The other SNP (rs163177) is approximately 121 kb upstream and is not in LD with the lead SNP (r2<0.01). By relaxing the p-value threshold to p<5×10−6, we identified additional novel discoveries in the DGKB and TP53INP1 genes. Continuing to relax this threshold, we identify 17 (p<5×10−5) (see Table 1) and 34 (p<5×10−4) regions with multiple associated SNPs. According to PolyPhen, none of the SNPs identified through C/J analysis had any predicted functional effect. According to our query in RegulomeDB, rs387769 near HNF4A shows evidence of being linked to expression of a gene target, affecting binding of a transcription factor, and shows evidence of a DNase footprint. SNP rs7176681 near ZFAND6 also displays evidence of being a transcription factor binding site, and evidence of a DNase footprint. SNP rs17168486 in DGKB shows evidence of transcription factor binding and a DNase peak. Several other SNPs show evidence of transcription factor binding or a DNase peak (see Table 1).
Table 1

SNPs identified by conditional/joint analysis with p-value <5×10−5, and corresponding evidence of regulatory function from RegulomeDB.

Gene regionChrSNPbprefAfreqbsepnbJbJ_sepJLD (r)RegulomeDB score
BCL11A 2rs219251259854224C0.4750.0580.0146.50E-041105170.0600.0142.75E-05−0.017
2rs24301960439310C0.4510.0860.0142.70E-061176020.0870.0143.24E-100.0005
IGF2BP2 3rs12233623186712890C0.5480.0580.0141.00E-031112510.0650.0145.49E-06−0.074
3rs6767484187003272G0.3050.1220.0183.90E-1083684.80.1270.0187.89E-130.0005
ANKRD55 5rs968666155897543T0.1890.0950.0239.20E-0570923.60.1000.0231.05E-05−0.0634
5rs189545256096152T0.6350.0580.0143.10E-031188580.0610.0141.69E-050.000
ZBED3 5rs1252261875491695G0.4250.0580.0143.70E-031127540.0580.0144.99E-050.009
5rs770828576461623G0.3000.1220.0221.10E-0654523.40.1220.0223.70E-080.000
DGKB 7rs1028210113867616T0.5010.0770.0197.30E-0564914.80.0770.0193.27E-05−0.011
7rs1716848614864807T0.1820.1220.0226.90E-07768120.1260.0221.35E-08−0.0273a
7rs197462015031992T0.5310.0680.0141.00E-041127230.0690.0141.06E-060.000
TP53INP1 8rs473533796042641T0.4900.0770.0146.90E-051144120.0780.0142.01E-08−0.0225
8rs699174296533562T0.5960.0680.0141.80E-041165760.0690.0148.77E-070.0005
CDKN2A/B 9rs56439822019547T0.5790.0770.0141.70E-051172680.0590.0142.48E-050.102
9rs1075728222123984C0.4300.0680.0198.80E-0465000.90.1630.0211.92E-14−0.5365
9rs1081166122124094T0.8250.1660.0211.50E-1386410.90.2470.0241.08E-240.000
ZMIZ1 10rs391593280611942G0.5920.0950.0184.70E-0869629.20.0980.0187.10E-08−0.0405
10rs648094780906216G0.1560.1130.0272.20E-0459743.80.1190.0278.58E-060.000
TCF7L2 10rs7917983114722872C0.4790.1480.0131.50E-171318380.0820.0142.17E-090.344
10rs17747324114742493C0.2370.3580.0218.50E-5570086.80.3410.0223.07E-54−0.126
10rs12266632114754949G0.0630.2550.0428.50E-1153996.40.2900.0431.12E-110.0005
KCNQ1 11rs4624022673869C0.5070.0770.0141.10E-051143910.0760.0146.49E-08−0.002
11rs1631772794989C0.5060.0770.0144.80E-051143820.0750.0149.55E-080.0405
11rs4510413017301G0.4960.0680.0148.20E-051123030.0630.0148.89E-060.0005
KCNJ11 11rs792881017329019C0.3960.0680.0141.30E-041174230.0680.0141.50E-06−0.004
11rs75798417576484T0.8030.0770.0194.90E-041025110.0770.0193.09E-050.000
MTNR1B 11rs1083096292338075G0.4080.1040.0141.50E-081249660.1010.0141.61E-130.069
11rs53157392444531C0.1820.0860.0181.70E-041111500.0770.0182.75E-050.000
TSPAN8 12rs1117853169694957A0.4490.0770.0149.20E-061155650.0590.0144.73E-050.271
12rs153310469942814T0.3380.0860.0142.80E-061301150.0720.0144.26E-070.000
SPRY2 13rs161654778884037C0.5120.0580.0143.00E-031103080.0580.0144.84E-050.006
13rs132731679607064G0.7170.0950.0189.40E-0782902.80.0950.0181.92E-070.000
C2CD4A 15rs649430760181982C0.5720.0770.0141.80E-051167970.0770.0143.65E-08−0.001
15rs245693660502334C0.7850.0770.0191.10E-0396195.20.0770.0193.32E-050.000
ZFAND6 15rs717668177714599C0.7370.0860.0181.20E-0485321.90.0920.0185.42E-07−0.0102a
15rs135733578155918A0.6840.0860.0181.60E-0576497.60.0860.0182.92E-06−0.0065
15rs384817478531707T0.3800.0580.0143.80E-031170130.0610.0141.78E-050.0005
HNF4A 20rs38776941745269C0.8810.1130.0271.40E-0575082.10.1130.0272.34E-050.0081f
20rs607370843386291A0.5620.0580.0142.40E-031119490.0580.0144.81E-050.0005

Abbreviations: Chr: chromosome, bp: base pair position, refA: reference allele, freq: frequency of the risk allele, b: regression coefficient from meta-analysis summary statistics, se: standard error from meta-analysis summary statistics, p: p-value from meta-analysis summary statistics, n: sample size in meta-analysis, bJ: regression coefficient estimated from conditional/joint analysis, bJ_se: standard error estimated from conditional/joint analysis, pJ: p-value from estimated from conditional/joint analysis, LD (r): linkage disequilibrium between corresponding SNP and the following SNP at the same locus.

(1f: eQTL+transcription factor (TF) binding/DNase peak; 2a: TF binding+matched TF motif+matched DNase Footprint+DNase peak; 3a: TF binding+any motif+DNase peak; 5: TF binding or DNase peak).

Abbreviations: Chr: chromosome, bp: base pair position, refA: reference allele, freq: frequency of the risk allele, b: regression coefficient from meta-analysis summary statistics, se: standard error from meta-analysis summary statistics, p: p-value from meta-analysis summary statistics, n: sample size in meta-analysis, bJ: regression coefficient estimated from conditional/joint analysis, bJ_se: standard error estimated from conditional/joint analysis, pJ: p-value from estimated from conditional/joint analysis, LD (r): linkage disequilibrium between corresponding SNP and the following SNP at the same locus. (1f: eQTL+transcription factor (TF) binding/DNase peak; 2a: TF binding+matched TF motif+matched DNase Footprint+DNase peak; 3a: TF binding+any motif+DNase peak; 5: TF binding or DNase peak). The AUC of the baseline prediction model which included only sex and age was 0.5702. For each of the three loci with additional SNPs that were significant at the p<5×10−8 threshold (TCF7L2, CDKN2A/B, and KCNQ1), the inclusion of the SNPs identified by the C/J analysis resulted in a higher AUC than a model including only the lead SNP (although not statistically significant) in all regions except for KCNQ1 (see Figure 1).
Figure 1

Prediction accuracy in MESA at 3 loci with additional detected SNPs at the 5×10−8 threshold.

Considering all three loci with additional SNPs at the p<5×10−8 threshold collectively, we found that the use of the seven SNPs identified by the C/J analysis resulted in a slightly higher AUC (0.5979) than when using only the three lead SNPs (0.5803). This represents a doubling in ΔAUC over the age+ sex model (see Figure 2), although this difference is not quite statistically significant (p = 0.055), according to the DeLong test. The inclusion of all SNPs (lead and from C/J analysis) results in a statistically significant (p = 0.049), yet small, increase in AUC (see Figure 2). At the p<5×10−6 threshold, the use of 11 SNPs at 5 loci (TCF7L2, CDKN2A/B, KCNQ1, DGKB and TP53INP1), slightly, but not significantly, increased prediction accuracy (AUC = 0.5885) over a model considering only the corresponding 5 lead SNPs (AUC = 0.5779; p = 0.126). At the p<5×10−5 threshold, we observe a small increase in prediction accuracy when using the 39 SNPs identified by the C/J analysis instead of the corresponding 17 lead SNPs (AUC = 0.5892 vs. 0.5724; p = 0.079). Finally, at the p<5×10−4 threshold, the use of 120 SNPs identified by the C/J analysis and the lead SNPs results in a slightly higher and nearly statistically significant increase in AUC over that of a model which includes only the 34 lead SNPs at the corresponding loci (AUC = 0.5965 vs. 0.5858; p = 0.067).
Figure 2

Prediction accuracy in MESA using lead SNPs vs. SNPs identified in C/J analysis at different p-value thresholds.

Table 2 shows the proportion of variance explained by the additional SNPs identified by the C/J analysis. For each of the three loci, the SNPs identified by the C/J analysis explained slightly more of the variance in T2D risk than the lead SNP. Similarly, for the collection of SNPs identified by the C/J analysis at various p-value thresholds, we observe an increase in the proportion of T2D variance explained by the SNPs and the GRSs, along with decreasing AIC values.
Table 2

Variance explained at various p-value thresholds in the MESA validation dataset by the collection of individual SNPs on the liability scale, variance explained by, and model fit of, the weighted GRS, using Nagelkerke's R2, and AIC, respectively.

Liability-scale varianceNagelkerke R2 AIC
age+sex 0.01151448.1
TCF7L2 1 Lead SNP0.00970.01781443.3
3 C/J SNPs0.01890.02121439.7
4 Lead+C/J SNPs0.02820.01981441.1
CDKN2A/B 1 Lead SNP0.00040.01181449.7
2 C/J SNPs0.00250.01271448.8
2 Lead+C/J SNPs0.00290.01171449.8
KCNQ1 1 Lead SNP3.00E-50.01151450
2 C/J SNPs0.00290.01221449.3
3 Lead+C/J SNPs0.00140.01181449.8
<5.00E-08 3 Lead SNPs0.01010.01531446
7 GCTA SNPs0.02250.02301437.8
10 Lead+C/J SNPs0.03220.01971441.2
<5.00E-06 5 Lead SNPs0.01300.01341448.1
11 GCTA SNPs0.02580.01901442.1
16 Lead+C/J SNPs0.03810.01641444.8
<5.00E-05 17 Lead SNPs0.02770.01341448
39 GCTA SNPs0.06480.02091440
55 Lead+C/J SNPs0.08650.01711444.1
<5.00E-04 34 Lead SNPs0.06130.01581445.5
91 GCTA SNPs0.14430.02591434.6
119 Lead+C/J SNPs0.18010.01971441.2

Discussion

Our analyses confirm previous findings regarding the allelic heterogeneity present at the CDKN2A/B, KCNQ1, DGKB, and MC4R loci. We provide novel evidence of allelic heterogeneity at genome-wide significance at the TCF7L2 locus. We support our finding in TCF7L2 by showing that the use of the three identified SNPs results in a small increase in AUC (albeit not statistically significant) compared to using the lead TCF7L2 SNP (rs4506565) alone. We observe similar but much weaker trends at the CDKN2A/B and KCNQ1 loci. At less stringent p-value thresholds, we observe additional putatively associated SNPs at up to 34 loci. Considering the collective set of loci in which additional associated SNPs were identified through C/J analysis, prediction accuracy appears to slightly improve with the addition of these additional SNPs in our validation dataset. At all p-value thresholds, the ΔAUC over the sex + age model is at least two-fold greater when using the C/J identified SNPs compared to using the lead SNPs alone. The strength of the method developed by Yang et al. is well exemplified by the multiple associated SNPs identified at the TCF7L2 locus, since the use of the three SNPs (which do not include the lead SNP) appears to be more informative than only using the lead SNP, rs4506565. Another example of the strength of this method is the case in which two risk alleles are in negative LD. Without the C/J analysis, the additional SNPs in the CDKN2A/B region would not be identified when analyzed on their own. The main limitation of this method is that associations are not tested directly, but rather through knowledge of marginal associations, and LD patterns in a different dataset (of the same ancestral background). A major limitation of the validation stage of our study is the relatively small sample size which limits the statistical power to detect differences in prediction accuracy between different GRSs. From this perspective, it will be important to continue validating these findings in larger datasets, and to combine actual genotype data across multiple datasets instead of using summary statistics. Furthermore, it will be important to dissect the allelic heterogeneity on a locus-by-locus basis to closely examine the patterns/existence of dependencies and additive or interactive effects. Finally, it will be important to functionally characterize these as well as all GWAS findings to more firmly establish causality and better understand molecular mechanisms leading to T2D. Nevertheless, this approach is clearly promising for a greater understanding of the molecular basis of type-2 diabetes, and potentially for use in risk prediction scores. As additional loci are identified through GWAS, it will be important to systematically identify instances of allelic heterogeneity and to examine the extent to which additional SNPs can help to shed light on the functional basis of genetic variation.
  17 in total

1.  Presence of multiple independent effects in risk loci of common complex human diseases.

Authors:  Xiayi Ke
Journal:  Am J Hum Genet       Date:  2012-07-05       Impact factor: 11.025

2.  Sequence variants in SLC16A11 are a common risk factor for type 2 diabetes in Mexico.

Authors:  Amy L Williams; Suzanne B R Jacobs; Hortensia Moreno-Macías; Alicia Huerta-Chagoya; Claire Churchhouse; Carla Márquez-Luna; Humberto García-Ortíz; María José Gómez-Vázquez; Noël P Burtt; Carlos A Aguilar-Salinas; Clicerio González-Villalpando; Jose C Florez; Lorena Orozco; Christopher A Haiman; Teresa Tusié-Luna; David Altshuler
Journal:  Nature       Date:  2013-12-25       Impact factor: 49.962

3.  Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis.

Authors:  Benjamin F Voight; Laura J Scott; Valgerdur Steinthorsdottir; Andrew P Morris; Christian Dina; Ryan P Welch; Eleftheria Zeggini; Cornelia Huth; Yurii S Aulchenko; Gudmar Thorleifsson; Laura J McCulloch; Teresa Ferreira; Harald Grallert; Najaf Amin; Guanming Wu; Cristen J Willer; Soumya Raychaudhuri; Steve A McCarroll; Claudia Langenberg; Oliver M Hofmann; Josée Dupuis; Lu Qi; Ayellet V Segrè; Mandy van Hoek; Pau Navarro; Kristin Ardlie; Beverley Balkau; Rafn Benediktsson; Amanda J Bennett; Roza Blagieva; Eric Boerwinkle; Lori L Bonnycastle; Kristina Bengtsson Boström; Bert Bravenboer; Suzannah Bumpstead; Noisël P Burtt; Guillaume Charpentier; Peter S Chines; Marilyn Cornelis; David J Couper; Gabe Crawford; Alex S F Doney; Katherine S Elliott; Amanda L Elliott; Michael R Erdos; Caroline S Fox; Christopher S Franklin; Martha Ganser; Christian Gieger; Niels Grarup; Todd Green; Simon Griffin; Christopher J Groves; Candace Guiducci; Samy Hadjadj; Neelam Hassanali; Christian Herder; Bo Isomaa; Anne U Jackson; Paul R V Johnson; Torben Jørgensen; Wen H L Kao; Norman Klopp; Augustine Kong; Peter Kraft; Johanna Kuusisto; Torsten Lauritzen; Man Li; Aloysius Lieverse; Cecilia M Lindgren; Valeriya Lyssenko; Michel Marre; Thomas Meitinger; Kristian Midthjell; Mario A Morken; Narisu Narisu; Peter Nilsson; Katharine R Owen; Felicity Payne; John R B Perry; Ann-Kristin Petersen; Carl Platou; Christine Proença; Inga Prokopenko; Wolfgang Rathmann; N William Rayner; Neil R Robertson; Ghislain Rocheleau; Michael Roden; Michael J Sampson; Richa Saxena; Beverley M Shields; Peter Shrader; Gunnar Sigurdsson; Thomas Sparsø; Klaus Strassburger; Heather M Stringham; Qi Sun; Amy J Swift; Barbara Thorand; Jean Tichet; Tiinamaija Tuomi; Rob M van Dam; Timon W van Haeften; Thijs van Herpt; Jana V van Vliet-Ostaptchouk; G Bragi Walters; Michael N Weedon; Cisca Wijmenga; Jacqueline Witteman; Richard N Bergman; Stephane Cauchi; Francis S Collins; Anna L Gloyn; Ulf Gyllensten; Torben Hansen; Winston A Hide; Graham A Hitman; Albert Hofman; David J Hunter; Kristian Hveem; Markku Laakso; Karen L Mohlke; Andrew D Morris; Colin N A Palmer; Peter P Pramstaller; Igor Rudan; Eric Sijbrands; Lincoln D Stein; Jaakko Tuomilehto; Andre Uitterlinden; Mark Walker; Nicholas J Wareham; Richard M Watanabe; Gonçalo R Abecasis; Bernhard O Boehm; Harry Campbell; Mark J Daly; Andrew T Hattersley; Frank B Hu; James B Meigs; James S Pankow; Oluf Pedersen; H-Erich Wichmann; Inês Barroso; Jose C Florez; Timothy M Frayling; Leif Groop; Rob Sladek; Unnur Thorsteinsdottir; James F Wilson; Thomas Illig; Philippe Froguel; Cornelia M van Duijn; Kari Stefansson; David Altshuler; Michael Boehnke; Mark I McCarthy
Journal:  Nat Genet       Date:  2010-07       Impact factor: 38.330

4.  Predicting functional effect of human missense mutations using PolyPhen-2.

Authors:  Ivan Adzhubei; Daniel M Jordan; Shamil R Sunyaev
Journal:  Curr Protoc Hum Genet       Date:  2013-01

5.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators.

Authors: 
Journal:  Am J Epidemiol       Date:  1989-04       Impact factor: 4.897

6.  Multi-Ethnic Study of Atherosclerosis: objectives and design.

Authors:  Diane E Bild; David A Bluemke; Gregory L Burke; Robert Detrano; Ana V Diez Roux; Aaron R Folsom; Philip Greenland; David R Jacob; Richard Kronmal; Kiang Liu; Jennifer Clark Nelson; Daniel O'Leary; Mohammed F Saad; Steven Shea; Moyses Szklo; Russell P Tracy
Journal:  Am J Epidemiol       Date:  2002-11-01       Impact factor: 4.897

7.  Hundreds of variants clustered in genomic loci and biological pathways affect human height.

Authors:  Hana Lango Allen; Karol Estrada; Guillaume Lettre; Sonja I Berndt; Michael N Weedon; Fernando Rivadeneira; Cristen J Willer; Anne U Jackson; Sailaja Vedantam; Soumya Raychaudhuri; Teresa Ferreira; Andrew R Wood; Robert J Weyant; Ayellet V Segrè; Elizabeth K Speliotes; Eleanor Wheeler; Nicole Soranzo; Ju-Hyun Park; Jian Yang; Daniel Gudbjartsson; Nancy L Heard-Costa; Joshua C Randall; Lu Qi; Albert Vernon Smith; Reedik Mägi; Tomi Pastinen; Liming Liang; Iris M Heid; Jian'an Luan; Gudmar Thorleifsson; Thomas W Winkler; Michael E Goddard; Ken Sin Lo; Cameron Palmer; Tsegaselassie Workalemahu; Yurii S Aulchenko; Asa Johansson; M Carola Zillikens; Mary F Feitosa; Tõnu Esko; Toby Johnson; Shamika Ketkar; Peter Kraft; Massimo Mangino; Inga Prokopenko; Devin Absher; Eva Albrecht; Florian Ernst; Nicole L Glazer; Caroline Hayward; Jouke-Jan Hottenga; Kevin B Jacobs; Joshua W Knowles; Zoltán Kutalik; Keri L Monda; Ozren Polasek; Michael Preuss; Nigel W Rayner; Neil R Robertson; Valgerdur Steinthorsdottir; Jonathan P Tyrer; Benjamin F Voight; Fredrik Wiklund; Jianfeng Xu; Jing Hua Zhao; Dale R Nyholt; Niina Pellikka; Markus Perola; John R B Perry; Ida Surakka; Mari-Liis Tammesoo; Elizabeth L Altmaier; Najaf Amin; Thor Aspelund; Tushar Bhangale; Gabrielle Boucher; Daniel I Chasman; Constance Chen; Lachlan Coin; Matthew N Cooper; Anna L Dixon; Quince Gibson; Elin Grundberg; Ke Hao; M Juhani Junttila; Lee M Kaplan; Johannes Kettunen; Inke R König; Tony Kwan; Robert W Lawrence; Douglas F Levinson; Mattias Lorentzon; Barbara McKnight; Andrew P Morris; Martina Müller; Julius Suh Ngwa; Shaun Purcell; Suzanne Rafelt; Rany M Salem; Erika Salvi; Serena Sanna; Jianxin Shi; Ulla Sovio; John R Thompson; Michael C Turchin; Liesbeth Vandenput; Dominique J Verlaan; Veronique Vitart; Charles C White; Andreas Ziegler; Peter Almgren; Anthony J Balmforth; Harry Campbell; Lorena Citterio; Alessandro De Grandi; Anna Dominiczak; Jubao Duan; Paul Elliott; Roberto Elosua; Johan G Eriksson; Nelson B Freimer; Eco J C Geus; Nicola Glorioso; Shen Haiqing; Anna-Liisa Hartikainen; Aki S Havulinna; Andrew A Hicks; Jennie Hui; Wilmar Igl; Thomas Illig; Antti Jula; Eero Kajantie; Tuomas O Kilpeläinen; Markku Koiranen; Ivana Kolcic; Seppo Koskinen; Peter Kovacs; Jaana Laitinen; Jianjun Liu; Marja-Liisa Lokki; Ana Marusic; Andrea Maschio; Thomas Meitinger; Antonella Mulas; Guillaume Paré; Alex N Parker; John F Peden; Astrid Petersmann; Irene Pichler; Kirsi H Pietiläinen; Anneli Pouta; Martin Ridderstråle; Jerome I Rotter; Jennifer G Sambrook; Alan R Sanders; Carsten Oliver Schmidt; Juha Sinisalo; Jan H Smit; Heather M Stringham; G Bragi Walters; Elisabeth Widen; Sarah H Wild; Gonneke Willemsen; Laura Zagato; Lina Zgaga; Paavo Zitting; Helene Alavere; Martin Farrall; Wendy L McArdle; Mari Nelis; Marjolein J Peters; Samuli Ripatti; Joyce B J van Meurs; Katja K Aben; Kristin G Ardlie; Jacques S Beckmann; John P Beilby; Richard N Bergman; Sven Bergmann; Francis S Collins; Daniele Cusi; Martin den Heijer; Gudny Eiriksdottir; Pablo V Gejman; Alistair S Hall; Anders Hamsten; Heikki V Huikuri; Carlos Iribarren; Mika Kähönen; Jaakko Kaprio; Sekar Kathiresan; Lambertus Kiemeney; Thomas Kocher; Lenore J Launer; Terho Lehtimäki; Olle Melander; Tom H Mosley; Arthur W Musk; Markku S Nieminen; Christopher J O'Donnell; Claes Ohlsson; Ben Oostra; Lyle J Palmer; Olli Raitakari; Paul M Ridker; John D Rioux; Aila Rissanen; Carlo Rivolta; Heribert Schunkert; Alan R Shuldiner; David S Siscovick; Michael Stumvoll; Anke Tönjes; Jaakko Tuomilehto; Gert-Jan van Ommen; Jorma Viikari; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael A Province; Manfred Kayser; Alice M Arnold; Larry D Atwood; Eric Boerwinkle; Stephen J Chanock; Panos Deloukas; Christian Gieger; Henrik Grönberg; Per Hall; Andrew T Hattersley; Christian Hengstenberg; Wolfgang Hoffman; G Mark Lathrop; Veikko Salomaa; Stefan Schreiber; Manuela Uda; Dawn Waterworth; Alan F Wright; Themistocles L Assimes; Inês Barroso; Albert Hofman; Karen L Mohlke; Dorret I Boomsma; Mark J Caulfield; L Adrienne Cupples; Jeanette Erdmann; Caroline S Fox; Vilmundur Gudnason; Ulf Gyllensten; Tamara B Harris; Richard B Hayes; Marjo-Riitta Jarvelin; Vincent Mooser; Patricia B Munroe; Willem H Ouwehand; Brenda W Penninx; Peter P Pramstaller; Thomas Quertermous; Igor Rudan; Nilesh J Samani; Timothy D Spector; Henry Völzke; Hugh Watkins; James F Wilson; Leif C Groop; Talin Haritunians; Frank B Hu; Robert C Kaplan; Andres Metspalu; Kari E North; David Schlessinger; Nicholas J Wareham; David J Hunter; Jeffrey R O'Connell; David P Strachan; H-Erich Wichmann; Ingrid B Borecki; Cornelia M van Duijn; Eric E Schadt; Unnur Thorsteinsdottir; Leena Peltonen; André G Uitterlinden; Peter M Visscher; Nilanjan Chatterjee; Ruth J F Loos; Michael Boehnke; Mark I McCarthy; Erik Ingelsson; Cecilia M Lindgren; Gonçalo R Abecasis; Kari Stefansson; Timothy M Frayling; Joel N Hirschhorn
Journal:  Nature       Date:  2010-09-29       Impact factor: 49.962

8.  pROC: an open-source package for R and S+ to analyze and compare ROC curves.

Authors:  Xavier Robin; Natacha Turck; Alexandre Hainard; Natalia Tiberti; Frédérique Lisacek; Jean-Charles Sanchez; Markus Müller
Journal:  BMC Bioinformatics       Date:  2011-03-17       Impact factor: 3.307

9.  Genotype imputation with thousands of genomes.

Authors:  Bryan Howie; Jonathan Marchini; Matthew Stephens
Journal:  G3 (Bethesda)       Date:  2011-11-01       Impact factor: 3.154

10.  Common variants near MC4R are associated with fat mass, weight and risk of obesity.

Authors:  Ruth J F Loos; Cecilia M Lindgren; Shengxu Li; Eleanor Wheeler; Jing Hua Zhao; Inga Prokopenko; Michael Inouye; Rachel M Freathy; Antony P Attwood; Jacques S Beckmann; Sonja I Berndt; Kevin B Jacobs; Stephen J Chanock; Richard B Hayes; Sven Bergmann; Amanda J Bennett; Sheila A Bingham; Murielle Bochud; Morris Brown; Stéphane Cauchi; John M Connell; Cyrus Cooper; George Davey Smith; Ian Day; Christian Dina; Subhajyoti De; Emmanouil T Dermitzakis; Alex S F Doney; Katherine S Elliott; Paul Elliott; David M Evans; I Sadaf Farooqi; Philippe Froguel; Jilur Ghori; Christopher J Groves; Rhian Gwilliam; David Hadley; Alistair S Hall; Andrew T Hattersley; Johannes Hebebrand; Iris M Heid; Claudia Lamina; Christian Gieger; Thomas Illig; Thomas Meitinger; H-Erich Wichmann; Blanca Herrera; Anke Hinney; Sarah E Hunt; Marjo-Riitta Jarvelin; Toby Johnson; Jennifer D M Jolley; Fredrik Karpe; Andrew Keniry; Kay-Tee Khaw; Robert N Luben; Massimo Mangino; Jonathan Marchini; Wendy L McArdle; Ralph McGinnis; David Meyre; Patricia B Munroe; Andrew D Morris; Andrew R Ness; Matthew J Neville; Alexandra C Nica; Ken K Ong; Stephen O'Rahilly; Katharine R Owen; Colin N A Palmer; Konstantinos Papadakis; Simon Potter; Anneli Pouta; Lu Qi; Joshua C Randall; Nigel W Rayner; Susan M Ring; Manjinder S Sandhu; André Scherag; Matthew A Sims; Kijoung Song; Nicole Soranzo; Elizabeth K Speliotes; Holly E Syddall; Sarah A Teichmann; Nicholas J Timpson; Jonathan H Tobias; Manuela Uda; Carla I Ganz Vogel; Chris Wallace; Dawn M Waterworth; Michael N Weedon; Cristen J Willer; Xin Yuan; Eleftheria Zeggini; Joel N Hirschhorn; David P Strachan; Willem H Ouwehand; Mark J Caulfield; Nilesh J Samani; Timothy M Frayling; Peter Vollenweider; Gerard Waeber; Vincent Mooser; Panos Deloukas; Mark I McCarthy; Nicholas J Wareham; Inês Barroso; Kevin B Jacobs; Stephen J Chanock; Richard B Hayes; Claudia Lamina; Christian Gieger; Thomas Illig; Thomas Meitinger; H-Erich Wichmann; Peter Kraft; Susan E Hankinson; David J Hunter; Frank B Hu; Helen N Lyon; Benjamin F Voight; Martin Ridderstrale; Leif Groop; Paul Scheet; Serena Sanna; Goncalo R Abecasis; Giuseppe Albai; Ramaiah Nagaraja; David Schlessinger; Anne U Jackson; Jaakko Tuomilehto; Francis S Collins; Michael Boehnke; Karen L Mohlke
Journal:  Nat Genet       Date:  2008-05-04       Impact factor: 38.330

View more
  2 in total

1.  Genetic-risk assessment of GWAS-derived susceptibility loci for type 2 diabetes in a 10 year follow-up of a population-based cohort study.

Authors:  Min Jin Go; Young Lee; Suyeon Park; Soo Heon Kwak; Bong-Jo Kim; Juyoung Lee
Journal:  J Hum Genet       Date:  2016-07-21       Impact factor: 3.172

2.  Evaluation of DNA variants associated with androgenetic alopecia and their potential to predict male pattern baldness.

Authors:  Magdalena Marcińska; Ewelina Pośpiech; Sarah Abidi; Jeppe Dyrberg Andersen; Margreet van den Berge; Ángel Carracedo; Mayra Eduardoff; Anna Marczakiewicz-Lustig; Niels Morling; Titia Sijen; Małgorzata Skowron; Jens Söchtig; Denise Syndercombe-Court; Natalie Weiler; Peter M Schneider; David Ballard; Claus Børsting; Walther Parson; Chris Phillips; Wojciech Branicki
Journal:  PLoS One       Date:  2015-05-22       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.