| Literature DB >> 24982668 |
Suman Kundu1, Raluca Mihaescu1, Catherina M C Meijer1, Rachel Bakker1, A Cecile J W Janssens2.
Abstract
BACKGROUND: There is increasing interest in investigating genetic risk models in empirical studies, but such studies are premature when the expected predictive ability of the risk model is low. We assessed how accurately the predictive ability of genetic risk models can be estimated in simulated data that are created based on the odds ratios (ORs) and frequencies of single-nucleotide polymorphisms (SNPs) obtained from genome-wide association studies (GWASs).Entities:
Keywords: AUC; GWAS; genetic; modeling; predictive ability; risk prediction
Year: 2014 PMID: 24982668 PMCID: PMC4056181 DOI: 10.3389/fgene.2014.00179
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
AUC values in published prediction studies and their estimates in simulated data.
| Published prediction study | Simulated data | |||||
|---|---|---|---|---|---|---|
| Disease | First author, year | Study design | Sample size | Number of SNPs | AUC | AUC |
| Crohn disease | Peter et al., | Case-control | 872 | 7 | 0.70 | 0.67 |
| Prostate cancer | Johansson et al., | Case-control | 1508 | 33 | 0.64 | 0.67 |
| Type 1 diabetes | Yamashita et al., | Case-control | 1743 | 7 | 0.65 | 0.64 |
| Type 2 diabetes | Lin et al., | Cross-sectional | 5360 | 15 | 0.57 | 0.59 |
| Type 2 diabetes | Qi et al., | Prospective cohort | 3210 | 17 | 0.62 | 0.60 |
| Type 2 diabetes | Van Hoek et al., | Prospective cohort | 6544 | 18 | 0.56 | 0.60 |
| Type 2 diabetes | Meigs et al., | Prospective cohort | 2377 | 18 | 0.58 | 0.59 |
| Type 2 diabetes | Wang et al., | Cross-sectional | 7232 | 19 | 0.55 | 0.60 |
| Type 2 diabetes | Talmud et al., | Prospective cohort | 5535 | 20 | 0.54 | 0.60 |
| Prostate cancer | Sun et al., | Case-control | 4621 | 28 | 0.62 | 0.66 |
| Prostate cancer | Kader et al., | Case-control | 1654 | 33 | 0.59 | 0.67 |
| Type 2 diabetes | Lin et al., | Cross-sectional | 5360 | 15 | 0.59 | 0.60 |
| Type 2 diabetes | Talmud et al., | Prospective cohort | 5535 | 20 | 0.55 | 0.61 |
| AMD | Scholl et al., | Case-control | 179 | 3 | 0.73 | 0.69 |
| AMD | Hecker et al., | Case-control | 274 | 4 | 0.77 | 0.76 |
| AMD | Grassmann et al., | Case-control | 1782 | 13 | 0.82 | 0.78 |
| Colorectal cancer | Dunlop et al., | Case-control | 39266 | 10 | 0.57 | 0.60 |
| Colorectal cancer | Lubbe et al., | Prospective cohort | 14929 | 14 | 0.58 | 0.60 |
| Crohn disease | Peter et al., | Case-control | 872 | 7 | 0.71 | 0.68 |
| Prostate cancer | Aly et al., | Prospective cohort | 5241 | 36 | 0.67 | 0.69 |
| Prostate cancer | Helfand et al., | Case-control | 1464 | 9 | 0.61 | 0.62 |
| Type 2 diabetes | Weedon et al., | Case-control | 6077 | 3 | 0.58 | 0.59 |
| Type 2 diabetes | Vaxillaire et al., | Prospective cohort | 5212 | 3 | 0.56 | 0.58 |
| Type 2 diabetes | Hu et al., | Case-control | 3634 | 11 | 0.62 | 0.61 |
| Type 2 diabetes | Miyake et al., | Case-control | 2000 | 11 | 0.63 | 0.63 |
| Type 2 diabetes | Fontaine-Bisson et al., | Cross-sectional | 2751 | 17 | 0.59 | 0.61 |
| Type 2 diabetes | Van Hoek et al., | Prospective cohort | 6544 | 18 | 0.60 | 0.61 |
| Type 2 diabetes | Lango et al., | Case-control | 4907 | 18 | 0.60 | 0.61 |
| Type 2 diabetes | Sparso et al., | Case-control | 9395 | 19 | 0.60 | 0.61 |
AUC, area under the receiver operating characteristic curve; CI, confidence interval; AMD, age-related macular degeneration; SNP, single nucleotide polymorphism.
Adjusted for age;
Adjusted for sex.
Odds ratios of 18 single nucleotide polymorphisms in two prediction studies on type 2 diabetes and their corresponding values in the cited genome-wide association studies.
| Gene | SNP | OR in prediction study | OR in cited GWAS | ||
|---|---|---|---|---|---|
| GoDARTS study | Rotterdam study | GoDARTS study | Rotterdam study | ||
| rs2641348 | 1.15 (1.01, 1.30) | 1.01 (0.88, 1.17) | 1.10 (1.06, 1.15) | ||
| rs4607103 | 1.05 (0.96, 1.16) | 1.14 (1.03, 1.28) | 1.09 (1.06, 1.12) | ||
| rs12779790 | 1.10 (0.99, 1.21) | 1.05 (0.94, 1.19) | 1.11 (1.07, 1.14) | ||
| rs10946398 | 1.11 (1.02, 1.21) | 1.11 (1.02, 1.22) | 1.12 (1.08, 1.16) | ||
| rs10811661 | 1.21 (1.08, 1.35) | 1.10 (0.98, 1.24) | 1.20 (1.14, 1.25) | ||
| rs564398 | 1.13 (1.04, 1.22) | 1.04 (0.95, 1.14) | 1.12 (1.07, 1.17) | ||
| rs8050136 | 1.11 (1.02, 1.20) | 1.09 (0.99, 1.19) | 1.15 (1.09, 1.22) | ||
| rs1111875 | 1.02 (0.94, 1.11) | 1.06 (0.97, 1.15) | 1.13 (1.08, 1.17) | ||
| rs4402960 | 1.12 (1.03, 1.22) | 1.11 (1.01, 1.22) | 1.17 (1.10, 1.25) | ||
| rs864745 | 1.00 (0.93, 1.09) | 1.09 (1.00, 1.19) | 1.10 (1.07, 1.13) | ||
| rs5219 | 1.25 (1.15, 1.36) | 1.03 (0.93, 1.13) | 1.18 (1.04, 1.34) | 1.14 (1.10, 1.19) | |
| rs1801282 | 1.21 (1.07, 1.36) | 1.09 (0.95, 1.24) | 1.14 (1.08, 1.20) | ||
| rs13266634 | 1.10 (1.01, 1.20) | 1.13 (1.02, 1.24) | 1.12 (1.07, 1.16) | ||
| rs757210 | 1.07 (0.99, 1.16) | 1.07 (0.98, 1.18) | 1.12 (1.07, 1.18) | 1.22 (1.15, 1.30) | |
| rs7903146 | 1.36 (1.24, 1.48) | 1.31 (1.19, 1.44) | 1.47 (1.33, 1.62) | 1.38 (1.31, 1.46) | |
| rs7578597 | 1.04 (0.90, 1.19) | 1.10 (0.96, 1.27) | 1.15 (1.10, 1.20) | ||
| rs7961581 | 1.09 (1.00, 1.19) | 1.09 (0.99, 1.20) | 1.09 (1.06, 1.12) | ||
| rs10010131 | 1.07 (0.99, 1.16) | 1.12 (1.05, 1.27) | 1.11 (1.07, 1.16) | ||
Table is adapted from Janssens and Van Duijn (2009). The risk models of the GoDARTS study (Lango et al., 2008) and the Rotterdam Study (Van Hoek et al., 2008) included the same 18 genes and both had an AUC of 0.60. The AUC values from simulated data were the same and both were 0.61. The SNPs listed in the table are those used by the GoDARTS study. For several genes, the Rotterdam Study used different SNPs that were in linkage disequilibrium:
rs1493694, r2 = 0.74;
rs1412829, r2 = 0.97;
rs7754840, r2 = 1.00;
rs1353362, r2 = 0.96;
rs11257622; r2 = 0.83;
rs10012946, r2 = 1.00;
rs4430796, r2 = 0.61;
rs4411878, r2 = 0.95;
rs1635852, r2 = 0.97. SNP, single nucleotide polymorphism; GWAS, genome-wide association study.
When only one value is presented, both prediction studies cited the same GWAS.
GWAS studies reported for the SNP used by the Rotterdam study; all others are for SNPs used by the GoDARTS study.
Figure 1Plots published in empirical prediction studies and their reproductions in simulated data. (A) Receiver operating characteristic (ROC) curves for unweighted and weighted gene count scores (Peter et al., 2011). (B) Quintiles plot presenting the odds ratios with 95% confidence intervals by quintiles of the weighted genetic scores (Lin et al., 2009). (C) Scatter plot showing the variation in predicted risks stratified by the number of risk alleles (Dunlop et al., 2013). (D) Histogram showing the distribution of the number of risk alleles among patients and nonpatients (Peter et al., 2011).