| Literature DB >> 19341491 |
A Cecile Jw Janssens1, Cornelia M van Duijn.
Abstract
The translation of emerging genomic knowledge into public health and clinical care is one of the major challenges for the coming decades. At the moment, genome-based prediction of common diseases, such as type 2 diabetes, coronary heart disease and cancer, is still not informative. Our understanding of the genetic basis of multifactorial diseases is improving, but the currently identified susceptibility variants contribute only marginally to the development of disease. At the same time, an increasing number of companies are offering personalized lifestyle and health recommendations on the basis of individual genetic profiles. This discrepancy between the limited predictive value and the commercial availability of genetic profiles highlights the need for a critical appraisal of the usefulness of genome-based applications in clinical and public health care. Anticipating the discovery of a large number of genetic variants in the near future, we need to prepare a framework for the design and analysis of studies aiming to evaluate the clinical validity and utility of genetic tests. In this article, we review recent studies on the predictive value of genetic profiling from a methodological perspective and address issues around the choice of the study population, the construction of genetic profiles, the measurement of the predictive value, calibration and validation of prediction models, and assessment of clinical utility. Careful consideration of these issues will contribute to the knowledge base that is needed to identify useful genome-based applications for implementation in clinical and public health practice.Entities:
Year: 2009 PMID: 19341491 PMCID: PMC2664953 DOI: 10.1186/gm20
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
AUC and effect estimates of susceptibility variants for the prediction of three diseases
| Disease | Coronary heart disease | Systemic lupus erythematosus | Hypertriglyceridemia | |||
|---|---|---|---|---|---|---|
| AUC* | 0.55 | 0.67 | 0.80 | |||
| Reference | [ | [ | [ | |||
| Genes and effect estimates | ||||||
| 1.28 (1.02, 1.61) | 2.36 (2.11, 2.64)‡ | 7.36 (3.98, 13.6) | ||||
| 1.18 (0.97, 1.44) | 1.62 (1.47,1.78)‡ | 5.57 (3.13, 9.90) | ||||
| 1.21 (1.00, 1.45) | 1.54 (1.40, 1.70)‡ | 2.81 (1.46, 5.24) | ||||
| 1.22 (1.01, 1.48) | 0.78 (0.73, 0.85) | 2.14 (1.31, 3.49) | ||||
| 1.22 (1.01, 1.47) | 1.25 (1.16, 1.35) | 2.11 (1.21, 3.67) | ||||
| 0.72 (0.52, 1.01) | rs10798269 | 0.82 (0.76, 0.88) | 2.10 (1.15, 3.81) | |||
| 2.02 (1.24, 3.30) | ||||||
*AUC, area under the receiver operating characteristic curve. Values are hazard ratios [35] or odds ratios with 95% confidence intervals. ‡The original paper mentions several polymorphisms per gene and that one for each gene was included to assess the combined predictive value of six variants. The polymorphisms that had the highest odds ratios are reported here.
Methodological characteristics of recent studies on the prediction of complex diseases using multiple genes*
| First author (year) and reference | Design | Cases | Controls† | Variant selection§ | Analyses | Evaluation | Calibration | Validation | Compared with clinical prediction | |
|---|---|---|---|---|---|---|---|---|---|---|
| Cauchi (2008) [ | Case-control | T2D | Normal glucose-tolerant individuals (screened) | 4,232/4,595 | From GWAS in same population | LR | Distribution RASs, OR for RASs, AUC | No | No | No |
| Harley (2008) [ | Case-control | Women with SLE | Age-matched women without SLE | 720/2,337 | From GWAS in same population | LR | AUC | No | No | No |
| Humphries (2007) [ | Prospective cohort | Coronary heart disease | Caucasian men | 183/1,874 | 4 (out of 12) candidate genes | Cox PH, weighted risk score, risk score | Kaplan-Meier curves, AUC | No | No | Yes |
| Kathiresan (2008) [ | Prospective cohort | Myocardial infarction, ischemic stroke and death from coronary heart disease | General population | 238/3,994 | 9 (out of 11) candidate SNPs in 9 genes | Cox PH, RAS | Distribution RASs, incidence rates for RASs, Kaplan-Meier curves, AUC, reclassification | No | No | Only added value¶ |
| Lango (2008) [ | Case-control | T2D | Normoglycemic (screened) | 2,309/2,598 | 18 established variants | LR, RAS | Distribution RASs, OR for RASs, AUC | No | No | Yes |
| Lyssenko (2005) [ | Prospective cohort | T2D | Relatives and spouses | 132/2,161 | 3 (out of 6) SNPs in 5 genes | Cox PH | HR for genotype combinations | No | No | No |
| Lyssenko (2008) [ | Prospective cohort | T2D | Two cohorts: general population and non-diabetic relatives | 2,201/16,630 | 11 (out of 16) established variants | LR, RAS | Distribution RASs, incidence rates for RASs, AUC, reclassification | No | No | Yes |
| Maller (2006) [ | Case-control | Advanced AMD | Individuals without AMD or early AMD | 1,238/934 | 5 (out of 1,536 tag SNPs) in candidate genes | LR | Relative risk for genotype combinations¥ | No | No | No |
| Meigs (2008) [ | Prospective cohort | T2D | Offspring of general population cohort | 255/2,122 | 18 established variants | LR, RAS | Distribution RASs, incidence rates for RASs AUC, reclassification | Yes | No | Yes |
| Morrison (2007) [ | Prospective cohort | Coronary heart disease | General population | 1,452/12,455 | 11 (out of 116) SNPs | Cox PH, RAS | Distribution RASs, AUC | No | Internal | Only added value |
| Podgoreanu (2006) [ | Prospective cohort | Myocardial infarction | Patients undergoing elective cardiac surgery with cardio-pulmonary bypass | 52/382 | 3 (out of 48) SNPs in 23 candidate genes | LR | AUC | No | No | Yes |
| Van der Net (2009) [ | Prospective cohort | Coronary heart disease | FH patients | 387/950 | 14 SNPs previously associated | Cox PH, RAS | Predicted risk for RASs, AUC | No | No | Yes |
| Van Hoek (2008) [ | Prospective cohort | T2D | General population | 1,287/5,221 | 18 established variants | Cox PH, LR, RAS | Predicted risks for RASs, OR of RASs, AUC | No | No | Yes |
| Vaxillaire (2008) [ | Prospective cohort | T2D | General population | 307/3,570 | 3 (out of 19) SNPs in 14 candidate genes | LR | OR of RAS, AUC | No | No | Yes |
| Wang (2008) [ | Case-control | Severe hypertriglyceridemia | Normolipidemic controls | 132/351 | 7 established variants | LR | AUC Hosmer-Lemeshow goodness of fit | No | Only added value | |
| Weedon (2006) [ | Case-control | T2D | Individuals without T2D, including one normoglycemic subpopulation | 2,409/3,668 | 3 established variants | LR, RAS | Distribution RASs, OR of RASs, AUC | No | No | No |
| Weersma (2008) [ | Case-control | Chronic inflammatory bowel disease | Healthy controls | 2,804/1,350 | 5 established genes | LR, RAS | OR of RASs | No | No | No |
| Yeh (2007) [ | Case-control | Colorectal cancer | Healthy controls | 727/736 | 3 (out of 10) established variants | LR | OR for genotype combinations¥ | No | No | No |
| Zheng (2008) [ | Case-control | Prostate cancer | General population | 2,893/1,781 | 5 (out of 16) SNPs in 5 candidate regions | LR, genotype score | Distribution genotype score, AUC, PAR | No | No | No |
*Abbreviations: AMD, age-related macular degeneration; AUC, area under the receiver operating characteristic curve, sometimes measured by the c-statistic; Cox PH, Cox proportional hazard regression analysis; FH, familial hypercholesterolemia; GWAS, genome-wide association studies; HR, hazard ratio; LR, logistic regression; OR, odds ratio; PAR, population attributable risk; RAS, risk allele score; SLE, systemic lupus erythematosus; SNP, single nucleotide polymorphism; T2D, type 2 diabetes. †For prospective cohort studies, this column describes the total population from which the cases were obtained. ‡For prospective cohort studies, these numbers indicate the number of cases divided by the number of individuals who did not develop the disease during follow-up. §Numbers between parentheses indicate the total number of variants at the start of the analysis from which the most predictive variants were selected. Candidate means that the variants were selected from the literature, on the basis of association with disease risk in other studies. ¶This study compared prediction from clinical risk factors with clinical risk factors plus genetic factors, but did not consider prediction from genetic factors alone. ¥These studies did not intend to evaluate the predictive value, but investigated the combined effect of multiple variants on disease risk.
Effect estimates of 18 established susceptibility variants on type 2 diabetes risk in two studies
| Odds ratio (95% confidence interval)* | |||
|---|---|---|---|
| Gene | Locus | GoDARTS study [ | Rotterdam study [ |
| rs7903146 | |||
| rs5219 | 1.03 (0.93, 1.13) | ||
| rs10811661 | 1.10 (0.98, 1.24) | ||
| rs1801282 | 1.09 (0.95, 1.24) | ||
| rs2641348† | 1.01 (0.88, 1.17) | ||
| rs564398‡ | 1.04 (0.95, 1.14) | ||
| rs4402960 | |||
| rs8050136 | 1.09 (0.99, 1.19) | ||
| rs10946398§ | |||
| rs13266634 | |||
| rs7961581¶ | 1.09 (0.99, 1.20) | ||
| rs12779790¥ | 1.10 (0.99, 1.21) | 0.95 (0.84, 1.06) | |
| rs10010131** | 1.07 (0.99, 1.16) | ||
| rs757210†† | 1.07 (0.99, 1.16) | 0.93 (0.85, 1.02) | |
| rs4607103‡‡ | 1.05 (0.96, 1.16) | ||
| rs1111875 | 1.02 (0.94, 1.11) | 1.06 (0.97, 1.15) | |
| rs7578597 | 1.04 (0.90, 1.19) | 1.10 (0.96, 1.27) | |
| rs864745§ | 1.00 (0.93, 1.09) | ||
*AUC (area under the receiver operating characteristic curve) was 0.60 for both studies. Values were obtained using logistic regression analyses. For several genes, the Rotterdam study [14] uses different single nucleotide polymorphisms from those listed: ‡‡rs4411878, r2 = 0.95; ¥rs11257622; r2 = 0.83; ‡rs1412829, r2 = 0.97; §rs1635852, r2 = 0.97; †rs1493694; ¶rs1353362, r2 = 0.96; ††rs4430796, r2 = 0.61; **rs10012946, r2 = 1.00; §rs7754840, r2 = 1.00. Statistically significant associations are presented in bold and r2 is a measure of linkage disequilibrium.