| Literature DB >> 20193047 |
Douglas M Ruderfer1, Joshua Korn, Shaun M Purcell.
Abstract
Genome-wide association studies have detected dozens of variants underlying complex diseases, although it is uncertain how often these discoveries will translate into clinically useful predictors. Here, to improve genetic risk prediction, we consider including phenotypic and genotypic information from related individuals. We develop and evaluate a family-based liability-threshold prediction model and apply it to a simulation of known Crohn's disease risk variants. We show that genotypes of a relative of known phenotype can be informative for an individual's disease risk, over and above the same locus genotyped in the individual. This approach can lead to better-calibrated estimates of disease risk, although the overall benefit for prediction is typically only very modest.Entities:
Year: 2010 PMID: 20193047 PMCID: PMC2829927 DOI: 10.1186/gm123
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Crohn's disease model specification
| RAF | GRR |
| VE |
|---|---|---|---|
| 0.018 | 3.99 | 0.504 | .0090 |
| 0.533 | 1.28 | 0.098 | .0048 |
| 0.425 | 1.25 | 0.083 | .0034 |
| 0.899 | 1.31 | 0.135 | .0033 |
| 0.387 | 1.25 | 0.083 | .0032 |
| 0.152 | 1.35 | 0.106 | .0029 |
| 0.677 | 1.22 | 0.080 | .0028 |
| 0.463 | 1.21 | 0.071 | .0025 |
| 0.478 | 1.20 | 0.067 | .0023 |
| 0.678 | 1.20 | 0.072 | .0022 |
| 0.780 | 1.21 | 0.079 | .0022 |
| 0.221 | 1.25 | 0.079 | .0022 |
| 0.933 | 2.50 | 0.130 | .0021 |
| 0.125 | 1.32 | 0.097 | .0021 |
| 0.565 | 1.18 | 0.062 | .0019 |
| 0.565 | 1.18 | 0.062 | .0019 |
| 0.697 | 1.18 | 0.064 | .0017 |
| 0.271 | 1.20 | 0.065 | .0016 |
| 0.090 | 1.33 | 0.099 | .0016 |
| 0.243 | 1.19 | 0.061 | .0014 |
| 0.386 | 1.16 | 0.053 | .0013 |
| 0.289 | 1.17 | 0.055 | .0013 |
| 0.345 | 1.16 | 0.053 | .0013 |
| 0.682 | 1.14 | 0.049 | .0010 |
| 0.389 | 1.13 | 0.043 | .0009 |
| 0.473 | 1.12 | 0.040 | .0008 |
| 0.348 | 1.12 | 0.040 | .0007 |
| 0.017 | 1.54 | 0.149 | .0007 |
| 0.708 | 1.11 | 0.038 | .0006 |
| 0.619 | 1.08 | 0.027 | .0004 |
Values used to generate simulated CD samples. RAF = risk allele frequency; GRR = genotypic relative risk, estimated from the reported odds ratios; a = additive genetic value; VE = variance explained.
Figure 1Predicted index disease risk. Predicted index disease risks from a single locus (MAF = 0.425, GRR = 1.25): unconditonal, P(D); conditional on index genotype, P(D|G); conditional on affected sibling phenotype, P(D|D); conditional on index genotype and affected sibling phenotype, P(D|G, D); conditional on index and sibling genotypes and affected sibling phenotype, P(D|G, G, D). The inserted table contains frequencies of sibling pair genotype combinations conditional on at least one sibling being affected. Red represents the homozygous risk-increasing genotype; green the heterozygous genotype; blue the homozygous risk-decreasing genotype.
Figure 2Predicted index disease risks from a single locus, under a variety of genetic models. Predicted index disease risk stratified by (a) effect size and (b) total sibling relative risk. See Figure 1 legend for details. In all cases, risk allele frequency is 0.425, disease prevalence is 1/250. (a) Varying the familial variance component of the residual variance from 20%, 50% to 80%, with corresponding sibling relative risks of 3.25, 12.25 and 35.5. (b) Varying additive genetic effect from a = 0.01, a = 0.05 to a = 0.1, with corresponding genotypic relative risks of 1.03, 1.16 and 1.30.
Crohn's disease simulation results
| Model | AUC |
|
|
|
|
|---|---|---|---|---|---|
| General population | 0.708 | 0.054 | 7.39 | 4.21 | 3.23 |
| | 0.726 | 0.085 | 15.90 | 5.71 | 3.91 |
| | 0.735 | 0.094 | 15.88 | 5.80 | 3.94 |
| | |||||
| Selected population (affected sibling) | |||||
| | 0.628 | 0.042 | 71.25 | 60.25 | 53.75 |
| | 0.648 | 0.056 | 82.00 | 67.20 | 58.48 |
Performance characteristics for tests based on the 30 Crohn's disease variants. Index individuals and their siblings were simulated in the unselected and selected (family history positive/affected sibling) scenarios. The prediction models estimate risk based on the index genotype G, and optionally sibling's phenotype Dand genotype G. The metrics are the area under the ROC curve (AUC), the squared correlation between disease state and risk (R2) and the relative enrichment of cases in the top 1, 5 and 10% of individuals with the highest risk scores relative to the baseline risk for that population (T1, T5 and T10). See main text for details.