| Literature DB >> 18852893 |
Hans D Daetwyler1, Beatriz Villanueva, John A Woolliams.
Abstract
BACKGROUND: The prediction of the genetic disease risk of an individual is a powerful public health tool. While predicting risk has been successful in diseases which follow simple Mendelian inheritance, it has proven challenging in complex diseases for which a large number of loci contribute to the genetic variance. The large numbers of single nucleotide polymorphisms now available provide new opportunities for predicting genetic risk of complex diseases with high accuracy. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2008 PMID: 18852893 PMCID: PMC2561058 DOI: 10.1371/journal.pone.0003395
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Predicted accuracy and percentage prediction error assessed by simulation with disease prevalence = 0.1 (SE range 0.0004–0.0065).
|
|
|
|
|
| |||||
| P | %error | P | %error | P | %error | P | %error | ||
| C | 0.1 | 0.045 | 4.0 | 0.218 | 3.6 | 0.301 | 2.2 | 0.577 | 0.4 |
| 0.5 | 0.100 | 2.1 | 0.447 | −0.5 | 0.577 | −0.2 | 0.845 | −0.1 | |
| 0.9 | 0.133 | −1.3 | 0.557 | 0.2 | 0.688 | −0.2 | 0.905 | −0.1 | |
| DP
| 0.1 | 0.026 | −14.1 | 0.130 | −6.6 | 0.182 | −2.2 | 0.382 | −1.6 |
| 0.5 | 0.058 | −1.1 | 0.281 | 0.6 | 0.382 | −1.1 | 0.679 | 0.2 | |
| 0.9 | 0.078 | −9.8 | 0.365 | 1.6 | 0.485 | 0.8 | 0.779 | 0.2 | |
| DC
| 0.1 | 0.043 | −0.6 | 0.209 | 2.4 | 0.290 | 3.5 | 0.560 | −1.9 |
| 0.5 | 0.089 | −4.3 | 0.407 | 3.0 | 0.533 | 0.8 | 0.816 | −2.9 | |
| 0.9 | 0.112 | −20.0 | 0.490 | −0.4 | 0.622 | −0.4 | 0.872 | −3.3 | |
λ = number of phenotypes per number of loci.
h 2 = heritability (observed scale for C and DP, liability scale for DC).
P = predicted accuracy of estimated additive genetic value.
% error = percentage prediction error = 100(P−accuracy from simulation)/P.
C = continuous phenotype.
DP = dichotomous phenotype, population study.
DC = dichotomous phenotype, case control study.
The effects of different distributions of allele frequency and effects on accuracy in a continuous phenotype with observed heritability = 0.5 (SE range 0.0004–0.0057).
|
| Predicted | Simulated | |||
| Beta | Beta/Exp | Uni | Uni/Exp | ||
| 0.02 | 0.100 | 0.095 | 0.093 | 0.100 | 0.097 |
| 0.50 | 0.447 | 0.442 | 0.436 | 0.451 | 0.450 |
| 1.00 | 0.577 | 0.577 | 0.579 | 0.576 | 0.578 |
| 2.00 | 0.707 | 0.709 | 0.714 | 0.704 | 0.709 |
| 5.00 | 0.845 | 0.849 | 0.848 | 0.846 | 0.846 |
| 10.00 | 0.913 | 0.914 | 0.914 | 0.913 | 0.912 |
λ = number of phenotypes per number of loci.
Beta = beta distribution (alpha = 0.3, theta = 0.3) of allele frequencies.
Nrm = normal distribution of allele effects.
Exp = exponential distribution of allele effects.
Uni = uniform distribution of allele frequencies.
Accuracy for continuous phenotype when setting 0.95 of n a loci to zero (λ = 0.02 = 400n b/20,000n, SE range 0.0042–0.0057).
|
| 0.95 of | 0.0 of | Predicted |
| 0.1 | 0.057 | 0.043 | 0.045 |
| 0.5 | 0.101 | 0.097 | 0.100 |
| 0.9 | 0.129 | 0.135 | 0.133 |
n = number of loci.
n = number of phenotypes.
= observed heritability.
Figure 1Predicted accuracy of estimated genetic values of a continuous phenotype.
Predicted accuracy of estimated additive genetic values of a continuous phenotype as a function of observed heritability and number of phenotypes per genotype tested, λ = 0.02, 0.1, 0.5, 1, 2, 5, 10 and 20 from minimum to maximum accuracy respectively.
Accuracy for a dichotomous disease trait as prevalence varies (a , b λ = 1, SE range 0.0026–0.0048).
| Prevalence | Study Type DP
| Study Type DC
| ||
| P | % Error | P | % Error | |
| 0.01 | 0.186 | −0.8 | 0.593 | −11.1 |
| 0.03 | 0.271 | −1.9 | 0.568 | −6.8 |
| 0.05 | 0.317 | 0.3 | 0.554 | −3.5 |
| 0.10 | 0.382 | −0.6 | 0.533 | 0.6 |
| 0.20 | 0.444 | 1.4 | 0.511 | −2.5 |
| 0.30 | 0.473 | 1.2 | 0.499 | −0.2 |
| 0.40 | 0.487 | −0.6 | 0.493 | 1.2 |
| 0.50 | 0.491 | 0.0 | 0.491 | 1.4 |
= heritability on liability scale.
λ = number of phenotypes per number of loci.
DP = population study of dichotomous phenotypes.
DC = case control study of dichotomous phenotypes.
P = predicted accuracy of additive genetic values.
% error = percentage prediction error = 100(P−accuracy from simulation)/P.
Simulated accuracy of a population study for a dichotomous phenotype as prevalence and a varies and b stays constant (λ c = 10, , predicted accuracy = 0.816, Equation (4), SE range 0.0025–0.0038).
| Prevalence |
| Accuracy |
| 0.05 | 0.893 | 0.810 |
| 0.10 | 0.584 | 0.814 |
| 0.20 | 0.408 | 0.814 |
| 0.30 | 0.347 | 0.813 |
| 0.40 | 0.322 | 0.813 |
| 0.50 | 0.314 | 0.813 |
= heritability on liability scale.
= heritability on observed scale.
λ = number of phenotypes per number of loci.
Figure 2Predicted accuracy of estimated genetic risk from population and case control designs of a dichotomous phenotype.
Contour plot of predicted accuracy for varied prevalence and additive heritability on the observed scale, in population studies (dashed vertical line) and case control studies (solid line) of dichotomous phenotypes. Each contour represents a line of constant accuracy, starting from the right 0.9, 0.8, 0.7, and 0.6. The narrowly dashed line is derived from Equation (5) with , so values below this line are not possible under the liability model.