| Literature DB >> 24670270 |
Nicholas A Furlotte1, David Heckerman2, Christoph Lippert2.
Abstract
The use of mixed models to determine narrow-sense heritability and related quantities such as SNP heritability has received much recent attention. Less attention has been paid to the inherent variability in these estimates. One approach for quantifying variability in estimates of heritability is a frequentist approach, in which heritability is estimated using maximum likelihood and its variance is quantified through an asymptotic normal approximation. An alternative approach is to quantify the uncertainty in heritability through its Bayesian posterior distribution. In this paper, we develop the latter approach, make it computationally efficient and compare it to the frequentist approach. We show theoretically that, for a sufficiently large sample size and intermediate values of heritability, the two approaches provide similar results. Using the Atherosclerosis Risk in Communities cohort, we show empirically that the two approaches can give different results and that the variance/uncertainty can remain large.Entities:
Mesh:
Year: 2014 PMID: 24670270 PMCID: PMC4521294 DOI: 10.1038/jhg.2014.15
Source DB: PubMed Journal: J Hum Genet ISSN: 1434-5161 Impact factor: 3.172
Figure 1Posterior distributions of SNP heritability for five phenotypes in the ARIC cohort on different subsets of the data. (a) Caucasian males, (b) Caucasian females, (c) African American males, (d) African American females, and (e) the complete data. Posterior distributions were calculated assuming a uniform prior over h2 and a relatively flat prior, Γ−1(1,1), over the variance σ2. The maximum a posteriori value for heritability is indicated by a red dot. The plots reveal a large degree of uncertainty in heritability.
Summary statistics for heritability posteriors in the ARIC cohort
| Caucasian male | Height | 3617 | 0.49 | 0.49 | 0.08 | 0.03 | 2.98 | 0.49 | 0.08 |
| BMI | 3607 | 0.13 | 0.15 | 0.08 | 0.4 | 2.82 | 0.13 | 0.09 | |
| Weight | 3607 | 0.2 | 0.21 | 0.08 | 0.15 | 2.87 | 0.2 | 0.08 | |
| QTi | 461 | 0.01 | 0.25 | 0.21 | 1.13 | 3.8 | 0 | 0.61 | |
| vWF | 3608 | 0.09 | 0.11 | 0.07 | 0.6 | 3.03 | 0.09 | 0.08 | |
| Caucasian female | Height | 4000 | 0.54 | 0.54 | 0.08 | 0.009 | 2.98 | 0.5 | 0.08 |
| BMI | 3983 | 0.35 | 0.35 | 0.08 | 0.008 | 2.99 | 0.35 | 0.08 | |
| Weight | 3983 | 0.25 | 0.25 | 0.08 | 0.04 | 2.94 | 0.25 | 0.08 | |
| QTi | 444 | 0.99 | 0.7 | 0.23 | −0.9 | 3.09 | 1 | 0.63 | |
| vWF | 3988 | 0.33 | 0.33 | 0.08 | 0.03 | 2.98 | 0.32 | 0.08 | |
| African American male | Height | 892 | 0.33 | 0.44 | 0.26 | 0.21 | 2.02 | 0.25 | 0.4 |
| BMI | 870 | 0.21 | 0.39 | 0.25 | 0.42 | 2.25 | 0.14 | 0.38 | |
| Weight | 870 | 0.05 | 0.33 | 0.23 | 0.69 | 2.72 | 0.01 | 0.35 | |
| QTi | 125 | 0.01 | 0.43 | 0.28 | 0.26 | 1.91 | 0 | 1.6 | |
| vWF | 871 | 0.08 | 0.32 | 0.23 | 0.71 | 2.8 | 0.03 | 0.33 | |
| African American female | Height | 1307 | 0.01 | 0.15 | 0.12 | 1.32 | 5.01 | 0 | 0.23 |
| BMI | 1248 | 0.3 | 0.35 | 0.19 | 0.47 | 2.85 | 0.28 | 0.23 | |
| Weight | 1248 | 0.37 | 0.41 | 0.21 | 0.3021 | 2.53 | 0.34 | 0.25 | |
| QTi | 160 | 0.76 | 0.51 | 0.28 | −0.06 | 1.83 | 0.69 | 1.35 | |
| vWF | 1251 | 0.22 | 0.3 | 0.18 | 0.66 | 3.07 | 0.2 | 0.23 | |
| All | Height | 5867 | 0.44 | 0.44 | 0.04 | 0.002 | 2.99 | 0.38 | 0.06 |
| BMI | 5844 | 0.21 | 0.21 | 0.05 | 0.006 | 2.99 | 0.21 | 0.06 | |
| Weight | 5844 | 0.19 | 0.19 | 0.05 | 0.02 | 2.97 | 0.18 | 0.06 | |
| QTi | 857 | 0.19 | 0.37 | 0.22 | 0.47 | 2.37 | 0.17 | 0.37 | |
| vWF | 5845 | 0.2 | 0.2 | 0.05 | 0.009 | 2.99 | 0.13 | 0.06 |
Abbreviations: ARIC, Atherosclerosis Risk in Communities; BMI, body mass index; QTi, QT interval; vWF, vonWillebrand factor.
The Bayesian posterior expectation corresponds to the frequentist restricted maximum-likelihood estimate. The Bayesian posterior s.d. corresponds to the frequentist s.e.
Figure 2Comparison of maximum-likelihood estimator distribution with posterior distribution for the examples of (a) height in Caucasian males, (b) height in African American females, and (c) QTi in Caucasian females. Shown are frequentist estimates of the variation of the heritability estimate obtained from GCTA (constrained to values of heritability between 0 and 1) and Bayesian posterior distributions. The area under each distribution is 1. A red dot indicates the maximum of each curve.
Figure 3Heatmaps showing the change in maximum a posteriori estimates of SNP heritability and the influence of the variance prior. In both heatmaps, MAP values for different hyperparameter settings for the prior distribution on σ2 are given. Each grid point represents one hyperparameter setting, and the intensity represents the MAP value, as indicated by the color bar. (a) Height in Caucasian males (sample size is 3617). (b) QTi in African American females (sample size is 160).