| Literature DB >> 25873936 |
Tao Wang1, Peng He2, Kwang Woo Ahn1, Xujing Wang3, Soumitra Ghosh4, Purushottam Laud1.
Abstract
The generalized linear mixed model (GLMM) is a useful tool for modeling genetic correlation among family data in genetic association studies. However, when dealing with families of varied sizes and diverse genetic relatedness, the GLMM has a special correlation structure which often makes it difficult to be specified using standard statistical software. In this study, we propose a Cholesky decomposition based re-formulation of the GLMM so that the re-formulated GLMM can be specified conveniently via "proc nlmixed" and "proc glimmix" in SAS, or OpenBUGS via R package BRugs. Performances of these procedures in fitting the re-formulated GLMM are examined through simulation studies. We also apply this re-formulated GLMM to analyze a real data set from Type 1 Diabetes Genetics Consortium (T1DGC).Entities:
Keywords: Bayesian methods; Cholesky decomposition; family data; generalized linear mixed models (GLMM); genetic correlation; genetic variance components; random genetic effects; re-parameterization
Year: 2015 PMID: 25873936 PMCID: PMC4379931 DOI: 10.3389/fgene.2015.00120
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The true values of model parameters in simulation.
| The number of families with one child | 0, 500, 1000 | |
| The number of families with two children | 500, 1000, 2000 | |
| The intercept | 10 | |
| α | The fixed effect of Z | 2 |
| β1 | The genetic effects of marker 1 | (1, -2) |
| β2 | The genetic effects of marker 2 | (1, 0) |
| β3 | The genetic effects of marker 3 | (1, -1) |
Mean and standard deviation (SD) of the parameter estimates from 200 simulations for linear mixed models.
| 10.01 (0.07) | 10.00 (0.07) | 10.00 (0.06) | 9.99 (0.05) | 10.00 (0.05) | 10.00 (0.05) | |
| α | 2.00 (0.06) | 2.01 (0.06) | 2.00 (0.04) | 2.00 (0.04) | 2.00 (0.03) | 2.00 (0.03) |
| β1 | 1.00 (0.07) | 1.00 (0.07) | 1.00 (0.05) | 1.00 (0.05) | 1.00 (0.05) | 1.00 (0.05) |
| β1 | −2.00 (0.11) | −2.00 (0.11) | −2.01 (0.08) | −2.00 (0.07) | −1.99 (0.07) | −2.00 (0.07) |
| β2 | 0.99 (0.06) | 1.01 (0.06) | 1.01 (0.05) | 1.00 (0.05) | 1.00 (0.04) | 1.01 (0.05) |
| β2 | 0.04 (0.17) | −0.01 (0.18) | −0.00 (0.11) | −0.02 (0.11) | −0.00 (0.10) | 0.00 (0.11) |
| β3 | 1.00 (0.08) | 1.00 (0.08) | 1.00 (0.05) | 1.00 (0.05) | 1.00 (0.05) | 1.00 (0.05) |
| β3 | −0.98 (0.30) | −0.97 (0.33) | −1.00 (0.21) | −0.99 (0.21) | −1.00 (0.21) | −0.98 (0.17) |
| σ2 | 1.01 (0.18) | 0.98 (0.18) | 1.01 (0.12) | 1.00 (0.12) | 1.00 (0.11) | 1.00 (0.11) |
| σ2 | 0.54 (0.36) | 0.46 (0.31) | 0.53 (0.30) | 0.49 (0.27) | 0.51 (0.22) | 0.49 (0.22) |
| σ2 | 0.99 (0.11) | 1.01 (0.12) | 0.99 (0.08) | 1.00 (0.08) | 0.99 (0.08) | 1.01 (0.08) |
| σ2 | 0.95 (0.36) | 1.06 (0.32) | 0.97 (0.31) | 1.02 (0.27) | 0.99 (0.23) | 1.03 (0.23) |
The average length of the 95% confidence (or probability) intervals and the coverage rate for true parameters from 200 simulations for linear mixed models.
| 0.31, 96.0% | 0.31, 96.0% | 0.22, 94.5% | 0.22, 96.0% | 0.21, 96.0% | 0.21, 96.5% | |
| α | 0.21, 94.5% | 0.21, 94.5% | 0.15, 95.0% | 0.15, 97.0% | 0.14, 96.0% | 0.14, 95.0% |
| β1 | 0.28, 96.0% | 0.28, 94.0% | 0.20, 95.0% | 0.20, 96.5% | 0.18, 95.0% | 0.18, 96.5% |
| β1 | 0.43, 95.5% | 0.43, 95.0% | 0.30, 93.5% | 0.30, 97.0% | 0.28, 93.0% | 0.28, 92.0% |
| β2 | 0.26, 97.5% | 0.26, 96.5% | 0.18, 95.5% | 0.18, 94.5% | 0.17, 97.0% | 0.17, 90.5% |
| β2 | 0.67, 93.5% | 0.67, 92.0% | 0.47, 96.0% | 0.47, 97.5% | 0.44, 96.0% | 0.44, 96.0% |
| β3 | 0.32, 95.0% | 0.32, 96.5% | 0.23, 98.5% | 0.23, 96.5% | 0.21, 95.5% | 0.21, 94.5% |
| β3 | 1.22, 96.5% | 1.20, 92.5% | 0.85, 94.5% | 0.85, 97.0% | 0.78, 95.0% | 0.78, 98.5% |
| σ2 | 0.68, 94.5% | 0.68, 92.0% | 0.49, 96.0% | 0.48, 95.0% | 0.47, 97.0% | 0.46, 95.0% |
| σ2 | 1.61, 87.0% | 0.77, 74.0% | 1.16, 89.5% | 0.66, 75.0% | 0.84, 93.5% | 0.62, 84.0% |
| σ2 | 0.44, 94.5% | 0.44, 95.0% | 0.32, 94.5% | 0.31, 94.0% | 0.31, 96.0% | 0.31, 94.0% |
| σ2 | 1.54, 87.0% | 0.87, 74.5% | 1.12, 88.5% | 0.72, 75.5% | 0.86, 95.0% | 0.67, 85.0% |
Means (SD) of the parameter estimates and AL (CR) of the 95% probability intervals from 200 simulations for mixed logistic regression models.
| −2.69 (0.20) | 0.82, 69.0% | −2.69 (0.16) | 0.63, 56.0% | −2.69 (0.14) | 0.58, 47.0% | |
| α | 1.80 (0.13) | 0.58, 75.5% | 1.79 (0.11) | 0.44, 58.0% | 1.80 (0.10) | 0.41, 58.5% |
| β1 | 0.89 (0.14) | 0.55, 85.5% | 0.89 (0.10) | 0.40, 80.5% | 0.90 (0.09) | 0.37, 77.5% |
| β1 | −1.78 (0.22) | 0.90, 85.5% | −1.79 (0.16) | 0.66, 78.5% | −1.81 (0.16) | 0.62, 74.0% |
| β2 | 0.90 (0.12) | 0.49, 87.5% | 0.89 (0.11) | 0.36, 78.0% | 0.91 (0.09) | 0.34, 83.5% |
| β2 | 0.01 (0.31) | 1.24, 95.5% | 0.02 (0.23) | 0.87, 95.0% | −0.01 (0.22) | 0.82, 94.5% |
| β3 | 0.88 (0.15) | 0.58, 85.0% | 0.89 (0.11) | 0.42, 83.0% | 0.88 (0.10) | 0.39, 77.0% |
| β3 | −0.84 (0.55) | 2.20, 95.0% | −0.85 (0.40) | 1.56, 93.0% | −0.84 (0.36) | 1.46, 94.5% |
| σ2 | 1.11 (0.45) | 2.00, 95.5% | 1.08 (0.42) | 1.56, 93.0% | 1.02 (0.32) | 1.45, 98.5% |
| σ2 | 0.49 (0.18) | 0.80, 94.5% | 0.52 (0.20) | 0.73, 93.5% | 0.57 (0.17) | 0.72, 94.5% |
| σ2 | 0.74 (0.19) | 0.83, 78.0% | 0.72 (0.14) | 0.64, 62.0% | 0.75 (0.14) | 0.62, 70.0% |
Number of subjects and families in T1DGC by cohorts.
| No. of subjects | 741 | 664 | 1936 | 347 | 465 |
| No. of families | 184 | 147 | 475 | 77 | 113 |
| Maximum family size | 6 | 14 | 8 | 6 | 6 |
Posterior means and 2.5%, 97.5% percentiles of the odds ratios and variance components for type I diabetes in five cohorts of the T1DGC data set.
| Baseline intercept (μ) | 2.32 (1.58, 3.29) | 2.64 (1.72, 3.67) | 3.70 (3.05, 4.48) | 1.11 (0.12, 2.14) | 2.22 (1.44, 3.20) |
| 18<age≤30 vs. age≤18 ( | 1.23 (0.56, 2.82) | 0.50 (0.19, 1.39) | 0.35 (0.20, 0.59) | 0.69 (0.21, 2.32) | 0.45 (0.15, 1.35) |
| 30<age≤40 vs. age≤18 ( | 0.10 (0.036, 0.25) | 0.25 (0.09, 0.64) | 0.047 (0.024, 0.086) | 0.48 (0.15, 1.45) | 0.014 (0.003, 0.045) |
| 40<age≤50 vs. age≤18 ( | 0.01 (0.003, 0.04) | 0.07 (0.02, 0.19) | 0.007 (0.002, 0.014) | 0.06 (0.01, 0.22) | 0.004 (0.001, 0.013) |
| 50<age≤60 vs. age≤18 ( | 0.005 (0.001, 0.022) | 0.04 (0.01, 0.11) | 0.002 (0.0004, 0.004) | 0.009 (0.001, 0.063) | 0.005 (0.001, 0.025) |
| age>60 vs. age≤18 ( | 0.012 (0.002, 0.055) | 0.009 (0.002, 0.032) | 0.0004 (0.0001, 0.0015) | 0.003 (0.0001, 0.023) | 0.027 (0.0003, 1.25) |
| Female vs. Male ( | 1.27 (0.75, 2.21) | 0.60 (0.35, 0.97) | 0.60 (0.43, 0.83) | 0.60 (0.24, 1.38) | 1.36 (0.64, 2.92) |
| DQrisk = 0 vs. 2 ( | 0.09 (0.025, 0.26) | 0.02 (0.005, 0.06) | 0.04 (0.02, 0.08) | 0.11 (0.01, 0.71) | 0.02 (0.002, 0.12) |
| DQrisk = 1 vs. 2 ( | 0.14 (0.05, 0.30) | 0.30 (0.14, 0.59) | 0.25 (0.15, 0.39) | 0.19 (0.04, 0.63) | 0.24 (0.08, 0.71) |
| DQrisk = 3 vs. 2 ( | 2.26 (1.17, 4.80) | 5.84 (2.75, 14.47) | 5.91 (3.48, 10.61) | 15.89 (4.80, 82.60) | 8.36 (3.30, 26.13) |
| Additive variance (σ2 | 0.71 (0.20, 1.90) | 0.69 (0.21, 1.73) | 0.47 (0.17, 1.05) | 0.68 (0.19, 2.11) | 0.71 (0.17, 2.05) |
| Dominant variance (σ2 | 1.64 (0.28, 6.35) | 1.44 (0.21, 4.93) | 0.66 (0.23, 1.85) | 2.96 (0.41, 10.51) | 1.25 (0.28, 4.32) |
| Family-shared variance (σ2 | 0.62 (0.20, 1.45) | 0.66 (0.22, 1.42) | 2.36 (1.33, 3.92) | 0.56 (0.17, 1.42) | 0.63 (0.19, 1.67) |