| Literature DB >> 26635803 |
Kelly M Burkett1, Marie-Hélène Roy-Gagnon2, Jean-François Lefebvre2, Cheng Wang2, Bénédicte Fontaine-Bisson3, Lise Dubois2.
Abstract
The etiology of immune-related diseases or traits is often complex, involving many genetic and environmental factors and their interactions. While methodological approaches focusing on an outcome measured at one time point have succeeded in identifying genetic factors involved in immune-related traits, they fail to capture complex disease mechanisms that fluctuate over time. It is increasingly recognized that longitudinal studies, where an outcome is measured at multiple time points, have great potential to shed light on complex disease mechanisms involving genetic factors. However, longitudinal data require specialized statistical methods, especially in family studies where multiple sources of correlation in the data must be modeled. Using simulated data with known genetic effects, we examined the performance of different analytical methods for investigating associations between genetic factors and longitudinal phenotypes in twin data. The simulations were modeled on data from the Québec Newborn Twin Study, an ongoing population-based longitudinal study of twin births with multiple phenotypes, such as cortisol levels and body mass index, collected multiple times in infancy and early childhood and with sequencing data on immune-related genes and pathways. We compared approaches that we classify as (1) family-based methods applied to summaries of the observations over time, (2) longitudinal-based methods with simplifications of the familial correlation, and (3) Bayesian family-based method with simplifications of the temporal correlation. We found that for estimation of the genetic main and interaction effects, all methods gave estimates close to the true values and had similar power. If heritability estimation is desired, approaches of type (1) also provide heritability estimates close to the true value. Our work shows that the simpler approaches are likely adequate to detect genetic effects; however, interpretation of these effects is more challenging.Entities:
Keywords: family design; generalized estimating equations; genetic association; linear mixed models; longitudinal studies
Year: 2015 PMID: 26635803 PMCID: PMC4652172 DOI: 10.3389/fimmu.2015.00589
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Simulation models.
| Parameter | Genetic effect modeled | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) None | (2) Effect on average | (3) Effect on change (linear) | (4) Effect on change (segmented) | ||||||
| (a) | (b) | (c) | (d) | (a) | (b) | (a) | (b) | ||
| 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | |
| 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | |
| 0 | 0.1 | 0.15 | 0.2 | 0.3 | 0 | 0 | 0 | 0 | |
| 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.8 | 0.8 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | −0.8 | −0.8 | |
| 0 | 0 | 0 | 0 | 0 | 0.005 | 0.01 | 0.005 | 0.01 | |
| 4.5 | 4.3 | 4.3 | 4.3 | 4.3 | 3 | 3 | 3 | 3 | |
| 2.25 | 2.25 | 2.25 | 2.25 | 2.25 | 1.5 | 1.5 | 1.5 | 1.5 | |
| 0 | 0 | 0 | 0 | 0 | 0.001 | 0.001 | 0.001 | 0.001 | |
| 0 | 0 | 0 | 0 | 0 | 0.001 | 0.001 | 0.001 | 0.001 | |
| 2.25 | 2.25 | 2.25 | 2.25 | 2.25 | 1.5 | 1.5 | 1.5 | 1.5 | |
Figure 1Example trajectories of observed and simulated BMI versus age in months. (A) Trajectories observed for two twin pairs in the QNTS. The monozygotic (MZ) twin pair is shown in red and the dizygotic (DZ) twin pair is shown in blue. (B) BMI trajectories in a simulated dataset under genetic model 2. The sample average at each age within each genotype category (G; coded as 0, 1, or 2 for the number of minor allele) is shown. (C) BMI trajectories in a simulated dataset under genetic model 3. (D) BMI trajectories in a simulated dataset under genetic model 4.
Figure 2Mean estimated values (SD) for fixed effects from selected simulation and analysis models. Analysis models shown are the classical twin analysis of the mean (twinlm mean in red) and slope (twinlm slope in blue), the marginal GEE model with unstructured working correlation matrix (GEE unstructured in black), the three-level hierarchical model (Hierarchical in yellow), and the Bayesian approach (Bayesian in green). The effect of genotype on rate of change (genotype–time interaction) was not modeled when not simulated.
Figure 3Estimated power or type I error from the 2000 simulated datasets for each model. (A) Estimated type I error from model 1 (simulated genetic effect at 0) and estimated power to detect a simulated genetic effect on the mean phenotype of 0.1, 0.15, 0.2, and 0.3 from model 2. (B) Estimated power to detect a simulated linear genetic effect on the rate of change of 0.005 and 0.01 from model 3. (C) Estimated power to detect a simulated segmented genetic effect on the rate of change of 0.005 and 0.01 from model 4. Methods shown are the classical twin analysis of the mean (twinlm mean in red) and slope (twinlm slope in blue), the marginal GEE model with unstructured working correlation matrix (GEE unstructured in black), the three-level hierarchical model (Hierarchical in yellow) and the Bayesian approach (Bayesian in green).