| Literature DB >> 24523728 |
Yun Ju Sung1, Jeannette Simino1, Rezart Kume1, Jacob Basson1, Karen Schwander1, D C Rao1.
Abstract
Gene-environment interaction (GEI) analysis can potentially enhance gene discovery for common complex traits. However, genome-wide interaction analysis is computationally intensive. Moreover, analysis of longitudinal data in families is much more challenging due to the two sources of correlations arising from longitudinal measurements and family relationships. GWIS of longitudinal family data can be a computational bottleneck. Therefore, we compared two methods for analysis of longitudinal family data: a methodologically sound but computationally demanding method using the Kronecker model (KRC) and a computationally more forgiving method using the hierarchical linear model (HLM). The KRC model uses a Kronecker product of an unstructured matrix for correlations among repeated measures (longitudinal) and a compound symmetry matrix for correlations within families at a given visit. The HLM uses an autoregressive covariance matrix for correlations among repeated measures and a random intercept for familial correlations. We compared the two methods using the longitudinal Framingham heart study (FHS) SHARe data. Specifically, we evaluated SNP-alcohol (amount of alcohol consumption) interaction effects on high density lipoprotein cholesterol (HDLC). Keeping the prohibitive computational burden of KRC in mind, we limited the analysis to chromosome 16, where preliminary cross-sectional analysis yielded some interesting results. Our first important finding was that the HLM provided very comparable results but was remarkably faster than the KRC, making HLM the method of choice. Our second finding was that longitudinal analysis provided smaller P-values, thus leading to more significant results, than cross-sectional analysis. This was particularly pronounced in identifying GEIs. We conclude that longitudinal analysis of GEIs is more powerful and that the HLM method is an optimal method of choice as compared to the computationally (prohibitively) intensive KRC method.Entities:
Keywords: Framingham heart study; HLM; SNP–alcohol interactions; gene–environment interactions; interactions in family data; longitudinal family data
Year: 2014 PMID: 24523728 PMCID: PMC3906599 DOI: 10.3389/fgene.2014.00009
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Descriptive statistics of the longitudinal family data used in the analysis.
| Characteristics | 1-visit | 3-visits | 5-visits |
|---|---|---|---|
| Unique individuals, | 3,012 | 3,946 | 4,190 |
| Observed data, | 3,012 | 9,620 | 16,480 |
| Male, | 1,397 (46.38%) | 4,496 (46.74%) | 7,703 (46.74%) |
| Anti-lipids med use, | 624 (20.72%) | 921 (9.57%) | 1,462 (8.87%) |
| Age, years | 60.75 ± 9.25 | 55.53 ± 11.68 | 55.89 ± 11.54 |
| BMI, kg/m2 | 28.16 ± 5.31 | 27.26 ± 5.01 | 27.28 ± 4.99 |
| Alcohol, oz./week | 2.61 ± 3.79 | 2.81 ± 4.11 | 2.73 ± 4.06 |
| HDLC, mg/dL | 53.74 ± 17.05 | 51.57 ± 15.75 | 51.05 ± 15.65 |
CPU time for running analysis with three mean models at each SNP.
| Model | 1-visit | 3-visits | 5-visits |
|---|---|---|---|
| Kronecker model | 0.55 s | 1.43 m | 42.33 m |
| Hierarchical linear model | 0.41 s | 1.42 s | 2.54 s |
Number of SNPs with P-values below thresholds.
| Genetic main effect | Gene–environment interaction effect | Joint main and interaction | |||||||
| 1-visit | 3-visits | 5 visits | 1-visit | 3-visits | 5-visits | 1-visit | 3-visits | 5-visits | |
| 0 | 1 | 1 | 8 | 10 | 31 | 9 | 10 | 26 | |
| 0 | 1 | 1 | 4 | 0 | 16 | 3 | 1 | 15 | |
| 0 | 1 | 1 | 0 | 0 | 6 | 0 | 1 | 4 | |