| Literature DB >> 25098835 |
Zhiyuan Xu1, Xiaotong Shen2, Wei Pan1.
Abstract
Most existing genome-wide association analyses are cross-sectional, utilizing only phenotypic data at a single time point, e.g. baseline. On the other hand, longitudinal studies, such as Alzheimer's Disease Neuroimaging Initiative (ADNI), collect phenotypic information at multiple time points. In this article, as a case study, we conducted both longitudinal and cross-sectional analyses of the ADNI data with several brain imaging (not clinical diagnosis) phenotypes, demonstrating the power gains of longitudinal analysis over cross-sectional analysis. Specifically, we scanned genome-wide single nucleotide polymorphisms (SNPs) with 56 brain-wide imaging phenotypes processed by FreeSurfer on 638 subjects. At the genome-wide significance level P < 1.8 x 10(9)) or a less stringent level (e.g. P < 10(7)), longitudinal analysis of the phenotypic data from the baseline to month 48 identified more SNP-phenotype associations than cross-sectional analysis of only the baseline data. In particular, at the genome-wide significance level, both SNP rs429358 in gene APOE and SNP rs2075650 in gene TOMM40 were confirmed to be associated with various imaging phenotypes in multiple regions of interests (ROIs) by both analyses, though longitudinal analysis detected more regional phenotypes associated with the two SNPs and indicated another significant SNP rs439401 in gene APOE. In light of the power advantage of longitudinal analysis, we advocate its use in current and future longitudinal neuroimaging studies.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25098835 PMCID: PMC4123854 DOI: 10.1371/journal.pone.0102312
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Trajectories of phenotype left hippocampus volume over time (in months) in three allele groups of SNP rs2075650.
Significant SNPs and each one's associated phenotype numbers at the significance level of .
| Gene | # Phenotypes | |||
| SNP | (Chr) | Position | Longitudinal | Baseline |
| rs2075650 | TOMM40 | 45,395,619 | 3 | 1 |
| (19) | LHippVol: | LHippVol: | ||
| RCerebCtx: | ||||
| LMeanTemp: | ||||
| rs439401 | APOE | 45,414,451 | 1 | 0 |
| (19) | LMeanLatTemp: | |||
| rs429358 | APOE | 45,411,941 | 42 | 4 |
| (19) | LHippVol: | LHippVol: | ||
| LEntCtx: | RHippVol: | |||
| LAmygVol: | LAmygVol: | |||
| - | - |
| 46 | 5 |
Top 3 SNP-phenotype associations are listed with corresponding P-values.
The number (percentage) of non-missing observations at each time point in Figure 1.
| Month | 0 | 6 | 12 | 18 | 24 | 36 | 48 |
| #Obs | 635 (99.5%) | 616 (96.6%) | 574 (90%) | 246 (38.6%) | 462 (72.4%) | 263 (41.2%) | 56 (8.8%) |
The baseline characteristics of 638 subjects, including gender, age, years of education, handedness (R/L) and intracranial volume (ICV).
| Name | HC | MCI | AD | P-value |
| number of subjects | 182 | 311 | 145 | - |
| Gender(M/F) | 103/79 | 204/107 | 80/65 | 0.0446 |
| Baseline age |
|
|
| 0.4153 |
| Education (years) |
|
|
| 0.0005 |
| Hand(R/L) |
|
|
| 0.6392 |
| ICV |
|
|
| 0.1463 |
|
|
|
|
P-values were calculated to test for differences among the diagnostic groups, HC, MCI and AD.
56 cortical thickness and volumetric phenotypes.
| Trait Name | Trait Description | Trait Name | Trait Description |
| AmygVol | Volume of amygdala | MidTemporal | Thickness of middle temporal gyrus |
| CerebCtx | Volume of cerebral cortex | Parahipp | Thickness of parahippocampal gyrus |
| CerebWM | Volume of cerebral white matter | PostCing | Thickness of posterior cingulate |
| HippVol | Volume of hippocampus | Postcentral | Thickness of postcentral gyrus |
| InfLatVent | Volume of inferior lateral ventricle | Precentral | Thickness of precentral gyrus |
| LatVent | Volume of lateral ventricle | Precuneus | Thickness of precuneus |
| EntCtx | Thickness of entorhinal cortex | SupFrontal | Thickness of superior frontal gyrus |
| Fusiform | Thickness of fusiform gyrus | SupParietal | Thickness of superior parietal gyrus |
| InfParietal | Thickness of inferior parietal gyrus | SupTemporal | Thickness of superior temporal gyrus |
| InfTemporal | Thickness of inferior temporal gyrus | Supramarg | Thickness of supramarginal gyrus |
| MeanCing | Mean thickness of caudal anterior | TemporalPole | Thickness of temporal pole |
| cingulate, isthmus cingulate, posterior | |||
| cingulate, and rostral anterior cingulate | |||
| MeanFront | Mean thickness of caudal midfrontal | MeanTemp | Mean thickness of inferior temporal, |
| rostral midfrontal, superior frontal, | middle temporal, superior temporal, | ||
| lateral orbitofrontal, and medial | fusiform, parahippocampal, ling- | ||
| orbitofrontal gyri and frontal pole | ual gyri temporal pole and transverse | ||
| temporal pole | |||
| MeanLatTemp | Mean thickness of inferior temporal, | MeanSensMotor | Mean thickness of precentral and |
| middle temporal, and superior | postcentral gyri | ||
| temporal gyri | |||
| MeanMedTemp | Mean thickness of fusiform, | MeanPar | Mean thickness of inferior and |
| parahippocampal, and lingual gyri, | superior parietal gyri, supramarginal | ||
| temporal pole and transverse | gyrus, and precuneus | ||
| temporal pole |
There are 2 phenotypes for each given phenotype name at the left and right sides of the brain respectively.
Significant SNPs and each one's associated phenotype numbers at the level of .
| Gene | # Phenotypes | |||
| SNP | (Chr) | Position | Longitudinal | Baseline |
| rs2075650 | TOMM40 | 45,395,619 | 2 | 1 |
| (19) | LHippVol: | LHippVol: | ||
| RCerebCtx: | ||||
| rs439401 | APOE | 45,414,451 | 1 | 0 |
| (19) | LMeanLatTemp: | |||
| rs429358 | APOE | 45,411,941 | 40 | 4 |
| (19) | LHippVol: | LHippVol: | ||
| LEntCtx: | RHippVol: | |||
| LAmygVol: | LAmygVol: | |||
| - | - |
| 43 | 5 |
Top 3 SNP-phenotype associations are listed with corresponding P-values.
Significant SNPs and each one's associated phenotype numbers at the level of .
| SNP | L | I | M | B |
| rs2075650 | 25 | 12 | 3 | 1 |
| LHippVol: | LHippVol: | |||
| RCerebCtx: | ||||
| LMeanTemp: | ||||
| rs11677350 | 1 | 0 | 1 | 1 |
| RCerebWM: | RCerebWM: | |||
| rs4902433 | 2 | 0 | 0 | 0 |
| LMeanLatTemp: | ||||
| LInfTemporal: | ||||
| rs439401 | 6 | 10 | 0 | 0 |
| RMeanLatTemp: | ||||
| RcerebCtx: | ||||
| RMeanTemp: | ||||
| rs11762610 | 2 | 0 | 0 | 0 |
| LFusiform: | ||||
| LInfTemporal: | ||||
| rs1800627 | 1 | 0 | 0 | 0 |
| RAmygVol: | ||||
| rs429358 | 46 | 40 | 5 | 5 |
| LHippVol: | LHippVol: | |||
| LEntCtx: | RHippVol: | |||
| LAmygVol: | LAmygVol: | |||
| rs2931352 | 0 | 0 | 0 | 1 |
| RParahipp: | ||||
| rs11875359 | 0 | 0 | 0 | 1 |
| RInfLatVent: | ||||
|
|
|
|
|
|
In longitudinal analysis, column name “L”, “I” and “M” indicate the number of traits associated with the SNP from the longitudinal joint testing (i.e. with ), testing for interaction (i.e. ) and testing for the main effects (i.e. ); the column named “B” is for cross-sectional analysis of the baseline data. Top 3 SNP-phenotype association are listed with corresponding P-values.
The numbers of the significant SNP-phenotype associations at various levels of false discovery rate (FDR).
| FDR |
|
|
|
|
|
|
| Longitudinal | 112 | 90 | 83 | 64 | 46 | 43 |
| Baseline | 5 | 5 | 5 | 5 | 3 | 3 |
Figure 2Comparison of the Manhattan plots for genome-wide p-values for phenotype left hippocampus volume from longitudinal analysis (left) and from cross-sectional analysis (right); SNP rs429358 is not included due to its small p-value.
Figure 5Comparison of the Q-Q plots for genome-wide p-values for phenotype volume of right inferior lateral ventricle from longitudinal analysis (left) and from cross-sectional analysis (right); SNP rs429358 is not included.
Figure 6Comparison of the Manhattan plots without (left) or with (right) top 10 PCs.
Figure 7Comparison of the Q-Q plots without (left) or with (right) top 10 PCs.
Figure 8The Q-Q plots for genome-wide p-values for phenotype left hippocampus volume from longitudinal analysis based on (a) GEE with the sandwich covariance estimator (left, inflation factor = 1.070), (b) GEE with the model-based covariance estimator (middle, inflation factor = 2.077), and (c) linear mixed model with only a random intercept term (right, inflation factor = 1.976).
Simulation results at significance level with different methods for phenotypic data generated from model (2).
| Type I Error | ||||||
| Model | rs2075650 | rs439401 | ||||
|
|
|
|
|
|
| |
| LME-RSI | 0.007 | 0.044 | 0.087 | 0.007 | 0.039 | 0.097 |
| LME-RI | 0.071 | 0.177 | 0.258 | 0.090 | 0.190 | 0.276 |
| GEE-Robust | 0.008 | 0.045 | 0.089 | 0.008 | 0.045 | 0.106 |
| GEE-Naive | 0.082 | 0.189 | 0.257 | 0.102 | 0.191 | 0.286 |
| Baseline | 0.006 | 0.042 | 0.084 | 0.006 | 0.059 | 0.112 |
LME-RSI: a linear mixed-effects model with random slope and intercept; LME-RI: a linear mixed-effects model with only a random intercept term; GEE-Robust: GEE with the sandwich covariance estimator; GEE-Naive: GEE with the model-based covariance estimator; Baseline: a linear model at the baseline testing for the main effects of an SNP.
Simulation results at significance level with different methods for phenotypic data generated from model (3).
| Type I Error | ||||||||||||
| Model | rs2075650 | rs439401 | rs429358 | |||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| LME-RSI | 0.006 | 0.038 | 0.079 | 0.136 | 0.007 | 0.043 | 0.096 | 0.143 | 0.007 | 0.043 | 0.097 | 0.140 |
| LME-RI | 0.006 | 0.039 | 0.079 | 0.135 | 0.008 | 0.043 | 0.096 | 0.147 | 0.007 | 0.044 | 0.096 | 0.141 |
| GEE-Robust | 0.004 | 0.040 | 0.088 | 0.135 | 0.007 | 0.052 | 0.104 | 0.148 | 0.009 | 0.045 | 0.100 | 0.145 |
| GEE-Naive | 0.000 | 0.012 | 0.031 | 0.043 | 0.001 | 0.010 | 0.033 | 0.059 | 0.001 | 0.010 | 0.023 | 0.044 |
| Baseline | 0.005 | 0.048 | 0.081 | 0.124 | 0.008 | 0.056 | 0.113 | 0.160 | 0.007 | 0.041 | 0.096 | 0.149 |