| Literature DB >> 25519365 |
Yun Ju Sung1, Jacob Basson1, Dabeeru C Rao1.
Abstract
Analysis of longitudinal family data is challenging because of 2 sources of correlations: correlations across longitudinal measurements and correlations among related individuals. We investigated whether analysis using long-term average (average of all 3 visits) can enhance gene discovery compared with a single-visit analysis. We analyzed all 200 replicates of simulated systolic blood pressure (SBP) in Genetic Analysis Workshop 18 (GAW18) family data using both single-marker and collapsing methods. We considered 2 collapsing approaches: collapsing all variants and collapsing low-frequency variants. Analysis using long-term average performed slightly better than SBP measured at a single visit. Collapsing all variants performed much better than collapsing low-frequency variants at MAP4 and FLNB, which included a common variant with a relatively large effect. For several variants in gene MAP4, single-marker analysis also provided high power. In contrast, collapsing only low-frequency variants performed much better for SCAP, DNASE1L3, and LOC152217, where rare variants in these genes had larger effect than common variants. However, for other causal variants, all approaches provided disappointingly poor performance. This poor performance appeared to occur because most of these causal variants explained a very small fraction of phenotypic variance. We also found that collapsing multiple variants did worse than single-marker analysis for several genes when they contained causal single-nucleotide polymorphisms (SNPs) with both positive and negative effects. Because half of causal SNPs were not found in the annotation file based on the 1000 Genomes Project, we found that power was also affected by our use of incomplete annotation information.Entities:
Year: 2014 PMID: 25519365 PMCID: PMC4143632 DOI: 10.1186/1753-6561-8-S1-S12
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Summary statistics of single-marker analysis at 134 causal variants on chromosome 3
| MAF | % Variance SBP | Power SBP1 | Power SBP3 | Power LTA | |
|---|---|---|---|---|---|
| Minimum | 0.0016 | 0.00000 | 0.0050 | 0.0050 | 0.0000 |
| 1st quartile | 0.0049 | 0.00001 | 0.0250 | 0.0250 | 0.0150 |
| Median | 0.0082 | 0.00001 | 0.0350 | 0.0400 | 0.0350 |
| 3rd quartile | 0.0636 | 0.00004 | 0.0925 | 0.0800 | 0.1025 |
| Maximum | 0.4947 | 0.02785 | 1.0000 | 1.0000 | 1.0000 |
| Mean | 0.0698 | 0.00064 |
LTA, long-term average; MAF, minor allele frequency; SBP, systolic blood pressure.
The bold text indicates the overall power (power averaged across all 134 causal variants.
Causal variants with empirical power over 0
| Position | Gene | MAF | % Variance explained | SBP1 | SBP3 | LTA |
|---|---|---|---|---|---|---|
| 47956424 | 0.995 | 0.885 | 0.990 | |||
| 47957996 | 0.0301 | 1.000 | 1.000 | 1.000 | ||
| 47958037 | <1.0E-5 | 0.980 | 0.875 | 0.970 | ||
| 47973345 | 0.0082 | 0.00049 | 0.955 | 0.870 | 0.975 | |
| 48040283 | 0.0318 | 1.000 | 1.000 | 1.000 | ||
| 48040284 | 0.0131 | 0.775 | 0.515 | 0.775 | ||
| 48054461 | 0.00030 | 0.640 | 0.420 | 0.670 | ||
| 58109162 | 0.00273 | 0.580 | 0.510 | 0.695 | ||
| 141160882 | 0.00022 | 0.525 | 0.315 | 0.550 | ||
| 141162128 | 0.00061 | 0.505 | 0.310 | 0.550 |
LTA, long-term average; MAF, minor allele frequency; SBP, systolic blood pressure.
Bold text for MAF indicates MAF greater than 0.05; Also bold text for proportion (%) of variance indicates that with greater than 0.01.
Spearman correlation across empirical powers at all causal genes
| Correlation | Collapse all | Collapse rare | |||||
|---|---|---|---|---|---|---|---|
| Collapse all | SBP1 | −0.11 | 0.18 | 0.04 | |||
| SBP3 | −0.36 | 0.01 | −0.11 | ||||
| LTA | −0.14 | 0.13 | 0.04 | ||||
| Collapse rare | SBP1 | 0.78 | |||||
| SBP3 | |||||||
| LTA | |||||||
Bold text indicates correlation greater than 0.8 between empirical powers of two approaches.
LTA, long-term average; SBP, systolic blood pressure.
Empirical power at 0.05 using collapsing and single-marker approaches at all 22 causal genes for long-term average phenotype
| Causal genes | Total SNPs | Causal SNPs1 | % Variance explained2 | Empirical Power | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Common | Rare3 | Common | Rare | Common | Rare | Collapse all SNPs | Collapse rare SNPs | Best rare4 | ||
| 1 | 149 | 739 | 3 | 12 | 0.17 | 1.00 | ||||
| 321 | 628 | 1 | 5 | 0.00007 | 0.03 | 0.20 | ||||
| 899 | 1303 | 2 | 8 | 0.00001 | 0.00006 | 0.07 | 0.11 | |||
| 1 | 9 | 0 | 0 | 0.04 | ||||||
| 7 | 41 | 2 | 0 | 0.02 | ||||||
| 157 | 332 | 3 | 6 | 0.00014 | 0.00017 | 0.00 | 0.09 | |||
| 2 | 175 | 408 | 5 | 4 | 0.00087 | 0.00003 | 0.09 | 0.14 | ||
| 57 | 79 | 4 | 4 | 0.00005 | 0.01 | 0.08 | ||||
| 3 | 4 | 1 | 0 | 0.00001 | 0.04 | |||||
| 31 | 174 | 0 | 2 | 0.00004 | 0.03 | 0.05 | ||||
| 72 | 216 | 0 | 8 | 0.00011 | 0.03 | 0.07 | ||||
| 230 | 452 | 1 | 3 | 0.00002 | 0.00006 | 0.02 | 0.04 | |||
| 3 | 5 | 18 | 0 | 1 | <1.0E-5 | 0.12 | 0.03 | |||
| 145 | 318 | 2 | 3 | 0.00025 | 0.00003 | 0.04 | 0.14 | |||
| 236 | 502 | 1 | 2 | 0.00008 | <1.0E-5 | 0.03 | 0.06 | |||
| 130 | 275 | 0 | 5 | 0.00007 | 0.02 | 0.03 | ||||
| 158 | 329 | 1 | 2 | 0.00002 | 0.00004 | 0.02 | 0.03 | |||
| 198 | 860 | 1 | 11 | <1.0E-5 | 0.00010 | 0.02 | 0.01 | |||
| 4 | 51 | 83 | 0 | 2 | 0.00001 | 0.05 | 0.03 | 0.05 | ||
| 363 | 852 | 0 | 5 | 0.00005 | 0.04 | 0.03 | 0.04 | |||
| 294 | 516 | 0 | 0 | 0.04 | 0.02 | |||||
| 90 | 126 | 1 | 0 | 0.00002 | 0.01 | 0.00 | ||||
| Overall power | 0.139 | 0.095 | ||||||||
| Type I error | 0.071 | 0.057 | ||||||||
Bold type for % variance indicates 3 largest among common variants and 2 largest among rare variants; bold type for empirical power indicates the highest across three analysis options.
1Number of causal single-nucleotide polymorphisms (SNPs) is computed based on the used annotation file (not based on the answers).
2% Variance explained is computed by adding the percent variance explained by each SNP based on the used annotation file (not based on the answers).
3Rare SNPs are defined as SNPs with minor allele frequencies (MAFs) less than 0.5.
4Best rare is the best power across all rare variants in a gene.