| Literature DB >> 25070353 |
Abstract
BACKGROUND: Both common and rare genetic variants have been shown to contribute to the etiology of complex diseases. Recent genome-wide association studies (GWAS) have successfully investigated how common variants contribute to the genetic factors associated with common human diseases. However, understanding the impact of rare variants, which are abundant in the human population (one in every 17 bases), remains challenging. A number of statistical tests have been developed to analyze collapsed rare variants identified by association tests. Here, we propose a haplotype-based approach. This work inspired by an existing statistical framework of the pedigree disequilibrium test (PDT), which uses genetic data to assess the effects of variants in general pedigrees. We aim to compare the performance between the haplotype-based approach and the rare variant-based approach for detecting rare causal variants in pedigrees.Entities:
Mesh:
Year: 2014 PMID: 25070353 PMCID: PMC4131059 DOI: 10.1186/1471-2164-15-632
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Type I errors and power on effect of the proportion of causal SNVs ( ) under significance level of 0.05
| # of variants |
|
|
|
|
|
| |||
|---|---|---|---|---|---|---|---|---|---|
| Type I error |
|
| 21.82(sd=4.63) | 0.053 | 0.050 | 0.035 | 0.045 | 0.052 | 0.041 |
| Power |
|
| 23.02(sd=4.90) | 0.456 | 0.906 | 0.275 | 0.465 | 0.894 | 0.326 |
|
| 22.63(sd=4.64) | 0.373 | 0.734 | 0.230 | 0.396 | 0.744 | 0.242 | ||
|
| 22.65(sd=5.11) | 0.316 | 0.515 | 0.163 | 0.306 | 0.525 | 0.177 | ||
|
| 22.31(sd=4.56) | 0.261 | 0.265 | 0.096 | 0.251 | 0.272 | 0.132 | ||
|
| 21.89(sd=4.65) | 0.148 | 0.090 | 0.073 | 0.154 | 0.083 | 0.065 |
Note: (1) “# of variants” indicates the mean and standard deviation (sd) values of the number of SNVs in a 20Kb test region on 1000 replicates (10 permutations for each replicate); (2) d% is the proportion of rare variants in the causal region to be causal; (3) r% is the proportion of causal variants increase risk.
Abbreviations: maxH: maximal haplotype test; hPDT: haplotype-based pedigree disequilibrium test; hPDT-t: haplotype-based pedigree disequilibrium test with training data; maxV: maximal rare variant test; vPDT: variant-based pedigree disequilibrium test; vPDT-t: rare variant based pedigree disequilibrium test with training data.
Power and effect of the proportion of positive causal SNVs ( ) under significance level of 0.05
| Power | # of variants |
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
|
|
| 23.10(sd=5.04) | 0.439 | 0.882 | 0.255 | 0.463 | 0.879 | 0.313 |
|
| 22.21(sd=4.87) | 0.390 | 0.617 | 0.229 | 0.390 | 0.598 | 0.238 | |
|
| 21.60(sd=4.84) | 0.294 | 0.161 | 0.174 | 0.293 | 0.152 | 0.199 | |
|
| 20.93(sd=4.68) | 0.167 | 0.089 | 0.087 | 0.158 | 0.095 | 0.090 | |
|
| 20.17(sd=4.72) | 0.045 | 0.288 | 0.048 | 0.044 | 0.293 | 0.062 |
Note: (1) “# of variants” indicates the mean and standard deviation (sd) values of the number of SNVs in a 20Kb test region on 1000 replicates (10 permutations for each replicate); (2) d% is the proportion of rare variants in the causal region to be causal; (3) r% is the proportion of causal variants increase risk.
Abbreviations: maxH: maximal haplotype test; hPDT: haplotype-based pedigree disequilibrium test; hPDT-t: haplotype-based pedigree disequilibrium test with training data; maxV: maximal rare variant test; vPDT: variant-based pedigree disequilibrium test; vPDT-t: rare variant based pedigree disequilibrium test with training data.
Power and effects of LD under special haplotype setting
| β settings |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| (1) | 0.589 | 0.114 | 0.394 | 0.158 | 0.110 | 0.065 |
| (2) | 0.541 | 0.088 | 0.331 | 0.505 | 0.258 | 0.168 |
| (3) | 0.488 | 0.412 | 0.230 | 0.691 | 0.616 | 0.391 |
Note: (1) Other βs are zero if not specified under each β setting; (2) For fifteen haplotypes, 1010000000, 0110000000, 1001000000, 0101000000, 1000000000, 0100000000, 0010000000, 0001000000, 0000100000, 0000010000, 0000001000, 0000000100, 0000000010, 0000000001, 0000000000, the first fourteen rare haplotypes have equal frequencies of 0.01 and the last common haplotype has frequency of 0.86; (3) Simulation results are based on 1000 replicates (10 permutations for each replicate).
Abbreviations: maxH maximal haplotype test, hPDT haplotype-based pedigree disequilibrium test; hPDT-t haplotype-based pedigree disequilibrium test with training data, maxV maximal rare variant test, vPDT variant-based pedigree disequilibrium test, vPDT-t rare variant based pedigree disequilibrium test with training data.
The significant genes from FHS study with p-values < 0.001 at any test
| gene | Chromosome | Nsnp |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| PLG | chr6 | 10 | 0.02498 |
| 0.08819 | 0.43656 | 0.50666 | 1 |
| TFPI | Chr2 | 18 | 0.00099 | 0.00146 | 9.76E-05 | 0.45654 | 0.88330 | 1 |
| TNFRSF4 | chr1 | 7 | 1 | 0.71721 | 0.00056 | 0.64336 | 0.37994 | 1 |
| TGFB3 | chr14 | 10 | 0.01998 | 0.00060 | 1 | 0.16284 | 0.62846 | 1 |
| IL1R2 | chr2 | 21 | 0.00200 | 0.19275 | 0.00067 | 0.16783 | 0.91804 | 1 |
| MMP16 | chr8 | 9 | 0.09790 | 0.00090 | 1 | 1 | 0.31731 | 1 |
| LRP2 | chr2 | 21 | 0.00100 | 0.61149 | 0.00091 | 0.03197 | 0.14087 | 1 |
| LEP | chr7 | 18 | 0.05395 | 0.00093 | 0.06325 | 0.66533 | 0.35130 | 1 |
Abbreviations: maxH maximal haplotype test, hPDT haplotype-based pedigree disequilibrium test, hPDT-t haplotype-based pedigree disequilibrium test with training data, maxV maximal rare variant test, vPDT variant-based pedigree disequilibrium test, vPDT-t rare variant based pedigree disequilibrium test with training data.