| Literature DB >> 30275896 |
Jason Vander Woude1,2, Jordan Huisman1, Lucas Vander Berg1, Jenna Veenstra1,3, Abbey Bos3, Anya Kalsbeek3, Karissa Koster1, Nathan Ryder1, Nathan L Tintle1.
Abstract
Although methylation data continues to rise in popularity, much is still unknown about how to best analyze methylation data in genome-wide analysis contexts. Given continuing interest in gene-based tests for next-generation sequencing data, we evaluated the performance of novel gene-based test statistics on simulated data from GAW20. Our analysis suggests that most of the gene-based tests are detecting real signals and maintaining the Type I error rate. The minimum p value and threshold-based tests performed well compared to single-marker tests in many cases, especially when the number of variants was relatively large with few true causal variants in the set.Entities:
Year: 2018 PMID: 30275896 PMCID: PMC6157195 DOI: 10.1186/s12919-018-0124-y
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Overview of gene-based test statistics considered
| SNP*CPG, | |
|---|---|
| Sum of natural log-transformed |
|
| Sum of negative squared natural log-transformed |
|
| Minimum |
|
|
| |
|
| |
|
|
Proportion of times test statistic, G, was rejected (p < 0.05) across 200 simulations, by choice of test statistic and by type of gene
| Statistic | Contains major causal variants | Contains minor causal variants | Contains no causal variants |
|---|---|---|---|
| Sum ln | 0.367 | 0.07 | 0.04 |
| Sum −(ln | 0.398 | 0.06 | 0.04 |
| Min | 0.431 | 0.03 | 0.02 |
| 0.460 | 0.04 | 0.03 | |
| 0.469 | 0.04 | 0.03 | |
| 0.467 | 0.03 | 0.02 | |
| Single markera | 0.403 | 0.03 | 0.02 |
aSingle marker test used a Bonferroni-corrected significance threshold of
erformance across major-effect genes
| Gene | SNP heritability | No. SNP-CpG pairs | MAF of causal variant | Sum ln | Sum −ln2 | Min | Single marker | |||
|---|---|---|---|---|---|---|---|---|---|---|
| SIPA1L2a | .125 | 141 | 0.11 | 0.35 | 0.42 | 0.73 | 0.58 | 0.65 | 0.72 | 0.69 |
| SYNTH1b | .100 | 23 | 0.19 | 0.79 | 0.74 | 0.48 | 0.65 | 0.61 | 0.52 | 0.41 |
| LYRM4 | .075 | 63 | 0.10 | 0.17 | 0.18 | 0.22 | 0.23 | 0.22 | 0.25 | 0.21 |
| HS3ST3A1 | .050 | 29 | 0.41 | 0.24 | 0.22 | 0.21 | 0.24 | 0.24 | 0.21 | 0.19 |
| MSRB2a | .025 | 32 | 0.14 | 0.72 | 0.72 | 0.41 | 0.65 | 0.59 | 0.48 | 0.39 |
aNearest gene within 50,000 bp of major-effect SNP
bArtificial “gene” containing all SNPs within 50,000 bp of major effect SNP
Performance across sets of SNP-CpG variant pairs containing major-effect variants
| Gene | Total SNP heritability | No. SNP-CpG pairs | Sum ln | Sum-ln2 | Min | Single marker | |||
|---|---|---|---|---|---|---|---|---|---|
| CAUSAL5 | 0.375 | 5 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| UNION5 | 0.375 | 288 | 0.73 | 0.86 | 0.74 | 0.93 | 0.92 | 0.89 | 0.70 |
| UNION2 | 0.125 | 92 | 0.23 | 0.26 | 0.26 | 0.25 | 0.26 | 0.27 | 0.23 |
| NOISE5 | 0.375 | 288 | 0.06 | 0.11 | 0.64 | 0.50 | 0.61 | 0.70 | 0.64 |
| NOISE2 | 0.125 | 92 | 0.03 | 0.09 | 0.20 | 0.14 | 0.15 | 0.18 | 0.20 |