| Literature DB >> 36229773 |
Benjamin F Voight1,2,3,4, Wei-Ting Hwang5, Zhuoran Ding6, Marylyn D Ritchie6,7,8.
Abstract
BACKGROUND: Observational studies and Mendelian randomization experiments have been used to identify many causal factors for complex traits in humans. Given a set of causal factors, it is important to understand the extent to which these causal factors explain some, all, or none of the genetic heritability, as measured by single-nucleotide polymorphisms (SNPs) that are associated with the trait. Using the mediation model framework with SNPs as the exposure, a trait of interest as the outcome, and the known causal factors as the mediators, we hypothesize that any unexplained association between the SNPs and the outcome trait is mediated by an additional unobserved, hidden causal factor.Entities:
Keywords: Gaussian mixture model; Markov-Chain Monte-Carlo; Mediation; Obesity
Mesh:
Substances:
Year: 2022 PMID: 36229773 PMCID: PMC9559019 DOI: 10.1186/s12859-022-04977-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1The mediation model framework. (A) The observed model. (B) The hypothesized model
Fig. 2Flowchart of the multi-step method. In the first step, we use linear regression models to estimate the SNP effects on the known mediators () and the direct effects between the SNPs and the outcome (). In the second step, we apply the EM algorithm to fit GMMs on and . In the third step, a MCMC procedure is performed using the estimated GMM parameters from the last step in the priors to generate a posterior distribution for the hidden mediator’s effect size, b
Parameters specifications of the 9 simulation settings
| Setting | Know mediator effects | Sample size | Number of known mediators | Associations among mediators | Level 1 and Level 2 standard deviation ratios | Hidden mediator effect | Number of SNPs | |
|---|---|---|---|---|---|---|---|---|
| Base case | 0.8 | (0.4, 0.2, 0.3, 0.2, 0.4) | 100,000 | 5 | Independent | 0:1 | Non-zero | 70; 500 |
| Setting 1 | 0.3; 0.5; 1 | (0.4, 0.2, 0.3, 0.2, 0.4) | 100,000 | 5 | Independent | 0:1 | Non-zero | 70; 500 |
| Setting 2 | 0.8 | (0.4, 0.2, 0.3, -0.2, -0.4) | 100,000 | 5 | Independent | 0:1 | Non-zero | 70; 500 |
| Setting 3 | 0.8 | (0.4, 0.2, 0.3, 0.2, 0.4) | 25,000; 50,000 | 5 | Independent | 0:1 | Non-zero | 70; 500 |
| Setting 4 | 0.8 | (0.4); (0.4, 0.2, 0.3, 0.2, 0.4, 0.2, 0.1, 0.3, 0.2, 0.2) | 100,000 | 1; 10 | Independent | 0:1 | Non-zero | 70; 500 |
| Setting 5 | 0.8 | (0.4, 0.2, 0.3, 0.2, 0.4) | 100,000 | 5 | 0:1 | Non-zero | 70; 500 | |
| Setting 6 | 0.8 | (0.4, 0.2, 0.3, 0.2, 0.4) | 100,000 | 5 | Independent | 1:3; 1:1 | Non-zero | 70; 500 |
| Setting 7 | / | (0.4, 0.2, 0.3, 0.2, 0.4) | 100,000 | 5 | Independent | 0:1 | Zero | 70; 500 |
| Setting 8 | 0.8 | (0.4, 0.2, 0.3, 0.2, 0.4) | 100,000 | 5 | Independent | 0:1 | Non-zero | 20; 40; 700 |
Parameters separated by ";" belong to separate simulations
Fig. 3Results of one simulation setting: base case, varying b, 1000 simulations. The first and second row presents the results for 70 SNPs and 500 SNPs, respectively. (A, E) Box plots of the posterior median and the mean of b. The purple lines indicate the true values. (B, F) Box plots of the widths of 90% HDIs and QIs. (C, G) The posterior medians and the 90% HDIs of the 49 equally spaced values of b between 0.02 and 0.5. (D, H) The posterior means and the 90% QIs of the 49 equally spaced values of b between 0.02 and 0.5. Outliers are defined as the values more extreme than the third quartile + 1.5 * (the third quartile—the first quartile) or the first quartile—1.5 * (the third quartile—the first quartile)
Fig. 4Histograms of the regression estimated effects. (A) Estimated SNP effects on BMI (a). (B) Estimated SNP effects on BMI with outliers removed. (C) Estimated direct effects between the SNPs and WHR (c). (D) Estimated direct effects between the SNPs and WHR with outliers removed