| Literature DB >> 22373183 |
Kith Pradhan1, Seungtai Chris Yoon, Tao Wang, Kenny Ye.
Abstract
We propose a factor-screening method based on a Bayesian model selection framework and apply it to Genetic Analysis Workshop 17 simulated data with unrelated individuals to identify genes and SNP variants associated with the quantitative trait Q1. A Metropolis-Hasting algorithm is implemented to generate a posterior distribution in a restricted model space and thus the marginal posterior distribution of each variant. Our framework provides flexibility to make inferences on either individual variants or genes. We obtained results for 10 simulated data sets. Our methods are able to identify FTP1 and KDR, two genes that are associated with Q1 in a majority of replicates.Entities:
Year: 2011 PMID: 22373183 PMCID: PMC3287876 DOI: 10.1186/1753-6561-5-S9-S4
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Marginal posterior probability of SNP variants The marginal posterior probability of 24,487 variants obtained for the first three replicates are displayed. Red dots mark the Q1-associated variants.
SNPs with high marginal posterior probabilities
| Appearances in top 20 SNPs among 10 replicates | SNPs (genes), |
|---|---|
| 9 | C13S523 ( |
| 8 | C13S522 ( |
| 5 | C13S431 ( |
| 4 | C4S1884 ( |
| 2 | C4S1878 ( |
| 1 | C13S524 ( |
The SNP variants that have posterior probabilities in the top 20, including both Q1-associated SNPs and false positives, are listed. The symbols of Q1-associated variants are given and those not associated with Q1 are omitted. The first column shows the frequency of a variant in the top 20 among 10 replicates.
SNPs with high marginal posterior probabilities
| SNP | Gene | Posterior probability > 0.5 | Posterior probability > 0.1 |
|---|---|---|---|
| C13S431 | 1 | 4 | |
| C13S522 | 6 | 8 | |
| C13S523 | 9 | 9 | |
| C13S534 | 0 | 1 | |
| C4S1878 | 0 | 2 | |
| C4S1884 | 2 | 4 | |
| C6S2981 | 0 | 1 | |
| Other Q1 | 0 | 0 | |
| Non-Q1 | 3 | 36 | |
| False-positive discovery rate | 3/21 | 36/65 |
The first column lists Q1-associated SNP variants that have marginal posterior probabilities larger that 0.1 in any of the 10 replicates. The second column gives the genes on which these variants sit. The third and fourth columns are frequency of their posterior probabilities larger than 0.1 and 0.5, respectively; they summarize all variants not associated with Q1 but with a posterior probability greater than 0.1 or 0.5. The false discovery rates are given in the last row.
Average marginal posterior/prior probability ratio per variants for Q1-associated genes
| Gene | Posterior/prior ratio |
|---|---|
| 1.79 | |
| 1.31 | |
| 113 | |
| 0.649 | |
| 0.970 | |
| 0.483 | |
| 50.5 | |
| 11.6 | |
| 2.50 |
For each of the Q1-associated genes, the average marginal posterior probability over all its SNP variants is computed and divided by the average marginal prior probability 10/24,487. The second column lists such ratios averaged over 10 replicates. A higher ratio indicates more importance of the gene.
Genes with high marginal posterior probabilities
| Appearances in the top 10 genes among 10 replicates | ||
|---|---|---|
| 10 | ||
| 6 | − | |
| 5 | − | |
| 2 | Two genes | Four genes |
| 1 |
The genes that have posterior probabilities in the top 10, including both Q1-associated genes and false positives, are listed. The symbols of Q1-associated genes are given and those not associated with Q1 are omitted. The first column shows the frequency of a variant in the top 10 among 10 replicates. The second and third columns show the results at m = 3 and m = 6, respectively, where m is the fixed number of factors entertained in a model of the model space.
Marginal posterior probability for Q1-associated genes
| Gene | Posterior probability | Posterior probability > 0.5 | Posterior probability > 0.1 | |||
|---|---|---|---|---|---|---|
| 0.00521 | 0.00175 | 0 | 0 | 0 | 0 | |
| 0.00001 | 0.00065 | 0 | 0 | 0 | 0 | |
| 0.98967 | 0.96252 | 10 | 10 | 10 | 10 | |
| 0 | 0.00053 | 0 | 0 | 0 | 0 | |
| 0.00596 | 0.00704 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.32353 | 0.40953 | 4 | 3 | 4 | 6 | |
| 0.0006 | 0.0434 | 0 | 0 | 0 | 1 | |
| 0.00129 | 0.00256 | 0 | 0 | 0 | 0 | |
| Non-Q1 | ||||||
| False-positive discovery rate | 5/19 | 7/20 | 18/32 | 51/68 | ||
Marginal posterior probabilities of nine Q1-associated genes averaged over 10 replicates are listed in the second and third columns. For each gene, the number of times that its posterior probability is greater than 0.5 is listed in columns 4 and 5, and the number of times that it is greater than 0.1 is listed in columns 6 and 7. The second to the last row lists the total number of times that genes not associated with Q1 have a posterior probability greater than 0.1 and 0.5. The false-positive discovery rate is given in the last row. The results at m = 3 and m = 6 are listed in separate columns, where m is the fixed number of factors entertained in a model of the model space.