| Literature DB >> 25491445 |
Xiaoshuai Zhang1, Fuzhong Xue2, Hong Liu3, Dianwen Zhu4, Bin Peng5, Joseph L Wiemels6, Xiaowei Yang7.
Abstract
BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified by GWAS generally account for only a small proportion of the total heritability for complex diseases. To solve this "missing heritability" problem, we implemented a strategy called integrative Bayesian Variable Selection (iBVS), which is based on a hierarchical model that incorporates an informative prior by considering the gene interrelationship as a network. It was applied here to both simulated and real data sets.Entities:
Mesh:
Year: 2014 PMID: 25491445 PMCID: PMC4275962 DOI: 10.1186/s12863-014-0130-7
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1Hierarchical model structure and relationships among the stochastic nodes.
Average AUC values of iBVS, LASSO and Stepwise
|
|
| ||
|---|---|---|---|
|
|
|
| |
|
| 0.911 | 0.891 | 0.869 |
|
| 0.894 | 0.882 | 0.853 |
|
| 0.792 | 0.779 | 0.774 |
Figure 2ROC curves of the three SNP selection strategies on the simulated data. Figure (a) depicts the ROC curves of the simulated data in scenario H70, and Figure (b) depicts the ROC curves of the simulated data in scenario H50.
Figure 3Posterior selection probabilities of SNPs and result of cross-validation in leprosy training Data. Figure (a) depicts the posterior SNP selection probabilities for the 3388 SNPs from the leprosy training data set. Figure (b) depicts the classification error in the conduct of cross-validation on the training data set using the PLS logistic regression model with different selection of top SNPs.
Information of 24 common SNPs selected by both iBVS and Lasso
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| rs9270984 | 6 | 32681969 | HLA-DR–DQ | 0.583 | 0.307 | 0.195 |
| rs7595482 | 2 | 38106517 | FAM82A1 | 0.329 | −0.031 | - |
| rs10133203 | 14 | 51425137 | GNG2 | 0.311 | −0.314 | - |
| rs2517467 | 6 | 30997239 | VARS2 | 0.283 | 0.237 | 0.148 |
| rs3764147 | 13 | 43355925 | C13orf31 | 0.272 | 0.227 | 0.114 |
| rs1446297 | 2 | 38061737 | FAM82A1 | 0.256 | −0.208 | - |
| rs2237585 | 7 | 94887754 | PON2 | 0.187 | −0.222 | - |
| rs42490 | 8 | 90847650 | RIPK2 | 0.135 | −0.143 | - |
| rs602875 | 6 | 32681607 | HLA-DR–DQ | 0.104 | −0.090 | - |
| rs16945848 | 15 | 60913837 | TLN2 | 0.093 | 0.245 | - |
| rs241409 | 6 | 32969898 | LOC100294145 | 0.082 | 0.018 | - |
| rs12817755 | 12 | 38585079 | SLC2A13 | 0.08 | −0.137 | - |
| rs1343104 | 20 | 57607136 | PHACTR3 | 0.075 | −0.06 | - |
| rs10502281 | 11 | 123261833 | TMEM225 | 0.071 | −0.105 | - |
| rs2305100 | 13 | 43346934 | CCDC122 | 0.066 | 0.001 | - |
| rs447833 | 20 | 42696770 | ADA | 0.058 | 0.209 | - |
| rs11632705 | 15 | 25141046 | GABRG3 | 0.057 | −0.043 | - |
| rs17065164 | 13 | 43342706 | CCDC122 | 0.051 | −0.066 | - |
| rs11900859 | 2 | 138039737 | THSD7B | 0.051 | 0.071 | - |
| rs241443 | 6 | 32905093 | TAP2 | 0.045 | 0.18 | - |
| rs1897419 | 2 | 137473187 | THSD7B | 0.045 | 0.023 | - |
| rs1805867 | 8 | 91100250 | DECR1 | 0.043 | −0.126 | - |
| rs2517598 | 6 | 30188253 | TRIM31 | 0.043 | 0.248 | - |
| rs17110817 | 14 | 80120188 | CEP128 | 0.04 | 0.005 | - |
Figure 4ROC curves from leprosy testing data set under additive and dominant genetic model. Figure (a) depicts the ROC curves of testing data set with different SNP selection strategies (iBVS, LASSO, and Stepwise) under additive genetic model, and Figure (b) under dominant genetic model.