| Literature DB >> 21304821 |
Feng Zhang1, Xiong Guo, Hong-Wen Deng.
Abstract
Because of combining the genetic information of multiple loci, multilocus association studies (MLAS) are expected to be more powerful than single locus association studies (SLAS) in disease genes mapping. However, some researchers found that MLAS had similar or reduced power relative to SLAS, which was partly attributed to the increased degrees of freedom (dfs) in MLAS. Based on partial least-squares (PLS) analysis, we develop a MLAS approach, while avoiding large dfs in MLAS. In this approach, genotypes are first decomposed into the PLS components that not only capture majority of the genetic information of multiple loci, but also are relevant for target traits. The extracted PLS components are then regressed on target traits to detect association under multilinear regression. Simulation study based on real data from the HapMap project were used to assess the performance of our PLS-based MLAS as well as other popular multilinear regression-based MLAS approaches under various scenarios, considering genetic effects and linkage disequilibrium structure of candidate genetic regions. Using PLS-based MLAS approach, we conducted a genome-wide MLAS of lean body mass, and compared it with our previous genome-wide SLAS of lean body mass. Simulations and real data analyses results support the improved power of our PLS-based MLAS in disease genes mapping relative to other three MLAS approaches investigated in this study. We aim to provide an effective and powerful MLAS approach, which may help to overcome the limitations of SLAS in disease genes mapping.Entities:
Mesh:
Year: 2011 PMID: 21304821 PMCID: PMC3033421 DOI: 10.1371/journal.pone.0016739
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Power comparing results of PLS-based MLAS (PLS_MLAS), PCA-based MLAS (PCA_MLAS), tagSNPs-based MLAS (tagSNPs_MLAS), TSM-based MLAS using F test (FTSM) and TSM-based MLAS using Wald test (WTSM) under the epistatic model.
Figure 2Power comparing results of PLS-based MLAS (PLS_MLAS), PCA-based MLAS (PCA_MLAS), tagSNPs-based MLAS (tagSNPs_MLAS), TSM-based MLAS using F test (FTSM) and TSM-based MLAS using Wald test (WTSM) under the additive model.
Figure 3Plot of genome-wide MLAS results of lean body mass implemented by PLS-based MLAS.
Significant genes are highlighted in red.
Comparison of MLAS and SLAS results of the 17 genes detected by PLS-based MLAS of lean body mass.
| Genes | PLS-based MLAS | SLAS | ||
| P values | FDR | P values | FDR | |
| ADAMTS1 | 1.00×10−5 | 0.045 | 3.31×10−5 | 0.514 |
| ANGPT2 | 2.00×10−5 | 1.00×10−5 | 4.76×10−4 | 0.906 |
| ATP8A2 | 1.30×10−4 | 0.045 | 0.016 | 0.919 |
| DKK2 | 3.00×10−5 | 0.037 | 5.66×10−3 | 0.910 |
| FAM13A1 | 8.10×10−4 | 0.022 | 5.64×10−4 | 0.906 |
| FGF10 | 1.00×10−5 | 0.037 | 1.16×10−4 | 0.753 |
| GPR158 | 4.00×10−5 | 1.00×10−5 | 1.56×10−4 | 0.798 |
| PTPRM | 3.00×10−5 | 2.00×10−5 | 6.40×10−3 | 0.910 |
| SDK2 | 7.00×10−5 | 3.00×10−5 | 0.029 | 0.935 |
| SLC28A3 | 3.60×10−4 | 5.00×10−5 | 1.36×10−3 | 0.909 |
| TNFRSF21 | 0.020 | 6.68×10−3 | 1.74×10−4 | 0.811 |
| TNFSF10 | 3.00×10−5 | 0.039 | 0.020 | 0.919 |
| TRHR | 2.20×10−3 | 0.021 | 7.55×10−8 | 0.029 |
| TRPC6 | 1.24×10−3 | 0.013 | 2.97×10−3 | 0.910 |
| TSPYL5 | 1.90×10−4 | 0.016 | 1.62×10−3 | 0.910 |
| ZBTB43 | 1.00×10−5 | 0.018 | 3.53×10−3 | 0.910 |
| ZFP37 | 3.00×10−5 | 0.028 | 0.011 | 0.910 |
denote the smallest P value of each gene obtained from our previous genome-wide SLAS of lean body mass.
Parameter configurations used in our simulation study.
| Genetic effect | ||||
| Epistatic model | D' | SNP6 | SNP10 | SNP6×SNP10 |
| 0.9∼1.0 | 0.020 | 0.010 | 0.000 | |
| 0.8∼0.9 | 0.018 | 0.008 | 0.004 | |
| 0.7∼0.8 | 0.016 | 0.006 | 0.008 | |
| 0.6∼0.7 | 0.014 | 0.004 | 0.012 | |
denote the phenotypic variance explained by additive effects of causal SNP 6 and SNP 10 as well as interactive effect between SNP 6 and SNP 10, respectively.
denote the phenotypic variance explained by additive effect of causal SNP 8.
the basic parameter configuration is highlighted in bold. Each possible parameter setting can be obtained by replacing one entry of the basic parameter configuration with a different entry of corresponding parameter.