| Literature DB >> 22373126 |
Abstract
Genome-wide association studies are largely based on single-nucleotide polymorphisms and rest on the common disease/common variants (single-nucleotide polymorphisms) hypothesis. However, it has been argued in the last few years and is well accepted now that rare variants are valuable for studying common diseases. Although current genome-wide association studies have successfully discovered many genetic variants that are associated with common diseases, detecting associated rare variants remains a great challenge. Here, we propose two partial least-squares approaches to aggregate the signals of many single-nucleotide polymorphisms (SNPs) within a gene to reveal possible genetic effects related to rare variants. The availability of the 1000 Genomes Project offers us the opportunity to evaluate the effectiveness of these two gene-based approaches. Compared to results from a SNP-based analysis, the proposed methods were able to identify some (rare) SNPs that were missed by the SNP-based analysis.Entities:
Year: 2011 PMID: 22373126 PMCID: PMC3287853 DOI: 10.1186/1753-6561-5-S9-S19
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1The GBPLS algorithm for Q (j = 1, 2).
Figure 2Receiver operating characteristic curves for trait Q TPR (or FPR) is the proportion of significant genes that are among true associated (or true unassociated) genes in at least c replications (c = 1, 2, 3, …, 25).
False-positive and true-positive rates for the GBPLS1 and GBPLS2 methods for selected values of c, t, and q
| Trait | GBPLS1 | GBPLS2 | ||
|---|---|---|---|---|
| TPR | FPR | TPR | FPR | |
| Q1 | 0.89 | 0.13 | 1 | 0.15 |
| Q2 | 0.85 | 0.12 | 0.85 | 0.10 |
Summary of results for Q1
| Gene | GBPLS1 SNPs | GBPLS2 SNPs |
|---|---|---|
| C1S6533, C1S6542, | C1S6533, C1S6540, C1S6542 | |
| None | ||
| C13S431, C13S522, C13S523, C13S524 | C13S431, C13S522, C13S523, C13S524 | |
| C5S5133, | C5S5133 | |
| C14S1729, C14S1734 | C14S1729, C14S1734 | |
| N/A | None | |
| C4S1861, | ||
| C6S2981 | C6S2981 | |
Listed in the first column are the genes that are associated with Q1 in the simulation model. “N/A” means that the gene is not detected, and “None” means that none of the causal SNPs are detected even though the gene is detected by the corresponding method. The SNP names in bold are those with MAF = 0.000717.