| Literature DB >> 22373362 |
Aimin Yan1, Nan M Laird, Cheng Li.
Abstract
Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Statistical methods that test variants individually are underpowered to detect rare variants, so it is desirable to perform association analysis of rare variants by combining the information from all variants. In this study, we use a Bayesian regression method to model all variants simultaneously to identify rare variants in a data set from Genetic Analysis Workshop 17. We studied the association between the quantitative risk traits Q1, Q2, and Q4 and the single-nucleotide polymorphisms and identified several positive single-nucleotide polymorphisms for traits Q1 and Q2. However, the model also generated several apparent false positives and missed many true positives, suggesting that there is room for improvement in this model.Entities:
Year: 2011 PMID: 22373362 PMCID: PMC3287941 DOI: 10.1186/1753-6561-5-S9-S99
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Association analysis for trait Q1
| Gene | SNP | Regression coefficient | PPA | MAF |
|---|---|---|---|---|
| C13S431 | 2.501 (2) | 0.243 (3) | 0.017217 | |
| C13S522 | 2.188 (3) | 0.264 (2) | 0.027977 | |
| C13S523 | 9.027 (1) | 0.998 (1) | 0.066714 | |
| C1S6533 | 0.478 (6) | 0.043 (4) | 0.011478 | |
| C4S1884 | 0.126 (42) | 0.016 (8) | 0.020803 |
The third and fourth columns are the average regression coefficients and posterior probabilities of association (PPAs) out of 200 replications, respectively. The numbers in parentheses for the regression coefficients indicate the rank of the SNP based on sorting the absolute values of average regression coefficients in decreasing order. The numbers in parentheses for the PPAs indicate the rank of the SNP based on sorting the PPAs in decreasing order. We use the same notation in Table 2.
Association analysis for trait Q2
| Gene | SNP | Regression coefficient | PPA | MAF |
|---|---|---|---|---|
| C6S5380 | 0.251 (1) | 0.077 (1) | 0.170732 | |
| C6S5449 | 0.194 (2) | 0.015 (3) | 0.010043 | |
| C6S5441 | 0.038 (25) | 0.010 (4) | 0.098278 | |
| C8S442 | 0.152 (5) | 0.016 (2) | 0.015782 | |
| C10S3050 | 0.170 (3) | 0.008 (6) | 0.002152 |
See Table 1 notes for an explanation of the notation.
AUC of the model using all SNPs and the rare variants only for traits Q1 and Q2
| Trait | AUC based on regression coefficients | AUC based on PPAs |
|---|---|---|
| Q1 | 0.791 (all SNPs ) | 0.808 (all SNPs) |
| 0.699 (RV only) | 0.724 (RV only) | |
| Q2 | 0.776 (all SNPs) | 0.774 (all SNPs) |
| 0.724 (RV only) | 0.718 (RV only) |
“All SNPs” indicates our analysis based on all SNPs; “RV only” indicates our analysis based on only the rare variants (MAF < 0.01).