| Literature DB >> 22373501 |
Heejong Sung1, Yoonhee Kim, Juanliang Cai, Cheryl D Cropp, Claire L Simpson, Qing Li, Brian C Perry, Alexa Jm Sorant, Joan E Bailey-Wilson, Alexander F Wilson.
Abstract
Tiled regression is an approach designed to determine the set of independent genetic variants that contribute to the variation of a quantitative trait in the presence of many highly correlated variants. In this study, we evaluate the statistical properties of the tiled regression method using the Genetic Analysis Workshop 17 data in unrelated individuals for traits Q1, Q2, and Q4. To increase the power to detect rare variants, we use two methods to collapse rare variants and compare the results with those from the uncollapsed data. In addition, we compare the tiled regression method to traditional tests of association with and without collapsed rare variants. The results show that collapsing rare variants generally improves the power to detect associations regardless of method, although only variants with the largest allelic effects could be detected. However, for traditional simple linear regression, the average estimated type I error is dependent on the trait and varies by about three orders of magnitude. The estimated type I error rate is stable for tiled regression across traits.Entities:
Year: 2011 PMID: 22373501 PMCID: PMC3287849 DOI: 10.1186/1753-6561-5-S9-S15
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Proportion of 200 replicates identifying each causal variant and gene significant for trait Q1.
Figure 2Proportion of 200 replicates identifying each causal variant and gene significant for trait Q2.
Proportion of 200 replicates identifying causal variants in traits Q1 and Q2
| Trait | Gene | Variant | PoR for uncollapsed variants | PoR for collapsed variants | ||||
|---|---|---|---|---|---|---|---|---|
| TR 10−2 | TR 10−7 | SLR 10−7 | TR 10−2 | TR 10−7 | SLR 10−7 | |||
| Q1 | C1S6533 | 0.245 | 0.005 | 0.045 | 0.31 | 0.02 | 0.045 | |
| C4S1877 | 0 | 0 | 0.255 | 0.16 | 0.005 | 0.055 | ||
| C4S1884 | 0.16 | 0.015 | 0.065 | 0.2 | 0.01 | 0.065 | ||
| C4S1889 | 0 | 0 | 0.255 | 0.025 | 0 | 0.01 | ||
| C13S431 | 0.475 | 0.095 | 0.12 | 0.495 | 0.08 | 0.12 | ||
| C13S522 | 0.745 | 0.115 | 0.99 | 0.78 | 0.165 | 0.99 | ||
| C13S523 | 1 | 0.72 | 1 | 1 | 0.72 | 1 | ||
| C13S524 | 0.115 | 0.01 | 0.39 | 0.03 | 0 | 0.005 | ||
| Average PoR | 7.9 × 10−2 | 2.5 × 10−2 | 8.2 × 10−2 | 1.6 × 10−1 | 4.8 × 10−2 | 1.1 × 10−1 | ||
| Q2 | C6S5380 | 0.42 | 0.01 | 0.03 | 0.415 | 0.025 | 0.03 | |
| Average PoR | 2.2 × 10−2 | 6.3 × 10−4 | 4.9 × 10−4 | 7.2 × 10−2 | 2.7 × 10−3 | 1.8 × 10−3 | ||
TR, tiled regression. SLR, simple linear regression. PoR, proportion of 200 replicates. For the tiled regression, 10−2 and 10−7 are the critical values; for the simple linear regression, 10−7 is the significance level.
Average proportion of 200 replicates identifying noncausal variants in traits Q1, Q2, and Q4
| Trait | PoR for uncollapsed variants | PoR for collapsed variants | ||||
|---|---|---|---|---|---|---|
| TR 10−2 | TR 10−7 | SLR 10−7 | TR 10−2 | TR 10−7 | SLR 10−7 | |
| Q1 | 9.7 × 10−4 | 5.5 × 10−6 | 6.8 × 10−4 | 1.3 × 10−3 | 7.3 × 10−6 | 1.0 × 10−3 |
| Q2 | 1.3 × 10−3 | 6.2 × 10−6 | 3.1 × 10−6 | 1.7 × 10−3 | 3.8 × 10−6 | 2.1 × 10−6 |
| Q4 | 1.3 × 10−3 | 3.5 × 10−6 | 2.0 × 10−7 | 1.7 × 10−3 | 3.2 × 10−6 | 0.0 |
TR, tiled regression. SLR, simple linear regression. PoR, proportion of 200 replicates. For the tiled regression, 10−2 and 10−7 are the critical values; for the simple linear regression, 10−7 is the significance level.