| Literature DB >> 19956578 |
Lei Zhang1, Aaron J Bonham, Jian Li, Yu-Fang Pei, Jie Chen, Christopher J Papasian, Hong-Wen Deng.
Abstract
The availability of a large number of dense SNPs, high-throughput genotyping and computation methods promotes the application of family-based association tests. While most of the current family-based analyses focus only on individual traits, joint analyses of correlated traits can extract more information and potentially improve the statistical power. However, current TDT-based methods are low-powered. Here, we develop a method for tests of association for bivariate quantitative traits in families. In particular, we correct for population stratification by the use of an integration of principal component analysis and TDT. A score test statistic in the variance-components model is proposed. Extensive simulation studies indicate that the proposed method not only outperforms approaches limited to individual traits when pleiotropic effect is present, but also surpasses the power of two popular bivariate association tests termed FBAT-GEE and FBAT-PC, respectively, while correcting for population stratification. When applied to the GAW16 datasets, the proposed method successfully identifies at the genome-wide level the two SNPs that present pleiotropic effects to HDL and TG traits.Entities:
Mesh:
Year: 2009 PMID: 19956578 PMCID: PMC2779861 DOI: 10.1371/journal.pone.0008133
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Type I error rates at various levels of residual correlations under homogeneous population.
|
| |||||||||||||||||
| −0.8 | −0.4 | 0.0 | 0.4 | 0.8 | |||||||||||||
|
| T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | QTDT | UT |
| Linkage | |||||||||||||||||
| −0.8 | 4.9 | 4.6 | 5.1 | 3.8 | 4.5 | 5.3 | 5.0 | 5.8 | 5.1 | 4.7 | 5.7 | 5.3 | 4.1 | 4.6 | 3.8 | 4.9 | 5.5 |
| −0.4 | 7.4 | 5.8 | 5.2 | 5.2 | 4.6 | 5.2 | 5.1 | 5.7 | 4.4 | 5.0 | 4.5 | 4.9 | 5.2 | 6.0 | 4.8 | ||
| 0.0 | 5.3 | 4.5 | 5.3 | 4.7 | 5.8 | 6.8 | 5.3 | 3.9 | 4.3 | 4.7 | 6.6 | 4.0 | 5.8 | 5.7 | 5.2 | ||
| 0.4 | 5.3 | 4.8 | 4.2 | 5.3 | 4.5 | 5.1 | 5.1 | 3.7 | 4.6 | 5.0 | 5.2 | 5.8 | 4.3 | 4.6 | 5.5 | ||
| 0.8 | 4.7 | 4.6 | 5.3 | 4.3 | 4.2 | 4.9 | 4.8 | 4.8 | 5.4 | 4.8 | 5.7 | 5.3 | 3.9 | 4.9 | 3.9 | ||
| No Linkage | |||||||||||||||||
| −0.8 | 4.7 | 4.5 | 3.8 | 5.7 | 4.9 | 5.4 | 4.6 | 4.6 | 4.2 | 5.8 | 4.9 | 4.2 | 5.9 | 4.3 | 4.9 | 5.2 | 4.0 |
| −0.4 | 5.1 | 3.9 | 4.5 | 5.7 | 5.1 | 5.4 | 5.4 | 5.5 | 5.2 | 4.1 | 6.5 | 4.1 | 4.3 | 4.8 | 5.3 | ||
| 0.0 | 5.5 | 4.8 | 3.9 | 4.4 | 4.2 | 5.1 | 3.8 | 6.5 | 5.8 | 4.0 | 6.0 | 5.1 | 4.3 | 5.2 | 4.1 | ||
| 0.4 | 5.2 | 5.4 | 3.7 | 4.6 | 4.9 | 4.2 | 4.9 | 6.1 | 3.8 | 4.1 | 4.6 | 4.9 | 5.6 | 5.4 | 4.6 | ||
| 0.8 | 5.2 | 6.2 | 5.2 | 6.4 | 4.1 | 4.4 | 4.4 | 5.0 | 4.8 | 4.3 | 5.4 | 6.0 | 4.2 | 4.5 | 4.6 | ||
Two hundred nuclear families were simulated, with the number of children per family being drawn from a Beta distribution with mean 2. Type I error rate was estimated at nominal level 5% on 1,000 replicates, with various levels of polygenic correlation (ρ) and environmental correlation (ρ).
the test site was linked to but not associated with the causal site.
the test site was neither linked to nor associated with the causal site.
Abbreviations: T, the proposed bivariate method; FBAT, the method FBAT-GEE [10] implemented in the software FBAT [24]; PBAT, the method FBAT-PC [9] implemented in the software PBAT [25]; QTDT, the method proposed by Abecasis et al. [23] and implemented in the software QTDT; UT, the univariate test in our previous study [18].
Type I error rates at various levels of residual correlations under admixed population.
|
| |||||||||||||||||
| −0.8 | −0.4 | 0.0 | 0.4 | 0.8 | |||||||||||||
|
| T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | QTDT | UT |
| Linkagea | |||||||||||||||||
| −0.8 | 2.0 | 4.5 | 4.5 | 3.5 | 5.7 | 4.0 | 3.2 | 4.0 | 4.3 | 3.8 | 5.4 | 5.5 | 4.3 | 5.7 | 5.4 | 4.6 | 4.4 |
| −0.4 | 3.5 | 5.4 | 5.7 | 3.3 | 4.1 | 5.1 | 4.1 | 5.2 | 5.8 | 2.8 | 3.9 | 5.7 | 3.8 | 4.1 | 4.0 | ||
| 0.0 | 4.3 | 4.2 | 3.7 | 2.9 | 4.5 | 4.3 | 3.8 | 4.2 | 5.1 | 5.5 | 4.7 | 5.1 | 4.0 | 5.0 | 5.1 | ||
| 0.4 | 2.8 | 4.0 | 4.9 | 2.7 | 4.4 | 4.5 | 3.0 | 5.3 | 3.8 | 2.2 | 3.9 | 4.3 | 3.2 | 4.9 | 3.7 | ||
| 0.8 | 2.9 | 5.8 | 5.8 | 3.2 | 4.9 | 3.2 | 2.8 | 4.9 | 4.8 | 2.6 | 4.0 | 5.8 | 3.6 | 6.0 | 4.8 | ||
| No Linkageb | |||||||||||||||||
| −0.8 | 1.9 | 5.2 | 4.2 | 2.6 | 4.9 | 5.7 | 3.6 | 5.1 | 4.9 | 2.9 | 4.8 | 4.0 | 4.1 | 4.6 | 6.0 | 5.0 | 4.5 |
| −0.4 | 3.7 | 4.9 | 5.0 | 4.2 | 4.4 | 5.2 | 3.0 | 6.0 | 5.8 | 3.4 | 5.4 | 5.4 | 4.6 | 5.3 | 5.3 | ||
| 0.0 | 2.8 | 4.6 | 4.4 | 3.1 | 5.4 | 5.9 | 3.4 | 5.8 | 5.0 | 3.8 | 6.0 | 5.5 | 4.5 | 4.8 | 5.0 | ||
| 0.4 | 3.9 | 5.4 | 5.3 | 2.8 | 4.8 | 4.6 | 4.1 | 5.4 | 5.5 | 4.0 | 4.9 | 4.8 | 4.7 | 4.5 | 4.7 | ||
| 0.8 | 3.7 | 5.1 | 5.9 | 4.4 | 5.1 | 5.8 | 4.1 | 4.8 | 5.3 | 3.9 | 6.5 | 5.7 | 3.9 | 4.2 | 4.7 | ||
Two hundred nuclear families were simulated by admixing 100 from two populations A and B. See Table 1 legend for simulation and abbreviation details.
Type I error rates when parents are missing.
| No. of children per family | |||||||||||||||
| 2 | 3 | 4 | |||||||||||||
| T | FBAT | PBAT | QTDT | UT | T | FBAT | PBAT | QTDT | UT | T | FBAT | PBAT | QTDT | UT | |
| Homogeneous | |||||||||||||||
| Linkage | 6.3 | 4.7 | 5.9 | 4.5 | 4.7 | 3.3 | 5.1 | 3.6 | 5.2 | 4.9 | 5.3 | 4.7 | 5.0 | 5.2 | 4.2 |
| No Linkage | 5.0 | 3.3 | 4.3 | 4.7 | 3.6 | 4.9 | 4.1 | 4.6 | 4.9 | 4.7 | 5.9 | 4.7 | 5.0 | 4.8 | 5.3 |
| Admixture | |||||||||||||||
| Linkage | 3.9 | 4.7 | 5.2 | 5.2 | 3.5 | 4.6 | 5.3 | 5.0 | 4.8 | 3.9 | 4.2 | 4.3 | 5.1 | 4.1 | 4.5 |
| No Linkage | 4.0 | 4.5 | 4.6 | 4.7 | 3.8 | 4.7 | 5.9 | 6.1 | 5.8 | 4.7 | 5.3 | 4.5 | 5.2 | 5.1 | 4.8 |
The number of children per family varied from 2 to 4, and the number of families varied accordingly with the constraint that the total number of children was fixed at 480. Both polygenic and environmental correlations were set to 0.4. See Table 1 legend for simulation and abbreviation details.
Power estimates at various levels of residual correlations.
|
| |||||||||||||||||
| −0.8 | −0.4 | 0.0 | 0.4 | 0.8 | |||||||||||||
|
| T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | QTDT | UT |
| Homogeneous | |||||||||||||||||
| −0.8 | 100.0 | 100.0 | 100.0 | 100.0 | 99.3 | 99.7 | 100.0 | 95.9 | 98.3 | 100.0 | 87.5 | 94.5 | 100.0 | 80.0 | 88.7 | 61.3 | 93.7 |
| −0.4 | 100.0 | 97.3 | 99.5 | 100.0 | 90.9 | 96.1 | 100.0 | 83.8 | 92.0 | 99.8 | 74.9 | 85.7 | 99.3 | 71.5 | 82.4 | ||
| 0.0 | 100.0 | 89.2 | 93.2 | 100.0 | 80.5 | 89.6 | 99.4 | 75.2 | 84.6 | 99.2 | 65.4 | 77.1 | 98.4 | 59.7 | 70.5 | ||
| 0.4 | 100.0 | 79.0 | 88.4 | 99.4 | 70.6 | 79.7 | 98.0 | 62.8 | 76.5 | 97.4 | 60.3 | 71.8 | 96.1 | 53.7 | 65.8 | ||
| 0.8 | 99.7 | 66.6 | 76.9 | 98.2 | 61.8 | 73.6 | 96.0 | 58.2 | 65.5 | 95.6 | 55.1 | 64.1 | 90.5 | 48.8 | 58.0 | ||
| Admixture | |||||||||||||||||
| −0.8 | 100.0 | 85.8 | 94.5 | 99.7 | 76.0 | 83.1 | 99.9 | 71.2 | 80.3 | 100.0 | 64.5 | 74.6 | 99.8 | 57.7 | 66.8 | 55.1 | 91.4 |
| −0.4 | 99.8 | 75.1 | 82.4 | 100.0 | 69.8 | 80.3 | 99.3 | 61.0 | 74.8 | 99.3 | 56.3 | 63.1 | 98.4 | 49.7 | 57.2 | ||
| 0.0 | 99.9 | 64.0 | 78.3 | 99.3 | 57.9 | 64.7 | 98.0 | 53.3 | 57.6 | 97.5 | 50.9 | 54.2 | 94.9 | 44.7 | 50.3 | ||
| 0.4 | 98.4 | 55.7 | 64.7 | 97.6 | 50.4 | 58.6 | 96.3 | 49.1 | 53.4 | 93.4 | 44.2 | 47.7 | 92.0 | 40.4 | 44.5 | ||
| 0.8 | 95.6 | 48.8 | 55.5 | 94.0 | 45.5 | 52.2 | 92.3 | 40.4 | 48.0 | 91.3 | 35.7 | 42.5 | 87.6 | 37.4 | 39.6 | ||
Power was estimated at the nominal level of 5% based on 1,000 replicates. The value of r 2 between test and causal sites was set to 1.0, and the recombination rate between the sites was set to 0.01. See Table 1 and 2 legends for simulation and abbreviation details.
Power estimates at various levels of residual correlations when parents are missing.
|
| |||||||||||||||||
| −0.8 | −0.4 | 0.0 | 0.4 | 0.8 | |||||||||||||
|
| T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | T | FBAT | PBAT | QTDT | UT |
| Homogeneous | |||||||||||||||||
| −0.8 | 99.0 | 100.0 | 100.0 | 99.2 | 96.7 | 99.2 | 99.3 | 90.1 | 97.5 | 98.4 | 85.5 | 94.4 | 97.5 | 77.5 | 88.9 | 43.4 | 72.8 |
| −0.4 | 99.7 | 85.8 | 95.7 | 99.2 | 79.1 | 92.8 | 96.8 | 71.2 | 85.8 | 94.7 | 66.9 | 83.1 | 90.3 | 60.5 | 76.9 | ||
| 0.0 | 99.0 | 67.7 | 86.1 | 95.8 | 60.2 | 80.7 | 92.3 | 57.2 | 76.6 | 85.6 | 51.4 | 69.9 | 81.6 | 46.4 | 66.0 | ||
| 0.4 | 94.3 | 54.3 | 74.5 | 89.0 | 46.1 | 70.2 | 84.8 | 42.6 | 61.9 | 79.7 | 40.7 | 56.8 | 76.4 | 39.1 | 60.2 | ||
| 0.8 | 88.2 | 42.7 | 66.7 | 81.4 | 37.8 | 59.5 | 77.7 | 37.2 | 58.2 | 76.3 | 35.7 | 57.7 | 68.9 | 31.4 | 48.8 | ||
| Admixture | |||||||||||||||||
| −0.8 | 99.9 | 99.3 | 99.7 | 99.4 | 94.7 | 97.3 | 98.7 | 87.9 | 94.6 | 94.6 | 77.5 | 88.2 | 93.1 | 70.1 | 78.6 | 40.3 | 63.2 |
| −0.4 | 98.2 | 83.4 | 96.2 | 94.9 | 70.2 | 90.7 | 90.8 | 64.1 | 83.1 | 86.8 | 58.9 | 78.4 | 81.0 | 54.7 | 74.3 | ||
| 0.0 | 92.0 | 61.2 | 87.7 | 87.6 | 57.0 | 78.9 | 83.3 | 52.6 | 75.5 | 78.9 | 47.3 | 72.3 | 74.5 | 42.9 | 65.8 | ||
| 0.4 | 82.7 | 45.8 | 72.1 | 78.8 | 42.6 | 69.9 | 73.2 | 40.8 | 63.4 | 68.7 | 39.0 | 59.6 | 64.1 | 35.0 | 57.2 | ||
| 0.8 | 76.9 | 41.3 | 65.2 | 69.8 | 36.4 | 61.7 | 63.1 | 34.7 | 55.4 | 62.3 | 32.5 | 54.1 | 57.8 | 29.5 | 39.5 | ||
Parents were deleted from the simulation. See Table 1 and 2 legends for simulation and abbreviation details.
The effects of LD level and family structures on power estimates.
| No. of children per family | |||||||||||||||
| 1 | 2 | 3 | |||||||||||||
|
| T | FBAT | PBAT | QTDT | UT | T | FBAT | PBAT | QTDT | UT | T | FBAT | PBAT | QTDT | UT |
| Homogeneous | |||||||||||||||
| 0.25 | 46.2 | 14.7 | 17.3 | 14.5 | 41.1 | 42.1 | 15.7 | 20.9 | 18.1 | 43.6 | 41.9 | 18.9 | 23.4 | 20.8 | 40.8 |
| 0.50 | 72.7 | 18.2 | 26.4 | 20.5 | 70.3 | 74.6 | 26.1 | 37.9 | 30.4 | 66.2 | 72.2 | 34.2 | 41.1 | 36.3 | 67.9 |
| 0.75 | 89.4 | 28.8 | 39.0 | 31.5 | 86.0 | 91.0 | 41.3 | 53.3 | 42.0 | 85.1 | 89.5 | 49.5 | 59.7 | 50.8 | 83.7 |
| 1.00 | 97.1 | 36.9 | 49.5 | 39.6 | 92.8 | 96.0 | 52.2 | 63.7 | 54.0 | 92.4 | 95.7 | 59.9 | 70.5 | 62.5 | 92.5 |
| Admixture | |||||||||||||||
| 0.25 | 32.5 | 11.4 | 15.4 | 13.2 | 31.6 | 33.0 | 13.7 | 18.9 | 16.5 | 29.0 | 29.2 | 12.8 | 19.5 | 16.4 | 33.1 |
| 0.50 | 60.8 | 13.9 | 18.2 | 17.7 | 58.2 | 61.0 | 22.9 | 34.8 | 28.8 | 58.7 | 60.6 | 25.1 | 35.4 | 30.7 | 57.5 |
| 0.75 | 80.9 | 22.2 | 36.3 | 26.7 | 77.4 | 81.6 | 32.2 | 48.6 | 37.6 | 75.7 | 79.5 | 33.4 | 55.9 | 43.7 | 74.2 |
| 1.00 | 92.3 | 25.4 | 41.6 | 34.1 | 87.1 | 92.4 | 42.0 | 57.1 | 49.3 | 86.3 | 91.9 | 45.7 | 66.8 | 55.6 | 89.5 |
The number of children per family varied from 1 to 3, with the total number of individuals being constrained at 800. See Table 1 and 2 legends for simulation and abbreviation details.
Figure 1Application of the proposed method to the GAW16 simulated datasets.
The GAW16 simulated HDL and TG traits were analyzed. Figure 1A, the quantile-quantile (QQ) plot (left), and log-QQ plot (right); Figure 1B, raw p-values of the genome-wide scan.
P-values at pleiotropic SNPs.
| p-value | |||||
| SNP | T | FBAT | PBAT | QTDT | UT |
|
| 3.47e-12 | 0.03 | 5.20E-04 | 1.91E-04 | 2.72e-08 |
|
| 3.07e-6 | 0.94 | 0.04 | 0.04 | 2.38e-06 |
For univariate tests, e.g., QTDT and UT, the uniform bivariate p-value was obtained by adjusting the minimum of the two univairate p-values by multiple testing correction (multiplying by 2). See Table 1 legend for abbreviation details.