| Literature DB >> 22373079 |
Jennifer H Barrett1, Jérémie Nsengimana.
Abstract
We propose a two-stage design for the analysis of sequence variants in which a proportion of genes that show some evidence of association are identified initially and then followed up in an independent data set. We compare two different approaches. In both approaches the same summary measure (total number of minor alleles) is used for each gene in the initial analysis. In the first (simple) approach the same summary measure is used in the analysis of the independent data set. In the second (alternative) approach a more specific hypothesis is formed for the second stage; the summary measure used is the count of minor alleles in only those variants that in the initial data showed the same direction of association as was seen overall. We applied the methods to the simulated quantitative traits of Genetic Analysis Workshop 17, blind to the simulation model, and then evaluated their performance once the underlying model was known. Performance was similar for most genes, but the simple strategy considerably out-performed the alternative strategy for one gene, where most of the effect was due to very rare variants; this suggests that the alternative approach would not be advisable when the effect is seen in very rare variants. Further simulations are needed to investigate the potential superior power of the alternative method when some variants within a gene have opposing effects. Overall, the power to detect associations was low; this was also true when using a more powerful joint analysis that combined the two stages of the study.Entities:
Year: 2011 PMID: 22373079 PMCID: PMC3287891 DOI: 10.1186/1753-6561-5-S9-S53
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Analysis of genes in relation to trait Q1 using the simple strategy, the alternative strategy based on more specific hypotheses, and the combined analysis with appropriate adjustment for multiple testing
| Number of significant replicates | ||||||
|---|---|---|---|---|---|---|
| Gene | Chromosome | Number of variants in gene | Number of times passing stage 1 threshold | Simple strategy | Alternative strategy | Joint analysis |
| 1 | 16 | 166 | 79 | 77 | 140 | |
| 12 | 34 | 159 | 57 | 60 | 117 | |
| 1 | 12 | 150 | 49 | 45 | 105 | |
| 19 | 12 | 149 | 37 | 38 | 97 | |
| 12 | 12 | 148 | 41 | 53 | 102 | |
| 19 | 9 | 145 | 45 | 42 | 87 | |
| 6 | 28 | 143 | 33 | 36 | 89 | |
| 19 | 16 | 138 | 42 | 39 | 91 | |
| 19 | 5 | 125 | 24 | 21 | 73 | |
| 16 | 30 | 123 | 28 | 32 | 64 | |
Only genes significant in the first analysis (at the 1% level) in at least 120 of the 200 replicates are shown. The two genes in bold were simulated to be associated with the trait. All analyses are adjusted for population stratification.
Figure 1Properties of the associated variants within a gene. (A) Sum of MAFs of associated variants within each of the simulated genes (9 genes for Q1 and 13 genes for Q2) as a function of the number of associated variants. (B) Mean simulated variant effect within the four genes with highest total MAF.
Figure 2Simulated variants effect (regression β coefficient) in relation to their MAFs within FLT1, KDR, and VNN1