| Literature DB >> 25519319 |
Abstract
The cost of next-generation sequencing is now approaching that of the first generation of genome-wide single-nucleotide genotyping panels, but this is still out of reach for large-scale epidemiologic studies with tens of thousands of subjects. Furthermore, the anticipated yield of millions of rare variants poses serious challenges for distinguishing causal from noncausal variants for disease. We explore the merits of using family-based designs for sequencing substudies to identify novel variants and prioritize them for their likelihood of causality. While the sharing of variants within families means that family-based designs may be less efficient for discovery than sequencing of a comparable number of unrelated individuals, the ability to exploit cosegregation of variants with disease within families helps distinguish causal from noncausal ones. We introduce a score test criterion for prioritizing discovered variants in terms of their likelihood of being functional. We compare the relative statistical efficiency of 2-stage versus1-stage family-based designs by application to the Genetic Analysis Workshop 18 simulated sequence data.Entities:
Year: 2014 PMID: 25519319 PMCID: PMC4143728 DOI: 10.1186/1753-6561-8-S1-S32
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Mean score tests for the complete pedigrees.
| Phenotype | Parameter | Mean score test | Proportion of variants (%) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Prioritized | Replicated | 1-Stage QTDTM | ||||||||
| Simulated | - | Null | + | Null | +/- | Null | +/- | Null | +/- | |
| DBP | Baseline | -1.42 | -0.25 | +0.42 | 10.2 | 14.7 | 1.8 | 5.3 | 1.7 | 9.3 |
| Intercept | -1.36 | -0.32 | +0.24 | 10.1 | 17.3 | 1.4 | 4.0 | 3.5 | 8.0 | |
| Slope | +0.70 | +0.14 | -0.12 | 10.2 | 13.3 | 1.7 | 0.0 | 3.3 | 4.0 | |
| SBP | Baseline | -1.40 | -0.35 | +0.20 | 10.2 | 16.0 | 0.3 | 2.7 | 1.7 | 9.3 |
| Intercept | -1.34 | -0.31 | +0.37 | 10.1 | 17.3 | 0.0 | 4.0 | 2.0 | 13.3 | |
| Slope | +0.85 | +0.21 | +0.11 | 10.2 | 12.0 | 0.5 | 0.0 | 2.3 | 0.0 | |
Mean score tests for the complete pedigrees for protective, null, and deleterious variants, along with the proportion of the top 100 variants prioritized using only the related members and replicated at α= 0.05/100 using only the unrelated members, and the proportion of variants significant at α= 0.05/1000 in a single-stage QTDTM test. The "+" and "−" represent variants with positive and negative association with the phenotypes respectively. If higher blood pressure is assumed to have more risk, "+" would correspond to deleterious variants and "−" would correspond to protective variants. In total, there are 6 deleterious variants, 9 protective variants, and 961 noncausal variants being discovered and tested.
Figure 1Comparison of 1-stage and 2-stage designs. The proportion of being prioritized and being significantfor all 15 functional variants in the MAP4 gene (sorted by %VAR explained in descending order):QTDTM for the 1-stage procedure using all members; score test for prioritizing the top 100 variants using 2, 4, or 6 randomly selected related members for sequencing and prioritizing). SBPintercepts model only.