| Literature DB >> 23860716 |
Manuel A Rivas1, Matti Pirinen, Matthew J Neville, Kyle J Gaulton, Loukas Moutsianas, Cecilia M Lindgren, Fredrik Karpe, Mark I McCarthy, Peter Donnelly.
Abstract
MOTIVATION: In sequencing studies of common diseases and quantitative traits, power to test rare and low frequency variants individually is weak. To improve power, a common approach is to combine statistical evidence from several genetic variants in a region. Major challenges are how to do the combining and which statistical framework to use. General approaches for testing association between rare variants and quantitative traits include aggregating genotypes and trait values, referred to as 'collapsing', or using a score-based variance component test. However, little attention has been paid to alternative models tailored for protein truncating variants. Recent studies have highlighted the important role that protein truncating variants, commonly referred to as 'loss of function' variants, may have on disease susceptibility and quantitative levels of biomarkers. We propose a Bayesian modelling framework for the analysis of protein truncating variants and quantitative traits.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23860716 PMCID: PMC3777107 DOI: 10.1093/bioinformatics/btt409
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(a) Prior and sampling distribution for the SEM: (i) distribution of the trait values under the null model, ; (ii) distribution of the trait values under the alternative model, ; and (iii) 50:50 mixture of two normal distributions as prior for . (b) Prior and sampling distribution for the GEM: (i) distribution of the trait values under the null model; (ii) under the alternative model, trait values are grouped around and ; and (iii) priors for and
Fig. 2.Plots of mean trait value for PTV carriers (X-axis) and BF from SEM () (Y-axis), for different values of n (written next to each point) the number of PTV carriers, for a fixed P-value of 0.001. The dashed line shows the density of the prior (restricted to positive values) on mean effect size under SEM (not on scale)
Critical values for BFs from SEM and GEM-NMD corresponding to different Type I error rates (α) with 3, 4 and 5 PTVs
| α | NPTV | SEM | GEM-NMD |
|---|---|---|---|
| 0.01 | 3 | 8.765 | 9.34 (9.30,9.39) |
| 4 | 7.77 | 8.93 (8.88,8.98) | |
| 5 | 6.81 | 8.36 (8.32,8.41) | |
| 0.001 | 3 | 64.56 | 59.94 (59.01,60.84) |
| 4 | 63.98 | 63.99 (62.76,65.01) | |
| 5 | 60.07 | 63.81 (62.63,64.94) | |
| 0.0001 | 3 | 438.45 | 362.10 (349.11,380.70) |
| 4 | 489.86 | 438.52 (412.02,460.90) | |
| 5 | 496.85 | 459.96.72 (434.96,484.73) | |
| 0.00001 | 3 | 2814.95 | 2035.51 (1790.69,2277.75) |
| 4 | 3564.12 | 2703.12 (2336.41,3197.85) | |
| 5 | 3924.31 | 3199.53 (2778.93,3765.76) |
Note: aA total of 10 000 000 replicates were generated to evaluate critical values at the corresponding Type I error rate together with its 95% confidence intervals (L95, U95). The critical values for SEM were calculated analytically.
Power, expressed as a percentage, at , to detect association for two scenarios: (i) variants impact trait values in same direction with similar effects, and (ii) variants impact trait values in different directions with direction of effect determined by NMD
| Similar | Grouped | |||||
|---|---|---|---|---|---|---|
| NPTV | 3 | 4 | 5 | 3 | 4 | 5 |
| SKAT | 42 | 49 | 69 | 44 | 55 | 66 |
| SKAT-O | 53 | 65 | 86 | 37 | 45 | 60 |
| Collapse | 55 | 81 | 86 | 2 | 0 | 1 |
| SEM | 55 | 81 | 86 | 2 | 0 | 1 |
| GEM-NMD | — | — | — | 51 | 65 | 79 |
| SKATw | — | — | — | 28 | 46 | 56 |
| Multiple linear regression | 42 | 49 | 69 | 44 | 55 | 66 |
Note: For GEM-NMD, thresholds for the given type I error rate are given in Table 1.
Fig. 3.(a) Protein and transcript locations of APOC3 mutations, including predicted impact of splice variant on transcript splicing. Transcript diagram demonstrates that variant c.IVS2 + 1G >A will disrupt proper splicing of the second exon and create a new spliced mRNA with exon 1 and exon 3 joining because of proper recognition of splice sequence in the donor site of exon 1 and acceptor site of exon 3. Genomic codon position is shown for the stop-gain mutation, i.e. g.55C >T. (b) Prior, likelihood and posterior of mean trait value after combining data from the Oxford Biobank study and GoT2D study. The shaded histogram in each panel represents the distribution of trait values for the relevant PTV carriers
Comparison of association P-values and BFs for APOC3 truncating variants and plasma triglyceride levels in the GoT2D exome-sequencing study
| Measure | Collapse | SKAT | SKAT-O | lm | SEM | GEM-NMD |
|---|---|---|---|---|---|---|
| 0.00065 | 0.0016 | 0.00063 | 0.003 | 0.00065 | 0.00066 | |
| BF | — | — | — | — | 63.4 | 42.3 |
Note: P-value for GEM-NMD is based on 1 000 000 simulations and its 95% empirical confidence interval is (0.000607–0.000707).