| Literature DB >> 28536599 |
Meng Wang1, William C L Stewart1,2.
Abstract
The efficient analysis of hybrid designs [e.g., affected families, controls, and (optionally) independent cases] is attractive because it should have increased power to detect associations between genetic variants and disease. However, the computational complexity of such an analysis is not trivial, especially when the data contain pedigrees of arbitrary size and structure. To address this concern, we developed a pragmatic test of association that summarizes all of the available evidence in certain hybrid designs, irrespective of pedigree size or structure. Under the null hypothesis of no association, our proposed test statistic (POPFAM+) is the quadratic form of two correlated tests: a population-based test (e.g., wQLS), and a family-based test (e.g., PDT). We use the parametric bootstrap in conjunction with an estimate of the correlation to compute p-values, and we illustrate the potential for increased power when (1) the heritability of the trait is high; and, (2) the marker-specific association is driven by the over-representation of risk alleles in cases, and by the preferential transmission of risk alleles from heterozygous parents to their affected offspring. Based on simulation, we show that type I error is controlled, and that POPFAM+ is more powerful than wQLS or PDT alone. In a real data application, we used POPFAM+ to analyze 43 genes of a hybrid epilepsy study containing 85 affected families, 80 independent cases, 234 controls, and 118 reference samples from the International HapMap Project. The results of our analysis identified a promising epilepsy candidate gene for follow-up sequencing: malic enzyme 2 (ME2; min p < 0.0084).Entities:
Keywords: POPFAM+; candidate gene association; meta-analysis; sequencing; transmission disequilibrium
Year: 2017 PMID: 28536599 PMCID: PMC5422425 DOI: 10.3389/fgene.2017.00049
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Three contours of constant probability are shown for POPFAM+. Dashed curve (τ = 1) is the density contour of the χ2 distribution with 2 degrees of freedom. The dotted line corresponds to τ = 0, and the solid line is the contour of constant probability for POPFAM+ when τ = 0.5. The X- and Y-axes are the values of (normalized) test statistics, with X representing the appropriately signed square root of wQLS, and Y representing PDT.
Summary statistics for families in simulated datasets.
| 3 | 47 | 0 | 47 |
| 4 | 117 | 9 | 126 |
| 5 | 74 | 15 | 89 |
| 6 | 39 | 9 | 48 |
| 7 | 27 | 2 | 29 |
All affected families were ascertained on the basis of a single IGE (idiopathic generalized epilepsy) proband.
Type I error for POPFAM+, PDT, and wQLS.
| POPFAM+ | 0.051 | 0.052 | 0.046 |
| wQLS | 0.056 | 0.056 | 0.045 |
| PDT | 0.047 | 0.054 | 0.045 |
Type I error for various simulation settings for POPFAM+, PDT, and wQLS. The 95% CIs for type I error for all scenarios considered included the nominal value 0.05.
Figure 2A scatter plot of PDT and wQLS; the .
Figure 3Power Comparison for POPFAM+ (solid line), MAX (dash-dot), PDT (dash), and wQLS (dot). POPFAM+ and MAX have comparable power, and both outperform PDT and wQLS. Panel (A) shows power for a dominant model with relative risk (RR) of 2.0, while panel (B) shows power for RR = 3.0. Similarly, panel (C) shows power for a recessive model with RR = 5, while panel (D) shows power for RR = 10.
Top 10 genes ranked by POPFAM+, the ILAE Consortium, wQLS, and PDT.
| CTIF | TXNL1 | SETBP1 | CTIF |
| SETBP1 | PSTPIP2 | ME2 | SERPINB5 |
| ME2 | KATNAL2 | SLC14A2 | CFAP53 |
| RNF165 | SMAD2 | ||
| SERPINB5 | DYM | PSTPIP2 | |
| DYM | MEX3C | ST8SIA5 | MYO5B |
| MYO5B | SKOR2 | EPG5 | SKOR2 |
| LOXHD1 | ZBTB7C | LOXHD1 | |
| ST8SIA5 | SMAD7 | MBD2 | SETBP1 |
| RNF165 | HDHD2 | CTIF | ZBTB7C |
Genes with an asterisk (
) are epilepsy-related genes according to previous reports in the literature. Genes in bold are ranked in the top 10 by both POPFAM+ and ILAE.