| Literature DB >> 26406920 |
Iksoo Huh1, Min-Seok Kwon2, Taesung Park3.
Abstract
Recent advances in genotyping methodologies have allowed genome-wide association studies (GWAS) to accurately identify genetic variants that associate with common or pathological complex traits. Although most GWAS have focused on associations with single genetic variants, joint identification of multiple genetic variants, and how they interact, is essential for understanding the genetic architecture of complex phenotypic traits. Here, we propose an efficient stepwise method based on the Cochran-Mantel-Haenszel test (for stratified categorical data) to identify causal joint multiple genetic variants in GWAS. This method combines the CMH statistic with a stepwise procedure to detect multiple genetic variants associated with specific categorical traits, using a series of associated I × J contingency tables and a null hypothesis of no phenotype association. Through a new stratification scheme based on the sum of minor allele count criteria, we make the method more feasible for GWAS data having sample sizes of several thousands. We also examine the properties of the proposed stepwise method via simulation studies, and show that the stepwise CMH test performs better than other existing methods (e.g., logistic regression and detection of associations by Markov blanket) for identifying multiple genetic variants. Finally, we apply the proposed approach to two genomic sequencing datasets to detect linked genetic variants associated with bipolar disorder and obesity, respectively.Entities:
Mesh:
Year: 2015 PMID: 26406920 PMCID: PMC4583484 DOI: 10.1371/journal.pone.0138700
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Contingency table for CMH test between trait and p SNPs. This test can be represented as H0: Y⊥SNP|(number of p-1 SNPs). “A” and “a” represent major and minor alleles, respectively.
Odds table for simulation studies and binary trait in three causal SNP model (θ = 0).
| ((#of c) = |
|
|
|
|---|---|---|---|
|
| α(1+ | α(1+ | α(1+ |
|
| α(1+ | α(1+ | α(1+ |
|
| α(1+ | α(1+ | α(1+ |
Penetrance table for Table 1 in three causal SNP model (θ = 0).
| (#of |
|
|
|
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fig 2Performance comparison of stepwise CMH method, stepwise logistic and DASSO-MB methods for the codominant model with two causal SNPs.
Blue bars represent the result of the DASSO-MB method, green bars represent the result of the stepwise logistic method, and red bars represent the result of the stepwise CMH method. Bars with diagonal lines are the results of power and solid bars are those of Dprob. The x-axis represents the MAF of true causal SNPs and the y-axis represents the value of two accuracy measures based on three approaches.
Fig 3Performance comparison of stepwise CMH method, stepwise logistic and DASSO-MB methods for the codominant model with three causal SNPs.
Fig 4Toy example dataset structure to show superiority of the stepwise CMH method.
We counted the numbers of cases and controls for each genotype combination and expressed them as vertical bars to visualize. Orange bars represent the counts of cases and the green bars do counts of controls.
Toy example dataset application result.
| Methods | p-value (1st forward step) | DF | p-value (2nd forward step) | DF |
|---|---|---|---|---|
| Stepwise CMH | 0.0146 | 2 | 0.0304 | 2 |
| Stepwise Logistic | 0.0203 | 2 | 0.2450 | 2 |
| DASSO-MB | 0.0498 | 2 | 0.1330 | 6 |
WTCCC bipolar disorder data analysis result (Entrance cutoff = 5×10−5, Removal cutoff = 5×10−5).
| Set | SNP name | GENE |
|---|---|---|
| 1st Set |
|
|
| 2nd Set |
|
|
KARE BMI data analysis result (Entrance cutoff = 5×10−5, Removal cutoff = 5×10−5).
| Set | SNP name | GENE |
|---|---|---|
| 1st Set |
|
|
| 2nd Set |
|
|