| Literature DB >> 20799953 |
Ming Li1, Roberto Romero, Wenjiang J Fu, Yuehua Cui.
Abstract
BACKGROUND: The genetic etiology of complex diseases in human has been commonly viewed as a complex process involving both genetic and environmental factors functioning in a complicated manner. Quite often the interactions among genetic variants play major roles in determining the susceptibility of an individual to a particular disease. Statistical methods for modeling interactions underlying complex diseases between single genetic variants (e.g. single nucleotide polymorphisms or SNPs) have been extensively studied. Recently, haplotype-based analysis has gained its popularity among genetic association studies. When multiple sequence or haplotype interactions are involved in determining an individual's susceptibility to a disease, it presents daunting challenges in statistical modeling and testing of the interaction effects, largely due to the complicated higher order epistatic complexity.Entities:
Mesh:
Year: 2010 PMID: 20799953 PMCID: PMC2946267 DOI: 10.1186/1471-2156-11-79
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
The configuration of two SNP combinations
| Observed Genotype | Diplotype | Composite Diplotype | ||
|---|---|---|---|---|
| Configuration | Frequency | |||
| 11/11 | [11][11] | 1 | ||
| 11/12 | [11][12] | 2 | 1 | |
| 11/22 | [12][12] | 1 | ||
| 12/11 | [11][21] | 2 | 1 | |
| 12/12 | ||||
| 12/22 | [12][22] | 2 | 1 | |
| 22/11 | [21][21] | 1 | ||
| 22/12 | [21][22] | 2 | 1 | |
| 22/22 | [22][22] | 1 | ||
Where
List of parameter values under different simulation designs
| Scenario | ||||||||
|---|---|---|---|---|---|---|---|---|
| S0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| S1 | 0.8 | 0.8 | 0.8 | 0.8 | 0.8 | 0.8 | 0.8 | 0.8 |
| S2 | 0.8 | 0.8 | 0 | 0 | 0 | 0 | 0 | 0 |
| S3 | 0.8 | 0.8 | 0.8 | 0.8 | 0 | 0 | 0 | 0 |
| S4 | 0.8 | 0 | 0.8 | 0 | 0.8 | 0.8 | 0.8 | 0.8 |
Figure 1The bar plot of variable selection results under different simulation scenarios. Parameter values are listed in Table 2. The three sets of colored bars correspond to different sample sizes (Blue:200; Green:500; Red:1000). The horizontal dashed line indicates the nominal level of 0.05.
List of selected genes, corresponding "risk" haplotype structure, effect estimates and permutation p-values
| SNP ID (allele) | Gene (region) | "Risk" haplotype | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 9508994 | PON1 | [TC]M | 0 | 0 | 0 | 0 | 0 | -0.45 | 0 | 0 |
| 20209376 | PON1 | [CC]O | p* = 0.001 | |||||||
| 659435566 | NFKB1 | [CC]M | 0 | 0 | 0 | 0 | -0.33 | 0 | 0 | 0 |
| 659435702 | NFKB1 | [TC]O | p* = 0.001 | |||||||
| 22767327 (A/T) | FLT4 | [AT]M | 0 | 0 | 0 | 0 | 0 | -0.30 | 0 | 0 |
| 22175087 (C/T) | FLT4 | [TC]O | p* < 0.001 | |||||||
| 1125300 (G/T) | SPARC (intron 3) | [TT]M | 0 | -0.38 | 0 | 0 | 0 | 0 | 0 | 0.245 |
| 1125290 (G/T) | SPARC (intron 5) | [TT]O | p* = 0.001 | p* < 0.001 | ||||||
| 634841108 (A/C) | TIMP2 (intron 2) | [AG]M | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.68 |
| 634841123 (A/G) | TIMP2 | [CG]O | p* < 0.001 | |||||||
| 634018768 (A/G) | HPGD (promoter) | [AG]M | 0 | 0 | 0.44 | 0 | 0 | 0 | 0 | 0 |
| 636105057 (A/G) | HPGD (promoter) | [GA]O | p* < 0.001 | |||||||
| 17252653 (G/T) | MMP9 (intron) | [GC]M | 0 | 0 | 0.53 | 0 | 0 | 0 | 0 | 0 |
| 17254821 (C/G) | MMP9 | [TC]O | p* < 0.001 | |||||||
M mother's "risk" haplotype information; O offspring's "risk" haplotype information
p* is the permutation p-value.