| Literature DB >> 22303409 |
Li Zhang1, Ruitao Liu, Zhong Wang, Daniel A Culver, Rongling Wu.
Abstract
Haplotype analysis has been increasingly used to study the genetic basis of human diseases, but models for characterizing genetic interactions between haplotypes from different chromosomal regions have not been well developed in the current literature. In this article, we describe a statistical model for testing haplotype-haplotype interactions for human diseases with a case-control genetic association design. The model is formulated on a contingency table in which cases and controls are typed for the same set of molecular markers. By integrating well-established quantitative genetic principles, the model is equipped with a capacity to characterize physiologically meaningful epistasis arising from interactions between haplotypes from different chromosomal regions. The model allows the partition of epistasis into different components due to additive × additive, additive × dominance, dominance × additive, and dominance × dominance interactions. We derive the EM algorithm to estimate and test the effects of each of these components on differences in the pattern of genetic variation between cases and controls and, therefore, examine their role in the pathogenesis of human diseases. The method was further extended to investigate gene-environment interactions expressed at the haplotype level. The statistical properties of the models were investigated through simulation studies and its usefulness and utilization validated by analyzing the genetic association of sarcoidosis from a human genetics project.Entities:
Keywords: EM algorithm; epistasis; haplotype; linkage disequilibrium; risk haplotype
Year: 2012 PMID: 22303409 PMCID: PMC3260479 DOI: 10.3389/fgene.2012.00002
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Frequencies of 2-SNP genotypes at block 1 calculated from genotypic observations for 2 SNPs at block 1 in both cases and controls. The composite diplotypes (CD) of each genotype are also given assuming that AB is the risk haplotype.
| Genotype | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| CD | ||||||||||
| Cases | ||||||||||
| Controls | ||||||||||
Frequencies of composite diplotypes for 2 SNPs at block 1 in both cases and controls assuming that .
| Disease status | |||
|---|---|---|---|
| Cases | |||
| Controls |
Frequencies of across-block composite diplotypes in both cases and controls, assuming that .
| Disease status | Composite diplotype | Total | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (22) | (21) | (20) | (12) | (11) | (10) | (02) | (01) | (00) | ||
| Cases | ||||||||||
| Controls | ||||||||||
Frequencies of across-block composite diplotypes calculated from across-block genotypes in cases, assuming that .
The genotypic values of across-block composite diplotypes in terms of additive and dominant effects at each block and interactions between two blocks.
| μ22 = μ + | μ21 = μ + | μ20 = μ + | |
| μ12 = μ + | μ11 = μ + | μ10 = μ + | |
| μ02 = μ − | μ01 = μ − | μ00 = μ − |
The genotypic values of 9 across-block composite diplotypes stratified by a binary environmental factor (.
| Environ. factor | Composite diplotype | |||
|---|---|---|---|---|
| μ0,22 = μ + | μ0,21 = μ + | μ0,20 = μ + | ||
| μ0,12 = μ + | μ0,11 = μ + | μ0,10 = μ + | ||
| μ0,02 = μ − | μ0,02 = μ − | μ0,00 = μ − | ||
| μ1,22 = μ + | μ1,21 = μ + | μ1,20 = μ + | ||
| μ1,12 = μ + | μ1,11 = μ + | μ1,10 = μ + | ||
| μ1,02 = μ − | μ1,01 = μ − | μ1,00 = μ − | ||
Estimation and significance test (corrected for multiple comparisons using FDR) of genetic effects on sarcoidosis risk by haplotypes composed of SNPs from MMP7 and MMP9.
| Effect | [MMP7 SNPs][MMP9 SNPs] | Risk Hap | Risk Hap freq. | Est(SD) | FDR | ||||
|---|---|---|---|---|---|---|---|---|---|
| MMP7 | MMP9 | ||||||||
| Case | Control | Case | Control | ||||||
| [rs11568819–rs11568818][rs2250889–rs17577] | CA-CG | 0.596 | 0.522 | 0.678 | 0.681 | 1.21(0.35) | 0.0006 | 0.036 | |
| [rs11568818–rs17098318][rs4810482–rs3918241] | GA-CT | 0.364 | 0.410 | 0.460 | 0.519 | −0.75(0.27) | 0.0059 | 0.089 | |
| [rs11568818–rs17098318][rs2250889–rs17577] | AG-CG | 0.597 | 0.521 | 0.678 | 0.681 | 0.94(0.32) | 0.0035 | 0.089 | |
| [rs17098318–rs880197][rs4810482–rs3918241] | AA-CT | 0.365 | 0.422 | 0.460 | 0.519 | −0.76(0.27) | 0.0045 | 0.089 | |
| [rs11568818–rs17098318][rs3918249–rs3918253] | AA-TT | 0.365 | 0.422 | 0.295 | 0.234 | −1.00(0.35) | 0.0040 | 0.080 | |
| [rs11568818–rs17098318][rs3918253–rs3918254] | AA-CC | 0.365 | 0.422 | 0.648 | 0.697 | −1.17(0.33) | 0.0004 | 0.018 | |
| [rs17098318-rs880197][rs3918249–rs3918253] | GA-TT | 0.364 | 0.410 | 0.295 | 0.234 | −0.96(0.35) | 0.0056 | 0.084 | |
| [rs17098318–rs880197][rs3918253–rs3918254] | GA-CC | 0.364 | 0.410 | 0.648 | 0.697 | −1.14(0.33) | 0.0006 | 0.018 | |
| [rs11568819–rs11568818][rs3918253–rs3918254] | CA-CC | 0.596 | 0.522 | 0.648 | 0.697 | 0.81(0.31) | 0.0085 | 0.102 | |
Risk haplotypes in the two genes and their frequencies in cases and controls are given.
Figure 1Pairwise significance tests of additive × dominant epistasis between 7 SNPs at MMP7 and 11 SNPs at MMP9 in a genetic association study of sarcoidosis risk.
Estimation of genetic parameters and power of epistatic detection from a simulated data for two independent blocks under Scenario 1 for all 1000 simulations.
| Parameter | True | 250 Cases/250 controls | 500 Cases/500 controls | ||||
|---|---|---|---|---|---|---|---|
| Est | MSE | Power | Est | MSE | Power | ||
| −0.125 | −0.123 | 0.061 | −0.120 | 0.028 | |||
| −0.275 | −0.301 | 0.062 | −0.293 | 0.029 | |||
| −0.485 | 0.484 | 0.115 | −0.472 | 0.057 | |||
| −0.363 | −0.392 | 0.174 | −0.374 | 0.092 | |||
| −0.125 | −0.123 | 0.062 | −0.120 | 0.028 | |||
| 0.810 | 0.827 | 0.110 | 0.719 | 0.810 | 0.050 | 0.961 | |
| −0.506 | −0.481 | 0.117 | −0.498 | 0.055 | |||
| 0.301 | 0.321 | 0.197 | 0.296 | 0.103 | |||
| 0.087 | 0.122 | ||||||
| 0.065 | 0.074 | ||||||
| 0.129 | 0.214 | ||||||
| 0.050 | 0.056 | ||||||
| 0.081 | 0.115 | ||||||
| 0.852 | 0.996 | ||||||
| 0.756 | 0.969 | ||||||
| 0.244 | 0.430 | ||||||
.
.
Type I error rate for all epistatic detection of two independent blocks under Scenario 3 for 1000 simulations.
| Parameter | True | 250 Cases/250 controls | 500 Cases/500 controls | ||||
|---|---|---|---|---|---|---|---|
| Est | MSE | Type I error | Est | MSE | Type I error | ||
| 0 | −0.127 | 0.070 | 0.049 | −0.127 | 0.044 | 0.050 | |
| 0 | −0.008 | 0.098 | 0.054 | 0.004 | 0.047 | 0.048 | |
| 0 | 0.019 | 0.102 | 0.043 | 0.007 | 0.055 | 0.053 | |
| 0 | 0.037 | 0.188 | 0.045 | 0.005 | 0.093 | 0.049 | |
| 0.060 | 0.050 | ||||||
| 0.064 | 0.043 | ||||||
| 0.060 | 0.050 | ||||||
| 0.048 | 0.049 | ||||||
| 0.050 | 0.044 | ||||||
| 0.156 | 0.235 | ||||||
| 0.046 | 0.098 | ||||||
| 0.062 | 0.081 | ||||||
.
.
| Disease status ( | Composite diplotype | Total | ||
|---|---|---|---|---|
| Cases | m2 | m1 | m0 | |
| Controls | n2 | n1 | n0 | |
Estimation and significance test (corrected for multiple comparisons using FDR) of genetic effects on sarcoidosis risk by haplotypes composed of SNPs from MMP7 and MMP9.
| Disease status | Composite diplotype | Total | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (22) | (21) | (20) | (12) | (11) | (10) | (02) | (01) | (00) | ||
| Cases | ||||||||||
| Controls | ||||||||||
Estimation of genetic parameters and power of epistatic detection from a simulated data for two independent blocks under Scenario 2 for all 1000 simulations.
| Parameter | True | 250 Cases/250 controls | 500 Cases/500 controls | ||||
|---|---|---|---|---|---|---|---|
| Est | MSE | Power | Est | MSE | Power | ||
| 0.341 | 0.357 | 0.053 | 0.353 | 0.024 | |||
| −0.713 | −0.735 | 0.051 | −0.725 | 0.024 | |||
| −0.459 | −0.493 | 0.117 | −0.467 | 0.051 | |||
| −0.746 | −0.784 | 0.101 | 0.731 | −0.749 | 0.044 | 0.950 | |
| 0.341 | 0.357 | 0.050 | 0.353 | 0.024 | |||
| −0.210 | −0.230 | 0.089 | −0.231 | 0.045 | |||
| −0.499 | −0.500 | 0.104 | −0.499 | 0.050 | |||
| 1.038 | 1.089 | 0.202 | 0.734 | 1.053 | 0.083 | 0.955 | |
| 0.050 | 0.040 | ||||||
| 0.053 | 0.057 | ||||||
| 0.042 | 0.046 | ||||||
| 0.052 | 0.049 | ||||||
| 0.365 | 0.600 | ||||||
| 0.438 | 0.715 | ||||||
| 0.995 | 1.000 | ||||||
| 0.696 | 0.929 | ||||||
.
.