| Literature DB >> 26126977 |
Cheng-Hong Yang1, Yu-Da Lin2, Cheng-San Yang3, Li-Yeh Chuang4.
Abstract
BACKGROUND: Multifactor dimensionality reduction (MDR) is widely used to analyze interactions of genes to determine the complex relationship between diseases and polymorphisms in humans. However, the astronomical number of high-order combinations makes MDR a highly time-consuming process which can be difficult to implement for multiple tests to identify more complex interactions between genes. This study proposes a new framework, named fast MDR (FMDR), which is a greedy search strategy based on the joint effect property.Entities:
Mesh:
Year: 2015 PMID: 26126977 PMCID: PMC4487567 DOI: 10.1186/s12864-015-1717-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1FMDR flowchart
A paired t-test comparison of the power analysis between MDR and FMDR for 2- to 5-loci
| Model1 | 2-loci | 3-loci | 4-loci | 5-loci | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MDR | FMDR |
| MDR | FMDR |
| MDR | FMDR |
| MDR | FMDR |
| |
| Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | |||||
| Model 1 | 0.12 (±0.11) | 0.12 (±0.11) | −3 | 0.27 (±0.21) | 0.27 (±0.21) | − | 0.67 (±0.21) | 0.67 (±0.21) | 0.17 | 0.94 (±0.08) | 0.91 (±0.11) | <0.001 |
| Model 2 | 0.05 (±0.00) | 0.05 (±0.00) | − | 0.24 (±0.18) | 0.24 (±0.18) | − | 0.78 (±0.14) | 0.78 (±0.14) | 0.65 | 0.96 (±0.04) | 0.96 (±0.05) | 0.001 |
| Model 3 | 0.05 (±0.00) | 0.05 (±0.00) | − | 0.73 (±0.14) | 0.72 (±0.14) | 0.01 | 0.90 (±0.08) | 0.88 (±0.09) | 0.003 | 0.99 (±0.01) | 0.99 (±0.01) | <0.001 |
| Model 4 | 0.24 (±0.10) | 0.24 (±0.10) | − | 0.50 (±0.14) | 0.50 (±0.14) | 0.32 | 0.88 (±0.09) | 0.87 (±0.09) | 0.01 | 0.99 (±0.02) | 0.98 (±0.03) | 0.001 |
| Model 5 | 0.06 (±0.01) | 0.06 (±0.01) | − | 0.50 (±0.13) | 0.50 (±0.12) | 0.89 | 0.95 (±0.06) | 0.91 (±0.10) | <0.001 | 1.00 (±0.00) | 1.00 (±0.00) | − |
| Model 6 | 0.05 (±0.00) | 0.05 (±0.00) | − | 0.11 (±0.04) | 0.11 (±0.04) | − | 0.70 (±0.12) | 0.70 (±0.12) | 0.001 | 1.00 (±0.00) | 1.00 (±0.00) | 0.001 |
1Model 1: MAF = 0.1, sample = 800 (400 cases and 400 controls), Model 2: MAF = 0.1, sample = 1600 (800 cases and 800 controls), Model 3: MAF = 0.2, sample = 800 (400 cases and 400 controls), Model 4: MAF = 0.2, sample = 1600 (800 cases and 800 controls), Model 5: MAF = 0.4, sample = 800 (400 cases and 400 controls), Model 6: MAF = 0.4, sample = 1600 (800 cases and 800 controls); 2 P-value were estimated from pairwise t-test; 3-: the same power analyses between MDR and FMDR
Fig. 2Performance comparison between MDR and FMDR on six simulated models for different minor allele frequencies (MAFs) and different sample sizes (a–f of Fig. 2). For all models, heritability h 2 = 0.2, and MAF includes 0.1, 0.2, and 0.4. For each model, 100 datasets are generated by randomly sorted samples. The figures show the box plot, where the boundary of the box closest to zero indicates the 25th percentile, a line within the box marks the median, and the boundary of the box farthest from zero indicates the 75th percentile. Error bars near the top and bottom of the boxes respectively indicate the 90th and 10th percentiles
Fig. 3MDR and FMDR execution times on six simulated models for different MAFs and different sample sizes (a-f of Fig. 3). The horizontal axis represents the execution time in log10 milliseconds, while the vertical axis represents the number of loci in the model
Analysis results of the chronic dialysis data sets from MDR and FMDR
| Models | Method | 3-loci | 4-loci | 5-loci | 6-loci | ||||
|---|---|---|---|---|---|---|---|---|---|
| MDR | FMDR | MDR | FMDR | MDR | FMDR | MDR | FMDR | ||
| Best | |||||||||
| Candidate model | 40,56,64 | 40,56,64 | 21,59,64,71 | 21,59,64,71 | 21,59,62, 64,71 | 21,59,62, 64,71 | 21,45,59, 62,64,71 | 21,45,59, 62,64,71 | |
| Consistency | 2 / 10 | 2 / 10 | 4 / 10 | 2 / 10 | 1 / 10 | 1 / 10 | 2 / 10 | 3 / 10 | |
| Accuracy | 0.56 | 0.56 | 0.58 | 0.58 | 0.58 | 0.58 | 0.60 | 0.60 | |
|
| 1.79 | 1.79 | 1.91 | 1.91 | 2.13 | 2.13 | 2.30 | 2.30 | |
| (1.27–2.52) | (1.27–2.52) | (1.38–2.63) | (1.38–2.63) | (1.54–2.94) | (1.54–2.94) | (1.66–3.18) | (1.66–3.18) | ||
| Worst | |||||||||
| candidate model | 45,56,62 | 45,56,65 | 40,45,56,62 | 19,21,34, 56 | 41,43,45, 56,62 | 40,56,64, 71,77 | 4,21,56, 60,62,70 | 40,21,56, 64,71,77 | |
| Consistency | 1 / 10 | 1 / 10 | 2/10 | 1/10 | 1 / 10 | 1 / 10 | 1 / 10 | 1 / 10 | |
| Accuracy | 0.56 | 0.56 | 0.57 | 0.56 | 0.57 | 0.57 | 0.58 | 0.59 | |
|
| 1.64 | 1.65 | 1.82 | 1.78 | 1.86 | 1.90 | 2.13 | 2.28 | |
| (1.18–2.27) | (1.19–2.31) | (1.31–2.54) | (1.27–2.49) | (1.34–2.60) | (1.36–2.64) | (1.50–3.03) | (1.59–3.26) | ||
| Accuracy | |||||||||
| Mean (SD) | 0.56 (±0.00) | 0.56 (±0.00) | 0.58 (±0.01) | 0.58 (±0.01) | 0.58 (±0.002) | 0.58 (±0.002) | 0.60 (±0.01) | 0.60 (±0.01) | |
|
| −2 | 0.07 | 0.32 | 0.85 | |||||
|
| |||||||||
| Mean (SD) | 1.73 (±0.05) | 1.73 (±0.05) | 2.04 (±0.19) | 2.03 (±0.19) | 2.21 (±0.18) | 2.17 (±0.19) | 2.31 (±0.08) | 2.30 (±0.08) | |
|
| 0.66 | 0.44 | 0.03 | 0.46 | |||||
| Power | |||||||||
| Mean (SD) | 0.97 (±0.03) | 0.99 (±0.02) | 0.99 (±0.02) | 0.99 (±0.02) | 0.99 (±0.003) | 0.99 (±0.002) | 0.99 (±0.004) | 0.99 (±0.003) | |
|
| <0.001 | 0.83 | 0.71 | <0.001 | |||||
1 P-value were estimated from pairwise t-test; 2-: the same accuracies between MDR and FMDR
Fig. 4Performance comparison between MDR and FMDR for the chronic dialysis data set. (a) accuracy box plot for MDR and FMDR, (b) OR box plot for MDR and FMDR, and (c) power analysis box plot for MDR and FMDR for 100 tests. For each test, the samples in the data set are randomly sorted, and then applied to MDR and FMDR
Fig. 5Execution times of MDR and FMDR for the real data set with chronic dialysis. The horizontal axis represents execution time in log10 milliseconds, while the vertical axis represents the number of loci in the model