| Literature DB >> 32908899 |
Ying Yin1, Boxin Guan1, Yuhai Zhao1, Yuan Li2.
Abstract
Detecting SNP-SNP interactions associated with disease is significant in genome-wide association study (GWAS). Owing to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power and long running time. To tackle these drawbacks, a fast self-adaptive memetic algorithm (SAMA) is proposed in this paper. In this method, the crossover, mutation, and selection of standard memetic algorithm are improved to make SAMA adapt to the detection of SNP-SNP interactions associated with disease. Furthermore, a self-adaptive local search algorithm is introduced to enhance the detecting power of the proposed method. SAMA is evaluated on a variety of simulated datasets and a real-world biological dataset, and a comparative study between it and the other four methods (FHSA-SED, AntEpiSeeker, IEACO, and DESeeker) that have been developed recently based on evolutionary algorithms is performed. The results of extensive experiments show that SAMA outperforms the other four compared methods in terms of detection power and running time.Entities:
Mesh:
Year: 2020 PMID: 32908899 PMCID: PMC7468611 DOI: 10.1155/2020/5610658
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The framework of MA.
Algorithm 1SAMA.
Algorithm 2HC.
Algorithm 3DBM.
Algorithm 4SLS.
Figure 2A running instance of SAMA.
Details of three two-locus disease models.
| MAF | 0.05 | 0.10 | 0.20 | 0.50 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AA | Aa | aa | AA | Aa | aa | AA | Aa | aa | AA | Aa | aa | ||||
| Model 1 (P(D) = 0.1, h2 = 0.005) | |||||||||||||||
| BB | 0.098 | 0.098 | 0.098 | BB | 0.096 | 0.096 | 0.096 | BB | 0.092 | 0.092 | 0.092 | BB | 0.078 | 0.078 | 0.078 |
| Bb | 0.098 | 0.299 | 0.522 | Bb | 0.096 | 0.197 | 0.282 | Bb | 0.092 | 0.145 | 0.181 | Bb | 0.078 | 0.105 | 0.122 |
| bb | 0.098 | 0.522 | 0.912 | Bb | 0.096 | 0.282 | 0.408 | Bb | 0.092 | 0.181 | 0.227 | Bb | 0.078 | 0.122 | 0.142 |
| Model 2 (P(D) = 0.1, h2 = 0.02) | |||||||||||||||
| BB | 0.096 | 0.096 | 0.096 | BB | 0.092 | 0.092 | 0.092 | BB | 0.084 | 0.084 | 0.084 | BB | 0.052 | 0.052 | 0.052 |
| Bb | 0.096 | 0.533 | 0.533 | Bb | 0.092 | 0.319 | 0.319 | Bb | 0.084 | 0.210 | 0.210 | Bb | 0.052 | 0.138 | 0.138 |
| bb | 0.096 | 0.533 | 0.533 | Bb | 0.092 | 0.319 | 0.319 | Bb | 0.084 | 0.210 | 0.210 | Bb | 0.052 | 0.138 | 0.138 |
| Model 3 (P(D) = 0.1, h2 = 0.02) | |||||||||||||||
| BB | 0.080 | 0.192 | 0.192 | BB | 0.072 | 0.164 | 0.164 | BB | 0.061 | 0.146 | 0.146 | BB | 0.067 | 0.155 | 0.155 |
| Bb | 0.192 | 0.080 | 0.080 | Bb | 0.164 | 0.072 | 0.072 | Bb | 0.146 | 0.061 | 0.061 | Bb | 0.155 | 0.067 | 0.067 |
| bb | 0.192 | 0.080 | 0.080 | Bb | 0.164 | 0.072 | 0.072 | Bb | 0.146 | 0.061 | 0.061 | Bb | 0.155 | 0.067 | 0.067 |
Parameter setting of five algorithms.
| Algorithm | Parameters |
|---|---|
| SAMA | The crossover probabilities pc1 and pc2 = 0.8; the switch parameter |
| FHSA-SED | The harmony memory considering rate HMCR =0.9; the pitch-adjusting rate PAR =0.35; the number of harmonies evaluated with Bayesian network scoring ‖HM1‖ = 250; the number of harmonies evaluated with Gini scoring ‖HM2‖ = 250 |
| AntEpiSeeker | The size of large SNP sets largesetsize = 6; the size of small SNP sets smallsetsize = 3; the weight parameters |
| IEACO | The switch parameter |
| DESeeker | The number of SNPs in a large size SNP combination W = 6; the number of vectors M = 500 |
Figure 3Power comparison of five compared algorithms on the datasets with 200 SNPs.
Figure 4Power comparison of five compared algorithms on the datasets with 2000 SNPs.
Running time of five compared algorithms on the datasets with 200 SNPs.
| Model | MAF | SAMA | FHSA-SED | AntEpiSeeker | IEACO | DESeeker |
|---|---|---|---|---|---|---|
| Model 1 | 0.05 | 9.12 ± 0.53 | 10.55 ± 0.59 | 46.63 ± 2.31 | 11.21 ± 0.76 | 10.03 ± 0.64 |
| 0.10 | 8.97 ± 0.51 | 10.32 ± 0.53 | 48.52 ± 2.40 | 12.45 ± 0.81 | 9.89 ± 0.70 | |
| 0.20 | 9.32 ± 0.49 | 10.47 ± 0.58 | 47.71 ± 2.29 | 10.93 ± 0.79 | 9.93 ± 0.66 | |
| 0.50 | 9.55 ± 0.44 | 10.62 ± 0.62 | 45.63 ± 1.99 | 13.06 ± 0.82 | 10.32 ± 0.73 | |
|
| ||||||
| Model 2 | 0.05 | 9.53 ± 0.48 | 11.04 ± 0.65 | 48.57 ± 2.37 | 10.90 ± 0.71 | 10.54 ± 0.77 |
| 0.10 | 9.29 ± 0.57 | 10.86 ± 0.68 | 49.12 ± 2.30 | 11.35 ± 0.66 | 9.98 ± 0.69 | |
| 0.20 | 8.86 ± 0.46 | 11.06 ± 0.64 | 46.83 ± 2.12 | 12.52 ± 0.73 | 10.74 ± 0.65 | |
| 0.50 | 9.22 ± 0.50 | 10.75 ± 0.70 | 46.89 ± 2.06 | 11.83 ± 0.68 | 9.76 ± 0.59 | |
|
| ||||||
| Model 3 | 0.05 | 9.06 ± 0.55 | 10.63 ± 0.63 | 50.02 ± 2.55 | 12.04 ± 0.74 | 10.63 ± 0.62 |
| 0.10 | 9.52 ± 0.59 | 11.05 ± 0.68 | 47.74 ± 2.19 | 11.67 ± 0.80 | 10.72 ± 0.58 | |
| 0.20 | 9.32 ± 0.51 | 10.64 ± 0.57 | 48.82 ± 2.49 | 12.42 ± 0.69 | 9.48 ± 0.61 | |
| 0.50 | 9.94 ± 0.60 | 10.74 ± 0.61 | 45.90 ± 2.05 | 11.53 ± 0.78 | 9.80 ± 0.65 | |
Running time of five compared algorithms on the datasets with 2000 SNPs.
| Model | MAF | SAMA | FHSA-SED | AntEpiSeeker | IEACO | DESeeker |
|---|---|---|---|---|---|---|
| Model 1 | 0.05 | 84.63 ± 3.76 | 98.74 ± 5.32 | 431.53 ± 11.57 | 108.64 ± 5.96 | 97.56 ± 4.97 |
| 0.10 | 87.53 ± 4.02 | 103.63 ± 5.67 | 427.87 ± 10.94 | 109.42 ± 6.03 | 100.55 ± 5.17 | |
| 0.20 | 90.89 ± 3.90 | 98.85 ± 5.15 | 442.35 ± 10.52 | 111.34 ± 6.12 | 99.74 ± 5.20 | |
| 0.50 | 88.16 ± 3.95 | 101.15 ± 4.96 | 425.84 ± 12.02 | 104.44 ± 6.04 | 103.85 ± 5.06 | |
|
| ||||||
| Model 2 | 0.05 | 91.48 ± 4.12 | 97.88 ± 4.87 | 435.14 ± 12.53 | 110.45 ± 5.64 | 102.66 ± 5.07 |
| 0.10 | 89.86 ± 3.79 | 100.56 ± 5.04 | 448.57 ± 10.89 | 102.63 ± 6.23 | 98.85 ± 5.12 | |
| 0.20 | 89.17 ± 4.03 | 99.95 ± 4.78 | 459.84 ± 11.78 | 101.34 ± 5.98 | 105.05 ± 5.31 | |
| 0.50 | 92.74 ± 3.87 | 100.13 ± 4.83 | 418.52 ± 10.97 | 105.65 ± 5.95 | 104.43 ± 5.13 | |
|
| ||||||
| Model 3 | 0.05 | 90.63 ± 3.93 | 97.73 ± 5.01 | 451.45 ± 12.32 | 112.56 ± 6.46 | 101.89 ± 5.44 |
| 0.10 | 86.73 ± 3.89 | 103.54 ± 5.21 | 432.85 ± 11.67 | 109.93 ± 6.15 | 104.92 ± 5.19 | |
| 0.20 | 87.83 ± 4.07 | 96.97 ± 4.89 | 429.50 ± 12.02 | 113.56 ± 5.96 | 99.71 ± 5.08 | |
| 0.50 | 90.09 ± 3.86 | 101.34 ± 5.36 | 440.86 ± 12.63 | 114.37 ± 6.07 | 103.67 ± 5.32 | |
Figure 5The number of two-locus SNP-SNP interactions detected by five algorithms.
Results of two-locus SNP-SNP interactions detected by SAMA on AMD dataset.
| SNP 1 | Gene | SNP 2 | Gene | p values |
|---|---|---|---|---|
| rs380390 | CFH | rs1363688 | NA | <1.0e-08 |
| rs380390 | CFH | rs2224762 | KDM4C | |
| rs380390 | CFH | rs555174 | NA | |
| rs380390 | CFH | rs1374431 | NA | |
| rs380390 | CFH | rs1740752 | NA | |
|
| ||||
| rs1329428 | CFH | rs7467596 | MED27 | <1.0e-07 |
| rs1329428 | CFH | rs9328536 | MED27 | |
| rs1329428 | CFH | rs3922799 | NA | |
| rs1329428 | CFH | rs10489076 | NA | |
| rs1740752 | N/A | rs3009336 | NA | |
| rs380390 | CFH | rs718263 | NCALD | |
| rs380390 | CFH | rs223607 | NA | |
| rs380390 | CFH | rs620511 | NA | |
| rs380390 | CFH | rs2178692 | COPS7A | |
| rs380390 | CFH | rs34512 | NA | |
| rs380390 | CFH | rs3853728 | EGFEM1P | |
| rs380390 | CFH | rs210758 | NA | |
| rs380390 | CFH | rs2446023 | ZNF518A | |
| rs380390 | CFH | rs2167167 | NA | |
| rs380390 | CFH | rs956275 | PPAT | |
|
| ||||
| rs380390 | CFH | rs1896373 | NA | <1.0e-06 |
| rs380390 | CFH | rs1896373 | NA | |
| rs380390 | CFH | rs143627607 | DDX3X | |
| rs1329428 | CFH | rs10504043 | ANK1 | |
| rs1329428 | CFH | rs10272438 | BBS9 | |
| rs1329428 | CFH | rs2695214 | PPP3CA | |
| rs1329428 | CFH | rs78812154 | NA | |
| rs1329428 | CFH | rs74412587 | NA | |
| rs1329428 | CFH | rs1363688 | NA | |
| rs1329428 | CFH | rs9328536 | MED27 | |
| rs1740752 | NA | rs943008 | NEDD9 | |
Number of two-locus SNP-SNP interactions detected by SAMA under different parameters.
|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 |
|---|---|---|---|---|---|---|---|---|---|
| pc1 and pc2 | |||||||||
| .1 | 9 | 12 | 14 | 17 | 19 | 18 | 17 | 13 | 10 |
| .2 | 12 | 14 | 17 | 20 | 23 | 21 | 18 | 16 | 11 |
| .3 | 13 | 13 | 16 | 19 | 21 | 18 | 20 | 16 | 13 |
| .4 | 13 | 15 | 16 | 20 | 24 | 21 | 21 | 18 | 18 |
| .5 | 16 | 17 | 17 | 23 | 30 | 25 | 23 | 20 | 19 |
| .6 | 15 | 17 | 18 | 24 | 28 | 25 | 25 | 22 | 17 |
| .7 | 15 | 13 | 18 | 25 | 27 | 26 | 27 | 21 | 19 |
| .8 | 14 | 14 | 22 | 28 | 31 | 30 | 27 | 25 | 26 |
| .9 | 12 | 13 | 17 | 23 | 29 | 25 | 26 | 22 | 21 |