| Literature DB >> 34930147 |
Yu Zhong Peng1,2, Yanmei Lin1, Yiran Huang3, Ying Li1, Guangsheng Luo2, Jianping Liao4.
Abstract
BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic interactions remains a challenge due to the intensive growth of measuring SNP combinations.Entities:
Keywords: Epistasis Analysis; Epistatic Interactions; Evolutionary Algorithm; Gene Expression Programming; Single Nucleotide Polymorphisms
Mesh:
Year: 2021 PMID: 34930147 PMCID: PMC8686218 DOI: 10.1186/s12864-021-08207-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The framework of GEP-EpiSeeker
Fig. 2Power performance comparisons between GEP-EpiSeeker and other comparative methods on four multiplicative DME disease models
Fig. 3Power performance comparisons between GEP-EpiSeeker and other comparative methods on four threshold DME disease models
Fig. 4Power performance comparisons between GEP-EpiSeeker and other comparative methods on four concrete DME disease models
Fig. 5Power performance comparisons between GEP-EpiSeeker and other comparative methods on five different DNME disease models with h2=0.01 and MAF=0.2
Fig. 6Power performance comparisons between the GEP-EpiSeeker and other comparative methods on the five different DNME disease models with h2=0.01 and MAF=0.2
The recall of the comparative methods with different disease models
| model id | BOOST | BEAM | AntEpiSeeker | MACOED | EACO | GEP-EpiSeeker |
|---|---|---|---|---|---|---|
| model 1 | 0.05±0.00 (0.00) | 0.02±0.00 (0.00) | 0.01±0.00 (0.00) | 0.02±0.00 (0.00) | 0.09±0.00 (0.00) |
|
| model 2 | 0.05±0.01 (0.00) | 0±0.00 (0.00) | 0±0.00 (0.00) | 0.04±0.00 (0.00) | 0.11±0.01 (0.00) |
|
| model 3 | 0.01±0.00 (0.00) | 0±0.00 (0.00) | 0.19±0.00 (0.00) | 0.27±0.02 (0.00) | 0.34±0.01 (0.00) |
|
| model 4 | 0.01±0.00 (0.00) | 0±0.00 (0.00) | 0.31±0.02 (0.00) | 0.37±0.01 (0.00) | 0.39±0.01 (0.00) |
|
| model 5 | 0.58±0.03 (0.00) | 0.58±0.002 (0.00) | 0.35±0.03 (0.00) | 0.43±0.03 (0.00) | 0.62±0.03 (0.00) |
|
| model 6 | 0.70±0.02 (0.00) | 0.45±0.03 (0.00) | 0.82±0.04 (0.00) | 0.94±0.01 (0.00) | 0.94±0.02 (0.03) |
|
| model 7 | 0.74±0.03 (0.00) | 0.19±0.02 (0.00) | 0.84±0.03 (0.00) |
| 0.98±0.02 (0.03) | 0.96±0.02 |
| model 8 | 0.12±0.01 (0.00) | 0.03±0.00 (0.00) | 0.97±0.02 (0.00) | 0.98±0.02 (0.00) | 0.98±0.01 (0.00) |
|
| model 9 | 0.10±0.02 (0.00) | 0.92±0.02 (0.00) | 0.97±0.01 (0.00) | 0.97±0.02 (0.00) |
| 0.95±0.03 |
| model 10 | 0.84±0.03 (0.00) | 0.84±0.02 (0.00) | 0.98±0.01 (0.00) | 0.99±0.01 (0.00) | 0.98±0.01 (0.00) |
|
| model 11 | 0.98±0.02 | 0.13±0.03 (0.00) | 0.84±0.03 (0.00) | 0.97±0.01 (0.00) | 0.97±0.01 (0.00) |
|
| model 12 | 0.96±0.02 (0.00) | 0.11±0.03 (0.00) | 0.95±0.03 (0.00) | 0.98±0.02 (0.05) | 0.97±0.02 (0.00) |
|
| model 13 | 0.56±0.04 (0.00) | 0±0.00 (0.00) | 0.2±0.02 (0.00) | 0.20±0.03 (0.00) | 0.44±0.04 (0.00) |
|
| model 14 | 0.63±0.02 (0.00) | 0±0.00 (0.00) | 0.19±0.03 (0.00) | 0.22±0.03 (0.00) | 0.63±0.02 (0.00) |
|
| model 15 | 0.62±0.02 (0.02) | 0±0.00 (0.00) | 0.20±0.03 (0.00) | 0.20±0.02 (0.00) | 0.60±0.02 (0.00) |
|
| model 16 | 0.96±0.02 (0.12) | 0±0.00 (0.00) | 0.63±0.04 (0.00) | 0.64±0.02 (0.00) | 0.93±0.02 (0.00) |
|
| model 17 | 0.41±0.02 (0.00) | 0±0.00 (0.00) | 0.11±0.02 (0.00) | 0.12±0.03 (0.00) | 0.53±0.02 (0.00) |
|
| model 18 |
| 0±0.00 (0.00) | 0.75±0.03 (0.00) | 0.98±0.02 (0.00) | 0.98±0.02 (0.00) | 0.95±0.02 |
| model 19 | 0.59±0.02 (0.00) | 0±0.00 (0.00) | 0.11±0.03 (0.00) | 0.20±0.04 (0.00) | 0.61±0.02 (0.00) |
|
| model 20 |
| 0±0.00 (0.00) | 0.80±0.03 (0.00) |
|
|
|
| model 21 |
| 0±0.00 (0.00) | 0.81±0.03 (0.00) |
| 0.97±0.02 (0.00) |
|
| model 22 | 0.97±0.02 (0.02) | 0±0.00 (0.00) | 0.62±0.03 (0.00) | 0.97±0.02 (0.03) | 0.97±0.01 (0.00) |
|
Note that the values in brackets are the p-values of the t-test between results of GEP-EpiSeeker and the corresponding comparative method. The best performances of each disease model are shown in bold and italics
The precision of the comparative methods with different disease models
| model id | BOOST | BEAM | AntEpiSeeker | MACOED | EACO | GEP-EpiSeeker |
|---|---|---|---|---|---|---|
| model 1 | 0.15±0.02 (0.00) | 0.10±0.02 (0.00) | 0.25±0.02 (0.00) | 0.42±0.03 (0.00) | 0.51±0.03 (0.00) |
|
| model 2 | 0±0.00 (0.00) | 0.11±0.02 (0.00) | 0±0.00 (0.00) | 0.85±0.03 (0.01) | 0.81±0.03 (0.00) |
|
| model 3 | 0±0.00 (0.00) | 0.01±0.01 (0.00) | 0.66±0.04 (0.00) | 0.74±0.03 (0.00) | 0.71±0.03 (0.00) |
|
| model 4 | 0±0.00 (0.00) | 0.02±0.02 (0.00) | 0.68±0.04 (0.00) | 0.44±0.03 (0.00) | 0.65±0.03 (0.00) |
|
| model 5 | 0.50±0.03 (0.00) | 0.71±0.03 (0.00) | 0.90±0.03 (0.00) | 0.96±0.02 (0.00) | 0.96±0.03 (0.01) |
|
| model 6 | 0.55±0.04 (0.00) | 0.45±0.03 (0.00) | 0.91±0.03 (0.00) |
| 0.92±0.02 (0.00) | 0.96±0.01 |
| model 7 | 0.50±0.03 (0.00) | 0.12±0.03 (0.00) | 0.92±0.03 (0.00) | 0.96±0.02 (0.11) | 0.94±0.02 (0.00) |
|
| model 8 | 0.12±0.03 (0.00) | 0.01±0.02 (0.00) | 0.98±0.02 (1.00) | 0.94±0.02 (0.00) | 0.98±0.02 (1.00) |
|
| model 9 | 0.13±0.03 (0.00) | 0.76±0.03 (0.00) | 0.96±0.02 (0.00) | 0.99±0.01 (0.01) | 0.97±0.02 (0.00) |
|
| model 10 | 0.57±0.04 (0.00) | 0.75±0.03 (0.00) | 0.98±0.02 (0.04) | 0.98±0.02 (0.04) | 0.98±0.02 (0.04) |
|
| model 11 | 0.63±0.04 (0.00) | 0.34±0.04 (0.00) | 0.98±0.02 (0.04) | 0.98±0.02 (0.04) | 0.98±0.02 (0.04) |
|
| model 12 | 0.65±0.04 (0.00) | 0.02±0.01 (0.00) | 0.96±0.02 (0.00) | 0.99±0.02 (0.05) | 0.98±0.02 (0.00) |
|
| model 13 | 0.51±0.03 (0.00) | 0±0.00 (0.00) | 0.92±0.03 (0.00) | 0.95±0.02 (0.04) | 0.92±0.02 (0.00) |
|
| model 14 | 0.52±0.03 (0.00) | 0±0.00 (0.00) | 0.86±0.03 (0.00) | 0.88±0.02 (0.00) | 0.87±0.02 (0.00) |
|
| model 15 | 0.45±0.03 (0.00) | 0±0.00 (0.00) | 0.83±0.03 (0.00) | 0.83±0.02 (0.00) | 0.84±0.02 (0.00) |
|
| model 16 | 0.65±0.04 (0.00) | 0±0.00 (0.00) | 0.92±0.02 (0.00) |
| 0.98±0.02 (0.04) |
|
| model 17 | 0.41±0.04 (0.00) | 0±0.00 (0.00) | 0.71±0.03 (0.00) |
| 0.74±0.02 (0.00) | 0.68±0.03 |
| model 18 | 0.68±0.04 (0.00) | 0±0.00 (0.00) | 0.93±0.02 (0.00) | 0.97±0.02 (0.05) | 0.97±0.02 (0.05) |
|
| model 19 | 0.48±0.03 (0.00) | 0±0.00 (0.00) | 0.85±0.04 (0.00) | 0.92±0.02 (0.00) | 0.92±0.02 (0.00) |
|
| model 20 | 0.62±0.03 (0.00) | 0±0.00 (0.00) | 0.98±0.02 (0.00) |
|
|
|
| model 21 | 0.61±0.04 (0.00) | 0±0.00 (0.00) |
| 0.96±0.02 (0.00) |
|
|
| model 22 | 0.62±0.04 (0.00) | 0±0.00 (0.00) | 0.97±0.02 (0.04) |
|
|
|
Note that the values in brackets are the p-values of the t-test between results of GEP-EpiSeeker and the corresponding comparative method. The best performances of each disease model are shown in bold and italics
The F1 of the comparative methods with different disease models
| model id | BOOST | BEAM | AntEpiSeeker | MACOED | EACO | GEP-EpiSeeker |
|---|---|---|---|---|---|---|
| model 1 | 0.08±0.02 (0.00) | 0.03±0.02 (0.00) | 0.02±0.03 (0.00) | 0.04±0.02 (0.00) | 0.15±0.03 (0.00) |
|
| model 2 | 0±0.00 (0.00) | 0±0.00 (0.00) | 0±0.00 (0.00) | 0.08±0.02 (0.00) | 0.19±0.04 (0.00) |
|
| model 3 | 0±0.00 (0.00) | 0±0.00 (0.00) | 0.30±0.04 (0.00) | 0.40±0.04 (0.00) | 0.46±0.03 (0.00) |
|
| model 4 | 0±0.00 (0.00) | 0±0.00 (0.00) | 0.43±0.04 (0.00) | 0.40±0.03 (0.00) | 0.49±0.03 (0.00) |
|
| model 5 | 0.54±0.03 (0.00) | 0.64±0.02 (0.00) | 0.50±0.04 (0.00) | 0.59±0.03 (0.00) | 0.75±0.04 (0.00) |
|
| model 6 | 0.62±0.03 (0.00) | 0.45±0.02 (0.00) | 0.86±0.02 (0.00) |
| 0.93±0.03 (0.00) | 0.95±0.02 |
| model 7 | 0.60±0.04 (0.00) | 0.15±0.02 (0.00) | 0.88±0.03 (0.00) |
| 0.96±0.02 (1.00) | 0.96±0.02 |
| model 8 | 0.12±0.04 (0.00) | 0.02±0.02 (0.00) | 0.97±0.03 (0.00) | 0.96±0.02 (0.00) | 0.98±0.02 (0.04) |
|
| model 9 | 0.11±0.04 (0.00) | 0.83±0.03 (0.00) | 0.96±0.03 (0.07) |
| 0.97±0.02 (0.98) | 0.97±0.01 |
| model 10 | 0.68±0.04 (0.00) | 0.79±0.03 (0.00) | 0.98±0.02 (0.04) | 0.98±0.02 (0.04) | 0.98±0.02 (0.04) |
|
| model 11 | 0.77±0.03 (0.00) | 0.19±0.03 (0.00) | 0.90±0.03 (0.00) | 0.97±0.02 (0.00) | 0.97±0.02 (0.00) |
|
| model 12 | 0.78±0.03 (0.00) | 0.03±0.02 (0.00) | 0.95±0.02 (0.00) | 0.98±0.02 (0.04) | 0.97±0.03 (0.01) |
|
| model 13 | 0.53±0.03 (0.00) | 0±0.00 (0.00) | 0.33±0.04 (0.00) | 0.33±0.03 (0.00) | 0.60±0.04 (0.00) |
|
| model 14 | 0.57±0.03 (0.00) | 0±0.00 (0.00) | 0.31±0.05 (0.00) | 0.35±0.04 (0.00) | 0.73±0.03 (0.00) |
|
| model 15 | 0.52±0.04 (0.00) | 0±0.00 (0.00) | 0.32±0.04 (0.00) | 0.32±0.04 (0.00) | 0.70±0.04 (0.01) |
|
| model 16 | 0.78±0.03 (0.00) | 0±0.00 (0.00) | 0.75±0.02 (0.00) | 0.78±0.04 (0.00) | 0.95±0.02 (0.00) |
|
| model 17 | 0.41±0.05 (0.00) | 0±0.00 (0.00) | 0.19±0.04 (0.00) | 0.21±0.04 (0.00) |
| 0.61±0.04 |
| model 18 | 0.81±0.03 (0.00) | 0±0.00 (0.00) | 0.83±0.03 (0.00) |
|
| 0.96±0.01 |
| model 19 | 0.53±0.04 (0.00) | 0±0.00 (0.00) | 0.19±0.04 (0.00) | 0.33±0.04 (0.00) | 0.73±0.03 (0.04) |
|
| model 20 | 0.77±0.03 (0.00) | 0±0.00 (0.00) | 0.88±0.05 (0.00) |
|
|
|
| model 21 | 0.75±0.03 (0.00) | 0±0.00 (0.00) | 0.89±0.03 (0.00) | 0.97±0.01 (0.03) | 0.97±0.02 (0.02) |
|
| model 22 | 0.76±0.03 (0.00) | 0±0.00 (0.00) | 0.76±0.03 (0.00) | 0.97±0.01 (0.03) | 0.97±0.01 (0.03) |
|
Note that the values in brackets are the p-values of the t-test between results of GEP-EpiSeeker and the corresponding comparative method. The best performances of each disease model are shown in bold and italics
Fig. 7Comparison of GEP-EpiSeeker using the fuzzy adaptive genetic manipulation rate or not on the recall score
Fig. 8Comparison of GEP-EpiSeeker using the fuzzy adaptive genetic manipulation rate or not on the precision score
Fig. 9Comparison of GEP-EpiSeeker using the fuzzy adaptive genetic manipulation rate or not on the F1 score
Fig. 10Example of GEP Expression Tree (ET)
Fig. 11Flowchart of GEP algorithm
Fig. 12The pseudocode of EpiGEP
Fig. 13The expression trees of an EpiGEP chromosome
Fig. 14Input and output of the membership function
Fig. 15A m-SNP epistasis BN model between disease state y and m SNPs x1, x2, x3, . . . , x
Penetrance functions of the three types of DME epistasis models
| Multiplicative model | Loci 1 | |||
|---|---|---|---|---|
| AA | Aa | aa | ||
| Loci 2 | BB | |||
| Bb | ||||
| bb | ||||
| Threshold model | Loci 1 | |||
| AA | Aa | aa | ||
| Loci 2 | BB | |||
| Bb | α(1+ | |||
| bb | ||||
| Concrete model | Loci 1 | |||
| AA | Aa | aa | ||
| Loci 2 | BB | |||
| Bb | ||||
| bb | ||||
Note: The parameters {α, β} of the model 1~ model 12 are set as {0.0980, 0.7464}, {0.0960, 0.4329}, {0.0921, 0.2526}, {0.0782, 0.1610}, {0. 0958, 4.5647}, {0. 0918, 2.4771}, {0. 0836, 1.5108}, {0.0519, 1.6474}, {0.0804, 1.3856}, {0.0717, 1.2817}, {0.0608, 1.3997} and {0.0671, 1.3070}
Penetrance tables of the twenty-two epistasis models with a different set of parameters
| Model id | Penetrance function | |||||
|---|---|---|---|---|---|---|
| Genotypes (SNP | Genotypes (SNP | |||||
| model 1 | 0.005 | 0.05 | 0.0980 | 0.0980 | 0.0980 | |
| 0.0980 | 0.2989 | 0.5222 | ||||
| 0.0980 | 0.5222 | 0.9121 | ||||
| model 2 | 0.005 | 0.1 | 0.0960 | 0.0960 | 0.0960 | |
| 0.0960 | 0.1971 | 0.2824 | ||||
| 0.0960 | 0.2824 | 0.4047 | ||||
| model 3 | 0.005 | 0.2 | 0.0921 | 0.0921 | 0.0921 | |
| 0.0921 | 0.1445 | 0.1810 | ||||
| 0.0921 | 0.1810 | 0.2266 | ||||
| model 4 | 0.005 | 0.5 | 0.0782 | 0.0782 | 0.0782 | |
| 0.0782 | 0.1054 | 0.1223 | ||||
| 0.0782 | 0.1223 | 0.1420 | ||||
| model 5 | 0.02 | 0.05 | 0.0958 | 0.0958 | 0.0958 | |
| 0.0958 | 0.5331 | 0.5331 | ||||
| 0.0958 | 0.5331 | 0.5331 | ||||
| model 6 | 0.02 | 0.1 | 0.0918 | 0.0918 | 0.0918 | |
| 0.0918 | 0.3192 | 0.3192 | ||||
| 0.0918 | 0.3192 | 0.3192 | ||||
| model 7 | 0.02 | 0.2 | 0.0836 | 0.0836 | 0.0836 | |
| 0.0836 | 0.2099 | 0.2099 | ||||
| 0.0836 | 0.2099 | 0.2099 | ||||
| model 8 | 0.02 | 0.5 | 0.0519 | 0.0519 | 0.0519 | |
| 0.0519 | 0.1374 | 0.1374 | ||||
| 0.0519 | 0.1374 | 0.1374 | ||||
| model 9 | 0.02 | 0.05 | 0.0804 | 0.1918 | 0.1918 | |
| 0.1918 | 0.0804 | 0.0804 | ||||
| 0.1918 | 0.0804 | 0.0804 | ||||
| model 10 | 0.02 | 0.1 | 0.0717 | 0.1636 | 0.1636 | |
| 0.1636 | 0.0717 | 0.0717 | ||||
| 0.1636 | 0.0717 | 0.0717 | ||||
| model 11 | 0.02 | 0.2 | 0.0608 | 0.1459 | 0.1459 | |
| 0.1459 | 0.0608 | 0.0608 | ||||
| 0.1459 | 0.0608 | 0.0608 | ||||
| model 12 | 0.02 | 0.5 | 0.0671 | 0.1548 | 0.1548 | |
| 0.1548 | 0.0671 | 0.0671 | ||||
| 0.1548 | 0.0671 | 0.0671 | ||||
| model 13 | 0.01 | 0.2 | 0.6377 | 0.4884 | 0.3826 | |
| 0.4638 | 0.7645 | 0.9566 | ||||
| 0.5798 | 0.5624 | 0.7189 | ||||
| model 14 | 0.01 | 0.2 | 0.2216 | 0.2758 | 0.1414 | |
| 0.2587 | 0.1690 | 0.4013 | ||||
| 0.2781 | 0.1279 | 0.4196 | ||||
| model 15 | 0.01 | 0.2 | 0.2216 | 0.2758 | 0.1414 | |
| 0.2587 | 0.1690 | 0.4013 | ||||
| 0.2781 | 0.1279 | 0.4196 | ||||
| model 16 | 0.01 | 0.2 | 0.1391 | 0.1882 | 0.2214 | |
| 0.1901 | 0.1114 | 0.0198 | ||||
| 0.2056 | 0.0514 | 0.2530 | ||||
| model 17 | 0.01 | 0.2 | 0.1391 | 0.1882 | 0.2214 | |
| 0.1901 | 0.1114 | 0.0198 | ||||
| 0.2056 | 0.0514 | 0.2530 | ||||
| model 18 | 0.01 | 0.4 | 0.1032 | 0.0634 | 0.1242 | |
| 0.0978 | 0.0858 | 0.0693 | ||||
| 0.0210 | 0.1467 | 0.0595 | ||||
| model 19 | 0.01 | 0.4 | 0.1852 | 0.2908 | 0.2340 | |
| 0.2860 | 0.2009 | 0.2770 | ||||
| 0.2486 | 0.2661 | 0.1657 | ||||
| model 20 | 0.01 | 0.4 | 0.0731 | 0.0418 | 0.0146 | |
| 0.0240 | 0.0639 | 0.0591 | ||||
| 0.0682 | 0.0188 | 0.0946 | ||||
| model 21 | 0.01 | 0.4 | 0.0462 | 0.1275 | 0.0694 | |
| 0.1150 | 0.0667 | 0.0971 | ||||
| 0.1067 | 0.0691 | 0.1085 | ||||
| model 22 | 0.01 | 0.4 | 0.0950 | 0.1222 | 0.1267 | |
| 0.0973 | 0.1294 | 0.0999 | ||||
| 0.2014 | 0.0439 | 0.1222 | ||||