| Literature DB >> 28143579 |
Abstract
BACKGROUND: Susceptible barcode recognition plays an important role in the diagnosis and treatment of complex diseases. Numerous approaches have been proposed to identify risky barcodes involved in the progress of complex diseases. However, some methods only consider differences in barcode frequencies between the control and disease groups; as such, these methods may be partial or even wrong. For example, some barcodes with a high risk ratio yield a low frequency on cases or exhibit a high frequency on controls, which may unreasonable from a statistical point.Entities:
Keywords: Ant colony algorithm; Complex diseases; Epistasis; Single nucleotide polymorphisms
Mesh:
Year: 2017 PMID: 28143579 PMCID: PMC5286784 DOI: 10.1186/s12976-017-0050-0
Source DB: PubMed Journal: Theor Biol Med Model ISSN: 1742-4682 Impact factor: 2.432
Fig. 1Searching multiple potential barcodes by MDMC-ACO. For candidate SNPs, MDMC-ACO considers both epistasis and trait heterogeneity in case–control study. Multiple barcodes generated by MDMC-ACO may related to different subtypes of complex diseases.
Fig. 2OR values of various barcodes with different lengths. IGA denotes OR value of the most risk barcode generated by IGA method. The solution1, solution2 and solution3 are three different risk barcodes generated by MDMC-ACO. The unit of the length of barcodes is one SNP. It means that when the length is 6, there are 6 SNPs in a barcode.
Fig. 3RR values of various barcodes with different lengths. IGA represents RR value of the most risk barcode generated by IGA method. The solution1, solution2 and solution3 are three different risk barcodes which may lead to different subtypes of complex diseases. The unit of the length is one SNP.
Details of solution1
| Epistatic SNPs | Genotype Barcode | Cancer percentage (%) | RR (95% CI) | OR (95% CI) |
|---|---|---|---|---|
| 4-17 | 2-1 | 52.75 | 1.07 (1.0254, 1.1217) | 1.15 (1.0506, 1.2662) |
| 4-17-18 | 2-1-1 | 54.19 | 1.1 (1.0355, 1.1655) | 1.21 (1.0657, 1.3687) |
| 4-17-18-22 | 2-1-1-2 | 55.60 | 1.12 (1.0347, 1.2102) | 1.27 (1.0644, 1.5107) |
| 4-11-17-18-22 | 2-2-1-1-2 | 57.98 | 1.16 (1.0473, 1.2946) | 1.39 (1.0826, 1.788) |
| 4-11-16-17-18-22 | 2-2-2-1-1-2 | 60.00 | 1.2 (1.0347, 1.3982) | 1.51 (1.0356, 2.193) |
| 4-11-12-16-17-18-22 | 2-2-1-2-1-1-2 | 70.27 | 1.41 (1.1403, 1.7373) | 2.37 (1.1702, 4.8032) |
Details of solution2
| Epistatic SNPs | Genotype Barcode | Cancer percentage (%) | RR | OR |
|---|---|---|---|---|
| 2-4 | 0-0 | 55.69 | 1.12 (0.9974, 1.2509) | 1.26 (0.9799, 1.6308) |
| 2-4-6 | 0-0-1 | 57.97 | 1.16 (0.9485, 1.4203) | 1.38 (0.8557, 2.2332) |
| 2-4-6-7 | 1-0-1-0 | 58.82 | 1.18 (0.7903, 1.7525) | 1.43 (0.5437, 3.7583) |
| 2-4-6-7-20 | 0-0-0-0-0 | 66.67 | 1.33 (0.5989, 2.9689) | 2 (0.1813, 22.0689) |
| 2-4-6-7-17-20 | 0-0-2-2-1-1 | 52.38 | 1.05 (0.6965, 1.5760) | 1.1 (0.4668, 2.5929) |
| 2-4-6-7-12-17-20 | 2-2-1-1-0-1-1 | 66.67 | 1.33 (0.5989, 2.9689) | 2 (0.1813, 22.0689) |
Details of solution3
| Epistatic SNPs | Genotype Barcode | Cancer percentage (%) | RR | OR |
|---|---|---|---|---|
| 4-8 | 0-1 | 54.31 | 1.09 (0.9821, 1.2081) | 1.2 (0.9537, 1.4985) |
| 4-6-8 | 0-1-1 | 54.9 | 1.1 (0.9209, 1.3120) | 1.22 (0.8243, 1.8053) |
| 4-6-8-19 | 0-2-1-0 | 57.14 | 1.14 (0.6016, 2.1717) | 1.33 (0.2983, 5.9618) |
| 4-6-8-19-20 | 2-2-2-0-0 | 66.67 | 1.33 (0.5989, 2.9689) | 2 (0.1813, 22.0689) |
| 4-6-8-13-19-20 | 0-0-1-0-1-2 | 55.56 | 1.11 (0.6193, 1.9940) | 1.25 (0.3355, 4.6587) |
| 4-6-8-13-18-19-20 | 0-0-2-2-1-1-2 | 66.67 | 1.33 (0.5989, 2.9689) | 2 (0.1813, 22.0689) |
Statistical analysis of risky barcodes
| Epistatic SNPs | Genotype Barcode |
|
|---|---|---|
| 4-17 | 2-1 | 0.003 |
| 4-17-18 | 2-1-1 | 0.003 |
| 4-17-18-22 | 2-1-1-2 | 0.008 |
| 4-11-17-18-22 | 2-2-1-1-2 | 0.012 |
| 4-11-16-17-18-22 | 2-2-2-1-1-2 | 0.031 |
| 4-11-12-16-17-18-22 | 2-2-1-2-1-1-2 | 0.013 |
| 2-4 | 0-0 | 0.071 |
| 2-4-6 | 0-0-1 | 0.184 |
| 4-8 | 0-1 | 0.121 |
| 4-6-8 | 0-1-1 | 0.319 |