| Literature DB >> 30315195 |
Kwan-Yeung Lee1, Kwong-Sak Leung2, Nelson L S Tang3, Man-Hon Wong2.
Abstract
In this paper, we aim at discovering genetic factors of psoriasis through searching for statistically significant SNP-SNP interactions exhaustively from two real psoriasis genome-wide association study datasets (phs000019.v1.p1 and phs000982.v1.p1) downloaded from the database of Genotypes and Phenotypes. To deal with the enormous search space, our search algorithm is accelerated with eight biological plausible interaction patterns and a pre-computed look-up table. After our search, we have discovered several SNPs having a stronger association to psoriasis when they are in combination with another SNP and these combinations may be non-linear interactions. Among the top 20 SNP-SNP interactions being found in terms of pairwise p-value and improvement metric value, we have discovered 27 novel potential psoriasis-associated SNPs where most of them are reported to be eQTLs of a number of known psoriasis-associated genes. On the other hand, we have inferred a gene network after selecting the top 10000 SNP-SNP interactions in terms of improvement metric value and we have discovered a novel long distance interaction between XXbac-BPG154L12.4 and RNU6-283P which is not a long distance haplotype and may be a new discovery. Finally, our experiments with the synthetic datasets have shown that our pre-computed look-up table technique can significantly speed up the search process.Entities:
Mesh:
Year: 2018 PMID: 30315195 PMCID: PMC6185942 DOI: 10.1038/s41598-018-33493-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
This table shows an example of a GWAS dataset.
| SNP1 | SNP2 |
| SNP | Status | |
|---|---|---|---|---|---|
| Sample1 | 1 | 1 |
| 1 | T |
| Sample2 | 2 | 1 |
| 2 | F |
|
|
|
|
|
|
|
| Sample | 3 | 2 |
| 2 | F |
| Sample | 3 | 2 |
| 2 | F |
This table shows the encoding scheme for SNP genotype.
| Original Genotype | Encode Value |
|---|---|
| Missing Data | 0 |
| Major Allele, Major Allele | 1 |
| Major Allele, Minor Allele | 2 |
| Minor Allele, Minor Allele | 3 |
Figure 1In this figure, the table shows the distribution of case and control under different genotypes of SNP rs3130048 in dataset phs000019.v1.p1. The 1 d.f. chi-square p-value and odds ratio of rs3130048 are 3.96 × 10−6 and 1.6021 respectively.
Figure 3In this figure, the table shows the distribution of case and control under genotype of the combination of SNP rs3132486 and rs3130048 in dataset phs000019.v1.p1. The 1 d.f. chi-square p-value and odds ratio of the combination of rs3132486 and rs3130048 are 2.52 × 10−14 and 2.2719 respectively.
Figure 2In this figure, the table shows the distribution of case and control under different genotypes of SNP rs3132486 in dataset phs000019.v1.p1. The 1 d.f. chi-square p-value and odds ratio of rs3132486 are 2.06 × 10−5 and 1.6810 respectively.
Figure 4This figure shows a gene network constructed from the 60 common gene-gene interactions predicted from the top 10000 SNP-SNP interactions in terms of improvement metric value found from datasets phs000019.v1.p1 and phs000982.v1.p1 with a circular layout. Genes which are already reported to be associated to psoriasis in existing literatures are coloured in grey colour. Meanwhile, if a gene-gene interaction is supported by a database, a thickened edge with a specific colour will be added to the network.
Figure 5In this figure, part (a) shows the bio-molecule interaction mechanism behind a 2 order SNP-SNP interaction[14] where SNP1 and SNP2 are both having genotype (major, minor). Meanwhile, part (b) shows the eight biologically plausible 2 order genotype interaction patterns and their corresponding disease-associated complexes. Major alleles are represented by upper-case letters (i.e. A, B) and minor alleles are represented by lower-case letters (i.e. a,b).