| Literature DB >> 34440333 |
Atsuko Okazaki1,2, Sukanya Horpaopan3, Qingrun Zhang4, Matthew Randesi5, Jurg Ott2.
Abstract
Some genetic diseases ("digenic traits") are due to the interaction between two DNA variants, which presumably reflects biochemical interactions. For example, certain forms of Retinitis Pigmentosa, a type of blindness, occur in the presence of two mutant variants, one each in the ROM1 and RDS genes, while the occurrence of only one such variant results in a normal phenotype. Detecting variant pairs underlying digenic traits by standard genetic methods is difficult and is downright impossible when individual variants alone have minimal effects. Frequent pattern mining (FPM) methods are known to detect patterns of items. We make use of FPM approaches to find pairs of genotypes (from different variants) that can discriminate between cases and controls. Our method is based on genotype patterns of length two, and permutation testing allows assigning p-values to genotype patterns, where the null hypothesis refers to equal pattern frequencies in cases and controls. We compare different interaction search approaches and their properties on the basis of published datasets. Our implementation of FPM to case-control studies is freely available.Entities:
Keywords: digenic traits; diplotype; genotype pattern; pattern mining
Mesh:
Substances:
Year: 2021 PMID: 34440333 PMCID: PMC8391494 DOI: 10.3390/genes12081160
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Partitioning of chi-square for the two genotypes in the most significant genotype pattern for the Opioid dataset. The interaction effect is more significant (smallest p-value) than either of the two main effects.
| Source | Chi-Square | df |
|
|---|---|---|---|
| rs1918760 main | 2.329 | 2 | 0.3121 |
| rs6136667 main | 7.388 | 2 | 0.0249 |
| interaction | 12.592 | 4 | 0.0135 |
| Total table | 22.309 | 8 | 0.0002 |
Bivariate distribution of genotypes in the best genotype pattern for the Opioid dataset, separately for cases and controls. As shown in bold, the unique difference between cases and controls is the presence of pattern (2,2) in cases and its complete absence controls. The numbers of the other eight genotype patterns look comparable in cases and controls, so the effect of the (2,2) pattern could be “diluted” when contingency tables rather than genotype patterns are compared between cases and controls.
| rs136667 Genotypes | |||
|---|---|---|---|
| rs1918760 Genotypes |
|
|
|
|
| |||
| 1 | 0 | 1 | 4 |
| 2 | 1 |
| 39 |
| 3 | 1 | 16 | 65 |
|
| |||
| 1 | 0 | 1 | 4 |
| 2 | 1 |
| 45 |
| 3 | 1 | 15 | 86 |