Literature DB >> 24572353

Detection and replication of epistasis influencing transcription in humans.

Gibran Hemani¹, Konstantin Shakhbazov¹, Harm-Jan Westra², Tonu Esko³, Anjali K Henders⁴, Allan F McRae¹, Jian Yang⁵, Greg Gibson⁶, Nicholas G Martin⁴, Andres Metspalu⁷, Lude Franke², Grant W Montgomery⁸, Peter M Visscher⁹, Joseph E Powell⁹.

Abstract

Epistasis is the phenomenon whereby one polymorphism's effect on a trait depends on other polymorphisms present in the genome. The extent to which epistasis influences complex traits and contributes to their variation is a fundamental question in evolution and human genetics. Although often demonstrated in artificial gene manipulation studies in model organisms, and some examples have been reported in other species, few examples exist for epistasis among natural polymorphisms in human traits. Its absence from empirical findings may simply be due to low incidence in the genetic control of complex traits, but an alternative view is that it has previously been too technically challenging to detect owing to statistical and computational issues. Here we show, using advanced computation and a gene expression study design, that many instances of epistasis are found between common single nucleotide polymorphisms (SNPs). In a cohort of 846 individuals with 7,339 gene expression levels measured in peripheral blood, we found 501 significant pairwise interactions between common SNPs influencing the expression of 238 genes (P < 2.91 × 10(-16)). Replication of these interactions in two independent data sets showed both concordance of direction of epistatic effects (P = 5.56 × 10(-31)) and enrichment of interaction P values, with 30 being significant at a conservative threshold of P < 9.98 × 10(-5). Forty-four of the genetic interactions are located within 5 megabases of regions of known physical chromosome interactions (P = 1.8 × 10(-10)). Epistatic networks of three SNPs or more influence the expression levels of 129 genes, whereby one cis-acting SNP is modulated by several trans-acting SNPs. For example, MBNL1 is influenced by an additive effect at rs13069559, which itself is masked by trans-SNPs on 14 different chromosomes, with nearly identical genotype-phenotype maps for each cis-trans interaction. This study presents the first evidence, to our knowledge, for many instances of segregating common polymorphisms interacting to influence human traits.

Entities: Chemical

Mesh：

Year: 2014 PMID： 24572353 PMCID： PMC3984375 DOI： 10.1038/nature13005

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

In the genetic analysis of complex traits it is usual for SNP effects to be estimated using an additive model where they are assumed to contribute independently and cumulatively to the mean of a trait. This framework has been successful in identifying thousands of associations[14]. But to date, though its contribution to phenotypic variance is frequently the subject of debate[1-3], there is little empirical exploration of the role that epistasis plays in the architecture of complex traits in humans[7,8]. Beyond the prism of human association studies there is evidence for epistasis, not only at the molecular scale from artificially induced mutations[4] but also at the evolutionary scale in fitness adaptation[15] and speciation[16]. Methods are now available to overcome the computational problems involved in searching for epistasis, but its detection still remains problematic due to reduced statistical power. For example, increased dependence on linkage disequilibrium (LD) between causal SNPs and observed SNPs[17,18], increased model complexity in fitting interaction terms[19], and more extreme significance thresholds to account for increased multiple testing[9] all make it more difficult to detect epistasis in comparison to additive effects. Thus, with small genetic effect sizes, as is expected in most complex traits of interest[14], the power to detect epistasis diminishes rapidly. There are two simple ways to overcome this problem. One is by using extremely large sample sizes[20]; another is by analysing traits that are likely to have large effect sizes among common variants. Because our focus was to ascertain the extent to which instances of epistasis arises from natural genetic variation we designed a study around the latter approach and searched for epistatic genetic effects that influence gene expression levels. Transcription levels can be measured for thousands of genes and like most complex diseases, these expression traits are typically heritable[21]. But unlike complex diseases, genetic associations with gene expression commonly have very large effect sizes that explain large proportions of the genetic variance[22], making them good candidates to search for epistasis, should it exist. In our discovery dataset (Brisbane Systems Genetics Study, BSGS[23]) of 846 individuals genotyped at 528,509 SNPs, we used a two stage approach to identify genetic interactions. First, we exhaustively test every pair of SNPs for pairwise effects against each of 7339 expression traits in peripheral blood (1.03 × 1015 statistical tests, family-wise error rate of 5% corresponding to a significance threshold of p < 2.91 × 10−16, Methods). Second, we filtered the SNP pairs from stage 1 on LD and genotype class counts, and tested the remaining pairwise effects for significant interaction terms and used a Bonferroni correction for multiple testing (estimated type 1 error rate 0.05 ≤ α ≤ 0.14, Methods, Supplementary Figure S1). Using this design we identified 501 putative genetic interactions influencing the expression levels of 238 genes (Supplementary Table S1). We used strict quality control measures to avoid statistical associations being driven by technical artifacts (Methods). However it remains possible that unexplained technical artifacts may have led to the significant discovery interactions. Of the 501 discovery interactions, 434 had available data and passed filtering (Methods) in two independent replication datasets, Fehrmann[12] and the Estonian Genomics Centre University of Tartu (EGCUT)[11], in which we saw convincing evidence for replication. We used the summary statistics from the replication datasets to perform a meta analysis to obtain an independent p-value for the putative interactions, and 30 were significant after applying a Bonferroni correction for multiple testing (5% significance threshold p < 0.05/501, Table 1). To quantify the similarity of GP maps between the independent datasets (Figure 1) we decomposed the genetic effects of each of the SNP pairs into orthogonal additive, dominance and epistatic effects (A1, A2, D1, D2, A1 × A2, A1 × D2, D1 × A2, D1 × D2) and tested for concordance of the sign of the most significant effect (Supplementary Table S3, Methods). Sign concordance between the discovery and both replication datasets was observed in 22 out of the 30 significantly replicated interactions (expected value = 7.5 under the null hypothesis of no interactions, p = 3.76 × 10−8).

Table 1

Epistatic interactions significant at the Bonferroni level in two replication sets

	Gene (chr.)	SNP 1 (chr.)	SNP 2 (chr.)	BSGS2	Fehrmann3	EGCUT3	Meta4
1	ADK (10)	rs2395095 (10)	rs10824092 (10)	6.691	18.331	21.211	39.821
2	ATP13A1 (19)	rs4284750 (19)	rs873870 (19)	5.30	12.18	3.25	14.23
3	C21ORF57 (21)	rs9978658 (21)	rs11701361 (21)	9.42	6.08	16.36	21.67
4	CSTB (21)	rs9979356 (21)	rs3761385 (21)	11.99	25.20	16.72	42.27
5	CTSC (11)	rs7930237 (11)	rs556895 (11)	7.16	18.76	15.06	33.53
6	FN3KRP (17)	rs898095 (17)	rs9892064 (17)	16.16	28.24	29.39	59.95
7	GAA (17)	rs11150847 (17)	rs12602462 (17)	13.91	19.98	12.99	32.60
8	HNRPH1 (5)	rs6894268 (5)	rs4700810 (5)	15.38	8.55	3.01	10.37
9	LAX1 (1)	rs1891432 (1)	rs10900520 (1)	19.16	18.60	11.22	29.24
10	MBNL1 (3)	rs16864367 (3)	rs13079208 (3)	13.49	16.25	24.74	41.56
11	MBNL1 (3)	rs7710738 (5)	rs13069559 (3)	7.92	2.55	7.89	9.28
12	MBNL1 (3)	rs2030926 (6)	rs13069559 (3)	7.10	0.91	5.80	5.53
13	MBNL1 (3)	rs2614467 (14)	rs13069559 (3)	5.74	4.13	2.22	5.30
14	MBNL1 (3)	rs218671 (17)	rs13069559 (3)	7.63	0.62	5.82	5.23
15	MBNL1 (3)	rs11981513 (7)	rs13069559 (3)	7.71	0.43	5.36	4.58
16	MBP (18)	rs8092433 (18)	rs4890876 (18)	5.40	7.06	21.91	28.73
17	NAPRT1 (8)	rs2123758 (8)	rs3889129 (8)	8.45	15.12	16.08	30.77
18	NCL (2)	rs7563453 (2)	rs4973397 (2)	7.31	7.51	6.33	12.70
19	PRMT2 (21)	rs2839372 (21)	rs11701058 (21)	4.81	0.69	4.47	4.06
20	RPL13 (16)	rs352935 (16)	rs2965817 (16)	4.98	3.79	14.41	17.24
21	SNORD14A (11)	rs2634462 (11)	rs6486334 (11)	7.31	13.11	10.96	23.22
22	TMEM149 (19)	rs807491 (19)	rs7254601 (19)	12.16	81.55	45.78	145.78
23	TMEM149 (19)	rs8106959 (19)	rs6926382 (6)	5.80	3.06	8.80	10.72
24	TMEM149 (19)	rs8106959 (19)	rs914940 (1)	6.22	3.36	6.96	9.20
25	TMEM149 (19)	rs8106959 (19)	rs2351458 (4)	7.30	0.04	9.61	8.00
26	TMEM149 (19)	rs8106959 (19)	rs6718480 (2)	8.55	3.31	5.15	7.36
27	TMEM149 (19)	rs8106959 (19)	rs1843357 (8)	6.21	3.72	3.33	6.00
28	TMEM149 (19)	rs8106959 (19)	rs9509428 (13)	9.44	0.10	5.75	4.47
29	TRA2A (7)	rs7776572 (7)	rs11770192 (7)	8.23	3.19	1.89	4.09
30	VASP (19)	rs1264226 (19)	rs2276470 (19)	5.09	0.94	5.14	4.95

−log10 p-values for 4 d.f. interaction tests

Discovery dataset

Independent replication dataset

Meta analysis of interaction terms between replication datasets only

Figure 1

Replication of GP maps in two independent populations

The GP maps for each epistatic interaction that is significant at the Bonferroni level in both replication datasets are shown. Each GP map consists of nine tiles where each tile represents the expression level for that two-locus genotype class. Phenotypes are for gene transcript levels (dark coloured tiles = high expression, light coloured tiles = low expression). Columns of GP maps are for each independent dataset. Rows of GP maps are for each of 30 significantly replicated interactions at the Bonferroni level, corresponding to the rows in Table 1. There is a clear trend of the GP maps replicating across all three datasets.

In addition, using the meta analysis from the replication samples only, we observed that 316 of the remaining 404 discovery SNP pairs had replication interaction p-values more extreme than the 2.5% confidence interval of the quantile-quantile plot against the null hypothesis of no interactions where p-values are assumed to be uniformly distributed (p ≪ 1.0×10−16, Figure 2 and Supplementary Figure S2). Concordance of the direction of the effect of the largest variance component was also highly significant (p = 5.71 × 10−31, Supplementary Table S3). The congruence of the epistatic networks in discovery and replication datasets is shown in Figure 3, demonstrating that these complex genetic patterns are common even across independent datasets. A further replication was attempted using the Centre for Health Discovery and Wellbeing (CHDWB) dataset[24], but only 20 of the SNP pairs passed filtering because the sample size was small (n = 139), and likely due to insufficient power we found no evidence for replication (Supplementary Figure S6). It should be noted that although it is a necessary step to establish the veracity of the interactions from the discovery set, replication of epistatic effects in independent samples is difficult in practice due to LD (Methods).

Figure 2

Q-Q plots of interaction p-values from replication datasets

The top panel shows all 434 discovery SNPs that were tested for interactions. Observed p-values (y-axis, −log10 scale) are plotted against the expected p-values (x-axis, −log10 scale). The multiple testing correction threshold for significance following Bonferroni correction is denoted by a dotted line. The bottom panel shows the same data as the top panel but excluding the 30 interactions that were significant at the Bonferroni level in the replication datasets. The shaded grey area represents the 5% confidence interval for the expected distribution of p-values. Dark blue points represent p-values that exceed the confidence interval, light blue are within the confidence interval.

Figure 3

Discovery and replication of epistatic networks

All 434 putative genetic interactions (edges) with data common to discovery and replication sets is shown, where black nodes represent SNPs and red nodes represent traits (gene expression probes). Three hundred and forty-five interactions had p-values exceeding the 2.5% confidence interval following meta analysis of the replication data The remaining 89 interactions that did not replicate are depicted in grey. It is evident that a large proportion of the complex networks identified in the discovery set also exist in independent populations. An interactive version of this graph can be found here: http://kn3in.github.io/detecting_epi/

Though seldom the focus of association studies, SNPs with known main effects are often tested for A×A genetic interactions[9], but our analysis suggests this is unlikely to be the best strategy for its detection. The majority of our discovery interactions comprised of one SNP that was significantly associated with the gene expression level in the discovery dataset, and one SNP that had no previous association[22] (439 out of 501, Methods). Only nine interactions were between SNPs that both had known main effects while 64 were between SNPs that had no known main effects. Additionally, we observed that the largest epistatic variance component for the 501 interactions was equally divided amongst A × A, A × D, D × A and D × D at the discovery stage (p = 0.22 for departure from expectation). This is not surprising because these patterns of epistasis used for statistical decomposition are simply convenient orthogonal parameterisations of a two locus model, and are not intended to model biological function[25]. Of the discovery interactions, 26 were cis-cis acting (within 1Mb of the transcription start site, mean distance between SNPs was 0.53Mb), 462 were cis-trans-acting, and 13 were trans-trans-acting. We observed a wide range of significant GP maps (Figure 1) but the most common pattern of epistasis that we detected involved a trans-SNP masking the effect of an additive cis-SNP. For example, MBNL1 (involved in RNA modification and regulation of splicing[26]) has a cis effect at rs13069559 which in turn is controlled by 13 trans-SNPs and one cis-SNP that each exhibit a masking pattern, such that when the trans-SNP is homozygous for the masking allele the decreasing allele of the cis-SNP no longer has an effect (Supplementary Figure S10). Each of these interactions has evidence for replication in at least one dataset and six are significantly replicated at the Bonferroni level (Supplementary Figure S3). We see similar epistatic networks involving multiple (eight or more) trans-acting SNPs for other gene expression levels too, for example TMEM149 (Supplementary Figure S11), NAPRT1 (Supplementary Figure S12), TRAPPC5 (Supplementary Figure S13), and CAST (Supplementary Figure S14). We observed that from pedigree analysis these five gene expression phenotypes had non-additive variance component estimates within the 95th percentile of the 17,994 gene expression phenotypes that were analysed previously[22] (Supplementary Table S2, Methods). In total the 501 interactions comprised 781 unique SNPs, which we analysed for functional enrichment (Methods). We tested the SNPs for cell-type specific overlap with transcriptionally active chromatin regions, tagged by histone-3-lysine-4,tri-methylation (H3K4me3) chromatin marks, in 34 cell types[27] (Supplementary Figure S5). There was significant enrichment for cis-acting SNPs in haematopoietic cell types only (p < 1 × 10−4 for the three tissues with the strongest enrichment after adjusting for multiple testing). However trans-acting SNPs did not show any tissue specific enrichment (p > 0.1 for all tissues). This difference between cis and trans SNPs suggests different roles in epistatic interactions where tissue specificity is provided by the cis SNPs. There is also enrichment for cis-SNPs to be localised in regions with regulatory genomic features as measured by chromatin states[28] (Supplementary Figure S4). We also demonstrate physical organisation of interacting loci within the cell, suggesting a mechanism by which biological function can lead to epistatic genetic variance. It has been shown that different chromosomal regions spatially colocalise in the cell through chromatin interactions[13]. We cross-referenced our epistatic SNPs with a map of chromosome interacting regions (n = 96, 139) in K562 blood cell lines[29] (Methods) and found that 44 epistatic interactions mapped to within 5Mb (p < 1.8 × 10−10), (Supplementary Figure S15). Interaction of distant loci may occur through physical proximity in transcriptional factories that organise across different chromosome regions and can regulate transcription of related genes[30]. Quantifying the importance of epistasis in complex traits in humans remains an open question. Here we are able to identify 238 gene expression traits with at least one significant interaction given our experiment-wide threshold, where the minimum estimated variance explained by the epistatic effects of any interaction was 2.1% of phenotypic variance. Taking results from our previously published eQTL[23] we calculated that 1848 of the 7339 gene expression levels analysed were influenced by additive effects where the estimated additive variance of a locus was 2.1% or greater. Thus, we can infer that the number of instances of large additive effects is significantly greater than the number of instances of large epistatic effects. In terms of their contribution to complex traits a more important metric might be the proportion of the variance that the epistatic loci explain[2]. Taking all additive effects detected in Powell et al (2012) that have additive variance explaining 2.1% or greater of phenotypic variance, we calculated that the proportion of total phenotypic variance of all 7339 gene expression levels explained by additive effects alone was 2.16%. By contrast, the estimated epistatic variance from the interacting SNPs detected in this study on average explain a total of 0.22% of phenotypic variance, approximately ten times lower than the estimated additive variance. There are several caveats to this comparison which we discuss in the Methods. Overall, we have demonstrated that it is possible to identify and replicate epistasis in complex traits amongst common human variants, despite the relative contribution of pairwise epistasis to phenotypic variation being small. The bioinformatic analysis of the significant epistatic loci suggests that there are a large number of possible mechanisms that can lead to non-additive genetic variation. Further research into such epistatic effects may provide a useful framework for understanding molecular mechanisms and complex trait variation in greater detail. With computational techniques and data now widely available the search for epistasis in larger datasets for traits of broader interest is warranted.

Online methods

1 Discovery data

1.1 Data description

The Brisbane Systems Genetics Study (BSGS) comprises 846 individuals of European descent from 274 independent families[23]. DNA samples from each individual were genotyped on the Illumina 610-Quad Beadchip by the Scientific Services Division at deCODE Genetics Iceland. Full details of genotyping procedures are given in Medland et al.[31] Standard quality control (QC) filters were applied and the remaining 528,509 autosomal SNPs were carried forward for further analysis. Gene expression profiles were generated from peripheral blood collected with PAXgene TM tubes (QIAGEN, Valencia, CA) using Illumina HT12-v4.0 bead arrays. The Illumina HT-12 v4.0 chip contains 47,323 probes, although some probes are not assigned to RefSeq genes. We removed any probes that did not match the following criteria: contained a SNP within the probe sequence with MAF > 0.05 within 1000 genomes data; did not map to a listed RefSeq gene; were not significantly expressed (based on a detection p-value < 0.05) in at least 90% of samples. After this stringent QC 7339 probes remained for 2D-eQTL mapping. These data are accessible through GEO Series accession number GSE53195.

1.2 Normalisation

Gene expression profiles were normalised and adjusted for batch and polygenic effects. Profiles were first adjusted for raw background expression in each sample. Expression levels were then adjusted using quantile and log2 transformation to standardise distributions between samples. Batch and polygenic effects were adjusted using the linear model where μ is the population mean expression levels, c, p, s and a are vectors of chip, chip position, sex and generation respectively, fitted as fixed effects; and g is a random additive polygenic effect with a variance covariance matrix The parameter is the variance component for additive background genetic. Here, we are using family based pedigree information rather than SNP based IBD to account for relationships between individuals and so ϕ is the kinship coefficient between individuals j and k. The residual, e, from equation 1 is assumed to follow a multivariate normal distribution with a mean of zero. Residuals were normalised by rank transformation and used as the adjusted phenotype for the pairwise epistasis scan to remove any skewness and avoid results being driven by outliers. The GenABEL package for R was used to perform the normalisation[32].

2 Exhaustive 2D-eQTL analysis

2.1 Two stage search

We used epiGPU[10] software to perform an exhaustive scan for pairwise interactions, such that each SNP is tested against all other SNPs for statistical association with the expression values for each of the 7339 probes. This uses the massively parallel computational architecture of graphical processing units (GPUs) to speed up the exhaustive search. For each SNP pair there are 9 possible genotype classes. We treat each genotype class as a fixed effect and fit an 8 d.f. F-test to test the following hypotheses: where μ is the mean expression level and x is the pairwise genotype class mean for genotype i at SNP 1 and genotype j at SNP 2. This type of test does not parameterize for specific types of epistasis, rather it tests for the joint genetic effects at two loci. This has been demonstrated to be statistically more efficient when searching for a wide range of epistatic patterns, although will also include any marginal effects of SNPs which must be dealt with post-hoc[18].

2.1.1 Stage 1

The complete exhaustive scan for 7339 probes comprises 1.03 × 1015 F-tests. We used permutation analysis to estimate an appropriate significance threshold for the study. To do this we performed a further 1600 exhaustive 2D scans on permuted phenotypes to generate a null distribution of the extreme p-values expected to be obtained from this number of multiple tests given the correlation structure between the SNPs. We took the most extreme p-value from each of the 1600 scans and set the 5% FWER to be the 95% most extreme of these p-values, T* = 2.13 × 10−12. The effective number of tests in one 2D scan being performed is therefore N* = 0.05/T* ≈ 2.33 × 1010. To correct for the testing of multiple traits we established an experiment wide threshold of T = 0.05/(N* × 7339) = 2.91 × 10−16. This is likely to be conservative as it assumes independence between probes.

Filtering

We used two approaches to filter SNPs from stage 1 to be tested for significant interaction effects in stage 2.

Filter 1

After keeping SNP pairs that surpassed the 2.91 × 10−16 threshold in stage 1 only SNP pairs with at least 5 data points in all 9 genotype classes were kept. We then calculated the LD between interacting SNPs (amongst unrelated individuals within the discovery sample and also from 1000 genomes data) and removed any pairs with r2 > 0.1 or D′2 > 0.1 to avoid the inclusion of haplotype effects and to increase the accuracy of genetic variance decomposition. If multiple SNP pairs were present on the same chromosomes for a particular expression trait then only the sentinel SNP pair was retained, i.e. if a probe had multiple SNP pairs that were on chromosomes one and two then only the SNP pair with the most significant p-value was retained. At this stage 6404 filtered SNP pairs remained.

Filter 2

We also performed a second filtering screen applied to the list of SNP pairs from stage 1 that was identical to filter 1 but an additional step was included where any SNPs that had previously been shown to have a significant additive or dominant effect (p < 1.29×10−11) were removed[22], creating a second set of 4751 unique filtered SNP pairs.

2.1.2 Stage 2

To ensure that interacting SNPs were driven by epistasis and not marginal effects we performed a nested ANOVA on each pair in the filtered set to test if the interaction terms were significant. We did this by contrasting the full genetic model (8 d.f.) against the reduced marginal effects model which included the additive and dominance terms at both SNPs (4 d.f.). Thus, a 4 d.f. F-test was performed on the residual genetic variation, representing the contribution of epistatic variance. Significance of epistasis was determined using a Bonferroni threshold of 0.05/(6404+4751) = 4.48×10−6. This resulted in 406 and 95 SNP pairs with significant interaction terms from filters 1 and 2, respectively.

2.2 Type 1 error rate

Using a Bonferroni correction of 0.05 in the second stage of the two stage discovery scan implies a type 1 error rate of α = 0.05. However, this could be underestimated because the number tests performed in the second stage depends on the number of tests in the first stage, and this depends on statistical power and model choice. We performed simulations to estimate the type 1 error rate of this study design. We assumed a null model where there was one true additive effect and 7 other terms with no effect. To simulate a test statistic we simulated 8 z-scores, and z2..8 ~ N(0, 1). Thus (representing the 8 d.f. test) and (representing the 4 d.f. test where the null hypothesis of no epistasis is true). For a particular value of NCP we simulated 100,000 z values, and calculated the p-value for the z test statistic. The n test statistics with p < 2.31 × 10−16 were kept for the second stage, where the type 1 error rate of stage 2 was calculated as the proportion of p < 0.05/n. The power at stage 1 was calculated as n/100, 000. This procedure was performed for a range of NCP parameters that represented power ranging from ~ 0 to ~ 1.

2.3 Population stratification

We ruled out population stratification as a possible cause of inflated test statistics. To test for cryptic relatedness driving the interaction terms we tested for increased LD among the SNPs[33]. We calculated the mean of the off-diagonal elements of the correlation matrix of all unique SNPs from the 501 interactions (731 SNPs) using only unrelated individuals, . This is not significantly different from the null hypothesis of zero (sampling error = 1/nunrelated = 0.0039).

2.4 Probe mapping

To avoid possibility that epistatic signals might arise due to expression probes hybridising in multiple locations we verified that probe sequences for genes with significant interactions mapped to only a single location. As an initial verification we performed a BLAST search of the full probe sequence against 1000 genomes phase 1 version 3 human genome reference and ensured that only one genomic location aligned significantly (p < 0.05). As a second step, to mitigate the possibility of weak hybridisation elsewhere in the genome we divided the probe sequence into three sections (1–25bp, 13–37bp, 26–50bp) and performed a BLAST search of these probe sequence fragments. No probe sequemces or probe sequence fragments mapped to positions other than the single expected genomic target (p < 0.05).

3 Replication

3.1 Data description

We attempted replication of the 501 significant interactions from the discovery set using three independent cohorts; Fehrmann, EGCUT, and CHDWB. It was required that LD r2 < 0.1 and D′2 < 0.1 between interacting SNPs (as measured in the replication sample directly), and all nine genotype classes had at least 5 individuals present in order to proceed with statistical testing for replication in both datasets. We also excluded any putative SNPs that had discordant allele frequencies in any of the datasets. Details of the cohorts are as follows.

Fehrmann

n = 1240 The Fehrmann dataset[12] consists of peripheral blood samples of 1240 unrelated individuals from the United Kingdom and the Netherlands. Some of these individuals are patients, while others are healthy controls. Individuals were genotyped using the Illumina HumanHap300, Illumina Human-Hap370CNV, and Illumina 610 Quad platforms. RNA levels were quantified using the Illumina HT-12 V3.0 platform. These data are accessible through GEO Series accession numbers GSE20332 and GSE20142.

EGCUT

n = 891 The Estonian Genome Center of the University of Tartu (EGCUT) study[11] consists of peripheral blood samples of 891 unrelated individuals from Estonia. They were genotyped using the Illumina HumanHap370CNV platform. RNA levels were quantified using the Illumina HT-12 V3.0 platform. These data are accessible through GEO Series accession number GSE48348.

CDHWB

n = 139 The Center for Health Discovery and Well Being (CD-HWB) Study[24] is a population based cohort consisting of 139 individuals of European descent collected in Atlanta USA. Gene expression profiles were generated with Illumina HT-12 V3.0 arrays from peripheral blood collected from Tempus tubes that preserve RNA. Whole genome genotypes were measured using Illumina OmniQuad arrays. Due to the small sample size, most SNP pairs did not pass filtering in this dataset (20 SNP pairs remained) and so we have excluded it from the rest of the analysis.

3.2 Meta Analysis

The 4 d.f. interaction p-values for each independent replication dataset were calculated using the same statistical test as was performed in the discovery dataset. We then took the interaction p-values from EGCUT and Fehrmann and calculated a joint p-value using Fisher’s method of combining p-values for a meta analysis as . As in the discovery analysis, all gene expression levels were normalised using rank transformation to avoid skew or outliers in the distribution[34].

3.3 Concordance of direction of effects

We used four methods to calculate the concordance of the direction of effects between the discovery and replication datasets.

Test 1

Is the most significant epistatic effect in the discovery set in the same direction as the same epistatic effect in the replication sets? We decomposed the genetic variance into 8 orthogonal effects, four of which are epistatic (A×A, A × D, D × A, D × D). The sign of the epistatic effect that had the largest variance in the discovery was recorded, and then was compared to the same epistatic effect in the two replication datasets (regardless of whether or not the same epistatic effect was the largest in the replication datasets). The probability of the sign being the same in one dataset is 1/2. The probability of the sign being the same in two is 1/4.

Test 2

Is the most significant epistatic effect in the discovery the same as the largest epistatic effect in the replication set with the sign being concordant. As in Test 1, but this time we required that the largest effect was the same in the discovery and the replication, and that they had the same sign (e.g. if the largest effect in the discovery is A×A, with a positive effect, then concordance is achieved if the same is true in the replication). The probability of one replication dataset being concordant by chance is 1/8, and concordance in both is 1/64.

Test 3

Do the epistatic effects that are significant at nominal p < 0.05 in the discovery have the same direction of effect as in the replication? Here we count all the epistatic variance components in the discovery that have p < 0.05 (1133 amongst the 434 discovery SNP pairs, i.e. each SNP pair has at least 1 and at most 4 significant epistatic variance components). Then we compare the direction of the effect in the replication dataset. The probability of the sign being the same in one dataset for any one significant effect is 1/2. The probability of the sign being the same in two is 1/4.

Test 4

If we count how many of the 4 epistatic effects are concordant between the discovery and replication data for each interaction then is this significant from what we expect by chance? There can be either 0, 1, 2, 3 or 4 concordant signs at each interaction, each with expectation of p = 1/16, 4/16, 6/16, 4/16, 1/16 under the null, respectively. Observed counts are multinomially distributed, and we tested if the observed proportions were statistically different from the expected proportions using an approximation of the multinomial test[35]. The probability of observing the number of concordant signs in tests 1–3 is calculated using a binomial test. All variance decompositions were calculated using the NOIA method[36].

4 Effects of LD on detection and replication

The power to detect genetic effects, when the observed markers are in LD with the causal variants, is proportional to r. For additive effects x = 2, but for non-additive effects x is larger, i.e. x = 4 for dominance or A × A, x = 6 for A × D or D × A, and x = 8 for D × D. Many biologically realistic GP maps may be comprised of all 8 variance components[18]. This is important for both detection and for replication of epistasis. For detection, if the epistatic effect includes the D × D term then if the two causal variants are tagged by observed markers that are each in LD r = 0.9, then if the true variance is V then the observed variance V at the markers will be 0.98V = 0.43V. Therefore, it is important to consider the sampling variation of r̂ in a sample given some true population value of r.

4.1 Simulation 1

For some values of fixed population parameters, p1 (minor allele frequency at observed marker), q1 (minor allele frequency at causal variant), and r (LD between marker and causal variant), the expected haplotype frequencies are where p2 = 1 − p1 and q2 = 1 − q1. For a range of population parameters we randomly sampled 2n haplotypes where the expected haplotype frequencies were h11, h12, h21, h22. From the sample haplotype frequencies we then calculated sample estimates of r̂ where For each value of combination of the parameters p1, q1, r, n 1000 simulations were performed and the sampling mean and sampling standard deviation of r̂, r̂2, r̂4, r̂6, r̂8 were recorded. It was observed that sampling variance increases for increasing x in r̂.

4.2 Simulation 2

We assume that the discovery SNP pairs are ascertained (from a very large number of tests) have high r̂ between observed SNPs and causal variants because otherwise power of detection would be low. We can hypothesis that the distribution of r̂ in this ascertained sample will be a mixture of r that is high and r that is lower but with ascertained higher values from sampling. Therefore, we would expect those with truly high r to have a higher replication rate in independent datasets, and those with ascertained high r̂ to have lower replication because resampling is unlikely to result in the same extreme ascertainment. To obtain empirical estimates of r̂ in discovery and replication datasets we conducted the following simulation. Using 1000 genomes data (phase 1, version 3, 379 European samples) we selected the 528,509 “markers” used in the original discovery analysis, plus 100,000 randomly chosen “causal variants” (CVs) with minor allele frequence > 0.05. The 379 individuals were split into discovery (190) and replication (189) sets. For each CV the marker with the maximum from the marker panel was recorded in the discovery set. This marker was known as the “discovery marker” (DM). The for each CV/DM pair was then calculated in the replication set where the discovery LD was ascertained to be high, such that . We observed that there was an average decrease in relative to , and that this decrease was larger with increasing x. We observed that whereas . The average drop in in replication r̂8 was3 times higher than the drop in r̂2.

4.3 Interpretation

Simulation 1 shows that sampling variance of r increases as x increases. Detection of epistatis is highly dependent upon high r̂. Amongst the discovery SNPs there will be a mixture of interactions where observed SNPs are either in true high LD with causal variants, or will have highly inflated sample r̂ compared to the population r. Simulation 2 shows that as x gets larger, the average decrease in r̂ between discovery and replication becomes larger, likely to be a result of ascertained high r̂ in the discovery and increased sampling variance with increasing x in the replication. These results demonstrate that if all else is equal, the impact of sampling variance of r alone will reduce the replication rate of epistatic effects compared to additive effects.

5 Additive and non-additive variance estimation

5.1 Fixed effects

To compare the relative contribution to the phenotypic variance of gene expression levels between additive and epistatic effects we are constrained by the problem that non-additive variance components for a phenotype cannot be calculated directly. Here, we only have SNP pairs that exceed a threshold of p < 2.91 × 10−16 = T. A strong conclusion cannot be made about the genome-wide variance contribution, but we can compare the variance explained by SNP effects at this threshold for additive scans and epistatic scans. In Powell et al 2012[23] an expression quantitative trait locus (eQTL) study was performed searching for additive effects in the same BSGS dataset as was used for the discovery here. Using the threshold T for the additive eQTL study, 453 of the 7339 probes analysed here had at least one significant additive effect. Assuming that the phenotypic variance for each of the probes is normalised to 1, the total phenotypic variance of all 7339 explained by the significant additive effects was 1.73%. Following the same procedure, at the threshold T there were 238 gene expression probes with at least one significant pairwise epistatic interaction out of the 7339 tested. In total the proportion of the phenotypic variance explained by the epistatic effects at these SNP pairs was 0.25%.

5.2 Limitations of this type of comparison

Though it is useful to compare the relative variances of epistatic and additive effects, it must be stressed that our results here are approximations that are very limited by the study design. We estimate that additive effects explain approximately 10 times more variance than epistatic effects, but this could be an overestimate or an underestimate due to a number of different caveats. Firstly, the ratio of additive to epistatic variance may differ at different minimum variance thresholds, and our estimate is determined by the threshold used. Secondly, the power of a 1 d.f. test exceeds that of an 8 d.f. test. Thirdly, the non-additive variance at causal variants is expected to be underestimated by observed SNPs in comparison to estimates for additive variance. And forthly, the extent of winner’s curse in estimation of effect sizes may differ between the two studies.

5.3 Pedigree estimates

The gene expression levels for MBNL1, TMEM149, NAPRT1, TRAPPC5 and CAST are influenced by large cis-trans epistatic networks (eight interactions or more). Though it is not possible to orthogonally estimate the non-additive genetic variance for non-clonal populations, an approximation of a component of non-additive variance can be estimated using pedigree information. The BSGS data is comprised of some related individuals and standard quantitative genetic analysis was used to calculate the additive and dominance variance components for each gene expression phenotype in Powell et al 2013[22]. The dominance effect is likely to capture additive × additive genetic variance plus some fraction of other epistatic variance components. We found that the aforementioned genes had dominance variance component estimates within the top 5% of all 17,994 gene expression probes that were analysed in Powell et al 2013.

6 Functional enrichment analysis

6.1 Tissue specific transcriptionally active regions

We employed a recently published method (http://www.broadinstitute.org/mpg/epigwas/)[27] that tests for cell-type-specific enrichment of active chromatin, measured through H3K4me3 chromatin marks[37] in regions surrounding the 731 SNPs that comprise the 501 discovery interactions. The exact method used to perform this analysis has been described previously[38]. Briefly, we tested the hypothesis that the 731 SNPs were more likely to be in transcriptionally active regions (as measured by chromatin marks) than a random set of SNPs selected from the same SNP chip. This hypothesis was tested for 34 cell types across four broad tissue types (haematopoietic, gastrointenstinal, musculoskeletal and endocrine, and brain).

6.2 Chromosome interactions

It has been shown[13] that different regions on different chromosomes or within chromosomes spatially colocalise within the cell. We shall refer to the colocalisation of two chromosome regions as a chromosome interaction. A map of pairwise chromosome interactions for K562 blood cell lines was recently produced[29], and we hypothesised that part of the underlying biological mechanism behind some of the 501 epistatic interactions may arise from chromosome interactions. We found that 44 of the putative epistatic interactions were amongst SNPs that were within 5Mb of known chromosome interactions. This means that SNP A was no more than 2.5Mb from the focal point of the chromosome interaction on chromosome A, and SNP B was no more than 2.5Mb from the focal point on chromosome B. We performed simulations to test how extreme the observation of 44 epistatic interactions overlapping with chromosome interactions is compared to chance. Chromosome interactions fall within functional genomic regions[13,29], and the SNPs in our epistatic interactions are enriched for functional genomic regions. Therefore, we designed the simulations to ensure that the null distribution was of chromosome interactions between SNPs enriched for functional genomic regions but with no known epistatic interactions. To do this we used the 731 SNPs that form the 501 putative epistatic interactions and randomly shuffled them to create new sets of 501 pairs, disallowing any SNP combinations that were in the original set. Therefore, each new random set was enriched for functional regions but had no genetic interactions. We scanned the map of chromosome interactions for overlaps with the new sets and then repeated the random shuffling process. We performed 1,000 such permutations to generate a null distribution of chromosome interaction overlaps. We repeated this process, searching for overlaps within 1Mb, 250kb, and 10kb.

6.3 SNP colocalisation with genomic features

We tested for enrichment of genomic features for the 687 IndexSNPs that comprise the 434 epistatic interactions with data present in discovery and replication datasets. For each of the 687 IndexSNPs we calculated LD with all regional SNPs within a radius of 0.5Mb and kept all regional SNPs with LD r2 > 0.8. We then cross-referenced the remaining regional SNPs with the annotated chromatin structure reference[28]) querying whether the regional SNPs fell in Predicted promoter region including TSS (TSS), Predicted promoter flanking region (PF), Predicted enhancer (E), Predicted weak enhancer or open chromatin cis regulatory element (WE), CTCF enriched element (CTCF), Predicted transcribed region (T), or Predicted Repressed or Low Activity region (R) positions. Therefore a particular IndexSNP might cover multiple genomic features through LD. We then performed the whole querying process for each of the 528,509 SNPs present in the SNP chip used in the scan, and used the results from this second analysis to establish a null distribution for the expected proportion of SNPs for each genomic feature. We calculated p-values for enrichment of each of the seven genomic features independently, and for cis- and trans-SNPs separately, using a binomial test. For each genomic feature we used the expected proportion of SNPs as the expected probability of “success” (p). Here, a success is defined as an IndexSNP residing in a region that includes the genomic feature. The observed number of successes for each IndexSNP (k) out of the total count of IndexSNPs (n) was then modelled as .

6.4 Transcription factor enrichment

To test for enrichment of transcription factor binding sites (TFBS) we followed a procedure similar to that described in Section 6.3. For each of the 687 IndexSNPs we extracted regional SNPs as previously described. We then used the PWMEnrich package in Bioconductor (http://www.bioconductor.org/packages/2.12/bioc/html/PWMEnrich.html) to identify which TFBSs each of the regional SNPs for one IndexSNP falls in (within a radius of 250bp). Thus, the number of occurrences of a particular TFBS was counted for each IndexSNP. We used the “Threshold-free affinity” method for identifying TFBSs[39]. We constructed a null distribution of expected TFBS occurrences based on the same null hypothesis as described in Section 6.3 - the probability of an IndexSNP covering a particular TFBS is identical to any of the 528,509 SNPs in the discovery SNP chip. To do this, we performed the same procedure for each SNP in the discovery SNP chip as was performed for each IndexSNP to obtain an expected probability of covering a particular TFBS. We then tested the IndexSNPs for enrichment of each TFBS independently, and for cis- and trans-SNPs separately. p-values were obtained using Z-scores, calculated by using a normal approximation to the sum of binomial random variables representing motif hits along the sequence[40].

6.5 Defining previously identified SNP associations

The discovery dataset (BSGS) had previously been analysed for additive and dominant marginal effects for all gene expression levels[22,23]. To define SNPs that had been previously detected to have effects for a particular gene expression level we used a significance threshold accounting for multiple testing across SNPs and expression probes, T = 0.05/(528509 × 7339) = 1.29 × 10−11. From this, we found that only nine of the 501 discovery interactions had known main effects, 64 were between SNPs that had no known marginal effects, and 439 were between a SNP with a known marginal effect and a SNP with no known marginal effect.

38 in total

Review 1. DNA binding sites: representation and discovery.

Authors: G D Stormo
Journal: Bioinformatics Date: 2000-01 Impact factor: 6.937

2. Darwinian evolution can follow only very few mutational paths to fitter proteins.

Authors: Daniel M Weinreich; Nigel F Delaney; Mark A Depristo; Daniel L Hartl
Journal: Science Date: 2006-04-07 Impact factor: 47.728

3. An Extension of the Concept of Partitioning Hereditary Variance for Analysis of Covariances among Relatives When Epistasis Is Present.

Authors: C C Cockerham
Journal: Genetics Date: 1954-11 Impact factor: 4.562

4. GenABEL: an R library for genome-wide association analysis.

Authors: Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal: Bioinformatics Date: 2007-03-23 Impact factor: 6.937

5. Genetics of gene expression surveyed in maize, mouse and man.

Authors: Eric E Schadt; Stephanie A Monks; Thomas A Drake; Aldons J Lusis; Nam Che; Veronica Colinayo; Thomas G Ruff; Stephen B Milligan; John R Lamb; Guy Cavet; Peter S Linsley; Mao Mao; Roland B Stoughton; Stephen H Friend
Journal: Nature Date: 2003-03-20 Impact factor: 49.962

6. Common variants in the trichohyalin gene are associated with straight hair in Europeans.

Authors: Sarah E Medland; Dale R Nyholt; Jodie N Painter; Brian P McEvoy; Allan F McRae; Gu Zhu; Scott D Gordon; Manuel A R Ferreira; Margaret J Wright; Anjali K Henders; Megan J Campbell; David L Duffy; Narelle K Hansell; Stuart Macgregor; Wendy S Slutske; Andrew C Heath; Grant W Montgomery; Nicholas G Martin
Journal: Am J Hum Genet Date: 2009-11-05 Impact factor: 11.025

7. GWAS of 126,559 individuals identifies genetic variants associated with educational attainment.

Authors: Cornelius A Rietveld; Sarah E Medland; Jaime Derringer; Jian Yang; Tõnu Esko; Nicolas W Martin; Harm-Jan Westra; Konstantin Shakhbazov; Abdel Abdellaoui; Arpana Agrawal; Eva Albrecht; Behrooz Z Alizadeh; Najaf Amin; John Barnard; Sebastian E Baumeister; Kelly S Benke; Lawrence F Bielak; Jeffrey A Boatman; Patricia A Boyle; Gail Davies; Christiaan de Leeuw; Niina Eklund; Daniel S Evans; Rudolf Ferhmann; Krista Fischer; Christian Gieger; Håkon K Gjessing; Sara Hägg; Jennifer R Harris; Caroline Hayward; Christina Holzapfel; Carla A Ibrahim-Verbaas; Erik Ingelsson; Bo Jacobsson; Peter K Joshi; Astanand Jugessur; Marika Kaakinen; Stavroula Kanoni; Juha Karjalainen; Ivana Kolcic; Kati Kristiansson; Zoltán Kutalik; Jari Lahti; Sang H Lee; Peng Lin; Penelope A Lind; Yongmei Liu; Kurt Lohman; Marisa Loitfelder; George McMahon; Pedro Marques Vidal; Osorio Meirelles; Lili Milani; Ronny Myhre; Marja-Liisa Nuotio; Christopher J Oldmeadow; Katja E Petrovic; Wouter J Peyrot; Ozren Polasek; Lydia Quaye; Eva Reinmaa; John P Rice; Thais S Rizzi; Helena Schmidt; Reinhold Schmidt; Albert V Smith; Jennifer A Smith; Toshiko Tanaka; Antonio Terracciano; Matthijs J H M van der Loos; Veronique Vitart; Henry Völzke; Jürgen Wellmann; Lei Yu; Wei Zhao; Jüri Allik; John R Attia; Stefania Bandinelli; François Bastardot; Jonathan Beauchamp; David A Bennett; Klaus Berger; Laura J Bierut; Dorret I Boomsma; Ute Bültmann; Harry Campbell; Christopher F Chabris; Lynn Cherkas; Mina K Chung; Francesco Cucca; Mariza de Andrade; Philip L De Jager; Jan-Emmanuel De Neve; Ian J Deary; George V Dedoussis; Panos Deloukas; Maria Dimitriou; Guðny Eiríksdóttir; Martin F Elderson; Johan G Eriksson; David M Evans; Jessica D Faul; Luigi Ferrucci; Melissa E Garcia; Henrik Grönberg; Vilmundur Guðnason; Per Hall; Juliette M Harris; Tamara B Harris; Nicholas D Hastie; Andrew C Heath; Dena G Hernandez; Wolfgang Hoffmann; Adriaan Hofman; Rolf Holle; Elizabeth G Holliday; Jouke-Jan Hottenga; William G Iacono; Thomas Illig; Marjo-Riitta Järvelin; Mika Kähönen; Jaakko Kaprio; Robert M Kirkpatrick; Matthew Kowgier; Antti Latvala; Lenore J Launer; Debbie A Lawlor; Terho Lehtimäki; Jingmei Li; Paul Lichtenstein; Peter Lichtner; David C Liewald; Pamela A Madden; Patrik K E Magnusson; Tomi E Mäkinen; Marco Masala; Matt McGue; Andres Metspalu; Andreas Mielck; Michael B Miller; Grant W Montgomery; Sutapa Mukherjee; Dale R Nyholt; Ben A Oostra; Lyle J Palmer; Aarno Palotie; Brenda W J H Penninx; Markus Perola; Patricia A Peyser; Martin Preisig; Katri Räikkönen; Olli T Raitakari; Anu Realo; Susan M Ring; Samuli Ripatti; Fernando Rivadeneira; Igor Rudan; Aldo Rustichini; Veikko Salomaa; Antti-Pekka Sarin; David Schlessinger; Rodney J Scott; Harold Snieder; Beate St Pourcain; John M Starr; Jae Hoon Sul; Ida Surakka; Rauli Svento; Alexander Teumer; Henning Tiemeier; Frank J A van Rooij; David R Van Wagoner; Erkki Vartiainen; Jorma Viikari; Peter Vollenweider; Judith M Vonk; Gérard Waeber; David R Weir; H-Erich Wichmann; Elisabeth Widen; Gonneke Willemsen; James F Wilson; Alan F Wright; Dalton Conley; George Davey-Smith; Lude Franke; Patrick J F Groenen; Albert Hofman; Magnus Johannesson; Sharon L R Kardia; Robert F Krueger; David Laibson; Nicholas G Martin; Michelle N Meyer; Danielle Posthuma; A Roy Thurik; Nicholas J Timpson; André G Uitterlinden; Cornelia M van Duijn; Peter M Visscher; Daniel J Benjamin; David Cesarini; Philipp D Koellinger
Journal: Science Date: 2013-05-30 Impact factor: 47.728

8. An evolutionary perspective on epistasis and the missing heritability.

Authors: Gibran Hemani; Sara Knott; Chris Haley
Journal: PLoS Genet Date: 2013-02-28 Impact factor: 5.917

9. Blood-informative transcripts define nine common axes of peripheral blood gene expression.

Authors: Marcela Preininger; Dalia Arafat; Jinhee Kim; Artika P Nath; Youssef Idaghdour; Kenneth L Brigham; Greg Gibson
Journal: PLoS Genet Date: 2013-03-14 Impact factor: 5.917

Review 10. Data and theory point to mainly additive genetic variance for complex traits.

Authors: William G Hill; Michael E Goddard; Peter M Visscher
Journal: PLoS Genet Date: 2008-02-29 Impact factor: 5.917

94 in total

1. Leveraging epigenomics and contactomics data to investigate SNP pairs in GWAS.

Authors: Elisabetta Manduchi; Scott M Williams; Alessandra Chesi; Matthew E Johnson; Andrew D Wells; Struan F A Grant; Jason H Moore
Journal: Hum Genet Date: 2018-05-24 Impact factor: 4.132

2. A random forest approach to capture genetic effects in the presence of population structure.

Authors: Johannes Stephan; Oliver Stegle; Andreas Beyer
Journal: Nat Commun Date: 2015-06-25 Impact factor: 14.919

3. A model to investigate SNPs' interaction in GWAS studies.

Authors: Enrico Cocchi; Antonio Drago; Chiara Fabbri; Alessandro Serretti
Journal: J Neural Transm (Vienna) Date: 2014-11-29 Impact factor: 3.575

Review 4. The role of regulatory variation in complex traits and disease.

Authors: Frank W Albert; Leonid Kruglyak
Journal: Nat Rev Genet Date: 2015-02-24 Impact factor: 53.242

5. Widespread Allelic Heterogeneity in Complex Traits.

Authors: Farhad Hormozdiari; Anthony Zhu; Gleb Kichaev; Chelsea J-T Ju; Ayellet V Segrè; Jong Wha J Joo; Hyejung Won; Sriram Sankararaman; Bogdan Pasaniuc; Sagiv Shifman; Eleazar Eskin
Journal: Am J Hum Genet Date: 2017-05-04 Impact factor: 11.025

6. Epistasis and destabilizing mutations shape gene expression variability in humans via distinct modes of action.

Authors: Ence Yang; Gang Wang; Jizhou Yang; Beiyan Zhou; Yanan Tian; James J Cai
Journal: Hum Mol Genet Date: 2016-11-15 Impact factor: 6.150

7. Significance Testing for Allelic Heterogeneity.

Authors: Yangqing Deng; Wei Pan
Journal: Genetics Date: 2018-06-29 Impact factor: 4.562

8. Additive gene-environment effects on hippocampal structure in healthy humans.

Authors: Ulrich Rabl; Bernhard M Meyer; Kersten Diers; Lucie Bartova; Andreas Berger; Dominik Mandorfer; Ana Popovic; Christian Scharinger; Julia Huemer; Klaudius Kalcher; Gerald Pail; Helmuth Haslacher; Thomas Perkmann; Christian Windischberger; Burkhard Brocke; Harald H Sitte; Daniela D Pollak; Jean-Claude Dreher; Siegfried Kasper; Nicole Praschak-Rieder; Ewald Moser; Harald Esterbauer; Lukas Pezawas
Journal: J Neurosci Date: 2014-07-23 Impact factor: 6.167

9. A meta-analytic framework for detection of genetic interactions.

Authors: Yulun Liu; Yong Chen; Paul Scheet
Journal: Genet Epidemiol Date: 2016-08-15 Impact factor: 2.135

10. An ancestry-based approach for detecting interactions.

Authors: Danny S Park; Itamar Eskin; Eun Yong Kang; Eric R Gamazon; Celeste Eng; Christopher R Gignoux; Joshua M Galanter; Esteban Burchard; Chun J Ye; Hugues Aschard; Eleazar Eskin; Eran Halperin; Noah Zaitlen
Journal: Genet Epidemiol Date: 2017-11-08 Impact factor: 2.135