Literature DB >> 28592644

Epistasis: Searching for Interacting Genetic Variants Using Crosses.

Ian M Ehrenreich¹.

Abstract

Entities: Disease Gene Species

Keywords: crosses; epistasis; genetic interactions; linkage mapping; multiparental populations, MPP

Mesh：

Year: 2017 PMID： 28592644 PMCID： PMC5473743 DOI： 10.1534/g3.117.042770

Source DB: PubMed Journal: G3 (Bethesda) ISSN： 2160-1836 Impact factor: 3.154

× No keyword cloud information.

Epistasis Matters In Quantitative Genetics

Within quantitative genetics, the term “epistasis” is used to broadly describe situations in which combinations of genetic variants show nonadditive phenotypic effects (Phillips 1998, 2008; Mackay 2014). Although most work on epistasis has focused on pairs of variants that interact (Brem ; Bloom ), more complicated forms of epistasis can also occur (Taylor and Ehrenreich 2015a). These include higher-order interactions between three or more variants (Rowe ; Pettersson ; Taylor and Ehrenreich 2014) and cases in which one variant acts as a hub of interactions with a number of other variants (Carlborg ; Forsberg ). Despite many reports of epistasis, its importance to quantitative genetics remains under active debate (Huang and Mackay 2016). This is in part because theory suggests that, even if epistasis is present, most genetic variance will be additive (Hill ; Maki-Tanila and Hill 2014). Consistent with this argument, purely additive models explain most of the heritability of many quantitative traits (Bloom ) and have proven quite effective in crop and livestock breeding programs (Crow 2010). Given that epistasis can be ignored to little detriment, what do we gain by studying epistasis? Epistasis matters for multiple reasons. A central goal of quantitative genetics is to determine the genetic architectures that underlie heritable traits (Mackay 2001). By definition, this endeavor entails identifying nearly all of the genetic effects that appreciably influence phenotypes, including epistatic effects. Achieving such a precise understanding of genotype–phenotype relationships advances our basic knowledge of genetics and can improve our ability to predict traits, such as disease risk and crop yield, from genome sequences (Forsberg ). Because epistasis often reflects functional relationships between genes, finding interacting variants can also shed light on molecular mechanisms that give rise to trait variability (Aylor and Zeng 2008; Rowe ; Cordell 2009; Huang ; Taylor ). Furthermore, epistasis impacts our understanding of why genetically distinct individuals respond differently to new spontaneous and induced mutations (Nadeau 2001; Queitsch ; Mackay 2014; Siegal and Leu 2014; Schell ). Such background effects are common across species and traits, and are known to contribute to clinically relevant phenotypes (Nadeau 2001; Chandler ). Recent work has shown that genetic background effects often reflect complex interactions between new mutations and multiple segregating variants (Dowell ; Chari and Dworkin 2013; Chandler ; Paaby ; Taylor and Ehrenreich 2015b; Geiler-Samerotte ; Lee ; Taylor ). Thus, predicting how individuals will respond to new mutations, including genetic changes introduced by genome editing (Cong ; Mali ), will likely require accounting for epistasis.

Challenges in Using Genetic Mapping to Detect Epistasis

Identifying epistasis is difficult because most genetic mapping studies are only capable of detecting the simplest and largest effect interactions (Taylor and Ehrenreich 2015a). Although selective genotyping approaches can be used to find interacting variants (Ehrenreich ; Taylor and Ehrenreich 2014, 2015b; Lee ; Taylor ), usually epistasis is identified by association or linkage mapping (Marchini ; Cordell 2009; Verhoeven ; Bloom ; Forsberg ). A common challenge in genome-wide scans for epistasis is multiple testing (Cordell 2009; Sham and Purcell 2014). The number of tests in a scan for epistasis will scale almost exponentially with the order of the interactions being considered (Cordell 2009). For example, assuming the number of variants in a population equals 10,000, then the number of tests in genome-wide scans for pairwise, three-way, and four-way epistasis will be ∼5×107, ∼2×1011, and ∼4×1014. With these large numbers of tests, stringent statistical approaches must be employed to minimize false positives (Sham and Purcell 2014). A related difficulty that genome-wide scans for epistasis face is statistical power. Leveraging data from multiple traits (Tyler , 2017), searching for epistatic effects involving variants that also have additive effects (Storey ; Laurie ), jointly modeling additive and epistatic effects (Marchini ; Verhoeven ), and identifying variants that respond to genetic background (Jannink and Jansen 2001) or show effects on phenotypic variance (Ronnegard and Valdar 2011) are just some of the approaches that can aid in the detection of interacting variants. Yet, arguably the best solution to the statistical power problem is to use large sample sizes in genome-wide scans for epistasis (Bloom , 2015; Hallin ). Notably, both overall sample size in a study and sample sizes within multi-locus genotype classes must be considered (Carlborg and Haley 2004). Samples sizes within multi-locus genotype classes should ideally be balanced, but in some cases this may not be possible, for example when association mapping is performed on natural isolates that possess population structure and a spectrum of allele frequencies (Mackay ). Another factor that may be important to detecting epistasis is how often involved variants also show additive effects. This question has bearing on whether efforts to identify epistasis can be simplified into a two-step process in which additive variants are first identified and then their interactions are measured. Recent work indicates that interacting variants also tend to exhibit additive effects (Bloom ). However, in some cases, new mutations appear to interact with “cryptic” variants that do not typically influence phenotype (Gibson and Dworkin 2004; Paaby and Rockman 2014), suggesting that major epistatic effects can involve variants that lack additive effects.

Exploring Epistasis with Crosses

One of the best opportunities for identifying interacting variants is using linkage mapping in crosses of genetically diverse isolates from model species (Carlborg and Haley 2004; Mackay ; Taylor and Ehrenreich 2015a). In many of these organisms, isolates can be made homozygous by inbreeding [e.g., Drosophila (Mackay ) and mouse (Beck )], sporulation [e.g., budding yeast (Liti ; Schacherer )], or the creation of doubled haploids [e.g., many plants (Maluszynski )], enabling the generation of stable genotypes that minimize heterozygosity. Using inbred lines as the founders of crosses is desirable because it allows unambiguous cataloging of the variants that will segregate among progeny. RILs can then be produced from cross progeny in the same way that the inbred founders were generated (Carlborg and Haley 2004; Mackay ; Taylor and Ehrenreich 2015a). RILs represent a powerful resource for identifying epistatic effects because they carry random combinations of the variants that differentiate their founders and have minimal to no population structure (Carlborg and Haley 2004; Rockman 2008; Mackay ; Taylor and Ehrenreich 2015a). There are many experimental design choices to make when constructing RIL populations (Verhoeven ; Rockman and Kruglyak 2008; Mackay ). Assuming sample size is not limiting, one of the key decisions in constructing a cross is the number of founders (Kover ; Aylor ; Long ). While two-parent RIL populations are commonly used, multiparent RILs can be generated from dozens of founders or more (Ladejobi ). As highlighted by the rapidly growing “Multiparental Populations” series in GENETICS and G3: Genes|Genomes|Genetics (de Koning and McIntyre 2014), there is tremendous interest in using RIL populations derived from more than two founders to examine the genetic basis of quantitative traits. A number of crossing designs have been described for generating multiparent RILs. These include, but are not limited to, employing multiple rounds of crossing to ensure that each founder contributes equally to each RIL (Churchill ), Nested Association Mapping (NAM) in which one common founder is crossed to many others (McMullen ), and crossing each founder to two or more of the other founders in a full or partial diallel design (Verhoeven ; Treusch ). Multiparent RILs can also be interbred to produce outbred populations that resemble natural populations but lack population structure (Svenson ). Relative to more traditional two-parent crosses, multiparent populations have some clear advantages: they sample a greater fraction of the genetic diversity that exists within a species and can lead to finer mapping resolution (Yu ; Kover ; Aylor ; Long ).

Tradeoffs in Searching for Epistasis Using Multiparent Crosses

Regarding epistasis, the main strength of multiparent populations relative to two-parent crosses is a more complete sampling of the combinations of interacting variants that segregate in a species. However, the specific crossing design used to generate multiparent RILs will influence the epistatic effects that are detectable. For example, the maize NAM population was generated by mating 25 genetically diverse founders to the same reference line (B73) and producing RILs from each two-parent cross (Yu ; Buckler ; McMullen ). The NAM panel provides a compelling opportunity to identify interactions involving variants carried by B73 (Yu ; Peiffer ). However, this population might have more limited potential to identify other epistatic effects. Generating multiparent RILs that are equally derived from each founder can maximize the epistatic effects present in a cross, but has consequences for multi-locus genotype frequencies at interacting variants. While two-parent RILs have the advantage that all variants and two-locus combinations should segregate at ∼1/2 and ∼1/4, respectively, this is not the case for multiparent RILs. For example, the eight founders of the mouse Collaborative Cross contribute almost equally to each RIL (Churchill ; Aylor ; Collaborative Cross Consortium 2012), implying that minor allele frequencies should be between ∼1/8 and ∼1/2 among the RILs. This variability in allele frequencies can lead to low and unbalanced multi-locus genotype frequencies at interacting variants, which may result in false negatives in genome-wide scans for epistasis. In an extreme case where two founder-specific variants interact, each will occur in roughly an eighth of the RILs and the four multi-locus genotype frequencies involving the variants will have frequencies of ∼1/64, ∼7/64, ∼7/64, and ∼49/64. Despite this issue, multiparent populations like the Collaborative Cross can be a very useful resource for studying epistasis, especially when systems level data are available or information is leveraged across traits (Tyler ). An additional factor to consider when using multiparent populations to study epistasis is allelic heterogeneity, which occurs when multiple causal variants reside in either the same gene or different, closely-linked genes (Risch 2000; Long ; Matsui ; Linder ). Many cases of allelic heterogeneity have been found in both multiparent genetic mapping (Buckler ; Ehrenreich ; King , 2014; Peiffer ) and association studies (Lango Allen ; Hormozdiari ). With respect to epistasis, this allelic heterogeneity may make it more difficult to detect interacting variants in multiparent populations than in comparably sized two-parent populations.

Conclusion

Epistasis has important phenotypic effects, but can be difficult to identify. RILs produced by crossing genetically distinct isolates can facilitate the detection of interacting variants, but experimental design criteria must be considered, including how many founders to employ. Expanding the genetic variation that is present in a cross by using more founders has both advantages and disadvantages. For example, RILs produced by crossing two founders will have balanced multi-locus genotype frequencies, which can provide statistical power to identify pairwise and higher-order epistasis. However, comprehensively mapping epistatic effects across a species requires using a number of founders. These considerations speak to how epistasis is a complex and incompletely understood phenomenon that has no single form. Thus, assuming finite resources, the most appropriate experimental design for studying epistasis may depend on the specific question one wants to address.

77 in total

1. Mapping epistatic quantitative trait loci with one-dimensional genome searches.

Authors: J L Jannink; R Jansen
Journal: Genetics Date: 2001-01 Impact factor: 4.562

2. Epistasis and the release of genetic variation during long-term selection.

Authors: Orjan Carlborg; Lina Jacobsson; Per Ahgren; Paul Siegel; Leif Andersson
Journal: Nat Genet Date: 2006-03-12 Impact factor: 38.330

Review 3. The genetics of quantitative traits: challenges and prospects.

Authors: Trudy F C Mackay; Eric A Stone; Julien F Ayroles
Journal: Nat Rev Genet Date: 2009-08 Impact factor: 53.242

4. Influence of gene interaction on complex trait variation with multilocus models.

Authors: Asko Mäki-Tanila; William G Hill
Journal: Genetics Date: 2014-07-01 Impact factor: 4.562

Review 5. Epistasis--the essential role of gene interactions in the structure and evolution of genetic systems.

Authors: Patrick C Phillips
Journal: Nat Rev Genet Date: 2008-11 Impact factor: 53.242

6. The Drosophila melanogaster Genetic Reference Panel.

Authors: Trudy F C Mackay; Stephen Richards; Eric A Stone; Antonio Barbadilla; Julien F Ayroles; Dianhui Zhu; Sònia Casillas; Yi Han; Michael M Magwire; Julie M Cridland; Mark F Richardson; Robert R H Anholt; Maite Barrón; Crystal Bess; Kerstin Petra Blankenburg; Mary Anna Carbone; David Castellano; Lesley Chaboub; Laura Duncan; Zeke Harris; Mehwish Javaid; Joy Christina Jayaseelan; Shalini N Jhangiani; Katherine W Jordan; Fremiet Lara; Faye Lawrence; Sandra L Lee; Pablo Librado; Raquel S Linheiro; Richard F Lyman; Aaron J Mackey; Mala Munidasa; Donna Marie Muzny; Lynne Nazareth; Irene Newsham; Lora Perales; Ling-Ling Pu; Carson Qu; Miquel Ràmia; Jeffrey G Reid; Stephanie M Rollmann; Julio Rozas; Nehad Saada; Lavanya Turlapati; Kim C Worley; Yuan-Qing Wu; Akihiko Yamamoto; Yiming Zhu; Casey M Bergman; Kevin R Thornton; David Mittelman; Richard A Gibbs
Journal: Nature Date: 2012-02-08 Impact factor: 49.962

7. Genetic mapping of MAPK-mediated complex traits Across S. cerevisiae.

Authors: Sebastian Treusch; Frank W Albert; Joshua S Bloom; Iulia E Kotenko; Leonid Kruglyak
Journal: PLoS Genet Date: 2015-01-08 Impact factor: 5.917

8. Transcriptional Derepression Uncovers Cryptic Higher-Order Genetic Interactions.

Authors: Matthew B Taylor; Ian M Ehrenreich
Journal: PLoS Genet Date: 2015-10-20 Impact factor: 5.917

Review 9. Maximizing the potential of multi-parental crop populations.

Authors: Olufunmilayo Ladejobi; James Elderfield; Keith A Gardner; R Chris Gaynor; John Hickey; Julian M Hibberd; Ian J Mackay; Alison R Bentley
Journal: Appl Transl Genom Date: 2016-10-26

10. Finding the sources of missing heritability in a yeast cross.

Authors: Joshua S Bloom; Ian M Ehrenreich; Wesley T Loo; Thúy-Lan Võ Lite; Leonid Kruglyak
Journal: Nature Date: 2013-02-03 Impact factor: 49.962

4 in total

1. Scalable Nonparametric Prescreening Method for Searching Higher-Order Genetic Interactions Underlying Quantitative Traits.

Authors: Juho A J Kontio; Mikko J Sillanpää
Journal: Genetics Date: 2019-10-04 Impact factor: 4.562