The molecular signature of selection depends strongly on whether new mutations are immediately favorable and sweep to fixation (hard sweeps) as opposed to when selection acts on segregating variation (soft sweeps). The prediction of reduced sequence variation around selected polymorphisms is much stronger for hard than soft sweeps, particularly when considering quantitative traits where sweeps are likely to be incomplete. Here, we directly investigate the genomic signal of soft sweeps within an artificial selection experiment on Mimulus guttatus. We first develop a statistical method based on Fisher's angular transformation of allele frequencies to identify selected loci. Application of this method identifies about 400 significant windows, but no fixed differences between phenotypically divergent populations. With two notable exceptions, we find a modest average effect of partial sweeps on the amount of molecular variation. The first exception is a polymorphic inversion on chromosome 6. The increase of the derived haplotype has a broad genomic effect due to recombination suppression coupled with substantial initial haplotype structure within the population. Second, we found significant increases in nucleotide variation around selected loci in the population evolving larger flowers. This suggests that "high" alleles for flower size were initially less frequent than "low" alleles. This result is consistent with prior studies of M. guttatus and illustrates how molecular evolution can depend on the allele frequency spectrum at quantitative trait loci.
The molecular signature of selection depends strongly on whether new mutations are immediately favorable and sweep to fixation (hard sweeps) as opposed to when selection acts on segregating variation (soft sweeps). The prediction of reduced sequence variation around selected polymorphisms is much stronger for hard than soft sweeps, particularly when considering quantitative traits where sweeps are likely to be incomplete. Here, we directly investigate the genomic signal of soft sweeps within an artificial selection experiment on Mimulus guttatus. We first develop a statistical method based on Fisher's angular transformation of allele frequencies to identify selected loci. Application of this method identifies about 400 significant windows, but no fixed differences between phenotypically divergent populations. With two notable exceptions, we find a modest average effect of partial sweeps on the amount of molecular variation. The first exception is a polymorphic inversion on chromosome 6. The increase of the derived haplotype has a broad genomic effect due to recombination suppression coupled with substantial initial haplotype structure within the population. Second, we found significant increases in nucleotide variation around selected loci in the population evolving larger flowers. This suggests that "high" alleles for flower size were initially less frequent than "low" alleles. This result is consistent with prior studies of M. guttatus and illustrates how molecular evolution can depend on the allele frequency spectrum at quantitative trait loci.
The most immediate purpose of population genetics is to predict allele frequency change. Throughout the 20th century, theorists developed increasingly detailed models to predict the simultaneous action of mutation, selection, migration, and genetic drift under various demographic and ecological scenarios. For most of this interval, there was little relevant data to evaluate the theory (Lewontin 1974). Direct observations of allele frequency change were largely limited to visible polymorphisms with a simple genetic basis (Ford 1971) or major chromosomal changes (Dobzhansky and Levene 1948). However, sequencing technologies now afford an unprecedented view of genetic change within populations.Second-generation sequencing has enabled “population genomics,” the characterization of whole-genome divergence among both natural (e.g. Emerson et al. 2010; Hohenlohe et al. 2010a) and experimental populations (e.g. Johansson et al. 2010; Turner et al. 2011). The method of population genomics typically involves two stages (Hohenlohe et al. 2010b; also see Pavlidis et al. 2008; Oleksyk et al. 2010). First, one estimates neutral demographic processes from a genome-wide integration of signal. Second, one identifies genomic outliers to be considered putative targets of selection. For example, polymorphisms that exhibit the greatest divergence in allele frequency among populations (FST outliers) may represent loci subject to divergent selection (Lewontin and Krakauer 1973; Beaumont and Nichols 1996). While polymorphism or “SNP-specific” analyses are intuitive, there are both population genetic and statistical reasons to seek evidence of selection within genomic windows (sets of closely linked polymorphisms). The population genetic reason is hitch-hiking. Selection on a polymorphism will cause allele frequency change at closely linked neutral sites (Maynard Smith and Haigh 1974). The statistical justification is that allele frequencies are estimated with error. As a consequence, aggregating information across polymorphisms within a window of sites can provide a more accurate characterization of divergence.The hitch-hiking effect is contingent on the nature of genetic variation and how selection acts on that variation. The simplest case is a so-called “hard sweep” where a new mutation is immediately favorable and its increase in frequency carries a single haplotype (a window of sites around the selected polymorphism) to fixation in the population (Maynard Smith and Haigh 1974). As a consequence, variation is eliminated around the selected site. The genomic extent of this effect depends on the rate of recombination, and also on the strength of selection, which determines how rapidly the favorable mutation fixes. In contrast, “soft sweeps” occur when extant alleles become favorable, say due to change in environmental conditions, and then increase in frequency. Here, the effect on linked neutral variation is more subtle because the favored allele occurs in a diversity of genetic backgrounds when it begins to increase in frequency (Innan and Kim 2004; Hermisson and Pennings 2005; Pennings and Hermisson 2006; Chevin and Hospital 2008).Hitch-hiking may be most difficult to detect when considering selection on quantitative traits. With multi-locus inheritance, large changes in traits means can be generated through incremental changes in allele frequency at many loci. As a consequence, sweeps are likely to be both soft and incomplete. Interestingly, this kind of selection can actually increase nucleotide diversity within a genomic region. If the less common allele at a Quantitative Trait Locus (QTL) is favored, allele frequencies at this site will become more intermediate due to selection. If allele frequencies at neighboring sites also become more intermediate, then the level of sequence variation will be inflated for the entire genomic window. A simple calculation (Appendix A) indicates that this will occur if the uncommon favored allele tends to be positively associated with the less common base at neighboring single-nucleotide polymorphisms (SNPs).This article investigates genomic variation in populations subject to divergent selection on a quantitative trait. Variation in flower size of Mimulus guttatus is highly polygenic (Kelly 2008; Lee 2009). Two independent populations (Low and High) were derived from a common source, a large and highly variable ancestral population, and subject to divergent artificial selection on flower size for nine generations. After nine generations, Low and High are largely non-overlapping in flower size (fig. 1) and substantially different in numerous morphological and life history traits (see table 2 of Kelly 2008). As in previous studies of fruit flies (Burke et al. 2010; Turner et al. 2011; Remolina et al. 2012), cattle (Hayes et al. 2009), and chickens (Johansson et al. 2010), we here investigate genomic divergence among artificially selected populations. However, the present study differs from previous experiments importantly in time scale; considering nine generations of evolution instead of hundreds (e.g. Burke et al. 2010). Divergence of our High and Low populations must be overwhelmingly due to the recruitment of standing genetic variation in flower size. Selective sweeps, if evident at all, must be soft. We also expect that most of the change in phenotype will be due to loci that have not yet fixed the favorable allele.
F
The density functions of corolla width (flower size) are depicted for the Low and High populations, respectively. The arrow denotes the mean of the Ancestral population.
The density functions of corolla width (flower size) are depicted for the Low and High populations, respectively. The arrow denotes the mean of the Ancestral population.This study is based on Illumina sequence data from pooled independent samples within the Low, High, and Ancestral populations. We first develop a simple statistical model based on classical population genetic methods to analyze whole genome divergence in allele frequency. Calibration of the model provides parameter estimates that are informative about the drift/sampling process and also thresholds for distinguishing significant outliers. Outliers are putative targets of selection. Surprisingly, we find that the contrast between the most divergent populations (Low vs High) indentifies fewer outliers than the comparisons of each selected population distinctly to the Ancestral population. The ancestral contrasts also identify a striking example of parallel evolution within Low and High.This experiment also confirms two predictions from the population genetic theory of hitch-hiking. First, the relatively weak genomic signal of soft, partial sweeps is expected to be more pronounced if 1) the selected region exhibits substantial linkage disequilibrium when selection is initiated and 2) recombination is suppressed. These conditions are satisfied by a polymorphic inversion previously mapped by Lee (2009). We find that evolution at this locus affects nucleotide diversity patterns over much of chromosome 6. Apart from the inversion on chromosome 6, we find that changes in nucleotide diversity near selected loci (divergent regions) are highly variable. However, there is a clear and significant tendency for sequence diversity to increase near selected loci within the High population. This suggests that selection has made allele frequencies more intermediate within High; a result consistent with previous quantitative genetic inferences about the allele frequency spectrum at flower size loci (Kelly 2008).
Materials and Methods
Experimental Procedures
The ancestral population (Zia-1) and selection experiment are described in detail in Kelly (2008). Briefly, the ancestral population was synthesized from a random sample of more than 1000 plants from a single natural population located at Iron Mountain in Oregon, USA. Distinct Low and High populations were established from this common source. In each generation, 1000 plants were grown to maturity within each population and measured for corolla width (the largest width of the lower lip of the flower). In each generation, we selected 200 plants (the smallest flowers in Low and largest in High), randomly paired these plants (within populations) assigning one as sire and the other as dam, and hand pollinated the dam. This process was repeated for nine successive generations.We grew progeny of the last generation simultaneously with plants germinated from the Ancestral population. We collected equal tissue per plant from 49 Low plants, 78 High plants, and 75 Ancestral population plants; each plant from an independent family within the population. We extracted DNA from each bulked tissue sample, made a single genomic DNA library for each population sample, and then sequenced each library within a distinct lane using the Illumina HiSeq 2000 instrument (v3 chemistry, 100 bp paired-end sequencing) at the University of Kansas Medical Center Genomics facility.The QTL mapping study of Lee (2009) identified a large inversion on chromosome 6 that is segregating in the Iron Mountain population of Mimulus guttatus. We conducted a distinct tissue collection from 96 plants of each population for individual genotyping of markers on chromosome 6. We extracted DNA from these samples using our standard protocol (Marriage et al. 2009) followed by polymerase chain reaction (PCR) amplification of three distinct markers: MgSTS_431, MgSTS_229, and MgSTS_480. Each of these loci is length polymorphic and a particular set of alleles—240 bp at MgSTS_431, 201 bp at MgSTS_229, and 270 bp at MgSTS_480—is diagnostic of the derived inversion haplotype on chromosome 6 (Scoville et al. 2009). Allele lengths were determined using capillary electrophoresis of PCR products using an ABI 3130 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). We sized fragments using GENEMAPPER 4.0 software (Applied Biosystems) calibrated with the ROX500 size standard (Applied Biosystems).
Processing of Sequence Data
Prior to read mapping, we trimmed low-quality ends and discarded reads reduced to fewer than 20 bases using Sickle (https://github.com/najoshi/sickle, last accessed July 30, 2013). For the High population, this process reduced the initial set of 394 million reads (197 million read pairs) to 360 million reads. For the Low population, read count was reduced from 358 to 333 million and the Ancestral population from 404 to 374 million. To map reads, we used BWA with default parameter values (Li and Durbin 2009) and a modified version of the M. guttatus reference genome (http://www.phytozome.net/, last accessed July 30, 2013) in which the repetitive regions were masked. After mapping, we removed PCR duplicates with Picard tools (http://picard.sourceforge.net, last accessed July 30, 2013). Duplicate removal followed by elimination of reads with a mapping quality less that 29 further reduced the number of reads to 110 million (High population), 96 million (Low), and 71 million (Ancestral), respectively.We identified candidate polymorphisms (hereafter SNPs) using Samtools 0.1.8 (Li et al. 2009). This version of Samtools-pileup does not assume the true allele frequency in the sample is ½ or 1. Candidates were subsequently filtered based on allele frequency information from the entire dataset with High, Low, and Ancestral samples considered jointly. To eliminate false SNPs caused by sequencing and/or mapping errors, candidate SNPs were eliminated from subsequent consideration if 1) the minor allele was not present in at least two copies and at a frequency of 1% or greater within the populations-combined sample, 2) there was significant (G > 3.84) and substantial (>20%) allele frequency divergence between strands, or 3) three bases were segregating at the site. For criterion (2), strand refers to whether a read aligned with the reference genome (plus) or as the reverse complement (minus). Without sequencing or mapping error, we expect that allele frequency should be the same in + and − strands; an expectation that we evaluate with a G test (Sokal and Rohlf 2000). Finally, we included a SNP within a particular contrast of populations (for example, High vs Low) only if each population had a minimum of 30× coverage and a maximum of 1000× coverage at the SNP. With these filters, the average coverage per SNP was 48× in the Ancestral population, 61× in Low, and 64× in High.After identifying valid SNPs, we recoded all read pairs to note location and allelic information at valid SNPs using custom Python scripts. These programs preserve haplotype information at the scale of read-pairs allowing estimation of linkage disequilibria (LD) between closely linked SNPs (2–500 bp). We wrote a series of C programs to calculate divergence statistics (see Statistical Model of Divergence section) and to estimate LD between all SNP pairs for which there were at least 20 read pairs within a population with valid base calls at each site. From these read pairs, we directly estimated the four “gamete frequencies” (y00, y10, y01, and y11), and then calculated p1 = y10 + y11, p2 = y01 + y11, D = y11 – p1p2, and r2 = D2/(p1p2 [1 − p1][1 − p2]). Our estimators for D and r2 are equivalent to Method 1 (direct inference) of Feder et al. (2012). When calculating D between two sites (A/a and B/b), we defined A and B as the most common bases at each site in the Ancestral population sample. Polarizing alleles this way does not affect r2, but is relevant to hitch-hiking effects on levels of variation. The variance increase condition of Appendix A requires positive D when alleles are scored this way.Divergence among populations was estimated for each SNP and also in “genomic windows.” For the latter, we defined windows as non-overlapping sets of S contiguous SNPs. In each window, we calculated B (eq. 2 below) as well as , the average of p(1 − p) across SNPs within a window. This is essentially the pooled sample equivalent of expected heterozygosity. Conditional on S, is directly proportional to nucleotide diversity (the average pairwise number of differences between sequences in a sample) and is thus a natural measure of sequence diversity within windows defined to have a fixed S. The full analytical pipeline is summarized in supplementary table S3, Supplementary Material online.
Statistical Model of Divergence
Figure 2 illustrates the populations of this study and the processes/events generating allele frequency differences among populations. Inference of evolutionary processes is complicated by the fact that allele frequencies are not directly observed. Let pA be the true allele frequency in the ancestral population; pL and pH for Low and High, respectively. The raw data are sequencing read counts; quantities that are removed from the allele frequencies by multiple sampling events. The first sampling event is collection of a finite number of individuals to form the “bulk” or “pool” for each population. Second, the various stages of library construction and events during the sequencing process may cause some genomes within a bulk to be overrepresented relative to others (library construction events in fig. 2). Finally, there is finite coverage of reads for any particular SNP.
F
Allele frequency (p) within the Ancestral (subscript A), Low (subscript L), and High (subscript H) populations is illustrated at various stages of the experiment.
Given marker genotypes, we scored each individual as CC, Cc, or cc (homozygous normal, heterozygous, or homozygous inversion) and then calculated the frequency of the inversion (P) in each population (Ancestral n = 91, Low n = 92, and High n = 96). In the Ancestral population, P = 0.15 with 95% confidence interval (0.10, 0.21). In Low, P = 0.66 (0.59, 0.73) while in High, P = 0.65 (0.58, 0.72). These data indicate a striking increase in the frequency of the Inverted haplotype within both High and Low populations over the course of nine generations of selection.
Allele Frequency Divergence between Populations
After filtering, we identified 1,224,974 SNPs with at least 30× coverage in the Low population, 1,405,249 in the High, and 836,385 in Ancestral. We calculated , , and for each SNP with sufficient coverage in the contrasted populations. Figure 3 illustrates that the distributions for these differences correspond closely to a normal distribution with mean zero and a contrast-specific variance (eqs. 1). The mean, variance, skew, and excess kurtosis are reported for all 14 of the main scaffolds (sequences corresponding to chromosomes 1–14) in supplementary table S1, Supplementary Material online. General agreement to the normal distribution is good for all chromosomes except chromosome 6 (supplementary fig. S1, Supplementary Material online). Skew is minimal. Excess kurtosis is typically small in magnitude, but on average positive across scaffolds. This suggests that a subset of SNPs have more extreme divergences than predicted from drift and sampling alone, consistent with genomically localized effects of selection.
F
The distribution of is depicted for the entire genome (n = 1,154,300).
The distribution of is depicted for the entire genome (n = 1,154,300).The distributions for chromosome 6 are clearly non-normal with positive excess kurtosis for Low vs High, and negative excess kurtosis for each contrast against the Ancestral population (supplementary fig. S1B, Supplementary Material online). The standard deviations of , and among SNPs on chromosome 6 were 0.56 and 0.59, respectively; much higher than for all other chromosomes (supplementary table S1, Supplementary Material online). These observations suggest that selection on the inversion, which suppresses recombination over a large portion of chromosome 6 (Lee 2009), has a genomically widespread effect on SNP variation. Given the evidence of a major selective event on chromosome 6, we excluded that data for subsequent model-fitting (e.g., estimating the v parameters of eqs. 1) and hypothesis testing.Using IQR as a robust estimator for the variance of transformed allele frequency divergence, σ2 = (IQR/1.349)2, and then subtracting variance attributable to SNP-specific finite coverage, we estimate that vL + vH = 0.13126, vL + vA = 0.07919, and vA + vH = 0.07602. Solving, vL = 0.06722, vH = 0.06404, and vA = 0.01198. These values, combined with the observed read counts, were employed to calculate SNP-specific d for each contrast, which were then compared with the standard normal distribution. For the Low versus High contrast, two SNPs were significantly divergent using a genome wide false discovery rate of 0.1. For High versus Ancestral, the 26 SNPs were significant, while no single SNP was significant for the Low versus Ancestral contrast. The significant SNPs are listed in supplemental table S2, Supplementary Material online.Considering divergence in genomic windows of 10 SNPs (S = 10), figure 4 is the observed distribution of B* (eq. 4) to the calibrated null distribution for Low versus High. There is excellent general agreement between observed and null, indicating that the estimation procedure effectively captures divergence owing to genetic drift and serial sampling (fig. 2). However, the window-based method is much more powerful than the single-SNP analysis for detecting genome-wide significant outliers. Figure 5 identifies significant outliers for each contrast: 304 windows for the High-Ancestral contrast (panel A), 50 windows for the Low-Ancestral contrast (panel B), and 40 windows for the Low-High contrast (panel A). The median size of windows was 1216 bp for High-Ancestral, 1221 bp for Low-Ancestral, and 837 bp for Low-High. We also performed the model fitting with larger windows (S = 20), but found fewer significant outliers than with S = 10.
F
The observed density function for B* (broken line) is compared with the fitted null density (solid line) with S = 10 for the Low-High contrast.
F
Observed B* is plotted as a function of genomic location for the contrast of (A) Low-High, (B) Low-Ancestral, and (C) High-Ancestral. The broken horizontal line in each panel is the genome-wide significance threshold for each contrast (significant windows red). Scaffolds 15–17 are autosomal, but not currently located to the 14 major chromosomes in the genome build.
The observed density function for B* (broken line) is compared with the fitted null density (solid line) with S = 10 for the Low-High contrast.Observed B* is plotted as a function of genomic location for the contrast of (A) Low-High, (B) Low-Ancestral, and (C) High-Ancestral. The broken horizontal line in each panel is the genome-wide significance threshold for each contrast (significant windows red). Scaffolds 15–17 are autosomal, but not currently located to the 14 major chromosomes in the genome build.There is clear clustering of significant windows when considering their distribution across the genome, particularly noting the concentration of outliers on chromosome 8 (fig. 5). However, at the scale of 10–100 kb, the signal of selection is fairly localized. If one considers the 10 windows bracketing each significant outlier of the High-Ancestral contrast, the mean P value for these windows is 0.14. This is substantially lower than the mean of all windows (P = 0.496) but not near the genome-wide threshold required for an individual window to be significant (P = 0.0004 for High-Ancestral). Notably, all of these calculations were conducted after excluding chromosome 6. On chromosome 6, there is fairly consistent divergence between both derived populations and the ancestral populations across a large fraction of the chromosome (fig. 6).
F
Averaged B is plotted versus genomic location on chromosome 6 (green = Low-High, red = High-Ancestral, red = High-Ancestral). B is calculated in windows with S = 10, but each trajectory is a moving average of 100 consecutive windows. The expected value for B under the null hypothesis is 10.
Averaged B is plotted versus genomic location on chromosome 6 (green = Low-High, red = High-Ancestral, red = High-Ancestral). B is calculated in windows with S = 10, but each trajectory is a moving average of 100 consecutive windows. The expected value for B under the null hypothesis is 10.The distributions for with S = 10 are similar across populations with nearly equivalent mean values (ca. 0.155). The minimal average change in from Ancestral to either Low or High indicates that genetic drift had a generally small effect on overall sequence variation in this experiment. However, there are interesting patterns if we classify windows on significance of (fig. 7). Across the 304 significant windows of the High-Ancestral contrast, mean essentially doubled over the course of nine generations of selection. In these windows, change in was highly variable (actually negative in 41 of the windows), but substantially positive on average owing to the fact that selection made allele frequencies more intermediate in most windows. For the Low-Ancestral and Low-High contrasts, differences in mean are more modest. For Low-Ancestral, mean is reduced by 25% from Ancestral to Low. For Low-High, mean increased in High and declined in Low relative to Ancestral.
F
Mean within windows (S = 10) is reported (bands are a 95% CI) for various classifications of windows. The first bar is for the entire genome of the Ancestral population. The succeeding bars are in three sets, one set for each contrast. Within each set, A, L, and H refer to the population sample (Ancestral, Low, and High).
Mean within windows (S = 10) is reported (bands are a 95% CI) for various classifications of windows. The first bar is for the entire genome of the Ancestral population. The succeeding bars are in three sets, one set for each contrast. Within each set, A, L, and H refer to the population sample (Ancestral, Low, and High).
LD among Closely Linked SNPs
Unfortunately, we cannot report LD estimates among the various SNPs within significant and non-significant windows identified in the previous section. This is because our D estimates are limited to SNPs within read pairs (most estimates are for inter-SNP distances of less than 50 bp). Even at this small genomic scale, the data are very incomplete owing to idiosyncratic coverage of SNP pairs. However, the data are sufficient to identify general trends. As expected, the strength of associations measured as r2 declines with distance between SNPs (supplemental fig. S2, Supplementary Material online). There is also a tendency for rare alleles to be positively associated. Of the 1,222,058 SNP pairs with sufficient coverage in the Ancestral population, about 60% of D estimates are positive. Mean D is about 0.05 consistently across chromosomes. Figure 8 depicts the distribution of D values for all SNP pairs in the Ancestral population where one SNP has a minor allele frequency in the range of 0.1–0.15. Not only are most D values positive, but the average magnitude of positive D estimates is greater than the average magnitude of negative D estimates.
F
The distribution of D values for SNP pairs in the Ancestral population where one SNP has a minor allele frequency of 0.1–0.15.
The distribution of D values for SNP pairs in the Ancestral population where one SNP has a minor allele frequency of 0.1–0.15.The level of LD is also relevant to our null model for window-based divergence (Appendix B). The variance of B depends partly on the pattern and strength of LD among SNPs within a window, and since LD will vary over the genome, the true neutral distribution for B (and B*) is itself a mixture distribution. Exaggerated values for B* could result from drift/sampling even in neutral regions if they exhibit unusually high LD. To evaluate whether this potential difficulty is relevant to the present dataset, we determined average r2 in the Ancestral population for genomic regions that either include or do not include significantly divergent windows. We delineated significant regions as the 10 kb interval containing a window start site, plus the 10 kb on either side. These criteria capture 560 distinct 10 kb intervals across the significant windows of all three contrasts. The mean r2 for these windows is 0.451 (SEM = 0.007). For the remainder of the genome (21,862 10 kb intervals), the mean r2 is 0.474 (SEM = 0.001). This indicates that significant values for B* are not a spurious effect of genetic drift on high LD portions of the genome.
Discussion
The divergence of High and Low populations from this study is trait evolution owing to changes in allele frequency at many loci. Mean corolla width differs between these populations by nearly 3-fold (fig. 1), typical of interspecific differences in flower size within Mimulus. Low and High have also separated substantially in morphology (Kelly 2008), in physiology (Kelly et al. 2008), and perhaps most importantly, in flowering phenology. Low plants flower an average of 6 days earlier than High plants. Days-to-flower is a critical fitness determinant in the field (Mojica and Kelly 2010), and flowering asynchrony between populations is an important reproductive isolating mechanism, both in M. guttatus (Martin and Willis 2007) and many other plants (e.g., Petit et al. 1997; Husband and Schemske 2000; Borchsenius 2002). The quantitative trait divergence of Low and High represents polygenic adaptation (sensu
Pritchard and Di Rienzo 2010) because we found not a single fixed difference between these populations across nearly a million polymorphisms.The three different contrasts (High vs Low, High vs Ancestral, and Low vs Ancestral) identified nearly 400 outlier windows (fig. 5). Some of these are likely redundant; significant tests in closely linked windows reflecting the same selective event. Corroboration for these divergent regions comes from a QTL mapping study based on these same populations (Lee 2009). Lee performed three crosses between distinct High and Low plants sampled from generation 6 of the selection experiment. She selfed each F1 to create three replicate F2 mapping panels (approximately 380 plants per panel) and then measured genotype–phenotype associations. For 22 corolla width QTLs in Lee (2009), we could locate the marker that was nearest the LOD peak (estimated QTL location on the linkage map) to a scaffold of the genome sequence (here we are ignoring chromosome 6 because of the inversion). For 10 of 22 QTLs, there was a significantly divergent window within 100 kb. For 16 of 22 QTLs, a divergent window is within 2 mb.To evaluate this level of congruence, we determined that there is about a 9% chance that a randomly selected location in the genome is within 100 kb of a significantly divergent window. A simple binomial calculation can then evaluate the null hypothesis that QTL and divergent windows are uncorrelated. The probability of getting 10 (or more) out of 22 QTLs within 100 kb is only 0.00000086. Thus, the correspondence is highly significant, although there are quite a few QTL markers that are not within 100 kb of a significantly divergent window. The reasons for this are multiple. There are likely some genuine differences, i.e., mapped flower size loci that did not contribute to response to selection. However, there are also several factors generating false negatives. For one, QTL mapping in F2s yields broad intervals for location. As a consequence, causal polymorphisms may be fairly distant from the QTL LOD peak. Second, the genome-wide significance threshold of the present study is quite stringent, requiring high divergence in allele frequency between populations. If one considers a less stringent threshold—windows with a P value <0.001—we find that 21 of 22 QTL markers are within 1 mb of a divergent window. It is also notable that a large fraction of significantly divergent windows in this study are on chromosome 8 (fig. 5), which yielded more QTL than any other chromosome in Lee (2009) and for which subsequent fine mapping has identified numerous distinct flower size loci (Scoville et al. 2011).
QTL Allele Frequency and the Effect of Selection on Levels of Molecular Variation
We measured sequence variation in terms of , the average of p(1 − p) within a window. is a distillation of the allele frequency spectrum at SNPs and is informative because short-term changes in sequence variation are caused by shifts in the relative abundance of distinct haplotypes. Conditional on the number of polymorphisms within a window (S), is directly proportional to nucleotide diversity (the average number of differences between sequences in a sample, Nei and Li 1979). However, nucleotide diversity is typically calculated in windows that vary in S. It thus integrates evolutionary processes over a much longer time scale over which mutation and the genealogical structure of the population influence the number of polymorphisms in a region.We find that soft, partial sweeps have a complicated and variable effect on levels of molecular variation (fig. 7). This is to be expected given that the predicted change in due to selection depends on the initial frequency of the favored allele, the final frequency of the favored allele, and on the particular nature of LD between the selected site and neighboring SNPs (Appendix A). Despite variability, there are some statistically significant trends. In significant windows from Low versus High, mean increased by about 25% in High and declined comparably in Low relative to the Ancestral population over the nine generations of selection (fig. 7). As consequence, mean differs significantly between Low and High. Importantly however, the magnitude of this difference (ca. 0.06) is not extreme when considered in relation to background variation. Across the entire genome, the standard deviation of differences in between Low and High is 0.04. Because the mean change within significant windows is only 1.5 SD of the background distribution, it is essentially impossible to assert an effect of selection strictly from changes in the amount of sequence variation.The most substantial signal in mean occurred within the High population where sequence variation was substantially increased by selection (see High-Ancestral windows of fig. 7). We hypothesize this is due the fact that “high” alleles (those increasing flower size) are routinely less common than alternative “low” alleles in the ancestral population. If the high allele at a flower size QTL increased from 10% to 50% due to selection, the change in pq at this site would be 0.16 contributing positively to Δ for the window in which it resides. Of course, Δ is mainly determined by changes in allele frequency at the non-causal SNPs within the window. Changes at these sites will generally be smaller in magnitude than for the causal SNP owing to imperfect associations of allele frequencies among neighboring sites; a salient feature of selection on standing variation (Innan and Kim 2004; Hermisson and Pennings 2005). Despite this, the inflation of mean indicates that, for most windows, allele frequencies at non-causal sites became more intermediate with selection. This requires that, on average, the uncommon allele at the causal site is positively associated with the less common allele at neighboring sites (Appendix A). Our pooled sample experimental design does not allow us to calculate LD among the various SNPs defining each window, except for a small fraction of the relevant SNP pairs. However, the admittedly incomplete data from closely linked SNPs indicate the genome-wide pattern of linkage disequilibrium in our ancestral population. As required for positive Δ, linkage disequilibrium is usually positive (fig. 8).The hypothesis that high flower size alleles are less frequent than their alternatives within the ancestral population is consistent with prior studies of this population. After six generations of the selection experiment, we conducted a breeding design to estimate VA, the additive genetic variance of flower size within each population (Kelly 2008). VA was slightly reduced in Low, but substantially elevated in High, relative to the Ancestral population. Response of the mean phenotype to selection was symmetric in early generations (equal magnitude of change in Low and High) but has become asymmetric over generations 5–9 with High more removed from Ancestral than Low (fig. 1). These phenotype-based observations are fully consistent with reduced mean at causal loci in Low and increased in High, but they do not conclusively indicate this model. Asymmetric response to selection can have numerous causes (Falconer and Mackay 1996, chapter 12). For example, epistasis can cause asymmetric changes in VA and in trait means even with perfectly symmetric changes in allele frequency. The present data speak more directly to changes in allele frequencies.Field experiments provide a potential explanation for why high alleles might be less frequent than low alleles at flower size loci. The ancestral population of this study is a large random sampling of genotypes from the Iron Mountain population of M. guttatus in central Oregon (Kelly 2008). We have conducted multiple field studies measuring the fitness consequences of Iron Mountain floral variants (Mojica and Kelly 2010; Mojica et al. 2012). High alleles for flower size tend to reduce viability (the probability that a plant survives to flower) but increase fecundity (seed set of plants that manage to flower). The relative strength of viability and fecundity selection vary both spatially and temporally. However, in most years and microsites, estimated lifetime fitness has been greater for small-flowered genotypes than large-flowered genotypes.
A Polymorphic Inversion Produces a Broad Genomic Signal of Selection
Lee (2009) identified the chromosome 6 inversion from recombination suppression (clustering of many markers on a genetic map) in the QTL mapping study described above. Subsequent genotyping of 93 outbred plants from the natural population showed that the “inversion” is a derived haplotype with a specific combination of alleles across 13 marker loci that span 4.2 mb of the reference genome (fig. 6 suggests that the inversion may be twice that size). The alternative ancestral gene order is composed of many distinct haplotypes as expected given historical recombination among marker loci. The increase of the derived inversion type from 15% in the Ancestral population to 65% in both Low and High produces a broad signal of divergence (fig. 6) because a single haplotype increased in frequency, largely unbroken by recombination.Despite the large change in inversion frequency, levels of sequence variation within this region of chromosome 6 (from the 1 mb to 8 mb position) are only marginally different among populations. Mean of windows in the Ancestral population is about 0.17; about 0.13 in both Low and High. If the inversion harbored many new mutations, these specific alleles would have been uncommon in the Ancestral population and then at intermediate frequency in both Low and High (based on the observed frequency of inversion haplotype). We would then have expected mean to have increased with the inversion. However, the marker genotyping of the natural population noted above clearly indicates that the inversion haplotype is composed of alleles sampled from variation resident in the ancestral population (Lee 2009). Given that the “inversion allele” at a SNP is not predictably the minor allele in the remainder of the Ancestral population, a slight decline in is not surprising as the inverted haplotype rose from 15% to 65%. However, we do predict that should decline precipitously if the inversion haplotype increases toward fixation, at least if it remains internally homogeneous.We do not yet know why selection generated parallel evolution of the inversion within both Low and High populations of this experiment. Previous mapping studies have found weak or inconsistent effects of the inversion on corolla width (Lee 2009; Scoville et al. 2009) which was our target of artificial selection. Moreover, the selection experiment had a contemporaneous control—a population maintained at the same size but without selection (Kelly 2008)—and the inversion is also at ca. 0.65 in this population (Kelly JK, unpublished results). The parallel increase of the inversion in three independent populations strongly implicates some sort of selection. This is surprising given that the methods for propagation in the artificial selection experiment substantially limited opportunities for incidental selection. Each surviving adult was mated to only one plant, and the number of progeny per family was approximately equal across populations and generations. There was opportunity for uncontrolled fitness variation at the seed germination stage. It is possible that the derived inversion type increases seed germination under greenhouse growth conditions and that this is responsible for its increase in frequency within Low, High, and Control populations.
Identifying Selected Loci in Pooled Samples
Artificial selection is increasingly used as a tool for mapping QTL. When this is the purpose, having replicated populations for each direction of selection is clearly advantageous. Parallel divergence, in which the same SNP exhibits extreme allele frequency change across independent populations, is a strong evidence that the SNP affects the trait under selection (Turner et al. 2011; Remolina et al. 2012). Unfortunately, replicate populations that have evolved under exactly the same selection regime may not be available, particularly if one is sampling populations from nature. Moreover, even true replicate populations can respond to the same selection regime via different genetic loci (Cohan 1984). For these reasons, it is useful to have rigorous statistical procedures that identify significantly divergent sites or windows within individual populations.We sequenced population samples in which many plants were combined equally into a single DNA pool. This strategy has been applied to both experimentally evolved populations (e.g., Burke et al. 2010; Turner et al. 2011; Remolina et al. 2012) and to divergent natural populations (e.g., Emerson et al. 2010; Kolaczkowski et al. 2011). Pooling has numerous advantages (Futschik and Schlötterer 2010) but also important limitations (Cutler and Jensen 2010). One caveat regards proper statistical testing for allele frequency differences between populations. The data structure allows a standard contingency table test, e.g., χ2 or Fisher’s exact test, on allele counts from each population. However, the resulting P value is not generally valid. In pooled samples, alleles cannot be treated as independent samples from their respective populations unless the two intermediate sampling events of figure 2 (individuals into pools and library construction events) are inconsequential.The statistical issue of serial sampling inherent to pooling has been treated in a number of ways depending on context (Futschik and Schlötterer 2010; Kolaczkowski et al. 2011; Magwene et al. 2011). The method applied here (eqs. 1–4) is intended not just to test whether populations are significantly different but whether differences are too large to be explained by genetic drift. An initially surprising result of our study is that the contrasts of Low and High populations distinctly to the Ancestral population identified more putatively selected windows than the contrast of Low to High. Relevant here is that our test statistics for both individual SNPs and genomic windows are essentially signal-to-noise ratios. Increased power can be achieved either by increased signal or reduced noise. The Low-High contrast exhibits the greatest phenotypic divegence, but is also encumbered by the greatest amount of noise owing to the accumulation of drift effects within two lineages instead of one.Our analysis differs from prior treatments of comparable genomic data in two major ways. First, we apply the arcsin square root transformation, a.k.a. the angular transformation, to allele frequencies prior to calculating divergence. This is an alternative to FST or other statistics that are functions of untransformed allele frequencies. The magnitude of changes in untransformed frequency due to genetic drift and sampling is contingent on the initial allele frequency (Fisher and Ford 1947), and as consequence, different SNPs have different null distributions. A common variance term applies genome-wide for divergence in transformed allele frequency, and as predicted by theory, the resulting distribution is well approximated by the normal curve (fig. 3).Secondly, we adopt a “non-parametric” method for identifying significant outliers. We treat the overall distribution of a divergence statistic (either for individual SNPs or within windows) as a mixture of neutral and selected loci. We assume the former to be far more abundant than the latter and then calculate robust estimators of dispersion from the overall distribution. These robust estimators are used to calibrate the null distribution under the assumption that they will be minimally affected by outliers (selected loci). Alternatively, we could have simulated the entire process (fig. 2) under the assumptions that sites are neutral. This parametric option would involve specifying the genetic composition of the Ancestral population as well as all parameters governing random changes in allele frequency. As is typical of parametric/non-parametric contrasts, the potential difficulty with the non-parametric option is loss of power. If selection has pervasive effects on genomic divergence, the null variances (eqs. 1) will be overestimated, and the testing procedure may become excessively conservative. The potential difficulty with the parametric procedure is that a simulation may fail to accurately characterize dispersive processes, and depending of the nature of the error, tests may be either conservative or anti-conservative. Of course, if one uses the observed distribution of divergence to calibrate a parametric simulation (say by choosing an effective population size that reiterates the observed variance), then parametric and non-parametric procedures may converge.
Supplementary Material
Supplementary tables S1–S3 and figures S1 and S2 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).
Authors: Molly K Burke; Joseph P Dunham; Parvin Shahrestani; Kevin R Thornton; Michael R Rose; Anthony D Long Journal: Nature Date: 2010-09-15 Impact factor: 49.962
Authors: B J Hayes; A J Chamberlain; S Maceachern; K Savin; H McPartlan; I MacLeod; L Sethuraman; M E Goddard Journal: Anim Genet Date: 2008-12-05 Impact factor: 3.169
Authors: Timothy M Beissinger; Candice N Hirsch; Brieanne Vaillancourt; Shweta Deshpande; Kerrie Barry; C Robin Buell; Shawn M Kaeppler; Daniel Gianola; Natalia de Leon Journal: Genetics Date: 2013-12-30 Impact factor: 4.562
Authors: Jeremy B Yoder; John Stanton-Geddes; Peng Zhou; Roman Briskine; Nevin D Young; Peter Tiffin Journal: Genetics Date: 2014-01-17 Impact factor: 4.562
Authors: Sean Hoban; Joanna L Kelley; Katie E Lotterhos; Michael F Antolin; Gideon Bradburd; David B Lowry; Mary L Poss; Laura K Reed; Andrew Storfer; Michael C Whitlock Journal: Am Nat Date: 2016-08-15 Impact factor: 3.926
Authors: Thomas C Nelson; Patrick J Monnahan; Mariah K McIntosh; Kayli Anderson; Evan MacArthur-Waltz; Findley R Finseth; John K Kelly; Lila Fishman Journal: Mol Ecol Date: 2018-12-10 Impact factor: 6.185