Literature DB >> 20333201

On reconciling single and recurrent hitchhiking models.

Abstract

A major focus of modern population genetics involves using polymorphism data in order to identify regions impacted by recent positive selection (so-called genomic scans). Recently, methodology has been proposed not to identify individual loci, but rather to quantify genomic recurrent hitchhiking (RHH) parameters using this same type of polymorphism data. I here examine to what extent genomic scans for adaptively important loci may be informed by recently estimated RHH parameters (and vice versa). I find that published results are largely incompatible with one another, with approximately an order of magnitude more sweeps being empirically identified than would be predicted under RHH estimates. Results demonstrate that making this connection between SHH and RHH models is crucial for a more complete and accurate characterization of adaptive evolution.

Entities: Gene Species

Keywords: genetic hitchhiking; genomic scans; recurrent selection; selective sweeps

Year: 2009 PMID： 20333201 PMCID： PMC2817426 DOI： 10.1093/gbe/evp031

Source DB: PubMed Journal: Genome Biol Evol ISSN： 1759-6653 Impact factor: 3.416

Introduction

One of the most popular approaches for identifying loci recently impacted by positive selection is known as “hitchhiking mapping” (e.g., Harr et al. 2002). Broadly speaking, this approach involves scanning across a large number of regions in order to determine the average levels of variability that are characteristic of the genomic environment. Regions that show extreme values and fall in the tail of this observed empirical distribution are then subject to further investigation via resequencing—with the aim being the discernment of locus-specific adaptive effects from neutral genome-wide patterns of variation (e.g., Harr et al. 2002; Glinka et al. 2003; Tenaillon et al. 2004; Carlson et al. 2005; Haddrill et al. 2005; Nielsen 2005; Ometto et al. 2005; Williamson et al. 2005; Wright et al. 2005; Kelley et al. 2006). Problematically, major assumptions about the underlying adaptive substitutions responsible for these patterns are made in such attempts to identify selected loci. Namely, as these scans rely on the impact of beneficial mutations upon closely linked neutral variability (i.e., the genetic hitchhiking effect; Maynard Smith and Haigh 1974), it is implicitly assumed that selection is strong enough to impact large genomic regions. Simultaneously, it is assumed that these selective events occur rarely enough that recently impacted regions will indeed uniquely reside in the tails of genomic distributions, and yet frequently enough to be detectable from patterns of variation. This suggests that the assumptions underlying genomic scans may correspond to a very specific parameter space. This disconnect between hitchhiking mapping and the true underlying rates and strengths of beneficial mutations (known as “recurrent hitchhiking” [RHH]) owes to the fact that the former relies upon a model of a single hitchhiking (SHH) event, in which a single adaptive fixation is assumed to have occurred immediately prior to sampling, whereas the latter considers a constant input of beneficial mutations, occurring at a given rate. The first point of comparison between these two models comes from Wiehe and Stephan (1993), who predicted the expected level of reduction in variation at linked neutral sites under an RHH model, demonstrating that for sλ = constant (where s is the selection coefficient, and λ is the rate of adaptive substitutions per site per generation), the mean reduction is identical among models. This result implies that regions of reduced variation may be consistent with models of rarely occurring but strongly advantageous, or commonly occurring but weakly advantageous, mutations. Recently, attempts have been made to estimate RHH parameters (i.e., s and λ) directly from the same multilocus and genomic polymorphism data used in genomic scans (e.g., Kim 2006; Li and Stephan 2006; Andolfatto 2007; Macpherson et al. 2007; Jensen, Thornton, and Andolfatto 2008; and recently reviewed by Sella et al. 2009), in order to distinguish between these scenarios. Thus, rather than attempting to identify individual loci, these estimators attempt to quantify the average genomic strength and rate of adaptive evolution. As these recent estimators are fundamentally informed by the same underlying parameters as the hitchhiking mapping approach implemented in genomic scans, I here ask whether published results from both approaches are consistent with one another.

Relating Models of RHH to the Identification of Adaptive Loci

The ability to distinguish between models of weak and strong selection has significant implications for our ability to detect adaptively important regions of the genome. As shown in table 1 for a hypothetical 1-Mb region, the expected number of potentially identifiable sweeps differs strongly between models. For example, a 5% average reduction in variation implies that selection tends to be either weak or infrequent. Thus, strong selection (i.e., s > 0.01) would occur so rarely as to never be detectable, on average, from patterns of polymorphism. And although weaker selection occurs with an appreciable frequency, such that it may be detectable when scanning large genomic regions, there are still few sweeps, each resulting in a relatively small genomic impact. As such, any given marker would have an approximately 0.2% chance of falling within a swept region, necessitating an extremely dense screen in order to identify adaptively important loci.

Table 1

Details of Recurrent Sweeps under Four Selection Coefficients and Four Levels of Reduction for a 1-Mb Region

s	2Ns	Size of Sweep (in bp)a	E (transit time in 4N generation)b	E (time between sweeps in 4N generation) (E(t))c	E (no. of sweeps) (no.)d	Fraction of Markers Swept (%)e	E(t)c	No.d	%e	E(t)c	No.d	%e	E(t)c	No.d	%e
				5% reductionf			20%g			60%h			90%i
1 × 10⁻¹	2 × 10⁵	20,000	3.6 × 10⁻⁵	1.1	∼0	∼0	2.7 × 10⁻¹	∼0	∼0	1.0 × 10⁻¹	∼1	0.02	8.3 × 10⁻³	∼12	0.24
1 × 10⁻²	2 × 10⁴	2,000	3.6 × 10⁻⁴	1.1 × 10⁻¹	∼1	0.002	2.7 × 10⁻²	∼4	0.007	1.0 × 10⁻²	∼10	0.02	8.3 × 10⁻⁴	∼120	0.24
1 × 10⁻³	2 × 10³	200	3.6 × 10⁻³	1.1 × 10⁻²	∼8	0.002	2.7 × 10⁻³	∼36	0.007	1.0 × 10⁻³	∼100	0.02	8.3 × 10⁻⁵	∼1,200	0.24
1 × 10⁻⁴	2 × 10²	20	3.6 × 10⁻²	1.1 × 10⁻³	∼80	0.002	2.7 × 10⁻⁴	∼360	0.007	1.0 × 10⁻⁴	∼1,000	0.02	8.3 × 10⁻⁶	∼12,000	0.24

The size of the region impacted by a given sweep, calculated as 0.01s/r base pairs (Kaplan et al. 1989), with r = 5 × 10−8 per site per generation (Charlesworth 1996; Andolfatto and Przeworski 2001).

The expected transit time of a beneficial mutation, calculated as −(log ξ/2γ), in units of 4N generations, where ξ = 1/2N, γ = 2Ns, and N = 106.

The expected time between beneficial fixations occurring within the region, calculated as =1/MΛ, in units of 4N generations, where Λ is the expected number of sweeps per recombination unit in the last 4N generations and M is the size of the region (=1 Mb).

The expected number of sweeps within the 1-Mb region that are recent enough to be detectable using polymorphism-based statistics, calculated as the average number of sweeps occurring within the last 0.1 4N generations (Przeworski 2002); importantly, only a fraction of this number may be identifiable, as power has been shown to rarely exceed 50% for commonly used summary statistics (Przeworski 2002; Jensen, Thornton, and Aquadro 2008).

The fraction of randomly placed markers across the 1 Mb under consideration that would fall within swept regions, calculated by determining the proportion of the total region impacted by a recent sweep (e.g., if eight sweeps, each effecting ∼200 bp, are expected across the 1-Mb region, then the probability for an individual marker to fall in a swept region is calculated as 1,600/1,000,000).

Estimated values for a 5% total reduction in variation due to RHH, calculated as: (Wiehe and Stephan 1993), where θ is the scaled population mutation rate (=0.01), r is the unscaled recombination rate in Morgans per base pair per generation (=5 × 10−8 per site per generation), κ is a constant (=0.075), γ = 2Ns (where s is the selection coefficient), N is the effective population size (=106), and λ is the rate of adaptive substitutions per site per generation. sλ = 9.0 × 10−15.

Estimated values for a 20% total reduction in variation (sλ = 4.1 × 10−14).

Estimated values for a 60% total reduction in variation (sλ = 2.5 × 10−13).

Estimated values for a 90% total reduction in variation (sλ = 3.0 × 10−12).

Details of Recurrent Sweeps under Four Selection Coefficients and Four Levels of Reduction for a 1-Mb Region The size of the region impacted by a given sweep, calculated as 0.01s/r base pairs (Kaplan et al. 1989), with r = 5 × 10−8 per site per generation (Charlesworth 1996; Andolfatto and Przeworski 2001). The expected transit time of a beneficial mutation, calculated as −(log ξ/2γ), in units of 4N generations, where ξ = 1/2N, γ = 2Ns, and N = 106. The expected time between beneficial fixations occurring within the region, calculated as =1/MΛ, in units of 4N generations, where Λ is the expected number of sweeps per recombination unit in the last 4N generations and M is the size of the region (=1 Mb). The expected number of sweeps within the 1-Mb region that are recent enough to be detectable using polymorphism-based statistics, calculated as the average number of sweeps occurring within the last 0.1 4N generations (Przeworski 2002); importantly, only a fraction of this number may be identifiable, as power has been shown to rarely exceed 50% for commonly used summary statistics (Przeworski 2002; Jensen, Thornton, and Aquadro 2008). The fraction of randomly placed markers across the 1 Mb under consideration that would fall within swept regions, calculated by determining the proportion of the total region impacted by a recent sweep (e.g., if eight sweeps, each effecting ∼200 bp, are expected across the 1-Mb region, then the probability for an individual marker to fall in a swept region is calculated as 1,600/1,000,000). Estimated values for a 5% total reduction in variation due to RHH, calculated as: (Wiehe and Stephan 1993), where θ is the scaled population mutation rate (=0.01), r is the unscaled recombination rate in Morgans per base pair per generation (=5 × 10−8 per site per generation), κ is a constant (=0.075), γ = 2Ns (where s is the selection coefficient), N is the effective population size (=106), and λ is the rate of adaptive substitutions per site per generation. sλ = 9.0 × 10−15. Estimated values for a 20% total reduction in variation (sλ = 4.1 × 10−14). Estimated values for a 60% total reduction in variation (sλ = 2.5 × 10−13). Estimated values for a 90% total reduction in variation (sλ = 3.0 × 10−12). In the other extreme, models positing a 90% reduction in variation are expected to have experienced a large number of recent sweeps at any given time of sampling. As such, ∼24% of markers may be linked to recent fixations. Although genomic scan studies rely on the premise that selected loci will appear as outliers when compared against the great majority of other (presumed neutral) loci, this result suggests that hitchhiked loci would effectively be compared with one another, upsetting the fundamental assumption of the approach, implying that in this RHH parameter space the vast majority of selected loci may be overlooked (Kelley et al. 2006; Sabeti et al. 2006; Teshima et al. 2006; Thornton and Jensen 2007). Under such a scenario, the meaning of outlier loci becomes unclear, as selected loci would comprise a large proportion of the empirical distribution. Although such strong reductions seem extreme, this scenario may be relevant in many recently domesticated species, which have experienced recent bouts of strong artificial selection (e.g., Wright and Gaut 2005; Wright et al. 2005). Thus, whether selection is common or rare, the standard assumption that the loci in the 5% tail of an empirical distribution represent swept regions corresponds to an extremely specific assumption regarding the reduction in variation owing to hitchhiking, and thus also about the true underlying and unknown value of the joint parameter sλ. For the parameters examined in table 1 for instance, the reduction in variation owing to hitchhiking must be ∼70%, in order for standard genomic scan assumptions to be met.

Comparing Published RHH and SHH Results

In light of these calculations, I consider a number of recently published genomic scan studies (Harr et al. 2002; Glinka et al. 2003; Bauer DuMont and Aquadro 2005; Jensen et al. 2007). Although there is an extremely large literature utilizing empirical genomic scans across organisms (recently reviewed by Thornton et al. 2007 and Akey 2009), these particular data sets have been chosen in order to minimize, as much as possible, differences in estimates owing to species- or population-based differences. As such, all the considered studies have focused on X-linked regions in derived populations of Drosophila melanogaster. Also common among all studies are the site frequency outlier–based methods of detection used to identify swept regions. For comparison, these genomic scans are considered against recent estimates of RHH parameters (Li and Stephan 2006; Andolfatto 2007; Macpherson et al. 2007; Jensen, Thornton, and Andolfatto 2008). These published estimators have a number of important differences from one another, in both statistical framework (likelihood or Bayesian) and the type of data utilized (polymorphism or divergence). Despite these differences, and the fact that these studies estimate drastically different RHH parameter values (with estimated mean selection coefficients ranging from 0.01 to 0.00001), the mean reduction in variation is similarly estimated to be ∼20% by both Macpherson et al. (2007) and Andolfatto (2007). Li and Stephan (2006) and Jensen, Thornton, and Andolfatto (2008) estimate an ∼50% reduction. As these numbers represent either maximum likelihood or maximum a posteriori estimates, they are associated with measures of uncertainty. Considering the 95% confidence intervals across all studies, the minimum and maximum published estimates of reductions in variation owing to RHH are found to range from 14% to 54%, respectively. As in table 1, it is possible to calculate the expected number of sweeps occurring within these empirically scanned regions for given values of s and λ (table 2). For example, for the RHH values estimated by Andolfatto (2007), one may expect ∼3,060 sweeps of s = 0.00001 to have occurred within the last 0.1 4N generations across the 850-kb region examined by Harr et al. (2002). Despite estimating the same 20% reduction in genomic variation owing to RHH as Andolfatto (2007), Macpherson et al. (2007) estimate a much stronger s (=0.01), suggesting approximately three detectable sweeps on average across a region of this size. Given their relative strengths, both RHH estimators suggest that approximately 0.7% of markers should be impacted by a recent sweep. Using an SHH-based approach, Harr et al. (2002) identify 7% of their markers as being swept, and the combined scans of Bauer DuMont and Aquadro (2005) and Jensen et al. (2007), as well as Glinka et al. (2003), identify ∼12% of their markers as swept. Thus, the number of putatively swept markers identified empirically using SHH models far exceeds published RHH estimates, with roughly an order of magnitude more sweeps being detected than would be predicted (table 2).

Table 2

Empirical Genomic Scan Results Compared with Expectations under Estimated RHH Models for Drosophila

Region Sizea	No. of markersb	Fraction Sweptc	E (fraction) \| ∼20% Reductiond	E (fraction) \| ∼50% Reductione	E (no. of sweeps \| s = 1 × 10⁻², 2Nλ=1 × 10⁻⁵)f,g	E (no. of sweeps \| s = 2 × 10⁻³, 2Nλ = 2 × 10⁻⁴)g,h,i	E (no. of sweeps \| s = 1 × 10⁻⁵, 2Nλ = 3 × 10⁻³)g,j
256 kbk	26	0.12	0.007	0.017	∼1	∼68	∼900
850 kbl	28	0.07	0.007	0.017	∼3	∼225	∼3,060
17 Mbm	105	0.12	0.007	0.017	∼61	∼4,620	∼61,200

Total length of the region spanned by the scan.

The number of scanned markers used in the study.

The fraction of scanned markers proposed by the authors to be linked to selective sweeps.

The expected fraction of markers that would fall in swept regions, for a ∼20% estimated reduction in variability (Andolfatto 2007; Macpherson et al. 2007).

The expected fraction of markers that would fall in swept regions, for a ∼50% estimated reduction in variability (Jensen, Thornton, and Andolfatto 2008; Li and Stephan 2006).

The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Macpherson et al. (2007) for D. simulans.

Only a fraction of this expected number may be identifiable, owing to the imperfect power of existing test statistics—see figure 2 (Przeworski 2002; Jensen, Thornton, and Aquadro 2008).

The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Jensen, Thornton, and Andolfatto (2008) for D. melanogaster

The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Li and Stephan (2006) for D. melanogaster.

The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Andolfatto (2007) for D. melanogaster.

From Bauer DuMont and Aquadro (2005); Jensen et al. (2007) for an X-linked region of D. melanogaster.

From Harr et al. (2002) for an X-linked region of D. melanogaster.

From Glinka et al. (2003) for an X-linked region of D. melanogaster.

Empirical Genomic Scan Results Compared with Expectations under Estimated RHH Models for Drosophila Total length of the region spanned by the scan. The number of scanned markers used in the study. The fraction of scanned markers proposed by the authors to be linked to selective sweeps. The expected fraction of markers that would fall in swept regions, for a ∼20% estimated reduction in variability (Andolfatto 2007; Macpherson et al. 2007). The expected fraction of markers that would fall in swept regions, for a ∼50% estimated reduction in variability (Jensen, Thornton, and Andolfatto 2008; Li and Stephan 2006). The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Macpherson et al. (2007) for D. simulans. Only a fraction of this expected number may be identifiable, owing to the imperfect power of existing test statistics—see figure 2 (Przeworski 2002; Jensen, Thornton, and Aquadro 2008).

A simulated comparison of the impact of demography on the identification of selected loci in genomic scans. The demographic model is the out-of-Africa bottleneck estimated for D. melanogaster (Thornton and Andolfatto 2006). For each point, one thousand 100 unlinked-locus data sets (with each locus being of size 1 kb) were simulated in which some fraction of the loci have experienced a recent selective sweep (value given on the x axis). For example, a value of 0.05 corresponds to a model in which 5 of 100 of the loci in each simulated data set have experienced a recent selective fixation. The selection coefficient is fixed at s = 0.01, and the age of the sweep is drawn from a uniform (0, 0.1) in units of 4N generations for each selected locus. The statistic utilized is the composite likelihood ratio test of Kim and Stephan (2002). The dotted line indicates the scenario in which selected loci are perfectly identifiable. The gray line gives the performance of the statistic under common usage—in which the null model is equilibrium neutrality. As shown, there is a tremendous false-positive rate associated with this implementation of hitchhiking mapping. The black line gives the performance when the null is the true underlying demographic model. Although this greatly reduces the false-positive rate, owing to the imperfect power of the test statistic, only roughly half of selected loci are being identified.

The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Jensen, Thornton, and Andolfatto (2008) for D. melanogaster The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Li and Stephan (2006) for D. melanogaster. The expected number of sweeps that would fall in the sequenced regions within the last 0.1 4N generations, for parameters estimated by Andolfatto (2007) for D. melanogaster. From Bauer DuMont and Aquadro (2005); Jensen et al. (2007) for an X-linked region of D. melanogaster. From Harr et al. (2002) for an X-linked region of D. melanogaster. From Glinka et al. (2003) for an X-linked region of D. melanogaster. Viewing these results graphically, figure 1 plots the reduction in genomic variation against the corresponding fraction of recently swept genomic regions, for both RHH- and SHH-based estimates. For genomic scan studies (grouped as “SHH model”), the expected reduction in variation is back-calculated based upon the empirically observed fraction of loci swept (i.e., what level of reduction is necessary in order for the identified number of loci to have experienced a sweep within the last 0.1 4N generations). Conversely, for the RHH estimators (grouped as “RHH model”), the expected fraction of loci swept is calculated from the estimated reduction in variation (i.e., for the estimated rate, how many sweeps will have occurred within the last 0.1 4N generations). The details of both calculations are given in table 2. As shown, RHH estimates as a whole suggest a less substantial reduction in variation, and thus a smaller fraction of swept loci. Interestingly, estimates strongly group by model—despite large differences among the estimators with regards to the type of data used, summary statistics utilized, and statistical framework—suggesting possible systematic biases in estimation under one, or possibly both, SHH model– and RHH model–based approaches.

A comparison of RHH- and SHH-based results. As shown, RHH- and SHH-based analyses suggest dramatically different patterns, with the latter detecting a far greater number of swept loci than would be predicted under RHH estimation, thereby suggesting a greater reduction in genomic variation due to selection. The vertical dotted line indicates the point at which the common genomic scan assumptions would be met (i.e., the 5% tail of markers are swept). Assuming that recently selected loci will indeed enrich the tails of genomic distributions, this demonstrates that under RHH-based estimation the 5% tail would primarily contain false positives. Conversely, if SHH-based estimates are correct, the majority of positively selected loci would be missed using this cut-off. Points are taken from the four RHH- and three SHH-based studies presented in table 2.

Evaluating Possible Explanations for the Observed SHH–RHH Discrepancy

One possible explanation for the discrepancy in SHH model– and RHH model–based analyses is that the true reduction in variation due to hitchhiking in D. melanogaster may be much more severe—a genomic reduction in variation of ∼79% is necessary in order to accommodate the number of empirically identified sweep regions, compared with the maximum published RHH estimate of ∼50%—and thus that existing RHH estimators are greatly underestimating the rate of adaptive evolution. Alternatively, the majority of the loci identified in genomic scans may be false positives. Recent studies have suggested that both demographic perturbations (e.g., Nielsen 2001; Przeworski 2002; Jensen et al. 2005; Nielsen et al. 2007) and ascertainment biases (Teshima et al. 2006; Thornton and Jensen 2007) likely contribute to a high rate of false inferences of selection in genomic scans. Along with this, it is additionally important to note that the expected number of sweeps in these calculations is not tantamount to the expected number of “identifiable” sweeps, as test statistics do not have perfect power. For example, examining the performance of three of the most common summary statistics (D [Tajima 1989], H [Fay and Wu 2000], and the composite likelihood ratio test [Kim and Stephan 2002]) across a wide range of RHH parameters, Jensen, Thornton, and Aquadro (2008) found power to be less than 20% for RHH models of weak selection, and rarely in excess of 50% even under models of strong selection. As shown in figure 2, these factors may actually predict a pattern that is opposite to that which is observed—even if demography is properly modeled, fewer sweeps should be identified than have occurred, owing to this imperfect power. Thus, empirical observations appear more consistent with the scenario in which there is a large false-positive rate associated with genomic scans for selection, consistent with previous results (Teshima et al. 2006; Thornton and Jensen 2007). A simulated comparison of the impact of demography on the identification of selected loci in genomic scans. The demographic model is the out-of-Africa bottleneck estimated for D. melanogaster (Thornton and Andolfatto 2006). For each point, one thousand 100 unlinked-locus data sets (with each locus being of size 1 kb) were simulated in which some fraction of the loci have experienced a recent selective sweep (value given on the x axis). For example, a value of 0.05 corresponds to a model in which 5 of 100 of the loci in each simulated data set have experienced a recent selective fixation. The selection coefficient is fixed at s = 0.01, and the age of the sweep is drawn from a uniform (0, 0.1) in units of 4N generations for each selected locus. The statistic utilized is the composite likelihood ratio test of Kim and Stephan (2002). The dotted line indicates the scenario in which selected loci are perfectly identifiable. The gray line gives the performance of the statistic under common usage—in which the null model is equilibrium neutrality. As shown, there is a tremendous false-positive rate associated with this implementation of hitchhiking mapping. The black line gives the performance when the null is the true underlying demographic model. Although this greatly reduces the false-positive rate, owing to the imperfect power of the test statistic, only roughly half of selected loci are being identified. Other possibilities exist as well. The impact of violations of both a constant-rate assumption on both SHH- and RHH-based approaches, and particularly systematic increases or decreases in the rate of adaptation, as well as the assumption that selection is largely acting only on new mutations (as opposed to segregating variation), remain as areas in need of further investigation. Additionally, under RHH models in which variation is strongly reduced, the approximations of Kaplan et al. (1989) and Stephan et al. (1992) are violated, owing to overlapping sweep patterns (Przeworski 2002). The impact of such a model on both SHH- and RHH-based estimation remains to be seen.

Conclusions

Comparison of a number of published studies in D. melanogaster suggests a lack of correspondence between SHH model– and RHH model–based analyses. Specifically, genomic scan results imply a much higher rate of adaptation, and thus a far greater level of reduction in genomic variation (∼79% reduction, whereas the mean RHH estimate ∼35%). Given the significant differences among RHH estimators particularly, this result may suggest systematic biases associated with the methodologies themselves. Although simulation results are suggestive of possible biases that may be inflating the number of loci identified in genomic scans, better disentangling these discrepancies has major implications. As RHH parameter estimates continue to come in to focus for natural populations of interest, it may become evident that searching for specific adaptive loci may be a difficult endeavor, owing to long expected waiting times between adaptive fixations. Alternatively, as putatively swept loci identified in genomic scans become functionally verified, it may appear more likely the case that existing RHH estimators are underestimating the true rate. Regardless of the species or population under consideration, these results highlight the need for future genomic studies to simultaneously consider and reconcile both classes of analyses in order to gain the most comprehensive and accurate understanding of the recent adaptive history of natural populations and suggest that SHH model– and RHH model–based approaches may indeed inform one another.

37 in total

1. Demography and natural selection have shaped genetic variation in Drosophila melanogaster: a multi-locus approach.

Authors: Sascha Glinka; Lino Ometto; Sylvain Mousset; Wolfgang Stephan; David De Lorenzo
Journal: Genetics Date: 2003-11 Impact factor: 4.562

2. Genomic regions exhibiting positive selection identified from dense genotype data.

Authors: Christopher S Carlson; Daryl J Thomas; Michael A Eberle; Johanna E Swanson; Robert J Livingston; Mark J Rieder; Deborah A Nickerson
Journal: Genome Res Date: 2005-11 Impact factor: 9.043

3. Inferring the effects of demography and selection on Drosophila melanogaster populations from a chromosome-wide scan of DNA variation.

Authors: Lino Ometto; Sascha Glinka; David De Lorenzo; Wolfgang Stephan
Journal: Mol Biol Evol Date: 2005-06-29 Impact factor: 16.240

4. The hitch-hiking effect of a favourable gene.

Authors: J M Smith; J Haigh
Journal: Genet Res Date: 1974-02 Impact factor: 1.588

Review 5. Positive natural selection in the human lineage.

Authors: P C Sabeti; S F Schaffner; B Fry; J Lohmueller; P Varilly; O Shamovsky; A Palma; T S Mikkelsen; D Altshuler; E S Lander
Journal: Science Date: 2006-06-16 Impact factor: 47.728

6. Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations.

Authors: Penelope R Haddrill; Kevin R Thornton; Brian Charlesworth; Peter Andolfatto
Journal: Genome Res Date: 2005-06 Impact factor: 9.043

7. The effects of artificial selection on the maize genome.

Authors: Stephen I Wright; Irie Vroh Bi; Steve G Schroeder; Masanori Yamasaki; John F Doebley; Michael D McMullen; Brandon S Gaut
Journal: Science Date: 2005-05-27 Impact factor: 47.728

8. Background selection and patterns of genetic diversity in Drosophila melanogaster.

Authors: B Charlesworth
Journal: Genet Res Date: 1996-10 Impact factor: 1.588

9. Hitchhiking mapping: a population-based fine-mapping strategy for adaptive mutations in Drosophilamelanogaster.

Authors: Bettina Harr; Max Kauer; Christian Schlötterer
Journal: Proc Natl Acad Sci U S A Date: 2002-09-26 Impact factor: 11.205

10. Constructing genomic maps of positive selection in humans: where do we go from here?

Authors: Joshua M Akey
Journal: Genome Res Date: 2009-05 Impact factor: 9.043

4 in total

1. Sporadic occurrence of recent selective sweeps from standing variation in humans as revealed by an approximate Bayesian computation approach.

Authors: Guillaume Laval; Etienne Patin; Pierre Boutillier; Lluis Quintana-Murci
Journal: Genetics Date: 2021-12-10 Impact factor: 4.402