Literature DB >> 33755671

Double drives and private alleles for localised population genetic control.

Abstract

Synthetic gene drive constructs could, in principle, provide the basis for highly efficient interventions to control disease vectors and other pest species. This efficiency derives in part from leveraging natural processes of dispersal and gene flow to spread the construct and its impacts from one population to another. However, sometimes (for example, with invasive species) only specific populations are in need of control, and impacts on non-target populations would be undesirable. Many gene drive designs use nucleases that recognise and cleave specific genomic sequences, and one way to restrict their spread would be to exploit sequence differences between target and non-target populations. In this paper we propose and model a series of low threshold double drive designs for population suppression, each consisting of two constructs, one imposing a reproductive load on the population and the other inserted into a differentiated locus and controlling the drive of the first. Simple deterministic, discrete-generation computer simulations are used to assess the alternative designs. We find that the simplest double drive designs are significantly more robust to pre-existing cleavage resistance at the differentiated locus than single drive designs, and that more complex designs incorporating sex ratio distortion can be more efficient still, even allowing for successful control when the differentiated locus is neutral and there is up to 50% pre-existing resistance in the target population. Similar designs can also be used for population replacement, with similar benefits. A population genomic analysis of CRISPR PAM sites in island and mainland populations of the malaria mosquito Anopheles gambiae indicates that the differentiation needed for our methods to work can exist in nature. Double drives should be considered when efficient but localised population genetic control is needed and there is some genetic differentiation between target and non-target populations.

Entities: Chemical Disease Species

Year: 2021 PMID： 33755671 PMCID： PMC8018619 DOI： 10.1371/journal.pgen.1009333

Source DB: PubMed Journal: PLoS Genet ISSN： 1553-7390 Impact factor: 5.917

Introduction

Gene drive is a natural phenomenon in which some genes are able to increase in frequency and spread through populations by contriving to be inherited at a greater-than-Mendelian rate [1,2]. Strong drive can cause genes to increase rapidly in frequency even if they also harm the organisms carrying them, and there is currently much effort trying to develop synthetic gene drive constructs (or gene drives) to control disease-transmitting mosquitoes and other pest populations that have thus far been difficult or impossible to manage satisfactorily [3-6]. If a species is harmful and subject to control measures wherever it exists, then, in principle (i.e., in the computer), highly efficient gene drive strategies can be devised that exploit natural processes of dispersal and gene flow such that relatively small inoculative releases in a few locations can lead to substantial and widespread impacts over subsequent generations [7-9]. However, some species are pests only in a part of their range (e.g., invasive species), and other approaches are needed. Two broad approaches have been proposed for restricting the impact of genetic control interventions to a target population. First, one can use a strategy requiring relatively large releases, which can be restricted to the target population, with any introductions into non-target populations (by dispersal, or by accidental or unauthorised releases) being too small to have a significant impact. Potentially suitable genetic constructs include those that do not drive (e.g., dominant lethals, autosomal X-shredders, or Y-linked editors; [10-12]), those that show transient drive due to a non-driving helper construct (e.g., killer-rescue systems and split drives; [13-15]) or those that drive, but only if they are above some threshold frequency (e.g., various underdominant [heterozygote inferiority] strategies, tethered drives, and split drive killer-rescue systems [16-19]). Some of these approaches are more efficient than others [12,20,21], but, by necessity, all of them require a non-trivial production and release effort. Alternatively, if there are pre-existing sequence differences between target and non-target populations, it may be possible to exploit these differences with a sequence-specific nuclease-based gene drive that would only spread in the target population, in which case the small release rates and overall efficiency of low threshold gene drive approaches may be retained [22,23]. Sudweeks et al. [23] present useful modelling of this approach, considering the case where there is a locally fixed allele of an essential gene in the target population, while non-target populations carry a functional cleavage-resistant allele at some frequency. A single-locus gene drive that uses the homing reaction (i.e., sequence-specific cleavage followed by homologous repair [4,24]) to disrupt the locally fixed allele could be released into and eliminate the target population, but have little impact, or only a transient impact, on non-target populations. However, as emphasised by the authors, if the target population has even a small frequency of the resistant allele, then that allele could be rapidly selected for and the intervention fail. Single locus homing drives targeting an essential gene in order to suppress a population necessarily generate strong selection pressure for resistant sequences if these can arise [24]; one approach to this problem is to target sites where functional resistance is unlikely to arise [25], but this is difficult to engineer if the target site is chosen such that a resistant allele exists at high frequency in non-target populations of the same species. In this paper we explore alternative two-locus “double drive” low release rate strategies to restrict population control based on pre-existing sequence differences between target and non-target populations. All our designs are based on a division of labour between the two constructs, with one imposing a reproductive load by disrupting a gene needed for survival or reproduction, and therefore responsible for the desired impact (population suppression), and the other responsible for the population restriction. The first construct can then be designed to target a well-conserved essential site where functional resistance is unlikely to arise, and selection for resistant alleles at the differentiated locus will be relatively weaker because it is not directly responsible for the reproductive load. As a result, these designs are substantially less susceptible to pre-existing resistance in the target population at the differentiated locus than single drive designs, and can even work if the differentiated locus is selectively neutral. Double drives may also be useful for population replacement. Finally, analyses of published genome sequences from island and mainland populations of the malaria mosquito Anopheles gambiae indicates that the sort of population differentiation we model can exist in nature.

Results

Simple double drives for population suppression

The simplest double drive designs we consider consist of one construct (call it α) inserted into and disrupting a haplo-sufficient female-essential gene, such that homozygous females die without reproducing while heterozygous females and all males are unaffected, and a second construct (β) inserted into a sequence that is significantly more common in the target than the non-target population(s). Both constructs are able to drive by the homing reaction but α can drive only in the presence of β, while β may either drive autonomously or rely on the presence of α. With CRISPR-based designs, α would encode its cognate gRNA, β would encode the Cas9, and either construct could encode the gRNA for the second locus (Fig 1, Designs 1 and 2). We assume the α construct has been designed such that functional resistance is not possible (e.g. by targeting sequences that are essential at the nucleotide level or by using multiple gRNAs), though non-functional resistant alleles can arise by end-joining repair [25-27]. For the β construct we initially suppose its insertion site (i.e., the differentiated locus) is selectively neutral and unlinked with the α insertion site, and that differentiation is nearly complete, with the recognition sequence present at a frequency of 99% in the target population and absent in the non-target population (i.e., it is a virtually fixed private allele).

Fig 1

Alternative double drive designs for population suppression.

Alternative double drive designs for population suppression.

Constructs α and β can drive autonomously (a) or non-autonomously (n); one or the other may encode an X-shredder; and the A target locus can be a gene that is haplo-sufficient (HS) or haplo-insufficient (HI) for female viability or fertility. fsHS—female-specific haplo-sufficient locus; fsHI—female-specific haplo-insufficient locus; diff—differentiated sequence; X-shr—X-shredder targeting an X-linked repeat. Under these conditions, a small (0.1%) release of males carrying Design 1 constructs into the target population leads to both constructs rapidly increasing in frequency and, as a result, an increasing fraction of female zygotes are homozygous for α and die without reproducing. The population size crashes to a minimum size of 3.58e-6 (relative to the pre-release equilibrium) after 25 generations (Fig 2A). Depending on the initial population size and the biology of the species (e.g., whether there are Allee effects [28] such that the population cannot persist at small sizes), this decline could be enough to eliminate the population. However, in our simple deterministic model population elimination is not possible. Instead, the population recovers due to selection of cleavage-resistant alleles at the differentiated locus (which either pre-existed or arose due to end-joining repair), leading to loss of the β construct, followed by loss of α, allowing the wild-type allele and population fertility to be restored. By contrast, the same releases into the non-target population have minimal effect: β cannot increase in frequency (because its target site is absent), and therefore α remains rare, and population size is little affected (Fig 2B).

Fig 2

Performance of double drives for population suppression.

Performance of double drives for population suppression.

(A, B) Timecourse for Design 1 in target and non-target populations, assuming 1% and 100% pre-existing resistance at the differentiated locus, respectively. In the target population the α and β constructs increase in frequency together (blue and red solid lines), causing the number of females to decline. If the population is not eliminated, then eventually the resistant b allele replaces β, followed by the wild-type A allele replacing α, allowing the population to recover. In the non-target population both constructs remain rare and the reduction in female numbers remains small. (C, D) Timecourse for Design 3 assuming 20% and 100% pre-existing resistance in the target and non-target populations. (E, F) Timecourse for Design 5 assuming 50% and 100% pre-existing resistance in the target and non-target populations. Because the spread of construct α in the target population depends on β, and therefore will be affected by the association between them, it might be expected that close linkage between the two constructs may increase construct spread and the extent of population suppression. Close linkage has been observed to affect the dynamics of other two-locus drive systems [15]. Furthermore, because population recovery (if it occurs) will be due to the evolution of resistance at the differentiated locus, additional improvements might be expected by using an essential gene as the differentiated locus while designing β to have minimal fitness effects (e.g., by containing a recoded, cleavage-resistant version of the essential gene [14,26,29], or by being inserted in an artificial intron [30]). End-joining repair will then tend to produce non-functional resistance alleles, increasing the load, and functional resistance at the differentiated locus will be slower to evolve, relying instead on pre-existing resistant alleles. Moreover, this effect may be stronger if the essential gene is haplo-insufficient than if it is haplo-sufficient, as found with some other gene drive designs [31]. Both these expectations about linkage and using an essential differentiated gene are met individually, and, in combination, can reduce the minimum population size achieved by many orders of magnitude (Fig 3; see also S1 Fig for the separate effect of each modification). If it is not possible to have close linkage, then the maximum level of suppression can also be increased by releasing the two constructs in different males rather than in the same males, which allows β to escape some of the fitness costs of α and get to a higher frequency than it otherwise would, though at the cost of the impact being delayed, and separate releases perform worse than combined releases when linkage is tight (S2 Fig).

Fig 3

Timecourse for the relative number of females over time for Designs 1 and 2.

Timecourse for the relative number of females over time for Designs 1 and 2.

Solid lines are for β in a neutral locus unlinked to the α construct, and dashed lines for β as a neutral insertion in a haplo-insufficient essential gene closely linked (r = 0.01) to the α construct. In all cases there is 1% pre-existing resistance at the differentiated locus. Also shown for comparison are results for a single construct drive targeting a haplo-sufficient female-specific viability gene, assuming 1% pre-existing functional resistance and that end-joining repair produces only nonfunctional resistant alleles (S). Design 2, which has the same components as Design 1, but arranged differently such that homing of the β construct only occurs in the presence of α, has dynamics qualitatively similar to Design 1, but quantitatively different (S3 Fig). Interestingly, if the two constructs are unlinked then the extent of suppression is less than with Design 1, but if they are closely linked then the suppression can be greater (Figs 3 and S1). For comparison we also model a single drive homing into a differentiated female-essential gene with 1% pre-existing resistance. The maximum extent of suppression is much less than with any of the double drives considered, because selection for resistance is much stronger, being directly at the fitness-determining locus (Fig 3).

Coping with higher frequencies of pre-existing resistance

Though these simple double drive designs work well with 1% pre-existing target site resistance at the differentiated locus, performance declines rapidly after that. For example, if there is 10% pre-existing resistance, then even the best of these designs (Design 2 with close linkage and the differentiated locus being haplo-insufficient) only suppresses the target population to a minimum of 2.38e-4 (Fig 4). In some situations the target population may not have a private allele with frequency over 90% and alternative approaches would need to be considered. One possibility is to increase the load imposed on the population by the α construct by adding to it an X-shredder locus that destroys the X-chromosome during spermatogenesis such that α now produces a male-biased sex ratio as well as killing homozygous females (Fig 1, Designs 3 and 4). Since population productivity in many species depends on the number of females, population size may thereby be further reduced. A single drive based on these components has previously been constructed in Anopheles gambiae by Simoni et al. [32]. Our modelling indicates that adding an X-shredder to a double drive gives a quantitative improvement in the dynamics, and even pre-existing resistance frequencies of 20% are compatible with good control, while still having minimal effect on non-target populations (Fig 2C and 2D).

Fig 4

Minimum population size for each of the 5 designs as a function of the pre-existing frequency of resistance at the differentiated locus.

Solid lines are for the baseline case (r = 0.5, β in a neutral locus), while dashed lines are for the improved case (r = 0.01, β as a neutral insertion in a haplo-insufficient essential gene).

Minimum population size for each of the 5 designs as a function of the pre-existing frequency of resistance at the differentiated locus.

Solid lines are for the baseline case (r = 0.5, β in a neutral locus), while dashed lines are for the improved case (r = 0.01, β as a neutral insertion in a haplo-insufficient essential gene). Even more robust control can be obtained by adding the X-shredder to the β construct and having the α construct drive autonomously in males and cause dominant sterility or lethality in females (e.g., target a female-specific haplo-insufficient locus; Fig 1, Design 5). As the Cas9 is encoded by α, homing of β also occurs only in males. The dynamics in this case are somewhat different from the others: the X-shredder does not function to directly increase the load, but instead it allows the α construct to spread in the population, because it will end up more often in males (where it homes), and less often in females (where it is a dead end). The male bias also protects the β construct from the female lethality produced by the α construct, and so selection against β is much weaker than in the previous designs, and resistance evolves more slowly (compare the rate of spread of the resistant b allele in Fig 2E to that in Fig 2A and 2C). As a result the design is able to perform well even with pre-existing resistance of up to 50%, but still not spread in the non-target population (Fig 2E and 2F). Moreover, if the population is not eliminated, it can nevertheless be suppressed for many generations. For example, with 50% pre-existing resistance the minimum population size reached is 2.15e-3, and the population remains below 5% of its pre-intervention size for 63 generations; with close linkage (r = 0.01), then the corresponding values are 8.79e-7 and 147 generations. A comparison of the maximum extent of suppression as a function of the pre-existing resistance frequency for the different designs is shown in Fig 4 (see also S4 Fig). Note that none of the modifications considered (linkage, use of an essential differentiated gene, or separate releases) has a qualitative effect on dynamics in the non-target population, as β is still unable to increase in frequency, and impacts on population size remain small (S5 Fig).

Evolutionary stability and impact of fitness costs

We now explore the consequences of relaxing two assumptions that have been implicit thus far in our modelling. First, we have assumed that our various constructs remain intact after release. In fact, mutations that destroy the function of one component or another will be expected to arise as the constructs spread through a population, particularly as homing may be associated with a higher mutation rate than normal DNA replication [33-35]. For components that contribute directly to their construct’s spread, one would expect that loss-of-function mutations would remain rare in the population and have little effect, whereas for other components (e.g., the X-shredder, especially in Designs 3 and 4), such mutations may be actively selected for. To investigate we allowed homing-associated loss-of-function mutations to occur in each component of each construct. Mutation rates of 10e-3 have a small but significant impact on the performance of the three designs with an X-shredder, due to the accumulation of mutant constructs missing that component, while mutation rates of 10e-4 have negligible impact for all designs (Figs 5A, S6 and S7).

Fig 5

The impact of evolutionary stability and added fitness costs on the performance of each of the 5 designs for population suppression.

The impact of evolutionary stability and added fitness costs on the performance of each of the 5 designs for population suppression.

(A) The effect of loss-of-function mutations on the minimum population size (number of adult females) reached. Solid lines are for the baseline case of no mutations, and dashed lines are with each component of each construct having a mutation rate of 10e-3 per homing event. Note that the effect is only visible for designs with an X-shredder, and if the mutation rate was 10e-4, the results for all designs would be virtually indistinguishable from the solid lines. (B, C) Contour plots showing combinations of fitness costs and pre-existing frequency of resistance giving a minimum population size of 10e-4 for different double drive designs. For (B) the costs are reductions in female fitness due to somatic expression of the nuclease targeting the A locus, and for (C) the costs are reductions in male fitness due to the X-shredder. Vertical lines indicate the cost is irrelevant, either because heterozygous females in any case have fitness 0 (Design 5 in (B)), or because the designs do not include an X-shredder (Designs 1 and 2 in (C)). Second, we have assumed thus far that the genetic constructs have little unintended impact on survival or reproduction. Experiments with An. gambiae have revealed at least two unintended fitness costs can occur, a reduced fitness of homing heterozygous females due to somatic expression of the nuclease [25,32], and reduced fitness of males expressing an X-shredder, possibly due to paternal deposition of the nuclease and/or reduced sperm production [36]. The first of these costs is not relevant to Design 5 (because heterozygous females die anyway), and the second is not relevant to Designs 1 and 2 (because they do not use an X-shredder), but in other contexts, as expected, these costs reduce performance, requiring a lower frequency of pre-existing resistance in order to achieve a particular level of suppression (Fig 5B and 5C).

Population replacement

Gene drive can be used not only for population suppression but also to introduce a new desirable ‘cargo’ gene into a target population for population replacement or modification–for example, a gene reducing a mosquito’s ability to transmit a pathogen [37,38]. In double drive designs for population replacement the α construct would carry the cargo and homing by α would require β, while that by β could be either autonomous or depend on α (analogous to Designs 1 and 2 for population suppression; Fig 6A). Both α and β could be inserted into neutral sites, or into essential genes in such a way as to minimise fitness effects. We have modelled these approaches assuming, for purposes of illustration, the cargo imposes a dominant 20% fitness cost on females, and find that, again, such double drives can spread rapidly through target populations even when there is significant pre-existing resistance, and would not spread in non-target populations fixed for the resistant allele (Fig 6B and 6C). Unless there is virtually no pre-existing resistance at the differentiated locus, double drives can keep the frequency of the cargo gene above 95% much longer than a single drive construct targeting a differentiated locus, either neutral or essential (Fig 6D). In the single locus case selection rapidly increases the frequency of a pre-existing functional resistant allele because there is both significant variation and significant fitness differences (arising from the cost of the cargo gene) at the same locus. By contrast, in the double drive case there is one locus at which there are fitness difference (due to presence/absence of the cargo) but very little variation (initially none, and arising only after release due to rare end-joining and loss-of-cargo events), and another (differentiated) locus at which there is pre-existing variation but much smaller fitness differences (arising only due to the statistical correlation between alleles at the two loci). Finally, as with double drives for population suppression, the protection provided by a double drive for replacement can be extended even further if the two loci are tightly linked and α is inserted in an essential gene, with end-joining repair producing nonfunctional alleles (Fig 6D). In the latter case the only source of functional cleavage-resistant cargo-less alleles are the constructs that lose their cargo during homing, and, since this is assumed to occur at a much lower rate than end-joining repair, more generations are needed for such alleles to become common and the duration of protection is extended.

Fig 6

Double drives for population replacement.

(A) Alternative double drive designs. (B) Timecourse of allele frequencies for Design 2r in a target population assuming 50% pre-existing resistance; “cargoless α construct” refers to α constructs that have lost their cargo gene. The dynamics for Design 1r are qualitatively similar. (C) Allele frequencies for Designs 1r and 2r in a non-target population with 100% pre-existing resistance, assuming insertion of both α and β into neutral unlinked loci (solid lines); or α as a neutral insertion into a haplo-insufficient essential gene closely linked to β (r = 0.01) (dashed lines). (D) Duration of at least 95% of adult females carrying a cargo gene (whether the rest of the construct is functional or not) as a function of the pre-existing frequency of resistance at the B locus for double drives 1r and 2r, where solid and dashed lines are as in (C). Also shown for comparison are results for a single drive (S) carrying the cargo at a neutral A locus (solid line) or a haplo-insufficient essential gene assuming end-joining repair produces only nonfunctional resistant alleles (dashed lines). Note that results for insertion of β into a haplo-insufficient essential gene would be virtually indistinguishable from the solid lines (C, D). All plots assume 20% fitness cost of the cargo on females and a homing-associated loss-of-function mutation rate of 10e-3.

Double drives for population replacement.

PAM site analysis in An. gambiae

To explore whether the type of population differentiation assumed in our modelling can exist in nature, we analysed published genome sequence data on An. gambiae mosquitoes from the Ag1000G project [39]. The Ag1000G dataset includes sequences from 16 mainland African populations and from populations on Mayotte and Bioko, two islands 500km off the east and 30km off the west coast of Africa, respectively. Note that in presenting this analysis we are not advocating the use of double drives on these islands, and merely wish to investigate whether the requisite differentiation can be found on island populations. For our analysis we focussed on potential PAM sequences (NGG or CCN), on the logic that a construct would be unlikely to mutate to recognise a new PAM, whereas this could occur for a protospacer. The entire dataset includes 57 million polymorphic sites, which we screened for PAM sites present in the island population and at a frequency <10%, <5%, or absent from all other populations. In Mayotte, for PAM sequences that were completely private to the island (i.e., not found in any other population), only 1 of them had no pre-existing resistance (i.e., was found in all 48 sequences from the island), whereas 25 had pre-existing resistance less than 20%, and 353 had pre-existing resistance less than 50%. PAM sequences with small but nonzero frequencies on the mainland were even more abundant (Fig 7). Bioko island is not as differentiated as Mayotte from the mainland populations, and the sample size is smaller (18 sequences), but still there are some potential candidate sites.

Fig 7

Frequency of PAM sites in island populations of An.

gambiae. Numbers of PAM sites (NGG or CCN) with varying frequencies of resistance in samples of An. gambiae from two oceanic islands (blue map circles): (A) Bioko island (n = 18 sequences, from 9 individuals), and (B) Mayotte island (n = 48 sequences, from 24 individuals), where the PAM site frequency in each non-target population (black map circles) is <10%, <5% or 0% (i.e., target site resistance is >90%, >95% or 100%). GG or CC dinucleotides which varied by at least one base were considered to be resistant.

Frequency of PAM sites in island populations of An.

Discussion

Given that some of the most promising gene drive approaches for population control use (CRISPR-based) sequence-specific nucleases, an obvious way to limit their spread and impact is to exploit sequence differences between target and non-target populations. In this paper we have proposed using a double drive design, here defined as one that uses two constructs, inserted at different locations in the genome, both of which can increase in frequency, at least initially, and which interact such that the transmission of at least one of them depends on the other. Previously published examples that fit this definition include those for 2-locus under-dominance [16,19,40,41], and Medusa [42], tethered [17], integral [30], and transcomplementing [43] gene drives. As with single-construct gene drives, these various proposed designs differ in purpose (suppression vs. modification), release rate needed to initiate spread (low vs high threshold), and the molecular basis for the superMendelian inheritance (homing, toxin-antidote interactions, or a combination of the two), and the suggested rationales for adopting these designs over single drives include allowing more localised population control and a more modular product development pipeline. The requirement that both constructs can increase in frequency over time excludes split drives [14,44-46] and killer-rescue systems [13,47], in which only one of the two components increases in frequency. In our proposed designs there is a division of labour between the two constructs, with one responsible for the desired impact (suppression or replacement) and the other for the population restriction, such that together they act as a double drive in the target population and as a split drive in non-target populations. Note that if there are multiple populations of the same species requiring control, each with a different private allele, the same α construct could be used in each case, with only a change in the insertion site of the β construct and the corresponding gRNA. This flexibility may be particularly useful when the α construct requires significant optimisation [30]. Moreover, the same strategies may also be useful for controlled suppression of a target population even when there is no concern about non-target populations (and therefore no need to target a private allele): by appropriate choice of construct components and insertion sites a form of “planned obsolescence” could be achieved, with a wider and more predictable range of suppression profiles (e.g., depth and duration of suppression) possible than with conventional single drives [see also [48]]. For designs that involve interacting insertions at two or more loci, their population genetic dynamics and impact will usually depend on the statistical correlation between the constructs, and therefore also on the degree of linkage between them [12]. As previously demonstrated for split cleave-and-rescue designs by Oberhofer et al. [15], the degree of linkage between constructs can therefore be used as a tunable parameter to control the dynamics. They found that the expected impacts of a release were stronger and longer-lasting with closely linked constructs than with distantly linked ones, and we found much the same with our double drive designs, though there are some differences in detail between the systems. In particular, with split cleave-and-rescue designs, which do not rely on homing, if there is complete linkage (i.e., no meiotic recombination) then the system behaves the same as a single locus construct, whereas that is not the case for our homing-based double drives, where constructs can be separated if one homes and the other does not, even if there is never any meiotic crossing over between them. Thus, in our model, setting r = 0 (while allowing separate homing events) does not reproduce the single drive dynamics. If the insertions are physically very close to each other, then there may be some direct mechanistic interaction between them (e.g., binding of one nuclease complex preventing the other from binding, or resection of DNA during the repair process leading to co-homing of the two insertions), or simultaneous cleavage may lead to a large deletion. We have assumed that our constructs are far enough apart as not to interact in this way (e.g., r = 0.01 corresponds on average to between 600kb and 1Mb in An. gambiae [49]). We have considered a range of double drive designs of increasing robustness, as judged by their ability to cope with an increasing frequency of pre-existing resistance at the differentiated locus. The simplest designs do not have any component beyond those needed for any CRISPR-based construct, and so should be widely applicable [43]. More powerful constructs can be made by adding an X-shredding sex ratio distorter to the load-inducing construct; these have been most effectively demonstrated in An. gambiae mosquitoes [11,50], but may also work more broadly [51]. Note that the optimal timing of homing and X-shredding during gametogenesis may be different, requiring different and compatible control sequences, which will need to be taken into account in construct design [50,52]. The ability to control gRNA expression in a tissue-specific manner would be helpful in this regard. In other species there are other ways to distort the sex ratio [53-55], and it would be interesting to model whether these alternatives would be expected to have the same impact as an X-shredder in the context of a double drive. Potentially an even simpler way to increase the load may be to include additional gRNAs in the α construct that cleave and knock out the function of other female fertility genes elsewhere in the genome [31,56]. The effect of such gRNAs would depend on the heterozygous and homozygous fitness effects of the mutations caused and, again, on the degree of linkage with the target site, and further modelling would be needed to investigate whether such a strategy is worthwhile. The most powerful design we considered targets a female-specific haplo-insufficient gene, or otherwise causes dominant female sterility or lethality. Such genes are not common, but there are some possible candidates [57-60], and our modelling motivates the search for others. Finally, performance (in terms of being able to cope with ever higher frequencies of pre-existing resistance) could presumably also be improved by using a third construct, to construct a triple drive, though modelling would be required to explore the implications of the many different configurations this extension would allow. The proposed strategy requires that there be a differentiated locus between target and non-target populations. It need not be an essential gene, and could even be selectively neutral. Our focus has been on using so-called private alleles–sequences that are present (but not necessarily fixed) in the target population, and absent (or of negligible frequency) in non-target populations. Our analysis of PAM sites in An. gambiae indicates that appropriately differentiated sites may exist in island populations of this species, though our analysis must be considered preliminary: the dataset does not include mainland sites in closest proximity to the island populations, where differentiation may be lower, and we have not considered potential polymorphism in the protospacer sequence (which, if present, may require the use of multiple gRNAs). We have focussed on nucleotide variation at PAM sites on the assumption that a construct is unlikely to mutate to recognise a new PAM; structural variation in the protospacer region may also be an appropriate basis for geographically restricting double drive spread. We have also not attempted to determine whether the observed differentiation is due solely to mutation and drift, or if selection may be involved as well. Note that the single drives modelled by Sudweeks et al. [23] require the opposite type of differentiation: sequences that are fixed in the target population, even if not private (i.e., even if found at appreciable frequencies in the non-target population [61]). In this latter scenario the challenge is not so much to have an impact on the target population as to not have an impact on the non-target population. What constitutes “acceptable non-impact” may differ widely from one use case to another and must be assessed on a case-by-case basis: in some circumstances spread of the construct and a transient decline in population size followed by recovery may be acceptable, whereas in others any significant spread of the construct may be unacceptable, regardless of impact on population size. Designs with non-autonomous homing of the β construct (Designs 2, 4, and 5) should be less likely to increase in frequency in the non-target population, and may therefore be preferable. We have focused in this paper on differentiated loci on autosomes, but note that for Design 5 the X-shredder is required for the spread of the α construct and, in principle, one could achieve population-restricted spread if the shredder targeted a population-specific sequence on the X chromosome (rather than inserting it into a population-specific autosomal sequence). In many species the X chromosome shows greater population differentiation than autosomes [62], so this alternative may be useful. Finally, if there are no private alleles in the target population, it may be worthwhile considering a two-step approach of first introducing a private allele into a population and then using that allele to control the population [22]. The ability of double drives to exploit private alleles that are selectively neutral and that have a frequency of only 50% (suppression) or 20% (modification) potentially makes this approach more feasible than would otherwise be the case. In this paper we have used a simple high-level modelling framework in which the generations are discrete, the population is well mixed, and dynamics are deterministic. This framework is appropriate for strategic models aiming to identify candidate approaches that are worthy of further investigation. For any specific use case the appropriate tactical models would need to be developed that incorporate more biological detail, including spatial and stochastic effects. In spatially distributed populations with local mating the statistical association between alleles at the two loci may differ from that in our well mixed model, and the quantitative dynamics thereby affected. Issues of evolutionary stability and the breakdown of constructs can also be more important in such models, as previously demonstrated for single drive homing constructs for population replacement [63]. Such extensions will be particularly important when the goal is to eliminate the target population, which is not possible in our deterministic models. Instead, we have reported the minimum relative population size achieved, which is expected to be related to the size of a population that could be eliminated, but determining the precise connection will require bespoke modelling tailored to a specific situation. Further extensions would be needed to allow for on-going movement between target and non-target populations–if there is on-going immigration into the target population, and this cannot be stopped, then it may not be possible to eliminate the target population with a single release of a double drive. Nevertheless, such a release may be sufficient to suppress the population to such an extent that it can be controlled by other means, including recurrent releases of the same constructs. If one is able to achieve an initial release rate of 1% into a target population, and that suppresses the population by a factor of 1000, then the same releases going forwards will constitute a 10-fold inundation, and self-limiting genetic approaches may be sufficient.

Methods

The basic deterministic modelling structure follows that of Burt & Deredec [12]. In brief, populations have discrete generations, mating is random, there are two life stages (juveniles and adults), and juvenile survival is density dependent according to the Beverton-Holt model with an intrinsic rate of increase (Rm) equal to 6 [56]. Genetic parameter values (rates of DNA cleavage, rates of alternative repair pathways, and the sex ratio produced by X-shredding) are as estimated from An. gambiae (S1 Table) [11,25,52]. Constructs may be inserted into a haplo-sufficient or haplo-insufficient female-essential gene (in which case gene function is disrupted), a selectively neutral sequence (in which case the insertion is also selectively neutral), or a haplo-sufficient or haplo-insufficient gene required for male and female viability (in which case the insertion is again selectively neutral, because it contains a re-coded version of the target gene [14,26,29], or is inserted in an artificial intron [30]). For constructs inserted into an essential gene we assume end-joining repair produces non-functional cleavage-resistant alleles [25,64], while for constructs inserted into selectively neutral sites the products of end-joining repair are also neutral. In all models we assume individuals with an intact CRISPR system suffer a 1% fitness cost for every different gRNA they carry as a cost of off-target cleavage, and for population replacement we assume the cargo gene imposes a 20% fitness cost on females. Both these costs are assumed to be dominant (i.e., not dosage-dependent). For simplicity, we assume all fitness costs affect survival after density dependent juvenile mortality and before censusing (e.g., as if pupae die). All results are for populations censused at the adult stage. Releases are of heterozygous adult males at 0.1% of the pre-release number of males, and if the two constructs are linked then they are assumed to be in cis; for constructs released in separate males we assume release rates of 0.05% of each. Additional details and a list of parameters and their baseline values is given in S1 Text, S1 and S2 Tables. Code for implementation of the simulations is available on GitHub (https://github.com/KatieWillis/DoubleDriveSimulator). For the PAM site analysis we screened the Ag1000G phase II SNP data for PAM sites (GG or CC dinucleotides) showing variation between samples at one or both nucleotides. PAM site frequencies were calculated per sampling location and filtered for those present in the island population and at <10%, 5%, or absent from all other populations, excluding those containing >5% missing data in at least one sampled population. Further details are given in the S1 Text. The map in Fig 7 was produced using the cartopy python package [65].

Supplemental methods.

(PDF) Click here for additional data file.

Model parameters and baseline values.

(PDF) Click here for additional data file.

Host gene disruption fitness costs.

(PDF) Click here for additional data file. Solid lines are for where ⍺ and β are unlinked and dashed lines for where they are linked (r = 0.01). Shown are the cases where β is inserted as a neutral insertion into (A) a neutral locus, (B) an essential haplo-sufficient gene or (C) an essential haplo-insufficient gene. Shown for comparison is a time course for a single drive targeting a haplo-sufficient female-specific viability gene (S). (TIF) Click here for additional data file.

Effect of releasing constructs in the same or different males.

Comparison of dynamics for Design 1 when the constructs are unlinked and are released in the same (A) or in different (B) males. If the constructs are released in separate males the initial correlation between ⍺ and β (black dotted line) is negative, allowing β (solid red line) to increase to a higher frequency than if released in the same males as ⍺ where it experiences higher fitness costs. Consequently ⍺ (solid blue line) is retained at high frequency in the population for longer resulting in a greater reduction in relative number of females, though there is a longer delay between release and impact. (C-G) Timecourse for the relative number of females over time for Designs 1–5 where constructs are unlinked and released in the same males (solid lines), linked and released in the same males (dotted lined), unlinked and released in different males (dashed lines) or linked and released in different males (dot-dashed). Pre-existing resistance is assumed to be 1% (C, D), 20% (E, F) and 50% (G). For Designs 2, 4 and 5 (D, F, G) separate releases only delay the impact because β cannot increase in frequency autonomously, whereas for Designs 1 and 3 separate releases can give a larger (though still delayed) impact when constructs are unlinked, but not when they are closely linked. Shown for comparison is a time course for a single drive targeting a female-specific viability gene with the same level of pre-existing resistance (1%, 20%, or 50%; black solid lines). (TIF) Click here for additional data file.

Example time courses for double drives for population suppression.

Design 1 (A, B) and 2 (B, C) assuming 1% and 100% pre-existing resistance in target and non-target populations. Design 3 (E, F) and 4 (G, H) assuming 20% and 100% pre-existing resistance in target and non-target populations. Design 5 (I, J) assuming 50% and 100% pre-existing resistance in target and non-target populations respectively. Plots for Designs 1, 3, and 5 are the same as in the main text, and presented here to facilitate comparisons. (TIF) Click here for additional data file.

The impact of alternative designs and their variants on population suppression as a function of the pre-existing frequency of resistance.

Solid lines are for baseline conditions (⍺ and β are released in the same males, β is in a neutral locus, and loci are unlinked), and are the same in each panel. Dashed lines are for variants where (A) β is inserted as a neutral insertion into an essential haplo-insufficient gene, (B) β is inserted as a neutral insertion into an essential haplo-sufficient gene, (C) loci are linked (r = 0.01), and (D) ⍺ and β are released in separate males, holding all other properties at baseline. (TIF) Click here for additional data file.

Timecourse of allele frequencies and population suppression (1-relative number of females) for Designs 1–5 in non-target populations where the resistant allele is present at 100%.

(A-E) ⍺ and β are unlinked (solid lines) or linked (dashed lines). (F-J) β is inserted into a neutral site (solid lines). Note that the effect of inserting β as a neutral insertion into a haplo-sufficient or haplo-insufficient essential gene would be virtually indistinguishable from the solid lines. (K-O) ⍺ and β are released in the same males (solid lines) or different males (dashed lines). (TIF) Click here for additional data file.

Timecourses for Designs 3, 4 and 5 assuming pre-existing frequency of resistance of 1%, where homing-associated loss-of-function mutations occur for each component of each construct with probability 10e-3.

For each design, the intact constructs (⍺, blue solid lines and β, red solid lines) increase in frequency together, causing the relative number of females (black dashed lines) to decline. For designs 3 and 4, loss-of-function mutations at the X-shredder (⍺-XS, blue dashed-dotted lines) are selected for, replacing ⍺ (A, C). Since ⍺-XS is identical to ⍺ in designs 1 and 2, the construct continues to reduce the relative number of females. If the population is not eliminated, β is eventually replaced by the resistant b allele and ⍺-XS is replaced by the wild-type A allele, allowing the population to recover. For design 5, loss-of-function mutations at the X-shredder (β-XS, red dashed-dotted lines) are also selected for, but increase in frequency more slowly than the ⍺-XS allele in Designs 3 and 4, resulting in the intact β construct persisting for longer. For all designs, loss-of-function mutations at each of the other components (Cas-9, gRNAA and gRNAB) remain at low frequency, having negligible impact on the efficacy of the designs (B, D, F). Note that these results are not directly comparable to Figs 2 and S3 due to differing pre-existing resistance frequencies. (TIF) Click here for additional data file.

Loss-of-function mutation rates of 10e-4 have minimal impact on the extent of population suppression.

Solid lines are for baseline conditions where constructs remain intact after release, while dashed lines are for homing-associated loss-of-function mutations occurring at each component of each construct with probability 10e-4. (TIF) Click here for additional data file. 8 Feb 2021 Dear Authors, We now have three detailed reviews of your manuscript. All of the reviewers have positive comments on your work. There is, however, a real need to revise the text to make the concepts and design more accessible to the general reader of PLOS Genetics. The reviewers provide some guidance. The manuscript is accepted with "minor revisions" because no further analyses are needed, but substantially improved accessibility of the material is a necessary condition for acceptance. This includes accessibility of the figures. One review had some concerns about whether this paper really broke new ground and another commented that it would have been useful (as well as traditional) in the introduction to mention other related work and concepts--the introduction could demonstrate the novelty of your ideas by presenting what came before them. One reviewer suggested further analyses to examine spatial structure, but also recognized that this could be beyond the scope of an initial paper--I agree. Finally, one reviewer requested that the model code be made available to others. This makes sense to me. Your revision should be accompanied with a point by point response to each reviewer comm In addition we ask that you: 1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. 2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images. We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. Please let us know if you have any questions while making these revisions. Yours sincerely, Fred Gould Guest Editor PLOS Genetics Gregory Copenhaver Editor-in-Chief PLOS Genetics Dear Authors, We now have three detailed reviews of your manuscript. All of the reviewers have positive comments on your work. There is, however, a real need to revise the text to make the concepts and design more accessible to the general reader of PLOS Genetics. The reviewers provide some guidance. The manuscript is accepted with "minor revisions" because no further analyses are needed, but substantially improved accessibility of the material is a necessary condition for acceptance. This includes accessibility of the figures. One review had some concerns about whether this paper really broke new ground and another commented that it would have been useful (as well as traditional) in the introduction to mention other related work and concepts--the introduction could demonstrate the novelty of your ideas by presenting what came before them. One reviewer suggested further analyses to examine spatial structure, but also recognized that this could be beyond the scope of an initial paper--I agree. Finally, one reviewer requested that the model code be made available to others. This makes sense to me. Your revision should be accompanied with a point by point response to each reviewer comment. Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Synthetic gene drives have considerable potential for landscape-scale suppression of wild populations that carry disease agents (such as malaria) or cause significant environmental or agricultural damage (e.g. invasive pests). While there is considerable excitement about the prospect of gene drive deployment, there are also legitimate concerns about the potential impact of gene drive leakage into non-target populations. Recent studies have indicated that one possible strategy for target-population specific control is to exploit genetic differences in target and non-target populations – for example Sudweeks et al investigated if “private alleles” and be used for population suppression via single gene drive strategies. Such approaches are very sensitive to pre-existing resistance alleles – essentially the private allele needs to be completely fixed in the target population (but low levels in the non-target population can be tolerated). Here, Willis and Burt investigate if an alternative (more likely?) genetic architecture can be exploited – whereby the differentiated allele is present in the target population at high frequency (but not completely fixed) and not in the non-target population. Various double drives strategies are examined using a simple deterministic modelling approach. Some scenarios, even with relatively limited differentiation in the target population (e.g. 50% pre-resistance, Design 5), show quite remarkable suppression dynamics. Drive designs are realistic (based on existing studies at least in insects) and modelling parameters based on experimental data in An. Gambiae (noting that homing drives seem particularly efficient in Anopheles). It would be really interesting to see how the strategies perform with more realistic stochastic, individual-based (spatial) modelling but I accept their argument that this “first-pass” analysis. The manuscript is clearly written, logically presented and is a significant body of work. I do not have any major concerns. The following points should be addressed. 1. Line 201-5 This is confusing – the text mentions “…killing heterozygous female (Fig 1 Designs 3 and 4)” – but Fig 1 indicates design 3 and 4 are haplosufficient (only design 5 is haploinsufficient). Further the Simoni et al [27] referenced gene drive is inserted into a haplosufficient gene. 2. Design 5 is a little unclear. The alpha locus drives autonomously in males (line 210) – this should be indicated in Fig. 1. Given there is only one Cas9 source (from the alpha locus) – then locus B should also be homing only in males. I cant see this explicitly mentioned and want to confirm this is what was modelled. 3. Very tight linkage between the alpha and beta loci could be confounded by the generation of large deletions (generated by end-joining after simultaneous DBSs) – this should be acknowledged in the Discussion. 4. Line 125-6 Some explanation should be provided regarding the strategy whereby functional resistance alleles will be avoided. 5. At lines 145-6, given an essential gene is not used in this context, it should be made clearer how the population rebound is occurring i.e. end-joining indels versus selection for the 1% non-target alleles that are present at the outset. 6. The significance of including the An. coluzzii data (Fig. 7) should be make clearer. It seems only An. gambiae is relevant. Also, why include the “uncertain” population in the legend? 7. Fig 7 shows a “proof-of-concept” analysis for the existence of “private allele” PAM sites in hypothetical target (island) versus mainland populations. Although the limited analysis is probably sufficient to make their point it would be interesting to know location of the PAMs, (i.e. the types of genes they reside in, particularly given that some strategies rely on an essential gene for the Beta locus), predicted off-target profile and on-target activity of their cognate protospacer sequences. 8. Line 358 A small caveat here that is hinted at but could be made more obvious is that the X-shredder could not be Cas9- based (if the homing constructs also use Cas9) – Cas9 is used for X-shredding in ref 43 and 44 but of course I-PpoI has been developed in An. Gambiae. 9. Given that homing rates in other species are generally lower and Anopheles, it would be interesting to see an analysis of a high performing drive strategy with cleavage/indel/homing rates from another well characterised insect – e.g. Drosophila melanogaster. Indeed, a sensitivity analysis for a high performing strategy would enhance the paper. 10. Line 78 The phrase “in the computer” is a touch awkward. 11. Line 197 Reference Sup Fig S4 here. 12. Line 431-2 Reference table S1 here. 13. What does “alpha-cargo” signify in Fig 6B? Reviewer #2: This is an interesting paper in which methods (double drive) for local and transient (assuming extinction does not occur) population suppression, and modification, are described. Several other multi-component HEG-based systems have been previously described for modification. These are often somewhat baroque and have a number of requirements that may not be met in the real world in terms of target site conservation, etc. Other non-HEG-systems have also been proposed. This paper rises above this earlier work because it cleverly leverages the features of HEGs (in anopheles mosquitoes) that are known to work, in ways that make success likely. In particular, it leverages the fact of resistance allele formation in ways that are uniquely positive, rather than negative, as well as the previously reported ability to target the dsx locus without resistance allele formation, and to bias sex ratio using autosomal X shredders. In short, while this is a purely modeling paper, there is little doubt that multiple of the strategies described can be successfully implemented in Anopheles, and with some biological luck, elsewhere. The simplest version of the idea is clear. An alpha element is designed to insert into and disrupt the female-specific functions of dsx, resulting ultimately in unfit homozygous females and fit males. This element is non-autonomous, containing either gRNAs or Cas9, but not both. A second element, beta, is inserted at a neutral locus that is enriched within the target population, and rare outside it. Beta is autonomous, and complements the missing alpha function, resulting in homing of alpha when both elements are present in the same individual. What makes this design (and related designs in which there is cross-complementation of various sorts) interesting is the fact that failure of the beta element to spread to all versions of its target site is guaranteed, and does not disrupt the intent. Beta's job is simply to increase in frequency enough to push alpha towards fixation, at which point population suppression ensues. Beta can either target a private allele, or in a more general version of the system, simply a neutral locus, with the inevitable appearance of resistant alleles ultimately limiting the spread of it and alpha. Thus, the key here is that with the designs proposed the authors are able to take advantage of what we (the field) already know how to do really, really, well, which is to build a HEG that fails due to resistance allele formation or the presence of pre-existing polymorphisms that are resistant alleles. The designs are plausible, because they utilize the known: consistent targeting of dsx, inconsistent targeting of pretty much any other site, either due to pre-existing polymorphisms, or new resistant allele formation, and X shredding-based sex ratio bias. My one specific request is that the authors provide the code on github for their work, along with an explanation of how to use it. Referencing back to a paper from 2008, which also does not provide code, does not provide sufficient guidance to understand and explore the many variables, loci and alleles worked with in the current manuscript. It is necessary that others be able to not only read the text, but also work with tools that allowed its creation. In the absence of these methods the paper is hard to explore, and it is not possible to test other scenarios. The details of how the model was implemented are undoubtedly very interesting. It clearly involved a lot of work (which remains invisible in the background otherwise), as it involves many alleles at multiple loci, shredding, homing and recombination. One hopes that general models such as this that can handle a lot contribute to the field eventually adopting a coherent modeling platform rather than a series of lab-specific one-offs. The manuscript is very dense and almost always asks the reader to already be intimately familiar with the behavior of related systems. The text describing the different designs and outcomes reads a bit like a math textbook. There is no walk through for the uninitiated, and sections typically end with a statement of outcome, in much the way a math text book would end a proof with the phrase "from inspection it can now be inferred that...", which then leads to a figure and a turn to a new chapter. Or to put it another way, in reading and re-reading the text and very dense figures (which rarely or never guide the reader through the data), I was reminded of the first line from Ludwig Wittgenstein's Tractatus, Logico Philosophicus: "This book will perhaps only be understood by those who have themselves already thought the thoughts which are expressed in it—or similar thoughts." I would like to strongly suggest that the authors make more of an effort to walk readers through the designs and how they work. The figures are very dense and key features of the panels are never discussed. The figures are simply presented, along with very minimal text, and the authors then move on to the next section, with the implicit understanding that the reader will somehow muddle through. Other suggestions are below. Many of these also relate in one way or another to expanding discussion to make the work more accessible to non-specialists. The ideas presented are really beautiful and plausible, and I just think that they can be made more accessible in ways that will allow others to better intuit the forces at work and how the ideas presented could be extended further. The manuscript is not overly long, and is online only in any case. If the authors feel strongly for some (compelling) reason that they want to keep the text tight and telegraphic they could also consider an extended supplementary discussion of the various systems. A nice example of this is the remarkably complete supplemental notes file provided in another article from this group "How driving endonuclease genes can be used to combat pests and disease vectors" In the introduction the authors might consider introducing other double drives from the literature. This would allow the authors then to begin drawing distinctions between prior goals and their own, and how their constructs etc therefore differ. See related comments on discussion. Its also a bit hard, in figures like 2 and other similar figures, to keep track of what all the lines mean. It requires going back and forth between the figure legend and Figure 1. It would be nice to introduce these a bit more visually, perhaps in a box at the top of figure 2. And again, none of this is discussed in the text. It might be nice if the authors would note at some point that the idea of using linkage to increase drive strength has been noted and explored to some extent in other systems. In figure 3 it would be good to walk the reader a little bit through the significance of targeting a haploisufficient locus that manifests phenotypes in particular sexes/genotypes. Again, this is one of those points in the text where the reader really needs to have a deep familiarity with the prior literature in order to understand how and when this manifests itself, and how this contributes to suppression while still allowing strong drive. In the legend for figure 3 it is not immediately clear what is going on with the single locus drive and why it works so much less well. Since a single locus drive is presumably inserted into the dsx locus, which thus far does not accumulate resistance alleles, what causes the failure? The legend makes no mention of these or other potentially important variables. Is there a resistance allele in there somewhere that is not mentioned? If so, why is it present in single locus dsx, but not the alpha locus of the two locus versions? Legends should include all the relevant variables at work. In line 200 it would be nice to explain in a bit more detail why the x shredder kills heterozygous females. When is it expressed and how does this still allow rapid and strong drive In lines 211 onwards this is the first place where the authors make an attempt to walk the reader a bit through how the system, a particularly complicated one, does its job. Line 239. This is an odd reference for the idea that hegs have reduced replication fidelity. While the referenced construct does have reduced stability due to the presence of repeats, this is not the same thing as the reduced fidelity that may be associated with homing per se. Is it appropriate to also reference some of the evidence suggesting HR repair of a ds break may lead to higher frequency of mutation, or do you just mean to focus on the repetitive components, gRNAs, if multiple are used? Though if dsx is being targeted with a single gRNA then it is not clear there is a repetitive element. For Fig 6 I don't quite understand the lines in panel B. In particular why is the frequency of alpha different from that of alpha-cargo. The alpha construct contains the cargo, so shouldn't they be the same? This figure is also one where a bit more captioning in the figure itself would help the reader. Finally, If alpha-cargo is the entity of interest, wouldn't it make more sense to show its genotype frequency rather than allele frequency, or at least both? The allele frequency appears to saturate at about 65-70%, but it is unclear what this means in terms of carrier frequency. In Fig. 6C the non-target population seems to perhaps rise to a significant number, maybe 30%. It is hard to know what the number is since its a log scale. It would be useful here and perhaps a few other places to provide actual numbers so the reader can evaluate this, since non-target population effects are the topic that tends to get folks excited. The data presented on private allele frequencies is intriguing, and the authors are appropriately cautious about its interpretation given the limited data available. In lines 358-353 in the discussion, the authors bring up, for the first time, other two locus systems. Some, such as myself, would have placed this and more in a manuscript introduction that put the current work on two locus systems into a historical context that discussed what had come before, including other two locus homing systems. The authors have chosen not to do that, or provide any more than a quick reference nod to this and other literature in their discussion. It is an interesting strategy, as it keeps the readers attention squarely focused on the authors' thoughts and accomplishments, while minimizing any mental clutter from the literature that might interfere with this task. That said, while the prior literature is not discussed, its existence is noted, and thus represents a stylistic choice that is acceptable. In line 363 the authors discuss the idea of adding extra gRNAs to the alpha construct. They say this will increase the load, but they dont explain how this will work. Can they do this, briefly? Spitting off LOF mutations (or HEGs?) at independently segregating loci does this because......? Again, the diligent reader can at some point come up with a plausible scenario, but the authors could guide them to this point directly in a few lines. The two references provided may not be the most relevant. One is an old paper that talks about several uncloned maternal effect mutants and their effects on embryo development, and the other discusses roles of fruitless in males, not females. Is there something more relevant based on cloned genes with known behavior that could be cited? In any case, the key point is to walk the reader through how these alleles bring about increased load while still allowing drive. On line 144 the authors refer to dominant fitness costs. Can they clarify that this does not mean (I assume) completely dominant, as in heterozygote fitness cost = homozygote fitness cost. If it does mean completely dominant can they explain the basis for this, as I would normally imaging fitness costs to be additive in some sense. Are they meaning to imply that off target cleavage effects are 100% regardless of the dosage? That might explain it. Reviewer #3: Willis and Burt present a simulation study of gene drive approaches that could potentially be localized to specific target populations. They focus on a combination of two basic concepts: two-locus drives (aka “double drives”) and the targeting of “private” alleles. The key idea is that there is a division of labor between the constructs at the two loci, with the first responsible for the desired impact (population suppression or replacement) and the second for the localization, achieved through the targeting of a private allele. The first construct can only drive when the second is present. In the target population, both constructs are therefore expected to drive, whereas they would resemble a split drive in non-target populations. The paper conducts computer simulations of a number of different such strategies. This is a well-written paper on an interesting topic. It is definitely more plausible that a real-world use case of a gene drive might exist for a type of drive that can be localized, as compared to standard homing drives that have the potential to spread around the world. The inclusion of an analysis of actual polymorphism data in a mosquito species to see how common the required “private” alleles are observed is quite interesting as well. I think we generally expect these sorts of sequence variations to be present between differentiated populations, and I wouldn’t say that it is critical that a modeling paper include this kind of analysis, but it’s interesting to see. My main comment is concerned with the extent to which this study provides an advancement over previous studies that have already provided a proof-of-principle of double drives achieving localization by targeting private alleles (notably, the recent study by Sudweeks et al). The authors of the current study are very clear that their study is still intended to serve as a proof-principle, rather than an assessment of an actual potential application, which would certainly require more detailed and realistic modeling to be meaningful (although they do study specific populations of An. gambiae in their analysis of polymorphism data). As such, the current study appears to re-tread some conceptual ground that has already been explored in previous studies. Obviously, there are some differences between the specific drive designs studied in their paper and Sudweeks et al. The latter study also focused on sequences that are fixed in the target population, but absent or at low frequency in the non-target population (the opposite of the current study). Whether these differences constitute enough advancement for the journal is a subjective question I want to leave to the editor to decide. Another (admittedly very general) concern is how confident we can be that even basic, qualitative conclusions from this study will ultimately hold in a real-world population, given the rather simplistic assumptions made by the modelling. In particular, it is unclear whether results from a panmictic population model still apply for realistic populations that are inherently spatially structured, for example, because they occupy a continuous landscape with limited dispersal. This can affect dynamics not only quantitatively, but can give rise to new qualitative phenomena that fundamentally change the outcome of a drive (e.g. resulting in a suppression drive failing to suppress a population). At the core of such new phenomena often lies the heterogeneity that can arise in spatial populations, which doesn’t exist in panmictic models. I would expect that two-locus drive strategies would be particularly prone to such issues, as compared to single locus strategies. This is due to the assumption made by panmictic models that the frequency of allele combinations at the two loci is simply the product of their individual population frequencies. In a spatial population, however, the two alleles could spread at considerably different rates, and thus be present very heterogeneously. The panmictic model would then rely on fundamentally wrong predictions of how frequently such alleles would actually “meet” in an individual based on their overall population frequencies, and this could certainly affect the dynamics profoundly. I admit that a comprehensive analysis of such questions would be open-ended and probably beyond the scope of the current paper. The authors do already discuss several other limitations of their model in the discussion section. Maybe a mention of the potential impact of spatial heterogeneity and limited dispersal could be added as well. Minor comments: The authors might consider reworking and extending the methods section. It is very short and seems a bit rushed compared to the rest of the paper. Some parameters are specified in this section, while not really enough detail is provided about what those parameters do. E.g. on line 428-430: “juvenile survival is density dependent according to the Beverton-Holt model, which has two parameters, but since we report results in terms of relative population sizes, only one matters, the intrinsic rate of increase (Rm).” Somehow this sentence simultaneously tells me what specific implementation the model uses, while telling me very little about that implementation, and tells me that there are two parameters, and then tells me that there is only one parameter. It also seems a little awkward that the PAM material and the modeling material are not in separate paragraphs. I am aware that these authors focused more attention on the supplemental methods section, but I feel that if there is going to be a short methods section in the main body, it should focus on giving a rough sketch of how the model works without going into unnecessary specifics, but ideally at least introducing all aspects of the model. 2. I find the statement that their model is individual-based a bit misleading. The model used portions of the total starting number of individuals who are in a given state, and as the population declines, the simulation isn’t tracking an actual simulated population size, but rather, a size relative to the equilibrium size. I’m sure this allows for simulations to be performed with great computational efficiency, and no doubt this type of model has its place in gene drive research. Still it is not truly an individual-based model, resulting in somewhat awkward situations such as that actual population elimination is never possible (as frequencies can become arbitrarily small but never truly reach zero). 3. The caption of Figure 2 seems to be missing some details (e.g. it doesn’t state that the dashed lines indicate resistance). The figure caption could probably be simplified a lot if a legend were included with the figure. In general, I feel that several of the figures might benefit from having legends, especially because the same style of line means different things from figure to figure (e.g. in Figure 2, the dashed lines indicate resistance within the population, in Figure 3 the dashed lines indicate that construct β is inserted in a haplo-insufficient target). Including a legend would probably make the figures a fair bit easier to digest. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: None Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Bruce A Hay Reviewer #3: No 1 Mar 2021 Submitted filename: WillisandBurt_response_to_reviewers.pdf Click here for additional data file. 7 Mar 2021 Dear Dr Willis, We are pleased to inform you that your manuscript entitled "Double drives and private alleles for localised population genetic control" has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Fred Gould Guest Editor PLOS Genetics Gregory P. Copenhaver Editor-in-Chief PLOS Genetics www.plosgenetics.org Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): The authors have addressed all of the reviewers comments either by making changes to the manuscript or by explaining why changes are not needed or not appropriate. As such, I find the manuscript ready for publication. ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01914R1 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. 18 Mar 2021 PGENETICS-D-20-01914R1 Double drives and private alleles for localised population genetic control Dear Dr Willis, We are pleased to inform you that your manuscript entitled "Double drives and private alleles for localised population genetic control" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Katalin Szabo PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics

56 in total

Review 1. Evaluating genomic signatures of "the large X-effect" during complex speciation.

Authors: Daven C Presgraves
Journal: Mol Ecol Date: 2018-07-16 Impact factor: 6.185

2. Variation in recombination rate across the X chromosome of Anopheles gambiae.

Authors: Marco Pombi; Aram D Stump; Alessandra Della Torre; Nora J Besansky
Journal: Am J Trop Med Hyg Date: 2006-11 Impact factor: 2.345

3. A CRISPR homing gene drive targeting a haplolethal gene removes resistance alleles and successfully spreads through a cage population.

Authors: Jackson Champer; Emily Yang; Esther Lee; Jingxian Liu; Andrew G Clark; Philipp W Messer
Journal: Proc Natl Acad Sci U S A Date: 2020-09-14 Impact factor: 12.779

4. A CRISPR-Cas9 sex-ratio distortion system for genetic control.

Authors: Roberto Galizi; Andrew Hammond; Kyros Kyrou; Chrysanthi Taxiarchi; Federica Bernardini; Samantha M O'Loughlin; Philippos-Aris Papathanos; Tony Nolan; Nikolai Windbichler; Andrea Crisanti
Journal: Sci Rep Date: 2016-08-03 Impact factor: 4.379

5. Requirements for Driving Antipathogen Effector Genes into Populations of Disease Vectors by Homing.

Authors: Andrea Beaghton; Andrew Hammond; Tony Nolan; Andrea Crisanti; H Charles J Godfray; Austin Burt
Journal: Genetics Date: 2017-02-03 Impact factor: 4.562

6. Performance analysis of novel toxin-antidote CRISPR gene drive systems.

Authors: Jackson Champer; Isabel K Kim; Samuel E Champer; Andrew G Clark; Philipp W Messer
Journal: BMC Biol Date: 2020-03-12 Impact factor: 7.431

7. Split drive killer-rescue provides a novel threshold-dependent gene drive.

Authors: Matthew P Edgington; Tim Harvey-Samuel; Luke Alphey
Journal: Sci Rep Date: 2020-11-25 Impact factor: 4.379

8. Fruitless mutant male mosquitoes gain attraction to human odor.

Authors: Nipun S Basrur; Maria Elena De Obaldia; Takeshi Morita; Margaret Herre; Ricarda K von Heynitz; Yael N Tsitohay; Leslie B Vosshall
Journal: Elife Date: 2020-12-07 Impact factor: 8.140

9. Retarded nuclear migration in Drosophila embryos with aberrant F-actin reorganization caused by maternal mutations and by cytochalasin treatment.

Authors: K Hatanaka; M Okada
Journal: Development Date: 1991-04 Impact factor: 6.868

10. Development of a confinable gene drive system in the human disease vector Aedes aegypti.

Authors: Ming Li; Ting Yang; Nikolay P Kandul; Michelle Bui; Stephanie Gamez; Robyn Raban; Jared Bennett; Héctor M Sánchez C; Gregory C Lanzaro; Hanno Schmidt; Yoosook Lee; John M Marshall; Omar S Akbari
Journal: Elife Date: 2020-01-21 Impact factor: 8.140

2 in total

Review 1. Gene Editing and Genetic Control of Hemipteran Pests: Progress, Challenges and Perspectives.

Authors: Inaiara D Pacheco; Linda L Walling; Peter W Atkinson
Journal: Front Bioeng Biotechnol Date: 2022-06-07

2. Propagation of seminal toxins through binary expression gene drives could suppress populations.

Authors: Juan Hurtado; Santiago Revale; Luciano M Matzkin
Journal: Sci Rep Date: 2022-04-15 Impact factor: 4.996

2 in total