Hominin evolution is characterized by progressive regional differentiation, as well as migration waves, leading to anatomically modern humans that are assumed to have emerged in Africa and spread over the whole world. Why or whether Africa was the source region of modern humans and what caused their spread remains subject of ongoing debate. We present a spatially explicit, stochastic numerical model that includes ongoing mutations, demic diffusion, assortative mating and migration waves. Diffusion and assortative mating alone result in a structured population with relatively homogeneous regions bound by sharp clines. The addition of migration waves results in a power-law distribution of wave areas: for every large wave, many more small waves are expected to occur. This suggests that one or more out-of-Africa migrations would probably have been accompanied by numerous smaller migration waves across the world. The migration waves are considered "spontaneous", as the current model excludes environmental or other extrinsic factors. Large waves preferentially emanate from the central areas of large, compact inhabited areas. During the Pleistocene, Africa was the largest such area most of the time, making Africa the statistically most likely origin of anatomically modern humans, without a need to invoke additional environmental or ecological drivers.
Hominin evolution is characterized by progressive regional differentiation, as well as migration waves, leading to anatomically modern humans that are assumed to have emerged in Africa and spread over the whole world. Why or whether Africa was the source region of modern humans and what caused their spread remains subject of ongoing debate. We present a spatially explicit, stochastic numerical model that includes ongoing mutations, demic diffusion, assortative mating and migration waves. Diffusion and assortative mating alone result in a structured population with relatively homogeneous regions bound by sharp clines. The addition of migration waves results in a power-law distribution of wave areas: for every large wave, many more small waves are expected to occur. This suggests that one or more out-of-Africa migrations would probably have been accompanied by numerous smaller migration waves across the world. The migration waves are considered "spontaneous", as the current model excludes environmental or other extrinsic factors. Large waves preferentially emanate from the central areas of large, compact inhabited areas. During the Pleistocene, Africa was the largest such area most of the time, making Africa the statistically most likely origin of anatomically modern humans, without a need to invoke additional environmental or ecological drivers.
Hominins are generally supposed to have originated in Africa and settled most of Africa and the southern half of Eurasia in the early Pleistocene [1-5]. Fossil evidence suggests that earliest Homo sapiens appeared in Africa during the late Middle Pleistocene (Jebel Irhoud, Omo and Herto [6-8]). Anatomically modern humans (AMH) emerged, spread out of Africa during the Late Pleistocene and now occupy the whole world as the only Homo species [9-14]. In the intervening time, a number of Homo species, such as Neanderthals, Denisovans, Homo erectus, H. heidelbergensis, H. ergaster, etc. existed, developed and disappeared again [15-22]. Pleistocene human evolution is thus characterised by differentiation, speciation, migration waves and extinction events. Most authors assume that the out-of-Africa spread of AMH involved one or more migration waves that replaced Homo species that existed at the time with only limited genetic admixture, such as between Neanderthals and AMH [23-24]. Many studies have addressed the timing and origin of migration waves, as well as migration paths (e.g. [13,23,25-27]). Apart from the major out-of-Africa event(s), AMH populations experienced several more migration waves within already populated areas, such as Africa [28-29] and Europe [30-31]. Considering this, it is not unlikely that more migration waves occurred in the Pleistocene, but the sparse fossil record still makes it difficult to detect any.Assuming that migration waves did happen, the question arises what caused them, in particular in relation to the spread of AMH. Most authors favour some competitive advantage of AMH over other Homo species [32-34]. With climate change now central in the scientific discourse, many recent studies suggest that climate played an important role in the environmental changes making AMH more competitive than other Homo species, or allowing the opening of ecological corridors that triggered the dispersal of Homo sapiens out of Africa [35-44]. An opposite view is that AMH and Neanderthals actually had no competitive advantage over each other [45]. The argument, based on a numerical model, is that, because AMH's population was much larger than that of Neanderthals [12,46-48], AMH's were statistically more likely to reach fixation in both Africa and Europe.Although modelling is extensively used in studies of human evolution (e.g. [49-51]), relatively few studies so far employed forward approaches on an explicit map to, for example, determine amounts of admixture, stepping stones or the most probably origin of AMH [39,45,52-55]. Kolodny and Feldman [45] used a simple "map" with only two demes to determine the chances that AMH would replace Neanderthals purely due to the larger population of AMH in Africa compared to Neanderthals in Europe. The SPLATCHE2 simulations [55] and the similar model by Eriksson et al. [39] assumed a priori that AMH have some advantage and explored the advance of AMH in relation to climate factors. These various models have in common that the species (AMH) and its competitive advantage are predefined in the parameters of the model.Here we present a basic numerical model to simulate the spatial and temporal differentiation/speciation and the emergence, frequency and patterns of migration waves. The model is not restricted to any particular species, but is applied here to human evolution, because there is probably no other species than AMH whose evolutionary origins have attracted so much general and scientific interest. Contrary to the above-mentioned models, no human species are defined a priori. The only aim is to explore the statistics of patterns, without claiming or attempting to simulate the actual emergence and spread of AMH, which would only be one of an infinite number of realisations of the stochastic model. Although a range of environmental factors have been invoked to explain various aspects of human evolution, we expressly do not include these in the models presented here. The premise here is that neutral evolutionary processes already lead to complex and emergent behaviour that resulted in human evolution patterns, including migration waves. The aim is thus to provide a null hypothesis against which additional environmental factors and influences can be further tested.
Methods
The model is based on a regular 2-dimensional grid of demes, each a square area with a number of individuals. The genome (G; see Table 1 for list of symbols) of a deme is defined by a single string of N binary genes, with two alleles, either zero or one (similar to e.g. [56-57]). This genome is the single dominant or representative genome for the population of a deme. The temporal and spatial evolution of genomes is modelled in discrete time steps of length Δt in which mutations take place and genes are transferred between neighbouring demes. The distribution of genomes is visualised in RGB colour maps in which each deme is one pixel. Red is proportional to the number of ones in the deme's genome, blue proportional to the number of genes identical to G = {10101010, etc.} and green proportional to the number of genes identical to G = {00110011, etc.}. Colours are stretched to maximise the colour range.
Table 1
List of symbols and abbreviations in order of appearance in text.
Symbol
Definition/description
G
The genome of a single deme, expressed as a binary string of zeros and ones
N
Number of genes in a genome. Typically at least several hundreds.
t, Δt
Time and duration of a numerical time step
M
Number of mutations occurring in the whole model per time step. Set to 4 per time step.
m
Chance, per time step, that a mutation occurs in a particular deme
A
Total number of populated demes in model
ND
Number of individuals that populate one deme (Eqs 3 and 4)
T
Number of generations per time step (Eq 3)
S
Linear dimension of a deme (Eq 4)
m0
Mutation rate of a single individual
s
Competitive advantage
D
Parameter defining the chance, per time step, that an allele jumps from one deme to a neighbour (Eq 1).
α
Assortative mating factor (Eq 1). Set to either 0 or 0.05.
ΔG
Number of genes that have different alleles in two neighbouring demes (Eq 1)
D0
Reference diffusion coefficient (equals D when α = 0). Always set to unity here (Eqs 1 and 7).
p
Advantage factor, defining the rate at which an allele "1" spreads at the cost of its counterpart "0" (Eqs 2,4,6 and 7). Varied from 0.0125 to 0.05 in Fig 1; set to 0.05 in other figures.
F
"Fitness" of a genome: the sum of all alleles in one deme's genome (Eq 2).
ΔF
"Fitness" difference between two neighbouring demes (Eq 2).
ρ
Population density in number of individuals per km2 (Eq 4)
Amut
Area in number of demes occupied by a mutation (a gene with the allele "1" instead of "0") (Eq 5)
req
Radius of a circle with the same area as that occupied by a mutation (Amut) (Eqs 5 and 6)
v
Spreading velocity of a mutation (Eq 5)
Lif
Length of the perimeter of an area occupied by a mutation (Eq 6)
Pfix
Mean chance of a mutation to reach fixation in whole model
Nfix
Number of mutation that have reached fixation in whole model
tfix
Mean time for a mutation to reach fixation in whole model
f
Frequency of the occurrence of sweeps of a certain area (Eq 8)
An
Normalised area of a sweep (number of swept demes/total number of populated demes in model) (Eq 8)
q
Exponent in power-law distribution of sweep areas (Eq 8)
Spreading rates of mutations.
Distribution of a single mutation originating from the centre of a circular model (R = 100 demes) at the stage where the effective radius of the area occupied by the mutation is 60 demes. A. p = 0.0125 at t = 990. B. p = 0.025 at t = 720. C. p = 0.05 at t = 410. The effect of increasing p is an increase in spreading rate and a sharpening of the spreading front. D. Interface length divided by effective circle circumference versus drift factor in a double-log plot. Data approximately follow a power law. E. As a result, spreading velocity versus advantage factor also shows a power-law relationship. F. Distribution of six mutations seeded at the centre of the model (R = 100 demes) at t = 0. Colours show number of mutations on a deme from one (pink) to six (black).
Mutations
Mutations are carried out at the start of each time step (See S2 Fig for the workflow). For each mutation, a deme is first randomly picked. Next, a random gene that has allele "0" everywhere in the model is chosen. This gene will mutate by changing its allele from "0" to "1" in the deme. This procedure is repeated M times (the mutation rate) per time step. The mutation chance (m) per time step for one single deme is m = M/A, with A the area of the model. m is the chance that one mutation emanating from a single individual reaches fixation in the whole population (of size N) of a single deme in a time step of T generations. m thus depends on T, N (itself the product of population density ρ and area S2 of a deme) and the intrinsic mutation rate m0 of one individual per generation. The number of mutations that occur in one deme per generation is proportional to N·m. However, the chance that a mutation reaches fixation within the population is roughly inversely proportional to N for s≈0, with s the competitive advantage [58]. It follows that the rate of successful mutations within the deme is proportional to m0, and is not dependent on the population size of the deme. The model, however, uses time steps T of more than one generation. It will be shown below that T∝N, and hence, m∝m0·N.We use the term "active mutations" for those mutations that have not reached fixation, i.e. they occupy at least one, but not every deme in the map of all demes. Once a mutation has reached fixation by occupying every deme in the model it no longer plays an active role. The timing of fixation of a mutation is recorded, as well as the location of origin of the mutation.
Allele exchange between neighbours
A mutation step is followed by one round of interbreeding in which single alleles may be transferred between neighbouring demes (here using a Von Neumann direct neighbourhood scheme). All demes are considered on average once in a random order for transfer between a deme (deme a) and a randomly selected direct neighbour (deme b). Each pair is thus treated every two steps on average. The chance that a transfer between neighbouring deme is considered in one time step is defined by the parameter D. It can be regarded as a general diffusion parameter: the chance, per time step, that an allele jumps from one deme to a neighbour by following a random Brownian movement [58-61]. Assortative mating [62] is included in a way similar to the model of Barton and Riel-Salvatore [54]. With assortative mating, similar individuals are more likely to mate than different ones. This is implemented with the "assortative mating factor" (α), which reduces D as a function of ΔG, which is the number of genes that have different alleles in the two neighbouring demes under consideration:
D0 is the reference diffusion parameter, equal to D for the case that α = 0. It is first decided with a random-number generator whether a gene transfer will take place or not, according to (Eq 1). If this is to happen, all genes in the list are considered for allele transfer. The chance P(a→b) that an allele of the genome of deme a is copied to deme b is calculated with:ΔF is the difference in number of mutations (ones) between the two demes. The chance that a deme passes on its mutations to a neighbour is thus determined by the overall number of mutations relative to that neighbour (ΔF) and the advantage factor p that defines the competitive advantage of these mutations (assumed the same for all mutations). In this paper we define the sum of all genes (F = (G)) as the "fitness" of a deme. This is because the number of advantageous genes (F), multiplied by the advantage factor (p) determines the chance that genes are passed on to offspring. When p = 0, mutations are neutral and there is no preferential transfer of alleles and we have purely random spreading of zeros and ones, but, on average, no change in their frequency. A positive value of p leads to fitter demes to pass on their alleles to their neighbours. Note that the transfer chance of a single allele does not only depend on its own value, but on that of the whole genome. This can potentially lead to less competitive zeros replacing more competitive ones.To estimate the duration of one time step, we can consider the population of the two neighbouring demes as one during the transfer. Of interest is the case where the mutation only occurs in one of the two demes, i.e. when it is carried by 50% of the combined population. The time step T (in number of generations) is thus the time it takes for a mutation to reach fixation by genetic drift in the combined population with a chance of 0.5+p or extinction with a chance of 0.5-p. Using a basic Monte Carlo model for populations of up to a few hundred individuals, we find for the time step T (Equation C in S1 File):
Here, S is the size of a deme and ρ the population density. (Eq 3) only holds for weakly competitive mutations (s<<1), small values of p, and no variation in population density between demes. The factor p is related to s, the competitive advantage of the mutation [58-61,63] (Equation B in S1 File):
At a deme size of S = 50 km and a population density of 0.01 individuals/km2, T is about 175 generations (≈4000 years at 25 years/generation) for D0 = 1 as used throughout this paper. Using p = 0.05 results in a competitive advantage of s≈0.7%.
Spreading of mutations
A single mutation in a field of demes without any other mutations has a chance of 0.5-p to disappear the first time an interbreeding event is considered with one of its neighbours. The initial survival rate is thus a function of p. The low initial survival rate is due to two factors. The first is a numerical effect that the width of diffusional front is less than can be resolved at the scale of the demes. The second is related to genetic drift [64-65], where the effective population (cluster of demes around the new mutation) is small and therefore the chance of survival smaller than when the mutation has spread over a large area.Once a mutation has survived this initial nucleation phase by spreading over sufficient demes, the area occupied by the deme increases linearly with time. The expansion front is not sharp, but diffuse, in accordance with the diffusion-reaction model of [59]. The width of the diffusional front decreases with increasing p (Fig 1A–1C) as the advantage factor becomes more important relative to the diffusional spreading. After the initial nucleation phase, the area (A) occupied by the single mutation increases linearly with the square of time. We define the velocity (v, in deme size per time step) of the expanding front as the rate of increase in radius (r) of an equivalent circle with area A:
Fig 1
Spreading rates of mutations.
Distribution of a single mutation originating from the centre of a circular model (R = 100 demes) at the stage where the effective radius of the area occupied by the mutation is 60 demes. A. p = 0.0125 at t = 990. B. p = 0.025 at t = 720. C. p = 0.05 at t = 410. The effect of increasing p is an increase in spreading rate and a sharpening of the spreading front. D. Interface length divided by effective circle circumference versus drift factor in a double-log plot. Data approximately follow a power law. E. As a result, spreading velocity versus advantage factor also shows a power-law relationship. F. Distribution of six mutations seeded at the centre of the model (R = 100 demes) at t = 0. Colours show number of mutations on a deme from one (pink) to six (black).
The spreading velocity (v) as a function of p is determined from 2500 simulations of an expanding mutation after a stable diffusive front has been established (50The exponent is slightly larger than 0.5 due to the fact that the spreading front becomes less fuzzy with increasing p. This means that demes at the front have fewer neighbours without the mutation when p is large than when it is small.The initial spreading velocity is higher when multiple mutations are placed in the centre of the model (Fig 1F). This is expected, because ΔF>1 in (Eq 2). However, the steady-state velocity of these mutations is the same when the diffusion front is wide enough (low p). The explanation is that demes within this diffusive front mostly only have a low ΔF with their direct neighbours and thus spread as fast as a single mutation. This implies that gene surfing due to high ΔF is not effective during steady-state spreading of an ensemble of mutations. However, spreading rate increases when fitness gradients or clines are high due to a high mutation rate, or when two different populations, with high ΔF, would suddenly come into contact.
Replacement events
Full genome transfers or replacement events are considered after the interbreeding step. Again, all demes and one random direct neighbour are considered in a random order. A full genome replacement is carried out if the absolute fitness difference |ΔF| is equal or greater than a set critical fitness difference ΔF. In that case, the full genome of the fittest deme is copied to that of its neighbour. After one round where all demes and one random neighbour are considered once for a replacement, the replacements may have led to new pairs that exceed ΔF. The routine is therefore repeated until ΔF<ΔF everywhere in the model. When Δt is small, this semi-instantaneous spreading would be relatively fast (up to the order of a km/yr, depending on the size of demes). However, we use this scheme here to be able to track individual replacement "avalanches" within one time step. Contiguous areas that experienced a full genome replacement are termed "sweeps" here. The time and area of each sweep (A) is recorded at the end of each time step, as well as a map of all demes that experienced a genome replacement sweep.
Aims
The aim of this paper is to test the main types of evolutionary patterns that result from different combinations of diffusive spreading with competitive advantageous mutations, assortative mating and replacement waves. It is not our intention to systematically investigate and quantify the effect of each of the parameters. We therefore only evaluate four representative cases (Table 2), using a reference model with 3 square inhabited areas, connected by narrow isthmuses. The areas or "continents" are 100x100, 50x50, and 25x25 demes in size. In each individual simulation presented below we keep all parameters constant in space and time. This implies that any environmental factors that could affect the competitive advantage of individual mutations, their spreading rate, and the population density do not change spatially or in time. This serves the aim of this paper to investigate patterns that develop in the complete absence of any environmental or other external influences, with the exception of coastlines and mountain ranges that act as barriers. We consider this as a fundamental preliminary step in order to further discuss diversification processes during human evolution.
Table 2
Summary of the main effects of assortative mating and migration sweeps.
No assortative mating (α = 0)
With assortative mating (α >0)
No migration sweeps(ΔFcritnot reached)
Fig 2A. Spreading of mutations leads to gradual gradients in genetic signature (ΔG) and fitness (ΔF). Gradients are a function of the mutation rate (M) in relation to the demic diffusion rate (D0) in combination with the advantage factor (p), a function of the competitive advantage (s) of mutations. Fitness maxima (largest number of mutations) are found in the centre of large compact populated areas. Variation develops by isolation-by-distance, but no distinct subspecies develop.
Fig 2B. Spreading of mutations leads to gradual gradients in genetic signature (ΔG) and fitness (ΔF). With increasing ΔG, "nation-like" regions develop with sharp gradients at their boundaries. Genomes within regions are relatively homogeneous as mutations within a region can reach fixation within the whole region. Genomes within regions evolve relatively slowly as mutations do not, or only rarely cross regional boundaries. Size of regions depends on mutation rate (M), relative to the diffusion rate (D0) and advantage factor (p), and the assortative mating factor (α). Pattern is close to spatially static multi-regional or structured populations with multiple split events in populations that develop distinct genetic signatures.
Gradients in genetic signature (ΔG) and, hence, in fitness (ΔF) increase until they reach the threshold (ΔFcrit) for the initiation of migration sweeps. Migration sweeps follow a power law function for their frequency as a function of swept area (Eq 8), with many smaller sweeps for each large one. The very largest sweeps are more common than the power-law predicts ("dragon kings").
With migration sweeps(ΔFcritregularly reached)
Figs 3 and 8A. Migration sweeps bring originally spatially isolated genomic signatures in contact to allow for admixture in the absence of assortative mating. Admixture results in rapidly increasing genetic signature gradients that initiate new migration sweeps. Periods of gradual differentiation by isolation-by-distance are interspersed by shorter "bursts" of migration sweeps and a complete reorganisation of the genetic signature, resulting in a relatively homogeneous genetic signature in the whole populated area at the end of one cycle. Consistent with a punctuated equilibrium model with regular cycles.
Figs 4 and 8B. The amount of structuring of the population in distinct regional genetic types depends on the importance of assortative mating (α) relative to the threshold for the initiation of migration sweeps (ΔFcrit). These sweeps bring originally spatially isolated genomic signatures in contact with each other, but assortative mating inhibits/reduces admixture between these. Assortative mating therefore reduces the clustering in time of migration sweeps and reduces admixture. The latter allows the preservation of older genetic signatures, preferentially in the centre of large populated areas (e.g. Africa) that experience fewer migration sweeps. Consistent with multi-regionalism or structured populations, preservation of archaic signatures, as well the punctuated equilibrium model with the occurrence of both large and many small migration sweeps.
Time evolution of genome signatures over 1000 steps.
Left column shows the genome signature along the east-west x-y section through the centre of the three continents as a function of time along the vertical axis. Right column shows map views of genetic variations at stages 500 and 1000. Colours qualitatively represent variations in genome, with the red tones proportional to "fitness". Graphs show the fitness profiles along the x-y section at four time steps. A. In case of diffusion only, genetic signatures tend to emanate from the centre of the occupied areas and spread out towards the margins. B. When assortative mating is added, internally relative homogeneous "nation" regions with sharp boundaries develop from about t = 500 steps. As these boundaries inhibit spreading of mutations, overall fitness increases more slowly than without assortative mating.
Results and discussion
Diffusion effect
We first consider the case (Fig 2A) without genome replacements (ΔF = ∞) or assortative mating (α = 0). The mutation rate M is set to four mutations per time step and p is 0.05. Individual mutations spread leading to increasing variation in genomes within the model. This is comparable to the geographic differentiation through isolation-by-distance as proposed for the reticulate or multiregional evolution model [66-67] or the recent assimilation model [68-69]. Demes in the centre of the model are on average nearer to the origins of mutations than are demes on the periphery. Fitness therefore increases faster in the centre than the periphery, as can be seen in the fitness profile across the three "continents" (Fig 2A). After steady outward fitness gradients are established at about t = 250, genetic signatures migrate outwards, down the gradients. Signatures in the periphery are thus regularly overprinted by those coming from the centre.
Fig 2
Time evolution of genome signatures over 1000 steps.
Left column shows the genome signature along the east-west x-y section through the centre of the three continents as a function of time along the vertical axis. Right column shows map views of genetic variations at stages 500 and 1000. Colours qualitatively represent variations in genome, with the red tones proportional to "fitness". Graphs show the fitness profiles along the x-y section at four time steps. A. In case of diffusion only, genetic signatures tend to emanate from the centre of the occupied areas and spread out towards the margins. B. When assortative mating is added, internally relative homogeneous "nation" regions with sharp boundaries develop from about t = 500 steps. As these boundaries inhibit spreading of mutations, overall fitness increases more slowly than without assortative mating.
The effect of assortative mating
The effect of the assortative mating factor (α) is to reduce the rate of gene exchange when the genomes of neighbouring demes are different. Fig 2B shows the effect of assortative mating in the same model as Fig 2A, but with α set to 0.05. This means decreasing genetic exchange up to ΔG = 20, where exchange is reduced to zero. With initially low variations in genetic signature, the effect in the beginning is only to slow down differentiation. At about t = 500, first neighbouring demes cease exchanging alleles, which allows their ΔG to increase further. This finally leads to homogeneous regions, bounded by fixed, sharp borders. In Fig 2B these are visible as persistent, sharp changes in colour. New mutations cannot escape these "nations". Because the fixation time within a “nation” is smaller than for the whole model, these mutations can now spread significantly before more mutations occur, thus keeping ΔG and ΔF low within a "nation", while F for each "nation" keeps rising steadily. A large area, high mutation rate and low spreading rate (low D0 and p) all favour high values of both ΔF and ΔG (with ΔG≥ΔF). When these values remain too low, incipient borders shift and weaken again, which inhibits the establishment of permanent borders. This effect is visible in the medium and small "continents" that now behave as a closed system with highest fitness in the centre, but no internal "nation" borders. The development of "nations" or a structured population [70] results in a breakdown of the positive relationship (Fig 2A) between genetic signature and distance between points or isolation by distance [71]. This leads to a relative isolation of demes, which is strengthened through time.
Effect of replacement sweeps
The effect of replacement is again illustrated with the three-continent model (Fig 3), using the same p = 0.05 and M of four mutations per time step as before. ΔF is set at 10, so the genome of a deme is fully replaced by that of its neighbour if that neighbour has at least 10 mutations more.
Fig 3
Evolution of genetic signatures in case of replacement sweeps in the absence of assortative mating.
A. Temporal evolution along the line x-y through the centre of the model (see Fig 2). Sharp changes in colour indicate replacement sweeps. The main ones are highlighted in yellow. Graph shows the area (A, relative to total area) of individual sweeps as a function of time. Replacement sweeps are strongly clustered in time. Map views show the genetic distribution at five selected points in time. Direction and extent of selected sweeps at t = 274, 296 and 299 are shown by yellow arrows and white lines that trace the front at regular time intervals, respectively. These sweep events are highlighted with red circles in the A versus time graph. Doubling (B) or halving (C) the deme size (S) roughly doubles or halves, respectively, the average time interval between clusters of replacement sweeps, but not the general pattern.
Evolution of genetic signatures in case of replacement sweeps in the absence of assortative mating.
A. Temporal evolution along the line x-y through the centre of the model (see Fig 2). Sharp changes in colour indicate replacement sweeps. The main ones are highlighted in yellow. Graph shows the area (A, relative to total area) of individual sweeps as a function of time. Replacement sweeps are strongly clustered in time. Map views show the genetic distribution at five selected points in time. Direction and extent of selected sweeps at t = 274, 296 and 299 are shown by yellow arrows and white lines that trace the front at regular time intervals, respectively. These sweep events are highlighted with red circles in the A versus time graph. Doubling (B) or halving (C) the deme size (S) roughly doubles or halves, respectively, the average time interval between clusters of replacement sweeps, but not the general pattern.In the absence of assortative mating (α = 0), fitness increases steadily, especially in the centre of the model, until gradients exceeding ΔF are reached (at t = 274 in Fig 3A). This typically happens somewhere between the centre and the margin. This is because, although fitness is highest in the centre, gradients are generally low here. Gradients are highest near the margins, where fitness is lower than in the centre. As a result, replacements first sweep the margins of the model (t = 274), skirting the high-fitness centre (for example at t = 296). Demes in the swept area with a single genome subsequently exchange alleles with the unswept demes, which rapidly leads to genomes with enhanced fitness again, and, hence, new sweeps (t = 299). A rapid succession of admixture and replacement sweeps leads to homogenisation of the genome over the whole area. Although the new global genome is closest to that of the centre of the model, there are significant changes by admixture during the various successive sweeps.After homogenisation of the genome, it takes some time for gradients to develop again to initiate a new cycle of replacement sweeps. This leads to a regular cycle of 200–300 time steps of differentiation without any sweeps, followed by a rapid succession of many small and a few large sweeps that sweep almost the whole model area (Fig 3A). This pattern is in line with the punctuated-equilibrium model [72-74]. Reducing the resolution by a factor two, while keeping all other parameters the same, implies reducing the population density and the frequency of mutations in the model by a factor four. Gradients now increase at a lower rate and the duration of a full cycle is roughly doubled (Fig 3B). Doubling the resolution has the opposite effect and leads to a reduction of the cycle time (Fig 3C). Independent of resolution, admixture results in several large sweeps that together reset the genomes in the whole area.Adding assortative mating (α = 0.05) to the previous simulation significantly changes the evolution of the model (Fig 4A). After the initial differentiation period, first sweeps occur and, again, mostly sweep the margins. Contrary to the previous case where α = 0, the sweeping genome is now unlikely to interbreed with fit demes at the edge of the swept area owing to their large ΔG. New sweeps are thus not immediately triggered for lack of admixture, and sweeps are less clustered in time. Demes in the centre are rarely or even never swept, providing a genetic continuity here. These demes have a higher fitness than the surrounding homogenised swept areas, and thus have a higher chance to initiate future sweeps. Marginal areas show distinct extinction events, as can be seen by distinct colour changes in Fig 4B. Halving or doubling the resolution has the expected effect of increasing, respectively decreasing the time between sweeps (Fig 4B and 4C).
Fig 4
Evolution of genetic signatures in case of replacement sweeps as in Fig 3, but now with additional assortative mating.
A. Temporal evolution along the line x-y through the centre of the model (see Fig 2). Sharp changes in colour indicate replacement sweeps. The main ones are highlighted in yellow. Note the overall continuity in time of genomes in the centre of the large continent that is rarely swept by migration waves. Graph shows the area (A, relative to total area) of individual sweeps as a function of time. Replacement sweeps are less clustered in time than in case of no assortative mating. Map views show the genetic distribution at five selected points in time. Direction and extent of three selected sweeps at t = 266, 360 and 415 are shown by yellow arrows and white lines that trace the front at regular intervals, respectively. These sweep events are highlighted with red circles in the A versus time graph. Doubling (B) or halving (C) the deme size roughly doubles or halves, respectively, the average time interval between clusters of replacement sweeps, but not the general pattern.
Evolution of genetic signatures in case of replacement sweeps as in Fig 3, but now with additional assortative mating.
A. Temporal evolution along the line x-y through the centre of the model (see Fig 2). Sharp changes in colour indicate replacement sweeps. The main ones are highlighted in yellow. Note the overall continuity in time of genomes in the centre of the large continent that is rarely swept by migration waves. Graph shows the area (A, relative to total area) of individual sweeps as a function of time. Replacement sweeps are less clustered in time than in case of no assortative mating. Map views show the genetic distribution at five selected points in time. Direction and extent of three selected sweeps at t = 266, 360 and 415 are shown by yellow arrows and white lines that trace the front at regular intervals, respectively. These sweep events are highlighted with red circles in the A versus time graph. Doubling (B) or halving (C) the deme size roughly doubles or halves, respectively, the average time interval between clusters of replacement sweeps, but not the general pattern.Figs 3 and 4 show that marginal areas, in particular the small continent experience more sweeps than the centre of the area inside the large continent. The simulations shown in Figs 3A and 4A were also run for 10,000 steps, recording each time a deme was swept. Fig 5A shows that the chance for the two smaller continents and the margins of the large continent to be swept is about 1.5 times higher than in for the centre of the large continent in the absence of assortative mating. In case of assortative mating, the effect is even stronger. Demes in the centre of the largest continent thus have a much higher chance to be preserved, as these demes are rarely swept. This also affects the survival chance of mutations. The origins of mutations that reached fixation are plotted in Fig 5B. We see that these are strongly concentrated in the centre of the large continent. While this continent occupies 76% of the whole model area, it is the origin of >99% of all mutations that reached fixation. The medium continent with 19% of the area delivered only <1% of all mutations that reached fixation and the small continent not a single one. The chance of a mutation from the medium continent to reach fixation is only 3.5% that of a mutation in the large continent in the simulation without assortative mating. In the simulation with assortative mating this change reduced to 1.7%. Replacement sweeps thus strongly favour the survival of mutations from the centre of the largest populated landmass but not from the medium or small ones.
Fig 5
Origin of fixed mutations, as well as distribution and directions of sweeps.
A. Relative frequency that a deme is swept by a migration wave. Demes on the margins and small continents are swept more often than demes in the centre of the large continent. These patterns are more pronounced in case of assortative mating (below) relative to the run without assortative mating. B. Origin of mutations that reached fixation shown at black dots on the map. Mutations that form in the middle of the large continent have a much larger chance of reaching fixation in the whole model area than mutations deriving from the margins, especially the small continent. C. Average directions of migrations within sweeps. Setting as in Figs 3A (top) and 4A (bottom), run for 10,000 steps. C.
Origin of fixed mutations, as well as distribution and directions of sweeps.
A. Relative frequency that a deme is swept by a migration wave. Demes on the margins and small continents are swept more often than demes in the centre of the large continent. These patterns are more pronounced in case of assortative mating (below) relative to the run without assortative mating. B. Origin of mutations that reached fixation shown at black dots on the map. Mutations that form in the middle of the large continent have a much larger chance of reaching fixation in the whole model area than mutations deriving from the margins, especially the small continent. C. Average directions of migrations within sweeps. Setting as in Figs 3A (top) and 4A (bottom), run for 10,000 steps. C.Directions of sweeps without (settings of Fig 3A) and with assortative mating (settings of Fig 4A) were recorded for simulations running 10,000 steps. Mean sweep propagation directions can be determined from this, and in turn, mean migration paths (Fig 5C). Migration paths consistently emanate from the centre of the large continent and lead to its margin and to the smaller continents. Migration directions are more consistent in case of assortative mating, resulting in a more consistent pattern of paths in the centre of the large continent.
Fixation rate variation
After an initial period in which the system settles to a dynamic equilibrium, the number (N) of mutations that reach fixation (i.e. spreading over entire model area) increases linearly with time (Fig 6). In the absence of replacement sweeps, N increases steadily, whereas the N-time curve is stairway-like with replacement sweeps. This is because large replacement sweeps spread some mutations over large areas, resulting in a sudden, but temporary increase in fixation events. As the fitness landscape is flattened after these events, few mutations can subsequently reach fixation until the process is repeated again.
Fig 6
Number (N) of mutations that reached fixation as a function of time with linear regression lines (for t>600 steps).
Main graph shows the first 1000 time steps, while the inset shows graphs for the full 5000 time steps on which the linear regressions are based. Intersection of the linear regressions is the mean time to fixation (t), while the slope is the rate at which mutations reach fixation. Replacement sweeps reduce the fixation chance of mutations by 60–70%, but also their t by 3–5 times.
Number (N) of mutations that reached fixation as a function of time with linear regression lines (for t>600 steps).
Main graph shows the first 1000 time steps, while the inset shows graphs for the full 5000 time steps on which the linear regressions are based. Intersection of the linear regressions is the mean time to fixation (t), while the slope is the rate at which mutations reach fixation. Replacement sweeps reduce the fixation chance of mutations by 60–70%, but also their t by 3–5 times.The time-averaged fixation chance (P) of an individual mutation is derived from the slope of the N-time curve. P is significantly lowered by replacement sweeps. With interbreeding only, advantageous mutations are usually added to neighbouring genomes and few are lost by genetic drift before fixation, except at the nucleation phase just after the mutation occurred. Mutations in demes that are swept by replacement sweeps are lost, thus reducing the number of mutations that reach fixation [75]. Each wave causes a founder event, where only the limited genetic sample suddenly spreads over a large area [75-77]. The effect is more pronounced in case of assortative mating. The intersection of the linear regression of the N-time curve with the horizontal time axis (Fig 6) gives the mean time to fixation (t) for mutations that reach fixation. t is about 4 times smaller in case of replacement sweeps than without these. This is to be expected, as replacement sweeps provide an efficient means to spread mutations over the map.The reduced t resulting from replacement sweeps has the advantage that a species can more quickly adapt to changes in the environment. This would cause some mutations to lose, and other to gain competitiveness. The latter can spread quickly in case of replacement sweeps. However, an inclination of a species towards replacements (low ΔF) comes at a cost. Replacement sweeps imply that part of the population is excluded and inhibited from further contributing to the species through their offspring. Furthermore, our simulations that are intentionally without any environmental changes show that a low ΔF also leads to spontaneous replacement sweeps in the complete absence of any external factors.
Sweep area statistics
Although the largest sweeps are the most conspicuous in Figs 3 and 4, these are accompanied by many more smaller sweeps. Fig 7 shows the frequency (f) of sweep areas versus their normalised area (A = sweep area/model area). We see that that the data plot on a straight line in the double-log plot, indicating a power-law relationship between f and A, with an exponent -q:
at least when A is small (up to a few per cent of the total area). A power law (Eq 8) was fitted to the data for A<0.01 from simulations shown in Figs 3 and 4, but run for up to 10,000 steps, resulting in q-values ranging from 1.81 to 2.09. Frequencies were normalised such that the power-law best fit frequency for A = 1 is unity in each of the six simulations. All these normalised data together overlap remarkably well (Fig 7). We obtain q = 1.84 when applying a best fit to all data with A<0.01. Normalised sweep areas >0.02 are overrepresented, with their frequencies up to >10x higher than the power-law trend. Notwithstanding this, small sweeps are several orders of magnitude more common than the largest ones.
Fig 7
Normalised frequency distribution of areas swept by migration waves plotted against these areas (divided by total area) in a double-log plot.
Data follow a power-law, except for the very largest sweeps. Dots represent the simulations with the schematic 3-continent map shown in Figs 3 and 4, but with 10,000 time steps.
Normalised frequency distribution of areas swept by migration waves plotted against these areas (divided by total area) in a double-log plot.
Data follow a power-law, except for the very largest sweeps. Dots represent the simulations with the schematic 3-continent map shown in Figs 3 and 4, but with 10,000 time steps.The observed power-law relation is a hallmark of the self-organised criticality (SOC) model of Bak et al. [78] (but also see [79]) that has been used [74,80] to explain punctuated equilibria [73]. The classical SOC model is the (numerical) sand pile in which grains are randomly sprinkled on a stage. Once critical heights of grain piles, or gradients in these heights are reached, grains are redistributed locally. One redistribution event can lead to neighbouring sites reaching the criterion for redistribution, sometimes leading to large "avalanches". Sizes of these avalanches typically follow power-law distributions [79,81], as the critical state has no intrinsic time or length scale [78]. The current model is similar to the classical sand-pile model, as mutations are "sprinkled" on the map of demes. There is a criterion for redistribution (ΔFcrit), which leads to replacement sweeps that indeed follow a power-law frequency distribution (Fig 8). A first difference with the standard sand-pile model is that, contrarily to grains, mutations can multiply. The second is that the model includes diffusion as an additional transport mechanism for mutations. Models with two transport channels, one fast avalanche-like and one diffusional, have been applied to fluid flow through pores and fractures [82-83], earthquake evolution [84-85], and heat transport in plasmas [86]. These models show that such systems still exhibit SOC-characteristics, as long as the criterion for the fast transport is frequently reached. However, with increasing importance of diffusion, avalanches become more regularly spaced in time and larger, isolated events (so-called "dragon kings" [87-88]) become more common [86]. This behaviour is indeed observed in our "mutation-pile" model, where large sweeps are over-represented compared to small ones and there is strong (α = 0) and weak (α = 0.05) cyclical behaviour with periods of semi-stasis (gradual diffusional differentiation), alternating with short periods of replacement sweeps. As such the model shows punctuated-equilibrium behaviour.
Fig 8
Example of applying the model for 10,000 steps with settings as in Figs 3A and 4A to a map of the Old World.
Top row without (α = 0) and bottom row with assortative mating (α = 0.05). A. Mean fitness of demes over the whole simulation, showing that demes in Central Africa are, on average, fitter than the minimum at the margins of the populated area. B. Origin of all mutations that reached fixation. Most (α = 0) or all (α = 0.05) these mutations originated in Africa. C. Number of times that a deme has been swept by a migration wave. Demes in Central Africa experienced significantly fewer sweeps than the rest of the world, especially in case of α = 0.05. D. Mean migration paths, all emanating from central Africa and diverging towards the periphery of the continent and into Asia.
Example of applying the model for 10,000 steps with settings as in Figs 3A and 4A to a map of the Old World.
Top row without (α = 0) and bottom row with assortative mating (α = 0.05). A. Mean fitness of demes over the whole simulation, showing that demes in Central Africa are, on average, fitter than the minimum at the margins of the populated area. B. Origin of all mutations that reached fixation. Most (α = 0) or all (α = 0.05) these mutations originated in Africa. C. Number of times that a deme has been swept by a migration wave. Demes in Central Africa experienced significantly fewer sweeps than the rest of the world, especially in case of α = 0.05. D. Mean migration paths, all emanating from central Africa and diverging towards the periphery of the continent and into Asia.Although much scientific attention is devoted to large migration and extinction events, such as the expansion of AMH and the extinction of Neanderthals and Denisovans [26,32,45,69], smaller events have been recognised as well in Africa and Europe [28–31, 89–93]. The model predicts that if large sweeps can occur, a whole hierarchy of replacement sweeps of smaller ones should also occur. Unfortunately, the smaller these are, the more difficult these will be to discern in the fossil record. In view of the results presented here, the aforementioned evidence for such events in recent (Neolithic) times in Europe and Africa may be regarded as supporting the hypothesis that small events were also common earlier in human evolution. Such small events should not be discarded as not important because of their local extent. Each time a sweep occurs and populations in the swept area become extinct, part of the genetic diversity is lost. This affects the accumulation rate of mutations and hence the molecular clock rate, which by it self is already difficult to determine [24,94-95]. Not recognising migration sweeps may lead to additional uncertainty in estimated molecular clock rates and resulting split times of phylogenetic split events, or in the estimate of the size of founder populations.
Parameter settings
The question of what the appropriate parameter settings (M, p, α, ΔF) are can now be discussed as the effect of the various parameters was shown above. The advantage factor p determines the spreading rate of advantageous mutations by demic diffusion only. It depends on the competitive advantage (s) and population (N) of a deme, and, hence, on the assumed population density (ρ) and size of the demes (S). In combination with the mutation rate (M) it thus also determines the magnitude of gradients (ΔF) in the model. Actual values for M and s are largely unknown and in fact the "numerical mutations" would represent a number of real mutations with an effective competitive advantage of the ensemble. Depending on the choice of M and model resolution, p should be chosen such that the spreading rate is realistic. The scale of the three-continent model is comparable with that of the real world if we assume a deme size of S = 50 km. In that case the largest square continent would have an area of 25·106 km2, comparable with Africa at ca. 30·106 km2 and the middle one 6.25·106 km2, somewhat smaller than Europe at ca. 10·106 km2. The time step for a population density of between 0.1 and 1 individuals per km2 results in a time step between Δt≈400 y and 4,000 y (Eq 2). The simulations in Figs 2 and 3 would then range between 400 ky and 4 My. Using M = 4, p = 0.05 results in spreading of mutations over significant distances (up to thousands of km) in periods of around a hundred thousands of years, but only when population density is low. Choosing a higher value for M would need to be compensated by a lower value of p to achieve the same spreading rate. The value of M also determines memory use and computation time, as does increasing the spatial resolution of the model (which reduces the time step; Eq 2). The choice of p thus depends on the assumed spreading rate of mutations by demic diffusion, in combination with the desired resolution in genomic signatures, time and space, bearing in mind computational capacities in terms of memory use and calculation time.The choice of ΔF depends on those of M, p and S, since these determine the magnitude of gradients and the time for these to develop. The number of migration sweeps is inversely proportional with ΔF. Although the total number of sweeps is unknown (as many are too small to be resolved in the fossil or archaeological record), the number of very large ones (such as the spread of AMH) may be constrained to one or a few in the last few hundred thousands of years. This constraint enables setting the ΔF to achieve the desired frequency of large events. The assortative mating factor (α) should also be set according to the various parameters that determine the magnitude of gradients. The choice of α depends on the expected efficacy of assortative mating and the size of homogeneous areas ("nations") that develop as a result. With the settings used here, α = 0.05 results in homogenous areas that span in the order of hundreds to thousands of kilometres (Fig 2B), when not disturbed by replacement sweeps (Fig 4). In the current approach (Eq 1), assortative mating completely inhibits admixture between adjacent regions when |ΔG|≥1/2α. Genetic analyses, however, indicate admixture between hominin species, such as Neanderthals, Denisovans and AMH [24,96-97], but also admixture of more archaic into more modern AMH in Africa [29,98]. This could be included in the future by replacing (Eq 1) with an exponential function for the decay of genetic exchange as a function of increasing |ΔG|, in which case the exchange probability never reaches zero. It should, however, be borne in mind that the purpose of this study is not to determine what the specific parameters are for human evolution, but to show what the effects of the different parameters are on the patterns of evolution.
Application to a world map
Having assessed the effects of, and patterns resulting from the different parameters on a schematic map with three continents, we now briefly apply the model to a real-world map. For this purpose we used a Fuller-projection map of the Old World (Fig 8), roughly adapted to ice age conditions by linking Japan and the British Isles to the mainland and assuming that large parts of northern Europe and Asia are (effectively) not inhabited. Although population densities (ρ) would never have been equal throughout the inhabited area, we maintain the assumption of a constant ρ, but excluded high-elevation areas (especially Tibet and the Pamirs) and desert areas in North Africa and the Arabian Peninsula, adapted from Eriksson et al. [39] for ~21 ka BP. In this configuration inhabited areas in Africa occupy 44% of the whole inhabited area. Inhabited areas evidently varied over time, but this simplified model serves the purpose of predicting the main expected patterns. Deme size was set at 50x50 km, resulting in 20808 populated demes, and settings were equal to those for the simulations shown in Figs 3A and 4A. At a time step of about 4000 years (for ρ = 0.01 individuals/km2), p = 0.05 would lead to a mutation spreading rate of 0.002 km/yr (Eq 5). Replacement sweeps in the current model take place within one time step. The maximum distance to travel, from South Africa to Japan is about 17,500 km, resulting in a maximum sweep velocity of ~4 km/yr. Such high velocities, however, only apply to the few very largest sweeps. To achieve good statistics, the models are run for 10,000 time steps, thus representing ca. 40 million years, which is in the order of about 40 real-world evolutions.In both simulations, with and without assortative mating, mean fitness is highest in central Africa (Fig 8A). The fitness maximum is most distinct in case of α = 0, because the long and regular interval between clusters of sweep events allow large-scale gradients to develop. Mutations that reach fixation mostly come from central Africa (Fig 8B). When α = 0.05 no mutations that originated outside of Africa reach fixation. When α = 0, a few mutations from Asia reach fixation, because sweeps from Africa can trigger "counter" sweeps after admixture with genomes from the margin of the swept areas. The chance that a deme in central Africa is swept by a migration event is higher in case of α = 0 than when α = 0.05 (Fig 8C). Despite these differences, migration directions are mostly emanating from central Africa in the direction of the margins of the occupied area (Fig 8D).Sweep area frequencies follow the same power-law distribution (Eq 8 and Fig 9A) as in the abstract 3-continent model. Largest sweeps are again over-represented relative to the power-law trend for smaller sweeps. The effect becomes noticeable for sweeps that are larger than a few per cent of the total area. This is about 1/3 the area of Europe. The frequency distributions show two distinct peaks. One is at the area of Japan, which in the model forms a narrow peninsula connected to Asia. Sweeps apparently initiate at the connection with the peninsula (where mutations rarely occur) and then tend to sweep the whole peninsula. The second peak is at about 56% of the total area, which equals the area of Eurasia. This indicates that isthmuses, such as the Sinai Peninsula, play an important bottleneck role and sweeps from Africa that entered Asia have an increased chance of then sweeping the whole of Eurasia.
Fig 9
Sweep statistics of simulations for the old world.
A. Graph of normalised frequency as a function of sweep area. B. Cumulative graph of number of sweeps against sweep area. About 1% of all sweeps are the size of Eurasia (56% of total area) or larger. C. Cumulative graph of the areas swept as a function of sweep area. About 50% of all individual deme replacements result from sweeps are the size of Eurasia (56% of total area) or larger.
Sweep statistics of simulations for the old world.
A. Graph of normalised frequency as a function of sweep area. B. Cumulative graph of number of sweeps against sweep area. About 1% of all sweeps are the size of Eurasia (56% of total area) or larger. C. Cumulative graph of the areas swept as a function of sweep area. About 50% of all individual deme replacements result from sweeps are the size of Eurasia (56% of total area) or larger.The cumulative number of sweeps as a function of area (Fig 9B) shows that sweeps of the size of Eurasia or larger represent about 1% of all sweeps. However, these few sweeps are responsible for about 50% of all individual deme replacement events over time (Fig 9C). If in the past there were one or two major out-of-Africa sweeps, one can deduce that there were in the order of 100–200 replacement sweeps in total. The average number of sweeps that an individual population in one single deme would have experienced would be about 2–4, double the number of very large sweeps. Two to four replacement events in a million years, i.e. ~40,000 generations, means that individuals in a deme have a chance in the order of 0.005–0.01% of experiencing a replacement sweep in their lifetime. Although sweeps thus rarely affect individuals, they have a profound effect on the evolution of the human genome, because about half these locally experienced sweeps are part global-scale sweeps that span a significant part of the whole populated Old World.
Future extensions of the model
The model employed here intentionally excluded environmental or other external factors, with the exception of geographical barriers, such as coastlines, deserts and high mountains. This does not imply that the model is restricted to this simplification. Incorporation of environmental factors requires population densities that can vary in space and time. It is envisaged to include a carrying capacity, which is a function of environment, but potentially also of the genetic signature. For this purpose, mutations could be assigned variable "fitness", depending on the environment. These factors can be used to determine population density changes. Eqs (1) and (2) are currently based on the assumption that two neighbouring demes have the same population. These equations can be adapted to relax this assumption. The framework of the model is furthermore amenable to multiple competing populations or species that occupy a single deme. These extensions of the model are not restricted to human evolution. However, the model is particularly suitable for studying the emergence of modern humans, because mutations in the numerical genome may also represent cultural traits or innovations. These can be assigned their own mutation and diffusion rates, advantageousness (possibly related to environment), as well as effect on assortative mating and initiation of replacement waves. This will allow exploring the interaction between biological and cultural evolution [99-102].
Conclusions
Diffusional gene flow alone leads to a homogenisation of all populations [103]. However, ongoing mutations continuously produce local variations that take time to spread over the whole populated area. The combination of diffusion and mutations thus results in isolation by distance [71], with differences between local populations increasing with distance (Fig 2A). The magnitude of these differences or clines depends on the balance between diffusion and mutation rate. The effect of assortative mating is to create a structured population [70] with relatively homogeneous "nation" regions, separated by distinct clines. While Scerri et al. [70] argue that the structuring is due to environmental and ecological drivers, our model (Fig 2B) shows that population structuring could be developed due to the neutral process of assortative mating without any such additional drivers. Without other evolutionary mechanisms, assortative mating reduces or inhibits exchange between regions and this exchange is restricted to neighbouring regions. Migration events are an efficient way to bring populations from far-removed regions into contact. Ensuing exchange across such new contacts leads to reticulate phylogenies [67].Our model shows that, if population/species replacements do occur, replacement sweeps of all sizes up to the whole populated area are expected. The basic reason is that if one group of individuals can take over the area of their neighbours, the chance that they (or their offspring) can also take over the next area is larger than zero. This can, but must not, lead to "avalanches" of replacements that can span up to the whole populated area. Replacement-sweep-area frequencies systematically decrease as a power-law function of their area. For every world-spanning sweep, about two orders of magnitude smaller sweeps are to be expected, down to the size of one deme in the model. As expected, isthmuses appear to play a special "bottle neck" role during expansions. Once a sweep crosses an isthmus, there is an increased chance that the whole peninsula beyond is swept in its entirety. In simulations this leads to a distinct peak in the chance that the whole of Eurasia is swept in at an out-of-Africa event.Replacement sweeps reduce the chance of fixation of mutations, but also significantly reduce the time to fixation of those mutations that do finally reach fixation. The propensity of a population to usurp the area of its neighbours if these are, for some reason, less competitive, has the benefit that advantageous mutations can quickly spread, for example after a change in environmental conditions. However, our simulations show that this propensity inevitably also leads to spontaneous replacement sweeps that are not triggered by any external factors but driven by genetic drift.Our simulations show that replacement sweeps mostly emanate from the largest consolidated populated area. In the Old World this is Africa, especially during glaciation stages. The simulations indicate that the most likely origin for modern humans lies somewhere in (central) Africa, in line with what is deduced from the fossil record (e.g. [104]). However, East Asia also forms large and compact populated area, especially during warm periods, that would have been a second probable source for replacement sweeps. This emphasizes the need for further palaeoanthropological research in East Asia (e.g. [5,105]).Large migration sweeps generally emanate from the central regions of large compact areas and spread towards the margins. The spreading directions are mostly determined by coastlines, mountains and other uninhabited areas. This leads to a remarkable consistency of the directions, quite independent of the parameter settings (Fig 8C). The tendency to migrate towards coasts is consistent with the beachcomber model [106]. The spreading of a migration wave is more like that of an inkblot than along narrow, bifurcating migration paths that some authors envisage [107].Mutations that arise in the centre of large compact areas have the highest change to survive and spread. This is not only because these areas are the probably sources of migration sweeps, but they are also the least likely to be swept themselves (Fig 8C). This pattern illustrates the relevance of the central areas as potential sources of new variants, but also in preserving old traits.Considering a combination of semi-stasis periods alternating with replacement sweeps as a result of large-scale and many smaller expansions may contribute to understand the current biological distribution of human populations, as well as the variation in the hominin fossil record. For this reason we suggest an integration of concepts coming from punctuated equilibrium theory [72,74,79-80], multiregional postulates [25,66– 67,70,93], and current out-of-Africa migration models. Any migration wave that spanned the whole world is most likely to have come from Africa. If large migration waves can occur, more numerous smaller waves are to be expected too. Multiple out-of-Africa waves are, therefore, most likely if one happened. Although the propensity for replacement waves makes a species more adaptable to changes in its environment, a "side effect" is that such waves then also occur spontaneously. A spontaneous emergence and spread of modern humans from Africa should thus be regarded as a null hypothesis against which any hypothetical causes should be tested.
Determination of the relationship between p and s, as well as the model time step.
A. The factor p as a function of competitive advantage s and population size N. Each data point is represents the average 10,000 simulations for a give N and s. Slopes (dP/ds) are determined by linear least-squares best fits (dashed lines). B. A regression (dashed line) shows that dP/ds is a linear function of N. C. Modelled mean time to fixation of a neutral mutation is found to be a linear function of N.(TIF)Click here for additional data file.
Workflow of the numerical implementation of the model.
Individual loops are given in different colours.(TIF)Click here for additional data file.
Supporting information on methods.
The text explains the relationship between advantage factor (p) and competitive advantage (s) and how the duration of one time step (Δt) is derived.(PDF)Click here for additional data file.
Authors: Cosimo Posth; Gabriel Renaud; Alissa Mittnik; Dorothée G Drucker; Hélène Rougier; Christophe Cupillard; Frédérique Valentin; Corinne Thevenet; Anja Furtwängler; Christoph Wißing; Michael Francken; Maria Malina; Michael Bolus; Martina Lari; Elena Gigli; Giulia Capecchi; Isabelle Crevecoeur; Cédric Beauval; Damien Flas; Mietje Germonpré; Johannes van der Plicht; Richard Cottiaux; Bernard Gély; Annamaria Ronchitelli; Kurt Wehrberger; Dan Grigorescu; Jiří Svoboda; Patrick Semal; David Caramelli; Hervé Bocherens; Katerina Harvati; Nicholas J Conard; Wolfgang Haak; Adam Powell; Johannes Krause Journal: Curr Biol Date: 2016-02-04 Impact factor: 10.834
Authors: Huw S Groucutt; Michael D Petraglia; Geoff Bailey; Eleanor M L Scerri; Ash Parton; Laine Clark-Balzan; Richard P Jennings; Laura Lewis; James Blinkhorn; Nick A Drake; Paul S Breeze; Robyn H Inglis; Maud H Devès; Matthew Meredith-Williams; Nicole Boivin; Mark G Thomas; Aylwyn Scally Journal: Evol Anthropol Date: 2015 Jul-Aug
Authors: Indu Khatri; Magdalena A Berkowska; Erik B van den Akker; Cristina Teodosio; Marcel J T Reinders; Jacques J M van Dongen Journal: Genes Immun Date: 2021-06-12 Impact factor: 2.676