Literature DB >> 29087301

Estimating mobility using sparse data: Application to human genetic variation.

Liisa Loog^1,2,3,4, Marta Mirazón Lahr⁵, Mirna Kovacevic^6,7, Andrea Manica³, Anders Eriksson⁸, Mark G Thomas¹.

Abstract

Mobility is one of the most important processes shaping spatiotemporal patterns of variation in genetic, morphological, and cultural traits. However, current approaches for inferring past migration episodes in the fields of archaeology and population genetics lack either temporal resolution or formal quantification of the underlying mobility, are poorly suited to spatially and temporally sparsely sampled data, and permit only limited systematic comparison between different time periods or geographic regions. Here we present an estimator of past mobility that addresses these issues by explicitly linking trait differentiation in space and time. We demonstrate the efficacy of this estimator using spatiotemporally explicit simulations and apply it to a large set of ancient genomic data from Western Eurasia. We identify a sequence of changes in human mobility from the Late Pleistocene to the Iron Age. We find that mobility among European Holocene farmers was significantly higher than among European hunter-gatherers both pre- and postdating the Last Glacial Maximum. We also infer that this Holocene rise in mobility occurred in at least three distinct stages: the first centering on the well-known population expansion at the beginning of the Neolithic, and the second and third centering on the beginning of the Bronze Age and the late Iron Age, respectively. These findings suggest a strong link between technological change and human mobility in Holocene Western Eurasia and demonstrate the utility of this framework for exploring changes in mobility through space and time.

Entities: Chemical Disease Gene Species

Keywords: ancient DNA; cultural variation; mobility; morphological variation; time-series data

Mesh：

Substances：

Year: 2017 PMID： 29087301 PMCID： PMC5699029 DOI： 10.1073/pnas.1703642114

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

One of the major goals of population history inference is to assess the role played by past mobility in shaping patterns of genetic, phenotypic, and cultural variation. It is well recognized that the past movement of people shapes geographic patterns of genetic variation (1) and the subsequent ecological and evolutionary properties of populations (2). This is due to the fact that gene flow changes allele frequencies, shapes genetic drift, and can affect (3) or even mimic (4) natural selection processes. It is also recognized that migration activity can influence cultural evolutionary processes (5, 6). However, despite the general agreement that mobility has played an important role in shaping past and present patterns of genetic, phenotypic, and cultural variation among humans, relatively little is known about its temporal and geographic variation in the past (7). Inferring past mobility is challenged by the sparseness and unevenness of sampling in time and space. As a result, studies of prehistorical mobility are typically limited to descriptive approaches, where major attested migration episodes or events are used as a proxy for general mobility. Data sources such as stable isotopes have enabled some quantification of mobility by allowing researchers to identify individuals within an archaeological community who have migrated into a region during their lifetime (e.g., ref. 8). The underlying logic behind this approach is that differences between isotope ratios—particularly strontium—within organisms reflect the isotope ratios acquired from the local environment (as a result of variation in underlying geology) (9). However, it is challenging to extrapolate within-community mobility rates to migration rates across larger geographic regions or over long time periods. Furthermore, isoscapes are still often poorly characterized, and isotope ratios can be relatively constant over large areas (9, 10) and so are not always informative. Most standard population genetic tools used for quantifying population structure, such as ADMIXTURE analysis (11), f statistics (12), and TREEMIX (11), are poorly suited for estimating underlying mobility change through time. In classical population genetic analysis, estimators of migration rates between hypothesized subpopulations have been developed, including statistics such as FST (13). Some of these statistics have also been applied to large sets of quantitative trait data, such as variation in craniometric morphology (e.g., refs. 14 and 15). However, such statistics quantify differentiation among a set of contemporaneous samples and only inform on migration rates under idealized demographic scenarios—such as gene flow between discrete subpopulations—and are also influenced by other factors, such as subpopulation split times and population size fluctuations. Furthermore, these estimators reflect past migration between hypothesized subpopulations over large periods of time and therefore lack temporal resolution. Some researchers interpret the estimated ages and geographic distribution of clades on a phylogenetic tree of uniparental genetic systems (mtDNA or the Y chromosome) as proxies for the rate of spread of populations (e.g., ref. 16). However, such approaches do not permit a formal quantification of mobility and have been criticized as a tool for demographic inference (17–19). Thus, existing methods allow us to identify migration episodes to some extent but lack the temporal resolution and formal quantification of underlying mobility, are poorly suited to spatially and temporarily sparsely sampled data, and do not permit systematic comparison between different time periods or geographic regions. To overcome these problems, we present an estimator of past mobility that is particularly suited to sparsely distributed morphological, cultural, or genetic variation data and provide a first application to a large set of genome-wide data from ancient individuals from across Western Eurasia. We define mobility as the average distance moved by entities in a given time period.

Estimating Past Migration Rates

Under a general model of identity-by-descent with modification and isolation by distance (IBD) (20, 21), trait (genetic, morphological, or cultural) differences between any two entities (individuals or populations) increase monotonically as a function of both the temporal and spatial distance between them. We therefore expect that trait differences between entities correlate with temporal as well as spatial distances. However, the extent to which spatial and temporal differences explain observed trait variation depends on the level of spatial population structure and therefore on the level of mobility. If mobility was low (i.e., strong spatial structure), then we would expect differences between entities to be more strongly correlated with space, relative to time, whereas if mobility was high we would expect time to explain a relatively larger proportion of differences between entities (because of the homogenizing effects of high mobility across space). Given that both spatial and temporal distances are expected to correlate with trait differences among entities, a matrix combining both spatial and temporal distance information should give a stronger correlation than either matrix alone (extra correlation, EC). However, since spatial and temporal distances are measured in different units (e.g., kilometers and years), combining them requires a scaling factor (S). Here, we show that the scaling factor value (Smax) that maximizes the correlation between a trait difference matrix and a Euclidian distance matrix combining the spatial and temporal distance matrices provides an estimator of mobility over the period and region covered by the data (see and Fig. 1). For convenience, we use a geometric interpretation of the scaling factor Smax as an angle, α, in the plane defined by the spatial and temporal distances [α = atan(Smax), illustrated in Fig. 1, Inset; see ].

Fig. 1.

Illustration of the principle of maximum time–space correlation. The black dots show a typical dependence of the correlation between genetic and time–space distances on the scaling factor angle α (in degrees). Here space alone (α = 0) is a better predictor of genetic differences than time alone (α = 90), but the best predictor (highest correlation) is found at an intermediate angle, indicated by the vertical red line. (Inset) Geometrical interpretation of the Scaling Factor (Smax) as an angle (α). To test the reliability and the robustness of Smax in recovering information about past mobility, we simulated data under a spatiotemporally explicit 2D model, which includes simple population dynamics with population growth, density dependence, and mobility (modeled as a Gaussian random walk), and generated variation data under different mobility parameter values (see ). We assessed the ability of Smax to infer simulated mobility values by correlation across simulations. We found a strong, positive linear relationship between the simulated average migration distance (i.e., mobility) and values of Smax (Fig. 2, R2 = 0.8), thus demonstrating the utility of this statistic as an estimator for relative mobility. However, for this result to hold, it is important that the trait differences are generated under an approximately constant mutation rate and vary neutrally within a population.

Fig. 2.

Correlation between simulated movement rate (dmig) and estimated scaling factor (Smax). Each black circle represents a single simulation. The colors correspond to the density of circles (see the color scale bar). The black line shows the best linear fit between dmig and Smax (R2 = 0.8), demonstrating that the scaling factor captures the underlying mobility in the simulated world.

Migration Rates Among Pleistocene Hunter–Gatherers and Early Farmers

Recent advances in sequencing technologies have allowed genomic data retrieval from a large sample of past individuals (e.g., refs. 22–26). Although these studies have not explicitly quantified underlying mobility in the past, they have suggested several periods of large-scale population turnover in Western Eurasia. Given that the Smax statistic is able to recover information on past mobility in simulated data, we applied the method to a sample (n = 329) of previously published genome-wide genotype data covering a time period from the beginning of the Upper Paleolithic to the Iron Age to explore changes in past human mobility in Western Eurasia (see ). We also constructed nonparametric confidence intervals to account for date and sampling uncertainty and estimated P values for the Smax statistic by permutation under the null hypothesis of no IBD in space and time, which allowed us to quantify the robustness of our estimates and identify time periods during which data are too sparse for the Smax statistic to be informative (see ). First, we explored the extent to which mobility differed between pre- and post-Last Glacial Maximum (LGM) hunter–gatherers (Fig. 3). We found the average (median) mobility rates to be higher (α = 18.1; 95% CI: 14.9–87.7; P = 0.08) among pre-LGM hunter–gatherers, temporally ranging from 37,000–26,000 y ago, compared with post-LGM hunter–gatherers (α = 9.9; 95% CI: 9.5–10.9; P = 0.03), temporally ranging from 19,000–5,000 y ago. We also estimated mobility rates for Holocene farmers, temporally ranging from 10,000–1,000 y ago, and found even higher values (α = 34.8; 95% CI: 33.9–35.3; P < 0.0001) than for both hunter–gatherer groups (see for full results).

Fig. 3.

Boxplot showing the mobility rate estimates (from jackknifing and date resampling) among pre-LGM hunter–gatherers temporally ranging from 37,000–26,000 y ago (n = 19), post-LGM hunter–gatherers temporally ranging from 19,000–5,000 y ago (n = 47), and Holocene farmers temporally ranging from 10,000–1,000 y ago (n = 263). The black solid lines are the medians of the distributions. The boxes represent the interquartile ranges, and the whiskers show the spans of the distributions. Because Holocene western Eurasia is particularly well sampled for ancient genomic DNA, we performed a sliding window analysis to explore changes in mobility over the last 14,000 y in more detail (Fig. 4), using 4,000 y-wide windows to ensure sufficient temporal signal within each window. We inferred a reduction in mobility rate between 14,000 and 9,000 y ago, before the start of the Neolithic transition (Fig. 4). However, throughout most of this period, the P values are not significant (Fig. 4). Because of the small sample size in the windows covering this time period (Fig. 4), there is no significant correlation between genetic and temporal distances, and as a result, we do not observe any EC and so lack power to estimate mobility (see and ). We consequently treat the inferred decline in mobility in this time range with caution. Second, we infer a substantial increase in mobility centered on the beginning of the Neolithic, with a peak centered around 7,500 y ago (Fig. 4). Notably, the inferred mobility rate does not remain at this level throughout the Holocene. Instead, we infer a Late Neolithic drop in mobility before a second increase centered on the beginning of the Bronze Age, around 5,000 y ago, then a decline in the Late Bronze Age and Early Iron Age, before a final increase centered on the Late Iron Age (Fig. 4 and for full results for each window).

Fig. 4.

Estimation of mobility through time from empirical data. (A) Relative mobility rate estimates in Western Eurasia over the last 14,000 y, using a 4,000-y sliding window (121 windows). The solid black line represents the mean α value from 10,000 date resampled iterations; the colored area represents the 95% confidence intervals of the jackknife distribution. (B) P values for each 4,000-y window under the null hypothesis of no EC, constructed by calculating the proportion of permuted datasets where the calculated EC value was as high or higher than the average EC value from the empirical dataset (see ). The red dotted line represents the level above which 5% or more of the permuted datasets result in EC values as high or higher than the empirical dataset. (C) Sample size for each 4,000-y window, averaged over 10,000 date resampled iterations. To validate the efficacy of our method to identify changes in migration rate on the time scales found in the empirical dataset (Fig. 4), we modified our simulations to represent a population experiencing two changes in migration rate, resulting in three episodes of constant migration rate. We observe a good correspondence between changes in Smax and the simulated migration rate (), supporting our interpretation of the empirical results in Fig. 4. Finally, we compare the performance of the Smax statistic to a simple IBD through time approach, where (the slope of) the linear relation between the genetic distances and geographic distances is used as an indicator of the level of past migratory activity: High levels of migration correspond to shallow IBD patterns. We observe a trend of decreasing spatial structure, consistent with the cumulative effects of a series of high migratory activity episodes over this period. However, this approach fails to recover the timing of those changes in migratory activity in specific periods (). Our method overcomes this lack of power to identify changes in migratory activity by explicitly considering the temporal dimension of the data.

Discussion

Through spatiotemporally explicit simulations, we have shown that the Smax statistic can be used as a reliable proxy for the underlying relative mobility of individuals within a given time period and geographic region. Because our statistic is based on correlations, it is well suited for analyzing data from archeological and paleontological contexts, where preservation can vary significantly across different geographical areas and temporal ranges, and samples are commonly sparsely distributed across space and time. Nevertheless, in the extreme case of just a small number of sites from different geographic locations or temporal periods, spurious estimates of migratory activity may arise. The permutation procedure introduced in this study can be used to identify when the Smax estimator is uninformative. We choose only to consider relative changes in the value of the Smax estimator and do not attempt to interpret its values in absolute terms. This is because, while our intuition is that mutation rate and population size will not affect the relationship between absolute values of the Smax estimator and the true mobility rate, we admit the possibility that other factors may. Selection in response to ecological and environmental factors could also reduce the utility of the Smax statistic as a proxy for mobility because local selection can create confounding spatial or temporal population structures. However, this is a common problem for any analysis assuming neutral evolution and can be dealt with by focusing on putatively neutrally varying traits or loci. The Smax statistic offers a robust alternative to existing methods for the quantification of IBD patterns in temporally heterogeneous datasets. In population genetics, correlations between trait differences and geographic distances are commonly used to infer past population structure and connectivity between populations (27). In such approaches, temporal structures in data are usually either ignored or mathematically controlled for by using partial least squares (e.g., ref. 28), but both of these practices have been criticized (29–31), and we show that while such approaches can inform the cumulative effects of migration in terms of structure reduction, they are unable to recover temporal changes in migratory activity. Partial least squares analysis assumes that the effect of time on genetic differences can be decoupled from the effect of space, which is generally not the case. We avoid this problem by integrating space and time into a single distance measure. Finally, because the statistic contains information about both spatial and temporal structuring of the populations, it can be used as a potentially informative summary statistic in quantitative model-fitting frameworks such as Approximate Bayesian Computation (32). Using the Smax statistic on ancient genomic data, we identified a sequence of changes in human mobility from the late Pleistocene to the Iron Age in western Eurasia. We find some support for reduced mobility in west Eurasian post-LGM hunter–gatherers compared with pre-LGM populations. The reasons for this result are, as yet, unclear, although possible explanations include reduced resource availability in pre-LGM Western Eurasia, requiring larger foraging ranges compared with post-LGM conditions (33, 34) and/or residual post-LGM population structure following recolonization of northern latitudes from LGM southern refugia (35). Using a sliding window analysis, we find some suggestion of a decline in post-LGM hunter–gatherer mobility leading up to the Neolithic transition. However, we caution against overinterpretation of this result as the estimated P values for the Smax statistic under the null hypothesis of no EC are mostly not significant. We find strong support for a rise in mobility during the Neolithic transition in western Eurasia, likely corresponding to a well-established demic expansion of farmers, originating in the Middle East and resulting in the spread of farming technologies throughout most of Western Eurasia (36–38). This is followed by an inferred mobility decline toward the end of the Neolithic, possibly related to the terminal phase of the spread of farming across most of Western Eurasia, and increased sedentism (39, 40). We also find strong support for a rise in mobility centered on the onset of the Bronze Age. From previous ancient DNA studies, this period has been associated with large-scale migration of Eurasian steppe populations, particularly those related to the Yamnaya culture, into Central and Northern Europe (22, 23). However, the emergence of the first civilizations and the concomitant establishment of far-reaching trade networks, as well as technological innovations such as horse-based transport (41), could also contribute to this increase in mobility (42). Finally, our sliding window analysis indicates a mobility reduction centered on the Late Bronze Age and Early Iron Age, starting around 3,000 y ago, before a final increase centered on the Late Iron Age in Western Eurasia (Fig. 4). One possible explanation for this pattern is a significant increase in trade and warfare during that period (43–45). Overall, our analysis suggests a strong link between technological change and human mobility in Holocene Western Eurasia. However, it should be noted that we have used wide windows (4,000 y), which necessarily reduces chronological resolution. A major strength of our method is its applicability to any set of neutrally evolving heritable traits where differences between individuals can be quantified and increase monotonically with geographic distance and temporal difference. This means that, in principle, the Smax statistics could allow the quantification of migratory activity in temporal and environmental contexts where obtaining ancient genetic data is not feasible, by using phenotypic data such as variation in cranial morphology, which has been shown to fit the pattern of neutral evolution and closely follow the patterns observed in analyses of neutral genetic data in humans (46, 47). Another exciting possibility is the quantification of movement based on cultural variation data, provided that appropriate near-neutral traits are used (e.g., refs. 48–50). While it should not be assumed that the movement of artifacts always coincides with that of people, contrasting measures of movement based on genetics and cultural artifacts obtained under the same conceptual framework would allow quantification of demic vs. cultural diffusion processes. This might permit identification periods and regions where genetic, phenotypic, and cultural processes are coupled, or decoupled. Given its robustness and flexibility, we anticipate that the Smax estimator will be applicable to a wide range of genetic, phenotypic, and cultural traits, allowing the quantification of mobility in a wide variety of scenarios in which this type of analysis has previously been challenging.

Materials and Methods

The proposed migration rate estimator, Smax, is the value of a scaling factor combining spatial and temporal distance matrices into a single distance matrix that maximizes its correlation with a matrix of trait distances. To estimate that value, the geographical, temporal, and trait distance matrices are calculated as described below.

Geographic, Temporal, and Trait Distances.

The geographic distance between all sample pairs was calculated in kilometers using the Haversine Eq. (51) to account for the curvature of the Earth as follows:where G is the distance in kilometers between individuals i and j; φ and φ are the latitude coordinates of individuals i and j, respectively; λ and λ are the longitude coordinates of individuals i and j, respectively; and r is the radius of the earth in kilometers. Temporal distances between samples were calculated as time in years between sample pairs. Previously reported date ranges based on stratigraphy or direct radiocarbon dating were used for all individuals (). In all analyses, sample dates were randomly drawn from a uniform distribution corresponding to the upper and lower bounds of a time period for a given specimen. Genetic distances were calculated as pairwise proportion of alleles that are not identical by state (pairwise heterozygosity), using the function ibs.dist from the Bioconductor package SNPstats v.1.18.0 (52) in the R statistical analysis environment v3.2.2 (53).

The Smax Estimator.

To consider the full range of scaling factors on a finite interval, we choose to represent S as the tangent of an angle α between 0° and 90°, where α = 0 corresponds to S = 0 (geographic variation alone explains the observed trait distances between entities) and α = π/2 corresponds to S = ∞ (temporal variation alone explains the observed trait distances between entities). Formally, the time–space product matrix (D) was calculated as follows:where i and j are the specimens considered, D is the time–space product matrix, G is the geographical distance matrix, T the temporal distance matrix (given by the difference in age of the samples), and S is the scaling factor [S = tan(α)]. To find the scaling factor, Smax, that maximizes the correlation between the trait distance matrix and D, the time–space product matrix, we calculated the Pearson correlation coefficient between these matrices for 200 (500 for the simulated data) scaling factor values (Fig. 1). The scaling factor value in the time–space product matrix that produced the strongest correlation with the trait distance matrix is recorded as Smax, the mobility estimator.

Simulation Tests.

The reliability and the robustness of the Smax statistic in recovering information about past mobility was explored using a spatiotemporally explicit simulation model. The simulation world consists of a grid of 8,000 by 8,000 demes. Each simulation starts with one entity placed in a randomly chosen deme and lasts 20,000 generations. The model simulated exponential population growth to a carrying capacity of 10,000 entities, followed by a stochastic birth–death process (54) mobility, and trait mutation. We generated spatiotemporal trait variation data under different mobility parameter values using the same Smax estimation protocols as described above for each dataset. A total of 10,000 independent replicates of the simulations and analyses were generated, and the utility of the Smax statistic in recovering information about mobility was assessed by correlation. The migratory process was modeled as Gaussian random walks: In each generation, each entity moves independently in the x and y directions by distances picked randomly from a normal distribution with mean = 0 and SD = σmig. This corresponds to the average distance moved in a single step (dmig) of σmi = 1.2533 σmig. Thus, dmig is the parameter of interest. We choose 1,000 random values of dmig from a uniform distribution with a range of 1–100. We modeled drift as a Moran-type birth–death process (54). At each generation, each entity undergoes binary fission with probability P = 0.1, creating a duplicate of itself at the same location. The two entities subsequently move and evolve independent of each other. When the number of entities reaches or exceeds the carrying capacity (10,000), excess entities are deleted at random among all entities present in that generation. Mutation was modeled as a one-dimensional Gaussian random walk for each trait (Ntraits = 50). Each trait was assigned an initial value of 1,000, and new (mutated) values were picked from a random normal distribution with mean equal to the current value and SD fixed at 0.05. Following a burn-in period of 10,000 generations, entities were sampled from simulations with a probability of 0.00001 at each generation. The x and y coordinates, time of sampling in generations, and the values for the 50 traits were recorded for all sampled entities. Pairwise trait distances between all sampled entities in each of the simulated datasets were calculated using the Euclidean distance formula as follows: where M is the distance between the two entities i and j; d and d are the values of the trait k for individuals i and j respectively; and n is the number of recorded traits. Out of 10,000 simulations, 9,866 (98.66%) resulted in an EC greater than zero. To match the simulated data with the empirical data, we filtered the simulated data based on the measured EC values and removed all simulations that produced an EC value smaller than 0.001. This resulted in 9,155 simulations being used in the correlation analysis. To assess the reliability of the Smax statistic in recovering information about mobility, R2 values were calculated for the correlation between the simulated dmig values and their corresponding Smax values.

Human Mobility in Late Pleistocene and Holocene.

We considered genome-wide data comprising 354,199 SNPs typed in 329 West Eurasian (i.e., west of the Ural mountains) individuals () temporally ranging from ∼39,000–1,000 y before present (see ). We merged the overlapping SNPs typed in archaeological samples published in refs. 22–26 and 55–61 (see for list of samples and references) that met the geographic and temporal criteria described above. No additional bioinformatic processing of the data were carried out for this study. The 329 individuals were assigned to one of following three groups based on their estimated age and subsistence strategy based on their archaeological context: pre-LGM hunter–gatherers n = 19 (temporally ranging from 39,000 y B.P. to 26,000 y B.P.), post-LGM hunter–gatherers n = 47 (temporally ranging from 19,000 y B.P. to 5,000 y B.P.), and Holocene farmers n = 263 (temporally ranging from 10,000 y B.P. to 500 y B.P.). Sliding window analysis was performed on all individuals in the dataset postdating 16,000 y B.P. The Smax statistic was estimated for 121 overlapping 4,000-y windows, each differing by 100 y. To take age uncertainty into account, we report the mean scaling factor angle from 10,000 replicates with sample dates randomly resampled from their age ranges. We estimated 95% confidence intervals through a jackknifing procedure in which a randomly chosen sample in each window was removed from analysis, and the 0.025 and 0.095 quantiles were calculated from the resulting distribution. To estimate the IBD signal through time, we fitted a linear model of genetic distances as a function of geographic distances in each time window (with sample jackknifing and age resampling as before, using the lm function from the R package base version 3.2.2) (53) and reported the slope of the line.

Confidence Intervals and Robustness of Smax Estimator.

We tested the assumption that there is an IBD pattern by correlating the genetic (trait) distance matrices in all time bins and in all windows with the respective geographic distance matrices and the date-resampled temporal distance matrices and calculated the P values for these correlations. We find a positive and statistically significant IBD pattern in space in all windows (, respectively and ). The isolation by temporal distance pattern is positive and significant for most windows, but some windows show negative correlations or are not significant. We find that these windows correspond to time periods where we observe low EC () and also low P values for the EC (Fig. 4). To account for the uncertainty in sample ages, we calculated the scaling factor angle 10,000 times using dates resampled at random from a uniform distribution for each sample, as described above, and report the average of the scaling factor angle of the given distribution as a point estimate. We also performed a leave-one-out analysis (10,000 replicates, combined with sample date resampling) to explore the combined effect of sampling and dating uncertainty and constructed approximate equal-tailed 95% confidence intervals for all groups and windows. To assess the statistical significance of Smax estimates, we consider the EC—defined as the Pearson correlation coefficient between the trait difference matrix and the time–space product matrix when S = Smax, minus the Pearson correlation coefficient between the trait difference matrix and either the temporal or geographical distance matrix alone, whichever is higher. To obtain a null-distribution of EC, we permuted trait data for individuals among the spatiotemporal sample locations 10,000 times and calculated EC for each permutation, as described above. Finally, we calculate the proportion of EC values from the permuted datasets that are equally high or higher than that obtained from the observed data. This permutation test permits assessment of how frequently the EC for the observed data are produced by chance alone or, alternatively, as the result of the method used for estimating the Smax statistic. The resultant P value is the probability of observing an equally high or higher EC value in permuted, supposedly signal-less data, and provides an indication of the information content of each dataset.

Simulated Scenario of Changing Migration Rate.

We modified our simulations to represent a population experiencing two changes in migration rate, resulting in three episodes of constant migration rate. We assumed a generation time of 25 y and chose the effective population size to be 2Ne = 10,000, standard figures in population genetic models of European populations (62). We next chose three levels of migration with relative magnitude on par with what was inferred from the empirical data: m1 = 0.0002, m2 = 0.01, and m3 = 0.05. To ensure equilibrium conditions during the start of the sampling period, we discarded the first 10,000 steps of the simulation (using migration rate m1). We then simulated a time period of 20,000 y, divided into three episodes with constant migration rate: m1 for 25,000–15,000 y ago, m2 for 15,000–10,000 y ago, and m3 for the last 5,000 y of the simulation. This roughly corresponds to the time spans associated with Mesolithic hunter–gatherers, Neolithic farmers, and post-Neolithic cultures in our empirical dataset. From a population genetic point of view, whole genome data as used in the empirical estimates correspond to a large number of approximately independent replicates. Because our model does not include recombination, we accounted for this effect by increasing the sample size to 10,000 individuals. shows the migration rate estimation using the Smax statistic using a 4,000 y-wide sliding window. R version 3.2.2 (53) was used for analyses throughout this manuscript. The correlations between temporal, geographic, and trait distance matrices were calculated using the mantel (method = “pearson”) function in R package Vegan version 2.3.0 (63). The permutation and bootstrap tests were performed using the function sample in the R package base version 3.2.2 (53).

39 in total

1. Approximate Bayesian computation in population genetics.

Authors: Mark A Beaumont; Wenyang Zhang; David J Balding
Journal: Genetics Date: 2002-12 Impact factor: 4.562

2. Late Pleistocene demography and the appearance of modern human behavior.

Authors: Adam Powell; Stephen Shennan; Mark G Thomas
Journal: Science Date: 2009-06-05 Impact factor: 47.728

Review 3. Statistical inferences in phylogeography.

Authors: Rasmus Nielsen; Mark A Beaumont
Journal: Mol Ecol Date: 2009-01-31 Impact factor: 6.185

4. Evolution in Mendelian populations. 1931.

Authors: S Wright
Journal: Bull Math Biol Date: 1990 Impact factor: 1.758

5. Early farmers from across Europe directly descended from Neolithic Aegeans.

Authors: Zuzana Hofmanová; Susanne Kreutzer; Garrett Hellenthal; Christian Sell; Yoan Diekmann; David Díez-Del-Molino; Lucy van Dorp; Saioa López; Athanasios Kousathanas; Vivian Link; Karola Kirsanow; Lara M Cassidy; Rui Martiniano; Melanie Strobel; Amelie Scheu; Kostas Kotsakis; Paul Halstead; Sevi Triantaphyllou; Nina Kyparissi-Apostolika; Dushka Urem-Kotsou; Christina Ziota; Fotini Adaktylou; Shyamalika Gopalan; Dean M Bobo; Laura Winkelbach; Jens Blöcher; Martina Unterländer; Christoph Leuenberger; Çiler Çilingiroğlu; Barbara Horejs; Fokke Gerritsen; Stephen J Shennan; Daniel G Bradley; Mathias Currat; Krishna R Veeramah; Daniel Wegmann; Mark G Thomas; Christina Papageorgopoulou; Joachim Burger
Journal: Proc Natl Acad Sci U S A Date: 2016-06-06 Impact factor: 11.205

6. The fate of mutations surfing on the wave of a range expansion.

Authors: Seraina Klopfstein; Mathias Currat; Laurent Excoffier
Journal: Mol Biol Evol Date: 2005-11-09 Impact factor: 16.240

7. Genomic and cranial phenotype data support multiple modern human dispersals from Africa and a southern route into Asia.

Authors: Hugo Reyes-Centeno; Silvia Ghirotto; Florent Détroit; Dominique Grimaud-Hervé; Guido Barbujani; Katerina Harvati
Journal: Proc Natl Acad Sci U S A Date: 2014-04-21 Impact factor: 11.205

8. Ancient human genomes suggest three ancestral populations for present-day Europeans.

Authors: Iosif Lazaridis; Nick Patterson; Alissa Mittnik; Gabriel Renaud; Swapan Mallick; Karola Kirsanow; Peter H Sudmant; Joshua G Schraiber; Sergi Castellano; Mark Lipson; Bonnie Berger; Christos Economou; Ruth Bollongino; Qiaomei Fu; Kirsten I Bos; Susanne Nordenfelt; Heng Li; Cesare de Filippo; Kay Prüfer; Susanna Sawyer; Cosimo Posth; Wolfgang Haak; Fredrik Hallgren; Elin Fornander; Nadin Rohland; Dominique Delsate; Michael Francken; Jean-Michel Guinet; Joachim Wahl; George Ayodo; Hamza A Babiker; Graciela Bailliet; Elena Balanovska; Oleg Balanovsky; Ramiro Barrantes; Gabriel Bedoya; Haim Ben-Ami; Judit Bene; Fouad Berrada; Claudio M Bravi; Francesca Brisighelli; George B J Busby; Francesco Cali; Mikhail Churnosov; David E C Cole; Daniel Corach; Larissa Damba; George van Driem; Stanislav Dryomov; Jean-Michel Dugoujon; Sardana A Fedorova; Irene Gallego Romero; Marina Gubina; Michael Hammer; Brenna M Henn; Tor Hervig; Ugur Hodoglugil; Aashish R Jha; Sena Karachanak-Yankova; Rita Khusainova; Elza Khusnutdinova; Rick Kittles; Toomas Kivisild; William Klitz; Vaidutis Kučinskas; Alena Kushniarevich; Leila Laredj; Sergey Litvinov; Theologos Loukidis; Robert W Mahley; Béla Melegh; Ene Metspalu; Julio Molina; Joanna Mountain; Klemetti Näkkäläjärvi; Desislava Nesheva; Thomas Nyambo; Ludmila Osipova; Jüri Parik; Fedor Platonov; Olga Posukh; Valentino Romano; Francisco Rothhammer; Igor Rudan; Ruslan Ruizbakiev; Hovhannes Sahakyan; Antti Sajantila; Antonio Salas; Elena B Starikovskaya; Ayele Tarekegn; Draga Toncheva; Shahlo Turdikulova; Ingrida Uktveryte; Olga Utevska; René Vasquez; Mercedes Villena; Mikhail Voevoda; Cheryl A Winkler; Levon Yepiskoposyan; Pierre Zalloua; Tatijana Zemunik; Alan Cooper; Cristian Capelli; Mark G Thomas; Andres Ruiz-Linares; Sarah A Tishkoff; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; David Comas; Rem Sukernik; Mait Metspalu; Matthias Meyer; Evan E Eichler; Joachim Burger; Montgomery Slatkin; Svante Pääbo; Janet Kelso; David Reich; Johannes Krause
Journal: Nature Date: 2014-09-18 Impact factor: 49.962

9. Using classical population genetics tools with heterochroneous data: time matters!

Authors: Frantz Depaulis; Ludovic Orlando; Catherine Hänni
Journal: PLoS One Date: 2009-05-14 Impact factor: 3.240

10. Genome-wide patterns of selection in 230 ancient Eurasians.

Authors: Iain Mathieson; Iosif Lazaridis; Nadin Rohland; Swapan Mallick; Nick Patterson; Songül Alpaslan Roodenberg; Eadaoin Harney; Kristin Stewardson; Daniel Fernandes; Mario Novak; Kendra Sirak; Cristina Gamba; Eppie R Jones; Bastien Llamas; Stanislav Dryomov; Joseph Pickrell; Juan Luís Arsuaga; José María Bermúdez de Castro; Eudald Carbonell; Fokke Gerritsen; Aleksandr Khokhlov; Pavel Kuznetsov; Marina Lozano; Harald Meller; Oleg Mochalov; Vyacheslav Moiseyev; Manuel A Rojo Guerra; Jacob Roodenberg; Josep Maria Vergès; Johannes Krause; Alan Cooper; Kurt W Alt; Dorcas Brown; David Anthony; Carles Lalueza-Fox; Wolfgang Haak; Ron Pinhasi; David Reich
Journal: Nature Date: 2015-11-23 Impact factor: 49.962

11 in total

Review 1. Beyond broad strokes: sociocultural insights from the study of ancient genomes.

Authors: Fernando Racimo; Martin Sikora; Marc Vander Linden; Hannes Schroeder; Carles Lalueza-Fox
Journal: Nat Rev Genet Date: 2020-03-03 Impact factor: 53.242

2. Dairying, diseases and the evolution of lactase persistence in Europe.

Authors: Richard P Evershed; George Davey Smith; Mélanie Roffet-Salque; Adrian Timpson; Yoan Diekmann; Matthew S Lyon; Lucy J E Cramp; Emmanuelle Casanova; Jessica Smyth; Helen L Whelton; Julie Dunne; Veronika Brychova; Lucija Šoberl; Pascale Gerbault; Rosalind E Gillis; Volker Heyd; Emily Johnson; Iain Kendall; Katie Manning; Arkadiusz Marciniak; Alan K Outram; Jean-Denis Vigne; Stephen Shennan; Andrew Bevan; Sue Colledge; Lyndsay Allason-Jones; Luc Amkreutz; Alexandra Anders; Rose-Marie Arbogast; Adrian Bălăşescu; Eszter Bánffy; Alistair Barclay; Anja Behrens; Peter Bogucki; Ángel Carrancho Alonso; José Miguel Carretero; Nigel Cavanagh; Erich Claßen; Hipolito Collado Giraldo; Matthias Conrad; Piroska Csengeri; Lech Czerniak; Maciej Dębiec; Anthony Denaire; László Domboróczki; Christina Donald; Julia Ebert; Christopher Evans; Marta Francés-Negro; Detlef Gronenborn; Fabian Haack; Matthias Halle; Caroline Hamon; Roman Hülshoff; Michael Ilett; Eneko Iriarte; János Jakucs; Christian Jeunesse; Melanie Johnson; Andy M Jones; Necmi Karul; Dmytro Kiosak; Nadezhda Kotova; Rüdiger Krause; Saskia Kretschmer; Marta Krüger; Philippe Lefranc; Olivia Lelong; Eva Lenneis; Andrey Logvin; Friedrich Lüth; Tibor Marton; Jane Marley; Richard Mortimer; Luiz Oosterbeek; Krisztián Oross; Juraj Pavúk; Joachim Pechtl; Pierre Pétrequin; Joshua Pollard; Richard Pollard; Dominic Powlesland; Joanna Pyzel; Pál Raczky; Andrew Richardson; Peter Rowe; Stephen Rowland; Ian Rowlandson; Thomas Saile; Katalin Sebők; Wolfram Schier; Germo Schmalfuß; Svetlana Sharapova; Helen Sharp; Alison Sheridan; Irina Shevnina; Iwona Sobkowiak-Tabaka; Peter Stadler; Harald Stäuble; Astrid Stobbe; Darko Stojanovski; Nenad Tasić; Ivo van Wijk; Ivana Vostrovská; Jasna Vuković; Sabine Wolfram; Andrea Zeeb-Lanz; Mark G Thomas
Journal: Nature Date: 2022-07-27 Impact factor: 69.504

Review 3. Sometimes hidden but always there: the assumptions underlying genetic inference of demographic histories.

Authors: Liisa Loog
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2020-11-30 Impact factor: 6.237

4. Ancient DNA suggests modern wolves trace their origin to a Late Pleistocene expansion from Beringia.

Authors: Liisa Loog; Olaf Thalmann; Mikkel-Holger S Sinding; Verena J Schuenemann; Angela Perri; Mietje Germonpré; Herve Bocherens; Kelsey E Witt; Jose A Samaniego Castruita; Marcela S Velasco; Inge K C Lundstrøm; Nathan Wales; Gontran Sonet; Laurent Frantz; Hannes Schroeder; Jane Budd; Elodie-Laure Jimenez; Sergey Fedorov; Boris Gasparyan; Andrew W Kandel; Martina Lázničková-Galetová; Hannes Napierala; Hans-Peter Uerpmann; Pavel A Nikolskiy; Elena Y Pavlova; Vladimir V Pitulko; Karl-Heinz Herzig; Ripan S Malhi; Eske Willerslev; Anders J Hansen; Keith Dobney; M Thomas P Gilbert; Johannes Krause; Greger Larson; Anders Eriksson; Andrea Manica
Journal: Mol Ecol Date: 2020-01-02 Impact factor: 6.185

5. Migration, acculturation, and the maintenance of between-group cultural variation.

Authors: Alex Mesoudi
Journal: PLoS One Date: 2018-10-16 Impact factor: 3.240

6. Whole-genome sequencing of 128 camels across Asia reveals origin and migration of domestic Bactrian camels.

Authors: Liang Ming; Liyun Yuan; Li Yi; Guohui Ding; Surong Hasi; Gangliang Chen; Tuyatsetseg Jambl; Nemat Hedayat-Evright; Mijiddorj Batmunkh; Garyaeva Khongr Badmaevna; Tudeviin Gan-Erdene; Batsukh Ts; Wenbin Zhang; Azhati Zulipikaer; Arkady Natyrov; Prmanshayev Mamay; Gendalai Meng; Choijilsuren Narangerel; Orgodol Khongorzul; Jing He; Le Hai; Weili Lin; Yixue Li; Zhen Wang
Journal: Commun Biol Date: 2020-01-07

7. The origins and spread of domestic horses from the Western Eurasian steppes.

Authors: Pablo Librado; Naveed Khan; Antoine Fages; Mariya A Kusliy; Tomasz Suchan; Laure Tonasso-Calvière; Stéphanie Schiavinato; Duha Alioglu; Aurore Fromentier; Aude Perdereau; Jean-Marc Aury; Charleen Gaunitz; Lorelei Chauvey; Andaine Seguin-Orlando; Clio Der Sarkissian; John Southon; Beth Shapiro; Alexey A Tishkin; Alexey A Kovalev; Saleh Alquraishi; Ahmed H Alfarhan; Khaled A S Al-Rasheid; Timo Seregély; Lutz Klassen; Rune Iversen; Olivier Bignon-Lau; Pierre Bodu; Monique Olive; Jean-Christophe Castel; Myriam Boudadi-Maligne; Nadir Alvarez; Mietje Germonpré; Magdalena Moskal-Del Hoyo; Jarosław Wilczyński; Sylwia Pospuła; Anna Lasota-Kuś; Krzysztof Tunia; Marek Nowak; Eve Rannamäe; Urmas Saarma; Gennady Boeskorov; Lembi Lōugas; René Kyselý; Lubomír Peške; Adrian Bălășescu; Valentin Dumitrașcu; Roxana Dobrescu; Daniel Gerber; Viktória Kiss; Anna Szécsényi-Nagy; Balázs G Mende; Zsolt Gallina; Krisztina Somogyi; Gabriella Kulcsár; Erika Gál; Robin Bendrey; Morten E Allentoft; Ghenadie Sirbu; Valentin Dergachev; Henry Shephard; Noémie Tomadini; Sandrine Grouard; Aleksei Kasparov; Alexander E Basilyan; Mikhail A Anisimov; Pavel A Nikolskiy; Elena Y Pavlova; Vladimir Pitulko; Gottfried Brem; Barbara Wallner; Christoph Schwall; Marcel Keller; Keiko Kitagawa; Alexander N Bessudnov; Alexander Bessudnov; William Taylor; Jérome Magail; Jamiyan-Ombo Gantulga; Jamsranjav Bayarsaikhan; Diimaajav Erdenebaatar; Kubatbeek Tabaldiev; Enkhbayar Mijiddorj; Bazartseren Boldgiv; Turbat Tsagaan; Mélanie Pruvost; Sandra Olsen; Cheryl A Makarewicz; Silvia Valenzuela Lamas; Silvia Albizuri Canadell; Ariadna Nieto Espinet; Ma Pilar Iborra; Jaime Lira Garrido; Esther Rodríguez González; Sebastián Celestino; Carmen Olària; Juan Luis Arsuaga; Nadiia Kotova; Alexander Pryor; Pam Crabtree; Rinat Zhumatayev; Abdesh Toleubaev; Nina L Morgunova; Tatiana Kuznetsova; David Lordkipanize; Matilde Marzullo; Ornella Prato; Giovanna Bagnasco Gianni; Umberto Tecchiati; Benoit Clavel; Sébastien Lepetz; Hossein Davoudi; Marjan Mashkour; Natalia Ya Berezina; Philipp W Stockhammer; Johannes Krause; Wolfgang Haak; Arturo Morales-Muñiz; Norbert Benecke; Michael Hofreiter; Arne Ludwig; Alexander S Graphodatsky; Joris Peters; Kirill Yu Kiryushin; Tumur-Ochir Iderkhangai; Nikolay A Bokovenko; Sergey K Vasiliev; Nikolai N Seregin; Konstantin V Chugunov; Natalya A Plasteeva; Gennady F Baryshnikov; Ekaterina Petrova; Mikhail Sablin; Elina Ananyevskaya; Andrey Logvin; Irina Shevnina; Victor Logvin; Saule Kalieva; Valeriy Loman; Igor Kukushkin; Ilya Merz; Victor Merz; Sergazy Sakenov; Victor Varfolomeyev; Emma Usmanova; Viktor Zaibert; Benjamin Arbuckle; Andrey B Belinskiy; Alexej Kalmykov; Sabine Reinhold; Svend Hansen; Aleksandr I Yudin; Alekandr A Vybornov; Andrey Epimakhov; Natalia S Berezina; Natalia Roslyakova; Pavel A Kosintsev; Pavel F Kuznetsov; David Anthony; Guus J Kroonen; Kristian Kristiansen; Patrick Wincker; Alan Outram; Ludovic Orlando
Journal: Nature Date: 2021-10-20 Impact factor: 49.962

8. The spread of agriculture in Iberia through Approximate Bayesian Computation and Neolithic projectile tools.

Authors: Alfredo Cortell-Nicolau; Oreto García-Puchol; María Barrera-Cruz; Daniel García-Rivero
Journal: PLoS One Date: 2021-12-28 Impact factor: 3.240

9. A genomic perspective on South American human history.

Authors: Marcos Araújo Castro E Silva; Tiago Ferraz; Tábita Hünemeier
Journal: Genet Mol Biol Date: 2022-07-29 Impact factor: 2.087

10. Did Our Species Evolve in Subdivided Populations across Africa, and Why Does It Matter?

Authors: Eleanor M L Scerri; Mark G Thomas; Andrea Manica; Philipp Gunz; Jay T Stock; Chris Stringer; Matt Grove; Huw S Groucutt; Axel Timmermann; G Philip Rightmire; Francesco d'Errico; Christian A Tryon; Nick A Drake; Alison S Brooks; Robin W Dennell; Richard Durbin; Brenna M Henn; Julia Lee-Thorp; Peter deMenocal; Michael D Petraglia; Jessica C Thompson; Aylwyn Scally; Lounès Chikhi
Journal: Trends Ecol Evol Date: 2018-07-11 Impact factor: 17.712