| Literature DB >> 23171862 |
Flora Jay1, Per Sjödin, Mattias Jakobsson, Michael G B Blum.
Abstract
Genetic differentiation among human populations is greatly influenced by geography due to the accumulation of local allele frequency differences. However, little is known about the possibly different increment of genetic differentiation along the different geographical axes (north-south, east-west, etc.). Here, we provide new methods to examine the asymmetrical patterns of genetic differentiation. We analyzed genome-wide polymorphism data from populations in Africa (n = 29), Asia (n = 26), America (n = 9), and Europe (n = 38), and we found that the major orientations of genetic differentiation are north-south in Europe and Africa, and east-west in Asia, but no preferential orientation was found in the Americas. Additionally, we showed that the localization of the individual geographic origins based on single nucleotide polymorphism data was not equally precise along all orientations. Confirming our findings, we obtained that, in each continent, the orientation along which the precision is maximal corresponds to the orientation of maximum differentiation. Our results have implications for interpreting human genetic variation in terms of isolation by distance and spatial range expansion processes. In Europe, for instance, the precise northnorthwest-southsoutheast axis of main European differentiation cannot be explained by a simple Neolithic demic diffusion model without admixture with the local populations because in that case the orientation of greatest differentiation should be perpendicular to the direction of expansion. In addition to humans, anisotropic analyses can guide the description of genetic differentiation for other organisms and provide information on expansions of invasive species or the processes of plant dispersal.Entities:
Mesh:
Year: 2012 PMID: 23171862 PMCID: PMC3563970 DOI: 10.1093/molbev/mss259
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Sample Information for the SNP and Microsatellite Data Sets.
| Continent | Markers | Sample Size | Population No. | Population Name (Population Sample Size) | References |
|---|---|---|---|---|---|
| Africa | 55,098 SNPs | 587 | 29 | Algeria (19), Bamoun (18), Biaka Pygmy (22), Brong (8), Bulala (15), Egypt (19), Fang (15), Fulani (12), Hadza (17), Hausa (12), Igbo (15), Kaba (17), Kongo (9), Libya (17), Luhya (36), Maasai (30), Mada (12), Mandenka (22), Mbuti Pygmy (13), Morocco N (18), Morocco S (16), Mozabite (29), Sahara OCC (18), San NB (17), San SA (31), Sandawe (28), Tunisia (18), Xhosa (11), Yoruba (47) | |
| America | 439,046 SNPs | 118 | 9 | Aymara (24), Colombian (5), Guerrero (14), Karitiana (5), Maya (18), Yucatan (4), Pima (5), Quechua (24), Surui (5) | |
| Asia | 656,995 SNPs | 428 | 26 | Balochi (24), Brahui (25), Burusho (25), Cambodian (10), Dai (10), Daur (9), Han (44), Hazara (22), Hezhen (9), Japanese (28), Kalash (23), Lahu (8), Makrani (25), Miaozu (10), Mongola (10), Naxi (8), Oroqen (9), Pathan (22), She (10), Sindhi (24), Tu (10), Tujia (10), Uygur (10), Xibo (9), Yakut (25), Yizu (10) | |
| Europe | 279,344 SNPs | 1,466 | 38 | Netherlands (17), Norway (3), Albania (3), Austria (14), Belgium (43), Bosnia (9), Bulgaria (2), Croatia (8), Cyprus (4), Czech (11), Denmark (1), Finland (41), France (89), Germany (71), Greece (8), Hungary (19), Ireland (61), Italy (219), Kosovo (2), Latvia (1), LSFIN (41), Macedonia, (4), Poland (22), Portugal (128), Romania (14), Russian (6), Scotland (5), Serbia (44), Slovakia (1), Slovenia (2), Spain (136), Sweden (10), Swiss-French (125), Swiss-German (84), Swiss-Italian (13), Turkey (4), Ukraine (1), United Kingdom (200) | |
| America | 678 Micro | 530 | 29 | Ache (19), Arhuaco (17), Aymara (18), Cabecar (20), Chipewyan (29), Cree (18), Embera (11), Guarani (10), Guaymi (18), Huilliche (20), Inga (17), Kaingang (7), Kaqchikel (12), Karitiana (24), Kogi (17), Maya (25), Mixe (20), Mixtec (20), Ojibwa (20), Piapoco (13), Pima (25), Quechua (20), Surui (21), TicunaArara (17), TicunaTarapaca (18), Waunana (20), Wayuu (17), Zapotec (19), Zenu (18) | |
FSpatial interpolation of the first component of MDS. MDS was applied separately in each continent using the pairwise intracontinetal FST matrix as dissimilarity matrix. Spatial interpolation was performed using the Krig function of the R fields package by considering a trend surface of degree 2. The gray dots represent the locations of the sampled populations. For each continent, the colored bar gives the scale of the first component of MDS. The MDS values are not on the same scale for the four continents reflecting the different levels of genetic differentiation within continents.
FSchematic description of the regression and geometric approaches used for providing the angle of maximal differentiation.
Orientations of Maximum Differentiation.
| Africa | America | Asia | Europe | |
|---|---|---|---|---|
| Regression method | 9*** (160–33) | 92 (22–167) | 102*** (84–121) | 167* (140–14) |
| Geometric method | 6 (164–26) | 67 (10–118) | 102 (80–124) | 167 (138–14) |
Note.—All orientations are measured in degree clockwise from the N–S orientation (1°–180°). The 95% confidence intervals given in parenthesis should be read clockwise. The test for anisotropy in equation (1) was assessed with a partial Mantel test using 10,000 permutations. P value: *** < 0.001, * < 0.05.
FCorrelation between FST and orientational distances computed along the different bearing lines. Results are shown for the four continents and for several subregions within Africa, Asia, and Europe. The symbols next to each circle indicate the significance of the test for anisotropy. The P values have been obtained using a partial Mantel test in equation (1). The gray dots represent the locations of the sampled populations. The large circles correspond to the continental analyses, whereas the smaller circles correspond to the analyses of regions within continent.
FPrediction of average FST for pairs of populations separated by 0 and by 1,000 km along the axis of maximum differentiation. The logit-transformed FST was regressed with equation (1) to constrain the predicted FST to lie between 0 and 1.
FPrediction of geographic origin based on SNP data. One arrow is plotted for each population. The origin of an arrow corresponds to the true location, and the arrow points to the predicted location averaged over all individuals in the population.
Median Errors of Geographic Localization Based on SNP Data.
| Africa | America | Asia | Europe | |
|---|---|---|---|---|
| Optimal number of PCs | 52 | 17 | 34 | 17 |
| Error (km) | 430 | 250 | 510 | 280 |
| Relative error | 0.15 | 0.10 | 0.17 | 0.24 |
| Shrinkage | 0.91 | 0.94 | 0.95 | 0.92 |
Note.—The relative errors are computed with respect to a naive localizer which assigns each individual to a population that is chosen at random among the sampled populations. The shrinkage is computed as the ratio of the mean distance between the predicted locations and the continental centroid and the mean distance between the true locations and the same centroid.
FMedian error of the localization method in N–S and E–W orientation. The upper panel shows the errors in km and the lower panel shows the relative error when the individual errors have been standardized by the errors obtained with a naive localizer that picks one population in the continent at random. Confidence intervals were estimated with bootstrap and P values were obtained with a two-sided Wilcoxon test.