| Literature DB >> 21085689 |
Wolfgang Haak1, Oleg Balanovsky, Juan J Sanchez, Sergey Koshel, Valery Zaporozhchenko, Christina J Adler, Clio S I Der Sarkissian, Guido Brandt, Carolin Schwarz, Nicole Nicklisch, Veit Dresely, Barbara Fritsch, Elena Balanovska, Richard Villems, Harald Meller, Kurt W Alt, Alan Cooper.
Abstract
In Europe, the Neolithic transition (8,000-4,000 B.C.) from hunting and gathering to agricultural communities was one of the most important demographic events since the initial peopling of Europe by anatomically modern humans in the Upper Paleolithic (40,000 B.C.). However, the nature and speed of this transition is a matter of continuing scientific debate in archaeology, anthropology, and human population genetics. To date, inferences about the genetic make up of past populations have mostly been drawn from studies of modern-day Eurasian populations, but increasingly ancient DNA studies offer a direct view of the genetic past. We genetically characterized a population of the earliest farming culture in Central Europe, the Linear Pottery Culture (LBK; 5,500-4,900 calibrated B.C.) and used comprehensive phylogeographic and population genetic analyses to locate its origins within the broader Eurasian region, and to trace potential dispersal routes into Europe. We cloned and sequenced the mitochondrial hypervariable segment I and designed two powerful SNP multiplex PCR systems to generate new mitochondrial and Y-chromosomal data from 21 individuals from a complete LBK graveyard at Derenburg Meerenstieg II in Germany. These results considerably extend the available genetic dataset for the LBK (n = 42) and permit the first detailed genetic analysis of the earliest Neolithic culture in Central Europe (5,500-4,900 calibrated B.C.). We characterized the Neolithic mitochondrial DNA sequence diversity and geographical affinities of the early farmers using a large database of extant Western Eurasian populations (n = 23,394) and a wide range of population genetic analyses including shared haplotype analyses, principal component analyses, multidimensional scaling, geographic mapping of genetic distances, and Bayesian Serial Simcoal analyses. The results reveal that the LBK population shared an affinity with the modern-day Near East and Anatolia, supporting a major genetic input from this area during the advent of farming in Europe. However, the LBK population also showed unique genetic features including a clearly distinct distribution of mitochondrial haplogroup frequencies, confirming that major demographic events continued to take place in Europe after the early Neolithic.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21085689 PMCID: PMC2976717 DOI: 10.1371/journal.pbio.1000536
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Summary of archaeological, genetic, and radiocarbon data.
| Sample | Feature | Grave | Age, Sex | Radiocarbon Date (Laboratory Code) (Uncalibrated BP, Cal | HVS-I Sequence (np 15997–16409), Minus np 16000 | Hg HVS-I | Hg GenoCoRe22 | Hg GenoY25 |
| deb09 | 420 | 9 | Adult, f | rCRS | H | H | ||
| deb06 | 421 | 10 | Adult/mature, n.d. | Ambiguous | n.d. | H | — | |
| deb11 | 569 | 16 | Adult, f? | n.d. | n.d. | T | ||
| deb10 | 566 | 17 | Adult, m | 093C, 224C, 311C | K | K | — | |
| deb23 | 565 | 18 | Infans I, m? | 093C, 223T, 292T | W | W | — | |
| deb12I | 568 | 20 | Infans I, m? | 6,015±35 BP (KIA30400), 4,910±50 cal | 298C | V | V | — |
|
| 591 | 21 | Adult, f | 6,147±32 BP (KIA30401), 5,117±69 cal | 147A, 172C, 223T, 248T, 320T, 355T | N1a | n.d. | |
| deb15 | 593 | 23 | Infans I, f? | 126C, 294T, 296T, 304C | T2 | T | — | |
|
| 604/2 | 29 | Infans II, f?? | 311C | HV | HV | ||
| deb22 | 604/3 | 30 | Adult/mature, f | 092C, 129A, 147A, 154C, 172C, 223T, 248T, 320T, 355T | N1a | N1 | — | |
| deb20 | 599 | 31 | Adult, m | 6,257±40 BP (KIA30403), 5,247±45 cal | 311C | HV | HV | F*(xG,H,I,J,K) |
| deb21 | 600 | 32 | Mature, f | 6,151±27 BP (KIA30404), 5,122±65 cal | rCRS | H | H | |
|
| 598 | 33 | Infans II/Juvenile, f?? | 147A, 172C, 223T, 248T, 355T | N1a | N1 | ||
|
| 596 | 34 | Adult, m | 6,141±33 BP (KIA30402), 5,112±73 cal | 311C | HV | HV | |
| deb26 | 606 | 37 | Juvenile, m?? | 069T, 126C | J | J | — | |
| deb32 | 640 | 38 | Adult/mature f | 6,142±34 BP (KIA30405), 5,112±73 cal | n.d. | n.d. | T | |
| deb30 | 592 | 40 | Adult, f? | 069T, 126C | J | J | — | |
| deb29II | 649 | 41 | Adult, f? | 6,068±31 BP (KIA30406), 4,982±38 cal | n.d. | n.d. | K | |
| deb34II | 484 | 42 | Adult/mature, m | 093C, 223T, 292T | W | W | G2a3 | |
| deb33 | 483 | 43 | Juvenile II, f?? | 126C, 147T, 293G, 294T, 296T, 297C, 304C | T2 | T | — | |
|
| 644 | 44 | Mature, f | 224C, 311C | K | K | — | |
| deb36 | 645 | 45 | Mature, f | 093C, 256T, 270T, 399G | U5a1a | U | ||
| deb38 | 665 | 46 | Adult/mature, m | 093C, 224C, 311C | K | K | F*(xG,H,I,J,K) | |
| deb35II | 662 | 47 | Adult, f? | 126C, 189C, 294T, 296T | T | T | ||
| deb37I | 643 | 48 | Adult/mature f | 069T, 126C | J | J | ||
| deb39 | 708 | 49 | Adult/mature, f | 6,148±33 BP (KIA30407), 5,117±69 cal | 126C, 294T, 296T, 304C | T2 | T | — |
Italicized samples had been described previously [19].
One versus two question marks after sex indicate two levels of insecurity in sexing.
Previously analyzed diagnostic SNP sites at np 7028 AluI (hg H) and np 12308 HinfI (hg U) per restriction fragment length polymorphism.
BP, before present; f, female; m, male; n.d., not determined.
Summary statistics, overview of population genetic analyses, and summary of haplogroup frequencies used for comparison with PCA vector loadings.
| Category | Variable, Simulation, or Hg | Modern Datasets | Ancient Datasets | |||||||
| Total Dataset | Pooled Geographic Sets of Equal Size ( | Pooled European Dataset | Pooled Near East Populations | DEB22 | LBK20 | LBK42 | LBK34 | Hunter–Gatherers | ||
|
| Populations | 55 | 37 | 41 | 14 | 1 | 1 | 1 | 1 | 1 |
| Samples | 23,394 | 18,039 | 22 | 20 | 42 | 34 | 20 | |||
|
| Shared haplotypes | X | X | |||||||
| PCA | X | X | X | X | X | X | ||||
| Relative hg frequencies | X | X | X | X | X | X | X | |||
| MDS | X | X | X | X | X | |||||
| Genetic distance maps | X | X | X | |||||||
| BayeSSC | X | X | X | X | ||||||
| Haplotype diversity | 0.957 | 0.989 | 0.969 | 0.982 | 0.932 | |||||
| Tajima's | −0.91645 | −0.90573 | −0.91374 | −0.88555 | −1.05761 | |||||
|
| Asian hgs | 1.62 | 2.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||
| African hgs | 0.65 | 6.43 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||
| R0/preHV | 0.37 | 3.26 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||
| H | 43.35 | 23.74 | 13.64 | 25.00 | 19.05 | 17.65 | 0.00 | |||
| HV | 1.40 | 5.80 | 13.64 | 0.00 | 7.14 | 2.94 | 0.00 | |||
| J | 8.49 | 10.59 | 13.64 | 5.00 | 9.52 | 5.88 | 4.76 | |||
| T | 9.26 | 8.91 | 13.64 | 25.00 | 19.05 | 23.53 | 9.52 | |||
| I | 2.23 | 1.97 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||
| N1a | 0.30 | 0.32 | 13.64 | 15.00 | 14.29 | 17.65 | 0.00 | |||
| K | 5.39 | 6.67 | 13.64 | 15.00 | 14.29 | 14.71 | 4.76 | |||
| V | 4.35 | 0.77 | 4.55 | 5.00 | 4.76 | 5.88 | 0.00 | |||
| W | 2.03 | 2.25 | 9.09 | 5.00 | 7.14 | 5.88 | 0.00 | |||
| X | 1.22 | 2.52 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||
| U2 | 1.04 | 1.52 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||
| U3 | 1.26 | 4.43 | 0.00 | 5.00 | 2.38 | 2.94 | 0.00 | |||
| U4 | 4.04 | 2.10 | 0.00 | 0.00 | 0.00 | 0.00 | 9.52 | |||
| U5a | 5.46 | 2.53 | 4.55 | 0.00 | 2.38 | 2.94 | 23.80 | |||
| U5b | 3.89 | 0.64 | 0.00 | 0.00 | 0.00 | 0.00 | 28.57 | |||
| Other rare hgs | 3.67 | 13.45 | 0.00 | 0.00 | 0.00 | 0.00 | 19.05 | |||
X's indicate which datasets were used in the genetic analyses.
For explanation of datasets, see Materials and Methods.
For BayeSSC analyses, representative samples of the key areas were randomly drawn from the larger meta-population pool (Table S6).
Figure 1Percentages of shared haplotype matches per population.
Populations are plotted on a northwest–southeast axis. Note that the percentage of non-informative matches (orange) is nearly identical to the percentage of all shared haplotypes (red) in most populations, whereas we observe elevated frequencies of informative matches (blue) in Southeast European and Near Eastern population pools, culminating in Iranians.
Figure 2PCA plot based on mtDNA haplogroup frequencies.
The two dimensions display 39% of the total variance. The contribution of each hg is superimposed as grey component loading vectors. Notably, the Derenburg dataset (DEB22) groups well with its meta-population (LBK20), supporting the unique status and characteristic composition of the LBK sample. Populations are abbreviated as follows (Table S6): ALB, Albanians; ARM, Armenians; ARO, Aromuns; AUT, Austrians; AZE, Azeris; BAS, Basques; BLR, Byelorussians; BOS, Bosnians; BUL, Bulgarians; CHE, Swiss; CHM, Mari; CHV, Chuvash; CRO, Croats; CZE, Czechs; DEB22, Derenburg; DEU, Germans; ENG, English; ESP, Spanish; EST, Estonians; FIN, Finns; FRA, French; GEO, Georgians; GRC, Greeks; HG, European Mesolithic hunter–gatherers.; HUN, Hungarians; IRL, Irish; IRN, Iranians; IRQ, Iraqis; ISL, Icelanders; ITA, Italians; JOR, Jordanians; KAB, Kabardinians; KAR, Karelians; KOM, Komis (Permyaks and Zyrian); KUR, Kurds; LBK20, LBK without Derenburg; LBK34, all LBK samples excluding potential relatives; LBK42, all LBK; LTU, Lithuanians; LVA, Latvians; MAR, Moroccans; MOR, Mordvinians; NOG, Nogais; NOR, Norwegians; OSS, Ossetians; POL, Poles; PRT, Portuguese; PSE, Palestinians; ROU, Romanians; RUS, Russians; SAR, Sardinians; SAU, Saudi Arabians; SCO, Scots; SIC, Sicilians; SVK, Slovaks; SVN, Slovenians; SWE, Swedes; SYR, Syrians; TAT, Tatars; TUR, Turkish; UKR, Ukrainians.
Figure 3Genetic matrilineal distances between 55 modern Western Eurasian populations (Table S6) and Neolithic LBK samples.
Mapped genetic distances are illustrated between 55 modern Western Eurasian populations and the total of 42 Neolithic LBK samples (A) or the single graveyard of Derenburg (B). Black dots denote the location of modern-day populations used in the analysis. The coloring indicates the degree of similarity of the modern local population(s) with the Neolithic sample set: short distances (greatest similarity) are marked by dark green and long distances (greatest dissimilarity) by orange, with fainter colors in between the extremes. Note that green intervals are scaled by genetic distance values of 0.02, with increasingly larger intervals towards the “orange” end of the scale.
Pairwise F ST values between ancient and modern-day population pools as used for goodness-of-fit estimates in BayeSSC analyses.
| Hunter–Gatherers | Near East | LBK | Central Europe | |
|
| 0 | — | — | — |
|
| 0.04192 | 0 | — | — |
|
| 0.09298 | 0.03019 | 0 | — |
|
| 0.03445 | 0.00939 | 0.03958 | 0 |
Details of the demographic models analyzed with BayeSSC and AIC goodness-of-fit estimates, and resulting model probabilities via Akaike weights.
| Model | H0a | H0b | H1 | H2 | H2 | H2 |
| Prior | U | U:100000,30000000 | U:100000,12000000 | U:100000,12000000 | U:100000,12000000 | U:100000,12000000 |
| Prior | U:100000,12000000 | U:100000,12000000 | U:100000,12000000 | U:100000,12000000 | ||
| Percent migrants from deme 0 to deme 1 | 25% | 50% | 75% | |||
| AIC | 97.78 | 120.37 | 89.19 | 82.56 | 78.52 | 78.07 |
| Akaike weight ω | 2.76164e−5 | 3.42478e−10 | 0.002018032 | 0.055596369 | 0.418527622 | 0.52383036 |
Of note, the smaller the AIC value, the better the fit of the model. While no threshold value can be assigned to AIC values at which any model can be rejected, the Akaike weights estimate a model probability given the six models tested.
U, uniform distribution of given range.
N e, effective population size.