| Literature DB >> 26098372 |
Qiaomei Fu1, Mateja Hajdinjak2, Oana Teodora Moldovan3, Silviu Constantin4, Swapan Mallick5, Pontus Skoglund6, Nick Patterson7, Nadin Rohland6, Iosif Lazaridis6, Birgit Nickel2, Bence Viola8, Kay Prüfer2, Matthias Meyer2, Janet Kelso2, David Reich9, Svante Pääbo2.
Abstract
Neanderthals are thought to have disappeared in Europe approximately 39,000-41,000 years ago but they have contributed 1-3% of the DNA of present-day people in Eurasia. Here we analyse DNA from a 37,000-42,000-year-old modern human from Peştera cu Oase, Romania. Although the specimen contains small amounts of human DNA, we use an enrichment strategy to isolate sites that are informative about its relationship to Neanderthals and present-day humans. We find that on the order of 6-9% of the genome of the Oase individual is derived from Neanderthals, more than any other modern human sequenced to date. Three chromosomal segments of Neanderthal ancestry are over 50 centimorgans in size, indicating that this individual had a Neanderthal ancestor as recently as four to six generations back. However, the Oase individual does not share more alleles with later Europeans than with East Asians, suggesting that the Oase population did not contribute substantially to later humans in Europe.Entities:
Mesh:
Year: 2015 PMID: 26098372 PMCID: PMC4537386 DOI: 10.1038/nature14558
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
Extended Data Figure 1Mitochondrial DNA tree for Oase 1 and other modern humans
The consensus sequences for all Oase 1 fragments and for deaminated fragments are shown. The tree is rooted with a Neanderthal mtDNA (Vindija33.25) as an outgroup.
Figure 1Allele sharing between the Oase 1 individual and other genomes
Each point indicates the extent to which the Oase 1 genome shares alleles with one or the other of a pair of genomes from different populations indicated above and below (see Extended Data Table 1 for numbers). Z-scores larger and smaller than |2| indicate an excess of allele sharing (grey).
Allele sharing between early modern humans and other humans
We compute D(Non-African to test whether an early modern human (Oase 1, Ust’-Ishim, or Kostenki 14) shares more alleles with Non-African (in which case the statistic is positive) or Non-African (negative). We use a pool of six sub-Saharan African genomes (2 Mbuti, 2 Yoruba, 2 Dinka) as an outgroup; a pool of four genomes (2 French, 2 Sardinians) to represent Europeans; a pool of four genomes (2 Han, 2 Dai) to represent East Asians; and a pool of three genomes (2 Karitiana, 1 Mixe) to represent Native Americans. Results are based on 242,122 transition and transversion SNPs covered by at least one deaminated fragment in Oase 1, and covered in all other samples with the possible exception of MA1 (for MA1, 176,569 SNPs are used).
| Oase 1 | Ust’-Ishim | Kostenki 14 | |||||
|---|---|---|---|---|---|---|---|
| D | Z | D | Z | D | Z | ||
| Oase 1 | Ust’-Ishim | −0.0033 | −3.8 | ||||
| Oase 1 | Kostenki 14 | −0.0037 | −4.1 | ||||
| Oase 1 | MA1 | −0.0032 | −3.5 | −0.0092 | −9.8 | ||
| Oase 1 | Loschbour | −0.0032 | −3.9 | −0.0101 | −12.2 | ||
| Oase 1 | East Asian | −0.0027 | −3.8 | −0.0011 | −1.6 | ||
| Oase 1 | Native American | −0.0030 | −4.1 | −0.0039 | −5.5 | ||
| Ust’-Ishim | Kostenki 14 | −0.0005 | −0.6 | ||||
| Ust’-Ishim | MA1 | −0.0007 | −0.8 | −0.0059 | −6.4 | ||
| Ust’-Ishim | Loschbour | 0.0002 | 0.3 | −0.0068 | −8.5 | ||
| Ust’-Ishim | East Asian | 0.0000 | −0.1 | 0.0022 | 3.3 | ||
| Ust’-Ishim | Native American | −0.0007 | −1.0 | −0.0006 | −0.8 | ||
| Kostenki 14 | MA1 | −0.0004 | −0.6 | 0.0003 | 0.4 | ||
| Kostenki 14 | Loschbour | 0.0007 | 1.0 | 0.0006 | 0.8 | ||
| Kostenki 14 | East Asian | 0.0004 | 0.6 | 0.0011 | 1.6 | ||
| Kostenki 14 | Native American | −0.0002 | −0.3 | 0.0008 | 1.1 | ||
| MA1 | Loschbour | 0.0012 | 1.7 | 0.0005 | 0.7 | −0.0012 | −1.5 |
| MA1 | East Asian | 0.0008 | 1.2 | 0.0007 | 1.1 | 0.0079 | 10.6 |
| MA1 | Native American | 0.0001 | 0.1 | 0.0004 | 0.6 | 0.0051 | 7.0 |
| Loschbour | East Asian | −0.0002 | −0.4 | 0.0005 | 0.9 | 0.0090 | 13.7 |
| Loschbour | Native American | −0.0009 | −1.5 | 0.0002 | 0.3 | 0.0062 | 9.0 |
| East Asian | Native American | −0.0006 | −1.6 | −0.0003 | −0.8 | −0.0028 | −6.6 |
|
| |||||||
| European | Oase 1 | 0.0004 | 0.6 | 0.0049 | 7.3 | ||
| European | Ust’-Ishim | −0.0023 | −3.5 | 0.0016 | 2.4 | ||
| European | Kostenki 14 | −0.0028 | −4.7 | −0.0033 | −5.1 | ||
| European | MA1 | −0.0033 | −5.4 | −0.0031 | −5.1 | −0.0041 | −6.0 |
| European | Loschbour | −0.0021 | −4.5 | −0.0027 | −5.7 | −0.0052 | −9.1 |
| European | East Asian | −0.0024 | −5.2 | −0.0022 | −5.3 | 0.0039 | 9.2 |
| European | Native American | −0.0030 | −6.4 | −0.0025 | −5.9 | 0.0010 | 2.2 |
| European | Stuttgart | −0.0007 | −1.5 | −0.0001 | −0.2 | −0.0002 | −0.3 |
| Stuttgart | Oase 1 | 0.0005 | 0.6 | 0.0051 | 6.7 | ||
| Stuttgart | Ust’-Ishim | −0.0017 | −2.3 | 0.0018 | 2.3 | ||
| Stuttgart | Kostenki 14 | −0.0021 | −3.2 | −0.0032 | −4.6 | ||
| Stuttgart | MA1 | −0.0027 | −3.9 | −0.0029 | −4.2 | −0.0041 | −5.0 |
| Stuttgart | Loschbour | −0.0015 | −2.4 | −0.0027 | −4.6 | −0.0050 | −7.5 |
| Stuttgart | East Asian | −0.0017 | −2.9 | −0.0022 | −3.8 | 0.0040 | 6.8 |
| Stuttgart | Native American | −0.0024 | −3.9 | −0.0025 | −4.4 | 0.0012 | 1.9 |
Allele sharing between early modern humans and other humans (transversions only)
We compute D(Non-African, to test whether an early modern human (Oase 1, Ust’-Ishim, or Kostenki 14) shares more alleles with Non-African (in which case the statistic is positive) or Non-African (negative). We use a pool of six sub-Saharan African genomes (2 Mbuti, 2 Yoruba, 2 Dinka) as an outgroup; a pool of four genomes (2 French, 2 Sardinians) to represent Europeans; a pool of four genomes (2 Han, 2 Dai) to represent East Asians; and a pool of three genomes (2 Karitiana, 1 Mixe) to represent Native Americans. Statistics are as in Extended Data Table 1 but are based on 106,004 transversion SNPs covered by at least one deaminated fragment in Oase 1 and that also have coverage for all other samples although not necessarily MA1. For analyses involving MA1, a subset of 76,715 transversion SNPs is analyzed.
| Oase 1 | Ust’-Ishim | Kostenki 14 | |||||
|---|---|---|---|---|---|---|---|
| D | Z | D | Z | D | Z | ||
| Oase 1 | Ust’-Ishim | −0.0019 | −2.1 | ||||
| Oase 1 | Kostenki 14 | −0.0031 | −3.3 | ||||
| Oase 1 | MA1 | −0.0026 | −2.9 | −0.0071 | −6.5 | ||
| Oase 1 | Loschbour | −0.0023 | −2.6 | −0.0081 | −8.8 | ||
| Oase 1 | East Asian | −0.0013 | −1.9 | 0.0007 | 1.0 | ||
| Oase 1 | Native American | −0.0019 | −2.7 | −0.0018 | −2.3 | ||
| Ust’-Ishim | Kostenki 14 | −0.0012 | −1.4 | ||||
| Ust’-Ishim | MA1 | −0.0006 | −0.7 | −0.0050 | −5.1 | ||
| Ust’-Ishim | Loschbour | 0.0003 | 0.4 | −0.0062 | −7.1 | ||
| Ust’-Ishim | East Asian | 0.0005 | 0.7 | 0.0026 | 3.8 | ||
| Ust’-Ishim | Native American | −0.0003 | −0.4 | 0.0001 | 0.1 | ||
| Kostenki 14 | MA1 | 0.0001 | 0.1 | 0.0002 | 0.3 | ||
| Kostenki 14 | Loschbour | 0.0015 | 2.0 | 0.0008 | 1.1 | ||
| Kostenki 14 | East Asian | 0.0017 | 2.3 | 0.0017 | 2.5 | ||
| Kostenki 14 | Native American | 0.0009 | 1.2 | 0.0012 | 1.6 | ||
| MA1 | Loschbour | 0.0019 | 2.2 | 0.0010 | 1.3 | −0.0013 | −1.3 |
| MA1 | East Asian | 0.0011 | 1.4 | 0.0013 | 1.9 | 0.0075 | 8.5 |
| MA1 | Native American | 0.0006 | 0.7 | 0.0007 | 1.1 | 0.0051 | 6.0 |
| Loschbour | East Asian | 0.0001 | 0.2 | 0.0009 | 1.5 | 0.0088 | 12.3 |
| Loschbour | Native American | −0.0006 | −0.9 | 0.0004 | 0.6 | 0.0063 | 8.4 |
| East Asian | Native American | −0.0008 | −1.7 | −0.0006 | −1.3 | −0.0025 | −5.3 |
|
| |||||||
| European | Oase 1 | −0.0005 | −0.7 | 0.0029 | 3.9 | ||
| European | Ust’-Ishim | −0.0023 | −3.3 | 0.0010 | 1.4 | ||
| European | Kostenki 14 | −0.0035 | −5.1 | −0.0035 | −5.2 | ||
| European | MA1 | −0.0033 | −4.5 | −0.0033 | −5.2 | −0.0038 | −4.8 |
| European | Loschbour | −0.0020 | −3.6 | −0.0027 | −5.1 | −0.0052 | −8.4 |
| European | East Asian | −0.0018 | −3.6 | −0.0018 | −4.0 | 0.0036 | 7.8 |
| European | Native American | −0.0026 | −4.8 | −0.0023 | −5.2 | 0.0011 | 2.1 |
| European | Stuttgart | −0.0009 | −1.7 | −0.0010 | −2.2 | −0.0012 | −2.3 |
| Stuttgart | Oase 1 | 0.0005 | 0.7 | 0.0041 | 4.7 | ||
| Stuttgart | Ust’-Ishim | −0.0014 | −1.8 | 0.0022 | 2.6 | ||
| Stuttgart | Kostenki 14 | −0.0026 | −3.3 | −0.0025 | −3.5 | ||
| Stuttgart | MA1 | −0.0026 | −3.1 | −0.0023 | −3.2 | −0.0031 | −3.4 |
| Stuttgart | Loschbour | −0.0011 | −1.6 | −0.0017 | −2.8 | −0.0040 | −5.2 |
| Stuttgart | East Asian | −0.0010 | −1.4 | −0.0008 | −1.3 | 0.0048 | 7.2 |
| Stuttgart | Native American | −0.0017 | −2.4 | −0.0013 | −2.2 | 0.0023 | 3.4 |
Testing whether archaic genomes share more alleles with Oase 1 than with other modern humans
The statistic D(Test, Oase 1; Archaic, Outgroup) is negative if the archaic genomes share more alleles with Oase 1 than with a test sample. The outgroups are either chimpanzee or a sub-Saharan African (Mbuti).
| Test | Sites | Archaic = Altai | Archaic = Denisovan | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Chimp | Mbuti | Chimp | Mbuti | ||||||
| D | Z | D | Z | D | Z | D | Z | ||
| 115,300 | −0.0036 | −5.1 | −0.0071 | −7.6 | −0.0014 | −2.2 | −0.0049 | −6.3 | |
| 115,300 | −0.0035 | −5.0 | −0.0077 | −8.2 | −0.0013 | −2.1 | −0.0056 | −7.0 | |
| 115,300 | −0.0032 | −4.3 | −0.0063 | −6.9 | −0.0008 | −1.3 | −0.0040 | −5.3 | |
| 115,300 | −0.0049 | −6.9 | −0.0074 | −8.2 | −0.0021 | −3.4 | −0.0047 | −6.2 | |
| 115,300 | −0.0038 | −5.1 | −0.0071 | −7.8 | −0.0016 | −2.5 | −0.0050 | −6.5 | |
| 115,300 | −0.0026 | −3.6 | −0.0051 | −5.4 | 0.0009 | 1.5 | −0.0016 | −2.1 | |
| 115,100 | −0.0026 | −3.6 | −0.0052 | −5.5 | −0.0009 | −1.5 | −0.0035 | −4.4 | |
| 108,100 | −0.0032 | −4.1 | −0.0059 | −6.0 | −0.0017 | −2.4 | −0.0044 | −5.3 | |
| 83,200 | −0.0031 | −3.6 | −0.0050 | −4.7 | −0.0007 | −0.9 | −0.0028 | −2.8 | |
| 114,300 | −0.0043 | −5.7 | −0.0066 | −6.8 | −0.0019 | −2.9 | −0.0043 | −5.3 | |
| 111,000 | −0.0033 | −4.2 | −0.0072 | −7.3 | −0.0008 | −1.2 | −0.0047 | −5.4 | |
| 114,000 | −0.0037 | −5.1 | −0.0066 | −7.1 | −0.0013 | −2.1 | −0.0042 | −5.6 | |
Estimated fraction of the Oase 1 genome that derives from Neanderthals.
| Sample | Statistic 1 | Statistic 2 | Statistic 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
| |||||||
| Prop. | S.E. | 90% CI | Prop. | S.E. | 90% CI | Prop. | S.E. | 90% CI | |
| Oase 1 | 8.1% | 2.0% | 4.8%–11.3% | 9.4% | 1.1% | 7.5%–11.3% | 6.0% | 2.0% | 2.8%–9.3% |
| Ust’-Ishim | 3.6% | 0.9% | 2.2%–5.0% | 5.5% | 0.7% | 4.3%–6.6% | 0.4% | 1.2% | 0.0%–2.5% |
| Kostenki 14 | 3.8% | 1.0% | 2.1%–5.5% | 2.9% | 0.8% | 1.6%–4.2% | 1.7% | 1.3% | 0.0%–3.9% |
| MA1 | 1.2% | 1.1% | 0.0%–3.0% | 3.5% | 0.8% | 2.2%–4.8% | 2.3% | 1.3% | 0.1%–4.5% |
| Loschbour | 1.3% | 0.9% | 0.0%–2.8% | 3.9% | 0.7% | 2.7%–5.1% | 0.5% | 1.2% | 0.0%–2.6% |
| LaBrana | 3.1% | 1.0% | 1.4%–4.7% | 1.9% | 0.7% | 0.7%–3.1% | 1.4% | 1.2% | 0.0%–3.4% |
| Stuttgart | 3.0% | 0.9% | 1.5%–4.4% | 2.5% | 0.7% | 1.3%–3.7% | 0.4% | 1.2% | 0.0–2.4% |
| Han | 2.2% | 0.9% | 0.6%–3.7% | 2.2% | 0.8% | 1.0%–3.5% | 1.0% | 1.2% | 0.0%–3.1% |
| Dai | 2.6% | 0.9% | 1.1%–4.0% | 1.0% | 0.8% | 0.0%–2.3% | 0.7% | 1.2% | 0.0%–2.6% |
| French | 3.0% | 0.9% | 1.6%–4.5% | 3.0% | 0.7% | 1.8%–4.2% | 0.2% | 1.2% | 0.0%–2.2% |
| Sardinian | 2.0% | 0.8% | 0.6%–3.4% | 2.7% | 0.7% | 1.5%–3.9% | −0.3% | 1.2% | 0.0%–1.8% |
Estimated fraction of the Oase 1 genome that derives from Neanderthals
Estimates are as in Table 1 but restrict to transversions. Modern human samples are from Panel B.
| Sample |
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
| Prop. | S.E. | 90% CI | Prop. | S.E. | 90% CI | Prop. | S.E. | 90% CI | |
| Oase 1 | 11.3% | 2.8% | 6.7%–16% | 10.9% | 1.6% | 8.3%–13.6% | 8.4% | 2.7% | 4.0%–12.9% |
| Ust'-Ishim | 2.9% | 1.2% | 1.0%–4.9% | 6.0% | 0.8% | 4.7%–7.4% | 4.2% | 1.5% | 1.8%–6.6% |
| Kostenki 14 | 3.0% | 1.4% | 0.7%–5.3% | 3.0% | 0.9% | 1.6%–4.5% | 6.2% | 1.6% | 3.6%–8.7% |
| MA1 | 1.5% | 1.5% | 0.0%–4.0% | 3.6% | 1.0% | 1.9%–5.2% | 5.5% | 1.6% | 2.8%–8.2% |
| Loschbour | 1.1% | 1.2% | 0.0%–3.1% | 4.8% | 0.9% | 3.3%–6.2% | 3.6% | 1.5% | 1.2%–6.1% |
| LaBrana | 3.7% | 1.3% | 1.4%–5.9% | 2.4% | 0.9% | 0.9%–3.8% | 4.8% | 1.5% | 2.4%–7.2% |
| Stuttgart | 2.8% | 1.2% | 0.8%–4.8% | 3.4% | 0.9% | 2.0%–4.9% | 3.8% | 1.5% | 1.4%–6.2% |
| Han | 1.0% | 1.3% | 0.0%–3.1% | 2.8% | 0.9% | 1.3%–4.2% | 3.6% | 1.5% | 1.2%–6.1% |
| Dai | 2.1% | 1.2% | 0.2%–4.0% | 1.3% | 0.9% | 0.0%–2.8% | 3.8% | 1.5% | 1.4%–6.2% |
| French | 1.6% | 1.2% | 0.0%–3.5% | 3.3% | 0.9% | 1.9%–4.7% | 2.7% | 1.5% | 0.3%–5.2% |
| Sardinian | 2.7% | 1.2% | 0.8%–4.7% | 2.3% | 0.9% | 0.8%–3.7% | 3.7% | 1.4% | 1.3%–6.1% |
Number of alleles in selected modern humans that are present in at least one Neanderthal genome but are rare or absent in West Africans
The analysis is based on 78,055 sites covered by at least one deaminated fragment in Oase 1. To convert the counts to estimates of ancestry, we subtract the Dinka count as an estimate of the false positive rate and divide by the number of sites covered (as indicated for the whole genome on the bottom). This gives the rate of alleles per screened site on this chromosome for this individual. We then multiply this quantity by 2%/1.32% to recalibrate the 1.32% seen genome-wide in the French to an assumed 2% genome-wide Neanderthal ancestry in the French.
| Chr | Sites | Neanderthal allele counts | Neanderthal ancestry | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Oase 1 | Ust’-Ishim | Kostenki 14 | Han | French | Dinka | Oase 1 | Ust’-Ishim | Kostenki 14 | Han | French | ||
| 1 | 6740 | 323 | 196 | 148 | 129 | 117 | 25 | 6.70% | 3.84% | 2.77% | 2.34% | 2.07% |
| 2 | 7112 | 294 | 145 | 121 | 188 | 199 | 29 | 5.65% | 2.47% | 1.96% | 3.39% | 3.62% |
| 3 | 5417 | 177 | 102 | 96 | 74 | 98 | 28 | 4.17% | 2.07% | 1.90% | 1.29% | 1.96% |
| 4 | 4495 | 359 | 86 | 63 | 141 | 96 | 42 | 10.69% | 1.48% | 0.71% | 3.34% | 1.82% |
| 5 | 4330 | 446 | 108 | 66 | 103 | 95 | 23 | 14.80% | 2.97% | 1.50% | 2.80% | 2.52% |
| 6 | 4549 | 324 | 155 | 167 | 142 | 138 | 73 | 8.36% | 2.73% | 3.13% | 2.30% | 2.16% |
| 7 | 4422 | 147 | 68 | 65 | 102 | 72 | 34 | 3.87% | 1.16% | 1.06% | 2.33% | 1.30% |
| 8 | 4322 | 131 | 132 | 72 | 35 | 38 | 14 | 4.10% | 4.14% | 2.03% | 0.74% | 0.84% |
| 9 | 3107 | 500 | 69 | 120 | 118 | 49 | 15 | 23.65% | 2.63% | 5.12% | 5.02% | 1.66% |
| 10 | 4009 | 147 | 139 | 67 | 131 | 86 | 22 | 4.72% | 4.42% | 1.70% | 4.12% | 2.42% |
| 11 | 4193 | 153 | 93 | 88 | 81 | 73 | 26 | 4.59% | 2.42% | 2.24% | 1.99% | 1.70% |
| 12 | 3456 | 456 | 160 | 54 | 125 | 93 | 10 | 19.55% | 6.58% | 1.93% | 5.04% | 3.64% |
| 13 | 2457 | 96 | 81 | 33 | 54 | 30 | 18 | 4.81% | 3.89% | 0.93% | 2.22% | 0.74% |
| 14 | 2390 | 85 | 27 | 52 | 50 | 52 | 13 | 4.56% | 0.89% | 2.47% | 2.35% | 2.47% |
| 15 | 2327 | 73 | 78 | 47 | 38 | 32 | 5 | 4.43% | 4.75% | 2.73% | 2.15% | 1.76% |
| 16 | 3139 | 90 | 121 | 68 | 43 | 39 | 8 | 3.96% | 5.45% | 2.90% | 1.69% | 1.50% |
| 17 | 2543 | 72 | 89 | 37 | 85 | 75 | 56 | 0.95% | 1.97% | −1.13% | 1.73% | 1.13% |
| 18 | 2305 | 57 | 58 | 59 | 27 | 29 | 5 | 3.42% | 3.48% | 3.55% | 1.45% | 1.58% |
| 19 | 1769 | 79 | 49 | 33 | 43 | 35 | 12 | 5.74% | 3.17% | 1.80% | 2.66% | 1.97% |
| 20 | 2492 | 107 | 29 | 62 | 56 | 43 | 12 | 5.78% | 1.03% | 3.04% | 2.68% | 1.88% |
| 21 | 1026 | 36 | 53 | 22 | 8 | 11 | 10 | 3.84% | 6.35% | 1.77% | −0.30% | 0.15% |
| 22 | 1455 | 79 | 33 | 66 | 34 | 18 | 5 | 7.71% | 2.92% | 6.35% | 3.02% | 1.35% |
|
| ||||||||||||
| All | 78055 | 4231 | 2071 | 1606 | 1807 | 1518 | 485 | 7.27% | 3.08% | 2.18% | 2.57% | “2%” |
| Subtract | Dinka | 3746 | 1586 | 1121 | 1322 | 1033 | 0 | |||||
Figure 2Spatial distribution of alleles matching Neanderthals in modern humans
Colored vertical lines indicate alleles shared with Neanderthals and no color indicates alleles shared with the great majority of West Africans. (O)ase 1, (K)ostenki 14, (U)st’-Ishim, (F)rench, (H)an, and (D)inka. The seven yellow bars indicate segments of putative recent Neanderthal ancestry. This analysis is based on 78,055 sites.
Ancient DNA libraries made from the Oase 1 mandible
| Metainformation | Sequencing results | All fragments | Deaminated fragments | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Library | Extract | UDG treatment | Index 1 | Index 2 | Extract used (μ1) | Sequences going into alignment | Sequences ≥35bp mapped | After dup. removal | Coverage | % C→T 5′end | % C→T 3′end | Coverage | % C→T 5′ end | % C→T 3′end |
| A5227 | E1406 | Yes | ACTTGCG | AACTCCG | 8 | 206,982 | 118,976 | 34,486 | 112 | 8 | 19 | 5 | 19 | 36 |
| A5252 | E1843 | Yes | GTAAGCC | TTGAAGT | 40 | 74,384 | 46,394 | 31,368 | 114 | 7 | 25 | 5 | 18 | 55 |
| A9032 | E1406 | No | ATAACGT | ACTATCA | 6 | 9,321,903 | 5,904,210 | 51,810 | 178 | 20 | 21 | 12 | 31 | 39 |
| A9033 | E1406 | No | AATAGGA | ACCAACT | 6 | 7,932,271 | 4,816,314 | 55,878 | 193 | 21 | 20 | 13 | 36 | 38 |
| A9034 | E1406 | No | ATCACGA | AACTCCG | 6 | 10,422,467 | 6,861,634 | 59,883 | 207 | 20 | 20 | 14 | 35 | 38 |
|
| ||||||||||||||
| All | Both | Mix | 27,958,007 | 17,747,528 | 233,425 | 803 | 17 | 21 | 49 | 30 | 39 | |||
Contamination estimate based on the direct comparison of C→T substitutions of all fragments and deaminated fragments only.
Sequencing metrics on all five libraries for each of four capture probe panels
| Library | Panel | No. target SNPs | Fragments going into alignment | Fragments mapped to genome | Fragments on target after dup. removal and MAPQ37 filter | % SNPs hit at least once | Average coverage on SNPs |
|---|---|---|---|---|---|---|---|
| A9032 | 390k | 393,577 | 10,849,144 | 2,235,955 | 133,564 | 26.5% | 0.34 |
| A9033 | 390k | 393,577 | 17,159,085 | 2,808,704 | 73,824 | 15.9% | 0.19 |
| A9034 | 390k | 393,577 | 16,902,935 | 3,256,438 | 142,520 | 27.7% | 0.36 |
| A5227 | 390k | 393,577 | 63,441,719 | 22,124,247 | 195,161 | 36.0% | 0.5 |
| A5252 | 390k | 393,577 | 60,181,844 | 14,278,978 | 180,626 | 33.3% | 0.46 |
| All 5 | 390k | 393,577 | 168,534,727 | 44,704,322 | 724,653 | 73.0% | 1.84 |
|
| |||||||
| A9032 | 840k | 842,630 | 25,105,625 | 3,801,435 | 178,015 | 17.6% | 0.21 |
| A9033 | 840k | 842,630 | 29,196,969 | 4,655,434 | 183,093 | 17.9% | 0.22 |
| A9034 | 840k | 842,630 | 35,780,652 | 5,968,851 | 200,767 | 19.3% | 0.24 |
| A5227 | 840k | 842,630 | 28,209,496 | 4,276,439 | 152,411 | 15.3% | 0.18 |
| A5252 | 840k | 842,630 | 20,286,540 | 1,630,343 | 106,943 | 11.2% | 0.13 |
| All 5 | 840k | 842,630 | 138,579,282 | 20,332,502 | 818,648 | 51.7% | 0.97 |
|
| |||||||
| A9032 | 1000k | 997,780 | 26,088,835 | 2,964,094 | 159,162 | 13.5% | 0.16 |
| A9033 | 1000k | 997,780 | 26,641,358 | 4,490,372 | 158,614 | 13.3% | 0.16 |
| A9034 | 1000k | 997,780 | 28,795,043 | 4,985,140 | 154,177 | 13.0% | 0.15 |
| A5227 | 1000k | 997,780 | 25,848,311 | 4,395,413 | 71,537 | 6.4% | 0.07 |
| A5252 | 1000k | 997,780 | 25,691,323 | 2,254,636 | 53,932 | 5.0% | 0.05 |
| All 5 | 1000k | 997,780 | 133,064,870 | 19,089,655 | 596,107 | 36.1% | 0.6 |
|
| |||||||
| A9032 | Archaic | 1,749,385 | 19,329,832 | 2,086,208 | 205,095 | 10.0% | 0.12 |
| A9033 | Archaic | 1,749,385 | 24,629,023 | 2,768,355 | 237,818 | 11.4% | 0.14 |
| A9034 | Archaic | 1,749,385 | 31,200,466 | 3,783,805 | 257,351 | 12.2% | 0.15 |
| A5227 | Archaic | 1,749,385 | 27,659,125 | 3,606,375 | 195,356 | 9.6% | 0.11 |
| A5252 | Archaic | 1,749,385 | 31,472,143 | 2,435,080 | 136,637 | 6.8% | 0.08 |
| All 5 | Archaic | 1,749,385 | 134,290,589 | 14,679,823 | 1,022,046 | 34.6% | 0.58 |
|
| |||||||
| A9032 | Combined | 3,801,245 | 81,373,436 | 11,087,692 | 719,146 | 15.5% | 0.19 |
| A9033 | Combined | 3,801,245 | 97,626,435 | 14,722,865 | 698,890 | 15.1% | 0.18 |
| A9034 | Combined | 3,801,245 | 112,679,096 | 17,994,234 | 806,589 | 17.0% | 0.21 |
| A5227 | Combined | 3,801,245 | 145,158,651 | 34,402,474 | 666,195 | 14.2% | 0.18 |
| A5252 | Combined | 3,801,245 | 137,631,850 | 20,599,037 | 531,873 | 11.4% | 0.14 |
| All 5 | Combined | 3,801,245 | 574,469,468 | 98,806,302 | 3,406,685 | 45.5% | 0.90 |
Effect of filtering on amount of nuclear data available for analysis
| All fragments | Deaminated fragments only | ||||||
|---|---|---|---|---|---|---|---|
| Panel | Target SNPs | No. SNPs hit ≥1× | % SNPs hit ≥1× | Average coverage | No. SNPs hit ≥1× | % SNPs hit ≥1× | Average coverage |
| Panels 1–3 | 2,051,902 | 1,038,619 | 50.6% | 1.03 | 271,326 | 13.2% | 0.16 |
| Panel 4 subset | 954,849 | 361,681 | 37.9% | 0.69 | 87,803 | 9.2% | 0.11 |
| Panels 1–4 | 3,801,245 | 1,685,891 | 44.4% | 0.85 | 426,027 | 11.2% | 0.13 |
The Panel 4 subset excludes the sites where only Denisova differs from the African panel. Note: Numbers differ from Extended Data Table 7 because only sites with base quality ≥20 were used.
Genomes merged with the Oase 1 data
| Sample ID | Human | Data type | Mean | UDG-treated |
|---|---|---|---|---|
| Oasel | Modern | Low coverage | Capture | Mix of library types |
|
| ||||
| Vindija | Archaic | Low coverage | 1.3 | No |
| Mezmaiskaya | Archaic | Low coverage | 0.5 | Yes |
| Altai | Archaic | High coverage | 52 | Yes |
| Denisova | Archaic | High coverage | 31 | Yes |
| Kostenki14 | Modern | Low coverage | 2.4 | Mix of library types |
| MA1 | Modern | Low coverage | 1 | No |
| LaBrana | Modern | Low coverage | 3.4 | No |
| Loschbour | Modern | High coverage | 22 | Yes |
| Stuttgart | Modern | High coverage | 19 | Yes |
| Ust’-Ishim | Modern | High coverage | 42 | Yes |
|
| ||||
| DinkaA | Modern | High coverage | 28 | .. |
| FrenchA | Modern | High coverage | 27 | .. |
| PapuanA | Modern | High coverage | 26 | .. |
| SardinianA | Modern | High coverage | 25 | .. |
| HanA | Modern | High coverage | 28 | .. |
| YorubaA | Modern | High coverage | 32 | .. |
| KaritianaA | Modern | High coverage | 26 | .. |
| SanA | Modern | High coverage | 33 | .. |
| MandenkaA | Modern | High coverage | 25 | .. |
| DaiA | Modern | High coverage | 28 | .. |
| MbutiA | Modern | High coverage | 24 | .. |
|
| ||||
| DaiB | Modern | High coverage | 37 | .. |
| FrenchB | Modern | High coverage | 42 | .. |
| HanB | Modern | High coverage | 35 | .. |
| MandenkaB | Modern | High coverage | 37 | .. |
| MbutiB | Modern | High coverage | 37 | .. |
| PapuanB | Modern | High coverage | 42 | .. |
| SanB | Modern | High coverage | 38 | .. |
| SardinianB | Modern | High coverage | 38 | .. |
| YorubaB | Modern | High coverage | 39 | .. |
| KaritianaB | Modern | High coverage | 35 | .. |
| MixeB | Modern | High coverage | 42 | .. |
| AustralianB1 | Modern | High coverage | 42 | .. |
| AustralianB2 | Modern | High coverage | 37 | .. |
| DinkaB | Modern | High coverage | 35 | .. |
Note: For the 25 present-day humans, samples ending with a subscript “A” are from “Panel A” of 11 individuals and samples ending with a subscript “B” are from “Panel B” of 14 individuals. Unless otherwise specified, in the analyses in this study we used Panel B individuals to represent specified present-day human populations.