| Literature DB >> 33951239 |
Shyamalika Gopalan1,2, Elizabeth G Atkinson1,3, Laura T Buck4, Timothy D Weaver5, Brenna M Henn1,5,6.
Abstract
Questions surrounding the timing, extent, and evolutionary consequences of archaic admixture into human populations have a long history in evolutionary anthropology. More recently, advances in human genetics, particularly in the field of ancient DNA, have shed new light on the question of whether or not Homo sapiens interbred with other hominin groups. By the late 1990s, published genetic work had largely concluded that archaic groups made no lasting genetic contribution to modern humans; less than a decade later, this conclusion was reversed following the successful DNA sequencing of an ancient Neanderthal. This reversal of consensus is noteworthy, but the reasoning behind it is not widely understood across all academic communities. There remains a communication gap between population geneticists and paleoanthropologists. In this review, we endeavor to bridge this gap by outlining how technological advancements, new statistical methods, and notable controversies ultimately led to the current consensus.Entities:
Keywords: ancient DNA; archaic hominins; genetics; introgression
Mesh:
Substances:
Year: 2021 PMID: 33951239 PMCID: PMC8360192 DOI: 10.1002/evan.21895
Source DB: PubMed Journal: Evol Anthropol ISSN: 1060-1538
Estimates of initial Neanderthal genomic contribution to AMH based only on mtDNA evidence
|
| Model | Citation |
|---|---|---|
| <10% | Effective population size of AMH females is 16,000, and no archaic mtDNA is observed in a modern sample of 5,000 mtDNA sequences |
|
| Up to 25% | Single pulse, panmictic population |
|
| ~0% | No model, examined differences between mtDNA hypervariable regions of Neanderthals and AMHs (pairwise and in MDS space) |
|
| ~0% | Spatially explicit expansion of AMHs |
|
| Up to 24.3% | 1 generation of AMH‐Neanderthal coexistence |
|
| ~0% | 400 generations of AMH‐Neanderthal coexistence |
|
| ~0% | Coalescent simulations of early, late, and no Neanderthal introgression |
|
FIGURE 1The mitochondrial phylogeny of a Sima de los Huesos hominin, four Denisovan, 19 Neanderthal, 5 extant human, and 4 ancient AMH mitochondrial sequences (15,788 aligned base pairs in total) constructed using the neighbor joining method., The branch lengths are proportional to the evolutionary distances computed using maximum composite likelihood. All analyses were conducted in MEGA7. Branch tips are labeled with a sample name, the accession number of the downloaded sequence in brackets, and the approximate date of the specimen., , , , , , , , , , , , , , , , The tree shows that Neanderthal mitochondrial sequences are more highly diverged from extant humans than all AMH (ancient and extant) are from each other. Interestingly, the mitochondrial phylogeny places Neanderthals and AMHs as sister groups to the exclusion of Denisovans and the Sima de los Huesos hominin, as observed previously. This is in contrast to the phylogeny constructed from multiple loci of autosomal DNA, which instead places Neanderthals and Denisovans as sister groups. This discrepancy highlights the fact that inferences of population history based on single loci can be misleading, as they reflect the history of only one gene lineage
Estimates of average Neanderthal admixture (m and/or f)
| Proportion ( | Method used | Citation |
|---|---|---|
| 1.3–2.7% (non‐Africans) | S‐statistic |
|
| 1–4% (non‐Africans) | Parameterized population genetic model fit to D‐statistics, introgression occurring 50–80 ka |
|
| 1.9–3.1% (non‐Africans) | S‐statistic |
|
| 1–1.6% (Europeans) + 0.4–1.0% (“Eastern” non‐Africans) | S‐statistic |
|
| 1.5–2.0% (Europeans) | S‐statistic |
|
| 1.6–2.1% (East Asians and Native South Americans) | ||
| 0.8% (Europeans and East Asians) | LD‐based method |
|
| 1.0–1.3% (Europeans) | Conditional random field‐based model, combining allele matching, sequence divergence, and haplotype length information |
|
| 1.3–1.5% (East Asians) | ||
| 1.0–1.3% (Native Americans) | ||
| 0.1–0.6% (Africans & African Americans) | ||
| 3.4–7.3% (non‐Africans) | Likelihood maximization of parameterized demographic models |
|
| 0.3–2.6% (non‐Africans) | Bayesian approach using whole genome sequences (G‐PhoCS) |
|
| 0.9–1.2% (Western Eurasians) | Conditional random field‐based model |
|
| 1.3–1.5% (East and Central Asians) | ||
| 1.3–1.5% (Native Americans) | ||
| 1.1–1.3% (South Asians) | ||
| 1.4–1.7% Oceanians | ||
| 1.3–6.2% (non‐Africans) | Site frequency spectrum analysis |
|
| 0–1.3% (Oceanians) | ||
| 1.2% (Europeans) | Percentage of genome in putative archaic haplotypes | |
| 1.4% (East Asians) | ||
| 1.2% (South Asians) | ||
| 1.4% (Native Americans) | ||
| 1.2–1.4% (Oceanians) | ||
| 1.8–2.4% (Western Eurasians) | S‐statistic |
|
| 2.3–2.6% (East Asians) | ||
| 2.1% (Western non‐Africans) | F4 statistics |
|
| 2.4% (Eastern non‐Africans) | ||
| 0.8% (Europeans) | Percentage of genome in putative archaic haplotypes |
|
| 0.9% (East Asians) | ||
| 0.9% (South Asians) | ||
| 0.3% (Africans) |
Estimates of average Denisovan admixture (m or f)
| Proportion ( | Method used | Article |
|---|---|---|
| 0% (Eurasians) | S‐statistic |
|
| 1.2–6.8% (Melanesians) | ||
| 3.8–5.8% (Melanesians) | Parameterized population genetic model fit to D‐statistics |
|
| 0.8–6.2% (Oceanians) | S‐statistic |
|
| 0–2.4% (Southeast Asians) | ||
| 0% (South Asians) | ||
| 6% (Melanesians) | Inference of ancestral relationships using allele frequency data (TreeMix), without accounting for Neanderthal admixture |
|
| 2.2–3.8% (Melanesians) | S‐statistic |
|
| <0.1% (East Asians) | ||
| 2.3–3.7% (Melanesians) | Bayesian approach using whole genome sequences (G‐PhoCS) |
|
| 0.1–1.6% (East Asians) | ||
| 0.8% (Oceanians) | Conditional random field‐based model |
|
| 1.9–3.4% (Melanesians) | F4 statistics |
|
| 3.3–5.0% (Oceanians) | Site frequency spectrum analysis |
|
| 0%(Europeans) | Percentage of genome in putative archaic haplotypes |
|
| 0.1% (East Asians) | ||
| 0.1% (South Asians) | ||
| 0.1% (Native Americans) | ||
| 0.2–1.2% (Oceanians) | ||
| 2.8% (Oceanians) | F4 statistics |
|
FIGURE 2Illustrations of various alternative explanations that can often explain or bias genetic patterns that are interpreted as signatures of archaic introgression. Time progresses from top to bottom for all trees. (a) A structured ancestral population is comprised of two distinct ancestries (blue and orange) in distinct demes (dashed circles) that give rise to new demes over time. The two leftmost demes eventually give rise to AMHs, but one of them shares more ancestry with the deme that eventually gives rise to Neandertals and Denisovans. Due to recombination over generations, this ancestry is carried in the second AMH population in short tracts that are highly divergent in sequence from the blue ancestry carried by the first AMH population. This pattern occurs without needing to invoke post‐population split introgression from the archaic hominin. (b) Different gene lineages within individuals and populations can have different evolutionary histories. A concordant gene lineage is one that conforms to the topology of the overall population tree. However, depending on the depth of divergence between the groups and the size of the ancestral population, some fraction of these lineages is expected to be affected by ILS. (c) Balancing selection can maintain highly diverged variants (blue and gray) of a specific genetic trait within a population (dashed circle) on long haplotypes over evolutionary time. Alternatively, if there is no selection acting to maintain variation at a locus, a long, highly diverged tract of ancestry could come from an archaic source. Recent introgression (red arrow) could bring this diverged ancestry into an AMH population, where it would lie on a long ancestral tract because relatively few generations of recombination have occurred. (d) A reference sequence (top) is used to align ancient archaic reads (green) from a sequencing experiment to recover the full sequence. Ancient DNA reads are typically short and contain a relatively high proportion of mismatches, either due to damage or diverged ancestry, compared with the reference. Observed C to T mutations (red) are due to a common form of DNA damage. Real mismatches (blue) can also occur because the archaic individual is usually substantially diverged from the reference, which is based on modern humans. Contaminant sequences from modern humans (orange), even if rare, can be favored by mapping algorithms because those fragments are longer and are more similar to the reference sequence. This leads to a reference‐biased consensus sequence (bottom)