| Literature DB >> 35768506 |
Anders Bergström1, David W G Stanton2,3,4, Ulrike H Taron5, Laurent Frantz4,6, Mikkel-Holger S Sinding7,8,9,10, Erik Ersmark2,3, Saskia Pfrengle11,12, Molly Cassatt-Johnstone13, Ophélie Lebrasseur14, Linus Girdland-Flink15,16, Daniel M Fernandes17,18, Morgane Ollivier19, Leo Speidel20,21, Shyam Gopalakrishnan7, Michael V Westbury5,7, Jazmin Ramos-Madrigal7, Tatiana R Feuerborn7,9,11, Ella Reiter11, Joscha Gretzinger11,22, Susanne C Münzel11, Pooja Swali20, Nicholas J Conard23,24, Christian Carøe7, James Haile14, Anna Linderholm3,14,25,26, Semyon Androsov27, Ian Barnes28, Chris Baumann24,29, Norbert Benecke30, Hervé Bocherens24,31, Selina Brace28, Ruth F Carden32, Dorothée G Drucker24, Sergey Fedorov33, Mihály Gasparik34, Mietje Germonpré35, Semyon Grigoriev33, Pam Groves36, Stefan T Hertwig37,38, Varvara V Ivanova39, Luc Janssens40, Richard P Jennings16, Aleksei K Kasparov41, Irina V Kirillova42, Islam Kurmaniyazov43, Yaroslav V Kuzmin44, Pavel A Kosintsev45, Martina Lázničková-Galetová46, Charlotte Leduc47, Pavel Nikolskiy48, Marc Nussbaumer37, Cóilín O'Drisceoil49, Ludovic Orlando50, Alan Outram51, Elena Y Pavlova52, Angela R Perri53,54, Małgorzata Pilot55, Vladimir V Pitulko41, Valerii V Plotnikov56, Albert V Protopopov56, André Rehazek37, Mikhail Sablin57, Andaine Seguin-Orlando50, Jan Storå58, Christian Verjux59, Victor F Zaibert60, Grant Zazula61,62, Philippe Crombé63, Anders J Hansen7, Eske Willerslev7,64, Jennifer A Leonard65, Anders Götherström3,58, Ron Pinhasi17,66, Verena J Schuenemann11,12,17, Michael Hofreiter5, M Thomas P Gilbert7,67, Beth Shapiro13,68, Greger Larson14, Johannes Krause69, Love Dalén2,3, Pontus Skoglund70.
Abstract
The grey wolf (Canis lupus) was the first species to give rise to a domestic population, and they remained widespread throughout the last Ice Age when many other large mammal species went extinct. Little is known, however, about the history and possible extinction of past wolf populations or when and where the wolf progenitors of the present-day dog lineage (Canis familiaris) lived1-8. Here we analysed 72 ancient wolf genomes spanning the last 100,000 years from Europe, Siberia and North America. We found that wolf populations were highly connected throughout the Late Pleistocene, with levels of differentiation an order of magnitude lower than they are today. This population connectivity allowed us to detect natural selection across the time series, including rapid fixation of mutations in the gene IFT88 40,000-30,000 years ago. We show that dogs are overall more closely related to ancient wolves from eastern Eurasia than to those from western Eurasia, suggesting a domestication process in the east. However, we also found that dogs in the Near East and Africa derive up to half of their ancestry from a distinct population related to modern southwest Eurasian wolves, reflecting either an independent domestication process or admixture from local wolves. None of the analysed ancient wolf genomes is a direct match for either of these dog ancestries, meaning that the exact progenitor populations remain to be located.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35768506 PMCID: PMC9279150 DOI: 10.1038/s41586-022-04824-9
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 69.504
Fig. 1Seventy-two ancient wolf genomes.
a, Sampling locations of ancient wolves and one ancient dhole analysed here, on a base map from Natural Earth (naturalearthdata.com). b, Ages and sequencing coverage of ancient wolves. c, PC1 from a PCA on outgroup f3-statistics plotted against sample age. PCs were calculated from ancient wolves only, with present-day wolves and dogs projected onto the plot. d, Heterozygosity estimates from sampling of two reads at sites ascertained as heterozygous in a coyote. Bars denote 95% CIs from block jackknifing.
Extended Data Fig. 1f-statistics informing on wolf population history.
Bars denote ±1.96 standard errors for f3-statistics, and ±3 standard errors for f4-statistics, estimated from a block jackknife. a) Outgroup f3-statistics quantifying shared genetic drift with a present-day wolf (Fig. S3). b) f4-statistics contrasting affinities to a pre-LGM and a post-LGM Siberian wolf (Fig. S4). c) f4-statistics contrasting affinities to a Siberian and a European pre-LGM wolf (Fig. S6). d) f4-statistics quantifying whether a ~60 ky old Siberian wolf is closer to a contemporaneous European wolf or other individuals (Fig. S7). e) f4-statistics quantifying whether a coyote is closer to a ~100ky old Siberian wolf or later individuals.
Fig. 2One hundred thousand years of wolf population history.
a, Admixture graph fit by qpGraph to selected ancient wolves, with two outlier (|Z| > 3) f-statistics (worst = 3.16). b, Best-fitting qpAdm models for post-LGM and present-day wolves. An ancient dhole was used as the outgroup for Eurasian wolves to capture any unsampled divergent ancestry, while a coyote was used as the outgroup for North American wolves. Bars denote ±1 standard error estimated from a block jackknife. c, FST for pairs of sample groups with mean dates separated by ≤12,500 years. Bars denote ±1.96 standard errors d, MSMC2 results for pairs of male X chromosomes, with sample ages indicated by blue lines. A sharp upwards spike in the curve corresponds to population divergence, with estimated timings indicated by red lines. Example curves for two pairs of wolves (left and middle) and a summary of results for all pairs (right) are shown. kyr, thousand years.
Fig. 3Natural selection in the ancient wolf time series.
a, –log10(P values) (two sided, not adjusted for multiple comparisons) from linear regression for association between allele frequency and sample age. b, Quantile–quantile plot comparing the P values in a to those expected under a uniform distribution (top) and likewise for results from a simulated neutrally evolving population (effective population size (Ne) = 50,000) (bottom). c, Allele observations in ancient wolves and frequencies in present-day populations for lead variants from the three strongest peaks. Bars denote 95% binomial CIs. d, Local P values (from a) and TMRCA inferred using Relate on modern wolves and dogs for the region surrounding IFT88. The genome-wide histogram (quantiles in grey lines) shows that this locus has the most recent TMRCA in the genome.
Selection peaks
| Chr | Start (Mb) | End (Mb) | Description and notes on genes within region |
|---|---|---|---|
| 1 | 103.7 | 103.8 | |
| 2 | 6.77 | 6.84 | |
| 3 | 72.35 | 72.45 | |
| 4 | 32.22 | 32.25 | No genes |
| 6 | 9.85 | 9.95 | |
| 6 | 13.85 | 14.05 | No genes |
| 6 | 43.8 | 43.82 | |
| 7 | 29 | 29.05 | |
| 9 | 2.2 | 2.3 | No genes, lncRNA |
| 9 | 8.95 | 9.6 | |
| 10 | 7.62 | 7.7 | Dog QTL locus associated to drop ears, body mass and other traits. |
| 10 | 7.95 | 8.09 | Dog QTL locus associated to drop ears, body mass and other traits. Human mutations in |
| 10 | 8.14 | 8.24 | Dog QTL locus associated to drop ears, body mass and other traits. |
| 11 | 0.75 | 1.15 | |
| 11 | 56.72 | 56.77 | No genes |
| 15 | 0.1 | 0.5 | Olfactory gene cluster, SLC2A1 is Glucose transporter 1 |
| 15 | 3.92 | 3.98 | No genes |
| 15 | 6.53 | 6.57 | |
| 15 | 13.5 | 13.7 | Three cytochrome P450 enzyme genes, involved in lipid and secondary metabolism |
| 21 | 28.02 | 28.07 | Olfactory gene cluster |
| 22 | 2.8 | 2.92 | lncRNA, just downstream of |
| 25 | 17.4 | 17.56 | |
| 25 | 19.77 | 19.9 | Uncharacterized gene |
| 30 | 2.69 | 2.75 | No genes |
Locations in the genome of regions displaying evidence of natural selection across the wolf time series, with comments on any genes within the region. For a more detailed table see Supplementary Data 3.
Fig. 4The ancestry of dogs.
a, PCA on post-LGM and present-day wolves (X), based on f4-statistics only of the form f4(X,A;B,C), where A, B and C are any of 21 wolves predating 28 ka. Dogs are projected, and coloured by f4(AndeanFox,X;Zhokhov dog 9.5 ka,Tel Hreiz dog 7.2 ka). b, For dogs (X), f4(AndeanFox,X;Zhokhov dog 9.5 ka,Tel Hreiz dog 7.2 ka) horizontally against f4(AndeanFox,X;Belaya Gora wolf 18 ka,Hohle Fels wolf 13 ka) vertically (Pearson’s r = 0.86, P = 3 × 10–23). Bars denote ±1 standard error estimated from a block jackknife. Silhouettes from phylopic.org. c, log10(P values) for qpAdm models fit to dog targets, where a low P value means the model can be rejected. An ancient dhole was used to represent unsampled, divergent ancestry; models including this source have black outlines. Points are jittered horizontally to avoid overlap. d, qpAdm ancestry proportions for dogs, using the Zhokhov (9.5 ka) dog and a present-day Syrian wolf as proxies for eastern and western dog progenitor ancestry, respectively. Bars denote ±1 standard error estimated from a block jackknife. e, Map of early and relevant later dogs and their ancestry proportions as in d. Black crosses indicate the locations of wolves from 25–10 ka that can be rejected as dog progenitors. Base map from the mapdata R package. k, thousand years. f, Admixture graph model of major dog lineage relationships, fit by qpGraph with no outlier f-statistics. Edge lengths are in units of FST (×1,000).
Extended Data Fig. 2Placing dogs into wolf diversity in a ‘pre-LGM f4 PCA’.
PCA on wolves that lived after 25 ka (including present-day), based on profiles of f4-statistics only of the form f4(X,A;B,C), where A, B, and C are wolves that lived prior to 28 kya. Dogs are projected. Dogs are coloured according to the f4-statistic f4(AndeanFox,X;Zhokhov dog 9.5ka,Tel Hreiz dog 7.2ka), with negative values going towards blue and positive values towards red. A few wolves (in colour) and dogs (in black) of particular interests are indicated with text labels. a) PC1 vs PC2 with the full set of wolves. b) PC3 vs PC4 with the full set of wolves. c) PC1 vs PC2 with western Chinese and North American outlier wolves removed. d) PC3 vs PC4 with western Chinese and North American outlier wolves removed.
Extended Data Fig. 3Affinities of dogs to ancient wolves.
a) f4-statistics of the form f4(AndeanFox,X;wolf A,wolf B), quantifying for all individuals X whether they share more drift with wolf A or wolf B. The ages of A and B are indicated with dashed lines, with positive values indicating affinity to the upper individual and negative values indicating affinity to the lower individual. Bars denote ±3 standard errors estimated from a block jackknife.
Extended Data Fig. 4A schematic model of how deep population structure could explain why dogs require ancestry from an outgroup population in qpAdm analyses.
Under this model, there is deep population structure between different wolf populations, including the wolf population that becomes the progenitor of dogs. High rates of gene flow over time largely homogenises the ancestry of all populations, but it does not completely erase the deep structure. If the true dog progenitor population is not sampled, a single-source qpAdm model involving one of the sampled wolf populations will not fit dog ancestry, because dogs do not share all of the genetic drift that has occurred in the history of the sampled population. But if an outgroup population is included as a source in qpAdm, this can account for the ‘missing’ deep ancestry in dogs, and therefore result in a model that fits dog ancestry.
qpWave tests of dog cladality
| Target sets | Individuals | ||||
|---|---|---|---|---|---|
| Eastern dogs | Karelia_Veretye.OL4061, Zhokhov.CGG6, PortauChoix.AL3194, Baikal.OL4223, NewGuineaSingingDog | ||||
| Southwestern dogs | Israel.THRZ02, Iran.AL2571, Israel.ASHQ01, Basenji | ||||
|
|
| ||||
| Ancient small (n=7) | Siberia_UlakhanSular.LOW008, Germany_Aufhausener.AH575, Germany_HohleFels.JK2183, Siberia_BungeToll.CGG29, Siberia_BelayaGora.IN18_016, Yukon_QuartzCreek.SC19.MCJ010, Altai_Razboinichya.AL2744 | ||||
| Ancient large (n=25) | Germany_Aufhausener.AH574, Germany_Aufhausener.AH577, Siberia_Yana.CGG27, Siberia_Badyarikha.CGG34, Alaska_Fairbanks.JAL385, Alaska_Fairbanks.JAL48, Alaska_Fairbanks.JAL65, Alaska_Fairbanks.JAL69, Yukon_HunkerCreek.SC19.MCJ017, Germany_HohleFels.JK2174, Germany_HohleFels.JK2175, Germany_HohleFels.JK2183, Siberia_BungeToll.LOW003, Siberia_UlakhanSular.LOW008, Czechia_Predmosti.PDM100, Alaska_LillianCreek.ALAS_024, Siberia_Tirekhtyakh.VAL_033, Siberia_Badyarikha.VAL_008, Siberia_Ogorokha.VAL_050, Siberia_BelayaGora.IN18_016, Siberia_Tirekhtyakh.CGG32 | ||||
|
|
|
|
|
| |
| Eastern dogs | Ancient small | 0.3667 | 0.9566 | 0.9992 | |
| Southwestern dogs | Ancient small | 0.0229 | 0.8850 | 0.8474 | |
| Eastern+Southwestern | Ancient small | 6.1E-05 | 0.1900 | 0.7610 | |
| Eastern dogs | Ancient large | 0.0656 | 0.5352 | 0.8292 | |
| Southwestern dogs | Ancient large | 0.1622 | 0.8989 | 0.9525 | |
| Eastern+Southwestern | Ancient large | 9.2E-18 | 2.9E-04 | 0.0659 | |
|
|
| ||||
| Base modern | WolfSaudiArabia, WolfSyria, Wolf01Altai, Wolf02Chukotka, Wolf03Bryansk, Wolf04InnerMongolia, Wolf05China, Wolf06Croatia, Wolf07Israel, Wolf19India, Wolf20Iran, Wolf21Italy, Wolf24Portugal, Wolf27Spain, Wolf31Liaoning, Wolf32Xinjiang, Wolf33Xinjiang, Wolf34Shanxi, Wolf35Xinjiang, Wolf36Xinjiang, Wolf37InnerMongolia, Wolf38Shanxi, Wolf39Iberia, Wolf41InnerMongolia, Wolf42Tibet, WolfTibetan01InnerMongolia, WolfTibetan02InnerMongolia, WolfTibetan03QinghaiHighland, WolfTibetan04QinghaiHighland, WolfTibetan05Tibet, WolfTibetan06Tibet, WolfTibetan07Xinjiang, WolfTibetan08Xinjiang, Wolf21-M-02-15Scandinavia, Wolf32-D-05-18Scandinavia | ||||
|
|
|
|
|
|
|
| Eastern dogs | (Sample of n=7 from base) x100 reps | 2.5E-11 | 0.0214 | 0.2643 | 0.9576 |
| Southwestern dogs | (Sample of n=7 from base) x100 reps | 0.0474 | 0.9247 | 0.3226 | 0.9990 |
| Eastern+Southwestern | (Sample of n=7 from base) x100 reps | 6.7E-77 | 1.2E-11 | 3.0E-05 | 0.4611 |
| Eastern dogs | (Sample of n=25 from base) x100 reps | 2.2E-46 | 1.1E-23 | 0.0033 | 0.3526 |
| Southwestern dogs | (Sample of n=25 from base) x100 reps | 7.1E-06 | 0.0525 | 0.0427 | 0.8939 |
| Eastern+Southwestern | (Sample of n=25 from base) x100 reps | 1.0E-100 | 1.0E-100 | 1.2E-54 | 1.0E-06 |
Two different dog target sets, and their union, are tested for cladality relative to reference sets consisting of ancient or modern wolves. From the modern wolves (bottom of table), for each target 100 different reference sets were constructed by randomly sampling either 7 or 25 individuals. The results across these 100 tests are summarised by displaying the mean (on a log-scale) and maximum p-values.
Extended Data Fig. 5Projecting dogs onto present-day wolf population structure.
Principal components analyses performed only on modern wolves, with modern dogs projected.
Extended Data Fig. 6"Ocean plot" searching for the best available wolf match for the ancestry of eastern dogs.
With the Siberian Zhokhov dog (9.5k BP) as the target, each candidate wolf X was added in turn into the rotating qpAdm analysis. When X is not part of the sources, it is placed in the reference list. Models placed within the gray space labelled “Failed” have p-values fall below the lower limit of the plot.
Extended Data Fig. 7"Ocean plot" searching for the best available wolf match for the west Eurasian wolf-related ancestry in western dogs.
With the African Basenji dog as a target, all available post-LGM and present-day wolf genomes X are tested as sources combined with the 9.5k-year old Siberian Zhokhov dog, which is assumed to represent a baseline for the Eastern-related dog progenitor ancestry. When X is not part of the sources, it is placed in the reference list. If a target has a model with p > 0.01, models with a larger number of sources are not plotted. Only four individuals achieve good fits in the two-source model (Zhokhov + X): WolfSyria, Wolf07Israel, Wolf20Iran and Wolf19India. For other individuals, including ancient and present-day European wolves, the two-source model can be rejected, and a three-source model with an unsampled ancestry component (Zhokhov + X + unsampled) is needed to fit the data.