| Literature DB >> 36254258 |
Andrius J Dagilis1, David Peede1,2,3, Jenn M Coughlan1,4, Gaston I Jofre1, Emmanuel R R D'Agostino1,5, Heidi Mavengere1, Alexander D Tate1, Daniel R Matute1.
Abstract
With the rise of affordable next-generation sequencing technology, introgression-or the exchange of genetic materials between taxa-has become widely perceived to be a ubiquitous phenomenon in nature. Although this claim is supported by several keystone studies, no thorough assessment of the frequency of introgression across eukaryotes in nature has been performed to date. In this manuscript, we aim to address this knowledge gap by examining patterns of introgression across eukaryotes. We collated a single statistic, Patterson's D, which can be used as a test for introgression across 123 studies to further assess how taxonomic group, divergence time, and sequencing technology influence reports of introgression. Overall, introgression has mostly been measured in plants and vertebrates, with less attention given to the rest of the Eukaryotes. We find that the most frequently used metrics to detect introgression are difficult to compare across studies and even more so across biological systems due to differences in study effort, reporting standards, and methodology. Nonetheless, our analyses reveal several intriguing patterns, including the observation that differences in sequencing technologies may bias values of Patterson's D and that introgression may differ throughout the course of the speciation process. Together, these results suggest the need for a unified approach to quantifying introgression in natural communities and highlight important areas of future research that can be better assessed once this unified approach is met.Entities:
Year: 2022 PMID: 36254258 PMCID: PMC9554761 DOI: 10.1002/evl3.294
Source DB: PubMed Journal: Evol Lett ISSN: 2056-3744
Figure 1Distribution of Patterson's D values sampled across the Eukaryote phylogeny of phyla (Hedges et al. 2015), with only phyla with labeled data. The number of records per taxonomic class (n) and number of source studies (s) are listed by the taxon labels, while the distribution of Patterson's D values is displayed on the inset. Significant values of Patterson's D (determined either by P‐value, Z score, or significance stated in source paper) are colored black, while nonsignificant values are colored red.
Summary of Models. Each model was subsequently bootstrapped using our custom phylogenetic bootstrap approach such that each species appeared at most once as either P2 or P3 in the subsampled data
| Model | Outcome | Fixed Effects | Random Effects |
|---|---|---|---|
| Model 1 | Significance (Pat | kingdom/phylum/class (k/p/c) | reference + species pair |
| Model 2 | “ ” | k/p/c + genetic distance (rbh) | “ ” |
| Model 3 | “ ” | k/p/c + genetic distance (coi/its/cytb) | “ ” |
| Model 4 | Magnitude of Pat | k/p/c | “ ” |
| Model 5 | “ “ | k/p/c + genetic distance (rbh) | “ ” |
| Model 6 | “ “ | k/p/c + genetic distance (coi/its/cytb) | “ ” |
| Model 7 | “ “ (only classes with >1 study) | k/p/c | “ ” |
| Model 8 | “ “ (only classes with >1 study) | k/p/c + genetic distance (rbh) | “ “ |
| Model 9 | “ “ (only classes with >1 study) | k/p/c + genetic distance (coi/its/cytb) | “ ” |
| Model 10 | Significance | gen. distance (rbh) * gen. distance to outgroup (rbh) | “ ” |
| Model 11 | Significance | gen. distance (coi/its/cytb) * gen. distance to outgroup (coi/its/cytb) | “ ” |
| Model 12 | Magnitude | gen. distance (rbh) * gen. distance to outgroup (rbh) | “ ” |
| Model 13 | Magnitude | gen. distance (coi/its/cytb) * gen. distance to outgroup (coi/its/cytb) | “ ” |
| Model 14 | Magnitude | gen. distance (rbh) | “ ” |
| Model 15 | Magnitude | gen. distance (coi/its/cytb) | “ ” |
| Model 16 | Magnitude | sequencing type + gen. distance(rbh) | “ ” |
| Model 17 | Magnitude | sequencing type + gen. distance(coi/its/rbh) | “ “ |
Genetic distance, genetic distance between P2 and P3; rbh, calculated using reciprocal best hits between species; coi/its/cytb, calculated using single genes (either ITS, COI, or CYTB); species pair, unique code for each combination of P2 and P3.
Figure 2The relationship between genetic distance and A) Significance of introgression tests and B) Magnitude of Patterson's D. Both relationships are significant in linear mixed models, but phylogenetic bootstrap estimates of effects overlap 0 for significance of Patterson's D (Figure S6) while remaining significant for the magnitude of Patterson's D (Figure S9). Solid lines represent the best fit from mixed models, while dashed lines show naïve linear model fits – accounting for the random effects of species pairs and reference reverses the slope in both cases.
Figure 3Phylogenetically independent clusters of introgression reports versus genetic distance. Reducing the data to 100 clusters of introgression events, we find no significant relationship between genetic distance and either significance or magnitude of Patterson's D. Dashed lines show linear model fits (inclusion of reference as a random effect was precluded by some clusters consisting of reports from many references).
Figure 4Sequencing type and Patterson's D. Studies using reduced representation sequencing (GBS/RAD) report significantly larger values of Patterson's D than studies that use whole genome sequences.