| Literature DB >> 32117355 |
Giovanni Zecca1, Massimo Labra1, Fabrizio Grassi2.
Abstract
Natural hybridization and introgression are central evolutionary processes in grape genus (Vitis). On the other hand, the interspecific relationships among grapes, the directionality of the inferred admixture events and the parents of hybrids are not yet completely clarified. The grapes are economically important crops characterized by tendrils used to climb on the trees and the fruits harvested by humans especially for the consumption or to produce wines and liquors. The American grapes (ca. 30 species) are recognized as an important resource because they show biotic and abiotic resistances. We analyzed 3,885 genome-wide SNPs from 31 American Vitis species using the TreeMix software combined with the f3 and f4 tests. This approach allowed us to infer phylogenetic relationships and to explore the natural admixture among taxa. Our results confirmed the existence of all hybrid species recognized in literature (V. x champinii, V. x doaniana, V. x novae-angliae, and V. x slavinii), identifying their most likely parent species and provided evidence of additional gene flows between distantly related species. We discuss our results to elucidate the origin of American wild grapes, demonstrating that admixture events have ancient origins. We observe that gene flows have involved taxa currently spread through the southern regions of North America. Consequently, we propose that glacial cycles could have triggered the contact between interfertile taxa promoting local hybridization events. We conclude by discussing the phylogenetic implications of our findings and showing that TreeMix can provide novel insights into the evolutionary history of grapes.Entities:
Keywords: TreeMix; Vitis; climate changes; gene flow; hybrids; introgression; migrations; phylogeny
Year: 2020 PMID: 32117355 PMCID: PMC7025467 DOI: 10.3389/fpls.2019.01814
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1(A) Linkage disequilibrium (LD) decay in ten grape species. The median r2 calculated in bins of 75 pairwise SNP comparisons was plotted against the mean physical distance of each bin (kb). Sample size for each species is shown in figure. Dotted line indicates r2 = 0.2. (B) Multistep procedure followed to identify the appropriate SNP block size used in TreeMix analysis. The SNP block size tested at each step is shown above plots. The physical distance (kb) between midpoints of consecutive blocks placed on the same chromosome is shown on y-axis while the x-axis shows the number of pairwise distances computed at each step. The dashed red line represents the assumed distance threshold (i.e., 200 kb).
Figure 2(A) The Maximum Likelihood (ML)-tree inferred by TreeMix under the strictly bifurcating model. The scale bar shows ten times the average standard error (s.e.) of the entries in the sample covariance matrix. Drift parameter is shown on the x-axis. (B) Scaled residuals from the fit of the model to the data. Without migration events 91.7% of the variance in relatedness between taxa was explained by the tree. Colors are described in the palette on the right.
Figure 3(A) Maximum Likelihood (ML)-tree inferred by TreeMix allowing eight migration events. Letters A–H refer to migration edges as defined in . Migration H, not confirmed by the f4 statistic after the Holm-Bonferroni correction (see ), is shown using a two-dashed line. Migration arrows are colored according to their weight and colors are explained in the palette on the left. The scale bar shows ten times the average standard error (s.e.) of the entries in the sample covariance matrix. Drift parameter is shown on the x-axis.
Migration edges inferred by TreeMix during the Maximum Likelihood (ML) searches carried out fitting eight gene flow events.
| Migration edge | Origin | Destination | N |
|
|
| p-value | MS | MSE |
|---|---|---|---|---|---|---|---|---|---|
|
| ( | ( | 40 | 0.492 | 0.492 | 0.020 | < < 1.000 E-10 | 69 | – |
|
| ( | ( | 40 | 0.444 | 0.444 | 0.025 | < < 1.000 E-10 | 63 | – |
|
|
| ( | 40 | 0.154 | 0.154 | 0.064 | 0.008 | 26 | 62 § |
|
| ( | ( | 40 | 0.431 | 0.431 | 0.025 | < < 1.000 E-10 | 16 | 23 # |
|
| ( | ( | 40 | 0.347 | 0.347 | 0.055 | 1.333 E-10 | 74 | – |
|
| ( | ( | 40 | 0.488 | 0.487 | 0.018 | < < 1.000 E-10 | 35 | 36 * |
|
| ( | ( | 40 | 0.358 | 0.358 | 0.023 | < < 1.000 E-10 | 32 | – |
|
| ( | ( | 40 | 0.140 | 0.140 | 0.035 | 3.843 E-05 | 10 | – |
Migration edges are labelled with capital letters from A to H. For each migration the number of independent runs (N) used to calculate mean values, the mean weight on the edge (), the jackknife estimate of the weight averaged over the N independent runs () and the jackknife estimate of the standard error averaged over the N independent runs () are shown in the table. The least significant p-value (p-value) recovered by TreeMix during the independent runs is also shown. Overall, results were found to be very stable across different runs (see and ). Origin, the set of species found in the subtree below the origin of the migration edge; Destination, the set of species found in the subtree below the destination of the migration edge; MS, migration support; MSE, extended migration support. Both MS and MSE are defined in . §, MSE value is computed considering as valid destination of migration the set of species shown in the column ‘Destination’ as well as any of its proper subsets with at least 2 elements. #, MSE value is computed considering as valid origin of migration the set of n species specified in the column ‘Origin’ as well as its proper subsets with at least n-3 elements. *, MSE value is computed considering as valid origin of migration the set of species shown in the column ‘Destination’ as well as any of its proper subsets.
Figure 4The figure shows the phylogenetic relationships of grapes referred to this work. Linking lines between corresponding tips are used to highlight changes in tree topology and taxa placements due to the addition of migration events. Left: the strict consensus tree between the two the best-scoring Maximum Likelihood (ML) tree topology obtained from 400 ML searches performed by TreeMix under the strictly bifurcating model. Right: strict consensus tree of the forty ML trees inferred by TreeMix allowing nine migration events. Bootstrap values (500 replicates) are shown for each node.
Test for treeness: f4 statistic.
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
| Z-score | p-value | H-B | |
|
|
|
|
|
| −1.21 | n.s. | n.s. |
|
|
|
|
| −2.93 | 3.4 x 10-03 | * | |
|
|
|
|
| −3.40 | 6.7 x 10-04 | ** | |
|
|
|
|
|
| −6.73 | 1.7 x 10-11 | **** |
|
| −0.35 | n.s. | n.s. | ||||
|
|
|
|
|
| −5.33 | 9.8 x 10-8 | **** |
|
| −3.85 | 1.2 x 10-4 | ** | ||||
|
|
| 0.03 | n.s. | n.s. | |||
|
| −0.77 | n.s. | n.s. | ||||
|
|
|
|
|
| −3.95 | 7.8 x 10-05 | ** |
|
| 1.35 | n.s. | n.s. | ||||
|
|
|
|
|
| 3.11 | 1.87 x 10-3 | * |
|
| 1.36 | n.s. | n.s. | ||||
|
|
| 3.17 | 1.5 x 10-3 | * | |||
|
| 1.85 | n.s. | n.s. | ||||
|
| 5.63 | 1.8 x 10-8 | **** | ||||
|
|
|
|
|
| 2.10 | 0.035 | n.s. |
|
| 1.86 | n.s. | n.s. | ||||
Tested hypotheses are given in the first column (Hp). Migrations A–H refer to migration edges shown in . When the parentage of a natural occurring hybrid is investigated, the name of the hybrid species is highlighted in bold. Z-scores computed using a block jackknifing procedure and their relative p-values are reported for each test. Statistical significance was further assessed according to Holm-Bonferroni (H-B) correction for multiple testing (n = 18). The interpretation of f4 test is described in Materials and Methods. n.s., not significant. *, p < 0.05; **, p < 0.01; ****, p<< 0.0001.
Admixture f3 statistic and ‘outgroup’ f3 statistic.
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
|
|
| Z-score | p-value | H-B | ||
|
|
|
|
| 3.29 | n.s. | n.s. | |
|
|
|
| 5.31 | n.s. | n.s. | ||
|
|
|
| 6.25 | n.s. | n.s. | ||
|
|
|
|
| −17.11 | 6.25 x 10-66 | **** | |
|
| 0.90 | n.s. | n.s. | ||||
|
|
|
|
| −17.03 | 2.4 x 10-65 | **** | |
|
| −10.31 | 3.3 x 10-25 | **** | ||||
|
|
|
|
| −13.55 | 4.0 x 10-42 | **** | |
|
|
| ||||||
|
|
|
|
|
|
|
| |
|
|
|
|
| 0.01487 | 10.18 | 4.5 x10-4 | ** |
|
|
|
| 0.00789 | 6.67 | |||
Tested hypotheses are given in the first column (Hp). Migrations A–H refer to migration edges shown in . When the parentage of a natural occurring hybrid is investigated, the name of the hybrid species is highlighted in bold. Z-scores computed using a block jackknifing procedure and their relative p-values are reported for each test. Statistical significance was further assessed according to Holm-Bonferroni (H-B) correction for multiple testing (n = 9). The interpretation of these tests is described in Materials and Methods. |Zdiff|: the absolute value of the difference between the Z-scores obtained testing alternative hypotheses in the outgroup f3 test; n.s., not significant; **, p < 0.01; ****, p<< 0.0001.