| Literature DB >> 35498660 |
Rebeca Hernández-Gutiérrez1,2, Cássio van den Berg3, Carolina Granados Mendoza2, Marcia Peñafiel Cevallos4, Efraín Freire M4, Emily Moriarty Lemmon5, Alan R Lemmon6, Susana Magallón2.
Abstract
The economically important cotton and cacao family (Malvaceae sensu lato) have long been recognized as a monophyletic group. However, the relationships among some subfamilies are still unclear as discordant phylogenetic hypotheses keep arising when different sources of molecular data are analyzed. Phylogenetic discordance has previously been hypothesized to be the result of both introgression and incomplete lineage sorting (ILS), but the extent and source of discordance have not yet been evaluated in the context of loci derived from massive sequencing strategies and for a wide representation of the family. Furthermore, no formal methods have been applied to evaluate if the detected phylogenetic discordance among phylogenomic datasets influences phylogenetic dating estimates of the concordant relationships. The objective of this research was to generate a phylogenetic hypothesis of Malvaceae from nuclear genes, specifically we aimed to (1) investigate the presence of major discordance among hundreds of nuclear gene histories of Malvaceae; (2) evaluate the potential source of discordance; and (3) examine whether discordance and loci heterogeneity influence on time estimates of the origin and diversification of subfamilies. Our study is based on a comprehensive dataset representing 96 genera of the nine subfamilies and 268 nuclear loci. Both concatenated and coalescence-based approaches were followed for phylogenetic inference. Using branch lengths and topology, we located the placement of introgression events to directly evaluate whether discordance is due to introgression rather than ILS. To estimate divergence times, concordance and molecular rate were considered. We filtered loci based on congruence with the species tree and then obtained the molecular rate of each locus to distribute them into three different sets corresponding to shared molecular rate ranges. Bayesian dating was performed for each of the different sets of loci with the same parameters and calibrations. Phylogenomic discordance was detected between methods, as well as gene histories. At deep coalescent times, we found discordance in the position of five subclades probably due to ILS and a relatively small proportion of introgression. Divergence time estimation with each set of loci generated overlapping clade ages, indicating that, even with different molecular rate and gene histories, calibrations generally provide a strong prior.Entities:
Keywords: Malvaceae; gene tree congruence; molecular heterogeneity; phylogenetic discordance; phylogenomic dating; species tree
Year: 2022 PMID: 35498660 PMCID: PMC9043901 DOI: 10.3389/fpls.2022.850521
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Species trees of Malvaceae derived from two coalescence methods. (A) Species tree from SVDquartets. Numbers associated to nodes represent Bootstrap values. (B) Species tree from ASTRAL. Numbers associated to nodes represent local posterior probabilities (LPP). Red circles indicate relationships with low quartet score (<40%). To visualize similarities between the two analysis, purple lines connect species between trees.
Source of phylogenetic discordance due to introgression and ILS between pairs of taxa.
| Taxa pairs | Subfamily groups | ILS proportion | Non-ILS proportion | BIC2 | BIC1 | dBIC | Total non-ILS prop. (%) |
|---|---|---|---|---|---|---|---|
|
| Byttnerioideae-Brownlowioideae | 0.00 | 1.00 | −30.91 | −18.65 | −12.26 | 3.70 |
|
| Bombacoideae-Brownlowioideae+Dombeyoideae+Tilioideae | 0.00 | 1.00 | −33.99 | −21.12 | −12.87 | 2.78 |
|
| Byttnerioideae-Malvoideae | 0.75 | 0.25 | −45.91 | −35.53 | −10.37 | 1.85 |
|
| Sterculioideae-Malvoideae | 0.50 | 0.50 | −34.70 | −22.46 | −12.23 | 1.85 |
|
| Tilioideae-Malvoideae | 0.50 | 0.50 | −34.70 | −22.46 | −12.23 | 1.85 |
|
| Byttnerioideae-Dombeyoideae | 0.93 | 0.06 | −79.60 | −67.69 | −11.91 | 0.93 |
|
| Byttnerioideae- | 0.93 | 0.06 | −79.60 | −67.69 | −11.91 | 0.93 |
|
| Byttnerioideae-Sterculioideae | 0.93 | 0.06 | −79.60 | −67.69 | −11.91 | 0.93 |
|
| Byttnerioideae-Tilioideae | 0.93 | 0.06 | −79.60 | −67.69 | −11.91 | 0.93 |
|
| Helicterioideae- | 0.97 | 0.03 | −211.37 | −196.92 | −14.45 | 0.93 |
|
| Dombeyoideae-Tilioideae | 0.97 | 0.03 | −207.21 | −194.06 | −13.15 | 0.93 |
|
| Bombacoideae- | 0.97 | 0.03 | −187.35 | −174.36 | −12.99 | 0.93 |
|
| Helicterioideae-Dombeyoideae | 0.96 | 0.04 | −160.88 | −149.11 | −11.77 | 0.93 |
|
| Dombeyoideae-Tilioideae | 0.97 | 0.03 | −228.43 | −216.54 | −11.89 | 0.93 |
|
| Helicterioideae-Tilioideae | 0.97 | 0.03 | −176.27 | −163.27 | −13.00 | 0.93 |
|
| Helicterioideae-Sterculioideae | 0.96 | 0.04 | −166.77 | −152.58 | −14.19 | 0.93 |
|
| Helicterioideae-Brownlowioideae | 0.97 | 0.03 | −172.49 | −160.95 | −11.54 | 0.93 |
|
| Grewioideae-Bombacoideae | 0.87 | 0.13 | −43.04 | −30.94 | −12.10 | 0.93 |
|
| Grewioideae-Malvoideae | 0.50 | 0.50 | −24.16 | −3.71 | −20.45 | 0.93 |
|
| Malvoideae-Bombacoideae | 0.97 | 0.03 | −187.35 | −174.36 | −12.99 | 0.93 |
|
| Byttnerioideae-Malvoideae | 0.50 | 0.50 | −24.16 | −5.16 | −19.00 | 0.93 |
|
| Helicterioideae-Bombacoideae | 0.72 | 0.28 | −246.36 |
| 10.16 | 0.10 |
|
| Helicterioideae-Bombacoideae | 0.79 | 0.21 | −204.46 |
| 10.01 | 0.06 |
|
| Helicterioideae-Malvoideae | 0.84 | 0.16 | −204.18 |
| 10.25 | 0.05 |
Summary of the QuIBL results considering only significant (>10 dBIC) values. For simplicity, we refer to the included species by their genus name. ILS proportion reports the proportion of loci with ILS signal, whereas non-ILS proportion indicates the proportion of loci that show additionally an introgression pattern. Bayesian information criterion values are reported for BIC1 and BIC2, where BIC1 is the ILS-only model, while BIC2 model considers ILS and introgression. dBIC is the difference between BIC2 and BIC1 values and it was considered a measure of significance. Total non-ILS proportion is the number of introgressed loci between the two species in the taxa pairs. Bold numbers correspond to models where ILS was preferred over introgression. For detailed results, see Supplementary Table S4.
Figure 2Summary of the sources of phylogenetic discordance obtained from QuIBL. (A) Species tree with the relationships among Malvaceae subfamilies derived from ASTRAL. Arrows indicate the direction of introgression or ILS events: black arrows represent relative strong introgression (>1% total non-ILS proportion), gray arrows represent relative weak introgression (<1% total non-ILS proportion), and orange arrow represents ILS. (B) Total proportion (%) of loci that show introgression between pairs of subfamilies or groups of subfamilies; relative strong introgression (>1%) shown in bold. See Supplementary Table S4 for detailed results.
Figure 3Molecular rate heterogeneity and age estimates. (A) Molecular rate variance for 123 loci, where the first tercile has lower variance (set1) and the third tercile has a higher variance (set3) as calculated with SortaDate. (B) Crown age estimates derived from five different datasets for Malvaceae and subfamilies, obtained with BEAST2. Circles show mean values, while the bars show 95% highest posterior density (HPD) intervals.
Figure 4Maximum clade credibility tree derived from the concatenated dataset (123 loci) partitioned by set (set1, set2, and set3). Bars associated to age values are the 95% highest posterior density (HPD) intervals.