| Literature DB >> 35069621 |
Martha Kandziora1, Petr Sklenář1, Filip Kolář1,2, Roswitha Schmickl1,2.
Abstract
A major challenge in phylogenetics and -genomics is to resolve young rapidly radiating groups. The fast succession of species increases the probability of incomplete lineage sorting (ILS), and different topologies of the gene trees are expected, leading to gene tree discordance, i.e., not all gene trees represent the species tree. Phylogenetic discordance is common in phylogenomic datasets, and apart from ILS, additional sources include hybridization, whole-genome duplication, and methodological artifacts. Despite a high degree of gene tree discordance, species trees are often well supported and the sources of discordance are not further addressed in phylogenomic studies, which can eventually lead to incorrect phylogenetic hypotheses, especially in rapidly radiating groups. We chose the high-Andean Asteraceae genus Loricaria to shed light on the potential sources of phylogenetic discordance and generated a phylogenetic hypothesis. By accounting for paralogy during gene tree inference, we generated a species tree based on hundreds of nuclear loci, using Hyb-Seq, and a plastome phylogeny obtained from off-target reads during target enrichment. We observed a high degree of gene tree discordance, which we found implausible at first sight, because the genus did not show evidence of hybridization in previous studies. We used various phylogenomic analyses (trees and networks) as well as the D-statistics to test for ILS and hybridization, which we developed into a workflow on how to tackle phylogenetic discordance in recent radiations. We found strong evidence for ILS and hybridization within the genus Loricaria. Low genetic differentiation was evident between species located in different Andean cordilleras, which could be indicative of substantial introgression between populations, promoted during Pleistocene glaciations, when alpine habitats shifted creating opportunities for secondary contact and hybridization.Entities:
Keywords: cytonuclear discordance; gene tree discordance; hybridization; incomplete lineage sorting; rapid radiation; workflow
Year: 2022 PMID: 35069621 PMCID: PMC8777076 DOI: 10.3389/fpls.2021.765719
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Species and characteristics of Loricaria.
| Species | Section | Clade | Capitulum position | Distribution |
| Graveoleum | Graveolens | Axilliary | Peru ( | |
| Thyopsis | Unknown | Terminal | Ecuador ( | |
| Thyopsis | Axilliary | Axilliary | Ecuador, Colombia ( | |
| Thyopsis | Axilliary | Axilliary | Peru, Ecuador, Colombia ( | |
| Thyopsis | Axilliary | Axilliary | Ecuador ( | |
| Thyopsis | Axilliary | Axilliary | Ecuador ( | |
| Thyopsis | Axilliary | Axilliary | Ecuador ( | |
| Thyopsis | unknown | Axilliary and terminal | Ecuador ( | |
| Thyopsis | unknown | Axilliary | Colombia ( | |
|
|
|
|
| Peru ( |
| Terminalia | Terminal | Terminal | Colombia ( | |
| Terminalia | Unknown | Terminal | Peru ( | |
| Terminalia | Terminal | Terminal | Peru, Ecuador ( | |
| Terminalia | Terminal | Terminal | Peru ( | |
| Terminalia | Terminal | Terminal | Ecuador ( | |
| Terminalia | Terminal | Terminal | Ecuador ( | |
| Terminalia | Unknown | Terminal | Peru ( | |
|
|
|
|
| Colombia ( |
|
|
|
|
| Bolivia ( |
In bold are highlighted where the species’ sectional assignment does not match their placement in clades and/or position of the capitula.
FIGURE 1Illustration of the different steps employed for the discovery of samples and clades which show introgression to other clades. Each colored box represents a major analysis step, enumerated from (A–E), analysis scripts used are indicated in the respective boxes in parentheses. (A) Assembly of nuclear reads, identification of paralogs, and alignment building. (B) Assembly of plastid reads and alignment building. (C) Filtration of alignments to exclude samples with few assembled exons and sequences that are too short. (Da) Calculation of gene and species trees. (Db) Identification and filtration of single nucleotide polymorphisms and analysis of gene flow. (E) Identification of ILS and introgression. At first, the pipeline follows the solid arrows which results in dataset 1. This dataset 1 is used to follow the dashed-dotted lines in an iterative approach to remove samples that show gene flow, finally resulting in dataset 2. Using dataset 2 and following the dotted line allows to identify if all hybridigenous samples have been detected. (F) Visualization of phylogenetic discordance between phylogenies is done for dataset 1 and dataset 2, respectively.
FIGURE 2Nuclear species tree using ASTRAL III (A) in comparison to the concatenated tree calculated using RAxML-NG (B; dataset 1). Values at nodes in (A) represent local posterior probabilities and in (B) bootstrap values. Samples that differ in their phylogenetic placement are indicated by a dotted line. Red stars indicate samples that got removed from dataset 2. Labeled boxes indicate major clades discussed in the main text. Red unlabeled boxes indicate L. thuyoides samples with a different placement in the plastome tree (see Supplementary Figure 4).
FIGURE 3Frequency of alternative topologies supported by gene trees before (dataset 1; A) and after removing samples that showed gene flow according to Dsuite (dataset 2; B). The relative frequencies of the topologies are shown on the left. On the right, the main topologies are shown that are reduced to clades; the numbers on the branches indicate node numbers.
Summary of all methods that were used to detect samples showing phylogenetic discordance.
| Clade | Discordant taxon | ILS | Cytonuclear discordance | NeighborNet | Introgressed clades (PhyloNet) | Degree of introgression (Dsuite sub-analysis) |
| Ilinissae-clade | x | |||||
|
|
| |||||
| x | ||||||
|
| ||||||
| Leptothamna-clade | x | |||||
|
|
| |||||
| x | ||||||
|
| ||||||
| Complanata-scolopendra-clade | x | – | o | |||
|
|
| |||||
| o | ||||||
|
| ||||||
| x | x | o | 0.07 (2) | |||
|
| ||||||
| – | o | 0.045 (5) | ||||
|
| ||||||
| Thuyoides-clade1 | x | – | ||||
|
|
| |||||
| x | – | 0.06 (5) | ||||
|
|
| |||||
| x | – | x | 0.12 (3) | |||
|
|
| |||||
| x | 0.14 (1) | |||||
|
|
| |||||
| x | o | |||||
|
|
| |||||
| – | 0.10 (4) | |||||
|
|
| |||||
| x | ||||||
|
|
| |||||
| 0.06 (2) | ||||||
|
|
| |||||
| o | ||||||
|
|
| |||||
| o | ||||||
|
| ||||||
| Thuyoides-clade2 | o | |||||
A dash (“–”) indicates that the sample is absent from both phylogenies used for the comparison. The letter “o” indicates the placement in a different clade, the letter “x” indicates a different position within a clade. The degree of introgression using the f–branch statistic in Dsuite is only presented when values were γ > 0.05. In such cases, the number of the sub-analysis in which gene flow was detected is indicated in parentheses.
FIGURE 4(A–E) show the five best inferred networks with a maximum of three reticulation events according to PhyloNet using all gene trees (dataset 2). Phylogenies were reduced to clades and only nodes above a bootstrap value of 70% were considered. Gene flow between lineages is indicated by a blue line leading from two lineages to a third one. The γ-parameter indicates the degree of gene flow between different lineages. AIC values are presented and also provided in Supplementary Table 4.