| Literature DB >> 35170754 |
Edeline Gagnon1,2, Rebecca Hilgenhof1,2, Andrés Orejuela1,2, Angela McDonnell3, Gaurav Sablok4,5, Xavier Aubriot6, Leandro Giacomin7, Yuri Gouvêa8, Thamyris Bragionis8, João Renato Stehmann8, Lynn Bohs9, Steven Dodsworth10,11, Christopher Martine12, Péter Poczai4,13, Sandra Knapp14, Tiina Särkinen1.
Abstract
PREMISE: Evolutionary studies require solid phylogenetic frameworks, but increased volumes of phylogenomic data have revealed incongruent topologies among gene trees in many organisms both between and within genomes. Some of these incongruences indicate polytomies that may remain impossible to resolve. Here we investigate the degree of gene-tree discordance in Solanum, one of the largest flowering plant genera that includes the cultivated potato, tomato, and eggplant, as well as 24 minor crop plants.Entities:
Keywords: Angiosperms353; Solanaceae; hard polytomy; incomplete lineage sorting; incongruence; multilocus phylogenetic trees; nuclear-plastid discordances; plastomes; short backbone branches; target capture
Mesh:
Year: 2022 PMID: 35170754 PMCID: PMC9321964 DOI: 10.1002/ajb2.1827
Source DB: PubMed Journal: Am J Bot ISSN: 0002-9122 Impact factor: 3.325
Number of species and taxon sampling across major and minor clades of Solanum. Clades are based on groups identified in previous molecular phylogenetic studies (Bohs, 2005; Weese and Bohs, 2007; Stern et al., 2011; Stern and Bohs, 2012; Särkinen et al., 2013; Tepe et al., 2016). Species number for each clade is based on current updated taxonomy in the SolanaceaeSource database (website: solanaceaesource.org, accessed November 2020). The 19 clades sampled in the pruned trees for the principal coordinate analysis in this study are in bold. New associated major clade names are given where applicable. Rows shaded in gray represent major and minor clades belonging to Clade II. The Eastern Hemisphere Spiny clade (EHS, formerly known as Old World spiny clade) comprises almost all the spiny solanums occurring in the eastern hemisphere.
| Minor clade | Associated major clade (Särkinen et al., | New associated major clade (this study) | Species | Sampled species (%) | ||
|---|---|---|---|---|---|---|
| Supermatrix | Plastome (PL) | Target capture (TC) | ||||
|
|
|
|
|
|
| |
| African non‐spiny | M Clade | VANAns | 14 | 5 (36%) | 1 (7%) | — |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Herpystichum | Potato | 10 | 10 (100%) | — | — | |
| Pteroidea | Potato | 10 | 10 (100%) | 1 (10%) | — | |
| Oxycoccoides | Potato | 1 | 1 (100%) | — | — | |
| Articulatum | Potato | 2 | 2 (100%) | — | — | |
|
|
|
|
|
|
| |
| Anarrhichomenum | Potato | 12 | 8 (82%) | |||
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
| Nemorense | Nemorense | 4 | 4 (100%) | 1 (25%) | — | |
| Pachyphylla | Cyphomandra | 39 | 32 (82%) | 1 (3%) | — | |
| Cyphomandropsis | Cyphomandra | 11 | 7 (64%) | 1 (9%) | 1 (9%) | |
|
|
|
|
|
|
| |
| Reductum | Geminata | 2 | 2 (100%) | 1 (50%) | — | |
| Brevantherum | Brevantherum | 83 | 29 (35%) | 3 (4%) | — | |
| Gonatotrichum | Brevantherum | 7 | 7 (100%) | 1 (14%) | — | |
| Inornatum | Brevantherum | 5 | 2 (40%) | 1 (20%) | — | |
| Trachytrichium | Brevantherum | 2 | 2 (100%) | — | — | |
|
|
|
|
|
|
| |
| Micracantha | Leptostemonum | 14 | 9 (64%) | 1 (7%) | — | |
|
|
|
|
|
|
| |
| Erythrotrichum | Leptostemonum | 33 | 13 (39%) | 1 (3%) | — | |
| Thomasiifolium | Leptostemonum | 9 | 4 (44%) | 1 (11%) | — | |
| Gardneri | Leptostemonum | 10 | 8 (80%) | 1 (10%) | — | |
|
|
|
|
|
|
| |
| Lasiocarpa | Leptostemonum | 12 | 12 (100%) | — | — | |
|
|
|
|
|
|
| |
| Androceras | Leptostemonum | 16 | 15 (94%) | — | — | |
| Crinitum | Leptostemonum | 23 | 10 (43%) | — | — | |
| Bahamense | Leptostemonum | 3 | 3 (100%) | — | — | |
| Asterophorum | Leptostemonum | 4 | 2 (50%) | — | — | |
| Carolinense | Leptostemonum | 11 | 8 (73%) | 1 (9%) | — | |
| Hieronymi | Leptostemonum | 1 | 1 (100%) | 1 (100%) | — | |
|
|
|
|
|
|
| |
| Campechiense | Leptostemonum | 1 | 1 (100%) | — | — | |
| Crotonoides | Leptostemonum | 3 | 2 (67%) | 1 (33%) | — | |
| Multispinum | Leptostemonum | 1 | 1 (100%) | 1 (100%) | — | |
| Unplaced | Leptostemonum | 9 | 1 (13%) | — | — | |
| TOTALS: | 1228 | 746 (60%) | 140 (11%) | 39 (3%) | ||
Overview of the 10 different analyses conducted across the Sanger supermatrix, plastome (PL), and target capture (TC) datasets. Acronyms indicate how each analysis is referred to in the figures and text. ML = Maximum Likelihood; BI = Bayesian Inference, A353 = Angiosperms353 bait set. See Materials and Methods section for full details.
| Dataset | Taxon and genomic sampling | Phylogenetic method | Partitioning scheme | Acronym |
|---|---|---|---|---|
| Supermatrix | 746 taxa, 9 loci | ML: RaxML | — | Supermatrix ML |
| BI: Beast2 | — | Supermatrix BI | ||
| Plastome (PL) | 151 taxa, full + partial plastomes | ML: IQ‐TREE2 | Unpartitioned | PL‐151‐UP |
| 151 taxa, full + partial plastomes | ML: IQ‐TREE2 | Best‐Partition scheme | PL‐151‐BP | |
| 125 taxa, full plastomes only | ML: IQ‐TREE2 | Unpartitioned | PL‐125‐UP | |
| 125 taxa, full plastomes only | ML: IQ‐TREE2 | Best‐Partition scheme | PL‐125‐BP | |
| Target capture (TC) (A353) | 40 taxa, 338 exons | ML: IQ‐TREE2 | — | TC‐min04‐ML |
| 40 taxa, 338 exons | Coalescent: ASTRAL‐III | — | TC‐min04‐ASTRAL‐III | |
| 40 taxa, 303 exons | ML: IQ‐TREE2 | — | TC‐min20‐ML | |
| 40 taxa, 303 exons | Coalescent: ASTRAL‐III | — | TC‐min20‐ASTRAL‐III |
Figure 1Supermatrix phylogeny from Maximum Likelihood analysis (RaxML) of 742 Solanum species based on two nuclear and seven plastid regions. Bootstrap branch support values are color‐coded: black = strong (0.95–1.0), white = moderate to weak support (0.75–0.94). Dashed lines in phylogeny indicate relationships that were not recovered in the TC and PL analyses (see Figures 2, 3). Clade names refer to major and minor clades discussed in the text (see Table 1).
Figure 2Comparison of Solanum clades recovered in plastome (PL) and target‐capture (TC) phylogenomic datasets. (A) Plastome phylogeny from the unpartitioned maximum likelihood analysis (PL‐151‐UP) based on 160 loci representing exons, introns and intergenic regions; (B) Filtered supertree network of the TC dataset (min20) based on 303 gene trees with a 50% minimum tree threshold. (C) TC phylogeny with 40 species from coalescent analysis (TC‐min20‐ASTRAL‐III). Clades are shown in the same color in all three phylogenies to enable comparison. Branch support values (BS values in (A) and local PP values in (C)) are color coded: black = strong (0.95–1.0), white = moderate to weak (0.75–0.94). Scale bars = substitutions/site. Collection or GenBank numbers are indicated in the PL phylogeny for duplicate species sampled in the phylogenetic trees.
Figure 3Comparison of Solanum clades recovered in the three different datasets. (A) TC ASTRAL‐III phylogeny of the min20 dataset, with local posterior probabilities indicated at nodes; (B) ML and BI phylogenies of supermatrix dataset, with bootstrap support and posterior probabilities indicated at nodes; (C) TC ML phylogeny of the min20 dataset, with local posterior probabilities indicated at nodes; (D) PL ML phylogenies of the unpartitioned and best partition‐scheme of the 151 taxa dataset, with bootstrap for each respective analysis is indicated at nodes; (E) TC ML phylogeny and ASTRAL‐III phylogeny of the min04 dataset, with bootstrap support and local posterior probabilities indicated at nodes; (F) PL ML phylogenies of the unpartitioned and best partition‐scheme of the 125 taxa dataset, with bootstrap for each respective analysis indicated at nodes.
Figure 4Discordance analyses within and between the plastome (PL) and target capture (TC) phylogenomic datasets across Solanum. Rooted TC ASTRAL‐III phylogeny (left) and PL IQ‐TREE2 phylogeny (right) with gene concordance factor (gCF) and site concordance factor (sCF) values shown as pie charts, above and below each node respectively; the PL topology is the unpartitioned ML analysis of 151 taxa, whereas the TC topology is based on the analysis of 40 taxa and 303 genes recovered from the A353 bait set. Both trees have been pruned to retain a single tip for each of the major and minor clades present within the PL and TC datasets. For gCF pie charts, blue represents proportion of gene trees concordant with that branch (gCF), green is proportion of gene trees concordant for 1st alternative quartet topology (gDF1), yellow support for 2nd alternative quartet topology (gDF2), and red is the gene discordance support due to polylphyly (gDFP). For the sCF pie charts: blue represents proportion of concordance across sites (sCF), green support for 1st alternative topology (quartet 1), and yellow support for 2nd alternative topology (quartet 2) as averaged over 100 sites. Percentages of gCF and sCF are given above branches, in bold. Branch support (local posterior probability) values ≥0.95 are not shown, and 0.94 and below are shown in italic grey, on the right; double‐dash (‐‐) indicates that the branch support was unavailable due to rooting of the phylogenetic tree.