| Literature DB >> 32722033 |
Wuhe Jike1,2, Mingai Li1, Nicola Zadra3, Enrico Barbaro1, Gaurav Sablok4, Giorgio Bertorelle2, Omar Rota-Stabelli3, Claudio Varotto1.
Abstract
Polyploidization is a frequent phenomenon in plants, which entails the increase from one generation to the next by multiples of the haploid number of chromosomes. While tetraploidization is arguably the most common and stable outcome of polyploidization, over evolutionary time triploids often constitute only a transient phase, or a "triploid bridge", between diploid and tetraploid levels. In this study, we reconstructed in a robust phylogenomic and statistical framework the evolutionary history of polyploidization in Arundo, a small genus from the Poaceae family with promising biomass, bioenergy and phytoremediation species. Through the obtainment of 10 novel leaf transcriptomes for Arundo and outgroup species, our results prove that recurrent demiduplication has likely been a major driver of evolution in this species-poor genus. Molecular dating further demonstrates that the species originating by demiduplication stalled in the "triploid bridge" for evolutionary times in the order of millions of years without undergoing tetratploidization. Nevertheless, we found signatures of molecular evolution highlighting some of the processes that accompanied the genus radiation. Our results clarify the complex nature of Arundo evolution and are valuable for future gene functional validation as well as reverse and comparative genomics efforts in the Arundo genus and other Arundinoideae.Entities:
Keywords: Arundo genus; adaptive evolution; chromosome number evolution; molecular dating; phylogenomics; polyplodization
Year: 2020 PMID: 32722033 PMCID: PMC7432733 DOI: 10.3390/ijms21155247
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Species tree reconstructions of the Arundo genus from the dataset of 144 nuclear genes. (a) maximum likelihood (ML) topology of the species tree performed by IQ-TREE using the edge-linked optimal partition scheme of the gene supermatrix. Numbers are ML ultrafast bootstrap (percentage), maximum parsimony (MP) non parametric bootstrap (percentage) and posterior probability (fraction) support values. (b) Coalescent reconstruction of the species tree performed by ASTRAL III using the supertree approach on the ML trees of each gene. Dotted lines represent topological connections as ASTRAL does not estimate branch lengths for terminal branches. Numbers are local posterior probabilities for the main quartet topology.
Figure 2Phylogenetic network of the Arundo genus from the nuclear dataset. Values are bootstrap support for each split. Only values above 50% are shown. Branches and bootstrap values in red indicate split support for the grouping of A. micrantha with A. donax varieties, while branches and bootstrap values in blue indicate split support for the grouping of A. micrantha with the A. plinii s.l. clade. (a) Complete network. (b) Detail of the part of the complete network contained in the dotted rectangle.
Figure 3Combined ML and Bayesian inference (BI) cladogram of the Arundinoideae and selected species from PACMAD and BOP clades based on 5 chloroplast intergenic regions. Taxa in red are those used in this study for leaf RNA-Seq. The taxon in purple corresponds to the misidentified A. formosana accession used in Hardion et al. 2014. Branch supports are poster support from BI analysis (first number) and ultrafast bootstrap support from ML analysis (second number). The reference topology is obtained by BI. Numbers in square brackets indicate the support for the alternative ML topology. Cladogram drawn with Tree Graph 2.
Figure 4Chronogram of the Arundo genus based on the set of 144 nuclear genes. Numbers close to nodes indicate posterior support. The scale under the tree is in million years and the 95% CIs are drawn in the color of the node they correspond to in the chronogram. The 95% CI for the Arundo ingroup is above the scale and under the scale for outgroups.
Likelihood and AIC scores for the data set analyzed for each model carried out by the ChromEvol software. ΔAIC is the AIC score difference between each model and the best-fitting one.
| MODEL | Parameters | Ln(Likelihood) | AIC | ΔAIC |
|---|---|---|---|---|
| CONST_RATE_DEMI_EST * | 4 | −14.92 | 37.83 | 0.00 |
| CONST_RATE_DEMI | 3 | −17.83 | 41.67 | 3.83 |
| BASE_NUM | 4 | −20.30 | 48.61 | 10.77 |
| BASE_NUM_DUPL | 5 | −20.18 | 50.36 | 12.53 |
| CONST_RATE | 3 | −29.81 | 65.61 | 27.78 |
| CONST_RATE_NO_DUPL | 2 | −31.15 | 66.30 | 28.47 |
| LINEAR_RATE_DEMI | 5 | −29.61 | 69.22 | 31.39 |
| LINEAR_RATE | 5 | −29.99 | 69.98 | 32.14 |
| LINEAR_RATE_NO_DUPL | 4 | −31.02 | 70.03 | 32.20 |
| LINEAR_RATE_DEMI_EST | 6 | −29.49 | 70.98 | 33.14 |
*: best evolutionary model.
Figure 5Chromosome number evolution and inferred ancestral chromosome state in the genus Arundo, including three outgroups, inferred under Bayesian optimization. For each node, the most probable ancestral haploid chromosome number is reported. Numbers in square brackets are posterior probabilities. Numbers at the tips are the most frequent known haploid chromosome numbers of each species. DP: inferred demipolyploidization event.
Homology-based functional annotation of the 7 genes under positive selection after multiple test correction (FDR: 0.05) according to the sequence alignment-wide method implemented in BUSTED.
| OG ID | Sites | LR | Fdr | TAIR_ID | Name | Description | |
|---|---|---|---|---|---|---|---|
| OG0018377 | 1531 | 39.72 | 2.38 × 10−9 | 2.88 × 10−7 | AT5G47690 |
| PO76/PDS5 cohesin cofactor |
| OG0018357 | 964 | 29.35 | 4.23 × 10−7 | 3.05 × 10−5 | AT4G34100 |
| RING/U-box-superfamily-protein |
| OG0018102 | 258 | 22.17 | 1.53 × 10−5 | 0.000735 | AT5G48300 |
| ADP glucose pyrophosphorylase 1 |
| OG0018342 | 973 | 17.99 | 0.000124 | 0.004471 | AT4G11420 |
| Eukaryotic translation initiation factor 3 subunit A |
| OG0018364 | 899 | 13.89 | 0.000965 | 0.027788 | AT4G16150 |
| Calmodulin-binding; transcription-regulators |
| OG0018157 | 445 | 12.44 | 0.001986 | 0.047669 | AT4G35140 | NA | Transducin/WD40-repeat-like-superfamily-protein |
| OG0018070 | 126 | 12.10 | 0.002359 | 0.048518 | NA | NA | NA |
Homology-based functional annotation of the 7 genes undergoing branch-specific selection intensification after multiple test corrections (FDR: 0.05) according to ABSREL and RELAX. The higher the K value, the higher the predicted entity of selection intensification. Node name abbreviations: adf, A. donaciformis; ama, A. donax var. macrophylla; ami, A. micrantha; hm, H. macra; mc, M. caerulea; pa, P. australis.
| OG ID | Test Nodes | K | LR | FDR | TAIR ID | Name | Description |
|---|---|---|---|---|---|---|---|
| OG0017620 | hm | 4.35 | 6.19 | 0.030 | AT5G19210 | NA | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| OG0018017 | ami | 3.69 | 10.02 | 0.014 | AT1G64710 | NA | GroES-like zinc-binding alcohol dehydrogenase family protein |
| OG0018069 | adf | 39.10 | 8.39 | 0.014 | AT5G42020 |
| Luminal binding protein involved in polar nuclei fusion during proliferation of endosperm nuclei. |
| OG0018170 | pa | 3.79 | 9.98 | 0.009 | AT2G01680 | NA | Ankyrin repeat family protein |
| OG0018204 | adf | 4.73 | 6.09 | 0.028 | AT5G08530 |
| 51 kDa subunit of mitochondrial complex I |
| OG0018205 | mc | 9.25 | 15.13 | 0 | AT3G15140 |
| Ribonuclease H-like superfamily functioning as siRNA exonuclease. It affects post-transcriptional gene silencing and growth rate /biomass. |
| OG0018357 * | ama, mc | 3.67 | 7.93 | 0.014 | AT4G34100 |
| Involved in cuticular wax biosynthesis. Arabidopsis mutants have leaf waxes nearly pure C24 and C26 acid, weakly glaucous stem surface, and reduced fertility in early flowers. |
a Benjamini-Hochberg multiple test correction. * genes identified as positively selected according to BUSTED.