| Literature DB >> 33173574 |
Junjie Tao1,2, Zhuan Hao3, Chunhui Huang1,2.
Abstract
Ascorbic acid (AsA) is a widespread antioxidant in living organisms, and plays essential roles in the growth and development of animals and plants as well as in the response to abiotic stress tolerance. The GDP-L-galactose phosphorylase (GGP) is a key regulatory gene in plant AsA biosynthesis that can regulate the concentration of AsA at the transcriptional and translational levels. The function and regulation mechanisms of GGP have been well understood; however, the molecular evolutionary patterns of the gene remain unclear. In this study, a total of 149 homologous sequences of GGP were sampled from 71 plant species covering the major groups of Viridiplantae, and the phylogenetic relationships, gene duplication and molecular evolution analyses of the genes were systematically investigated. Results showed that GGP genes are present throughout the plant kingdom and five shared whole-genome duplications and several lineage-specific whole-genome duplications were found, which led to the rapid expansion of GGPs in seed plants, especially in angiosperms. The structure of GGP genes was more conserved in land plants, but varied greatly in green algae, indicating that GGP may have undergone great differentiation in the early stages of plant evolution. Most GGP proteins had a conserved motif arrangement and composition, suggesting that plant GGPs have similar catalytic functions. Molecular evolutionary analyses showed that GGP genes were predominated by purifying selection, indicating that the gene is functionally conserved due to its vital importance in AsA biosynthesis. Most of the branches under positive selection identified by the branch-site model were mainly in the chlorophytes lineage, indicating episodic diversifying selection may contribute to the evolution of GGPs, especially in the chlorophyte lineage. The conserved function of GGP and its rapid expansion in angiosperms maybe one of the reasons for the increase of AsA content in angiosperms, enabling angiosperms to adapt to changing environments.Entities:
Keywords: AsA; GGP; L-galactose pathway; VTC2; ascorbate; gene duplication; molecular evolution
Year: 2020 PMID: 33173574 PMCID: PMC7640755 DOI: 10.1093/aobpla/plaa055
Source DB: PubMed Journal: AoB Plants Impact factor: 3.276
Figure 1.Ascorbic acid biosynthesis by the L-galactose pathway in plants. Enzymes involved in L-galactose pathway are labelled in the circles, including (1) PGI: glucose-6-phosphate isomerase; (2) PMI: mannose-6-phosphate isomerase; (3) PMM: phosphomannomutase; (4) GMP: GDP-mannose pyrophosphorylase; (5) GME: GDP-mannose-3′,5′-epimerase; (6) GGP: GDP-L-galactose phosphorylase; (7) GPP: L-galactose-1-phosphate phosphatase; (8) GalDH: L-galactose dehydrogenase; (9) GalLDH: L-galactono-1,4-lactone dehydrogenase.
Figure 2.Phylogenetic analyses of plant GGP genes using the Bayesian method. The phylogenetic tree was constructed through the Bayesian method under the GTR+I+G model. Posterior probabilities are labelled near the nodes. The accession number of the GGP gene is listed after the name of the species. Red arrows indicate shared WGDs. Red asterisks (*) indicate lineage-specific WGDs. Black star (★) indicates the branch is identified under episodic diversifying selection by branch-site model. Double black stars (★★) indicate that the branch is still under positive selection after Bonferroni correction.
Figure 3.Phylogenetic relationships, gene structures and conserved protein motifs of plant GGP genes. (A) The Bayesian phylogenetic tree of plant GGP genes. (B) Exon number of corresponding GGP genes. (C) Exon–intron structure of plant GGP genes. (D) The conserved motif composition and distribution of plant GGP proteins. The conserved motifs are displayed in different coloured boxes, and the sequence information for each motif is displayed in the form of seqlog.
PAML branch model analyses to test the variable selective pressure among branches and after gene duplication. aNp: number of estimated parameters. blnL: log-likelihood scores. cdf: degree of freedom. d−2ΔlnL: twice the log-likelihood difference of the models being compared. *P < 0.05; **P < 0.01.
| Model | Npa | lnLb | Parameter estimates | Models compared | dfc | −2ΔlnLd |
|
|---|---|---|---|---|---|---|---|
| A: One-ratio model M0 | 297 | −49065.194200 | ω 0 = 0.09302 | ||||
| B: Two ratios (angiosperm) | 298 | −49063.900889 | ω 0 = 0.09333, ω angiosperm = 0.04308 | B vs. A | 1 | 2.586622 | 0.1078 |
| C: Two ratios (angiosperm I) | 298 | −49063.877071 | ω 0 = 0.09272, ω angiosperm I = 949.49270 | C vs. A | 1 | 2.634258 | 0.1046 |
| D: Two ratios (angiosperm II) | 298 | −49064.939231 | ω 0 = 0.09317, ω angiosperm II = 0.06067 | D vs. A | 1 | 0.509938 | 0.4752 |
| E: Two ratios (eudicots I) | 298 | −49061.885830 | ω 0 = 0.09241, ω eudicots I = 1.65002 | E vs. A | 1 | 6.61674* | 0.0101 |
| F: Two ratios (monocots I) | 298 | −49063.571050 | ω 0 = 0.09264, ω monocots I = 0.20233 | F vs. A | 1 | 3.2463 | 0.0716 |
| G: Two ratios (eudicots II) | 298 | −49065.190239 | ω 0 = 0.09304, ω eudicots II = 0.08908 | G vs. A | 1 | 0.007922 | 0.9291 |
| H: Two ratios (monocots II) | 298 | −49060.105366 | ω 0 = 0.09359, ω monocots II = 0.02125 | H vs. A | 1 | 10.177668** | 0.0014 |