| Literature DB >> 26380674 |
Haiyan Sun1, Kai Guo2, Shengqiu Feng3, Weihua Zou3, Ying Li3, Chunfen Fan3, Liangcai Peng4.
Abstract
Lignin and flavonoids play a vital role in the adaption of plants to a terrestrial environment. 4-Coumarate: coenzyme A ligase (4CL) is a key enzyme of general phenylpropanoid metabolism which provides the precursors for both lignin and flavonoids biosynthesis. However, very little is known about how such essential enzymatic functions evolve and diversify. Here, we analyze 4CL sequence variation patterns in a phylogenetic framework to further identify the evolutionary forces that lead to functional divergence. The results reveal that lignin-biosynthetic 4CLs are under positive selection. The majority of the positively selected sites are located in the substrate-binding pocket and the catalytic center, indicating that nonsynonymous substitutions might contribute to the functional evolution of 4CLs for lignin biosynthesis. The evolution of 4CLs involved in flavonoid biosynthesis is constrained by purifying selection and maintains the ancestral role of the protein in response to biotic and abiotic factors. Overall, our results demonstrate that protein sequence evolution via positive selection is an important evolutionary force driving adaptive diversification in 4CL proteins in angiosperms. This diversification is associated with adaption to a terrestrial environment.Entities:
Keywords: 4-Coumarate: coenzyme A ligase; diversification; evolution; phylogeny; positive selection
Year: 2015 PMID: 26380674 PMCID: PMC4569036 DOI: 10.1002/ece3.1613
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Phylogenetic relationship between 4CL genes from angiosperms based on the neighbor-joining method. The branch lengths are proportional to distances, and the values at the interior nodes are the bootstrap percentages derived from 1000 replicates. The six branches potentially under positive selection are indicated as a, b, c, d, e, and f, respectively.
Log-likelihood values and parameter estimates for the 4CL genes
| Model | p | lnL | Parameter estimates | Positively selected sites |
|---|---|---|---|---|
| M0: one ratio | 1 | −28951.48 | None | |
| Branch specific models | ||||
| Two ratios (branch a) | 2 | −28949.50 | ||
| Two ratios (fixed | 1 | −28949.58 | ||
| Two ratios (branch b) | 2 | −28950.10 | ||
| Two ratios (fix | 1 | −28950.49 | ||
| Two ratios (branch c) | 2 | −28948.26 | ||
| Two ratios (fixed | 1 | −28948.29 | ||
| Two ratios (branch d) | 2 | −28948.76 | ||
| Two ratios (fixed | 1 | −28948.79 | ||
| Two ratios (branch e) | 2 | −28950.04 | ||
| Two ratios (fixed | 1 | −28950.17 | ||
| Two ratios (branch f) | 2 | −28939.38 | ||
| Two ratios (fixed | 1 | −28940.20 | ||
| Sites-specific models | ||||
| M1:neutral (K = 2) | 1 | −28731.92 | Not allowed | |
| M2: selection (K = 3) | 3 | −28731.92 | None | |
| ( | ||||
| M3: discrete (K = 2) | 3 | −28260.60 | None | |
| M3: discrete (K = 3) | 5 | −28136.31 | None | |
| M7: beta | 2 | −28118.00 | Not allowed | |
| M8: beta and | 4 | −28116.60 | None | |
| Branch-site model A | ||||
| Model a | 3 | −28731.43 | None | |
| ( | ||||
| Model fixed | 2 | −28731.55 | Not allowed | |
| ( | ||||
| Model c | 3 | −28719.14 | 82C 291S (at | |
| ( | 379M 423T (at | |||
| Model fixed | 2 | −28724.93 | Not allowed | |
| ( | ||||
| Model d | 3 | −28723.32 | 79V 202S 211S (at | |
| ( | ||||
| Model fixed | 2 | −28726.61 | Not allowed | |
| ( | ||||
| Model f | 3 | −28716.01 | 65L 69E 181I 223L | |
| ( | 234K 239K (at | |||
| Model fixed | 2 | −28720.97 | Not allowed | |
| ( | ||||
Likelihood ratio statistics (2ΔlnL) for testing branch hypothesis
| M0 (one ratio) | Fixed | Fixed | Fixed | Fixed | Fixed | Fixed | |
|---|---|---|---|---|---|---|---|
| 3.96 | 0.16 | ||||||
| 2.76 | 0.78 | ||||||
| 6.44 | 0.06 | ||||||
| 5.44 | 0.06 | ||||||
| 2.88 | 0.26 | ||||||
| 24.2 | 1.64 |
Significant (P < 0.05, χ2 = 3.84).
Extremely significant (P < 0.01, χ2 = 6.63).
Likelihood ratio statistics (2ΔlnL) for testing branch-site hypothesis
| M1a | Branch-site MAa ( | Branch-site MAc ( | Branch-site MAd ( | Branch-site MAf ( | |
|---|---|---|---|---|---|
| MAa | 0.98 (0.61) | 0.24 (0.89) | |||
| MAc | 25.56 (2.82E-07) | 11.58 (6.67E-04) | |||
| MAd | 17.2 (1.84E-04) | 6.58 (1.03E-02) | |||
| MAf | 31.82 (1.23E-07) | 9.92 (7.01E-03) |
Significant (P < 0.05).
Extremely significant (P < 0.01).
Figure 2The deducted amino acid sequence for Arabidopsis At4CL1 referred to in this article. The residues involved in hydroxycinnamate binding are indicated with stars, while those involved in enzymatic functions are labeled with triangles (Hu et al. 2010). The bold-type letters indicate conserved motifs (Gulick 2009), while those on a gray background indicate positively selected sites.