| Literature DB >> 35336617 |
Aidi Zhang1,2, Yuhong Xiong1,2,3, Jing Fang1,2,3, Xiaohan Jiang1,2,3, Tengfei Wang1,2,3, Kangchen Liu1,2,3, Huixiang Peng1,2,3, Xiujun Zhang1,2.
Abstract
Terpenes are organic compounds and play important roles in plant development and stress response. Terpene synthases (TPSs) are the key enzymes for the biosynthesis of terpenes. For Rosaceae species, terpene composition represents a critical quality attribute, but limited information is available regarding the evolution and expansion occurring in the terpene synthases gene family. Here, we selected eight Rosaceae species with sequenced and annotated genomes for the identification of TPSs, including three Prunoideae, three Maloideae, and two Rosoideae species. Our data showed that the TPS gene family in the Rosaceae species displayed a diversity of family numbers and functions among different subfamilies. Lineage and species-specific expansion of the TPSs accompanied by frequent domain loss was widely observed within different TPS clades, which might have contributed to speciation or environmental adaptation in Rosaceae. In contrast to Maloideae and Rosoideae species, Prunoideae species owned less TPSs, with the evolution of Prunoideae species, TPSs were expanded in modern peach. Both tandem and segmental duplication significantly contributed to TPSs expansion. Ka/Ks calculations revealed that TPSs genes mainly evolved under purifying selection except for several pairs, where the divergent time indicated TPS-e clade was diverged relatively anciently. Gene function classification of TPSs further demonstrated the function diversity among clades and species. Moreover, based on already published RNA-Seq data from NCBI, the expression of most TPSs in Malus domestica, Prunus persica, and Fragaria vesca displayed tissue specificity and distinct expression patterns either in tissues or expression abundance between species and TPS clades. Certain putative TPS-like proteins lacking both domains were detected to be highly expressed, indicating the underlying functional or regulatory potentials. The result provided insight into the TPS family evolution and genetic information that would help to improve Rosaceae species quality.Entities:
Keywords: Rosaceae; evolution; expansion; terpene synthases
Year: 2022 PMID: 35336617 PMCID: PMC8953233 DOI: 10.3390/plants11060736
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Summary of genome information and TPSs of sequenced Rosaceae species used in this study.
| Subfamily | Species | Release Version * | The Prefix of Gene Symbol | Total Genes | Numbers of TPSs | Numbers of TPSs with Both Domains | Percent of TPSs with Both Domains | Numbers of Already Know TPSs |
|---|---|---|---|---|---|---|---|---|
|
|
| GDR, v2.0 | Pr.pLo | 26,873 | 45 | 38 | 84.44% | 38 (both domains) [ |
|
| GDR, v1.0 | Pr.mir | 26,958 | 10 | 9 | 90.00% | ||
|
| NCBI, v1.0 | Pr.mum | 28,638 | 30 | 27 | 90.00% | ||
|
|
| GDR, v1.0 | Py.bet | 59,552 | 48 | 37 | 77.08% | |
|
| GDR, HFTH1 v1.0 | Ma.dom | 44,677 | 56 | 36 | 64.29% | 55 (all) [ | |
|
| CNGB, v1.0 | Ma.bac | 45,931 | 48 | 33 | 68.75% | ||
|
|
| GDR, v4.0 | Fr.ves | 35,914 | 65 | 43 | 66.15% | |
|
| GDR, v1.0 | Ro.chi | 39,669 | 76 | 57 | 75.00% |
* GDR, Genome Database for Rosaceae; CNGB, China National GeneBank DataBase; NCBI, National Center for Biotechnology Information. Species used for the phylogenetic tree are highlighted in bold.
Figure 1Phylogenetic relationship and distribution of motif/domain of TPSs in three Rosaceae species (P. persica, M. domestica, F. vesca). The phylogenetic tree is shown on the left panel, while conserved motifs, conserved domains, and Pfam domains are shown on the right three panels. The phylogenetic tree from full-length amino acid sequences was constructed using the MEGA with maximum likelihood (ML) method. The conserved motifs were assessed using the online MEME software. The conserved domain was annotated based on the conserved domain database in NCBI, whereas the gene structure and domains were annotated by using the PfamScan tool. The conserved motifs and domains are shaded in different colors. The root nodes of TPS-a, TPS-g, TPS-b, TPS-c, TPS-e, and TPS-f clades are indicated by blue, green, yellow-green, red, benzo, and purple, respectively.
Figure 2The unrooted phylogenetic tree of TPSs and motifs comparison between different TPS clades. (A) The maximum-likelihood phylogenetic tree of the TPS proteins in three Rosaceae species (P. persica, M. domestica, F. vesca). The TPS-a, TPS-g, TPS-b, TPS-c, TPS-e, and TPS-f clades are shaded in blue, green, yellow-green, red, benzo, purple, respectively. (B) The frequency of different motifs among different TPS clades. (C) The seqLogo of ‘DDxxD’ motif in the C-terminal domain of different TPS clades, the bit score represents the information content for each position in the sequence.
Numbers of complete TPSs with both domains from Rosaceae species that were used for the phylogenetic tree of Figure 3.
| TPSs Clade | TPS-a | TPS-b | TPS-g | TPS-c | TPS-d | TPS-e | TPS-f | Total |
|---|---|---|---|---|---|---|---|---|
| Motif | DDXYD | DDVYD | DDIFD | DIDDT | DXDD, DXXD | DDFFD | DDFFD | |
|
| 24 | 5 | 2 | 1 | 0 | 3 | 3 | 38 |
|
| 0 | 0 | 4 | 1 | 0 | 3 | 1 | 9 |
|
| 13 | 5 | 4 | 1 | 0 | 3 | 1 | 27 |
|
| 20 | 3 | 8 | 2 | 0 | 1 | 3 | 37 |
|
| 17 | 4 | 6 | 3 | 0 | 3 | 3 | 36 |
|
| 37 | 9 | 5 | 2 | 0 | 3 | 1 | 57 |
|
| 26 | 7 | 4 | 4 | 0 | 2 | 0 | 43 |
Figure 3Phylogenetic tree of complete TPSs from six representative Rosaceae species. The TPS members were identified from six species including two Prunoideae species (P. persica, P. mume), two Maloideae species (M. domestica, P. betulifolia), and two Rosoideae species (F. vesca, R. chinensis); only those TPSs containing both PF01397 and PF03936 domains were used. The phylogenetic tree from full-length amino acid sequences was constructed using the MEGA with maximum likelihood (ML) method. Representative sequences of TPSs from Vitis vinifera were used as outgroups. The branches of TPS-a, TPS-g, TPS-b, TPS-c, TPS-e, and TPS-f clades are indicated by blue, green, yellow-green, red, benzo, and purple colors, respectively. The lineage-specific expansion of TPSs in Prunoideae, Maloideae, Rosoideae are indicated by yellow squares, grey diamonds, red triangles, respectively. The enlarged phylogenetic tree with bootstrap values is shown in Supplementary Figure S2.
Figure 4Inter-chromosomal relationships of TPSs in P. persica (A), M. domestica (B), and F. vesca (C). Grey lines in the circle indicate the collinear blocks in the P. persica or M. domestica genome. The red lines in the circle highlight the segmental duplicated TPS gene pairs, while the red curve outside the circle indicates the tandemly duplicated TPS gene pairs produced by MCSscanX.
Figure 5Synteny analysis of TPSs among Rosaceae species. Grey lines in the background indicate the collinear blocks between different genomes, while the yellow lines highlight the syntenic TPS gene pairs. The chromosome is indicated by different colored boxes and labeled by Pr.pLo (P. persica), Pr. Mir (P. mira), Ma.dom (M. domestica), Fr.ves (F. vesca), Ro.chi (R. chinensis). Collinear relationships between P. persica-P. mira, P. persica-M. domestica, P. persica-F. vesca, P. persica-R. chinensis, F. vesca-R. chinensis are shown in (A–E), respectively.
Putative functions of TPSs in six Rosaceae species.
| TPSs Clade | TPS-a | TPS-b | TPS-g | TPS-c | TPS-e | TPS-f |
|---|---|---|---|---|---|---|
| Catalytic type | ||||||
|
| a1 (19) *; | b3 (5); | b2 (2); | c2 (1); | c1 (3); | a2 (3); |
|
| a1 (6); | b3 (5); | b2 (4); | c2 (1); | c1 (3); | a2 (1); |
|
| b1 (1); | / | b2 (4); | c2 (1); | c1 (3); | a2 (1); |
|
| a1 (8); | a3 (6); | b2 (4); | c2 (3), c3 (1); | c1 (2); | / |
|
| a1 (10); | a1 (2), a3 (2); | a1 (3), a2 (1); a3 (1); | b1 (2); | c1 (3); | b1 (1); |
|
| a1 (8), a2 (3); | b1 (3); | a1 (2), a3 (1); | a1(1), a3 (1); | c1 (1); | a3 (1); |
|
| a1 (4); | a3 (2); | a1 (1); | b1 (1), b2 (1); | a1 (1); | a3 (1); |
Notes: *, a1 represents the catalytic type of TPS listed in the second row; number in the bracket “(19)” represents family number of this TPS in the corresponding species.
Figure 6Expression pattern of expressed TPSs in the three Rosaceae species (P. persica, M. domestica, F. vesca). Heat mapping of TPSs gene expression in three Rosaceae species. The x-axis represents different samples (ripe fruit, immature fruit, and leaf), the y-axis represents TPSs. There are two replicates for each tissue. The rows and columns were clustered based on row-scale normalized expression values. TPSs clades were shown on the right by different colors; putative TPSs without both domains are indicated by black stars.
Figure 7The expression level of TPSs from clades b, c, g, e, and f in the three Rosaceae species (P. persica, M. domestica, F. vesca). Tissues (immature fruit, leaf) are indicated by different colors.