| Literature DB >> 33986115 |
Wenbing Su1, Yi Jing2, Shoukai Lin3, Zhen Yue2, Xianghui Yang1, Jiabao Xu2, Jincheng Wu3, Zhike Zhang1, Rui Xia1, Jiaojiao Zhu4, Ning An4, Haixin Chen2, Yanping Hong1, Yuan Yuan1, Ting Long1, Ling Zhang1, Yuanyuan Jiang1, Zongli Liu1, Hailan Zhang1, Yongshun Gao1, Yuexue Liu1, Hailan Lin3, Huicong Wang1, Levi Yant5, Shunquan Lin6, Zhenhua Liu7.
Abstract
Whole-genome duplication (WGD) plays important roles in plant evolution and function, yet little is known about how WGD underlies metabolic diversification of natural products that bear significant medicinal properties, especially in nonmodel trees. Here, we reveal how WGD laid the foundation for co-option and differentiation of medicinally important ursane triterpene pathway duplicates, generating distinct chemotypes between species and between developmental stages in the apple tribe. After generating chromosome-level assemblies of a widely cultivated loquat variety and Gillenia trifoliata, we define differentially evolved, duplicated gene pathways and date the WGD in the apple tribe at 13.5 to 27.1 Mya, much more recent than previously thought. We then functionally characterize contrasting metabolic pathways responsible for major triterpene biosynthesis in G. trifoliata and loquat, which pre- and postdate the Maleae WGD, respectively. Our work mechanistically details the metabolic diversity that arose post-WGD and provides insights into the genomic basis of medicinal properties of loquat, which has been used in both traditional and modern medicines.Entities:
Keywords: apple tribe; comparative genomics; evolution; polyploidy; triterpene biosynthesis
Mesh:
Substances:
Year: 2021 PMID: 33986115 PMCID: PMC8157987 DOI: 10.1073/pnas.2101767118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Comparative analysis including newly assembled genomes of loquat and G. trifoliata identified a single WGD at the base of Maleae ∼13.5 to 27.1 Mya. (A) Phylogenetic relationships of loquat and other Rosaceae species. The ML phylogeny was inferred by 661 single-copy orthologs. Species split time based on molecular clock analysis is indicated on each branching node. (B) Intergenomic syntenic analysis between peach, G. trifoliata, apple, and loquat. Genomic regions in peach and G. trifoliata could be aligned with up to two regions in loquat and apple. Two examples in chromosomes 4 and 8 in G. trifoliata are highlighted by colors. (C) Ks distributions for gene pairs from syntenic blocks of G. trifoliata, grape, apple, and loquat. The two predicted WGD events are indicated by dashed lines. Ks around 1.5 indicates the gamma WGD occurred at the base of the eudicots. Ks around 0.16 indicates the more recent WGD specifically occurred in the apple tribe.
Fig. 2.Loquat leaves accumulate exceptionally high levels of ursane-type triterpenes. (A) GC chromatograms show major pentacyclic triterpenes in young and mature leaves of loquat, apple, G. trifoliata, and peach. IS, internal standard (Coprostanol); BA, beta-amyrin; AA, alpha-amyrin; OA, oleanolic acid; UA, ursolic acid; MA, maslinic acid; and CA, corosolic acid. (B) Triterpene content in mature leaves of loquat, apple, G. trifoliata, and peach. Error bars present means ± SE, with three biologically independent replicates; *** presents significant difference at P < 0.001 by two-sided Student’s t test. (C) GC chromatograms show major triterpene profiles in loquat leaf developmental stages. (D) Major triterpene content in mature loquat leaves (n = 3). Error bars present means ± SE with three biologically independent samples.
Fig. 3.Characterization of candidate genes encoding biosynthesis of major triterpenes. (A) Phylogenetic relationship and gene expression profile of OSCs from available Maleae and outgroup species. The ML tree was inferred with OSC proteins. Tree node support (>80%) is indicated by pink dots on branches (1,000 bootstrap replicates). The main subclade with higher gene expression levels was indicated by an arrow. Notably, gene expression levels are comparable within species only. Gene expression levels based on transcript per million values are indicated as a heatmap. (B) EjOSC centric coexpression networks. The top 30 correlated genes were shown in the network. The genes in dashed circles were predicted to encode enzymes. Detailed gene list can be found in Dataset S2. (C) GC-MS chromatograms of major triterpenes extracted from N. benthamiana leaves transiently expressed with candidate genes. (D) The pathway for biosynthesis of major triterpenes in loquat. The thick arrows present dominant flux toward ursane-type triterpenes due to promiscuous functions of OSC, CYP716A, and CYP716C enzymes.
Fig. 4.Syntenic analysis of genes encoding for biosynthesis of major triterpenes in the apple tribe. (A–C) Microsynteny across peach, G. trifoliata, loquat, apple, and pear for EjOSC1 and EjOSC2 in A, for EjCYP716C1 and EjCYP716C2 in B, and for EjCYP716A1 in C. Syntenic blocks are connected by lines. (D) Macrosynteny across peach, G. trifoliata, loquat, apple, and pear for EjCYP716A1. Gt00001404 and EjCYP716A2 formed a monophyletic branch in Fig. 3, indicated by arrows.
Fig. 5.WGD-associated diversification of the metabolic pathway for biosynthesis of major triterpenes before and after the emergence of the apple tribe. Pathway genes in the OSC-centric coexpression network are indicated by colors. Numbers represent correlation rankings based on WGCNA weights to targeted OSCs (reference Datasets S2 and S4). IS, internal standard; AA, alpha-amyrin; UA, ursolic acid; and CA, corosolic acid.