| Literature DB >> 32245969 |
Yi-Cun Chen1,2, Zhen Li3,4, Yun-Xiao Zhao1,2, Ming Gao1,2, Jie-Yu Wang5,6, Ke-Wei Liu7,8,9, Xue Wang1,2, Li-Wen Wu1,2, Yu-Lian Jiao1,2, Zi-Long Xu1,2, Wen-Guang He1,2, Qi-Yan Zhang1,2, Chieh-Kai Liang10, Yu-Yun Hsiao11, Di-Yang Zhang5, Si-Ren Lan5, Laiqiang Huang7,8,9, Wei Xu12, Wen-Chieh Tsai13,14,15, Zhong-Jian Liu16,17,18,19,20, Yves Van de Peer21,22,23,24, Yang-Dong Wang25,26.
Abstract
The laurel family within the Magnoliids has attracted attentions owing to its scents, variable inflorescences, and controversial phylogenetic position. Here, we present a chromosome-level assembly of the Litsea cubeba genome, together with low-coverage genomic and transcriptomic data for many other Lauraceae. Phylogenomic analyses show phylogenetic discordance at the position of Magnoliids, suggesting incomplete lineage sorting during the divergence of monocots, eudicots, and Magnoliids. An ancient whole-genome duplication (WGD) event occurred just before the divergence of Laurales and Magnoliales; subsequently, independent WGDs occurred almost simultaneously in the three Lauralean lineages. The phylogenetic relationships within Lauraceae correspond to the divergence of inflorescences, as evidenced by the phylogeny of FUWA, a conserved gene involved in determining panicle architecture in Lauraceae. Monoterpene synthases responsible for production of specific volatile compounds in Lauraceae are functionally verified. Our work sheds light on the evolution of the Lauraceae, the genetic basis for floral evolution and specific scents.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32245969 PMCID: PMC7125107 DOI: 10.1038/s41467-020-15493-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Concatenated- and ASTRAL-based phylogenetic trees.
a Phylogenetic trees based on the concatenated (left) and multi-species coalescent (MSC) methods (right) using nucleotide sequences. Magnoliids are indicated with a gray background. Red stars with labels I, II, III, and IV refer to the discussions on phylogenetic discordances (see text). b Estimated proportions of the 160 single-copy gene trees with different topologies based on nucleotide alignments. The x-axis labels q1, q2, and q3 refer to the quartet support for the main topology (red), the first alternative (blue), and the second alternative (yellow), respectively. The dashed line refers to a proportion of 0.33. c Phylogenetic trees based on the concatenated (left) and MSC methods (right) using amino acid sequences. Interpretation is as in a. d Estimated proportions of the 160 single-copy gene trees based on amino acid sequences. Interpretation is same as b. Source data underlying (a) and (c) are provided as a Source Data file.
Fig. 2Whole-genome duplications in Laurales.
a K age distribution for the whole paranome of L. cubeba. Two K peaks are shown by dotted lines at KS ≈ 0.5 and 0.8 falling in two K ranges highlighted by two gray rectangles in the background from 0.3 to 0.645 and from 0.645 to 1.1, respectively. b K age distributions for anchor pairs of L. cubeba (dark gray histogram and line; peaks represent WGD events) and for one-to-one orthologs between L. cubeba and selected Lauralean species and V. vinifera (colored filled curves of kernel-density estimates; a peak represents a species divergence event). The arrows in different colors indicate under- (to the left) and overestimations (to the right) of the divergence events and point to the K values after corrections of different substitution rates in the three comparisons based on that in L. cubeba (see Methods). c The phylogeny of Laurales and Magnoliales with branch lengths in K units (left) and in absolute divergence time (right). The tree topology and absolute divergence time were retrieved. The K ages and their 95% confidence intervals (CIs) for WGDs identified from the whole paranome (Supplementary Fig. 9) are shown in dots and rectangles, respectively, on the left phylogeny. Without considering species with only one K peak in their K distributions of paralogs, the blue and green rectangles highlight the ranges of 95% CIs of WGDs for the older peaks and the younger peaks, respectively. The red, light blue, and yellow rectangles show the ranges of 95% CIs for three independent WGD events in different lineages of Laurales. Correspondingly, the red, light blue, and yellow bars illustrate the independent WGD events on the right phylogenetic tree with absolute divergence time. The three WGD events occurred at about the same time of the radiation of Lauralean species within ~3 million years. In addition, the green bar denotes a lineage-specific WGD event in C. filiformis and the blue bar denotes the WGD event before the divergence of Laurales and Magnoliales.
Fig. 3The evolution of floral structures in Lauraceae.
a Phylogeny of Lauraceae based on a concatenated sequence alignment of 275 single-copy gene families for 22 species in the Lauraceae. b The variable panicles in Lauraceae (from bottom to up): spikes in Cassytha, spikelike panicles in the Cryptocarya group, cymoses panicles in the Alseodaphne-Phoebe clade and Cinnamomum, pseudo-umbel in Sassafras, and umbels in the Laurus-Litsea clade. c Perianth tube turbinate or suburceolate present in Cryptocarya group and Cassytha. Caryodaphnopsis and Alseodaphne-Phoebe clade appear broadly conical and short perianth tube. Perianth tubes are campanulate, short to nearly absent in the Cinnamomum-Litsea clade. d The Cinnamomum-Litsea clade has unisexual flowers and the other species in Lauraceae have bio-sexual flowers. e The phylogenetic tree of FUWA homologs in different Lauraceae species. f PTL expression in the flower buds of Lauraceae species. The PTL expression level was noted as being consistent with the variation of perianth morphology in Lauraceae. PTL exhibited a higher level of expression in the flower buds of the basic group lineage (Cryptocarya group), which presented an abscission of the perianth tube from the perianth tube encapsulated in fruits. PTL had a lower level of expression in the Litsea-Cinnamomum clade, where the fruit receptacle developed from the perianth tube. g TGACG motif-binding protein family member TGA10 has higher transcriptional expression level in male flowers than that in female flowers from eight unisexual species representing four genera, and TGA10 also has higher expression lever in male flowers comparing with that in bisexual flowers from nine bisexual species representing six genera of Lauraceae. Source data underlying a, e–g are provided as a Source Data file.
Fig. 4Scent biosynthesis in Lauraceae.
Tissue-specific relative expression profiles (red–blue scale) of genes implicated in terpenoid biosynthesis (heat map). Intermediates are shown in black, and the enzymes (Supplementary Table 34) involved at each step are shown in gray. The genes involved in the MEP pathway exhibit a high level of fruit-development-specific expression, which may contribute to the biosynthesis of large amounts of monoterpenes. SesquiTPSs, or the responsible sesquiterpene biosynthesis of flowers, involve the gene expansion of 17 members (full amino acid length >200 aa). MonoTPSs involved in the production of monoterpenes in fruits also show signs of family expansion for 27 members (full amino acid length >200 aa) of L. cubeba. LcuTPSs form a gene cluster in chromosome 8 (Supplementary Fig. 5). MVA pathway mevalonate pathway, MEP pathway mevalonate-independent (deoxyxylulose phosphate) pathway, R root, S stem, L leaves, FL flower, F1 fruit 40 days after full bloom, F2 fruit 70 days after full bloom, F3 fruit 100 days after full bloom, F4 fruit 140 days after full bloom. Source data are provided as a Source Data file.
Fig. 5Phylogeny and functional verification of LcuTPSs.
a Phylogeny of TPSs. Putative full-length TPS proteins (>200 amino acids in length) identified in L. cubeba (Supplementary Table 32) and 10 other sequenced plant genomes (Supplementary Table 33) were subjected to phylogenetic analysis. TPS subfamilies are shown along the circumference of the circle. b Transient overexpression of LcuTPSs in Nicotiana benthamiana leaves. After infiltration, the plants were grown for 2 days and the presence or concentration of the monoterpenes was detected using GC-MS analysis. Data represent the mean ± SDs of three biological replicates. c Identification of enzymatic products after incubating recombinant LcuTPSs proteins with geranyl diphosphate. The recombined enzyme expressed in Escherichia coli was purified by Ni2+ affinity. The volatile terpenes were further analyzed by GC-MS analysis comparison with authentic standards (Supplementary Fig. 18). Source data underling a, b are provided as a Source Data file.