| Literature DB >> 32637140 |
Jungeun Kim1, Sang-Ho Kang2, Sin-Gi Park3, Tae-Jin Yang4, Yi Lee5, Ok Tae Kim6, Oksung Chung7, Jungho Lee8, Jae-Pil Choi1, Soo-Jin Kwon2, Keunpyo Lee2, Byoung-Ohg Ahn2, Dong Jin Lee3, Seung-Il Yoo3, In-Gang Shin3, Yurry Um6, Dae Young Lee6, Geum-Soog Kim6, Chang Pyo Hong3, Jong Bhak1,7,9,10, Chang-Kug Kim2.
Abstract
Triterpenoid saponins (TSs) are common plant defense phytochemicals with potential pharmaceutical properties. Platycodon grandiflorus (Campanulaceae) has been traditionally used to treat bronchitis and asthma in East Asia. The oleanane-type TSs, platycosides, are a major component of the P. grandiflorus root extract. Recent studies show that platycosides exhibit anti-inflammatory, antiobesity, anticancer, antiviral, and antiallergy properties. However, the evolutionary history of platycoside biosynthesis genes remains unknown. In this study, we sequenced the genome of P. grandiflorus and investigated the genes involved in platycoside biosynthesis. The draft genome of P. grandiflorus is 680.1 Mb long and contains 40,017 protein-coding genes. Genomic analysis revealed that the CYP716 family genes play a major role in platycoside oxidation. The CYP716 gene family of P. grandiflorus was much larger than that of other Asterid species. Orthologous gene annotation also revealed the expansion of β-amyrin synthases (bASs) in P. grandiflorus, which was confirmed by tissue-specific gene expression. In these expanded gene families, we identified key genes showing preferential expression in roots and association with platycoside biosynthesis. In addition, whole-genome bisulfite sequencing showed that CYP716 and bAS genes are hypomethylated in P. grandiflorus, suggesting that epigenetic modification of these two gene families affects platycoside biosynthesis. Thus whole-genome, transcriptome, and methylome data of P. grandiflorus provide novel insights into the regulation of platycoside biosynthesis by CYP716 and bAS gene families.Entities:
Keywords: Plant molecular biology; Secondary metabolism
Year: 2020 PMID: 32637140 PMCID: PMC7327020 DOI: 10.1038/s41438-020-0329-x
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Genome assembly and gene prediction of Platycodon grandiflorus
| Parameters | Value |
|---|---|
| Genome assembly | |
| Scaffold number | 4815 |
| Total scaffold length | 680.1 Mb |
| Scaffold N50 value | 277,181 bp |
| Longest scaffold | 1,387,349 bp |
| GC content | 36.2% |
| Gene prediction | |
| Gene number | 40,017 |
| Gene number supported by RNA-Seq | 39,188 |
| Mean gene length | 5051 bp |
| Total length of gene models | 224.8 Mb |
| Exons | |
| Exon number | 212,565 |
| Average exon number per gene | 4.77 |
| Average exon length | 221 bp |
| Introns | |
| Intron number | 168,058 |
| Average intron number per gene | 3.77 |
| Average intron length | 1044 bp |
Fig. 1Evolution of the CYP716 family in P. grandiflorus.
a Expansion and contraction of the CYP716 family in P. grandiflorus based on a time-calibrated phylogeny of six Asterid species. Branch numbers indicate the number of expanded (blue) and contracted (red) gene families after the split from the most recent common ancestor (MRCA). The table (right) shows the number of CYP450 families identified in Asterids analyzed on the left. In the table, the gray shaded regions indicate expansions of the CYP450 family in the MRCA, and red-color numbers indicate expansions specific to the corresponding species. b Heatmap showing the abundance of CYP716 proteins in six Asterids. Asterisks (*) indicate CYP716 proteins that modify triterpene scaffolds. c β-Amyrin modifications in P. grandiflorus. The structure of PubChem (ID: 73145) was modified to use the β-amyrin scaffold. Different reactions involved in the modification of β-amyrin are indicated in red, blue, and green in d. d Heatmap showing the expression of CYP450 genes and known reactions on platycoside scaffolds. Gene expression is presented as log2 fragments per kilobase of transcript per million mapped reads (FPKM) of RNA-Seq data from eight different tissues. Asterisks (*) indicate genes with significantly higher expression in roots than in other tissues (p < 0.001; Fisher’s exact test)
Fig. 2Expansion and diverged expression patterns of triterpenoid saponin biosynthesis (TBS)-related genes in P. grandiflorus.
a, b Phylogeny of the GGPS (a) and bAS (b) gene families in P. grandiflorus. Phylogenetic trees were constructed using the maximum likelihood (ML) method. c Expression profiles of TBS-related genes in P. grandiflorus. Data represent log2 FPKM values of RNA-Seq data generated from eight different tissues, including root, leaf, stem, seed, and four flowers, of plants treated with methyl jasmonate (MJ) for 12, 24, and 48 h (MJ 12 h, MJ 24 h, and MJ 48 h, respectively); CT 12 h indicates the control
Fig. 3DNA methylation landscape of CYP450 family and other TSB-related genes in P. grandiflorus genome.
a Methylation status of the CYP450 family genes in the CG, CHG, and CHH contexts. Based on the distribution of methylation levels of all protein-coding genes (including gene bodies and ~2-kb upstream and downstream regions) in the control sample (CT 12 h_AllGenes; black line), the methylation levels of CYP450 genes were measured in the control treatment (CT 12 h; red) and at three time points in the MJ treatment (MJ 12 h [blue], MJ 24 h [green], and MJ 48 h [orange]). b Density of differentially methylated cytosines (DMCs) in all three contexts in CYP450 and other TSB-related genes between the control and MJ treatments (CT vs. MJ 12 h, CT vs. MJ 24 h, and CT vs. MJ 48 h). c Abundance of CG-DMCs in the top 30% CYP450 and TSB-related genes in the MJ treatment. d Relative abundance of hypermethylated and hypomethylated cytosines (hyper-DMCs and hypo-DMCs, respectively) in the CG context of CYP716s, CYP72s, CYP76s, CYP71s, GGPSs, and bAS in the MJ treatments. The ratio was transformed to the log2 scale. e Relative abundance of hyper-DMCs and hypo-DMCs in the CG context within ~2-kb upstream regions and coding regions of the primary CYP450 and TSB-related genes. f Identification of genes showing inverse correlation between CG methylation levels and gene expression in the MJ treatment. Asterisk (*) indicates a gene with similar DMC distribution in the ~2-kb upstream region and coding region