| Literature DB >> 33208749 |
Sang-Ho Kang1, Ramesh Prasad Pandey2,3, Chang-Muk Lee4, Joon-Soo Sim4, Jin-Tae Jeong5, Beom-Soon Choi6, Myunghee Jung7, Daniel Ginzburg8, Kangmei Zhao8, So Youn Won9, Tae-Jin Oh2, Yeisoo Yu6,10, Nam-Hoon Kim6, Ok Ran Lee11, Tae-Ho Lee9, Puspalata Bashyal2, Tae-Su Kim2, Woo-Haeng Lee2, Charles Hawkins8, Chang-Kug Kim9, Jung Sun Kim9, Byoung Ohg Ahn9, Seung Yon Rhee12, Jae Kyung Sohng13.
Abstract
Senna tora is a widely used medicinal plant. Its health benefits have been attributed to the large quantity of anthraquinones, but how they are made in plants remains a mystery. To identify the genes responsible for plant anthraquinone biosynthesis, we reveal the genome sequence of S. tora at the chromosome level with 526 Mb (96%) assembled into 13 chromosomes. Comparison among related plant species shows that a chalcone synthase-like (CHS-L) gene family has lineage-specifically and rapidly expanded in S. tora. Combining genomics, transcriptomics, metabolomics, and biochemistry, we identify a CHS-L gene contributing to the biosynthesis of anthraquinones. The S. tora reference genome will accelerate the discovery of biologically active anthraquinone biosynthesis pathways in medicinal plants.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33208749 PMCID: PMC7674472 DOI: 10.1038/s41467-020-19681-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Summary of genome assembly and protein-coding genes in S. tora.
| Contigs | Superscaffolds | |
|---|---|---|
| Numbers | 721 | 13 |
| Total length | 526.3 Mb | 502.6 Mb |
| N50 | 4.03 Mb | 41.7 Mb |
| Longest | 14.9 Mb | 52.7 Mb |
| GC content | 35.45% | 35.45% |
| No. of genes | 45,268 | |
| Mean gene length | 3162 bp | |
| Mean exon length | 217 bp | |
| Mean intron length | 656 bp | |
| lncRNA | 3278 | |
| rRNA | 752 | |
| tRNA | 839 | |
Fig. 1The S. tora genome and comparative genomic analysis.
a The landscape of genome assembly (~502 Mb) and annotation of S. tora. Tracks (from outside) correspond to chromosomes (Chr01–Chr13 on a Mb scale), gene density, repeat density, rRNA density, tRNA density, GC content, and GC skew. Tracks are drawn in nonoverlapping 100-kb sliding windows. The red bars in the rRNA and tRNA tracks represent the maximum density of copies on the scale. b An overview of orthologous and paralogous genes among S. tora, related legumes, and V. vinifera. “Uncertain” indicates homologous genes obtained from BLAST but not found using OrthoMCL. “Species-specific” genes do not have any similarity to genes in the other species based on BLAST and OrthoMCL. c Significantly enriched biological process GO and KEGG categories (specialized metabolism) of expanded gene families in S. tora. d Lineage-specific expansion of the CHS-L gene family in S. tora and four other legumes. The 15 tandemly duplicated gene clusters are ordered and shown on chromosome 7, as well as one gene on chromosome 2.
Fig. 2Analysis of anthraquinone contents and CHS-L gene expression during S. tora seed development.
a Developmental progression of S. tora seeds (Stage 1–Stage 7). b Concentrations of ten anthraquinones during the seven developmental stages of S. tora seeds (mean ± SD, n = 3). Dots represent individual values. c, d Scaled transcript expression profiles (in transcripts per million, TPM) of cluster 3 (554 genes) and cluster 6 (1,513 genes) during seed development and enriched metabolic domains within these two clusters. e Expression analysis of CHS-L genes during seed development. Heatmap represents normalized transcripts per million (TPM) from two biological replicates. S1–S7 represents the seed-development stages of S. tora. The source data underlying Fig. 2b are provided as a Source Data file.
Fig. 3Enzyme assays and MS analysis of anthraquinones.
a (i) Extracted ion chromatograms (EIC) for the compound with the mass 319.08 Da in reaction mixtures containing malonyl-CoA as substrate and chalcone synthase-like (CHS-L9) (STO07G228250), chalcone synthase (CHS) (STO03G058250), or heat-denatured CHS-L9. (ii) EIC for the mass 335.13 Da in reaction mixtures containing 13C3-malonyl-CoA containing CHS-L9 or heat-denatured CHS-L9 enzyme. Dots in the structure represent 13C-labeled carbons. b (i) EIC for the mass 301.07 Da in reaction mixtures containing malonyl-CoA as substrate containing CHS-L9, CHS, or heat-denatured CHS-L9. The inset shows a zoomed region of the EIC chromatogram. (ii) EIC for the mass 317.12 Da in reaction mixtures containing 13C3-malonyl-CoA containing CHS-L9 or heat-denatured CHS-L9 enzyme. c PKS-mediated biosynthetic pathway of anthraquinones. Two pathway intermediates, atrochrysone carboxylic acid and endocrocin anthrone, were produced in the CHS-L9-catalyzed reaction mixture.