| Literature DB >> 29604169 |
Nam-Hoon Kim1, Murukarthick Jayakodi1, Sang-Choon Lee1, Beom-Soon Choi2, Woojong Jang1, Junki Lee1, Hyun Hee Kim3, Nomar E Waminal1,3, Meiyappan Lakshmanan4, Binh van Nguyen1, Yun Sun Lee1, Hyun-Seung Park1, Hyun Jo Koo1, Jee Young Park1, Sampath Perumal1, Ho Jun Joh1, Hana Lee1, Jinkyung Kim1, In Seo Kim1, Kyunghee Kim1, Lokanand Koduru5, Kyo Bin Kang6, Sang Hyun Sung6, Yeisoo Yu2, Daniel S Park7, Doil Choi1, Eunyoung Seo1, Seungill Kim1, Young-Chang Kim8, Dong Yun Hyun9, Youn-Il Park10, Changsoo Kim11, Tae-Ho Lee12, Hyun Uk Kim13, Moon Soo Soh14, Yi Lee15, Jun Gyo In16, Heui-Soo Kim17, Yong-Min Kim18, Deok-Chun Yang19, Rod A Wing20, Dong-Yup Lee4,5, Andrew H Paterson21, Tae-Jin Yang1.
Abstract
Panax ginseng C. A. Meyer, reputed as the king of medicinal herbs, has slow growth, long generation time, low seed production and complicated genome structure that hamper its study. Here, we unveil the genomic architecture of tetraploid P. ginseng by de novo genome assembly, representing 2.98 Gbp with 59 352 annotated genes. Resequencing data indicated that diploid Panax species diverged in association with global warming in Southern Asia, and two North American species evolved via two intercontinental migrations. Two whole genome duplications (WGD) occurred in the family Araliaceae (including Panax) after divergence with the Apiaceae, the more recent one contributing to the ability of P. ginseng to overwinter, enabling it to spread broadly through the Northern Hemisphere. Functional and evolutionary analyses suggest that production of pharmacologically important dammarane-type ginsenosides originated in Panax and are produced largely in shoot tissues and transported to roots; that newly evolved P. ginseng fatty acid desaturases increase freezing tolerance; and that unprecedented retention of chlorophyll a/b binding protein genes enables efficient photosynthesis under low light. A genome-scale metabolic network provides a holistic view of Panax ginsenoside biosynthesis. This study provides valuable resources for improving medicinal values of ginseng either through genomics-assisted breeding or metabolic engineering.Entities:
Keywords: zzm321990Panax ginsengzzm321990; adaptation; evolution; ginsenosides; metabolic network
Mesh:
Substances:
Year: 2018 PMID: 29604169 PMCID: PMC6181221 DOI: 10.1111/pbi.12926
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Panax ginseng genome assembly and gene annotation parameters
| Genome assembly | |
| Number of scaffolds | 9845 |
| Total length of scaffolds (bp) | 2 984 993 682 |
| N50 of scaffold (bp) | 569 017 |
| Longest scaffold (bp) | 3 641 815 |
| GC content (%) | 32 |
| Gene annotation | |
| Number of genes | 59 352 |
| Total coding sequence length (bp) | 66 481 566 |
| Mean gene length (bp) | 4394 |
| Mean number of exon per gene | 5 |
| Mean exon length (bp) | 242 |
| Average CDS length (bp) | 1120 |
| Maximum gene length (bp) | 93 383 |
| Average intergenic region length (bp) | 37 601 |
| Number of long noncoding RNAs (lncRNAs) | 19 495 |
| Number of conserved miRNAs | 451 |
Figure 1Panax ginseng genome structure and evolution. (a) Relationship between four paralogous blocks resulting from two WGD events. Block 1 composed of Pg_scaffold0266 and Pg_scaffold2259, Block 2 contained Pg_scaffold0762 and Pg_scaffold0978, Block 3 had just one scaffold, Pg_scaffold0701, and Block 4 comprised reverse of Pg_scaffold2798 and Pg_scaffold1958. (b) FISH analysis to confirm the chromosomal locations of scaffolds inferred to be adjacent by zigzag alignment from counterpart scaffolds. FISH probes designed to validate adjacent scaffolds, and applied to pachytene chromosomes indicated that Pg_scaffold0266 and Pg_scaffold2259 (i, ii) were adjacent but Pg_scaffold0762 and Pg_scaffold0978 were on different chromosomes (iii). (c) Evolutionary history of P. ginseng. The pale blue triangles signify species number in the Araliaceae (1500) and Apiaceae (3700). (d) Construction of 18 virtual superscaffolds based on Daucus carota. The artificial counterpart superscaffolds of P. ginseng were twice the number of the corresponding D. carota superscaffolds, because of Pg‐α WGD. (e) Syntenic analysis between P. ginseng and D. carota. The seven scaffolds described illustrated chromosomal rearrangements relative to two D. carota regions. (f) Circular map of 18 virtual superscaffolds of P. ginseng and distribution of SNPs with cv. YuP (A), repeats (B) and genes (C). Total identified repeats (red lines) and major LTR‐RT family, PgDel (blue lines).
Figure 2Evolutionary model for the Panax genus. (a) Phylogenetic tree based on complete chloroplast genome sequences. Estimated divergence times (MYA) are at the roots of branch extensions for P. stipuleanatus (Ps), P. notoginseng (Pn), P. vietnamensis (Pv), P. japonicas (Pj), P. trifolius (Pt), P. ginseng (Pg), P. quinquefolius (Pq) and Aralia elata (Ae), respectively. (b) Bar charts for genome size and estimated genome proportions of five major repeats. Estimated genome sizes depict the predicted amounts of PgDel (red), PgTat (blue), PgAthila (yellow), PgTork (green) and PgSire (orange) LTR‐RT families (c) Homeologous scaffolds between P. ginseng and P. notoginseng. A total of nine P. notoginseng scaffolds matched two counterpart P. ginseng scaffolds. The listed P. notoginseng are scaffold11410, scaffold32646, scaffold10403, scaffold1534, scaffold27642, scaffold10246, scaffold1534, scaffold31849 and scaffold10465 in an order from left to right (d) Ks distribution of orthologous genes between P. ginseng and P. notoginseng. The peak at 0.035 suggests 2.62 MYA divergence time between these species, similar to chloroplast genome‐based estimation (Figure 2a). (e) FISH analysis of Ty3/gypsy (PgDel1 and PgTat1) and Ty1/copia (PgTork) LTR retrotransposons to show their differential abundance. (f) Two rounds of intercontinental species migration. Solid lines indicate current habitats, whereas dotted lines indicate inferred past habitats and migrated vestiges. Blue lines indicate first migration of diploid Panax, and yellow lines indicate migration of tetraploid Panax.
Figure 3Characterization of Pg harbouring extraordinary Pg167TR in P. ginseng. (a) Idiogram of three Pg elements showing two transposase domains, transposase 21 (T21) and transposase 24 (T24), and different Pg167TR copy numbers (blue arrows). The Pg167TR sequences were often inserted into the last intron of T24. Subterminal repeats were longer at the 3’ end (red arrows), and 31‐bp TIRs were highly conserved. (b) Quantification of genomic Pg167TR within Panax species. Genome representation (GR) of Pg167TR showing the abundance in tetraploid Panax species. (c) FISH analysis with Pg domain (i, iii) and Pg167TR (ii, iv) in P. notoginseng (i, ii) and P. ginseng (iii, iv). Pg gene regions showed clear signals in both diploid and tetraploid species (i, iii), while Pg167TR showed very faint signals in P. notoginseng (ii) but showed highly abundant and distinct signals in P. ginseng (iv).
Figure 4Ginsenoside biosynthesis model and related genes in P. ginseng. (a) Overview of the ginsenoside biosynthetic pathway in P. ginseng. The blue coloured uridine 5′‐diphospho‐glucuronosyltransferases (UDP‐glucuronosyltransferase, UGTs) are unknown enzymes involved in the glycosylation of ginsenosides. Reaction and metabolite abbreviations can be found in Data S1. (b) A phylogenetic tree of oxidosqualene cyclases (OSCs) in P. ginseng. OSC genes, including dammarenediol synthase (DDS), β‐amyrin synthase (β‐AS), lanosterol synthase (LSS) and cycloartenol synthase (CAS), were identified from P. ginseng (red), D. carota (black), S. lycopersicum (green), A. thaliana (blue) and V. vinifera (cyan) by KEGG and BLASTP searches. (c) Heatmap shows TMM normalized expression values of putative downstream genes involved in ginsenosides biosynthesis. Expression in above‐ground tissue (S1: immature fruit, S2: mature fruit, S3: flower, S4: 1‐year‐old leaves, S5: 5‐year‐old leaves, S6: 6‐year‐old stem) and subterranean parts (R1: 1‐year‐old main body roots, R2: 6‐year‐old main body roots, R3: 6‐year‐old lateral roots, R4: 6‐year‐old rhizomes, R5: 6‐year‐old dormant roots) are depicted. Similarly, expression of downstream genes is shown between adventitious roots of P. ginseng cultivars, CS, SH and SU.
Figure 5Classification and expression of FAD genes. (a) Phylogenetic analysis of FAD genes. (b) Mapping coverage for coding (CDS) genes using 10× coverage WGS reads from tetraploid P. ginseng cv. YuP (white bars) and diploid P. vietnamensis (red bars). Orthologous FADs in diploid P. notoginseng (PN) denoted as * under the bar graphs. (c) TMM normalized expression of each gene under control (CTR), drought, salt and cold stress conditions. The FAD subclass is represented at the top to show expansion of specific subclasses and its role in abiotic stress responses.