| Literature DB >> 30109190 |
Yang Chu1, Shuiming Xiao1, He Su1,2, Baosheng Liao1, Jingjing Zhang1,3, Jiang Xu1, Shilin Chen1.
Abstract
Ginseng (Panax ginseng C.A. Meyer) is one of the best-selling herbal medicines, with ginsenosides as its main pharmacologically active constituents. Although extensive chemical and pharmaceutical studies of these compounds have been performed, genome-wide studies of the basic helix-loop-helix (bHLH) transcription factors of ginseng are still limited. The bHLH transcription factor family is one of the largest transcription factor families found in eukaryotic organisms, and these proteins are involved in a myriad of regulatory processes. In our study, 169 bHLH transcription factor genes were identified in the genome of P. ginseng, and phylogenetic analysis indicated that these PGbHLHs could be classified into 24 subfamilies. A total of 21 RNA-seq data sets, including two sequencing libraries for jasmonate (JA)-responsive and 19 reported libraries for organ-specific expression analyses were constructed. Through a combination of gene-specific expression patterns and chemical contents, 6 PGbHLH genes from 4 subfamilies were revealed to be potentially involved in the regulation of ginsenoside biosynthesis. These 6 PGbHLHs, which had distinct target genes, were further divided into two groups depending on the absence of MYC-N structure. Our results would provide a foundation for understanding the molecular basis and regulatory mechanisms of bHLH transcription factor action in P. ginseng.Entities:
Keywords: Genome-wide characterization; Ginsenosides; Panax ginseng; Phylogenetic analysis; bHLH transcription factors
Year: 2018 PMID: 30109190 PMCID: PMC6089850 DOI: 10.1016/j.apsb.2018.04.004
Source DB: PubMed Journal: Acta Pharm Sin B ISSN: 2211-3835 Impact factor: 11.413
Number of bHLH proteins and genome feature analysis.
| Species | Number of bHLHs | Genome size |
|---|---|---|
| 153 | 125 Mb | |
| 108 | 245 Mb | |
| 150 | 473 Mb | |
| 128 | 641 Mb | |
| 124 | 1.35 Gb | |
| 169 | 3.43 Gb | |
| 109 | 3.48 Gb |
Figure 1bHLH protein length distribution in P. ginseng and the other six sequenced species.
Figure 2Unrooted phylogenetic tree of the bHLH protein family in P. ginseng. Only the bHLH domains were used for phylogenetic tree construction. The JTT model was selected as the best-fitting amino acid substitution model with an estimated proportion of invariable sites (I) and an estimated g-distribution parameter (G). The phylogenetic tree was constructed using the neighbour-joining method with 1000 bootstrap replicates and Poisson correction distance. Roman numerals correspond to the bHLH subfamily. The different colours on the outer two rounds represent organ-specific expression patterns. The blue bars (up-regulated) and red bars (down-regulated) on the outermost round show the changes after MeJA treatment in callus of P. ginseng.
Exon distribution analysis of 169 PGbHLHs in each subfamily.
| Subfamily | Number of PgbHLHs | Number of AtbHLHs | Relative frequency of bHLHs | bHLHs with specific exon number | AVERAGE | Intron phase | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | >7 | 0 | 1 | 2 | |||||
| Ia | 15 | 10 | 1.50 | 4 | 8 | 3 | 2.93 | 29 | 0 | 0 | ||||
| Ib1 | 2 | 2 | 1.00 | 1 | 1 | 3.50 | 5 | 0 | 0 | |||||
| Ib1 | 3 | 12 | 0.25 | 1 | 1 | 1 | 3.00 | 6 | 0 | 0 | ||||
| II | 3 | 4 | 0.75 | 1 | 1 | 1 | 4.00 | 8 | 0 | 1 | ||||
| IIIa | 2 | 1 | 2.00 | 1 | 1 | 3.50 | 5 | 0 | 0 | |||||
| IIIb | 14 | 5 | 2.80 | 1 | 1 | 7 | 2 | 1 | 2 | 4.43 | 42 | 4 | 1 | |
| IIIc | 2 | 3 | 0.67 | 1 | 1 | 3.00 | 3 | 1 | 0 | |||||
| III(d+e) | 26 | 9 | 2.89 | 20 | 3 | 2 | 1 | 1.46 | 9 | 0 | 3 | |||
| IIIf | 2 | 2 | 1.00 | 2 | 8.00 | 8 | 5 | 0 | ||||||
| IVa | 9 | 4 | 2.25 | 4 | 1 | 2 | 1 | 1 | 4.89 | 31 | 0 | 2 | ||
| IVb | 3 | 2 | 1.50 | 3 | 6.00 | 6 | 1 | 5 | ||||||
| IVc | 2 | 4 | 0.50 | 2 | 6.00 | 4 | 6 | 0 | ||||||
| IVd | 2 | 1 | 2.00 | 1 | 1 | 5.50 | 8 | 1 | 0 | |||||
| Va | 4 | 6 | 0.67 | 1 | 3 | 7.25 | 14 | 6 | 3 | |||||
| Vb | 9 | 3 | 3.00 | 6 | 1 | 1 | 1 | 3.78 | 21 | 1 | 3 | |||
| VIIa | 6 | 6 | 1.00 | 1 | 5 | 9.17 | 35 | 4 | 0 | |||||
| VIIb | 6 | 8 | 0.75 | 2 | 2 | 1 | 1 | 5.67 | 25 | 2 | 1 | |||
| VIIa | 2 | 7 | 0.29 | 1 | 1 | 3.50 | 4 | 1 | 0 | |||||
| VIIIb | 9 | 3 | 3.00 | 6 | 2 | 1 | 1.44 | 3 | 1 | 0 | ||||
| VIIIc | 1 | 5 | 0.20 | 1 | 5.00 | 3 | 0 | 1 | ||||||
| IX | 8 | 7 | 1.14 | 1 | 4 | 3 | 6.63 | 34 | 2 | 8 | ||||
| XI | 7 | 5 | 1.40 | 1 | 1 | 5 | 7.86 | 49 | 1 | 1 | ||||
| XII | 25 | 17 | 1.47 | 1 | 1 | 9 | 14 | 7.96 | 157 | 7 | 5 | |||
| XV | 7 | 6 | 1.17 | 7 | 2.00 | 7 | 0 | 0 | ||||||
| Total | 169 | 132 | 1.28 | 27 | 25 | 22 | 20 | 13 | 24 | 38 | 4.64 | 516 | 43 | 34 |
Figure 3The structural features of the III (d+e) and VII a bHLH gene subfamilies. The exons and UTRs are represented by green and pink round-cornered rectangles, respectively, with black connecting lines as the introns. The bHLH domains are emphasized by yellow colour. The numbers above the rectangles correspond to the intron phase.
Figure 4Alignment of the bHLH domains of P. ginseng proteins. (A) Alignment of the bHLH domain of PGbHLHs in the III(d+e) subfamily. The shaded boxes above indicate the positions of the DNA-binding basic region, the two α-helixes, and the variable loop region. (B) Highly conserved amino acid residues in the bHLH domain across all PGbHLHs. The overall height of each stack represents the conservation of the sequence at that position. The numbering of the amino acid follows.
Figure 5Conserved motifs in each P. ginseng bHLH protein subfamily. A representation of a typical member of each bHLH subfamily is shown, with the bHLH domain and other conserved motifs drawn as coloured boxes. The sequences of each motif in the individual proteins are included in Supplementary Information Table S4.
Figure 6Heatmaps representing partial expression profiles of P. ginseng bHLH genes. (A) Clustering and differential expression analysis of 169 PGbHLH genes in various organs. (B) Clustering and differential expression analysis of 44 PGbHLHs genes with root-specific expression in the cortex, periderm and stele. The FPKM values were transformed to log2(value +1). The colour scale is shown at the right, and higher expression levels are shown in red. Genes with FPKM values of 0 in all samples are not shown.
Pearson's correlation coefficients of selected bHLH transcription factor expression levels and major ginsenoside contents.
| ID | Rg1 | Re | Rf | Rb1 | Rb2 | Rc | Rd | Total |
|---|---|---|---|---|---|---|---|---|
| PG40693 | 0.8789 | 0.9731 | 0.9209 | 0.9465 | 0.9749 | 0.9825 | 0.9678 | 0.9669 |
| PG26994 | 0.8646 | 0.9112 | 0.9787 | 0.9232 | 0.9280 | 0.8997 | 0.9232 | 0.9249 |
| PG19462 | 0.8203 | 0.8947 | 0.8492 | 0.8797 | 0.8901 | 0.8850 | 0.8501 | 0.8852 |
| PG29620 | 0.8629 | 0.8789 | 0.7424 | 0.8562 | 0.8477 | 0.8620 | 0.7749 | 0.8580 |
| PG06701 | 0.7917 | 0.8762 | 0.8961 | 0.8556 | 0.8773 | 0.8494 | 0.8403 | 0.8708 |
| PG07173 | 0.6911 | 0.8371 | 0.8227 | 0.8061 | 0.8650 | 0.8868 | 0.9309 | 0.8426 |
Figure 7Biosynthesis pathway for ginsenosides with the Pearson's correlation coefficients of six candidate bHLH transcription factors. β-AS, β-amyrin synthase; AACT, acetyl-CoA C-acetyltransferase; CAS, cycloartenol synthase; CDP-ME, methylerythritol cytidyl diphosphate; CDP-MEP, 4-diphosphocytidyl-2-C-methyl-d-erythritol-2-phosphate; DDS, dammarenediol synthase; DMAPP, dimethylallyl diphosphate; DXP, 1-deoxy-d-xylulose-5-phosphate; DXR, DXP reductoisomerase; DXS, 1-deoxy-d-xylulose-5-phosphate synthase; FPP, farnesyl diphosphate; FPS, farnesyl diphosphate synthase; HMBPP, 4-hydroxy-3-methyl-butenyl-1-diphosphate; HMG-CoA, 3-hydroxy-3-methylglutaryl-CoA; HMGR, 3-hydroxy-3-methylglutaryl-CoA reductase; HMGS, 3-hydroxy-3-methylglutaryl-CoA synthase; IDI, isopentenyl-diphosphate delta-isomerase; IPP, isopentenyl diphosphate; IspD, CDP-ME synthetase; IspE, 4-diphosphocytidyl-2-C-methyl-d-erythritol kinase; IspF, 2-C-methyl-d-erythritol 2,4-cyclodiphosphate synthase; IspG, (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase; IspH, 4-hydroxy-3-methylbut-2-en-1-yl diphosphate reductase; LAS, lanosterol synthase; MEcPP, 2-C-methyl-d-erythritol-2,4-cyclodiphosphate; MEP, 2-C-methyl-d-erythritol 4-phosphate; MVD, mevalonate diphosphate decarboxylase; MVK, mevalonate kinase; MVP, mevalonate phosphate; MVPP, diphosphomevalonate; OAS, oleanolic acid synthase; PMK, phosphomevalonate kinase; PPDS, protopanaxadiol synthase; PPTS, protopanaxatriol synthase; SE, squalene epoxidase; SS, squalene synthase.