| Literature DB >> 35432473 |
Quanfang Zhang1, Min Li2, Xueyan Chen1, Guoxia Liu1, Zhe Zhang2, Qingqing Tan1, Yue Hu1, Yangyang Fan1, Yanyan Liu1, Tongshan Zhu2, Xue Yang1, Mingming Yue1, Xun Bu1, Yongqing Zhang2.
Abstract
Bupleurum chinense DC is a plant widely used in Chinese traditional medicine. Saikosaponins are the major bioactive constituents of B. chinense DC. Saikosaponins biosynthesis in Bupleurum has been more intensively studied than any other metabolic processes or bioactive constituents. However, whole-genome sequencing and chromosome-level assembly for Bupleurum genus have not been reported yet. Here, we report a high-quality chromosome-level genome of B. chinense DC. through the integration of PacBio long-read sequencing, Illumina short-read sequencing, and Hi-C sequencing. The genome was phased into haplotype 0 (621.27 Mb with a contig N50 of 16.86 Mb and a scaffold N50 of 92.25 Mb) and haplotype 1 (600.48 Mb with a contig N50 of 23.90 Mb and a scaffold N50 of 102.68 Mb). A total of 45,909 and 35,805 protein-coding genes were predicted in haplotypes 0 and 1, respectively. The enrichment analyses suggested that the gene families that expanded during the evolution of B. chinense DC are involved in the biosynthesis of isoquinoline alkaloid, tyrosine, and anthocyanin. Furthermore, we analyzed the genes involved in saikosaponin biosynthesis and determined the candidate P450 and UGT genes in the third stage of saikosaponins biosynthetic, which provided new insight into the saikosaponins biosynthetic. The genomic data provide a valuable resource for future investigations of the molecular mechanisms, biological functions, and evolutionary adaptations of B. chinense DC.Entities:
Keywords: Bupleurum chinense; chromosome-level genome; evolution; gene families; genome assembly; haplotype; saikosaponin biosynthesis
Year: 2022 PMID: 35432473 PMCID: PMC9008701 DOI: 10.3389/fgene.2022.878431
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Morphological characteristics of the B. chinense. Mature plants in (A) and (B), The above-ground plant (C) and leaves in the stem (D), root system(E), and flowers (F) are shown.
FIGURE 2Karyotype pictures of B. chinense DC using (A) DAPI staining, (B) fluorescence in situ hybridization (FISH) with a telomere specific repeat probe, and (C) an 18 s rDNA repeat sequence probe. Bar = 5 μm.
Genome assembly and annotation statistics for B. chinense DC.
| Haplotypes | Hap0 | Hap1 |
|---|---|---|
| Length of the scaffolds (bp) | 621,267,366 | 600,371,836 |
| Length of the contigs (bp) | 621,420,202 | 600,482,800 |
| Number of scaffolds | 2,912 | 1,775 |
| Number of contigs | 1,382 | 664 |
| Scaffolds N50 (bp) | 92,254,985 | 102,676,127 |
| Contigs N50 (bp) | 16,864,651 | 23,895,762 |
| Scaffolds N90 (bp) | 73,672,175 | 74,133,959 |
| Contigs N90 (bp) | 39,561 | 85,203 |
| GC Content (%) | 33.5 | 33.52 |
| N Content (%) | 0.024 | 0.018 |
| Gene number | 45,909 | 35,805 |
| Gene density (gene_number/100 kb) | 7.39 | 5.96 |
| Gene average length (bp) | 1,145.8 | 1,211.9 |
| Exon number per gene | 4.78 | 5.30 |
| Intron number per gene | 3.78 | 4.30 |
| Exon average length (bp) | 239.6 | 228.5 |
| Intron average length (bp) | 535.6 | 583.2 |
| Genome GC (%) | 33.51 | 33.53 |
| Exon GC (%) | 42.10 | 42.03 |
| Intergenic region average length (bp) | 7,288.0 | 9,055.0 |
FIGURE 3Schematic representation of the genomic characteristics and Hi-C contact map of haplotypes 0 and 1. (A) Schematic representation of the genomic characteristics of haplotypes 0 and 1. From the outer-ring: Track 1: the 6 pseudo chromosomes comprising haplotypes 0 and 1. genome; Track 2: Protein-coding genes present in each of the chromosomes. Long and short bars represent genes on the forward and reverse strands, respectively;Track 3: The distribution of the gene density using 1-Mb sliding windows. Higher densities are shown in darker red. Track 4: The distribution of the GC content. Track 5: The distribution of the repeat density using 1-Mb sliding windows. Track 6: Gene synteny between haplotype 0 and 1 assemblies was analyzed using MCscanX. The Hi-C contact map of the haplotype 0 (B) and haplotype 1 (C) genome. The color bar shows the contact density from white (low) to red (high).
FIGURE 4Genome evolution analysis of B. chinense DC. (A) Venn diagram of the orthologous genes shared between Lactuca sativa, Cynara cardunculus, Mikania micrantha, Daucus carota, and Bupleurum chinense DC. (B) The phylogenetic relationship between B. chinense DC and 16 other species. The number of gene gains (+) and losses (-) is shown on each branch, and are also displayed as pie charts:green, red, and blue corresponding to gene gain, loss, and neither gain nor loss, respectively. The divergence times between the different species are displayed below the tree.
FIGURE 5Identification and analysis of the genes involved in the biosynthesis of saikosaponins. (A) The SS biosynthetic pathway. (B) A neighbor-joining phylogenetic tree including the P450 protein sequences. Each color represents a P450 family or subfamily. (C) Heat map showing the expression levels of the P450s, UGTs, and other genes involved in the biosynthesis of saikosaponins based on the RNA-Seq data from the root, stem, leaf, flowers, and seeds.
Genes involved in the biosynthesis of saikosaponins and their respective copy numbers in the B. chinense DC genome and D. carota genome.
| Stage | Gene |
|
|
|---|---|---|---|
| Stage 1 | ACAT | 2 | 3 |
| HMGC | 2 | 1 | |
| HMGR | 5 | 5 | |
| MK | 3 | 1 | |
| PMK | 2 | 3 | |
| MDD | 4 | 1 | |
| IPPI | 1 | 1 | |
| Stage 2 | FPS | 9 | 9 |
| SS | 3 | 3 | |
| SE | 9 | 7 | |
|
| 5 | 4 | |
| Stage 3 | P450s | 266 | 207 |
| UGTs | 80 | 190 |