| Literature DB >> 35043180 |
Xiaohong Shang1, Xinxin Yi2, Liang Xiao1, Yansheng Zhang3, Ding Huang4, Zhengbao Xia2, Kunpeng Ou1, Ruhong Ming4, Wendan Zeng1, Dongqing Wu2, Sheng Cao1, Liuyin Lu1, Huabing Yan1.
Abstract
Pueraria lobata var. thomsonii (hereinafter abbreviated as Podalirius thomsonii), a member of legumes, is one of the important traditional Chinese herbal medicines, and its puerarin extraction is widely used in health and pharmaceutical industry. Here, we assembled a high-quality genome of P. thomsonii using long-read single-molecule sequencing and Hi-C technologies. The genome assembly is approximately 1.37 Gb in size and consists of 5145 contigs with a contig N50 of 593.70 Kb, further clustered into 11 pseudochromosomes. The genome structural annotation resulted in about 869.33 Mb (about 62.70% of the genome) repeat regions and 45 270 protein-coding genes. Genome evolution analysis revealed that P. thomsonii is most closely related to soybean and underwent two ancient whole-genome duplication events, one was in the common ancestor shared by legume species, the other occurred independently at around 7.2 million years ago after its specification. A total of 2373 gene families were found unique in P. thomsonii comparing to five other legume species. Genes and metabolites related to puerarin content in tuberous tissues were characterized. A total of 572 genes upregulated in the puerarin biosynthesis pathway were identified, and 235 candidate genes were further enriched by omics data. Furthermore, we identified 6 8-C-glucosyltransferase (8-C-GT) candidate genes significantly involved in puerarin metabolism. Our study filled in a key genomic gap in legume family, and provided valuable multi-omic resources for the genetic improvement of P. thomsonii.Entities:
Keywords: P. lobata var. thomsonii; chromosomal-level; genome; metabolome; puerarin; transcriptome
Year: 2022 PMID: 35043180 PMCID: PMC8881381 DOI: 10.1093/hr/uhab035
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Figure 1Genome landscape of assembly, annotation, and gene family clusters of P. thomsonii. a Overview of the P. thomsonii genomic features. Track I is the circular representation of the pseudomolecule. Tracks II–V are repetitive sequence density (bin = 1 Mb), gene density (bin = 1 Mb), GC content (bin = 1 Mb), and genome collinearity, respectively. b Distribution of single-copy, multiple-copy, unique, other orthologous and unclustered genes in the 17 plant species used in this study: Oryza sativa, Vitis vinifera, Pisum sativum, Phaseolus vulgaris, Vigna unguiculata, Vigna angularis, Glycine max, Pueraria lobata, Malus domestica, Cucumis melo, Momordica charantia, Populus alba, Salix brachista, Arabidopsis thaliana, Corchorus capsularis, Theobroma cacao, and Solanum lycopersicum. c Number of overlapped gene families among the six legume species visualized in the UpSet plot. PSAT, P. sativum; VANG, V. angularis, PVUL, P. vulgaris, GMAX, G. max; PTHO, P. thomsonii; VUNG, V. unguiculata.
Figure 2Evolution and WGD in P. thomsonii and 16 other species. a Phylogenetic trees and divergence times of the 17 species. The red stars on the branches represent occurrences of WGD events. b Distribution of Ks values of A. thaliana, P. thomsonii, G. max and O. sativa.
Figure 3General overview of metabolome and transcriptome. a PCA of metabolites. b Volcano plot of differential metabolites between the roots of ZG-19 and ZG-39. c KEGG enrichment analysis of the DMs between the roots of ZG-19 and ZG-39. d Volcano plot of DEGs between the roots of ZG-19 and ZG-39. e KEGG enrichment pathway with both DMs and DEGs. The red bars are the negative log10 and the green color represents the enrichment P-value of DMs, which is expressed as −log(P-value), and higher vertical coordinates represent stronger enrichment. f Canonical correlation analysis between the DMs and DEGs in the isoflavonoid biosynthesis pathway. Gene IDs are in green and metabolite names are in purple; within the same region, the further from the origin and the closer to each other the higher the correlation.
Figure 4Expression profile of genes encoding enzymes of the puerarin biosynthesis pathway in the roots of ZG-19 and ZG-39 genotypes. Gene expression values (FPKM) were scaled by log2(FPKM+1). Red and blue asterisks represent significant up- and downregulation of gene expression in root tissue of ZG-19 relative to ZG-39, respectively. Expression profile of all members of the glucosyltransferase (GT) gene family in P. lobata var. thomsonii are presented; genes in red and orange are homologous to 8-C-GT genes. Moreover, genes in red are the homologous genes of PIUGT43, which was previously found to catalyze the C-glucosylation of daidzein to puerarin [16]. PAL, phenylalanine ammonialyase; C4H, cinnamate-4-hydroxylase; 4CL, 4-coumarate-CoA ligase; CHS, chalcone synthase; CHR, chalcone reductase; CHI, chalcone isomerase; IFS, 2-hydroxyisoflavanonesynthase; 8-C-GT, 8-C-glucosyltransferase; HID, 2-hydroxyisoflavanone dehydratase.