| Literature DB >> 32714378 |
Ying Liang1, Shanshan Chen2, Kunhua Wei1, Zijiang Yang3, Shengchang Duan4, Yuan Du4, Peng Qu1, Jianhua Miao1, Wei Chen5,6,7, Yang Dong1,3,5,7.
Abstract
Andrographis paniculata (Chinese name: Chuanxinlian) is an annual dicotyledonous medicinal plant widely grown in China and Southeast Asia. The dried plant has a highly acclaimed usage in the traditional Chinese medicine for its antipyretic, anti-inflammatory, and analgesic effects. In order to help delineate the biosynthetic pathways of various secondary metabolites, we report in this study a high-quality reference genome for A. paniculata. With the help of both PacBio single molecule real time sequencing and Illumina sequencing reads for error correction, the A. paniculata genome was assembled into a total size of 284 Mb with a contig N50 size of 5.14 Mb. The contigs were further assembled into 24 pseudo-chromosomes by the Hi-C technique. We also analyzed the gene families (e.g., KSL, and CYP450) whose protein products are essential for synthesizing bioactive compounds in A. paniculata. In conclusion, the high-quality A. paniculata genome assembly builds the foundation for decoding the biosynthetic pathways of various medicinal compounds.Entities:
Keywords: Andrographis paniculata; Hi-C; PacBio sequencing; genome assembly; medicinal plant
Year: 2020 PMID: 32714378 PMCID: PMC7340177 DOI: 10.3389/fgene.2020.00701
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Evaluation of the genome size of A. paniculata. (A) Photo of a single A. paniculata plant with close-up image of the flower. (B) The 21-mer analysis of the A. paniculata genome using GenomeScope. (C) Flow cytometry analysis of A. paniculata genome size comparing with Zea mays.
The library information and data statistics for the A. paniculata assembly.
| Assembly size (Mb) Number of N50 contig | 284.3 22 |
| N50 contig length (bp) | 5,149,272 |
| Number of N90 contigs | 51 |
| N90 contig length (bp) | 2,610,185 |
| Transposable elements content | 57.35% |
| Total number of protein-coding genes | 24,015 |
| Total exon number | 136,156 |
| Average exon number per gene | 5.67 |
| Average exon size (bp) | 227.14 |
| Total intro length (bp) | 453,323 |
| Total number of non-protein-coding genes | 6,591 |
FIGURE 2Collinear analysis of the assemblies of A. paniculata genome.
FIGURE 3Hi-C clustering heatmap for pseudomolecule construction in A. paniculata. In all, 24 pseudomolecules are assembled and indicated by blue boxes.
FIGURE 4Repeat analysis (A) Neighbor-joining (NJ) trees were built from RT domain sequence similarities among different lineage-specific copies identified in the A. paniculata genome. Deep branching revealed ancient expansion while flat branching is consistent with a recent burst of insertion activity. Red branches correspond to outgroup sequences. (B) The average age of TEs was revealed for the different lineages by the branching distribution in the NJ trees built from RT (light blue). (C) The density of different TE lineages inferred from the detection of their protein-coding domains along pseudomolecules.
FIGURE 5Comparative analyses of the A. paniculata genome. (A) Major groups of orthologous genes in eight plant genomes. (B) Venn diagram of shared orthologous gene families among A. paniculata, S. indicum, S. miltiorrhiza, and O. sativa genomes. (C) Estimation of the time points of divergence (time range shown in parentheses) between A. paniculata and seven other plant species based on orthologous single-copy gene pairs. (D) Expansion and contraction of gene families in eight plant genomes. Pie diagram on each branch and node corresponds to combined change across lineages.
FIGURE 6Genomic analysis of putative TPS genes in the A. paniculata genome. (A) Schematic biosynthetic pathway of andrographolide and neoandrographolide. (B) Phylogenetic analysis of TPS genes in A. paniculata and other plants. Red branches indicate putative TPS genes in A. paniculata. (C) Expression levels of putative KS/KSL genes in the leaf and root tissue of A. paniculata. (D) Expression levels of putative CPS genes in the leaf and root tissue of A. paniculata.
FIGURE 7Genomic analysis of putative CYP450 genes in the A. paniculata genome. (A) Phylogenetic analysis of CYP450 genes in A. paniculata and other plants. Red branches indicate putative CYP450 genes in A. paniculata. (B) The CYP450 gene clusters on the contigs. (C) Differentially expressed putative CYP450 genes in A. paniculata.