| Literature DB >> 33800612 |
Kunyuan Guo1, Jie Chen2, Yan Niu2, Xianming Lin1.
Abstract
One of the most commonly utilized medicinal plants in China is Fritillaria hupehensis (Hsiao et K.C. Hsia). However, due to a lack of genomic resources, little is known about the biosynthesis of relevant compounds, particularly the flavonoid biosynthesis pathway. A PacBio RS II sequencing generated a total of 342,044 reads from the bulb, leaf, root, and stem, of which 316,438 were full-length (FL) non-redundant reads with an average length of 1365 bp and a N50 of 1888 bp. There were also 38,607 long non-coding RNAs and 7914 simple sequence repeats detected. To improve our understanding of processes implicated in regulating secondary metabolite biosynthesis in F. hupehensis tissues, we evaluated potential metabolic pathways. Overall, this study provides a repertoire of FL transcripts in F. hupehensis for the first time, and it will be a valuable resource for marker-assisted breeding and research into bioactive compounds for medicinal and pharmacological applications.Entities:
Keywords: genomic analysis; herbal medicine; medicinal plant; third generation sequencing
Year: 2021 PMID: 33800612 PMCID: PMC8066755 DOI: 10.3390/life11040287
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
Figure 1Fritillaria hupehensis plant in the field. (A) Maturing F. hupehensis bulbs and leaves. (B) Matured F. hupehensis bulbs and leaves.
Figure 2F. hupehensis unigenes homology search and isoform detection. (A) Venn diagram of number of unigenes with an E-value threshold of 10−5 against the protein databases. The numbers in circles indicate the number of individual unigenes annotated by single or multiple databases. (B) Percentage of annotated unigenes in Nr database that match the top 11 species using BLASTx.
PacBio transcriptome sequencing summary of F. Hupehensis tissues.
| Library | Number of Reads | Number of Subreads | Number of FL Transcripts | Number of FLNC | Assembly Length (Mb) | Average Transcript Length (bp) | N50 (bp) |
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
FL and FLNC refer to full-length and full-length non-chimeric transcripts, respectively.
Profiles of simple sequence repeats (SSRs) detected in F. hupehensis transcriptome.
| SSR | Number of SSR |
|---|---|
| Total SSRs | 7914 |
| Total SSR length | 13387 |
| Relative abundance (SSR/Mb) | 143 |
| Relative density (bp/Mb) | 243 |
| SSR containing sequences | 5973 |
| Sequences containing more than 1 SSR | 1311 |
Profiles of long non-coding RNA (LncRNAs) identified in F. hupehensis transcriptome
| LncRNA Length | Number |
|---|---|
| 200–400 | 1899 |
| 400–600 | 5279 |
| 600–800 | 7986 |
| 800–1000 | 7180 |
| 1000–1200 | 5058 |
| 1200–1400 | 3382 |
| 1400–1600 | 2113 |
| 1600–1800 | 1438 |
| 1800–2000 | 909 |
| 2000–2200 | 723 |
| 2200–2400 | 529 |
| 2400–2600 | 391 |
| 2600–2800 | 314 |
| 2800–3000 | 270 |
| 3000–3200 | 221 |
| 3200–3400 | 185 |
| 3400–3600 | 140 |
| 3600–3800 | 118 |
| 3800–4000 | 96 |
| 4000–4200 | 70 |
| 4200–4400 | 78 |
| 4400–4600 | 49 |
| 4600–4800 | 39 |
| 4800–5000 | 27 |
| 5000–5200 | 26 |
| 5200–5400 | 13 |
| 5400–5600 | 20 |
| 5600–5800 | 15 |
| 5800–6000 | 8 |
| >6000 | 31 |
Figure 3Phylogenetic associations among flavonoid biosynthesis genes in F. hupehensis (Hsiao et K.C. Hsia), Solanum lycopersicum (Linn), and Arabidopsis thaliana (Linn). Purple dot stands for S. lycopersicum, green diamond dot is A. thaliana, and red dot is F. hupehensis. MEGA 10.0 was used to create the phylogenetic tree based on the neighbor-joining method. Clade 1, coumaroylquinate (coumaroyl shikimate) 3′, 4′-monooxygenase genes (C3′H); clade 2 (C4′H); clade 3, O-methyltransferases (OMT); clade 4, ladanein (LAD); and clade 5, shikimate O-hydroxycinnamoyltransferase genes (HCT). On the basis of their clustering patterns, we named the flavonoid biosynthesis genes in F. hupehensis.