| Literature DB >> 35126662 |
Mingwei Sun1, Yilian Zhao2, Xiaobin Shao1, Jintao Ge1, Xueyan Tang1, Pengbo Zhu1, Jiangying Wang1, Tongli Zhao1.
Abstract
The fast advancement and deployment of sequencing technologies after the Human Genome Project have greatly increased our knowledge of the eukaryotic genome sequences. However, due to technological concerns, high-quality genomic data has been confined to a few key organisms. Moreover, our understanding of which portions of genomes make up genes and which transcript isoforms synthesize these genes is scarce. Therefore, the current study has been designed to explore the reliability of the tiger lily (Lilium lancifolium Thunb) transcriptome. The PacBio-SMRT was used for attaining the complete transcriptomic profile. We obtained a total of 815,624 CCS (Circular Consensus Sequence) reads with an average length of 1295 bp. The tiger lily transcriptome has been sequenced for the first time using third-generation long-read technology. Furthermore, unigenes (38,707), lncRNAs (6852), and TF members (768) were determined based on the transcriptome data, followed by evaluating SSRs (3319). It has also been revealed that 105 out of 128 primer pairs effectively amplified PCR products. Around 15,608 transcripts were allocated to 25 distinct KOG Clusters, and 10,706 unigenes were grouped into 52 functional categories in the annotated transcripts. Until now, no tiger lily lncRNAs have been discovered. Results of this study may serve as an extensive set of reference transcripts and help us learn more about the transcriptomes of tiger lilies and pave the path for further research.Entities:
Year: 2022 PMID: 35126662 PMCID: PMC8816598 DOI: 10.1155/2022/7641048
Source DB: PubMed Journal: Appl Bionics Biomech ISSN: 1176-2322 Impact factor: 1.781
Representation of Iso-Seq from the sepal of Lilium davidii var. unicolor Salisb by PacBio Sequel system.
| SMRT cells | Subread base (G) | Reads of CCS | Non-full-length reads | Full-length reads | FLNC reads | Consensus transcripts | Mean length (bp) | Unigenes |
|---|---|---|---|---|---|---|---|---|
| 54.86 | 815,624 | 353,880 | 461,744 | 459,322 | 38,707 | 1574 | 38,707 |
Figure 1The transcript (38,707) and unigene (15,608) length distribution in tiger lily sepals.
Summary of the unigenes mapped to KEGG functional pathways.
| Number | Pathway ID | Pathway name | Unigene number |
|---|---|---|---|
| 1 | ko03010 | Genetic information processing, translation, ribosome | 434 |
| 2 | ko01200 | Metabolism, global and overview maps, carbon metabolism | 345 |
| 3 | ko01230 | Metabolism, global and overview maps, biosynthesis of amino acids | 307 |
| 4 | ko04141 | Genetic information processing; folding, sorting, and degradation; protein processing in the endoplasmic reticulum | 253 |
| 5 | ko03040 | Genetic information processing, transcription, spliceosome | 236 |
| 6 | ko00190 | Metabolism, energy metabolism, oxidative phosphorylation | 207 |
| 7 | ko05169 | Human diseases; infectious diseases: viral; Epstein-Barr virus infection | 180 |
| 8 | ko04144 | Cellular processes, transport and catabolism, endocytosis | 179 |
| 9 | ko00010 | Metabolism, carbohydrate metabolism, glycolysis/gluconeogenesis | 175 |
| 10 | ko03013 | Genetic information processing, translation, RNA transport | 166 |
| 11 | ko05016 | Human diseases, neurodegenerative diseases, Huntington's disease | 161 |
| 12 | ko00230 | Metabolism, nucleotide metabolism, purine metabolism | 156 |
| 13 | ko04075 | Environmental information processing, signal transduction, plant hormone signal transduction | 152 |
| 14 | ko00620 | Metabolism, carbohydrate metabolism, pyruvate metabolism | 146 |
| 15 | ko05418 | Human diseases, cardiovascular diseases, fluid shear stress and atherosclerosis | 135 |
Figure 2Transcript families and their corresponding numbers of TFs in the tiger lily.
Figure 3Prediction of lncRNAs using CNCI, CPC, CPAT, and Pfam analysis tools, as indicated in the Venn diagram.
Figure 4Frequency and distribution of SSRs on the basis of the motif.
Figure 5Venn maps of the unigenes among NR, GO, KEGG, Pfam, KOG, NT, and Swiss-Prot databases.
Figure 6Homologous species distribution of tiger lily transcripts annotated in the NR database.
Figure 7GO classification of tiger lily transcripts. BP, CC, and MFs are indicated by red, green, and blue colors, accordingly.
Figure 8The KOG classification of tiger lily transcripts. The x-axis and y-axis represent the subcategories and the number of transcripts, accordingly.