| Literature DB >> 33805478 |
Qunlu Liu1, Fiza Liaquat2, Yefeng He1, Muhammad Farooq Hussain Munis3, Chunying Zhang4.
Abstract
Rhododendronsimsii is one of the top ten famous flowers in China. Due to its historical value and high aesthetic, it is widely popular among Chinese people. Various colors are important breeding objectives in Rhododendron L. The understanding of the molecular mechanism of flower color formation can provide a theoretical basis for the improvement of flower color in Rhododendron L. To generate the R.simsii transcriptome, PacBio sequencing technology has been used. A total of 833,137 full-length non-chimeric reads were obtained and 726,846 high-quality full-length transcripts were found. Moreover, 40,556 total open reading frames were obtained; of which 36,018 were complete. In gene annotation analyses, 39,411, 18,565, 16,102 and 17,450 transcriptions were allocated to GO, Nr, KEGG and COG databases, correspondingly. To identify long non-coding RNAs (lncRNAs), we utilized four computational methods associated with Protein families (Pfam), Cooperative Data Classification (CPC), Coding Assessing Potential Tool (CPAT) and Coding Non Coding Index (CNCI) databases and observed 6170, 2265, 4084 and 1240 lncRNAs, respectively. Based on the results, most genes were enriched in the flavonoid biosynthetic pathway. The eight key genes on the anthocyanin biosynthetic pathway were further selected and analyzed by qRT-PCR. The F3'H and ANS showed an upward trend in the developmental stages of R. simsii. The highest expression of F3'5'H and FLS in the petal color formation of R. simsii was observed. This research provided a huge number of full-length transcripts, which will help to proceed genetic analyses of R.simsii. native, which is a semi-deciduous shrub.Entities:
Keywords: Rhododendron simsii; alternative splicing; full length transcript; transcriptome
Year: 2021 PMID: 33805478 PMCID: PMC8065783 DOI: 10.3390/plants10040649
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Three developmental stages of R. simsii flower.
The primers used for qRT-PCR analysis.
| Primer ID | Gene ID | Forward Primer | Reverse Primer |
|---|---|---|---|
|
| F01_transcript/61502 | GCTTACCGTCGAGACCGTGG | AACAATGGGCCTCACCAGCC |
|
| F01_transcript/68926 | GCCGCGTTGGAGCAAATTGT | CCCCCTGGTTCCACCCAAAT |
|
| F01_transcript/64242 | CTCACTCTCGGCCTCAAGCG | TGCCTCAGGGGCTGGATTCT |
|
| F01_transcript/64345 | GGCTTCATCGGCTCATGGCT | TCCGCTTTGGGCAACTCCAG |
|
| F01_transcript/40426 | CTCCTTCTCCTCCACCGGCT | GGGGGTTGTTCCAGGTGGTG |
|
| F01_transcript/54870 | CGATCCCCACCACTCCATC | AAGAACTGTGCGGCAACCGA |
|
| F01_transcript/50526 | TGCTGACTACGGCCCGAGAT | GGTATCACCACTGCCTCGCC |
|
| F01_transcript/63169 | ACTCGAGCTGTAGCCCACCA | CGTTCGGAAGACGAGCCTCC |
|
| F01_transcript/66405 | GGCAACTGGAGGTCTCTGCC | CCGGCAGCCTTCCTGCTATG |
|
| F01_transcript/65059 | CCCTTTCGGCTATGCTGCGA | CCCTTTCGGCTATGCTGCGA |
|
| F01_transcript/50357 | CTTGGTTGCTTCGGCCTCCA | ACCTCACTCCCTCTCGCCTC |
Summary of reads of inserts from single-molecule long-read sequencing.
| Samples | cDNA Size | CCS Number | Read Bases of CCS | Mean Read Length of CCS | Mean Number of Passes |
|---|---|---|---|---|---|
| F01 | 1–6 k | 833,137 | 2,298,485,998 | 2758 | 19 |
Figure 2Distribution of predicted CDS encoded protein length.
Figure 3Venn diagram of long non-coding RNAs (lncRNAs) CPC: coding potential calculator; CNCI: coding-noncoding index; CPAT: coding potential assessment tool; Pfam: s protein families.
Figure 4Distribution of transcription factor types.
Summary of functional annotation of S. superba transcriptome.
| Annotated Databases | Isoform Number |
|---|---|
| COG | 16,102 |
| GO | 18,565 |
| KEGG | 17,450 |
| KOG | 25,787 |
| Pfam | 34,156 |
| Eggnog | 29,498 |
| Swiss-Prot | 38,933 |
| Nr | 39,411 |
| All | 39,521 |
Figure 5The classification statistics of Nr-annotated species in transcripts.
Summary of functional annotation for the non-redundant unigenes of R. simsii.
| Databases | Unigene Number | 300~1000 bp | ≥1000 bp | ||
|---|---|---|---|---|---|
| Number | Percentage | Number | Percentage | ||
| NR | 39,411 | 2171 | 5.51% | 37,230 | 94.47% |
| GO | 18,565 | 1521 | 8.19% | 17,040 | 91.79% |
| COG | 16,102 | 800 | 4.97% | 15,300 | 95.02% |
| eggNOG | 38,933 | 2115 | 5.43% | 36,811 | 94.55% |
| KEGG | 17,450 | 1109 | 6.36% | 16,336 | 93.62% |
| KOG | 25,787 | 1249 | 4.84% | 24,534 | 95.14% |
| Pfam | 34,156 | 1699 | 4.97% | 32,454 | 95.02% |
| Swiss-Prot | 29,498 | 1521 | 5.16% | 27,971 | 94.82% |
| All | 39,521 | 2201 | 5.57% | 37,310 | 94.41% |
Figure 6Transcript GO annotation classification statistics graph.
Figure 7COG annotation classification statistics of transcripts.
The best 15 pathways annotated by the KEGG database.
| No. | Name of Pathway | Pathway ID | No of Transcripts (%) | |
|---|---|---|---|---|
| 1. | Carbon metabolism | ko01200 | 605 | (3.47%) |
| 2. | Protein processing in endoplasmic reticulum | ko04141 | 608 | (3.48%) |
| 3. | Biosynthesis of amino acid | ko01230 | 525 | (3.01%) |
| 4. | Spliceosome | ko03040 | 700 | (4.01%) |
| 5. | Ribosome | ko03010 | 491 | (2.81%) |
| 6. | RNA transport | ko03013 | 558 | (3.20%) |
| 7. | Starch and sucrose metabolism | ko00500 | 377 | (2.16%) |
| 8. | Plant hormone signal transduction | ko04075 | 470 | (2.69%) |
| 9. | Oxidative phosphorylation | ko00190 | 280 | (1.60%) |
| 10. | Glycolysis/gluconeogenesis | ko00010 | 322 | (1.85%) |
| 11. | Plant pathogen interaction | ko04626 | 324 | (1.86%) |
| 12. | mRNA surveillance pathway | ko03015 | 469 | (2.69%) |
| 13. | Ubiquitin mediated proteolysis | ko04120 | 321 | (1.84%) |
| 14. | Amino sugar and nucleotide sugar metabolism | ko00520 | 271 | (1.55%) |
| 15. | Endocytosis | ko04144 | 251 | (1.44%) |
Figure 8(a,b) Comparison of expression profiles of eight representative genes measured by qRT-PCR.