| Literature DB >> 32854620 |
Danfeng Liu1,2, Li Chen1,2, Chao Chen1,2, Xingkui An3, Yongjun Zhang3, Yi Wang4,5, Qingjun Li1,2.
Abstract
BACKGROUND: Phytolaccaceae species in China are not only ornamental plants but also perennial herbs that are closely related to human health. However, both large-scale full-length cDNA sequencing and reference gene validation of Phytolaccaceae members are still lacking. Therefore, single-molecule real-time sequencing technology was employed to generate full-length transcriptome in invasive Phytolacca americana and non-invasive exotic P. icosandra. Based on the transcriptome data, RT-qPCR was employed to evaluate the gene expression stability in the two plant species and another indigenous congener P. acinosa.Entities:
Keywords: Full-length transcriptome analysis; Phytolaccaceae; RT-qPCR; Reference gene evaluation; SMRT sequencing
Mesh:
Year: 2020 PMID: 32854620 PMCID: PMC7450566 DOI: 10.1186/s12870-020-02608-9
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Summary of full-length transcriptome sequencing
| Clean reads (Gb) | 19.96 | 19.75 |
| CCS | 235,097 | 233,203 |
| FLNC | 200,857 | 208,865 |
| FLNC (%) (FLNC/CCS) | 85.44 | 89.56 |
| Consensus isoform | 89,082 | 98,448 |
| High quality consensus isoform | 86,989 | 96,764 |
| Transcripts | 46,369 | 50,220 |
| Alternative splicing | 1675 | 1908 |
| SSR | 25,641 | 31,800 |
| Complete coding sequences | 34,971 | 36,841 |
| LncRNA | 3574 | 3833 |
| Annotated transcripts | 41,676 | 45,050 |
SSRs obtained from transcripts with more than 500 bp
| Searching item | ||
|---|---|---|
| Total number of sequences examined | 43,942 | 48,070 |
| Total size of examined sequences (bp) | 100,469,039 | 117,904,499 |
| Total number of identified SSRs | 25,641 | 31,800 |
| Number of SSR containing sequences | 17,231 | 20,155 |
| Number of sequences containing more than 1 SSR | 5641 | 7229 |
| Number of SSRs present in compound formation | 2635 | 3571 |
| Number of mono nucleotide SSR | 12,699 | 16,366 |
| Number of di nucleotide SSR | 7182 | 8065 |
| Number of tri nucleotide SSR | 5266 | 6593 |
| Number of tetra nucleotide SSR | 192 | 283 |
| Number of penta nucleotide SSR | 82 | 121 |
| Number of hexa nucleotide SSR | 220 | 372 |
Number of proteins annotated via differential protein database
| Databases | ||||||
|---|---|---|---|---|---|---|
| Annotated Number | 300 ≤ length < 1000 | Length ≥ 1000 | Annotated Number | 300 ≤ length < 1000 | Length ≥ 1000 | |
| COG | 16,701 | 1494 | 15,197 | 17,249 | 1496 | 15,742 |
| GO | 31,055 | 4171 | 26,838 | 32,733 | 4157 | 28,535 |
| KEGG | 17,017 | 2191 | 14,798 | 18,130 | 2211 | 15,893 |
| KOG | 26,396 | 2853 | 23,508 | 28,948 | 2933 | 25,979 |
| Pfam | 32,822 | 3571 | 29,239 | 34,684 | 3708 | 30,963 |
| Swiss-Prot | 29,608 | 3503 | 26,055 | 31,485 | 3524 | 27,920 |
| eggNOG | 40,363 | 5251 | 35,043 | 43,423 | 5262 | 38,109 |
| Nr | 41,485 | 5612 | 35,784 | 44,692 | 5590 | 39,026 |
| All | 41,676 | 5699 | 35,884 | 45,050 | 5705 | 39,267 |
Fig. 1Homologous species distribution of P. americana and P. icosandra annotated based on the Nr database. a, P. americana; b, P. icosandra
Fig. 2Classification of the transcripts annotated by the Gene Ontology (GO)
Fig. 3Venn diagram of the number of lncRNAs predicted by CPC, CNCI, CPAT and Pfam. a, P. americana; b, P. icosandra
Fig. 4Classification of predicted transcription factors
Primers for RT-qPCR analysis
| Gene name | Gene description | Primer sequence (5′-3′) | Length (bp) | PCR Efficiency (%) | R2 |
|---|---|---|---|---|---|
| Tubulin 8 | F: GTAAGGAAGCCGAGAATTG | 181 | 103 | 0.984 | |
| R: TCAACAACAGTGTCAGAGA | |||||
| Elongation factor 1-alpha | F: TGAAGAAGGTCGGATACAAT | 194 | 95 | 0.993 | |
| R: GTAGACATCCTGGAGTGG | |||||
| Glyceraldehyde-3-phosphate dehydrogenase | F: TGGTGCTAAGAAGGTTATTATC | 198 | 104 | 0.986 | |
| R: GAGTGAACGGTGGTCATA | |||||
| Elongation factor 2 | F: GTATCACCATCAAGTCAACTG | 194 | 99 | 0.999 | |
| R: ACAATCAACCACAACAAGG | |||||
| 18S rRNA | F: ACTTCCTCTTCTCGTATCATT | 193 | 99 | 0.993 | |
| R: TGTTCAGCATAGACTGTGA | |||||
| Actin-7 | F: ATGCTATCCTTCGTCTGG | 180 | 105 | 0.996 | |
| R: TACTCTTGGCTGTCTCTG | |||||
| 28S rRNA | F: TACGATTGGTTACGGACAT | 191 | 102 | 0.983 | |
| R: TTCTCATCAACAACAGCATAT |
R2, linear regression coefficient
Fig. 5RNA transcription levels of seven candidate reference genes in P. americana, P.icosandra and P. acinosa. The expression level of candidate reference genes in total samples (n = 99) was presented as cycle threshold number (Ct-value) and explained by box and whisker plots. The asterisks represented the minimum and maximum Ct value. The squares indicated the 25th and 75th percentiles, and the median was represented by a bar across the square
Fig. 6Pairwise variation analyzed by geNorm to determine the optimal number of reference genes for accurate normalization. A threshold value of 0.15 was suggested for valid normalization. If the value of Vn/n + 1 (pairwise variation) is less than 0.15, then n reference genes in combination are recommended for gene normalization. If the value of Vn/n + 1 is more than 0.15, then Vn + 1/n + 2 should be taken into account. Pam: P. americana; Pic: P. icosandra; Pac: P. acinosa; LSRF: different tissues of leaves, stems, roots and flowers; GS: germinating seeds of these three plant species; F: flowers of these three plant species; LSR: different tissues of leaves, stems and roots; I: infested by S. litura of third instar