| Literature DB >> 31641161 |
Yanting Chang1, Tao Hu1, Wenbo Zhang1, Lin Zhou2, Yan Wang3, Zehui Jiang4.
Abstract
Tree peony (Paeonia suffruticosa Andrew) is a popular ornamental plant due to its large, fragrant and colorful flowers. The floral development is the most important event in its lifecycle. To explore the mechanism that regulate flower development, we sequenced the flower bud transcriptomes of 'High Noon', a reblooming cultivar of P. suffruticosa × P. lutea, using both full-length isoform-sequencing (ISO-seq) and RNA-seq were sequenced. A total of 15.94 Gb raw data were generated in full-length transcriptome sequencing of the 3 floral developmental stages, resulting 0.11 M protein-coding transcripts. Over 457.0 million reads were obtained by RNA-seq in the 3 floral buds. Here, we openly released the full-length transcriptome database of 'High Noon' and RNA-seq database of floral development. These databases can provide a fundamental genetic information of tree peony to investigate its transcript structure, variants and evolution. Data will facilitate to deep analyses of the transcriptome for flower development.Entities:
Mesh:
Year: 2019 PMID: 31641161 PMCID: PMC6805890 DOI: 10.1038/s41597-019-0240-1
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Overview of the experimental design and analysis pipeline. The RNA of ‘High Noon’ bud in its three developmental stages of was collected for PacBio Iso-sequencing and Illumina RNA-seq. The raw data of PacBio Iso-seq were classified and corrected to generate the high-quality transcript sequence. Then these sequences were used to predict the gene sequence and annotate the gene function. The raw data of Illumina RNA-seq were filtered using ng_QC package. Then the clean data were mapped to the full-length transcript sequences by RSEM and used to calculate the reads coverage and gene abundance with package DESeq R package and DESeq2, respectively. The sample stage was confirmed through the parallel samples observed through microscope. Stage I represents the vegetative stage. The stage II represents the bud developmental stage during which the floral organ differentiated. Stage III represents the stage during which the floral organ were formed and the floral bud was identified. Bar = 0.5 mm.
Summary of sequencing strategies in this study.
| Samples | Sequencing Strategy | Library Layout | Platform | Instrument model | Tissue | OD260/280 | OD260/230 | 25S/18S | RIN |
|---|---|---|---|---|---|---|---|---|---|
| S1-1 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.194 | 1.388 | 2.1 | 8.8 |
| S1-2 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.326 | 0.877 | 1.4 | 8.3 |
| S1-3 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2 | 1.053 | 1.6 | 9.1 |
| S2-1 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.115 | 1.743 | 1.8 | 10 |
| S2-2 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.069 | 2.011 | 1.8 | 9.3 |
| S2-3 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.152 | 2.073 | 1.9 | 9.8 |
| S3-1 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.161 | 1.463 | 1.8 | 9.6 |
| S3-2 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.153 | 1.859 | 2 | 10 |
| S3-3 | RNA-seq | paired | Illumina | Illumina HiSeq 2000 | floral bud | 2.092 | 0.823 | 1.8 | 9.3 |
| HN | RNA-seq | single | PACBIO_SMART | PacBio RS II | floral bud | mixed | mixed | mixed | mixed |
Statistic of ISO-sequencing in this study.
| Type | Total_nucleotides (bp) | Total_reads_number | Average length (bp) |
|---|---|---|---|
| subreads | 15,936,030,572 | 15,654,254 | 1,018 |
| CCS | 1,236,332,390 | 714,643 | 1,730 |
| FLNC | 760,275,054 | 441,507 | 1,722 |
| Consensus | 397,594,800 | 214,800 | 1,851 |
| polished consensus | 387,499,200 | 214,800 | 1,804 |
| Before_correct | 387,469,755 | 214,800 | 1,804 |
| After_correct | 388,005,257 | 214,800 | 1,807 |
| CD-HIT | 237,754,844 | 115,439 | 2,060 |
Fig. 2The reads and alignment quality in RNA-seq. (a) Per sequence quality scores. The x-axis represents the mean sequence quality of RNA-seq reads. The y-axis represents the count of reads with specified Phred score. (b) Per base sequence quality. The x-axis represents the position of reads in RNA-seq. The y-axis represents the Phred scores of each base. (c) Per sequence GC content. The x-axis represents the percentage of GC of each read. The y-axis represents the count of the reads with. (d) Distribution of insert size after alignment. The x-axis represents the insert size of library after alignment. The y-axis represents the frequency of each size.
Fig. 3The statistic results of gene annotation and predicted CDS length distribution. (a) The transcripts were annotated in 7 databases. The y-axis represents the annotated gene number. (b) The CDS length distribution of transcripts in ISO-seq. The x-axis represents CDS length. The y-axis in left represent the frequency of different CDS length. The right y-axis represents the percentage of different CDS length. (c) The completeness of transcripts in ISO-seq. The completeness of transcripts was assessed by benchmarking universal single-copy ortholog (BUSCO). The x-axis represents the percentage of detected BUSCOs. The light blue diamond represents the complete (C) and single-copy (S) genes; the dark blue represent complete and duplicated (D) genes; the yellow diamond represents fragmented (F) genes; the red diamond represents the missing (M) genes. Total number of core genes queried was 1440.
Summary of Illumina RNA-sequencing.
| Sample | Raw Reads | Clean Reads | Clean Bases (G) | Total mapped reads | Mapping rate (%) |
|---|---|---|---|---|---|
| S1_1 | 49,022,606 | 48,474,096 | 7.27 | 41,483,460 | 85.58 |
| S1_2 | 48,375,304 | 47,857,452 | 7.18 | 40,130,002 | 83.85 |
| S1_3 | 50,839,914 | 49,433,588 | 7.42 | 41,993,190 | 84.95 |
| S2_1 | 58,645,596 | 58,018,926 | 8.7 | 50,391,040 | 86.85 |
| S2_2 | 55,158,816 | 54,515,864 | 8.18 | 47,983,332 | 88.02 |
| S2_3 | 46,277,122 | 45,839,116 | 6.88 | 40,461,148 | 88.27 |
| S3_1 | 47,068,706 | 45,955,414 | 6.89 | 40,193,166 | 87.46 |
| S3_2 | 48,714,736 | 47,383,326 | 7.11 | 41,088,574 | 86.72 |
| S3_3 | 52,899,086 | 51,759,904 | 7.76 | 45,315,526 | 87.55 |
Fig. 4Summary of the sample clustering. (a) Principal component analysis of the 9 samples based on expression levels. (b) The person correlation between the 9 samples based on expression levels. (c) The cluster analysis of the 9 samples based on expression levels.
DEGs relative to floral development.
| Gene ID | Gene ID in model plant | Gene Symbol | Iidentity (%) | Alignment length (bp) | Pathway |
|---|---|---|---|---|---|
| i1_HQ_HNbud_c49771/f2p0/1159 | AT1G24260.1 | SEP3 | 54.47 | 246 | floral developmental processes |
| i1_LQ_HNbud_c231904/f1p0/1102 | AT1G24260.1 | SEP3 | 74.49 | 247 | floral developmental processes |
| i2_HQ_HNbud_c9324/f2p0/2669 | AT4G32980.1 | ATH1 | 56.04 | 323 | floral developmental processes |
| i2_LQ_HNbud_c47156/f1p3/2045 | AT3G03090.1 | ATVGT1 | 58.49 | 477 | floral developmental processes |
| i1_HQ_HNbud_c27914/f7p7/1955 | AT3G03090.1 | ATVGT1 | 58.07 | 477 | floral developmental processes |
| i1_LQ_HNbud_c91035/f1p1/1218 | AT5G60910.1 | FUL | 68.45 | 206 | floral developmental processes |
| i4_HQ_HNbud_c4367/f6p1/4832 | AT5G67100.1 | ICU2 | 60.67 | 1556 | floral developmental processes |
| i1_LQ_HNbud_c105452/f1p0/1482 | AT5G61850.1 | LFY | 65.64 | 390 | floral developmental processes |
| i1_LQ_HNbud_c42466/f1p1/1326 | AT1G69490.1 | NAP | 63.58 | 162 | floral developmental processes |
| i3_LQ_HNbud_c3353/f1p4/3804 | AT4G04885.1 | PCFS4 | 60.67 | 239 | floral developmental processes |
| i3_LQ_HNbud_c2041/f1p5/3526 | AT4G04890.1 | PDF2 | 74.9 | 741 | floral developmental processes |
| i0_HQ_HNbud_c9787/f2p4/869 | AT5G20240.1 | PI | 56.13 | 212 | floral developmental processes |
| i0_LQ_HNbud_c42964/f1p0/955 | AT5G20240.1 | PI | 57.35 | 211 | floral developmental processes |
| i0_LQ_HNbud_c22315/f1p0/746 | AT5G37055.1 | SEF | 70.91 | 165 | floral developmental processes |
| i0_LQ_HNbud_c30233/f1p0/911 | AT3G02310.1 | SEPALLATA2 | 68.8 | 250 | floral developmental processes |
| i1_LQ_HNbud_c213752/f1p14/1085 | AT1G14400.1 | UBC1 | 94.74 | 152 | floral developmental processes |
| i1_LQ_HNbud_c10259/f1p0/1559 | AT1G30950.1 | UFO | 62.14 | 420 | floral developmental processes |
| i1_LQ_HNbud_c105095/f1p0/1458 | AT1G30950.1 | UFO | 56.9 | 420 | floral developmental processes |
| i0_LQ_HNbud_c79621/f1p2/890 | AT1G65480.1 | FT | 79.29 | 169 | floral integrator |
| i1_LQ_HNbud_c105115/f1p1/1458 | AT5G13790.1 | AGL15 | 59.32 | 236 | floral repressor |
| i1_LQ_HNbud_c133723/f1p1/1875 | AT5G13790.1 | AGL15 | 60.17 | 236 | floral repressor |
| i1_LQ_HNbud_c49765/f1p3/1453 | AT1G68840.1 | TEM2 | 63.32 | 368 | floral repressor |
| i1_HQ_HNbud_c33701/f2p5/1226 | AT2G01290.1 | RPI2 | 75.31 | 243 | flower meristem identity |
| i1_LQ_HNbud_c51781/f2p3/1522 | AT2G01290.1 | RPI2 | 73.68 | 266 | flower meristem identity |
| i1_LQ_HNbud_c128329/f1p0/1130 | AT1G69120.1 | AP1 | 69.14 | 256 | flower meristem identity |
| i1_LQ_HNbud_c51214/f1p0/1256 | AT1G69120.1 | AP1 | 69.66 | 178 | flower meristem identity |
| Measurement(s) | transcription profiling assay • full-length isoform |
| Technology Type(s) | RNA sequencing • isoform sequencing |
| Factor Type(s) | developmental stage |
| Sample Characteristic - Organism | Paeonia suffruticosa |