| Literature DB >> 35818355 |
Eszter Virág1,2, Géza Hegedűs1,3, Barbara Kutasy4, Kincső Decsi4.
Abstract
The dynamic of flower development is a key agronomic characteristic affecting soybean yield. RNA-seq dataset of field-cultivated soybean flowers in four developmental stages including flower buds, and early, mature, and overblown stage flowers are reported in this paper. Gene Expression (Gex) library construction and Illumina NextSeq550 sequencing were carried out to produce 86 bp long forward reads. Reads were preprocessed and deposited in the National Center for Biotechnology Information Sequence Read Archive (NCBI SRA) database. These SRA depositions are under the BioProject accession: PRJNA807844. A reference transcriptome dataset was de novo assembled using these SRA reads. Annotation, differential expression, and gene set enrichment analyses were performed and deposited in the Mendeley Data.Entities:
Keywords: Development; Flower; Glycine max; Soybean; Transcriptome
Year: 2022 PMID: 35818355 PMCID: PMC9270202 DOI: 10.1016/j.dib.2022.108426
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Floral samples were collected from all flowering stage plants of soybean. RNA-seq data of flower buds of Glycine max flower: stage 0 (A), early flowers of Glycine max flower: stage 1 (B), mature flowers of Glycine max flower: stage 2 (C) and overblown flowers of Glycine max flower: stage 3 (D) are presented.
Samples used to create CountTable and determined DEGs.
| NCBI accession number | Sample | Raw library size | Maturation stage | Group |
|---|---|---|---|---|
| SRR16927693 | Glycine max 525-1 leaf | 3,635,514 | 0 | leaf |
| SRR18059506 | Glycine max flower: stage 0 | 5,541,655 | 1 | flower |
| SRR18059505 | Glycine max flower: stage 1 | 4,308,133 | 2 | flower |
| SRR18059504 | Glycine max flower: stage 2 | 4,112,204 | 3 | flower |
| SRR18059503 | Glycine max flower: stage 3 | 5,672,747 | 4 | flower |
Fig. 2MDS plot of vegetative and generative samples. The similarity between the samples, where the distances correspond to the leading log-fold change between each pair of samples. The leading log-fold change is the average (square root) of the largest absolute log-fold change between each sample pair.
Fig. 3Heatmap of differentiated genes where vegetative tissue (leaves) as reference and flower stage 1 as test condition were set. Flower Stage 0-3 are Glycine max flower: stage 0-3 samples and vegetative tissue leaves correspond to Glycine max 525-1 leaf sample. Annotation of transcript IDs see in AnnotationTable (Doi:10.17632/pv2vn2v6bd.2).
Fig. 4The workflow of the used methodology of the presented dataset. The flowchart includes the investigated samples and experimental steps with output data accessibility.
| Subject | Plant Science: Plant Physiology |
| Specific subject area | Genome-wide expression profiling was performed and differentially expressed genes were determined during the floral development of soy plants ( |
| Type of data | TableDatabase recordFigure |
| How the data were acquired | Floral samples were collected from field-cultivated soybean plants during the period 10-16 June 2021 in Tata, Hungary. Approximately 50 mg of plant tissues were used to prepare Next Generation Sequencing (NGS) libraries. NextSeq550 sequencing was performed, to produce 15-16M 86 bp long reads in each sample, approximately. Reads were pre-processed and assembled. A transcriptome dataset was reconstructed and genome-wide expression profiles were determined using combined and separated read sets per all samples. Pairwise differential expression with gene set enrichment analysis (GSEA) and differentially expressed genes (DEGs) were annotated with gene ontology (GO) terms. |
| Data format | RawAnalyzedFiltered |
| Description of data collection | Four developmental stage flowers including flower buds, and early, mature and overblown flowers of soybean plants were collected from field populations during the period 10-16 June 2021 in Tata, Hungary. Plant materials were stored in DNA/RNA Shield (Zymo research) at -25°C until sequencing. |
| Data source location | EduCoMat Ltd Keszthely Hungary |
| Data accessibility | The BioProject and sequence reads are available in National Center for Biotechnology Information (NCBI) database under the accessions:Repository name: Glycine max flowers raw sequence readsData identification number: |