| Literature DB >> 29124087 |
Daniel Lepe-Soltero1, Alma Armenta-Medina1, Daoquan Xiang2, Raju Datla2, C Stewart Gillmor1, Cei Abreu-Goodger1.
Abstract
The genome annotation for the model plant Arabidopsis thaliana does not include the primary transcripts from which MIRNAs are processed. Here we present and analyze the raw mRNA sequencing data from wild type and serrate-1 globular stage embryos of A. thaliana, ecotype Columbia. Because SERRATE is required for pri-miRNA processing, these precursors accumulate in serrate-1 mutants, facilitating their detection using standard RNA-Seq protocols. We first use the mapping of the RNA-Seq reads to the reference genome to annotate the potential primary transcripts of MIRNAs expressed in the embryo. We then quantify these pri-miRNAs in wild type and serrate-1 mutants. Finally, we use differential expression analysis to determine which are up-regulated in serrate-1 compared to wild type, to select the best candidates for bona fide pri-miRNAs expressed in the globular stage embryos. In addition, we analyze a previously published RNA-Seq dataset of wild type and dicer-like 1 mutant embryos at the globular stage [1]. Our data are interpreted and discussed in a separate article [2].Entities:
Year: 2017 PMID: 29124087 PMCID: PMC5671514 DOI: 10.1016/j.dib.2017.10.019
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
RNA sequences obtained by Illumina sequencing.
| WT1 | 20,751,215 | 18,894,161 (91.05%) |
| WT2 | 21,623,459 | 19,362,653 (89.54%) |
| se1 | 20,708,775 | 18,580,516 (89.72%) |
| se2 | 22,984,350 | 20,882,004 (90.85%) |
RNA sequences from GSE25404.
| WT (Col-0) (SRR074122) | 21,413,867 | 5,153,766 (24.06%) |
| dlc1-5 (SRR074123) | 22,578,840 | 4,047,267 (17.92%) |
Groups of miRNAs according to their position relative to other annotation in the Arabidopsis genome.
| G1 | Intergenic region | 222 | 28 | 166 | 2 | 194 |
| G2 | Closest protein-coding gene is 1–400 bp away | 39 | 7 | 19 | 0 | 26 |
| G3A | Overlap of pre-miRNA with protein-coding gene | 56 | 0 | 54 | 3 | 54 |
| G3B | Overlap of pri-miRNA with protein-coding gene | 0 | 37 | 0 | 1 | 37 |
| G4 | Overlaps with non-coding gene | 8 | 5 | 2 | 1 | 7 |
The overlapping protein gene model was used instead of the pre-miRNA coordinates.
Common pri-miRNAs detected with at least 4 reads in the se-1 and dcl1-5 experiments.
| 117 | |
| 90 |
Behavior of differentially expressed pri-miRNAs in se-1 and dcl1-5.
| 117 common pri-miRNAs (no FDR filter) | |||
|---|---|---|---|
| 47 | 39 | ||
| 11 | 20 | ||
Behavior of differentially expressed pri-miRNAs in se-1 (FDR < 0.05) and dcl1-5 (no FDR filter).
| 90 common pri-miRNAs (FDR < 0.05 for | |||
|---|---|---|---|
| 37 | 31 | ||
| 7 | 15 | ||
| Subject area | |
| More specific subject area | |
| Type of data | Tables |
| How data was acquired | RNA-Seq from an Illumina HiSeq 2000 and previously published |
| Data format | |
| Experimental factors | Total RNA was extracted from pools of ~80 globular stage (32–64 cell) embryos isolated at 72 hours after pollination |
| Experimental features | |
| Data source location | |
| Data accessibility |