| Literature DB >> 29876460 |
Patricia M Aguilera1, Humberto J Debat2, Mauro Grabiele1.
Abstract
This contribution contains data associated to the research article entitled "Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly" (Debat et al., 2014) [1]. By means of a bioinformatic approach involving extensive NGS data analyses, we provide a resource encompassing the full transcriptome assembly of yerba mate, the first available reference for the Ilex L. genus. This dataset (Supplementary files 1 and 2) consolidates the transcriptome-wide assembled sequences of I. paraguariensis with further comprehensive annotation of the protein coding genes of yerba mate via the integration of Arabidopsis thaliana databases. The generated data is pivotal for the characterization of agronomical relevant genes in the tree crop yerba mate -a non-model species- and related taxa in Ilex. The raw sequencing data dissected here is available at DDBJ/ENA/GenBank (NCBI Resource Coordinators, 2016) [2] Sequence Read Archive (SRA) under the accession SRP043293 and the assembled sequences have been deposited at the Transcriptome Shotgun Assembly Sequence Database (TSA) under the accession GFHV00000000.Entities:
Year: 2018 PMID: 29876460 PMCID: PMC5988406 DOI: 10.1016/j.dib.2018.02.015
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
32,480 BlastX annotated transcripts classified according to e-value ranges and mean parameters.
| 6,068 | 0 | 874.7 | 80.5 | 71.5 | 625.1 |
| 1,729 | 1.19e−180 to 9.42e−150 | 476.1 | 76.0 | 68.6 | 366.4 |
| 2,316 | 1.01e−149 to 9.99e−120 | 393.8 | 74.5 | 68.1 | 308.6 |
| 3,084 | 1.01e−119 to 9.87e−90 | 314.8 | 73.9 | 66.8 | 261.3 |
| 4,485 | 1.01e−89 to 9.98e−60 | 230.8 | 74.4 | 66.4 | 193.0 |
| 7,327 | 1.03e−59 to 9.92e−30 | 148.0 | 76.8 | 65.0 | 128.7 |
| 7,471 | 1.00e−29 to 9.87e−06 | 76.1 | 63.8 | 55.0 | 84.5 |
Fig. 132,480 BlastX annotated transcripts classified according to e-value ranges (A) and direct and reverse (R) frames (B).
30,476 tBlastn annotated transcripts classified according to e-value ranges and mean parameters.
| 8,046 | 0 | 851.6 | 93.2 | 69.7 | 623.2 |
| 2,426 | 1.05e−180 to 8.96e−150 | 475.0 | 89.8 | 63.3 | 412.3 |
| 2,979 | 1.03e−149 to 9.93e−120 | 393.4 | 86.3 | 62.1 | 355.1 |
| 3,618 | 1.04e−119 to 9.81e−90 | 314.7 | 82.7 | 59.4 | 311.0 |
| 4,055 | 1.10e−89 to 9.33e−60 | 232.7 | 77.3 | 58.1 | 246.2 |
| 4,776 | 1.00e−59 to 9.84e−30 | 152.1 | 69.1 | 52.6 | 190.7 |
| 4,576 | 1.00e−29 to 9.91e−06 | 77.8 | 55.7 | 43.3 | 133.7 |
Fig. 230,476 tBlastn annotated transcripts classified according to e-value ranges (A) and direct and reverse frames (B).
9,437 RBH sensu stricto annotated pairs classified according to e-value ranges and mean parameters.
| 6,748 | 0 | 876.1 | 87.1 | 71.5 | 623.0 |
| 1,563 | 1.16e−180 to 9.42e−150 | 472.1 | 82.7 | 68.1 | 366.0 |
| 1,933 | 1.01e−149 to 9.93e−120 | 389.1 | 80.6 | 67.7 | 306.0 |
| 2,280 | 1.01e−119 to 9.74e−90 | 310.2 | 77.5 | 65.9 | 258.0 |
| 2,541 | 1.03e−89 to 9.98e−60 | 228.4 | 73.8 | 64.5 | 199.0 |
| 2,525 | 1.00e−59 to 9.92e−30 | 150.0 | 68.4 | 61.6 | 143.0 |
| 1,284 | 1.03e−29 to 9.91e−06 | 80.4 | 59.2 | 52.7 | 99.0 |
| Subject area | Agronomy-Biology |
| More specific subject area | Transcriptomics |
| Type of data | Assembly of reads and sequence annotation |
| How data was acquired | Bioinformatics approach |
| Data format | Analyzed |
| Experimental factors | RNA, used for library construction and Illumina sequencing, was isolated from leaf samples at emerging, young, fully expanded, and early and late senescent stages from |
| Experimental features | Paired-end 100 nt Raw reads were filtered and |
| Data source location | Misiones, Argentina |
| Data accessibility | Data are within this article and at DDBJ/ENA/GenBank under the accessions |