| Literature DB >> 30596128 |
Deden Derajat Matra1, Arya Widura Ritonga1, Azis Natawijaya2, Roedhy Poerwanto1, Winarso Drajad Widodo1, Eiichi Inoue3.
Abstract
Baccaurea motleyana Müll. Arg. (rambai) is one of the underutilized fruit natives to Indonesia, Thailand, and Malaya Peninsula and it is mostly cultivated in Java island (Lim, 2012) [1]. The edible part of fruits is white and reddish arillodes in which having sweet to acid-sweet tastes. However, nucleotide as well as transcriptome information of this species is still scarce, no information has been deposited in GenBank. In this data article, we performed for the first time of de novo assembly of transcriptome using paired-end Illumina technology. The assembled contigs were constructed using Trinity and after filtering and clustering, produced 37,077 contigs. The contig ranged 201-4972 bp and N50 has 696 bp. The contig was annotated with several database such as SwissProt, TrEMBL, nr and nt NCBI databases. The raw reads were deposited in DDBJ with DRA numbers, DRA007358. The assembled contigs of transcriptome are deposited in the DDBJ TSA with accession number, IADP01000001-IADP01037077 and also can be accessed at http://rujakbase.id.Entities:
Year: 2018 PMID: 30596128 PMCID: PMC6307336 DOI: 10.1016/j.dib.2018.12.031
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Read and assembly statistics of rambai (Baccaurea motleyana) arillode.
| Features | Number |
|---|---|
| Reads (bases) | 60,245,320/9,036,798,000 |
| Number and bases total (bp) of transcripts | 53,219/26,754,820 |
| Number and bases total (bp) of unigenes | 40,966/19,489,602 |
| Number and bases total (bp) of contigs | 37,077/19,675,275 |
| Length range, average, and N50 of transcripts (bp) | 201–4972/502.73/654 |
| Length range, average, and N50 of unigenes (bp) | 201–4972/475.75/609 |
| Length range, average, and N50 of contigs (bp) | 201–4972/530.66/696 |
Functional annotation of rambai (Baccaurea motleyana) contigs.
| Database source | Number of contig (%) |
|---|---|
| Contig number | 37,077 |
Non-redundant protein (nr) NCBI | 25,647 (69.17%) |
Non-redundant nucleotide (nt) NCBI | 22,712 (61.26%) |
Swiss-Prot UniProt | 17,316 (46.70%) |
TrEMBL UniProt | 26,299 (70.93%) |
| Subject area | Agricultural and Biological Sciences |
| More specific subject area | Horticulture |
| Type of data | RNA sequencing Data |
| How data were acquired | Illumina HiSeq X ten |
| Data format | Raw Sequencing reads and assembled contigs |
| Experimental factors | RNA sequencing was performed by using Illumina X Ten |
| Experimental features | RNA Sequencing of arillode tissue with reddish color at ripening stage |
| Data source location | Cileungsi, Bogor, West Java, Indonesia (6°24′50.1′′S 106°59′05.7′′E) |
| Data accessibility | The raw data have been deposited in the DNA Data Bank of Japan (DDBJ) under the DRA accession number, DRA007358 and the assembled contigs of transcriptome have been deposited in the DDBJ TSA repository with accession number, IADP01000001-IADP01037077 and also can be accessed at |
| Related research article | Lim T.K., |