Literature DB >> 29188226

Data on RNA-seq analysis of Garcinia mangostana L. seed development.

Othman Mazlan1, Azhani Abdul-Rahman1, Hoe-Han Goh1, Wan Mohd Aizat1, Normah Mohd Noor1.   

Abstract

Mangosteen (Garcinia mangostana L.) has exceptional potential for commercial and pharmaceutical applications due to its delicious fruit and medicinal properties. Nevertheless, the molecular mechanism of mangosteen seed development is poorly understood. In this study, we performed transcriptomic analysis of four seed developmental stages; eight, ten, twelve and fourteen weeks after anthesis. Illumina HiSeq™ 4000 sequencer was used to generate raw data of approximately 68 Gb in size. From 451,495,326 raw reads, 406,143,756 clean reads were obtained. The raw data were uploaded to SRA database and the BioProject ID is PRJNA395504. These data provide the basis for further exploration and understanding of the molecular mechanism in mangosteen seed development.

Entities:  

Keywords:  Mangosteen fruit; RNA sequencing; Seed development; Transcriptomics

Year:  2017        PMID: 29188226      PMCID: PMC5694954          DOI: 10.1016/j.dib.2017.11.001

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The data obtained using Illumina sequencer is the first report on RNA-seq of mangosteen seed at different developmental stages (eight, ten, twelve and fourteen weeks after anthesis). This permits the identification of differentially expressed genes that may play an important role in mangosteen seed development. Transcriptomics analysis provides the foundation in elucidating the molecular regulation during mangosteen seed development. Data obtained will be valuable for further investigation on putative genes and proteins discovery in mangosteen seed development.

Data

This dataset are raw reads for mangosteen seed at four different developmental stages; eight, ten, twelve and fourteen weeks after anthesis. Consequently, the data were de novo assembled into full-length transcriptome.

Experimental design, materials and methods

Plant materials

Mangosteen fruit were obtained from mangosteen plots at Universiti Kebangsaan Malaysia, Bangi (2°55′09.0″N 101°47′04.8″E). Flowers of mangosteen were labelled at anthesis during its flowering season (March – April 2014). During fruiting period (June – August 2014), fruits were harvested for seeds at eight, ten, twelve and fourteen weeks after anthesis denoting different developmental stages. Seed samples were stored at −80 °C and grounded to fine powder prior analysis.

Total RNA extraction and quality control, library preparation and transcriptomic service

Extraction of seed total RNA was done [1], [2] via modifying the CTAB method [3]. For quality control, NanoDrop spectrophotometer (Thermo Fisher Scientific Inc., USA) and Agilent 2100 Bioanalyzer (Agilent Technologies, USA) were used to determine the total RNA quantity, quality and reliability. Samples with RNA integrity number (RIN) of around 8.0 or higher were selected for library preparation and sequencing. The mRNA library preparation employed was SureSelect Strand-Specific RNA Library Prep for Illumina Multiplexed Sequencing (protocol version E0, March 2017). Consequently, RNA-seq was performed using Illumina HiSeq™ 4000 (Theragene Etex, South Korea), generating 150 bp of paired end reads (Table 1).
Table 1

SRA accession numbers and links for raw data of mangosteen seed development transcriptome. Mangosteen seed developmental stages; eight (W8), ten (W10), twelve (12) and fourteen (W14) weeks after anthesis.

StageReplicateAccession numberAccession links
W8GmW8-ASRX3066480http://www.ncbi.nlm.nih.gov/sra/SRX3066480
GmW8-BSRX3066479http://www.ncbi.nlm.nih.gov/sra/SRX3066479
W10GmW10-ASRX3066478http://www.ncbi.nlm.nih.gov/sra/SRX3066478
GmW10-BSRX3066477http://www.ncbi.nlm.nih.gov/sra/SRX3066477
W12GmW12-ASRX3066481http://www.ncbi.nlm.nih.gov/sra/SRX3066481
GmW12-BSRX3066475http://www.ncbi.nlm.nih.gov/sra/SRX3066475
W14GmW14-ASRX3066474http://www.ncbi.nlm.nih.gov/sra/SRX3066474
GmW14-BSRX3066476http://www.ncbi.nlm.nih.gov/sra/SRX3066476
SRA accession numbers and links for raw data of mangosteen seed development transcriptome. Mangosteen seed developmental stages; eight (W8), ten (W10), twelve (12) and fourteen (W14) weeks after anthesis.

De novo transcriptome assembly

Quality control of raw reads were tested via FastQC version 0.11.2 [4]. Then, high quality reads were obtained by trimming adapters and other unwanted sequences sequence using cutadapt version 1.9.1 [5] and filtering the reads using in-house script by Theragen Etex Bio Institute, Republic of Korea (Table 2). Trinity version 2.1.1 [6] was used to assemble the reads de novo [7] with default configuration while TIGR Gene Indices clustering tools version 2.1 (Identity; 0.94) [8] was used to omit redundant sequences and cluster them into non-redundant unigenes set. A total of 101,384 unigenes were found and their average length is 784 bp (Table 3).
Table 2

Statistics of raw and clean reads and bases of mangosteen seed development transcriptome. Mangosteen seed developmental stages; eight (W8), ten (W10), twelve (12) and fourteen (W14) weeks after anthesis.

StageReplicateRaw
Clean
ReadsBases (bp)ReadsBases (bp)
W8GmW8-A61,372,7669,267,287,66658,496,8647,896,242,284
GmW8-B57,393,2608,666,382,26054,766,2267,410,657,947
W10GmW10-A55,824,3568,429,477,75653,424,6087,198,009,302
GmW10-B47,977,5467,244,609,44645,998,7286,149,885,945
W12GmW12-A52,078,7607,863,892,76031,555,0524,113,212,390
GmW12-B49,468,8747,469,799,97446,959,9546,170,160,791
W14GmW14-A64,984,5669,812,669,46662,069,5387,964,059,718
GmW14-B62,395,1989,421,674,89852,872,7866,933,874,925
Table 3

Statistics of mangosteen seed development transcriptome assembly.

AttributesValue
Pre-assembly
Total raw reads (bases, bp)451,495,326 (68,175,794,226 bp)
Total clean reads (bases, bp)406,143,756 (53,836,103,302 bp)
Post-assembly
Number of unigenes (bases, bp)101,384 (79,466,965 bp)
Average length of unigenes784 bp
N50
Statistics of raw and clean reads and bases of mangosteen seed development transcriptome. Mangosteen seed developmental stages; eight (W8), ten (W10), twelve (12) and fourteen (W14) weeks after anthesis. Statistics of mangosteen seed development transcriptome assembly.
Subject areaBiology
More specific subject areaTranscriptomics
Type of dataTables, text file
How data was acquiredIllumina HiSeq™ 4000
Data formatRaw (FASTQ)
Experimental factorsMangosteen seed development; eight, ten, twelve and fourteen weeks after anthesis.
Experimental featuresTranscriptome of mangosteen seed development
Data source locationUKM Bangi, Malaysia (2°55′09.0″N 101°47′04.8″E)
Data accessibilityData can be accessed from NCBI SRA (BioProject ID: PRJNA395504) (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA395504)
  5 in total

1.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.

Authors:  Geo Pertea; Xiaoqiu Huang; Feng Liang; Valentin Antonescu; Razvan Sultana; Svetlana Karamycheva; Yuandan Lee; Joseph White; Foo Cheung; Babak Parvizi; Jennifer Tsai; John Quackenbush
Journal:  Bioinformatics       Date:  2003-03-22       Impact factor: 6.937

2.  Rapid and reliable method of extracting DNA and RNA from sweetpotato, Ipomoea batatas (L). Lam.

Authors:  Sun-Hyung Kim; Tatsuro Hamada
Journal:  Biotechnol Lett       Date:  2005-12       Impact factor: 2.461

3.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

4.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

5.  RNA-seq analysis of mangosteen (Garcinia mangostana L.) fruit ripening.

Authors:  Azhani Abdul-Rahman; Hoe-Han Goh; Kok-Keong Loke; Normah Mohd Noor; Wan Mohd Aizat
Journal:  Genom Data       Date:  2017-05-13
  5 in total
  2 in total

1.  Recent updates on metabolite composition and medicinal benefits of mangosteen plant.

Authors:  Wan Mohd Aizat; Ili Nadhirah Jamil; Faridda Hannim Ahmad-Hashim; Normah Mohd Noor
Journal:  PeerJ       Date:  2019-01-31       Impact factor: 2.984

2.  Mass spectrometry dataset for LC-MS metabolomics analysis of Garcinia mangostana L. seed development.

Authors:  Othman Mazlan; Wan Mohd Aizat; Syarul Nataqain Baharum; Kamalrul Azlan Azizan; Normah Mohd Noor
Journal:  Data Brief       Date:  2018-10-12
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.