Literature DB >> 31667253

The dataset of de novo transcriptome assembly of Falcataria moluccana cambium from gall-rust (Uromycladium falcatarium) infected and non-infected tree.

Hasyyati Shabrina1, Ulfah J Siregar2, Deden D Matra3, Iskandar Z Siregar2.   

Abstract

Sengon (Falcataria moluccana), formerly known as Albizia falcataria or Paraserianthes falcataria, is an essential tree species for the development of community-based timber plantation, especially in Indonesia. The plantations nowadays are facing a significant disease threat, namely infection of gall-rust fungi (Uromycladium falcatarium). However, a clear understanding of the molecular mechanisms of the tree response against the disease is still unknown. We carried out transcriptome assembly using BGISEQ-500 technology to provide assembled de novo transcriptome dataset generated from gall-rust infected and non-infected trees. The construction of assembled transcriptome was conducted using Trinity v.2.3.2 The raw reads are available in the DDBJ platform with accession number, DRA007983.
© 2019 The Authors.

Entities:  

Keywords:  De novo; Falcataria moluccana; Gall-rust (Uromycladium falcatarium) infected; Next-generation sequencing (NGS); RNA; Transcriptome

Year:  2019        PMID: 31667253      PMCID: PMC6811874          DOI: 10.1016/j.dib.2019.104489

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table This data provides the assembled-transcriptome of Sengon (Falcataria moluccana) from cambium tissue This data will be of practical use to develop genetic markers as a tool for assisting gall-rust resistant Sengon tree improvement program This data will further be advantageous in analyzing differential gene expression to understand molecular mechanism regarding infection of gall rust disease in Sengon

Data

This data presented the de novo transcriptome assembly of Falcataria moluccana (Miq.) Barneby & JW Grimes (Indonesia: Sengon, Malaysia: Batai, Hawaii: Albizia, East Timor: Madre de Cacao, Philippines: Falcata). The Sengon tree is cultivated widely in Indonesia as a material source for veneer, light construction, and pellet in bioenergy. However, the plantations are facing the most damaging threat that damaging the wood and lowering the productivity, gall-rust (Uromycladium falcatarium) fungi infection. Altough the fungi could infect any tissue, the wood or stem, which is the main product of the plantation, was the part that damaged badly. After the fungi entered the bark, the gall-rust fungi will wounded the cell in cambium, causing death in some cells and grow itself in cambium tissues by blocking the nutrient. The fungi also secretes growth hormones that caused the undifferentiated cell in cambium to grow rapidly and created wood deformity. The transcriptome of Sengon was derived from the RNA collected from the cambium part of the stem. The sequencing process was generated using BGISEQ-500 platform. The properties of the reads and the assembled sequences are shown in Table 1. Overview of the transcriptome data is presented in Table 2. The KEGG pathway analysis generated pathways with high number of contigs which involved in gall-rust infection incidence (Table 3). Among the microsatellite motifs from the merged sequences (Table 4), mono- (27,917; 39.83%) and dinucleotide (8,242; 11.76%) were the most common, and the three most frequent motifs were A/T (26,157; 64.33%), AG/CT (4,300; 10.58%), and AT/AT (2,195; 5.40%). The most frequent trinucleotide, tetranucleotide, pentanucleotide, and hexanucleotide were AAG/CTT (1,378; 3.39%), AAAG/CTTT (83; 0.2%), AAAAG/CTTTT (8; 0.02%), and AAACCC/GGGTTT (7; 0.02%) respectively.
Table 1

The properties of raw reads and assembled sequences of Sengon cambium.

FeaturesNumbers
infectednon-infectedMerged
Raw Reads
 Numbers79,054,11279,037,484158,090,000
 Bases7,905,411,2007,903,748,40015,810,000,000
Unique Transcripts
 Numbers380,032118,171400,633
 Bases278,860,152114,550,451325,278,365
 Length range (bp)201-17,824201-8,098201-12,634
 Average (bp)733.78969.36811.91
 N50 (bp)132414391521
 GC contents (%)40.140.639.8
Contigs
 Numbers72,96841,33270,089
 Bases100,689,84757,231,623115,068,922
 Length range (bp)201-21,986201-8,098181-15,997
 Average (bp)733.7813851641
 N50 (bp)107412431382
 GC contents (%)41.140.840.6
Table 2

Functional annotation of Sengon contigs.

SourceNumber (percentage)
InfectedNon-infectedMerged
Contig Number72,96841,33270,089
Contigs:
 Nr NCBI68,927 (98.34)
 Nt NCBI61,213 (83.89)37,986 (91.90)62,880 (89.71)
 SwissProt40,694 (55.76)25,633 (62.02)42,783 (61.04)
 TrEMBL58,593 (83.59)
 Gene ontology17,134 (24.45)
 KEGG3,256 (4.65)
Table 3

Top 20 KEGG pathway with the highest contig numbers of merged sequence.

PathwayPathway ID#Enzymes in pathway#Contigs of Enzyme
Purine metabolismmap00230754
Thiamine metabolismmap00730244
Biosynthesis of antibioticsmap01130831
Pyrimidine metabolismmap00240817
Folate biosynthesismap00790216
One carbon pool by folatemap00670316
beta-Lactam resistancemap01501113
Drug metabolism - other enzymesmap00983213
Penicillin and cephalosporin biosynthesismap00311113
Terpenoid backbone biosynthesismap00900112
Aminoacyl-tRNA biosynthesismap0097039
Alanine, aspartate and glutamate metabolismmap0025038
Cysteine and methionine metabolismmap0027038
Porphyrin and chlorophyll metabolismmap0086036
Galactose metabolismmap0005216
Oxidative phosphorylationmap0019015
Glutathione metabolismmap0048024
Pantothenate and CoA biosynthesismap0077034
Cyanoamino acid metabolismmap0046013
Glycine, serine and threonine metabolismmap0026023
Phenylalanine, tyrosine and tryptophan biosynthesismap0040023
Table 4

The number and motif of microsatellite of Sengon contigs.

MotifsNumber of contigs (percentage)
InfectedNon-infectedMerged
Mononucleotide15,979 (21.90)18,786 (45.45)27,917 (39.83)
Dinucleotide7,157 (9,81)3,879 (9.38)8,242 (11.76)
Trinucleotide3,676 (5.04)2,258 (5.46)4,172 (5.95)
Tetranucleotide216 (0.30)94 (0.23)255 (0.36)
Pentanucleotide31 (0.04)6 (0.01)28 (0.04)
Hexanucleotide41 (0.06)22 (0.05)48 (0.07)
The properties of raw reads and assembled sequences of Sengon cambium. Functional annotation of Sengon contigs. Top 20 KEGG pathway with the highest contig numbers of merged sequence. The number and motif of microsatellite of Sengon contigs.

Experimental design, materials, and methods

The cambiums of Sengon tree were collected from a private plantation in Bogor, West Java, Indonesia (6°32′38.9″S 106°44′24.5″E). The RNA was extracted from the cambium part originated from gall-rust infected and gall-rust free tree, each type of tree consist of one individual and no replication. The RNA extracted by BGI from 80 mg tissue sample using established CTAB-pBIOZOL method. The integrity and quantity of isolated-RNA were monitored and quantified by a NanoDrop ND-1000 spectrophotometer and Agilent 2100 Bioanalyzer. The non-infected sample was treated with Ribozero due to ribosomal RNA contamination. The RNA sequencing was carried out using the BGISEQ-500 (BGI, Hong Kong) and produced pre-processing reads to discard the adaptors and low-quality reads. The pre-processed reads were de novo assembled by Trinity v.2.3.2 [1], and the high-quality contigs were obtained [2], [3]. These contigs were annotated using BLAST + program [4] against the NCBI non-redundant (Nr), Nucleotide sequences (Nt) and protein sequence database of UniProt (SwissProt and TrEMBL). The annotated-contigs from Nr database were performed using Blast2Go software [5] to obtain functional annotation of Gene Ontology and KEGG pathway. Identification of contigs containing microsatellites was performed using MISA program [6], with minimum repeats were: 10 for 1 base, 6 for 2 bases, and 5 for 3, 4, 5, and 6 bases; and the interruptions (the maximum difference between microsatellites) were 100 bases.

Specifications Table

SubjectAgricultural and Biological Sciences: Forestry
Specific subject areaMolecular study in Forestry
Type of dataRNA Sequencing Data
How data were acquiredThe data acquired by Next-generation Sequencing technology using BGISEQ-500 platform
Data formatRaw Sequencing reads and assembled contigs
Parameters for data collectionThe cambium was collected each from outer sapwood of gall-rust infected and non-infected Sengon tree.
Description of data collectionThe RNA was sequenced with Next Generation Sequencing method using BGISEQ-500 platform in Beijing Genomic Institute, Hongkong.
Data source locationCity/Region: Bogor/West JavaCountry: IndonesiaLatitude and longitude (and GPS coordinates) for collected samples/data: 6°32′38.9″S 106°44′24.5″E
Data accessibilityRepository name: DDBJ (DNA Data Bank of Japan)Data identification number: DRA007983Direct URL to data: https://ddbj.nig.ac.jp/DRASearch/submission?acc=DRA007983
Related research articleN.E. Lelana, S. Wiyanto, Gianto, I.Z. Siregar.Genetic diversity of Falcataria moluccana and its relationship to the resistance of gall rust disease.Biodiversitas (2018): 12–17. DOI: https://doi.org/10.13057/biodiv/d190102D. Rahmawati, N. Khumaida, U.J. Siregar.Morphological and phytochemical characterization of susceptible and resistant Sengon (Falcataria moluccana) tree to gall rust disease. Biodiversitas 20 (2019): 907–913. DOI: https://doi.org/10.13057/biodiv/d200340
Value of the data

This data provides the assembled-transcriptome of Sengon (Falcataria moluccana) from cambium tissue

This data will be of practical use to develop genetic markers as a tool for assisting gall-rust resistant Sengon tree improvement program

This data will further be advantageous in analyzing differential gene expression to understand molecular mechanism regarding infection of gall rust disease in Sengon

  6 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

2.  Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.).

Authors:  T Thiel; W Michalek; R K Varshney; A Graner
Journal:  Theor Appl Genet       Date:  2002-09-14       Impact factor: 5.699

3.  Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.

Authors:  Ana Conesa; Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Manuel Talón; Montserrat Robles
Journal:  Bioinformatics       Date:  2005-08-04       Impact factor: 6.937

4.  Dataset from de novo transcriptome assembly of Nephelium lappaceum aril.

Authors:  Deden Derajat Matra; Arya Widura Ritonga; Azis Natawijaya; Roedhy Poerwanto; Winarso Drajad Widodo; Eiichi Inoue
Journal:  Data Brief       Date:  2018-12-14

5.  Dataset of the first de novo transcriptome assembly of the arillode of Baccaurea motleyana.

Authors:  Deden Derajat Matra; Arya Widura Ritonga; Azis Natawijaya; Roedhy Poerwanto; Winarso Drajad Widodo; Eiichi Inoue
Journal:  Data Brief       Date:  2018-12-13

6.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

  6 in total
  2 in total

1.  De novo assembly of transcriptome dataset from leaves of Dryobalanops aromatica (Syn. Dryobalanops sumatrensis) seedlings grown in two contrasting potting media.

Authors:  Iskandar Zulkarnaen Siregar; Fifi Gus Dwiyanti; Ulfah Juniarti Siregar; Deden Derajat Matra
Journal:  BMC Res Notes       Date:  2020-08-28

2.  De novo transcriptome assembly data for sengon (Falcataria moluccana) trees displaying resistance and susceptibility to boktor stem borers (Xystrocera festiva Pascoe).

Authors:  Ulfah J Siregar; Aditya Nugroho; Hasyyati Shabrina; Fitri Indriani; Apriliya Damayanti; Deden D Matra
Journal:  BMC Res Notes       Date:  2021-07-07
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.