Literature DB >> 30229114

De novo transcriptome resources of the lichens, Dirinaria sp. UKM-J1 and UKM-K1 collected from Jerantut and Klang, Malaysia.

Izwan Bharudin1, Siti NurhaniAbdul Abdul Rahim1, Mohd FaizalAbu Abu Bakar2, Siti Norsaidah Ibrahim1, Shazilah Kamaruddin1, Mohd Talib Latif3, Mohd Wahid Samsudin4, Abdul Munir Abdul Murad1, Farah DibaAbu Abu Bakar1.   

Abstract

Lichen is a symbiotic organism that exists as a single composite body consisting of a mycobiont (fungus) and a photobiont (algae or a cyanobacterium). Many lichen species are considered as extremophiles due to their tolerance to radiation, desiccation, temperature and pollution. However, not all lichen species are tolerant to harsh environmental conditions as several species are sensitive for example to nitrogen, sulphur, acidity, heavy metals, halogens (e.g. fluoride) and ozone. Thus, to better understand why some lichens can withstand exposure to pollutants as opposed to those that are susceptible, we focused on the lichen species of Dirinaria known for their wide distribution in the tropics, subtropics and pantropical, and moderate tolerance to air pollution. Their moderate tolerance to air pollution affords them to thrive in good air quality environments as well as polluted air environments. Lichen samples of Dirinaria sp., UKM-J1 and UKM-K1, were respectively collected from two areas with different levels of air quality based on Air Pollutant Index or API (with index pollutant criteria of PM10, carbon monoxide, ozone, nitrogen dioxide and sulfur dioxide) in the outskirt of Jerantut (UKM-J1), a rural area in the middle of Peninsular Malaysia and the township of Klang (UKM-K1), in a busy area of the Klang Valley, Malaysia. API was monitored throughout 2012-2013 whereby the sample collection site in Klang showed markedly higher concentrations of pollutants in all the index pollutant criteria as compared to that of Jerantut. We performed transcriptome sequencing using Illumina RNA-seq technology and de novo assembly of the transcripts from the lichen samples. Raw reads from both libraries were deposited in the NCBI database with the accession number SRP138994.

Entities:  

Keywords:  Air pollution; De novo assembly; Dirinaria sp.; Gene discovery; RNA-seq; Symbiosis

Year:  2018        PMID: 30229114      PMCID: PMC6141494          DOI: 10.1016/j.dib.2018.07.020

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the data

The data obtained using Illumina sequencer is the first source of Dirinaria sp. RNA-seq. These data provide a glimpse into the molecular nature of the lichen Dirinaria sp. and contribute further to the understanding of lichen symbiosis. The data presented here can be used for gene discovery in examining the tolerance of this lichen towards air pollution. The data can also be used in unravelling genes and pathways involved in the synthesis of unique secondary metabolites.

Data

Two lichen samples collected from Jerantut (good air quality), Pahang and Klang (polluted air quality), Selangor, Malaysia (Table 1), were identified as Dirinaria sp. UKM-J1 and K1, respectively, through Polymerase Chain Reaction (PCR) and rRNA Internal Transcribed Spacer (ITS) sequencing [1]. Transcriptome data were generated from the total RNA extracted from these two lichen samples. Details of the experimental procedure and sequence analyses are described in the next section.
Table 1

Comparison of the average concentrations of five air pollutants throughout 2012–2013 between sample collection sites of Jerantut and Klang, Malaysia.

JerantutKlang
Carbon monoxide, CO (ppm)0.120.61
Sulphur dioxide, SO2 (ppb)0.704.72
Nitrogen dioxide, NO2 (ppb)2.2121.10
Ozone, O3 (ppb)22.6128.81
Suspended particulates, PM10 (µg/m3)33.9462.87
Comparison of the average concentrations of five air pollutants throughout 2012–2013 between sample collection sites of Jerantut and Klang, Malaysia.

Experimental design, materials and methods

Sample preparation

Lichen sample collection, RNA extraction procedure and library construction for data production were as previously described [1].

Assembly and RNA-seq analysis

The raw RNA-seq data from Dirinaria sp. were trimmed and filtered with SolexaQA++ [2] to acquire high-quality reads. Phred quality value of Q20 and reads longer than 50 bp were used as parameters. Paired-end reads were determined using Perl script select_paired.pl [3]. De novo assembly of high-quality reads was carried out using Trinity RNA-Seq v2.4.0 [4], with default parameters. Table 2 shows the RNA-seq statistics whereas the assembly statistics are as shown as in Table 3.
Table 2

Statistics of the RNA-seq generated from two different libraries.

JerantutKlang
Raw reads61,101,76646,951,030
Clean reads51,051,34439,379,978
Read counts for transcriptome assembly (paired-end reads)46,457,80636,013,638
Average read length (bp)8990
Total base pair (bp)4,125,748,9023,233,294,069
Table 3

Assembly statistics using Trinity RNA-Seq v2.4.0.

AttributesValue
Number of transcripts379,310
Total residues (bp)353,841,575
Average length (bp)935
N50 transcript1902
Largest transcript (bp)14,535
Smallest transcript (bp)201
Statistics of the RNA-seq generated from two different libraries. Assembly statistics using Trinity RNA-Seq v2.4.0.
Subject areaBiology
More specific subject areaEnvironmental microbiology
Type of dataTranscriptome data
How data was acquiredPaired-end transcriptome of Dirinaria sp. was sequenced using Illumina HiSeq. 2000 at Malaysia Genome Institute. De novo transcriptome assembly was performed using Trinity RNA-Seq v2.4.0
Data formatRaw sequences (FASTQ)
Experimental factorsSamples were collected from two different sites with different air qualities based on Air Pollution Index (polluted air quality and good air quality)
Experimental featuresLichen Dirinaria sp. samples were collected from tree barks in Klang and Jerantut, Malaysia.
Data source locationJerantut and Klang, Malaysia
Data accessibilityRaw FASTQ files were deposited in NCBI SRA database with the accession numberSRP138994 (https://www.ncbi.nlm.nih.gov/sra/?term=SRP138994)
  3 in total

1.  SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data.

Authors:  Murray P Cox; Daniel A Peterson; Patrick J Biggs
Journal:  BMC Bioinformatics       Date:  2010-09-27       Impact factor: 3.169

2.  Transcriptome datasets of oil palm pathogen Ganoderma boninense.

Authors:  Irene Liza Isaac; A W C Y Walter; Mohd Faizal Abu Bakar; A S Idris; Farah Diba Abu Bakar; Izwan Bharudin; Abdul Munir Abdul Murad
Journal:  Data Brief       Date:  2018-02-15

3.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

  3 in total
  1 in total

Review 1.  Review Update on the Life Cycle, Plant-Microbe Interaction, Genomics, Detection and Control Strategies of the Oil Palm Pathogen Ganoderma boninense.

Authors:  Izwan Bharudin; Anis Farhan Fatimi Ab Wahab; Muhammad Asyraff Abd Samad; Ng Xin Yie; Madihah Ahmad Zairun; Farah Diba Abu Bakar; Abdul Munir Abdul Murad
Journal:  Biology (Basel)       Date:  2022-02-06
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.