Literature DB >> 26981350

RNA-seq analysis for secondary metabolite pathway gene discovery in Polygonum minus.

Kok-Keong Loke1, Reyhaneh Rahnamaie-Tajadod1, Chean-Chean Yeoh2, Hoe-Han Goh1, Zeti-Azura Mohamed-Hussein3, Normah Mohd Noor1, Zamri Zainal3, Ismanizan Ismail3.   

Abstract

Polygonum minus plant is rich in secondary metabolites, especially terpenoids and flavonoids. Present study generates transcriptome resource for P. minus to decipher its secondary metabolite biosynthesis pathways. Raw reads and the transcriptome assembly project have been deposited at GenBank under the accessions SRX313492 (root) and SRX669305 (leaf) respectively.

Entities:  

Keywords:  Hybrid assembly; Kesum; Secondary metabolites; Transcriptome

Year:  2015        PMID: 26981350      PMCID: PMC4778588          DOI: 10.1016/j.gdata.2015.11.003

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Value of the data

Current transcriptome datasets greatly improve the previous EST study in P. minus [1]. P. minus is a non-model medicinal plant rich in terpenoids bioactive compounds [2]. Improved transcript repository with increased KEGG pathways coverage provide extensive genetic resource to integrate research between gene expression and metabolite compounds in P. minus. This data will add to the Polygonum transcriptome resource for understanding secondary metabolite production in this genus.

Data

To profile the leaf and root transcriptomes of P. minus, RNA-seq short reads were generated from the polyA-enriched cDNA libraries prepared from the total RNAs extracted from the leaf and root tissues. The short reads were filtered, processed, assembled and analyzed as described below. The raw data and assembly project have been deposited at GenBank under the accessions SRX313492 (http://www.ncbi.nlm.nih.gov/sra/SRX313492) and SRX669305 (http://www.ncbi.nlm.nih.gov/sra/SRX669305) for the root and leaf tissues respectively.

Experimental design, materials and methods

Plant materials

Sampling of P. minus root and leaf tissues were done from the experimental plot (3° 16′14.63″ N, 101° 41′ 11.32″ E) at Universiti Kebangsaan Malaysia, Bangi. Collected samples were rinsed with distilled water and frozen in liquid nitrogen before stored under − 80 °C.

Total RNA extraction, quality control, library preparation and RNA-seq

Total RNA was extracted accordingly to protocol reported by Lopez-Gomez [3]. 250 ng of poly(A) RNA was prepared from P. minus root sample using PolyATract mRNA isolation kit (Promega, USA) and used as starting material in Roche 454 GS FLX pyrosequencing platform at Malaysia Genome Institute, Malaysia. PCR emulsion was done with long fragment Lib-emPCR amplification for amplicons that are 550 bp or greater in length. The conditions used are as follows: 94 °C for 4 min, 50 cycles of 94 °C for 30 s and 60 °C for 10 min. Two biological replicates of P. minus leaf samples were sequenced using the Illumina HiSeq 2000 sequencing platform. Paired end reads with 90 bp was generated through the standard library preparation protocol implemented by BGI-Shenzhen, P. R. China.

Transcriptome de novo assembly, annotation and classification

Raw reads were filtered to remove adapter sequences with sequence pre-processing tools, Cutadapt [4] and Trimmomatic [5]. High quality Illumina raw reads with phred score ≥ 25 were kept for assembly. Root 454 reads were clipped to pseudo reads equivalent to that of leaf Illumina short reads of 90 bp with 5 bp overlap using an in-house PHP script (http://gitlab.inbiosis.ws/open-source/rnaseq-utils). These reads were then digital normalized with Khmer protocol (http://khmer.readthedocs.org/en/v1.0/). De novo hybrid assembly of these processed reads was performed with Trinity (release r20140717) [6]. Statistics of the hybrid assembly is showed in Table 1.
Table 1

Statistics of P. minus hybrid assembly.

AttributesValue
Pre-assembly
Total raw reads48,615,711
Total processed reads34,365,872



Post-assembly
Number of unigenes108,541
Number of unique transcripts188,735
N50 (bp)1009
Size range (bp)201–12,106
Protein coding sequences of unique transcripts were analyzed via Transdecoder which was embedded as a utility script in Trinity pipeline. Standard Trinotate (release r20140708) annotation pipeline (https://trinotate.github.io/) was carried out to annotate the assembled unique transcripts against Swissprot [7], Pfam [8], eggNOG [9], Gene Ontology [10], SignalP [11], and Rnammer [12]. Summary of the annotation is showed in Table 2. Annotated Gene Ontology terms from Trinotate were associated with EC2GO database [13] for KEGG Pathway mapping via KEGG Search & Color Mapper API [14] (Table 3).
Table 2

Functional annotation of P. minus unique transcripts.

Annotation/toolsNumber of unique transcripts
Total Transdecoder Peptides86,295
BLASTX-SwissProt17,307
BLASTP-SwissProt29,283
PFAM-TMHMM13,617
eggNOG29,004
Gene Ontology (GO)52,796
SignalP3715
RNAMMER9
Table 3

Statistics of EC2GO mapped enzymes and KEGG pathway mapping.

Mapping resourcesTotal mapping entities
GO2EC482 unique enzymes
KEGG search & color7037 unique KO, 376 KEGG pathways

Conflict of interest

All the authors have approved submission and there are no conflicts of interest.
Specifications
Subject areaBiology, Plant Molecular Biology
Type of dataTranscriptome sequences
Organism/Cell line/tissuePolygonum minus (leaf and root)
Sequencer typeIllumina HiSeq™ 2000 (leaf), Roche 454 GS-FLX (root)
Data formatRaw and processed
Experimental factorsControlled growth chamber (leaf), experimental plot (root)
Experimental featuresRNA-seq dataset for gene discovery in plant
Sample source locationMalaysia
Data accessibilityGenBank accessionSRX669305 (http://www.ncbi.nlm.nih.gov/sra/SRX669305) and SRX313492 (http://www.ncbi.nlm.nih.gov/sra/SRX313492)
  11 in total

1.  The Gene Ontology (GO) database and informatics resource.

Authors:  M A Harris; J Clark; A Ireland; J Lomax; M Ashburner; R Foulger; K Eilbeck; S Lewis; B Marshall; C Mungall; J Richter; G M Rubin; J A Blake; C Bult; M Dolan; H Drabkin; J T Eppig; D P Hill; L Ni; M Ringwald; R Balakrishnan; J M Cherry; K R Christie; M C Costanzo; S S Dwight; S Engel; D G Fisk; J E Hirschman; E L Hong; R S Nash; A Sethuraman; C L Theesfeld; D Botstein; K Dolinski; B Feierbach; T Berardini; S Mundodi; S Y Rhee; R Apweiler; D Barrell; E Camon; E Dimmer; V Lee; R Chisholm; P Gaudet; W Kibbe; R Kishore; E M Schwarz; P Sternberg; M Gwinn; L Hannick; J Wortman; M Berriman; V Wood; N de la Cruz; P Tonellato; P Jaiswal; T Seigfried; R White
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  SignalP 4.0: discriminating signal peptides from transmembrane regions.

Authors:  Thomas Nordahl Petersen; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Methods       Date:  2011-09-29       Impact factor: 28.547

3.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

4.  Flavonoid biosynthesis genes putatively identified in the aromatic plant Polygonum minus via Expressed Sequences Tag (EST) analysis.

Authors:  Nur Diyana Roslan; Jastina Mat Yusop; Syarul Nataqain Baharum; Roohaida Othman; Zeti-Azura Mohamed-Hussein; Ismanizan Ismail; Normah Mohd Noor; Zamri Zainal
Journal:  Int J Mol Sci       Date:  2012-02-28       Impact factor: 6.208

5.  The Gene Ontology Annotation (GOA) Project--Application of GO in SWISS-PROT, TrEMBL and InterPro.

Authors:  Evelyn Camon; Daniel Barrell; Catherine Brooksbank; Michele Magrane; Rolf Apweiler
Journal:  Comp Funct Genomics       Date:  2003

6.  eggNOG v4.0: nested orthology inference across 3686 organisms.

Authors:  Sean Powell; Kristoffer Forslund; Damian Szklarczyk; Kalliopi Trachana; Alexander Roth; Jaime Huerta-Cepas; Toni Gabaldón; Thomas Rattei; Chris Creevey; Michael Kuhn; Lars J Jensen; Christian von Mering; Peer Bork
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

7.  Analysis of the chemical composition of the essential oil of Polygonum minus Huds. using two-dimensional gas chromatography-time-of-flight mass spectrometry (GC-TOF MS).

Authors:  Syarul Nataqain Baharum; Hamidun Bunawan; Ma'aruf Abd Ghani; Wan Aida Wan Mustapha; Normah Mohd Noor
Journal:  Molecules       Date:  2010-10-12       Impact factor: 4.411

8.  RNAmmer: consistent and rapid annotation of ribosomal RNA genes.

Authors:  Karin Lagesen; Peter Hallin; Einar Andreas Rødland; Hans-Henrik Staerfeldt; Torbjørn Rognes; David W Ussery
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

9.  Pfam: the protein families database.

Authors:  Robert D Finn; Alex Bateman; Jody Clements; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Andreas Heger; Kirstie Hetherington; Liisa Holm; Jaina Mistry; Erik L L Sonnhammer; John Tate; Marco Punta
Journal:  Nucleic Acids Res       Date:  2013-11-27       Impact factor: 16.971

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  11 in total

Review 1.  Renaissance in phytomedicines: promising implications of NGS technologies.

Authors:  Sonal Sharma; Neeta Shrivastava
Journal:  Planta       Date:  2016-03-22       Impact factor: 4.116

2.  Deep sequencing and in silico analysis of small RNA library reveals novel miRNA from leaf Persicaria minor transcriptome.

Authors:  Abdul Fatah A Samad; Nazaruddin Nazaruddin; Abdul Munir Abdul Murad; Jaeyres Jani; Zamri Zainal; Ismanizan Ismail
Journal:  3 Biotech       Date:  2018-02-15       Impact factor: 2.406

3.  Enrichment of genomic resources and identification of simple sequence repeats from medicinally important Clausena excavata.

Authors:  Doo Young Bae; Sang Mi Eum; Sang Woo Lee; Jin-Hyub Paik; Soo-Yong Kim; Mihyun Park; Changyoung Lee; The Bach Tran; Van Hai Do; Jae-Yun Heo; Eun-Soo Seong; Il-Seop Kim; Ki-Young Choi; Jin Sung Hong; Rahul Vasudeo Ramekar; Sangho Choi; Jong-Kuk Na
Journal:  3 Biotech       Date:  2018-02-15       Impact factor: 2.406

4.  Transcriptome analysis of Polygonum minus reveals candidate genes involved in important secondary metabolic pathways of phenylpropanoids and flavonoids.

Authors:  Kok-Keong Loke; Reyhaneh Rahnamaie-Tajadod; Chean-Chean Yeoh; Hoe-Han Goh; Zeti-Azura Mohamed-Hussein; Zamri Zainal; Ismanizan Ismail; Normah Mohd Noor
Journal:  PeerJ       Date:  2017-02-28       Impact factor: 2.984

5.  Differential Gene Expression Analysis in Polygonum minus Leaf upon 24 h of Methyl Jasmonate Elicitation.

Authors:  Reyhaneh Rahnamaie-Tajadod; Kok-Keong Loke; Hoe-Han Goh; Normah M Noor
Journal:  Front Plant Sci       Date:  2017-02-06       Impact factor: 5.753

6.  Extensive mass spectrometry proteomics data of Persicaria minor herb upon methyl jasmonate treatment.

Authors:  Wan Mohd Aizat; Sarah Ibrahim; Reyhaneh Rahnamaie-Tajadod; Kok-Keong Loke; Hoe-Han Goh; Normah Mohd Noor
Journal:  Data Brief       Date:  2018-02-08

7.  Functional Characterisation of New Sesquiterpene Synthase from the Malaysian Herbal Plant, Polygonum Minus.

Authors:  Nor Azizun Rusdi; Hoe-Han Goh; Suriana Sabri; Ahmad Bazli Ramzi; Normah Mohd Noor; Syarul Nataqain Baharum
Journal:  Molecules       Date:  2018-06-06       Impact factor: 4.411

8.  Regulation of terpenoid biosynthesis by miRNA in Persicaria minor induced by Fusarium oxysporum.

Authors:  Abdul Fatah A Samad; Reyhaneh Rahnamaie-Tajadod; Muhammad Sajad; Jaeyres Jani; Abdul Munir Abdul Murad; Normah Mohd Noor; Ismanizan Ismail
Journal:  BMC Genomics       Date:  2019-07-16       Impact factor: 3.969

Review 9.  Proteomic Contributions to Medicinal Plant Research: From Plant Metabolism to Pharmacological Action.

Authors:  Akiko Hashiguchi; Jingkui Tian; Setsuko Komatsu
Journal:  Proteomes       Date:  2017-12-07

10.  Proteomics (SWATH-MS) informed by transcriptomics approach of tropical herb Persicaria minor leaves upon methyl jasmonate elicitation.

Authors:  Wan Mohd Aizat; Sarah Ibrahim; Reyhaneh Rahnamaie-Tajadod; Kok-Keong Loke; Hoe-Han Goh; Normah Mohd Noor
Journal:  PeerJ       Date:  2018-08-28       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.