Literature DB >> 27625990

Comparative analysis of transcriptomes in aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing.

Taketo Okada1, Hironobu Takahashi2, Yutaka Suzuki3, Sumio Sugano4, Masaaki Noji2, Hiromichi Kenmoku2, Masao Toyota2, Shigehiko Kanaya5, Nobuo Kawahara6, Yoshinori Asakawa2, Setsuko Sekita1.   

Abstract

Ephedra plants are taxonomically classified as gymnosperms, and are medicinally important as the botanical origin of crude drugs and as bioresources that contain pharmacologically active chemicals. Here we show a comparative analysis of the transcriptomes of aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing by RNA-Seq. De novo assembly of short cDNA sequence reads generated 23,358, 13,373, and 28,579 contigs longer than 200 bases from aerial stems, roots, or both aerial stems and roots, respectively. The presumed functions encoded by these contig sequences were annotated by BLAST (blastx). Subsequently, these contigs were classified based on gene ontology slims, Enzyme Commission numbers, and the InterPro database. Furthermore, comparative gene expression analysis was performed between aerial stems and roots. These transcriptome analyses revealed differences and similarities between the transcriptomes of aerial stems and roots in E. sinica. Deep transcriptome sequencing of Ephedra should open the door to molecular biological studies based on the entire transcriptome, tissue- or organ-specific transcriptomes, or targeted genes of interest.

Entities:  

Keywords:  Comparative transcriptome analysis; EC, Enzyme Commission; Ephedra sinica; Es_R, E. sinica roots; Es_S, E. sinica aerial stems; Es_SR, E. sinica combined aerial stems and roots; GO, gene ontology; High-throughput mRNA sequencing; IPR, InterPro; RNA-Seq

Year:  2016        PMID: 27625990      PMCID: PMC5011178          DOI: 10.1016/j.gdata.2016.08.003

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Introduction

Ephedra is one of the oldest medicinal plant genera known to mankind [1], [2], [3]. This genus belongs to the Ephedraceae family of gymnosperms, and about 50 Ephedra species are indigenous to areas in Asia, Europe, North Africa, and the Americas. The aerial stems of Ephedra plants have been utilized as a crude drug preparation known as ephedra herb (Ephedrae Herba), used mainly for treatment of bronchitis and bronchial asthma, or to induce perspiration and blood pressure elevation. Ephedra herb is particularly used in traditional Oriental medicines; it is well known as má huáng in traditional Chinese medicine (often abbreviated to TCM), and is frequently used in Japanese Kampo medicine, often as one component of a combined drug formulation. The ingredients mainly associated with the unique pharmacological and biological effects of ephedra herb are ephedrine alkaloids [e.g. (−)-ephedrine; (−)-N-methylephedrine] [1]. Since the first isolation of an ephedrine alkaloid in 1887 by Professor Nagayoshi Nagai, the founder of pharmacy in Japan, these alkaloids have been studied around the world. Ephedrine alkaloids are primarily localized in the aerial stems of several Ephedra species as their principal metabolites (e.g., E. sinica, E. intermedia, E. equisetina) [4], [5], [6]. Pharmacologically, ephedrine alkaloids are a sympathomimetic agonist at α/β-adrenergic receptors, resulting in bronchodilation (β2), enhanced cardiac rate and contractility (β1), and peripheral vasoconstriction (α1). The biosynthetic pathway of these alkaloids has been studied; the route primarily from l-phenylalanine has been chemically and biochemically summarized, although several of the reaction steps have been predicted in hypothetical pathways [7], [8], [9], [10], [11], [12], [13], [14], [15], [16]. The underground roots of Ephedra plants have also been utilized as a crude drug preparation known as ephedra root (Ephedrae Radix). Interestingly, it is well known that ephedra root has hypotensive activity, which is the opposite pharmacological effect of ephedra herb. This hypotensive property is thought to be derived from several unique metabolites contained in Ephedra roots: ephedradines A–D [17], [18], [19], [20]; ephedrannin A [21]; mahuannin A–D [22], [23], [24]; and feruloylhistamine [25], which were isolated by monitoring the hypotensive activity of Ephedra root extract. The hypotensive activities of ephedradine B and feruloylhistamine analogues have been a particular focus of pharmacological study [26], [27]. In addition, maokonine [28], ephedrannin B [29], and mahuannin E [29] have also been isolated from Ephedra roots. Although maokonine displays weak hypertensive activity, the primary pharmacological effect of ephedra root is still hypotensive. In this way, due to the importance of Ephedra plants as medicinal resources, our understanding of their biological, pharmacological, chemical, and taxonomic properties has progressed through interdisciplinary studies. The genetic and genomic features of Ephedra species, from the viewpoint of molecular biology, have been elucidated gradually. For example, during studies of ephedrine alkaloid biosynthesis, a pal gene of E. sinica involved in the primary step of the biosynthetic pathway was cloned and characterized [14]. In a further study, mRNA in aerial stems of E. sinica (Es_S) was comprehensively sequenced and the gene candidates potentially involved in biosynthesis of amphetamine-type alkaloids including ephedrines were profiled [7]. Based on this study, two aromatic aminotransferases of E. sinica were characterized [30]. In other studies, the sequences of internal transcribed spacer 1 region of the nuclear ribosomal DNA, 18S ribosomal RNA gene, and chloroplast DNA were used to describe the taxonomy of Ephedra plants (e.g., [31], [32], [33]). Furthermore, the chloroplast genomic sequences of E. foeminea was totally analyzed, and new plastid markers for phylogenetic purposes were suggested by comparison with the sequences of E. equisetina [34]. Thus, RNA and DNA sequences of Ephedra species have been effectively used for targeted studies. In this study, the comparative analysis between two transcriptomes in Es_S and roots of E. sinica (Es_R) by a high-throughput mRNA sequencing using a Genome Analyzer IIx (Illumina, CA, USA) is mainly presented. The mRNAs of Es_S and Es_R were separately sequenced and the sequence data were comprehensively analyzed using bioinformatics approaches. Our comparative transcriptome analysis of Es_S and Es_R focused in particular on molecular biological annotation of de novo sequences and quantitation of gene expression levels. Namely, this comparative study was performed to more comprehensively understand an Ephedra plant as a biological system by deep transcriptome analysis.

Materials and methods

High-throughput mRNA sequencing

The seeds of E. sinica were germinated in moistened vermiculite, sand, and small stones (5:5:1) in daylight at ca. 25 °C/10 °C in a greenhouse, improving upon the methods previously reported by our group [14]. E. sinica was grown until the plan had generated aerial stems with 4–5 joints. Es_S and Es_R were collected separately and their mRNAs were sequenced individually. Total RNAs were extracted using RNeasy Plant Mini Kit (Qiagen, Hilden, Germany) and the quality of samples for high-throughput mRNA sequencing were confirmed using the Agilent 2100 Bioanalyzer (Agilent Technologies, CA, USA) with the Agilent RNA 6000 Pico Kit (Agilent Technologies) (Fig. S1). The sequencing samples were prepared using the mRNA-Seq Sample Preparation Kit (Illumina, CA, USA) and PE adaptors were ligated onto cDNA ends. The single read-cDNA clusters on a flow cell for sequencing were generated using cBot (Illumina). Sequencing was performed using a Genome Analyzer IIx (Illumina) with the single-read method using 36-cycle sequencing. Sequencing of each Es_S and Es_R sample was performed twice. The short sequence reads obtained from these RNA-Seq experiments were registered in the DDBJ BioProject database (PRJDB3343).

Bioinformatics analysis

The RNA-Seq reads in fastq format were assembled using the Rnnotator program [35] and contig sequences were output in fasta format. Searches by blastx query with an E-value cutoff of 1E-6, GO mapping, and annotation by EC and IPR numbers were performed for Es_S, Es_R, and combined Es_S and Es_R (Es_SR) contigs continuously using the Blast2GO program [36], [37], [38]. The method for quantitation of gene expression levels in the aerial stems and roots is summarized in Fig. 1. In this expression analysis, mapping of short sequence reads in fastq format of Es_S and Es_R to Es_SR contigs was performed using TopHat [39]. The gene expression levels in the Es_S and Es_R transcriptomes were quantified by using Cufflinks software, and the abundances of expressed genes were calculated as expected fragments per kilobase of transcript per million fragments mapped (FPKM) [40]. The differential gene expression levels of the Es_SR combined transcriptomes in Es_S and Es_R were quantified using Cuffdiff in the Cufflinks program [41]. The significance of the abundance of an expressed gene was determined by the false discovery rate < 5% (q value < 0.05).
Fig. 1

Scheme for analysis of differential gene expression to compare transcriptomes of Es_S and Es_R.

Results

High-throughput sequencing of mRNA from Es_S and Es_R and de novo assembly

Total mRNA from both Es_S and Es_R was sequenced using a Genome Analyzer IIx (Illumina) for RNA-Seq [42], [43] (Table 1). Two independent technical replicates were performed for sequencing both Es_S and Es_R. A total of 6.4 × 107 reads from Es_S and 6.3 × 107 reads from Es_R were acquired. De novo assembly was performed using Rnnotator software [35] and cDNA contigs were generated from Es_S, Es_R, and Es_SR. The cDNA contigs over 200 bases that we identified included a total of 23,358 contigs from Es_S, 13,373 contigs from Es_R, and 28,579 contigs from Es_SR.
Table 1

High-throughput sequencing of mRNAs from Es_S and Es_R by RNA-Seq.

Sequenced plant's partExperimentLength of SRSaClusters (passed filter/tile)Total number of clustersbNumber of contigs (≥ 200 bases)
Es_S1st35 bases213,15625,578,72023,35828,579c
2nd324,76638,971,920
Total537,92264,550,640
Es_R1st219,99926,399,88013,373
2nd310,33937,240,680
Total530,33863,640,560

Short-read sequencing.

120 Tiles/Experiment.

Number of Es_SR contigs.

BLAST searches of contig sequences

To find amino acid sequences encoded by mRNA of E. sinica similar to those of other sequences, cDNA contigs longer than 200 bases from Es_S, Es_R, and Es_SR were analyzed using blastx program, which compares a nucleotide query sequence translated in all reading frames to a protein sequence database. A blastx search was performed against the public protein database Swiss-Prot, which consists of manually annotated and reviewed proteins and amino acid sequences in the UniProt Knowledgebase (UniProtKB; http://www.uniprot.org/uniprot/). As a result, 49.8% (11,643), 55.5% (7428), and 48.7% (13,925) of the Es_S, Es_R, and Es_SR contigs were annotated with known gene functions, respectively. The minimum E-values (Table S1) and the percentages of mean similarity (Table S2) distributions of the Es_SR contigs were summarized and displayed in a single figure (Fig. S2). Over 80% of the Es_SR contigs were concentrated in the ranges of E-values not over 8.67E-14 and similarity over 55%. The species of the sequences highest hits by blastx search are also statistically summarized (Table 2). Indeed, as one might expect, approximately half of the highest matches annotating the Es_SR contigs were genes from Arabidopsis thaliana (51.69%), and the percentages of species annotating the other contigs were < 7.16%.
Table 2

Species distribution of sequences matching Es_SR contigs by blastx search.

SpeciesCommon nameNumber of contigsPercentage (%)
Arabidopsis thalianaMouse-ear cress719851.69
Oryza sativa subsp. japonicaRice9977.16
Homo sapiensHuman5944.27
Mus musculusMouse4243.04
Dictyostelium discoideumSlime mold3912.81
Schizosaccharomyces pombe(Strain 972/ATCC 24843)Fission yeast2341.68
Nicotiana tabacumCommon tobacco1411.01
Bos taurusBovine1370.98
Zea maysMaize1340.96
Danio rerioZebrafish1320.95
Solanum lycopersicumTomato1260.9
Rattus norvegicusRat1240.89
Oryza sativa subsp. indicaRice1120.8
Solanum tuberosumPotato1040.75
Xenopus laevisAfrican clawed frog1000.72
Pinus taedaLoblolly pine950.68
Glycine maxSoybean940.68
Others278820.02

Classification of contigs by gene ontology

The contigs annotated by blastx search were then classified by gene ontology (GO) covering the three functional categories of molecular function, biological processes, and cellular component [44]. All GO terms annotating the gene products of these contigs were remapped using ‘GO slims’ [45], which are smaller and more manageable subsets of GO, to reduce the large numbers of original GO terms assigned to these contig sequences. As a result, 95.7% (11,138), 97.0% (7198), and 95.8% (13,334) of Es_S, Es_R, and Es_SR contigs, respectively, that had been annotated by blastx search could also be classified by GO terms (Table 3). Comparison of results for Es_S and Es_R contigs classified based on three GO categories are also shown in Table 3. In the transcriptome of E. sinica, there is little difference in the percentages of GO terms assigned to contigs of Es_S or Es_R.
Table 3

Distribution of Es_S, Es_R, and Es_SR contigs annotated by GO slims.

GO functional categoriesNumber of Es_SR contigs(%)Number of Es_S contigs(%)Number of Es_R contigs(%)
Cellular Component23,06010019,90710013,889100
 Cell12225.39924.987005.04
 Cell wall6752.935402.714623.33
 Cytoplasm21429.2918539.3112028.65
 Cytoskeleton4181.813671.841961.41
 Cytosol16507.1614997.5310687.69
 Endoplasmic reticulum7003.046043.034413.18
 Endosome2150.931750.881210.87
 External encapsulating structure30.0150.0310.01
 Extracellular region5042.194032.023322.39
 Extracellular space550.24530.27330.24
 Golgi apparatus5142.234502.262651.91
 Intracellular12785.5410365.26694.82
 Lysosome440.19460.23200.14
 Membrane233110.1119739.91143610.34
 Mitochondrion13245.7411925.998826.35
 Nuclear envelope1200.52990.5750.54
 Nucleolus6382.775722.873972.86
 Nucleoplasm5692.475212.622902.09
 Nucleus232210.07199710.0313219.51
 Peroxisome2270.982161.091891.36
 Plasma membrane262211.37218410.97161011.59
 Plastid20508.8918559.3212218.79
 Proteinaceous extracellular matrix100.04110.0640.03
 Ribosome3281.423201.612872.07
 Thylakoid3321.443121.571941.4
 Vacuole7673.336323.174733.41
Molecular Function20,41410017,48810012,019100
 Binding234911.51198711.36147912.31
 Carbohydrate binding1100.54900.51530.44
 Catalytic activity229911.26190310.88145812.13
 Chromatin binding870.43890.51280.23
 DNA binding5002.454382.52642.2
 Enzyme regulator activity2361.161991.141321.1
 Hydrolase activity223510.95189610.84120210
 Kinase activity11065.429325.335704.74
 Lipid binding1320.651020.58850.71
 Motor activity620.3550.3160.05
 Nuclease activity1270.621100.63570.47
 Nucleic acid binding1670.821360.78760.63
 Nucleotide binding18308.9616289.3111369.45
 Oxygen binding570.28400.23340.28
 Protein binding472523.15414623.71275922.96
 Receptor activity1990.971510.861030.86
 Receptor binding900.44730.42520.43
 RNA binding5692.795693.254413.67
 Sequence-specific DNA binding transcription factor activity4462.183782.162522.1
 Signal transducer activity1640.81410.81960.8
 Structural molecule activity3321.633191.822602.16
 Transferase activity14186.9511956.837706.41
 Translation factor activity, nucleic acid binding1170.571140.651110.92
 Translation regulator activity180.09190.11150.12
 Transporter activity10395.097784.455804.83
Biological Process41,13310034,88510023,848100
 Abscission160.04110.0380.03
 Anatomical structure morphogenesis13583.311243.227142.99
 Behavior1130.27920.26600.25
 Biological process2020.0110
 Biosynthetic process22405.4518645.3413665.73
 Carbohydrate metabolic process8372.037432.135742.41
 Catabolic process12433.0210913.138603.61
 Cell communication1960.481510.431100.46
 Cell cycle7931.936751.933831.61
 Cell death3870.943250.932230.94
 Cell differentiation10272.58342.395512.31
 Cell growth5981.454931.413301.38
 Cell-cell signaling810.2710.2570.24
 Cellular component organization24305.9121136.0612855.39
 Cellular homeostasis1810.441580.45990.42
 Cellular process501612.19431212.36288312.09
 Cellular protein modification process12843.1210703.076732.82
 Death40.0150.0160.03
 DNA metabolic process4221.033541.011840.77
 Embryo development8482.067332.14611.93
 Flower development4861.184021.152551.07
 Fruit ripening50.0130.0120.01
 Generation of precursor metabolites and energy3790.922970.853151.32
 Growth4541.13991.143051.28
 Lipid metabolic process8582.097532.164782
 Metabolic process13963.3911393.278423.53
 Multicellular organismal development20104.8916694.7811114.66
 Nucleobase-containing compound metabolic process12162.9611193.217463.13
 Photosynthesis1460.351300.37840.35
 Pollen-pistil interaction190.0580.0280.03
 Pollination2590.632170.621280.54
 Post-embryonic development12152.95104736822.86
 Protein metabolic process7101.736341.824932.07
 Regulation of gene expression, epigenetic1970.481630.47700.29
 Reproduction11582.8210272.946392.68
 Response to abiotic stimulus16964.121394410404.36
 Response to biotic stimulus10122.468532.456022.52
 Response to endogenous stimulus12663.0810202.927092.97
 Response to external stimulus4191.023591.032431.02
 Response to extracellular stimulus2260.551930.551310.55
 Response to stress24886.0520285.8114496.08
 Secondary metabolic process5541.354241.223291.38
 Signal transduction13583.311683.357443.12
 Translation5281.285351.534111.72
 Transport18774.5615744.5111534.83
 Tropism1250.31090.31510.21

Classification of proteins and domains encoded by contigs based on enzyme commission (EC) numbers and the InterPro database

EC numbers comprehensively categorize catalytic enzymes based on the six main classes (EC 1–6) of similar enzymatic reactions [46]. In the present study, the amino acid sequences encoded by the Es_S, Es_R, and Es_SR contigs were annotated with EC numbers. As a result, EC numbers were assigned to 14.7% (3444), 18.5% (2470), and 14.2% (4053) of Es_S, Es_R, and Es_SR contigs, respectively. The protein domains encoded by Es_S, Es_R, and Es_SR contigs were also classified using information from the InterPro (IPR) database (The European Molecular Biology Laboratory-European Bioinformatics Institute) organized by the several institutions that make up the consortium [47]. Protein domain predictions were performed using InterProScan [48]. Consequently, 77.0% (17,984), 81.0% (10,830) and 76.0% (21,732) of Es_S, Es_R, and Es_SR contigs, respectively, were characterized by IPR database. Specifically, 57.3% (10,308), 61.2% (6625), and 57.7% (12,533) of the Es_S, Es_R, and Es_SR contigs, respectively, classified by IPR database were annotated with IPR numbers.

Comparative expression analysis of transcriptomes in Es_S and Es_R based on gene functions

Differential gene expression analysis was performed using sequences of genes expressed in Es_S and Es_R to compare these transcriptomes (Fig. 1). The sequence reads from Es_S and Es_R were mapped onto Es_SR contigs using the TopHat program [39]. Subsequently, gene expression levels of Es_S and Es_R were quantified using the Cufflinks program [40], and the differential levels of gene expression in Es_S and Es_R were quantified using Cuffdiff in the Cufflinks program [41]. We found that 4.1% (1170) and 3.8% (1085) of the 28,579 contigs from Es_SR were significantly expressed in Es_S and Es_R, respectively (Fig. 2). To characterize these significantly expressed genes, the enzymatic functions of the encoded proteins were classified based using EC (Fig. 3) and IPR (Table 4) numbers annotated to contigs.
Fig. 2

Percentage of significantly expressed genes in Es_S and Es_R.

Fig. 3

Comparison of EC numbers annotated with amino acid sequences encoded by differentially expressed genes in Es_S and Es_R.

A, Summary of comparison results; B–F, distribution of EC numbers (EC1, 3, and 5) according to Es_S or Es_R.

Table 4

IPR numbers assigned to Es_SR contigs of genes significantly expressed in Es_S and Es_R.

Plant organRankingIPR numberNumber of contigsAnnotation
Es_S specific1IPR0017637Rhodanese-like domain (D)
IPR005150Cellulose synthase (F)
IPR008030NmrA-like domain (D)
IPR013026Tetratricopeptide repeat-containing domain (D)
5IPR0136016FAE1/Type III polyketide synthase-like protein (D)
IPR016038Thiolase-like, subgroup (D)
IPR016039Thiolase-like (D)
IPR023329Chlorophyll a/b binding protein domain (D)
9IPR0013055Heat shock protein DnaJ, cysteine-rich domain (D)
IPR002937Amine oxidase (D)
IPR005746Thioredoxin (F)
IPR013766Thioredoxin domain (D)
IPR022796Chlorophyll A-B binding protein (F)
Es_R specific1IPR00146113Aspartic peptidase (F)
IPR021109Aspartic peptidase domain (D)
3IPR0041587Protein of unknown function DUF247, plant (F)
IPR010987Glutathione S-transferase, C-terminal-like (D)
5IPR0014806Bulb-type lectin domain (D)
IPR004045Glutathione S-transferase, N-terminal (D)
IPR004046Glutathione S-transferase, C-terminal (D)
8IPR0017505NADH:ubiquinone/plastoquinone oxidoreductase (D)
IPR003445Cation transporter (F)
IPR006094FAD linked oxidase, N-terminal (D)
IPR016166FAD-binding, type 2 (D)
Es_S and Es_R1IPR00112850Cytochrome P450 (F)
2IPR00221327UDP-glucuronosyl/UDP-glucosyltransferase (F)
3IPR00240126Cytochrome P450, E-class, group I (F)
IPR016040NAD(P)-binding domain (D)
5IPR01100919Protein kinase-like domain (D)
6IPR02321318Chloramphenicol acetyltransferase-like domain (D)
7IPR00071917Protein kinase domain (D)
IPR003480Transferase (F)
IPR017972Cytochrome P450, conserved site (S)
10IPR01785316Glycoside hydrolase, superfamily (D)

D, Domain; F, Family; S, Conserved site. (It should be noted that IPR numbers are revised occasionally upon InterPro database updates.)

The numbers of EC numbers annotated to differentially expressed genes from Es_S and Es_R were roughly the same (219 and 229, respectively) (Fig. 3A). Genes (69 contigs) encoding EC 3 (hydrolases) were highly expressed in Es_S compared to Es_R (38 contigs) (a 1.8-fold difference) (Fig. 3A–C). In particular, genes encoding the EC 3.1.3.x enzymes (phosphoric monoester hydrolases) were characteristically expressed in Es_S. For example, for x = 2, the enzyme is acid phosphatase; if x = 4, the enzyme is phosphatidate phosphatase; if x = 11, the enzyme is fructose-bisphosphatase; if x = 37, the enzyme is sedoheptulose-bisphosphatase; and if x = 46, the enzyme is fructose-2,6-bisphosphate 2-phosphatase. EC 3.1.3.11, EC 3.1.3.37 and EC 3.1.3.46 are involved in saccharide metabolism, and EC 3.1.3.11 and EC 3.1.3.37 are related to the metabolic pathway for carbon fixation by photosynthesis in aerial parts. Moreover, the genes encoding EC 5 (isomerases) (9 contigs) were highly expressed in Es_S, including: EC 5.2.1.8, peptidylprolyl isomerase; EC 5.3.3.2, isopentenyl-diphosphate Δ-isomerase; EC 5.4.99.7, lanosterol synthase; and EC 5.4.99.8, cycloartenol synthase (Fig. 3A, D). On the other hand, genes encoding EC 1 (oxidoreductases) enzymes (108 contigs) were highly expressed in Es_R compared to Es_S (58 contigs) (a 1.9-fold difference) (Fig. 3A, E, F). The number of contigs encoding EC 1.11.1.7 (peroxidase) was particularly elevated in Es_R (4.4-fold) compared to Es_S. IPR functional terms, which are coordinated with IPR numbers, were also assigned to Es_SR contigs, and 574 and 475 terms were annotated to the contigs of genes significantly expressed in Es_S and Es_R, respectively. Additionally, 426 and 216 terms were specifically annotated to Es_S and Es_R, respectively, and 180 terms were annotated to both Es_S and Es_R. The top-10 ranking of IPR functional terms according to the number of annotated contigs is listed in Table 4.

Discussion

High-throughput mRNA sequencing by RNA-Seq technique has enabled deep transcriptome analysis of many kinds of organisms. In this study, transcripts from E. sinica were comprehensively sequenced and the transcriptomes of aerial stems and roots were comparatively analyzed. Es_SR contigs longer than 200 bases totaled about 28,000, and were generated by de novo assembly of short sequence reads from both Es_S and Es_R (Table 1). Comparing contigs from both types of plant parts, there were 1.7-fold more Es_S contigs than Es_R contigs (23,358, and 13,373 contigs, respectively). This result suggests more active metabolism in aerial stems than in roots (e.g., photosynthesis). In a blastx search against the Swiss-Prot database, ca. 50% of contigs were annotated by various encoded protein functions. BLAST results were statistically analyzed (Table 2, S1, S2, and Fig. S2) and most of these contigs could be classified using GO slims (Table 3). Interestingly, the percentages of assigned GO slims were similar between Es_S and Es_R contigs. This result suggested that although gene expression in aerial stems was relatively more active than that in roots, the overall diversity of functions expressed in each organ was very similar in a view of the broader functional categorization achieved using GO. Actually, only about 8% (Fig. 2) of genes exhibited a significant difference in expression level between Es_S and Es_R. Thus, the metabolic diversity and differences between these plant parts might be controlled by the expression of relatively few genes specific to each plant organ. In the present study, differences in categories of expressed genes could be considered in detail using bioinformatics analysis of sequence reads (Fig. 1). The encoded protein functions of genes expressed in Es_S and Es_R were assigned to contigs according to EC and IPR numbers (Fig. 3, Table 4). For example, contigs encoding chlorophyll a/b binding proteins (IPR023329 and IPR022796) were specifically identified from among Es_S contigs (Table 4). The chlorophyll a/b binding protein is part of the light-harvesting complex, a light receptor that captures and delivers excitation energy to photosystems I and II via chlorophylls a/b [49], [50]. This result was closely related to the result from comparing Es_S and Es_R using EC numbers, which specifically identified EC3.1.3.11 and EC3.1.3.37, which are involved in photosynthesis, in Es_S (Fig. 3B). Interestingly, the contigs encoding thiolase-like domains (IPR016038 and IPR 016039) were identified in Es_S contigs (Table 4). In the biosynthetic pathway of ephedrine alkaloids, a thiolase is presumed to catalyze the biosynthesis of benzoyl-CoA from 3-oxo-3-phenylpropionyl-CoA in a β-oxidative CoA-dependent route [7], [12], [14]. This assumption about the biosynthetic route agrees with the accumulation of ephedrine alkaloids in aerial stems of Ephedra plants.

Conclusions

In conclusion, the transcriptome of an Ephedra plant is analyzed using deep RNA-Seq and bioinformatics, focusing on a comparative analysis of gene expression in aerial stems and roots. The results of the present study will form a molecular biological basis for other research, such as evaluating various qualities of medicinal resources, distinguishing species and cultivars, and biosynthesizing specific accumulated metabolites. It is hoped that this study and further research will contribute to the useful and sustainable application and efficient cultivation of Ephedra plants as medicinal bioresources, and also promote their survival in their natural settings.

Transparency document

Transparency document.
  33 in total

Review 1.  Redox regulation of thylakoid protein phosphorylation.

Authors:  Eva-Mari Aro; Itzhak Ohad
Journal:  Antioxid Redox Signal       Date:  2003-02       Impact factor: 8.401

2.  Characterization of aromatic aminotransferases from Ephedra sinica Stapf.

Authors:  Korey Kilpatrick; Agnieszka Pajak; Jillian M Hagel; Mark W Sumarah; Efraim Lewinsohn; Peter J Facchini; Frédéric Marsolais
Journal:  Amino Acids       Date:  2016-02-01       Impact factor: 3.520

3.  Genetic diversity of Ephedra plants in mongolia inferred from internal transcribed spacer sequence of nuclear ribosomal DNA.

Authors:  Yuki Kitani; Shu Zhu; Javzan Batkhuu; Chinbat Sanchir; Katsuko Komatsu
Journal:  Biol Pharm Bull       Date:  2011       Impact factor: 2.233

4.  Hypotensive actions of ephedradines, macrocyclic spermine alkaloids of Ephedra roots.

Authors:  H Hikino; K Ogata; C Konno; S Sato
Journal:  Planta Med       Date:  1983-08       Impact factor: 3.352

5.  Dimeric proanthocyanidins from the roots of Ephedra sinica.

Authors:  Huaming Tao; Lishu Wang; Zhanchen Cui; Daqing Zhao; Yonghong Liu
Journal:  Planta Med       Date:  2008-10-30       Impact factor: 3.352

Review 6.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

7.  Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads.

Authors:  Jeffrey Martin; Vincent M Bruno; Zhide Fang; Xiandong Meng; Matthew Blow; Tao Zhang; Gavin Sherlock; Michael Snyder; Zhong Wang
Journal:  BMC Genomics       Date:  2010-11-24       Impact factor: 3.969

8.  InterProScan: protein domains identifier.

Authors:  E Quevillon; V Silventoinen; S Pillai; N Harte; N Mulder; R Apweiler; R Lopez
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

9.  Transcriptome profiling of khat (Catha edulis) and Ephedra sinica reveals gene candidates potentially involved in amphetamine-type alkaloid biosynthesis.

Authors:  Ryan A Groves; Jillian M Hagel; Ye Zhang; Korey Kilpatrick; Asaf Levy; Frédéric Marsolais; Efraim Lewinsohn; Christoph W Sensen; Peter J Facchini
Journal:  PLoS One       Date:  2015-03-25       Impact factor: 3.240

10.  High-throughput functional annotation and data mining with the Blast2GO suite.

Authors:  Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Tim D Williams; Shivashankar H Nagaraj; María José Nueda; Montserrat Robles; Manuel Talón; Joaquín Dopazo; Ana Conesa
Journal:  Nucleic Acids Res       Date:  2008-04-29       Impact factor: 16.971

View more
  3 in total

1.  De Novo RNA Sequencing and Transcriptome Analysis of Monascus purpureus and Analysis of Key Genes Involved in Monacolin K Biosynthesis.

Authors:  Chan Zhang; Jian Liang; Le Yang; Baoguo Sun; Chengtao Wang
Journal:  PLoS One       Date:  2017-01-23       Impact factor: 3.240

Review 2.  Proteomic Contributions to Medicinal Plant Research: From Plant Metabolism to Pharmacological Action.

Authors:  Akiko Hashiguchi; Jingkui Tian; Setsuko Komatsu
Journal:  Proteomes       Date:  2017-12-07

Review 3.  Researches on Transcriptome Sequencing in the Study of Traditional Chinese Medicine.

Authors:  Jie Xin; Rong-Chao Zhang; Lei Wang; Yong-Qing Zhang
Journal:  Evid Based Complement Alternat Med       Date:  2017-08-16       Impact factor: 2.629

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.