Literature DB >> 29743016

Transcriptome sequencing and analysis during seed growth and development in Euryale ferox Salisb.

Xian Liu1, Zhen He1, Yulai Yin2, Xu Xu1, Weiwen Wu1, Liangjun Li3.   

Abstract

BACKGROUND: Euryale ferox Salisb., an annual aquatic plant, is the only species in the genus Euryale in the Nymphaeaceae. Seeds of E. ferox are a nutritious food and also used in traditional Chinese medicine (Qian Shi in Mandarin). The molecular events that occurred during seed development in E. ferox have not yet been characterized. In this study, we performed transcriptomic analysis of four developmental stages (T1, T2, T3, and T4) in E. ferox seeds with three biological replicates per developmental stage to understand the physiological and biochemical processes during E. ferox seeds development.
RESULTS: 313,844,425 clean reads were assembled into 160,107 transcripts and 85,006 unigenes with N50 lengths of 2052 bp and 1399 bp, respectively. The unigenes were annotated using five public databases (NR, COG, Swiss-Prot, KEGG, and GO). In the KEGG database, all of the unigenes were assigned to 127 pathways, of which phenylpropanoid biosynthesis was associated with the synthesis of secondary metabolites during E. ferox seed growth and development. Phenylalanine ammonia-lyase (PAL) as the first key enzyme catalyzed the conversion of phenylalanine to trans-cinnamic acid, then was related to the synthesis of flavonoids, lignins and alkaloid. The expression of PAL1 reached its peak at T3 stage, followed by a slight decrease at T4 stage. Cytochrome P450 (P450), encoded by CYP84A1 (which also called ferulate-5-hydroxylase (F5H) in Arabidopsis), was mainly involved in the biosynthesis of lignins.
CONCLUSIONS: Our study provides a transcriptomic analysis to better understand the morphological changes and the accumulation of medicinal components during E. ferox seed development. The increasing expression of PAL and P450 encoded genes in phenylpropanoid biosynthesis may promote the maturation of E. ferox seed including size, color, hardness and accumulation of medicinal components.

Entities:  

Keywords:  Cytochrome P450; Euryale ferox Salisb.; Phenylalanine ammonia-lyase; Phenylpropanoid biosynthesis; Seed development; Transcriptome

Mesh:

Substances:

Year:  2018        PMID: 29743016      PMCID: PMC5944168          DOI: 10.1186/s12864-018-4707-9

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Euryale ferox Salisb., an annual aquatic herbaceous plant, is the only species in the genus Euryale in the botanical family Nymphaeaceae [1, 2]. E. ferox is known commonly as fox nut or gorgon nut [3], and widely distributed in tropical and subtropical regions of east and southeast Asia [4]. E. ferox is usually grown in lakes, ponds, reservoirs, and other shallow bodies of water [4, 5]. Unlike most other aquatic crops, the floating leaves of E. ferox are comparatively large, reaching up to 1.5 m in diameter [5] (Fig. 1). The seeds of E. ferox are rich in starch, proteins, vitamins, minerals and many other nutritional ingredients [6, 7]. The “Compendium of Materia Medica”, a book of Chinese traditional medicine written during the Ming dynasty (1368–1644), describes the many medicinal uses of E. ferox, which included its use as a kidney tonic, for nourishing the spleen, as a treatment for diarrhea, as well as for eliminating dampness [8]. Seeds of E. ferox are also a significant component of contemporary Chinese herbal medicine and are used to treat a variety of diseases, such as kidney failure, chronic diarrhea, excessive leucorrhea, and hypofunction of the spleen [9, 10]. At present, studies of E. ferox are focused on seed dormancy and germination traits [4], genetic diversity [11], and the antioxidant activity in the seeds [3, 10, 12, 13]. However, there are few reports describing the stages of seed development in E. ferox in the literature. It is particularly important for seeds to explore the morphological changes and the accumulation of medicinal components.
Fig. 1

The leaves and flower of Euryale ferox Salisb. in shallow body of water

The leaves and flower of Euryale ferox Salisb. in shallow body of water Phenylpropanoid biosynthesis involved in numerous important biological processes, such as detoxification of xenobiotics and synthesis of secondary metabolites [14, 15], plays an important role in development of many plant. Phenylalanine ammonia-lyase (PAL) [14] and cytochrome P450 (P450) [16] both encoded by a multi-gene family with different functions [17-22] are key enzymes in the proceed of phenylpropanoid biosynthesis. The expression of PALs possesses tissue-specific in Arabidopsis thaliana [23] and Salvia miltiorrhiza [14], among which the highest expression of PAL1 and PAL2 in root and mature flower of Arabidopsis thaliana [23], PAL1, PAL2 and PAL3 with highest expression in root, stem and leaf of Salvia miltiorrhiza, respectively [14]. The expression of FaPAL6 increased during the strawberry fruit riping along with the anthocyanin accumulation affecting fruit color [24]. For P450 genes (cytochrome P450, CYP), the expression of CaCYP94B1 was up-regulated, whereas the expressions of CaCYP72A15, CaCYP701A3 and CaCYP71A25 were down-regulated in the perisperm at three developmental stages of Coffea Arabica [25]. In addition, TaCYP78A3 was specifically expressed in reproductive organs and influenced the size of wheat seed [26]. E. ferox seed, as an important medicinal component, was closely related to many secondary metabolites, such as flavonoids, lignins and alkaloid [14]. Phenylpropanoid biosynthesis involved in the synthesis of secondary metabolites was selected to analyse the relationship with E. ferox seed development, and PAL and P450 are the vital differential genes in this pathway that we found. In this study, we performed a transcriptomic analysis of E. ferox seed at four developmental stages (10, 20, 30, and 40 days after anthesis) on the basis of morphological characteristics [27, 28]. Quantitative real-time PCR on the basis of six genes associated with nutritional and functional substances of E. ferox seed validated the transcriptome data. Two unigenes –PAL and P450 encoded genes associated with phenylpropanoid biosynthesis during E. ferox seed development were analysed to investigating the relationship with morphological changes and the accumulation of medicinal components.

Methods

Plant material and sample collection

Euryale ferox plants were collected from Suzhou of Jiangsu Province, China. Specifically, seeds of E. ferox begin to swell at 10 days after anthesis, and grow rapidly from 20 to 30 days after anthesis, while reach physiological maturity at 40 days after anthesis (Fig. 2). So the seed samples used for cDNA library construction were collected on 10, 20, 30, and 40 days after anthesis (stages T1, T2, T3, and T4, respectively), with three biological replicates per developmental stage. All samples were frozen in liquid nitrogen immediately upon collection and stored at − 80 °C.
Fig. 2

The fruit, the longitudinal section of fruit, seed and kernel of Euryale ferox Salisb. during four development stages (T1, T2, T3 and T4). a Fruit; (b) The longitudinal section of fruit; (c) Seed; (d) Kernel

The fruit, the longitudinal section of fruit, seed and kernel of Euryale ferox Salisb. during four development stages (T1, T2, T3 and T4). a Fruit; (b) The longitudinal section of fruit; (c) Seed; (d) Kernel

RNA preparation and cDNA library construction

Total RNA was extracted from the 12 samples separately. RNA contamination and degradation was detected by electrophoresis on 1% agarose gels. RNA purity was measured with a NanoPhotometer® spectrophotometer (IMPLEN, CA, USA), and RNA concentration was determined with a Qubit®2.0 Flurometer and the Qubit® RNA Assay Kit (Life Technologies, CA, USA). RNA integrity was evaluated using the Nano 6000 Assay Kit on the Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA). To construct the cDNA libraries, we used 3 μg total RNA per sample as the input material. The NEBNext® Ultra™ RNA Library Prep Kit for Illumina® (NEB, USA) was used to generate a series of corresponding libraries, and index codes were applied to every sample. Briefly, mRNA was purified from total RNA using oligo dT magnetic beads. RNA fragmentation was carried out at high temperature in the presence of divalent cations in NEBNext First Strand Synthesis Reaction Buffer (5X). First-strand cDNA was synthesized using random hexamer primers and M-MLV Reverse Transcriptase (RNase H−). Second strand was synthesized using DNA Polymerase I and RNase H. The exonuclease/polymerase activities were used to convert the remaining single-stranded overhangs to blunt ends. After adenylation of the 3′ ends of the DNA fragments, the NEBNext Adaptor containing a hairpin loop structure was added. cDNA fragments between 150 and 200 bp were purified using the AMPure XP system (Beckman Coulter, Beverly, USA), and the cDNA library was non-strand specific. Afterwards, the 3 μl USER Enzyme (NEB, USA) was made use of picking size, adaptor-ligated cDNA at 37 °C for 15 min followed by 5 min at 95 °C before PCR. PCR was performed using universal PCR primers, Index (X) Primer and Phusion High-Fidelity DNA polymerase. The Agilent Bioanalyzer 2100 system was used to purify the PCR products and to evaluate the quality of the cDNA libraries. Paired-end DNA sequencing data for the 12 libraries was generated on an Illumina Hiseq 2000 instrument.

De novo cDNA assembly and functional annotation

To obtain clean reads, sequencing adaptors and low-quality reads were removed from each library. For increasing the amount of date to reduce the unigene redundancy, and ensuring the low expression genes assembled more completely, the remaining clean reads were adopted the method of the merged assembly of Trinity software [29]. Specifically, the clean reads in each library were first assembled into contigs, then the contigs were clustered to form unigenes. The clean reads were mapped back to the homologous contigs based on the paired-end reads, and the different contigs from their transcripts and the distance were also distinguished by the methods of the De Bruijn and sequencing information. The unigene sequences were annotated using the following public databases: NCBI non-redundant protein sequences (NR) [30], Clusters of Orthologous Groups (COG) [31], Swiss-Prot [32], Kyoto Encyclopedia of Genes and Genomes (KEGG) [33], and Gene Ontology (GO) [34]. The unigene sequences were aligned using BLASTX with an E-value of < 10− 5. If results from the different databases were mutually contradictory, a priority order of NR, Swiss-Prot, KEGG, and COG was followed to decide the annotation. Nevertheless, if a unigene did not align to any of the databases, ESTScan software was chosen to infer the direction of transcription [35].

Differential gene expression analysis

Each of the 12 E. ferox seed cDNA libraries was aligned separately to the transcriptome assemblies using Bowtie [36]. The counting of alignments was estimated using the RSEM package [37]. The RPKM method was then used to calculate the numbers of differentially expressed genes (DEGs) from pairwise comparisons of the four seed developmental stages [38]. From the biological replicates in this study, DESeq [39] implemented in R package was used to analyze the differential expression between two groups. P values were adjusted using the Benjamini-Hochberg approach to control the false discovery rate (FDR). Genes with an adjusted P-value < 0.05 found by DESeq were usually identified as DEGs. However, genes with FDR < 0.01 and FC (Fold Change) ≥2 served as standards in the screening, and FC represents the specific value between two samples. In addition, the phylogenetic relationships of P450 and beta-glucosidase genes were assessed using the neighbor-joining (NJ) method in MEGA version 7.0 [40].

Quantitative real-time PCR analysis

Total RNA was extracted in triplicate from the four developmental stages of E. ferox seeds (total of 12 samples) using Trizol® Reagent (Invitrogen, USA), and then reversed transcribed into cDNA using a PrimeScript® RT reagent Kit (Takara, Dalian, China). Six differentially expressed genes associated with nutritional and functional substances in E. ferox seed were selected to validate the transcriptome data using quantitative real-time PCR. The gene-specific primers for the six unigene sequences were designed with Primer Premier 5 software [41], and the β-Actin gene was used as an internal gene expression control: the gene was amplified with forward primer 5′-GACTCTGGTGATGGTGT-3′ and reverse primer 5′-CACTTCATGATGGAGTTGT-3′. The amplifications were performed in 20 μl reactions consisting of 10 μl 2× SYBR® Premix EX Taqll (Tli RNaseH Plus) (TaKaRa, Dalian, China), 2.0 μl of a mixture of the forward and reverse primers, 2.0 μl cDNA template, and 6.0 μl sterile dH2O. The amplifications were performed on a CFX-96 real-time PCR system (Bio-Rad) using the following quantitative real-time PCR program: 95 °C for 3 min, followed by 39 cycles of 95 °C for 30 s and Tm for 30 s. Each amplification reaction was performed in triplicate.

Results

Collection of Euryale ferox seed

Transcriptome sequencing was used to analyze the complex physiological and biochemical processes that occurred during four developmental stages of Euryale ferox seeds. The first stage was designated as 10 days after anthesis with obvious small size fruit and seeds. The other three stages were 20, 30, and 40 days after anthesis. The fruit and seeds gradually became larger along with the color deepening and mechanical hardening of seeds (Fig. 2). Because three biological replicates were carried out for each developmental stage, a total of 12 samples were used in the analysis of seed development. Specific information of 12 samples is shown in Table 1.
Table 1

Summary of sample information and transcriptome sequencing output statistics for the E. ferox seed RNA-seq libraries

SampleDate of CollectionTotal Clean ReadsTotal Clean Nucleotides (nt)GC Percent (%)N Percent (%)Q30 Percent (%)
1–120, August, 201622,421,4286,651,313,47249.27091.98
1–220, August, 201628,358,3068,449,902,68849.06091.69
1–320, August, 201621,297,8396,334,599,48649.00091.71
2–130, August, 201624,816,3217,393,759,06248.80091.18
2–230, August, 201625,714,2397,656,559,22249.68092.62
2–330, August, 201623,400,0936,976,232,18249.94091.10
3–109,September, 201622,236,7476,619,274,59450.96091.87
3–209,September, 201624,450,7927,274,216,70050.79092.11
3–309,September, 201627,901,4208,307,711,47449.20092.09
4–119,September, 201628,587,6018,504,756,01050.28092.56
4–219,September, 201629,452,3928,769,841,38250.83092.16
4–319,September, 201635,207,24710,466,146,78050.73092.42
Summary of sample information and transcriptome sequencing output statistics for the E. ferox seed RNA-seq libraries

RNA-seq and de novo transcriptome assembly

In order to obtain comprehensive and effective transcriptomic information for E. ferox seed growth and development, 12 cDNA libraries were constructed and generated paired-end sequence reads using the Illumina Hiseq 2000 platform. Detailed information about the transcriptome sequencing is given in Table 1. A total of 313,844,425 clean reads originated from the 12 E. ferox cDNA libraries were obtained following removal of adaptors and low-quality reads from the libraries (the transcriptome data is being submitted to SRA database). The percent G + C, percentage of Ns (unknown bases), and fraction of bases with quality scores of Q30 for the 12 libraries averaged 48.8%, 0%, and 91.1%, respectively. These numbers showed that the data was of high quality. Trinity software was then used to assemble the clean reads into 160,107 transcripts with N50 length of 2052 bp and 85,006 unigenes with N50 length of 1399 bp. The relative lengths of the assembled sequences is one standard by which a transcriptome assembly can be assessed. For the present study, the summary statistics of the assembled transcripts and unigenes are shown in Table 2. Of these, assembled sequences < 200 bp were not taken into account. In the final assemblies there were 60,836 transcripts ranging from 200 to 500 bp in length, which accounted for 38% of the total. There were 32,751 (20.46%) and 66,519 (41.55%) transcripts from 500 to 1000 bp and > 1000 bp in length, respectively. Compared with the assembled transcripts, the assembled unigenes in the above three length ranges accounted for 59.91%, 19.16%, and 20.94% of the total (Table 2). In addition, the assembled sequences were also annotated by BLASTX against Arabidopsis thaliana TAIR10 (Additional file 3: Table S1) and rice 7.0 proteomes (Additional file 4: Table S2).
Table 2

Summary statistics for the assembled transcripts and unigenes

Length rangeTranscriptsTranscripts percent (%)UnigenesUnigenes percent (%)
200–30034,00621.24%30,87136.32%
300–50026,83016.76%20,05123.59%
500–100032,75120.46%16,28719.16%
1000–200035,97522.47%10,15411.95%
2000+30,54419.08%76428.99%
Total number160,10785,006
Total length193,603,84666,000,704
N50 length20521399
Mean length1209.22776.42
Summary statistics for the assembled transcripts and unigenes

Annotation and functional classification

The assembled sequences were used as queries in BLAST searches (E ≤ 10− 5) against the NR, COG, Swiss-Prot, KEGG, and GO databases, and the results are shown in Fig. 3. The majority of the unigenes (27,546; 32.4%) were annotated from the NR database. In contrast, only 11,304 annotated unigenes were matched to the COG database. As shown in Fig. 3, a total of 4850 unigenes were annotated with all five databases.
Fig. 3

Venn diagram of Euryale ferox Salisb. seed unigenes. Functional annotation was done using five public databases; NCBI non-redundant protein sequences (NR), Swiss-Prot, COG, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG). The shared unigenes are indicated in the intersections

Venn diagram of Euryale ferox Salisb. seed unigenes. Functional annotation was done using five public databases; NCBI non-redundant protein sequences (NR), Swiss-Prot, COG, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG). The shared unigenes are indicated in the intersections To further utilize the transcriptome data, the COG, GO, and KEGG databases were used to identify the molecular processes that occur during the growth and development of E. ferox seed. The annotated unigenes were initially classified into 25 COG categories with the largest group being “General function prediction only” (2644; 23.39%) (Fig. 4). Genes in “Carbohydrate transport and metabolism” (870, 7.70%) and “Secondary metabolites biosynthesis, transport and catabolism” (417, 3.69%) are involved in biosynthesis of the nutritional and functional substances which benefit human health. The Gene Ontology (GO), which is an internationally standardized classification system, was used to assign 15,827 unigenes from the four seed developmental stages to the three principal GO domains; “cellular component”, “molecular function”, and “biological process” (Fig. 5). In the “cellular component” category, “cell” and “cell part” were the most abundant terms. For the category of “molecular function”, “catalytic activity” and “binding” were the most prominent. “Metabolic process” and “cellular process” were the most abundant terms in the “biological process” category.
Fig. 4

Histogram of clusters of orthologous groups (COG) classification. A total of 11,304 unigenes with lengths more than 300 bp were divided into 25 COG categories

Fig. 5

Functional annotation of assembled sequences based on gene ontology (GO) categorization. 15,827 unigenes were grouped into the three main GO domains: “cellular component”, “molecular function”, and “biological process”

Histogram of clusters of orthologous groups (COG) classification. A total of 11,304 unigenes with lengths more than 300 bp were divided into 25 COG categories Functional annotation of assembled sequences based on gene ontology (GO) categorization. 15,827 unigenes were grouped into the three main GO domains: “cellular component”, “molecular function”, and “biological process”

Differential gene expression and pathway enrichment analysis

Transcriptome sequencing is a powerful and effective approach that is often used to analyze gene expression at the whole genome level without a reference genome sequence, and this method can also be taken to identify a variety of metabolic processes [42-44]. In this study, a total of 4897 DEGs were confirmed among the four developmental stages of E. ferox seed. Specific information describing the up- and down-regulated DEGs identified in pairwise comparisons between the four seed developmental stages is shown in Additional file 5: Table S3. The KEGG database is used to systematically map unigenes to specific metabolic pathways and to describe the function of the unigenes. In the present study, all of the unigenes were assigned to 127 pathways which were divided into five groups: cellular processes, environmental information processing, genetic information processing, metabolism, and organismal systems (Additional file 1: Figure S1). Of the five groups, the metabolism group contained the largest number of pathways. The most abundant pathway was “ribosome” which had 539 unigenes, followed by “carbon metabolism” with 515 genes, and “biosynthesis of amino acids” with 470 genes.

Phenylpropanoid pathway

In the phenylpropanoid pathway of E. ferox seed development, PAL is the first key enzyme that catalyzed the conversion of phenylalanine to trans-cinnamic acid, then experienced a series of complex channels to finally synthesizes some important secondary metabolites including flavonoids, lignins and alkaloid (Fig. 6). PAL in E. ferox seed was encoded by c55946.graph_c2, which shared highest identities (79%) with PAL1 gene of Myrtaceae in GenBank. During E. ferox seed growth and development, the expression of c55946.graph_c2 reached its peak at T3 stage, followed by a slight decrease at T4 stage. Another key enzyme is P450 encoded by c17722.graph_c0, which shared highest identities (70%) with CYP84A1 gene of Ananas comosus in GenBank. And the CYP84A1 gene was also called ferulate-5-hydroxylase (F5H) in Arabidopsis [45]. In the phenylpropanoid pathway, F5H mainly involved in the biosynthesis of 5-Hydroxy-guaiacyl lignin and syringyl lignin (Fig. 6). Phylogenetic analysis based on P450 gene CYP84A1 of angiosperms showed that E. ferox clustered as the boundary of monocots and eudicots (Fig. 7). The P450 gene sequences of Zea mays, Sorghum bicolor, Ananas comosus, Phoenix dactylifera, Elaeis guineensis, Setaria italica, Oryza sativa, Vitis vinifera, Theobroma cacao, Glycine max, Prunus avium, Cucurbita maxima, Durio zibethinus, Arabidopsis, Nelumbo nucifera etc. in angiosperms together with Ginkgo biloba in gymnosperm as outgroup were used in this study. Consistently with P450 gene, the NJ tree based on beta-glucosidase gene showed similar result in Additional file 2: Figure S2.
Fig. 6

Simplified scheme of phenylpropanoid biosynthesis. Two key enzymes are PAL (phenylalanine ammonia-lyase) and F5H (ferulate-5-hydroxylase) which called the CYP84A1 gene of cytochrome P450

Fig. 7

A phylogenetic tree depicting the relationships among the CYP84A1s in angiosperm. Forty-two CYP84A1 genes from angiosperms with Ginkgo biloba as a outgroup were used in this study

Simplified scheme of phenylpropanoid biosynthesis. Two key enzymes are PAL (phenylalanine ammonia-lyase) and F5H (ferulate-5-hydroxylase) which called the CYP84A1 gene of cytochrome P450 A phylogenetic tree depicting the relationships among the CYP84A1s in angiosperm. Forty-two CYP84A1 genes from angiosperms with Ginkgo biloba as a outgroup were used in this study

Verification of differential gene expression by quantitative real-time PCR

To validate the gene expression results derived from the transcriptome data, six DEGs related to nutritional and functional compounds produced during the growth and development of E. ferox seed were selected for qRT-PCR. Detailed information about the six DEGs is given in Table 3. These DEGs are mainly involved in “starch and sucrose metabolism” (pectinesterase and beta-glucosidase), “phenylpropanoid biosynthesis” (cytochrome P450 and phenylalanine ammonia-lyase), and also “carotenoid biosynthesis” (9-cis-epoxycarotenoid dioxygenase and phytoene synthase). As shown in Fig. 8, the six DEGs had very similar expression patterns based on the transcriptome data and qRT-PCR results, which indicates that the results of the transcriptomic analysis are reliable (Additional file 6: Table S4).
Table 3

Oligonucleotide primers used in gene-specific qRT-PCR assays for six DEGs

Gene identifierGene descriptionFunctionForward primer (5′-3′)Reverse primer (5′-3′)Tm (°C)Product (bp)
c44497.graph_c0PectinesteraseStarch and sucrose metabolismCGCTGAAGTTTTGGGAGATGCCGACCTCCATTAGAAAACG54.9113
c50120.graph_c0Beta-glucosidaseStarch and sucrose metabolismGAACCCTCTTGAGCCTACTTGGTCCATCCACCGCACTGATAACC60.6185
c17722.graph_c0Cytochrome P450Phenylpropanoid biosynthesisTCTCGGTGATGAGTCCTCTGCTGTTCCGCCATTGCCCATTCTAT60.6157
c55946.graph_c2Phenylalanine ammonia-lyasePhenylpropanoid biosynthesisCTTCTCTGGCATTCGGTTCGACGATAGCGGCACCAAGTCC60.6121
c41810.graph_c19-cis-epoxycarotenoid dioxygenaseCarotenoid biosynthesisGTCATCCGAAAGCCTTACCTCCGTGGAAGCAGAAGCAGTCAGGG60.6293
c53300.graph_c0Phytoene synthaseCarotenoid biosynthesisCCTGTTGACATCCAGCCATTTCATCAGACCGACAGTTCCA55.6129
Fig. 8

Relative expression levels of six DEGs in four developmental stages of Euryale ferox Salisb. seed. We compared expression levels determined by RNA-seq and qRT-PCR analyses for each DEG

Oligonucleotide primers used in gene-specific qRT-PCR assays for six DEGs Relative expression levels of six DEGs in four developmental stages of Euryale ferox Salisb. seed. We compared expression levels determined by RNA-seq and qRT-PCR analyses for each DEG

Discussion

In recent years, gene expression in an increasing number of horticultural plants has been studied by transcriptomic analysis including cucumber, tomato, water chestnut, lettuce, and peanut [27, 46–49]. With the rapid development of sequencing technology, transcriptomic analysis has become more efficient and cost-effective. Liu et al. [48] reported that 51,792 unigenes with an average length of 849 bp and an N50 length of 1448 bp were obtained from lettuce seed using Trinity assembly software. Yin et al. [49] assembled 59,236 unigenes with a mean length of 751 bp and an N50 length of 1130 bp from transcriptome sequencing of peanut seed. In the present study, the N50 length from our E. ferox seed transcriptome assemblies was greater than in peanut seed but less than in lettuce seed, and all three were generated on the Illumina Hiseq 2000 platform [48, 49]. Undoubtedly, the transcriptome data differs between the three horticultural plant species. In addition, being edible seeds, over 1500 unigenes participate in lipid metabolism in the seven pathways that are related to the accumulation of oil in peanut seed during development [49]. We identified 313 unigenes in the four pathways involved in lipid metabolism in E. ferox seed development. This difference must be due to individual crop attributes; in this case, peanut is an oil crop, and the seeds are rich in oil, but E. ferox seed is mainly composed of carbohydrates. PAL is the key enzyme encoded by a multi-gene family in the biosynthesis of many valuable secondary metabolites, including flavonoid, alkaloid and lignin. There are four members in Arabidopsis [18] and tobacco [19], seven members in cucumber [17], and more than twenty copies in tomato [20]. In this study, gene encoding PAL of E. ferox seed is grouped into PAL1. PAL1 has been extensively studied in other plants in order to observe tissue-specific expression and understand secondary metabolites [14, 17–19]. The expression of PAL1 in E. ferox seed reached its peak at T3 period, followed by a slight decrease at T4 period. The increasing expression patterns during the maturity of seed influenced the increase of relevant secondary metabolites in phenylpropanoid pathway, which may explain the morphological changes and the accumulation of medicinal substances in E. ferox seed. Similar, in the development of the apple (variety ‘Jonathan’) and strawberry, PAL was related to the anthocyanin accumulation, and anthocyanin is a kind of flavonoid which has lots of health care functions as well as an effect in modification of color [24, 50]. As shown in Fig. 2, the color of E. ferox seed gradually deepened during the growth and development. This phenotypic change may be regulated by the associated secondary metabolite. Whereas CcPAL1, CcPAL2 and CcPAL3 were verified with the highest expression in the early stage of bean and pericarp in Coffea canephora [51]. The most likely reason was that the synthetic metabolites were different in the growth and development of plants. In fact, the types of metabolite were very rich due to the complexity of the metabolic pathway. Two PALs in Populus tremuloides, PtPAL1 was associated with condensed tannin metabolism and PtPAL2 was involved in monolignol biosynthesis, of which, tannin has remarkable biological and pharmacological activities [52, 53]. In coffee bean, CcPAL1 and CcPAL3 were linked with the accumulation of cholorogenic acids (CGA), whereas CcPAL2 may contribute more importantly to flavonoid accumulation [51]. As a traditional herb, E. ferox seeds are available for human health, so the accumulation of secondary metabolites is closely related to the growth and development of E. ferox seeds. Lignin, as another vital secondary metabolite, is quite crucial for structural integrity of the cell wall and strength and stiffness of the stem [54]. All of the AtPAL1, AtPAL2 and AtPAL4 were associated with lignin biosynthesis in Arabidopsis thaliana [18]. As shown in Fig. 6, there are multiple methods to synthesize lignin during the E. ferox seeds development. In addition to PAL, CYP84A1 (F5H) involved in biosynthetic pathway of lignin. The Fig. 2 of four stages in E. ferox seeds shown that, the stiffness of seeds was enhanced. This phenotypic change may be closely related to the accumulation of lignin. During the seed development in wheat, silencing of TaCYP78A3 led to the decrease in wheat seed size and cell proliferation in wheat seed coats, so TaCYP78A3 gene was proved that it can influence the size of seed [26]. The fruit and seeds of E. ferox became obviously larger at the four development stages (Fig. 2). The example in wheat seed can help understand the cellular basis of genes influencing the E. ferox seed development. For all of the DEGs in E. ferox seed development, the number of DEGs between any two adjacent developmental stages (T1 vs. T2, T2 vs. T3, and T3 vs. T4) was far less than the number found in the T1 vs. T3, T1 vs. T4, and T2 vs. T4 comparisons (Additional file 5: Table S3). This clearly shows constant development in E. ferox seeds. In addition, there were only 21 DEGs in the T3 vs. T4 comparison (Additional file 5: Table S3), which indicated that accumulation of the substances tended to saturate at the later stages of seed development. Above all, the pattern of DEGs was coincident with the stages of E. ferox seed growth and development.

Conclusions

In summary, we generated 313,844,425 clean reads from the 12 E. ferox seed cDNA libraries using the Illumina Hiseq 2000 platform. 85,006 unigenes with an N50 length of 1399 bp were obtained; of these, 17,769 unigenes were longer than 1 kb. Phenylpropanoid biosynthesis (129 genes annotated with 16 DEGs) involved in the synthesis of important secondary metabolites including flavonoids, lignins and alkaloid. PAL and P450 encoded genes related to this pathway played a vital role in the morphological changes and the accumulation of medicinal components. In the mature stage of E. ferox seed, 21 DEGs indicated that accumulation of the substances tended to saturate. And this pattern is consistent with the growth and development of E. ferox seed. Figure S1. KEGG taxonomy of differentially expressed genes. All of the unigenes were assigned to 127 pathways which were divided into five groups: cellular processes, environmental information processing, genetic information processing, metabolism, and organismal systems. (JPG 653 kb) Figure S2. A phylogenetic tree depicting the relationships among the beta-glucosidase genes in angiosperm. Forty-nine beta-glucosidase genes from angiosperms with Pinus contorta as a outgroup were used in this study. (JPG 2894 kb) Table S1. The assembled sequences of Euryale ferox were annotated by BLASTX against Arabidopsis thaliana TAIR10. (XLS 490703 kb) Table S2. The assembled sequences of Euryale ferox were annotated by BLASTX against rice 7.0 proteomes. (XLS 587887 kb) Tables S3. Differentially expressed genes (DEGs) identified in pairwise comparisons of developmental stages in E. ferox seeds. (DOCX 15 kb) Table S4. The FDR of differentially expressed genes (DEGs) identified in pairwise comparisons of developmental stages in E. ferox seeds. (DOCX 15 kb)
  42 in total

1.  Unpuréeing the tomato: layers of information revealed by microdissection and high-throughput transcriptome sequencing.

Authors:  Jennifer Mach
Journal:  Plant Cell       Date:  2011-11-09       Impact factor: 11.277

2.  Expression of TaCYP78A3, a gene encoding cytochrome P450 CYP78A3 protein in wheat (Triticum aestivum L.), affects seed size.

Authors:  Meng Ma; Qian Wang; Zhanjie Li; Huihui Cheng; Zhaojie Li; Xiangli Liu; Weining Song; Rudi Appels; Huixian Zhao
Journal:  Plant J       Date:  2015-07       Impact factor: 6.417

3.  EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments.

Authors:  Ning Leng; John A Dawson; James A Thomson; Victor Ruotti; Anna I Rissman; Bart M G Smits; Jill D Haag; Michael N Gould; Ron M Stewart; Christina Kendziorski
Journal:  Bioinformatics       Date:  2013-02-21       Impact factor: 6.937

4.  Polymorphic microsatellite markers in Euryale ferox Salisb. (Nymphaeaceae).

Authors:  Zhiwu Quan; Lei Pan; Weidong Ke; Yi Ding
Journal:  Mol Ecol Resour       Date:  2009-01       Impact factor: 7.090

5.  The cytochrome p450 homepage.

Authors:  David R Nelson
Journal:  Hum Genomics       Date:  2009-10       Impact factor: 4.639

6.  Regulation of ferulate-5-hydroxylase expression in Arabidopsis in the context of sinapate ester biosynthesis.

Authors:  M Ruegger; K Meyer; J C Cusumano; C Chapple
Journal:  Plant Physiol       Date:  1999-01       Impact factor: 8.340

Review 7.  Lignin biosynthesis.

Authors:  Wout Boerjan; John Ralph; Marie Baucher
Journal:  Annu Rev Plant Biol       Date:  2003       Impact factor: 26.379

8.  Phenylalanine ammonia-lyase (PAL) from tobacco (Nicotiana tabacum): characterization of the four tobacco PAL genes and active heterotetrameric enzymes.

Authors:  Angelika I Reichert; Xian-Zhi He; Richard A Dixon
Journal:  Biochem J       Date:  2009-11-11       Impact factor: 3.857

9.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

10.  De novo assembly of the peanut (Arachis hypogaea L.) seed transcriptome revealed candidate unigenes for oil accumulation pathways.

Authors:  Dongmei Yin; Yun Wang; Xingguo Zhang; Hemin Li; Xiang Lu; Jinsong Zhang; Wanke Zhang; Shouyi Chen
Journal:  PLoS One       Date:  2013-09-10       Impact factor: 3.240

View more
  9 in total

1.  Multi-functional topology optimization of Victoria cruziana veins.

Authors:  Hui-Kai Zhang; Jingyi Zhou; Wei Fang; Huichan Zhao; Zi-Long Zhao; Xindong Chen; Hong-Ping Zhao; Xi-Qiao Feng
Journal:  J R Soc Interface       Date:  2022-06-15       Impact factor: 4.293

2.  Genome-Wide Analysis of NBS-LRR Genes From an Early-Diverging Angiosperm Euryale ferox.

Authors:  Lan-Hua Qian; Jia-Yi Wu; Yue Wang; Xin Zou; Guang-Can Zhou; Xiao-Qin Sun
Journal:  Front Genet       Date:  2022-05-13       Impact factor: 4.772

3.  Bulk segregant analysis identifies SSR markers associated with leaf- and seed-related traits in Perilla crop (Perilla frutescens L.).

Authors:  Su Eun Lim; Kyu Jin Sa; Ju Kyong Lee
Journal:  Genes Genomics       Date:  2021-02-04       Impact factor: 1.839

4.  Small Auxin Up RNAs influence the distribution of indole-3-acetic acid and play a potential role in increasing seed size in Euryale ferox Salisb.

Authors:  Zhiheng Huang; Ke Bao; Zonghui Jing; Qian Wang; Huifang Duan; Yaying Zhu; Sen Zhang; Qinan Wu
Journal:  BMC Plant Biol       Date:  2020-07-03       Impact factor: 4.215

5.  Transcriptome analysis reveals important candidate genes involved in grain-size formation at the stage of grain enlargement in common wheat cultivar "Bainong 4199".

Authors:  Yuanyuan Guan; Gan Li; Zongli Chu; Zhengang Ru; Xiaoling Jiang; Zhaopu Wen; Guang Zhang; Yuquan Wang; Yang Zhang; Wenhui Wei
Journal:  PLoS One       Date:  2019-03-25       Impact factor: 3.240

6.  Characterization the complete chloroplast genome of Euryale ferox (Nymphaeaceae), an medicinal plant species in China.

Authors:  Zhiyong Guo; Jie Min
Journal:  Mitochondrial DNA B Resour       Date:  2020-06-08       Impact factor: 0.658

7.  EfABI4 Transcription Factor Is Involved in the Regulation of Starch Biosynthesis in Euryale ferox Salisb Seeds.

Authors:  Peng Wu; Yue Zhu; Ailian Liu; Yuhao Wang; Shuping Zhao; Kai Feng; Liangjun Li
Journal:  Int J Mol Sci       Date:  2022-07-08       Impact factor: 6.208

8.  Euryale Small Auxin Up RNA62 promotes cell elongation and seed size by altering the distribution of indole-3-acetic acid under the light.

Authors:  Zhi-Heng Huang; Ke Bao; Zong-Hui Jing; Qian Wang; Hui-Fang Duan; Sen Zhang; Wei-Wei Tao; Qi-Nan Wu
Journal:  Front Plant Sci       Date:  2022-09-09       Impact factor: 6.627

9.  Transcriptome Analysis of Jojoba (Simmondsia chinensis) during Seed Development and Liquid Wax Ester Biosynthesis.

Authors:  Saqer S Alotaibi; Mona M Elseehy; Bandar S Aljuaid; Ahmed M El-Shehawi
Journal:  Plants (Basel)       Date:  2020-05-04
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.