Shengzhi Liu1, Zhuoheng Zhong2, Zijian Sun1, Jingkui Tian3, Kaisa Sulaiman4, Eman Shawky5, Hongwei Fu2, Wei Zhu3. 1. College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang 310027, China. 2. College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou 310018, China. 3. The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou 310002, China. 4. The Xinjiang Uygur Autonomous Region National Institute of Traditional Chinese Medicine, Urumchi, Xinjiang 830092, China. 5. Department of Pharmacognosy, Faculty of Pharmacy, Alexandria University, Alexandria 21521, Egypt.
Abstract
Moracins, a kind of 2-phenyl-benzofuran compound from Moraceae, serve as phytoalexins with antimicrobial, anti-inflammatory, antitumor, and antidiabetes activities and respond to biotic and abiotic stresses, while their biosynthetic pathway and regulatory mechanism remain unclear. Here, we report a de novo transcriptome sequencing for different tissues of seedlings, as well as leaves under different stresses, in M. alba L. A total of 88 282 unigenes were assembled with an average length of 937 bp, and 82.2% of them were annotated. On the basis of the differential expression analysis and enzymatic activity assays in vitro, moracins were traced to the phenylpropanoid pathway, and a putative biosynthetic pathway of moracins was proposed. Unigenes coding key enzymes in the pathway were identified and their expression levels were verified by real-time quantitative reverse transcription PCR (qRT-PCR). Particularly, a p-coumaroyl CoA 2'-hydroxylase was presumed to be involved in the biosynthesis of stilbenes and deoxychalcones in mulberry. Additionally, the transcription factors that might participate in the regulation of moracin biosynthesis were obtained by coexpression analysis. These results shed light on the putative biosynthetic pathway of moracins, providing a basis for further investigation in functional characterization and transcriptional regulation of moracin biosynthesis in mulberry.
Moracins, a kind of 2-phenyl-benzofuran compound from Moraceae, serve as phytoalexins with antimicrobial, anti-inflammatory, antitumor, and antidiabetes activities and respond to biotic and abiotic stresses, while their biosynthetic pathway and regulatory mechanism remain unclear. Here, we report a de novo transcriptome sequencing for different tissues of seedlings, as well as leaves under different stresses, in M. alba L. A total of 88 282 unigenes were assembled with an average length of 937 bp, and 82.2% of them were annotated. On the basis of the differential expression analysis and enzymatic activity assays in vitro, moracins were traced to the phenylpropanoid pathway, and a putative biosynthetic pathway of moracins was proposed. Unigenes coding key enzymes in the pathway were identified and their expression levels were verified by real-time quantitative reverse transcription PCR (qRT-PCR). Particularly, a p-coumaroyl CoA 2'-hydroxylase was presumed to be involved in the biosynthesis of stilbenes and deoxychalcones in mulberry. Additionally, the transcription factors that might participate in the regulation of moracin biosynthesis were obtained by coexpression analysis. These results shed light on the putative biosynthetic pathway of moracins, providing a basis for further investigation in functional characterization and transcriptional regulation of moracin biosynthesis in mulberry.
Morus, a genus of flowering plants in the family
Moraceae, has been used as traditional herb and silkworm fodder for
thousands of years in China, bringing about important economic and
medicinal values. Many parts of mulberry, including leaves, fruits,
branches, and root barks, have a variety of biological and pharmacological
activities that are antioxidant, antimicrobial, antihyperglycemic,
glucosidase-inhibiting, hypolipidemic, antiobesity, antiatherosclerosis,
and antineoplastic, which are attributed to the bioactive compounds
enriched in tissues.[1,2] Until now, several kinds of active
ingredients, including flavonoids, alkaloids, stilbenoids, coumarins,
and phenolic acids, have been identified from mulberry.[1] Furthermore, mulberry is also recognized as a
rich source of benzofuran derivatives and Diels–Alder adducts.[3]Moracins, the derivatives of 2-phenyl-benzofuran,
are mainly isolated
from Moraceae, especially Morus. Structurally, they
all contain a scaffold of 2-phenyl substituted benzo[b]furan-fused-ring. Twenty-six moracins (A–Z) and their derivatives,
modified with hydroxylation, methylation, prenylation, cyclization,
glycosylation, and Diels–Alder reaction,[4] have been identified from Moraceae, and some of them showed
valuable biological and pharmacological activities. For example, moracin
M was found to inhibit lipopolysaccharide-induced inflammatory responses
in nucleus pulposus cells;[5] moracin S,
with a prenyl moiety, showed the highest BACE1 inhibitory activity
compared with the other moracins, which provided valuable information
for the design of anti-Alzheimer’s disease drugs;[6] Gao et al.[7] reported
moracin N induced autophagy and apoptosis through ROS generation in
lung cancer. In addition, moracins were found to possess antimicrobial,
antioxidant, antidiabetes, and phosphodiesterase inhibition properties.[4] However, only some of them have been synthesized
chemically.[8−10] Considering the difficulties in extraction and isolation
of moracins from Morus, the development of their
medicinal potential is largely restricted. Therefore, there is an
emergency need to elucidate their biosynthetic pathway.Transcriptomics
has been an efficient tool for identifying biosynthetic
pathways of nature products in the past decade. In contrast to microbes,
the biosynthesis genes of plant-derived compounds are usually not
enriched in a cluster on chromosomes, making it hard to screen them
directly in the genome. Nevertheless, on the metabolic level, metabolites
differentially accumulate in some specific tissues and are affected
by biotic or abiotic stress, which is thought to result from the differential
expression of biosynthesis genes. Besides, biosynthetic pathway genes
in plants are often coordinately regulated, so it is possible to identify
candidate genes through coexpression analysis with the dedicated pathway
gene to be used as a bait. A few studies have been done on the basis
of this method, including identification of genes related to biosynthesis
of etoposide aglycone,[11] seco-iridoid,[12] protolimonoid,[13] triterpenoid
saponin,[14] and colchicine.[15]To date, there is little information regarding the
biosynthetic
pathway of moracins. In this study, we performed de novo RNA sequencing toward different tissues of M. alba L. seedlings, as well as leaves, under ultraviolet-B (UV-B) treatment
and Botrytis cinerea infection. Differential expression
analysis and coexpression analysis were used to identify candidate
genes involved in moracin biosynthesis and transcriptional regulation,
and a putative biosynthetic pathway was proposed. Our results will
provide a foundation for further biological synthetic study.
Results and Discussion
Identification of Compounds
in the Moracin
Pathway from Different Samples
Several main metabolites in
the moracin biosynthetic pathway were identified by high-performance
liquid chromatography (HPLC) and found to be differentially distributed
in different tissues (Figure a) and leaves under different treatments (Figure b), and their relative contents
were quantified and shown in Table S2.
In untreated seedlings, compounds in the moracin pathway were mainly
enriched in roots, especially in fibrous roots (FR), while they were
undetectable in untreated leaves (CKL) and stems (S). In addition,
leaves infected by B. cinerea (BCL) accumulated all
interesting compounds in moracin pathway, including moracin M (standard
3), moracin C (standard 4), moracin N (standard 5), as well as oxyresveratrol
(standard 2) and chalcomoracin (standard 7), the up- and downstream
compound of moracins, respectively. However, in UV-B-treated leaves
(UVL), the accumulation of these metabolites was relatively lower,
and moracin M was even undetectable. Besides, mulberroside A (standard
1), the diglucoside of oxyresveratrol, was found to be enriched in
taproots (TR) and FR significantly, but was rarely present in other
groups, which was not inconsistent with the expression of stilbenoid
pathway and could be explained by the expression levels of the glycosyltransferases
responsible for mulberroside A biosynthesis in different groups.
Figure 1
HPLC analysis
of the metabolites involved in moracin biosynthetic
pathway. (a) Metabolites in different tissues of mulberry. FR, fibrous
roots; S, stems; CKL, untreated leaves; TR, taproots. (b) Metabolites
in different treated mulberry leaves. UVL, UV-B-treated leaves; BCL, B. cinerea-infected leaves. Standards 1–7 represent
mulberroside A, oxyresveratrol, moracin M, moracin C, moracin N, morachalcone
A, and chalcomoracin, respectively.
HPLC analysis
of the metabolites involved in moracin biosynthetic
pathway. (a) Metabolites in different tissues of mulberry. FR, fibrous
roots; S, stems; CKL, untreated leaves; TR, taproots. (b) Metabolites
in different treated mulberry leaves. UVL, UV-B-treated leaves; BCL, B. cinerea-infected leaves. Standards 1–7 represent
mulberroside A, oxyresveratrol, moracin M, moracin C, moracin N, morachalcone
A, and chalcomoracin, respectively.As indicated by previous studies, moracins are unique and vital
compounds mostly isolated from mulberry barks and root barks, as well
as diseased shoots.[4] In our study, HPLC
analysis showed consistent results that moracins were only detected
in the roots of mulberry seedlings but accumulated in leaves under
biotic infection and UV-B radiation to different degrees, suggesting
their function as phytoalexins.
Enzymatic
Assays of the Key Step for Biosynthesis
of Moracins in Mulberry
Moracins share the structures of
2-phenyl-benzofuran with two C–O bonds at the meta positions
of the benzene and another one at the C-6 position of benzofuran.
They are mainly isolated from Moraceae plants, and most of their derivatives
are modified by hydroxylation, methylation, prenylation, and cyclization.
Among them, moracin M is the simplest one in structure and recognized
as the precursor for the other moracins (Table S1). Therefore, the biosynthesis of moracin M is important
for the biosynthesis of moracins in mulberry. In the chromatographic
profile of BCL, a strong peak eluted at 14.8 min was considered as
oxyresveratrol because of the same retention time and ultraviolet
absorption with the standard 2 (Figure ). It is worth noting that oxyresveratrol has a similar
structure to moracin M, especially for the hydroxy position on benzene
ring, which suggests that oxyresveratrol participates in the response
to biotic stress along with moracin pathway.Enzymatic activity
assays in vitro showed that oxyresveratrol could
be converted into moracin M under the catalyzation of crude enzymes
from fibrous roots of mulberry seedlings (Figure S1). Therefore, it was proved that the formation of benzofuran
ring of moracins was enzymatic, and the enzyme(s) responsible were
named as moracin M synthase(s) (MMS). The MMS activity from other
groups was also analyzed. The results showed that the MMS activity
of fibrous roots was the highest, followed by taproots and stems;
however, no MMS activity was detected in untreated leaves and UV-B-treated
leaves (Figure ).
Meanwhile, in B. cinerea-infected leaves, the MMS
activity was activated to a detectable level. The combined enzymatic
assays and metabolic level (Figure ) indicated that the biosynthesis of moracins was enriched
in roots, not in leaves, and could be significantly activated under B. cinerea infection rather than UV-B radiation.
Figure 2
Enzymatic activity
of moracin M synthase in different groups. FR,
fibrous roots; TR, taproots; S, stems; CKL, untreated leaves; UVL,
UV-B-treated leaves; BCL, B. cinerea-infected leaves.
Enzymatic activity
of moracin M synthase in different groups. FR,
fibrous roots; TR, taproots; S, stems; CKL, untreated leaves; UVL,
UV-B-treated leaves; BCL, B. cinerea-infected leaves.Mulberry contains high levels of stilbenoids, including
oxyresveratrol
and its diglucoside mulberroside A (Figure ).[16] Stilbenoids
are phenylpropanoid phytoalexins produced by plants in response to
biotic and abiotic stress.[17] Therefore,
the MMS relates moracin biosynthesis to stilbenoid and phenylpropanoid
pathways (Figure ).
Figure 4
Putative phenylpropanoid pathway in M. alba. The
expression levels of unigenes coding the enzymes are shown in heatmap.
PAL, phenylalanine ammonia-lyase; C4H, trans-cinnamate
4-hydroxylase; 4CL, 4-coumarate-CoA ligase; C2′H, p-coumaroyl CoA 2′-hydroxylase; CHS, chalcone synthase; CHR,
chalcone reductase; STS, stilbene synthase; COSY, coumarin synthase.
FR, fibrous roots; TR, taproots; S, stems; CKL, untreated leaves;
UVL, UV-B-treated leaves; BCL, B. cinerea-infected
leaves. The solid and broken lines represent enzymatic reactions demonstrated
in other species and putative in M. alba, respectively.
Transcriptome Sequencing, De Novo Assembly and Functional Annotation
Transcriptome sequencing
of 6 groups containing 17 samples of M. alba L. generated
388.3 million raw reads (Table S3). After
filtering, 382.3 million clean reads and 115.0 GB clean bases were de novo assembled using Trinity, yielding a total of 192 111
transcripts and 88 282 unigenes. The average length and N50
value of unigenes were 937 bp and 1421 bp, respectively (Table S4). Functional annotation showed that
61 514 (69.7%), 62 154 (70.4%), 21 626 (24.5%),
43 170 (48.9%), 37 643 (42.6%), 37 639 (42.6%),
and 11 748 (13.3%) unigenes were annotated in NR (nonredundant
protein database), NT (nonredundant nucleotide database), KO (Kyoto
Encyclopedia of Genes and Genomes Orthology), SwissProt (annotated
protein sequence database), Pfam (protein family), GO (Gene Ontology),
and KOG (EuKaryotic Orthologous Groups) databases, respectively (Table S5).
Differential
Expression Analysis and Kyoto
Encyclopedia of Genes and Genomes (KEGG) Enrichment
To identify
differentially expressed genes (DEGs) in different tissues, as well
as leaves under UV-B and B. cinerea treatments, read
counts of all unigenes were normalized through DESeq method, and then
hypothesis testing was conducted in pairwise groups (FR vs TR, FR
vs S, FR vs CKL, UVL vs CKL, and BCL vs CKL). The FR vs TR comparison
revealed 8176 DEGs, with 4982 unigenes up-regulated and 3194 unigenes
down-regulated. For the FR versus S comparison, 10 447 DEGs
were identified, including 5608 up-regulated and 4839 down-regulated
unigenes. Furthermore, the most DEGs, 8077 up-regulated and 6893 down-regulated,
were identified in the FR versus CKL comparison, suggesting a large
biological difference between roots and leaves. In addition, compared
to untreated leaves, 13 972 unigenes were differentially expressed
in UVL, including 6734 up-regulated unigenes and 7238 down-regulated
unigenes, while in BCL the numbers were 8629, 4593, and 4036, respectively.These DEGs were then enriched in KEGG pathways, and the top 20
enrichment pathways in each comparison are shown in Figure . Interestingly, we found that
up-regulated DEGs in FR compared to the other tissues (TR, S, and
CKL) were mostly enriched in “phenylpropanoid biosynthesis,”
“ubiquinone and other terpenoid-quinone biosynthesis,”
and “flavonoid biosynthesis.” Besides, compared with
CKL, the up-regulated DEGs in UVL and BCL were also enriched in “flavonoid
biosynthesis;” “phenylalanine, tyrosine and tryptophan
biosynthesis;” and “phenylpropanoid biosynthesis”
pathways. All of these pathways had a close relation to the biosynthesis
of phenylpropanoids. In addition to flavonoids, moracins were also
presumed to be derived from phenylpropanoid pathway due to the discovery
of MMS. These results were consistent with the enrichment of moracin
biosynthesis in FR, UVL, and BCL at the level of transcription. However,
the down-regulated DEGs in UVL and BCL were mainly enriched in “photosynthesis,”
“porphyrin and chlorophyll metabolism,” “photosynthesis-antenna
proteins,” and “carbon fixation in photosynthetic organisms”
pathways, indicating the damage of UV-B radiation and B. cinerea infection on photosynthesis in mulberry leaves.
Figure 3
KEGG enrichment of top
20 pathways of DEGs in different comparisons.
FR, fibrous roots; TR, taproots; S, stems; CKL, untreated leaves;
UVL, UV-B-treated leaves; BCL, B. cinerea-infected
leaves. Rectangles represent the major enriched pathways.
KEGG enrichment of top
20 pathways of DEGs in different comparisons.
FR, fibrous roots; TR, taproots; S, stems; CKL, untreated leaves;
UVL, UV-B-treated leaves; BCL, B. cinerea-infected
leaves. Rectangles represent the major enriched pathways.
Identification and Expression Analysis of
Known Genes Involved in the Moracin Biosynthetic Pathway
The confirmation of enzymatically converting oxyresveratrol to moracin
M enabled us to integrate the biosynthesis of moracins with the downstream
of stilbenoids and phenylpropanoid pathways and correlate them legitimately.
By analyzing their expressions in different groups, homologous genes
involved in putative pathway were filtered and summarized (Table S6). Unigenes coding phenylalanine ammonia-lyase
(PAL, EC 4.3.1.24), trans-cinnamate 4-hydroxylase
(C4H, EC 1.14.14.91), and 4-coumarate-CoA ligase (4CL, EC 6.2.1.12),
common and conserved enzymes of phenylpropanoid pathway, were examined
and visualized (Figure ). Three, two, and three nonredundant unigenes
were annotated as PAL, C4H, and 4CL (Table S6), respectively. Chalcone reductase (CHR), combined with chalcone
synthase (CHS, EC 2.3.1.70), catalyzed three molecules of malonyl-CoA
and one molecule of p-coumaroyl CoA to form deoxychalcone.[18] Six and one unigenes were identified to code
CHS and CHR, respectively. Another two unigenes were taken as coding
stilbene synthase (STS, EC 2.3.1.74), which utilized the same substrates
as CHS but produced the resveratrol instead.[19−21] Based on the
fragments per kilobase million (FPKM) values, most of these unigenes
had the highest expression level in FR, followed by TR and S, with
relatively low expression in CKL. In addition, most of these unigenes
were up-regulated in UVL and BCL compared to CKL (Figure and Table S6).Putative phenylpropanoid pathway in M. alba. The
expression levels of unigenes coding the enzymes are shown in heatmap.
PAL, phenylalanine ammonia-lyase; C4H, trans-cinnamate
4-hydroxylase; 4CL, 4-coumarate-CoA ligase; C2′H, p-coumaroyl CoA 2′-hydroxylase; CHS, chalcone synthase; CHR,
chalcone reductase; STS, stilbene synthase; COSY, coumarin synthase.
FR, fibrous roots; TR, taproots; S, stems; CKL, untreated leaves;
UVL, UV-B-treated leaves; BCL, B. cinerea-infected
leaves. The solid and broken lines represent enzymatic reactions demonstrated
in other species and putative in M. alba, respectively.It was interesting that a unigene annotated as p-coumaroyl CoA 2′-hydroxylase (C2′H) showed
a high
similar expression pattern to the aforementioned unigenes in phenylpropanoid
pathway (Figure and Table S6). C2′H was a 2-oxoglutarate-dependent
dioxygenase that catalyzed the ortho-hydroxylation
of p-coumaroyl CoA and feruloyl CoA, produced umbelliferone
and scopoletin in final via a spontaneous or enzymatic cyclization,
respectively, and participated in the biosynthesis of coumarins.[22,23] We further determined the transcription levels of genes involved
in the biosynthesis of simple coumarins, since no furan- or pyran-
coumarins were reported in mulberry.[1] Coumarins
were also derived from the phenylpropanoid pathway as flavonoids and
stilbenoids, and the subsequent efficient trans–cis isomerization and lactonization catalyzed by coumarin
synthase (COSY) formed the coumarin skeleton (Figure ).[24] As expected,
COSY showed a lower expression level in our transcriptome compared
with C2′H (Figure and Table S6), making us reexamine
the roles of C2′H in secondary metabolism of mulberry.In the biosynthesis of moracins, we noticed a pair of similar stilbenes,
resveratrol and oxyresveratrol, and the latter had an extra hydroxyl
at the C-2’ position. It was similar to the difference between p-coumaroyl CoA and 2′,4′-dihydroxycinnamoyl
CoA (Figure and S2). STS catalyzed three molecules of malonyl-CoA
and one molecule of p-coumaroyl CoA to produce resveratrol
in peanut,[19] grape,[20] and mulberry.[21] STS converted
cinnamoyl-CoA and malonyl-CoA to pinosylvin, a stilbene similar to
resveratrol and oxyresveratrol, while in scots pine (Figure S2).[25] Therefore, oxyresveratrol
was reasonably supposed to derive from 2′,4′-dihydroxycinnamoyl
CoA and malonyl-CoA under the catalysis of STS. In other words, C2′H
might play an important role in the biosynthesis of stilbenoids and
moracins in mulberry.Meanwhile, 2,4,2′,4′-tetrahydroxychalcone
was determined
in structure by reasoning about the prenylation catalyzed by M. alba isoliquiritigenin 3′-dimethylallyltransferase
(MaIDT) in formation of morachalcone A (Figure ).[26] This compound
had an extra hydroxyl at the C-2 position compared to isoliquiritigenin,
a deoxychalcone formed by malonyl-CoA and p-coumaroyl
CoA in the presence of CHS-CHR complexes and involved in the biosynthesis
of deoxyflavonoids (Figure S2).[18] Similar to oxyresveratrol, 2,4,2′,4′-tetrahydroxychalcone
was presumed to be biosynthesized from 2′,4′-dihydroxycinnamoyl
CoA and malonyl CoA, which was supported by the identification of
a highly expressed unigene coding CHR (Figure and Table S6).
In summary, by structure comparison, we assigned C2′H into
the biosynthesis of stilbenoids and flavonoids in mulberry, which
was supported by our transcriptome data.
Figure 5
Putative biosynthetic
pathway for moracins and their downstream
Diels–Alder adducts in M. alba. The expression
levels of unigenes coding the enzymes are shown in heatmap. MMS, moracin
M synthase(s); PT, prenyltransferase; P450, cytochrome P450s; MaIDT, M. alba isoliquiritigenin 3′-dimethylallyltransferase;
MaMO, M. alba moracin C oxidase; MaDA, M.
alba Diels–Alderase. FR, fibrous roots; TR, taproots;
S, stems; CKL, untreated leaves; UVL, UV-B-treated leaves; BCL, B. cinerea infected leaves. The solid and broken lines (curves)
represent verified and putative enzymatic reactions in M.
alba, respectively.
Putative biosynthetic
pathway for moracins and their downstream
Diels–Alder adducts in M. alba. The expression
levels of unigenes coding the enzymes are shown in heatmap. MMS, moracin
M synthase(s); PT, prenyltransferase; P450, cytochrome P450s; MaIDT, M. alba isoliquiritigenin 3′-dimethylallyltransferase;
MaMO, M. alba moracin C oxidase; MaDA, M.
alba Diels–Alderase. FR, fibrous roots; TR, taproots;
S, stems; CKL, untreated leaves; UVL, UV-B-treated leaves; BCL, B. cinerea infected leaves. The solid and broken lines (curves)
represent verified and putative enzymatic reactions in M.
alba, respectively.Candidate unigenes coding the other known enzymes in moracin and
the downstream pathway were identified and summarized (Figure and Table S6). Three unigenes were annotated as MaIDT, M. alba moracin C oxidase (MaMO), and M. alba Diels–Alderase
(MaDA), respectively, and the levels of expression were consistent
with that of the former unigenes.
Identification
of Unknown Candidate Genes
Involved in the Moracin Biosynthetic Pathway
Transcriptomic
analyses also indicated that the transcriptional levels of genes involved
in the moracin biosynthetic pathway were highly consistent with the
metabolic level, inspiring us to screen the unknown candidate genes
according to the following rules. First, the up-regulated DEGs in
FR versus TR, FR versus S, FR versus CKL, and BCL versus CKL comparisons
were gathered and analyzed, and 717 DEGs were commonly up-regulated
in all four comparisons (Figure ). Most of the inconsistent unigenes were filtered
at the first step while the unigenes with high expression levels in
FR group and activated by B. cinerea infection were
reserved. Second, since moracin biosynthetic pathway was silent in
untreated leaves, another 97 DEGs were excluded because their average
FPKM values were higher than 10 in the CKL group. The remaining 620
DEGs were used for the further screening of the candidates coding
specific enzymes in moracin biosynthetic pathway such as MMS, prenyltransferases
(PTs), and cytochrome P450s.
Figure 6
Venn diagram of up-regulated DEGs in different
comparisons. FR,
fibrous roots; TR, taproots; S, stems; CKL, untreated leaves; UVL,
UV-B-treated leaves; BCL, B. cinerea-infected leaves.
Venn diagram of up-regulated DEGs in different
comparisons. FR,
fibrous roots; TR, taproots; S, stems; CKL, untreated leaves; UVL,
UV-B-treated leaves; BCL, B. cinerea-infected leaves.Homologous BLAST (basic local alignment search
tool) was improper
for identification of candidate unigenes for MMS since the reaction
type had not been determined. Therefore, a further filter was conducted
on the basis of the enzymatic level. DEGs with the average FPKM values
≥10 in UVL group or <10 in FR group were eliminated and
118 unigenes were retained, of which 64 unigenes possessed open reading
frames (ORFs) of more than 200 amino acids (Table S7). According to the Pfam annotation and the type of reaction
that we could hypothesize, nine candidate unigenes for MMS were proposed,
including five cytochrome P450s, as well as one for berberine bridge
enzymelike protein, multicopper oxidase (laccase), polyphenol oxidase,
and 2-oxoglutarate-dependent oxygenase (Table S7).Prenylations and oxidative cyclizations were the
most common and
significant modifications in the process of moracins biosynthesis
(Table S1).[4] Studies showed that PTs could catalyze the prenylation of various
substrates, including flavonoids,[27] coumarins,[28] stilbenoids,[29] and
other phenolic compounds.[30] Two PT sequences
have previously been reported from mulberry. One was MaIDT, which
catalyzed the C-3′ and C-6 prenylation on chalcones and flavones,
respectively,[26] and the other was MaOGT
catalyzing the prenylation of oxyresveratrol.[31] Most PTs had a strong substrate specificity and only accepted one
or several similar prenyl acceptors. However, for moracins, prenylations
occurred at different sites on similar substrates. Therefore, there
might be several different PTs catalyzing the prenylation of moracins.
The identification of PTs involved in moracin pathway was carried
out based on the annotation in Pfam database. The Pfam term “UbiA
prenyltransferase family” (Pfam ID: PF01040) was used to screen
the 620 DEGs, and 4 candidates were obtained (Figure and Table S6).Similar to furanocoumarins and pyranocoumarins, some moracins also
had a furan or pyran ring coupled to the benzenes in structure, such
as moracins D, E, K, O, P, and W (Table S1). It has been reported that the furan or pyran ring was formed by
the cyclization of isopentenyl and ortho-hydroxyl
under the catalysis of cytochrome P450s.[32,33] To identify the P450s responsible for the formation of furan and
pyran rings in moracins, the same method for PTs screening was used
for P450s, and 17 candidate unigenes with a Pfam annotation of “Cytochrome
P450” (Pfam ID: PF00067) were acquired (Figure and Table S6).
Identification of Transcription Factors (TFs)
Related to Moracin Biosynthetic Pathway
TFs are a group of
specific DNA-binding proteins that can regulate gene expression. In
our transcriptome, a total of 2564 unigenes were identified as putative
TFs, and most of them were annotated to MYB (216, 8%), AP2/ERF (171,
7%), C2H2 (132, 5%), bHLH (119, 5%), WRKY (107, 4%), GRAS (101, 4%),
and C2C2 (100, 4%) families (Table ). To identify TFs regulating moracin biosynthesis,
the unigene “Cluster_14943.34115,” annotated as STS
with an intact ORF of 391 amino acids and serving as the specific
structural gene in moracin pathway, was used as the query for coexpression
analysis of putative TF-coding unigenes. There were 69 TF-coding unigenes
that showed high coexpression (Pearson’s r > 0.80 or < −0.80) with Cluster_14943.34115 (Table S8), including 66 positive ones and 3 negative
ones. Furthermore, these coexpressed TFs mostly consisted of AP2/ERF
family (16, 23%), WRKY family (11, 16%), and NAC family (9, 13%) (Table ). Although no study
about TFs regulating the biosynthesis of moracins or stilbenoids in
mulberry was reported, several TFs involved in stilbenoids biosynthesis
have been cloned and characterized in Vitis vinifera. These TFs belong to MYB, AP2/ERF, WRKY, and bZIP families and regulate
the expression of STS directly or indirectly.[34−37] Therefore, the regulation of
stilbenoids and moracins in mulberry might be complicated and interlaced,
and much work remains to be done.
Table 1
Type and Number of
Transcription Factor
(TF) Families
coexpressed
TFs
TF family
total TFs
positive
negative
MYB superfamily
216
4
0
AP2/ERF
171
16
0
C2H2
132
4
0
bHLH
119
4
0
WRKY
107
11
0
GRAS
101
3
0
C2C2
100
1
0
HB
90
1
0
C3H
89
2
0
NAC
88
9
0
bZIP
82
1
0
B3 superfamily
78
1
2
AUX/IAA
54
0
1
mTERF
51
0
0
FAR1
50
0
0
Trihelix
49
0
0
GARP
45
2
0
NF-Y
42
0
0
LOB
39
0
0
others
861
7
0
total
2564
66
3
Real-Time Quantitative
Reverse Transcription
PCR (qRT-PCR) Validation of DEGs Involved in the Moracin Biosynthetic
Pathway
The expression levels of nine DEGs involved in the
proposed moracin biosynthetic pathway were verified by qRT-PCR analysis.
Most of unigenes showed a high expression trend in FR, low in CKL,
and up-regulated in UVL and BCL groups (Figure ). The results were consistent with the transcriptomic
data.
Figure 7
Validation for DEGs by qRT-PCR. The relative expression of qRT-PCR
was indicated on the left y-axis and the FPKM normalized
expression level (log2FPKM) of RNA-Seq was indicated on the right y-axis. FR, fibrous roots; TR, taproots; S, stems; CKL,
untreated leaves; UVL, UV-B-treated leaves; BCL, B. cinerea-infected leaves.
Validation for DEGs by qRT-PCR. The relative expression of qRT-PCR
was indicated on the left y-axis and the FPKM normalized
expression level (log2FPKM) of RNA-Seq was indicated on the right y-axis. FR, fibrous roots; TR, taproots; S, stems; CKL,
untreated leaves; UVL, UV-B-treated leaves; BCL, B. cinerea-infected leaves.
Conclusions
On the basis of differential expression analysis and enzymatic
assay in vitro, a putative biosynthesis pathway of
moracins was proposed in M. alba L. In addition,
a total of 51 structural unigenes and 69 TFs were putatively identified
to participate in moracin biosynthesis and could be further confirmed
by functional validation and DNA-binding assays.[38] We provided evidence that moracins are derived from stilbene
and phenylpropanoid pathway, and there could be two novel pathways
for stilbene and deoxychalcone biosynthesis in mulberry. The presented
work provides a basis for further investigation in functional characterization
and transcriptional regulation of moracin biosynthesis in mulberry.
Methods
Plant Materials and Treatments
Mature
seeds of M. alba L. were collected from the experimental
plot in Zijingang Campus of Zhejiang University (Hangzhou, China),
washed with running water, and sown directly in soil for germination.
Mulberry seedlings were grown in a green house at 25 ± 1 °C
with a 16 h light/8 h dark cycle for 5 to 6 weeks. The TR, FR, and
S tissues were harvested. At the same time, the middle 3∼4
pieces of leaves were detached, followed by wrapping petioles with
wet cotton balls, and used for either of the following treatments:
(a) the detached leaves were treated with UV-B (290–320 nm)
radiation according to our previous method,[39] transferred to covered Petri dishes to retain moisture, and incubated
at 25 °C in darkness for 36 h; (b) each mulberry leaf was inoculated
with 44 mm diameter mycelium disks taken from 10-day-old B.
cinerea grown on potato dextrose agar medium, placed in a
covered Petri dish, and incubated at 25 °C with a 16 h light/8
h dark cycle for 48 h. The CKL, UVL, and BCL samples were harvested
at the given points in time. All samples were immediately snap-frozen
in liquid nitrogen and stored at −80 °C for future use.
Tissues from each of the four plants were pooled into one biological
replicate, and the same samples were used for metabolite analysis,
RNA sequencing (RNA-seq), and qRT-PCR.
Metabolite
Extraction and HPLC Analysis
Mulberry samples were grinded
into powder in liquid nitrogen and
lyophilized overnight. A portion (0.5 mL) of absolute methanol was
added to a 1.5 mL EP tube containing 10 mg of dried sample, followed
by sonication for 60 min and centrifugation at 12 000g for 20 min to precipitate plant debris. The supernatants
were then filtered through a 0.22 μm nylon filter and used for
HPLC analysis. The standards of mulberroside A, oxyresveratrol, and
moracin M were purchased from Yuanye Bio-Technology (Shanghai, China),
and the others (moracin C, moracin N, morachalcone A, and chalcomoracin)
were isolated and identified by our lab.[40]A 10 μL aliquot of extracted samples and the mixed standard
solution (10 μM for each compound in methanol) was used for
HPLC analysis on an Agilent Series 1260 liquid chromatograph (Agilent)
with a C18 column (250 mm × 4.6 mm, inner dimension: 5 mm, Agilent).
Acetonitrile (solvent A) and 0.1% formic acid in water (solvent B)
were used as the mobile phase with a flow rate of 1.0 mL min–1 at 40 °C. The gradient conditions were optimized as follows:
0–25 min, 10∼50% A; 25–40 min, 50∼95%
A; 40–45 min, 95% A; 45–47 min, 95∼100% A; 47–52
min, 100% A. Spectra were measured at a wavelength of 320 nm.
Enzymatic Assay of Moracin M Synthase In Vitro
To extract crude enzymes from mulberry
samples, a portion (0.1 g) of fresh tissue was ground with 1 mL of
100 mM sodium phosphate buffer (pH = 7.5) on ice, added with 1% PVPP.
The mixture was then centrifuged at 12 000 g at 4 °C for 20 min, and the supernatant was collected as crude
enzymes and tested for protein concentration using the Bradford method.[41] The enzymatic reaction was conducted in the
mixture (100 μL) that contained 100 mM sodium phosphate buffer
(pH = 7.5), 500 μM oxyresveratrol as substrate, and 50 μg
crude enzymes. The reactions were incubated at 30 °C for 3 h,
then terminated by the addition of 100 μL of methanol and centrifuged
at 15 000g for 20 min. The crude enzymes without
substrate and boiled crude enzymes were used as controls. Supernatants
were analyzed by HPLC using the same method as metabolite analysis,
and the peak area of moracin M was calculated to evaluate the activity
of MMS.
RNA Extraction and cDNA Library Preparation
Total RNA was extracted using the RNAprep Pure Plant Plus Kit (TIANGEN
BIOTECH, Beijing, China) from FR, S, CKL, UVL, BCL (3 biological replicates),
and TR (2 biological replicates). RNA integrity was assessed by 2100
Bioanalyzer (Agilent, Santa Clara, CA, USA). Strand-specific cDNA
libraries were prepared using the NEBNext Ultra Directional RNA Library
Prep Kit for Illumina (NEB, Ipswich, MA, USA) according to the manufacturer’s
instructions. The quality and insert size of cDNAs were determined
by 2100 Bioanalyzer (Agilent, Santa Clara, CA, USA).
Transcriptome Sequencing, De Novo Assembly,
and Annotation
Libraries were sequenced (PE150)
on an illumina Novaseq 6000 sequencer. Raw reads of FASTQ files were
trimmed using fastp,[42] and clean reads
were obtained by removing reads containing adapter, reads containing
ploy-N, and reads of low quality. Clean reads from all libraries were
then combined and assembled into de novo transcriptomes
using Trinity (v2.4.0) in strand-specific mode,[43,44] and unigenes were finally obtained. ORFs were identified by BLAST
in NR and SwissProt protein database or predicted by Estscan (v3.0.3)
for those unmapped or mapped with no predicted sequences.[45] Unigenes were annotated in NR, NT, SwissProt,
KEGG, KOG, Pfam, and GO databases. The KEGG pathway identification
and enrichment were conducted by KOBAS (v2.0.12).[46] The ORFs of unigenes were mapped to the plant TF database
(PlantTFDB) by Hmmsearch (v3.0) using iTAK software (v1.2)[47] to identify the potential TFs.
Analysis of DEGs
Data sets of samples
from same tissue or treatment were considered as a group and differential
expression analysis of two groups was performed using the DESeq2 R
package (v1.6.3).[48] In this study, the
resulting P values were adjusted by the Benjamini–Hochberg
approach to control the false discovery rate. Finally, an adjusted
p-value <0.05 and |log2(FoldChange)| > 1 were used as the threshold
to determine DEGs between two groups.
Expression
Levels and Coexpression Analysis
Clean reads were mapped
to the Trinity unigenes using RSEM (v1.2.15)
with the bowtie2 parameter set at 0 mismatch.[49] The number of mapped clean reads for each unigene was then counted
and normalized into number of FPKM to estimate the expression level
of the unigenes.[50] Pearson correlation
analysis was carried out to identify genes coexpressed with the gene
of interest using the stats package available in R (v1.3.1093) based
on FPKM values.
Quantitative Real-Time
PCR Validation
The expression levels of nine unigenes related
to moracin biosynthesis
were validated by qRT-PCR. Total RNA (1.0 μg) was utilized for
cDNA synthesis using 5X All-In-One RT MasterMix Kit (Applied Biological
Materials Inc., Richmond, British Columbia, Canada) following the
manufacturer’s instructions. The specific qRT-PCR primers were
designed using NCBI Primer-BLAST (https://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome; Table S9). The qRT-PCR analysis was
performed using BlasTaq 2X qPCR MasterMix (Applied Biological Materials
Inc., Richmond, British Columbia, Canada) on the BIO-RAD CFX Connect
Real-Time system (Bio-Rad, Hercules, CA, USA). Each 20 μL reaction
contained 100 μg cDNA, 10 μL BlasTaq 2X qPCR MM, and 0.25
μM primers. The reaction conditions were as follows: 95 °C
for 3 min, followed by 40 cycles of 95 °C for 15 s and 60 °C
for 60 s. The experiment was conducted in three technical replicates
each, using beta actin gene (XM_010112836.2) as an internal reference
control. The relative gene expression and fold change were calculated
with the 2–ΔΔCt method.[51]
Authors: Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev Journal: Nat Protoc Date: 2013-07-11 Impact factor: 13.491
Authors: Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter Journal: Nat Biotechnol Date: 2010-05-02 Impact factor: 54.908