Ramaraj Sathasivam1, Sun Ju Bong1, Chang Ha Park1, Ji Hyun Kim2, Jae Kwang Kim2, Sang Un Park1,3. 1. Department of Crop Science, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea. 2. Division of Life Sciences, College of Life Sciences and Bioengineering, Incheon National University, 119 Academy-ro, Yeonsu-gu, Incheon 22012, Republic of Korea. 3. Department of Smart Agriculture Systems, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea.
Abstract
Watercress (Nasturtium officinale R. Br.) is an important aquatic herb species belonging to the Brassicaceae family. It has various medicinal properties and has been utilized for the treatment of cancer and other diseases; however, currently available genomic information regarding this species is limited. Here, we performed the first comprehensive analysis of the carotenoid biosynthesis pathway (CBP) genes of N. officinale, which were identified from next-generation sequencing data. We identified and characterized 11 putative carotenoid pathway genes; among these, nine full and two partial open reading frames were determined. These genes were closely related to CBP genes of the other higher plants in the phylogenetic tree. Three-dimensional structure analysis and multiple alignments revealed several distinct conserved motifs, including aspartate or glutamate residues, carotene-binding motifs, and dinucleotide-binding motifs. Quantitative reverse transcription-polymerase chain reaction results showed that the CBP was expressed in a tissue-specific manner: expression levels of NoPSY, NoPDS, NoZDS-p, NoCrtISO, NoLCYE, NoCHXE-p, and NoCCD were highest in the flower, whereas NoLCYB, NoCHXB, NoZEP, and NoNCED were highest in the leaves. Stems, roots, and seeds did not show a significant change in the expression compared to the leaves and flowers. High-performance liquid chromatography analysis of the same organs showed the presence of seven distinct carotenoid compounds. The total carotenoid content was highest in the leaves followed by flowers, seeds, stems, and roots. Among the seven individual carotenoids, the levels of six carotenoids (i.e., 13-Z-β-carotene, 9-Z-β-carotene, E-β-carotene, lutein, violaxanthin, and β-cryptoxanthin) were highest in the leaves. The highest content was observed for lutein, followed by E-β-carotene, and 9-Z-β-carotene; these carotenoids were much higher in the leaves compared to the other organs. The results will be useful references for further molecular genetics and functional studies involving this species and other closely related species.
Watercress (Nasturtium officinale R. Br.) is an important aquatic herb species belonging to the Brassicaceae family. It has various medicinal properties and has been utilized for the treatment of cancer and other diseases; however, currently available genomic information regarding this species is limited. Here, we performed the first comprehensive analysis of the carotenoid biosynthesis pathway (CBP) genes of N. officinale, which were identified from next-generation sequencing data. We identified and characterized 11 putative carotenoid pathway genes; among these, nine full and two partial open reading frames were determined. These genes were closely related to CBP genes of the other higher plants in the phylogenetic tree. Three-dimensional structure analysis and multiple alignments revealed several distinct conserved motifs, including aspartate or glutamate residues, carotene-binding motifs, and dinucleotide-binding motifs. Quantitative reverse transcription-polymerase chain reaction results showed that the CBP was expressed in a tissue-specific manner: expression levels of NoPSY, NoPDS, NoZDS-p, NoCrtISO, NoLCYE, NoCHXE-p, and NoCCD were highest in the flower, whereas NoLCYB, NoCHXB, NoZEP, and NoNCED were highest in the leaves. Stems, roots, and seeds did not show a significant change in the expression compared to the leaves and flowers. High-performance liquid chromatography analysis of the same organs showed the presence of seven distinct carotenoid compounds. The total carotenoid content was highest in the leaves followed by flowers, seeds, stems, and roots. Among the seven individual carotenoids, the levels of six carotenoids (i.e., 13-Z-β-carotene, 9-Z-β-carotene, E-β-carotene, lutein, violaxanthin, and β-cryptoxanthin) were highest in the leaves. The highest content was observed for lutein, followed by E-β-carotene, and 9-Z-β-carotene; these carotenoids were much higher in the leaves compared to the other organs. The results will be useful references for further molecular genetics and functional studies involving this species and other closely related species.
Watercress
(Nasturtium officinale R. Br.) belongs
to the Brassicaceae family, and
it is cultivated worldwide in regions of Australia, Europe, India,
North America, southern Africa, and sub-Saharan Africa, as a perennial
herb or as an edible aquatic plant because of its nutraceutical properties.[1] This raw leafy vegetable can be eaten in the
form of salads, or it can be cooked and consumed in the same manner
as other vegetables and used as ingredients in soups as well as a
garnish in other dishes.[2]N. officinale is rich in vitamins A, B, C, and E;
in addition, it is also rich in carotenoids, flavonoids, folic acid,
glucosinolates, phenolics, protein, and minerals such as iodine, iron,
calcium, and sulfur compounds.[3,4] Recently, several studies
have reported its antidiabetic, anticancer, anti-inflammatory, and
antioxidant properties.[4,5]Among these components,
carotenoids are a natural pigment that
plays an important role in the photosynthetic organelles of algae,
mosses, ferns, and other higher plants.[6] In addition, they are also found in the membranes of photosynthetic
bacteria such as phototropic bacteria and cyanobacteria.[7] Carotenoids have received great attention because
of their various health benefits and their ability to protect the
skin against damaging free radicals, age-related macular degeneration,
cancer, and cardiovascular diseases, as well as boosting immune system
function.[8,9] Recently, several reviews have summarized
genes involved in the transcriptional regulation of the carotenoid
biosynthesis pathway (CBP) in plants.[7,10] The regulation
of the CBP genes at the transcriptional level is critically important
for the syntheses of photosynthetic pigments and plant hormones. The
primary steps in the CBP are the conversion of two geranylgeranyl
pyrophosphate (GGPP) molecules by the enzyme phytoene synthase (PSY)
to generate phytoene by the condensation process. This step is the
most important rate-limiting step in the CBP.[8] Following these steps, the enzymes of phytoene desaturase (PDS),
ζ-carotene isomerase (Z-ISO), ζ-carotene desaturase (ZDS),
and carotenoid isomerase (CrtISO) carry out consecutive desaturations
and result in the production of all-trans-lycopene.
Furthermore, lycopene ε-cyclase and lycopene β-cyclase
carry out cyclization, which results in the production of α-carotene
and β-carotene; this step is a critical branch-point in the
CBP.[8] Both α-carotene and β-carotene
undergo sequential hydroxylation by β-ring carotene hydroxylase
(CHXB) and ε-ring carotene hydroxylase (CHXE), which results
in the synthesis of lutein and zeaxanthin, respectively. β-cryptoxanthin
was produced as an intermediate product during the hydroxylation of
zeaxanthin. After this step, zeaxanthin is converted to antheraxanthin
and violaxanthin by two successive epoxidations with the same enzyme
zeaxanthin epoxidase (ZEP), and violaxanthin can be converted back
to antheraxanthin and zeaxanthin by deepoxidations with the enzyme
violaxanthin de-epoxidase. The last step in the CBP is the conversion
of violaxanthin into 9-cis-neoxanthin via the enzyme
neoxanthin synthase (NXS).[7] In one branch,
conversion of antheraxanthin to capsanthin occurred, and in the next
step, the conversion of violaxanthin to capsorubin was transformed
with the help of enzyme capsanthin-capsorubin synthase.[7] In the other branch of the pathway, β-carotene,
β-cryptoxanthin zeaxanthin, violaxanthin, and 9-cis-neoxanthin are catabolized into volatile and nonvolatile apocarotenoids
by substrate cleavage enzymes, such as carotenoid cleavage dioxygenases
(CCDs) and 9-cis-epoxycarotenoid dioxygenase (NCED)
(Figure ).
Figure 1
Schematic view
of the CBP in Nasturtium officinale. Enzymes are shown in blue, and the pink asterisk represents the
gene identified and characterized in this study. Solid black arrows
denote biosynthesis, and dotted black arrows denote degradation of
carotenoids. IPI-Isopentenyl pyrophosphate isomerase. The pathway
scheme is adapted and modified from the study by Sathasivam et al.[7]
Schematic view
of the CBP in Nasturtium officinale. Enzymes are shown in blue, and the pink asterisk represents the
gene identified and characterized in this study. Solid black arrows
denote biosynthesis, and dotted black arrows denote degradation of
carotenoids. IPI-Isopentenyl pyrophosphate isomerase. The pathway
scheme is adapted and modified from the study by Sathasivam et al.[7]These CBP genes have
been identified and characterized in several
plants such as Arabidopsis, Chinese
cabbage, citrus, Ixeris dentate, papaya, Scutellaria baicalensis, strawberry, and wolfberry.[11−19] To date, there have only been a few studies regarding the molecular
biology of N. officinale,[1,20,21] and there are no published reports
regarding the characterization and gene expression of CBP genes in N. officinale. In our laboratory, a cDNA library
was constructed from the N. officinale seedling, and the transcript sequences were assembled using the
trinity package (http://trinityrnaseq.github.io) and evaluated by Transrate S/W (http://trinityrnaseq.github.io). The raw read transcriptome sequences were submitted to the National
Center for Biotechnology Information (NCBI), sequence read archive
(SRA) database under the accession number SRR3490957.[20] This study aimed to use these transcriptomic data to identify
CBP genes in N. officinale.This
is the first report to identify and characterize the CBP genes
(NoPSY, NoPDS, NoZDS-p, NoCrtISO, NoLCYB, NoLCYE, NoCHXB, NoCHXE-p, NoZEP, NoCCD, and NoNCED) in N. officinale. To validate the spatial distribution
of the transcripts of CBP genes, we examined gene expression in various
organs of N. officinale using quantitative
reverse transcription-polymerase chain reaction (qRT-PCR). In addition,
we analyzed the distribution of seven carotenoid compounds in the
various organs of N. officinale using
high-performance liquid chromatography (HPLC). The present study would
improve our understanding of the differential carotenoid accumulation
and biosynthetic pathway genes in various organs of N. officinale that are beneficial for human health.
Our results can then increase our understanding of CBP genes and allow
us to explore strategies that could improve the anticarcinogenic properties
of N. officinale.
Results
and Discussion
In Silico Identification,
Protein Nomenclature,
and Sequence Analysis of CBP Genes
CBP genes were identified
from the transcriptomic data of N. officinale. Conserved regions of the sequences from previously identified and
classified CBP genes from higher plants were used as queries to search
against N. officinale transcript databases
using the NCBI BLASTN (Basic Local Alignment Search Tool Nucleotide)
program. The specific genes were identified and were subjected to
NCBI’s open reading frame (ORF) finder program to identify
whether the gene possesses the full ORF with a maximum nucleotide
length. The full ORF with the highest number of nucleotides was then
retrieved and subjected to structural and functional characterization
by searching for conserved regions; moreover, the CBP gene that did
not possess the full ORF was also taken for the characterization studies.
In total, nine full ORF (NoPSY, NoPDS, NoCrtISO, NoLCYB, NoLCYE, NoCHXB, NoZEP, NoCCD, and NoNCED) CBP genes were determined, whereas NoZDS-p and NoCHXE-p showed partial ORFs
in the cDNA library used in this study. All were submitted to GenBank
(Figure S1 and Table ). The GGPPS and Z-ISO genes were identified from the transcriptomic data;
however, they possess a short ORF so these two genes were not taken
for further analysis. The expression levels of these genes in the
transcriptomic data are shown in Table S1. The predicted molecular weights and their estimated isoelectric
points are shown in Table . The predicted molecular weights of some N.
officinale CBP proteins are in accordance with those
previously reported from other higher plant species such as Brassica napus,[22]Chelidonium majus,[23]Lycium chinenses,[14]S. baicalensis,[18,24] and I. dentate.[15] Signal IP
analyses showed that the maximum values of the original shearing site
(C score) were seen in NoCCD followed by NoZDS-p, NoCHXB, NoPDS, NoLCYE, NoPSY, NoNCED, NoZEP, NoCrtISO, NoLCYB, and then NoCHXE-p (P450 type), whereas maximum
values of the synthesized shearing site (Y score) were highest in NoCCD followed by NoZDS-p, NoPSY, NoLCYE, NoCHXB, NoNCED, NoPDS, NoCrtISO, NoLCYB, NoZEP, and NoCHXE-p. The maximum
values of the signal peptide (S score) were found in NoZDS-p followed by NoPSY, NoLCYE, NoCCDs, NoNCED, NoCHXB, NoCrtISO, NoPDS, NoLCYB, NoZEP, and NoCHXE-p (Table S2). No transmembrane region was detected
in identified CBP genes of N. officinale. A similar result indicating that the CBP genes do not possess any
transmembrane region was found in the green algae Tetraselmis
suecica(25) and higher plants C. majus.[23] Similarly,
some CBP genes in the higher plants such as Banana (MaPSY1 and MaPSY3), wheat (TaPSY3), and Brassica napus (BnCCD) also do not
possess any transmembrane region.[26−28] Homology analysis using
CDD showed that the CBP amino acid sequences had high similarity with
other higher plant species, including the sequences of amino acids
66–407 for NoPSY, 4–564 for NoPDS, 1–137 for NoCrtISO, 56–501
for NoLCYB, 1–526 for NoLCYE, 1–303 for NoCHXB, 1–249 for NoZEP, 1–170 for NoCCD, 5–527
for NoNCED, 34–225 for NoCHXE-p, and 11–318 for NoZDS-p. In green algae, Dunaliella salina homology analysis of CBP genes by CCD
showed high homology with other microalgae and higher plants.[27,29,30] From these results, it can be
inferred that N. officinale CBP genes
are highly conserved when compared to genes of other higher plants
and green algae.
Table 1
Molecular Characterization of CBP
Genes in Nasturtium officinalea
To investigate molecular evolutionary
relationships between N. officinale CBP proteins and other higher CBP sequences,
a neighbor-joining phylogenetic tree was constructed using all 11
CBP sequences and a set consisting of each specific CBP sequence.
Phylogenetic analysis of each sequence showed that the N. officinale CBP proteins formed a cluster with
other higher plants, whereas bacteria, chlorophyte, dinoflagellates,
and heterokonts formed separate clades (Figure S2). Similar results were obtained from several studies; the
phylogenetic analysis of plant CBP sequences with other species showed
that they formed a separate cluster with higher plants.[24,31,32] In addition, we constructed a
phylogenetic tree with 11 CBP protein sequences shared among the eight
species that show a high bootstrap value which indicated that the
CBP genes are highly conserved. Hence, these identified 11 CBP proteins
in N. officinale have a similar putative
conserved function to that of the other higher plants, especially Arabidopsis thaliana (Figure ). This supports the previous study result
that the N. officinale transcriptomic
data showed the highest similarity and annotation ratio to A. thaliana.[20] Pairwise
identity matrix of all the CBP sequences shared sequence identities
with the A. thaliana amino acid sequence
(Figure and Table S3). In addition, bacteria, chlorophyta,
dinoflagellates, and heterokonts showed less sequence identity when
compared to N. officinale CBP amino
acid sequences (Table S3). Previous studies
reported that the CBP amino acid sequences of C. majus, S. baicalensis, and I. dentata showed high similarity with those in higher
plant species.[15,16,23,24] These results clearly showed that CBP genes
may share higher sequence identities with higher plants, indicating
that in higher plants, these identified CBP proteins are highly conserved
in sequences and have a similar function to that of the other higher
plants especially A. thaliana.
Figure 2
Phylogenetic
analysis based on the concatenated amino acid sequences
of 11 CBP genes. The NJ phylogenetic tree was drawn with the Poisson-correction
distance. The number at each node denotes the percentage in the bootstrap
analysis (1000 replicates), whereas the numbers below the branch points
represent bootstrap values. The outgroup is Eutrema
salsugineum.
Phylogenetic
analysis based on the concatenated amino acid sequences
of 11 CBP genes. The NJ phylogenetic tree was drawn with the Poisson-correction
distance. The number at each node denotes the percentage in the bootstrap
analysis (1000 replicates), whereas the numbers below the branch points
represent bootstrap values. The outgroup is Eutrema
salsugineum.
Multiple Alignments, Tertiary Structure, and
Protein Localization Analysis
Multiple alignments and predicted
three-dimensional (3D) structures of N. officinale CBP proteins showed highly conserved domains compared to the other
higher plants[15,19,33] and microalgae[34,35] (Figures , 4,
and S3). It is well known that protein
function mainly depends on its 3D structure and its stability.[36] The results showed that the conformations of
α and β secondary structural elements and substrate-binding
pockets were similar to those of A. thaliana, C. reinhardtii, and D. salina (data not shown). However, we found slight
structural differences in the CBP proteins in the variable loop regions
of the models; this might be due to their sequence identities being
relatively low.[37] This supports the multiple
alignment and percent identity results of this study (Figure S3 and Table S3).
Figure 3
Predicted 3D structure
of upstream CBP genes of Nasturtium officinale. (A) NoPSY, (B) NoPDS, (C) NoZDS-p (partial
ORF), and (D) NoCrtISO structures were generated
using Chimera 1.14 software.[63] The amino
(NH2) and carboxyl (COOH) terminals are presented in blue
and dark red, respectively. In these 3D structures, α-helices
and β-strands are shown in light sea green and hot pink, respectively.
For the sequence alignment of each gene, see Φιγυρε Σ3.
Figure 4
Predicted 3D structure
of downstream CBP genes of Nasturtium officinale. (A) NoLCYB, (B) NoLCYE,(C) NoCHXB-p, (D) NoZEP, (E) NoCCD, and (F) NoNCED structures were generated using
Chimera 1.14 software.[63] The amino (NH2) and carboxyl (COOH)
terminals are shown in blue and dark red, respectively. In these 3D
structures, α-helices and β-strands are shown in light
sea green and hot pink, respectively. For sequence alignment of each
gene, see Φιγυρε Σ3.
Predicted 3D structure
of upstream CBP genes of Nasturtium officinale. (A) NoPSY, (B) NoPDS, (C) NoZDS-p (partial
ORF), and (D) NoCrtISO structures were generated
using Chimera 1.14 software.[63] The amino
(NH2) and carboxyl (COOH) terminals are presented in blue
and dark red, respectively. In these 3D structures, α-helices
and β-strands are shown in light sea green and hot pink, respectively.
For the sequence alignment of each gene, see Φιγυρε Σ3.Predicted 3D structure
of downstream CBP genes of Nasturtium officinale. (A) NoLCYB, (B) NoLCYE,(C) NoCHXB-p, (D) NoZEP, (E) NoCCD, and (F) NoNCED structures were generated using
Chimera 1.14 software.[63] The amino (NH2) and carboxyl (COOH)
terminals are shown in blue and dark red, respectively. In these 3D
structures, α-helices and β-strands are shown in light
sea green and hot pink, respectively. For sequence alignment of each
gene, see Φιγυρε Σ3.The predicted 3D structure
of N. officinale CBP genes possesses
a central hydrophobic substrate-binding pocket
which was folded by α-helices and β-sheet strands; notably,
the binding pocket was almost covered within the core of the α-helices.
In addition, other domains including the aspartate-rich domain (ARD),
carotene-binding domain (CBD), and dinucleotide-binding domain (DBD)
signature motifs were found near the cavity, which may be required
for enzyme activity.[29] In detail, the key
upstream pathway enzyme NoPSY possesses a conserved trans-isoprenyl
diphosphate synthase domain and ARD in its structure. This result
agreed with previous studies that showed the presence of these conserved
domains in higher plants such as I. dentate and S. baicalensis.[15,18] The second important gene in the CBP is NoPDS, which possesses both
the CBD and DBD in its structure, whereas NoZDS-p shares similar identical features to NoPDS, which consists of a
CBD in the C-terminal region and a DBD in the N-terminal region. This
result was consistent with a previous study which showed that higher
plants (Carica papaya, C. majus, I. dentate, and S. baicalensis) and marine green
algae (D. salina) had PDS and ZDS consisting
of these domains in their structures.[15,18,23,35]Moving on to
the downstream pathway genes, both NoLCYB and NoLCYE contain a DBD that is found in all lycopene
cyclases and helps to bind flavin adenine dinucleotide (FAD). Moreover,
a plant β-conserved region was also found in plant-type cyclases
(CrtL) but not in bacterial CrtYm, and this may play a crucial role
in the specific interaction between the cyclase and components of
the membrane-associated enzymes.[38] In addition,
three well-conserved regions, namely, cyclase motif 1, 2, and charged
regions, were also found, and these are potentially involved in substrate
binding and catalysis.[11] Similar LCYB and
LCYE conserved domains were also found in some higher plant species
(Arabidopsis, Capsicum
annuum, and C. majus) and green algae (Haematococcus pluvialis).[11,23,38,39] The common gene responsible for both the upstream
and downstream branches of the CBP is NoCHXB, which
consists of four histidine domains that may be involved in Fe2+ adhesion while hydroxylation takes place.[15] Similarly, NoCCD and NoNCED consist of four highly conserved histidine residues (Figures , 4, and S3); this result was similar to
the structural result obtained from Citrus CCD4a, CCD4b1, and CCD4c.[40] Previous studies reported that these four histidine residues are
involved in coordinating the Fe2+ cofactor required for
activity and the aspartate or glutamate moieties that fix the positions
of the histidines.[41,42] From the multiple alignments
and 3D structure analysis results, it can be inferred that most of
the N. officinale CBP genes are highly
conserved and that the genes are generally closely related to higher
plants and algae. However, further detailed studies are required to
understand the functions of the N. officinale CBP proteins identified in this study.CBP sequences of N. officinale were
analyzed using CELLO, ChloroP 1.1, TargetP 1.1, and WoLF PSORT web-based
programs to predict the subcellular location of these proteins. Most
of the N. officinale CBP proteins were,
through consensus, predicted to be targeted to the chloroplast, whereas
some of the CBP proteins were targeted to the cytoplasm or to the
mitochondrion (Table ). In A. thaliana, transgenic sweet
potato, and in some other plants, most of the CBP genes were localized
within the chloroplast, which show results similar to the results
of this study.[23,30,43] From these, we found that all the N. officinale CBP proteins share a highly conserved region with higher plants,
as such subcellular location prediction also showed similar results
to those obtained from the higher plants.
Table 2
Subcellular-Localization
Predictions
of N. officinale CBP Genesa
Comparison of N. officinale CBP Gene Sequences with Other Plant
CBP Sequences and Its Impact
on Carotenoid Accumulation
In several crop species, PSY is
one of the most important and rate-limiting enzymes in the CBP. NoPSY genes consist of five distinctive PSY motifs (Figure S3), and they possess
a putative PSY active site (Figure ). When compared to A. thaliana the highest level of sequence difference was found at the chloroplast
transit peptide (TP) in the N-terminal region (Figure S3). This result was similar to the previous study
result observed in Brassica and other
PSY sequences.[22] Because of high-sequence
variability at the N-termini, TPs are well recognized and bound by
protein import complexes (translocons) in the Toc (translocon at the
outer envelope membrane of chloroplasts) and Tic (translocon at the
inner envelope membrane of chloroplasts),[22] which efficiently targets the nuclear-encoded proteins to plastids.[44] Various types of translocons are drawn together
in plastids of diverse tissue types (e.g., photosynthetic vs nonphotosynthetic)
and developmental stages.[45] However, no
functional prediction of TP was performed still, but, it is now possible
merely based on TP sequence data. In addition, a previous study reported
that in tomatoes, a mutation in the 192 position of the PSY1 gene (P192L) causes a change in the proline to leucine substitution
which leads to delayed accumulation of carotenoids in the fruit; this
might be due to a decrease of PSY enzymatic activity. In addition,
they found that in the P192L mutant the accumulation of phytoene,
lycopene, and β-carotene was much lower when compared to the
nonmutated line.[46] Comparison of NoPSY with other PSY genes showed that
there was no mutation that occurs at the 192 position of the NoPSY
genes (Figure ), which
leads to accumulation of carotenoids in N. officinale (Figure ). The second
most important enzyme in the CBP is PDS. NoPDS contained GXGX2GX3AX2LX3GX6EX5GG, a secondary structure
consisting of a β sheet-α helix-β sheet configuration
which is called DBD fold, which shares high similarity with A. thaliana (Figures S2 and S3). However, the structure-functional association of PDS remains unclear,
and in the future, it needs to be characterized.[47]LCYB and LCYE are the
two major types of cyclase in plants. A catalytically active domain
(involved in the FAD-binding site, along with the substrate-binding
site) and a transmembrane domain were discovered through the structure
analysis of NoLCYB. In addition, the NoLCYB protein also possesses the FAD domain at 77–467 amino acid
residues (Figure S3). Previous studies
reported that lycopene cyclase uses FAD as a cofactor.[48] Moise et al.[49] reported
that most of the enzymes involved in the transformation of lycopene
and carotene are membrane-bound. Similarly, in NoLCYB the binding site is close to the membrane helix and the FAD-binding
domain is present near the enzyme (data not shown). Functional analysis
of LCY-B2 in the yellow-fleshed Kapoho variety reaveled
an A/C sequence polymorphism at 607 positions that could result in
the amino acid change and lead to a decrease in the enzymatic function
in red-fleshed papaya.[12] However, the sequence
analysis of NoLCYB showed the presence of nucleotide
‘A’ at the 607 positions (Figure S4), indicating that NoLCYB gene encodes a
fully functional enzyme in N. officinale, leading to the highest accumulation of carotenoid content (Figure ). The CCD belongs to a family of oxygenases, which particularly cleaves the
carotenoids into apocarotenoids (ABA and strigolactones). The NoCCDs contain hydrophobic patches in their structures (Figure S3), which might allow them to interact
with the nonpolar lipids of plastoglobule, where the CCD is localized.[50] In addition, several
amino acids were identified in the hydrophobic patches, and these
might represent interacting structural elements.[33] The comparison of N. officinale CBP gene sequences with other CBP gene sequences showed high sequence
similarity with higher plants. However, the structure-functional relationship
in most of the CBP genes remains unclear. Subsequent functional analysis
will improve our understanding of particular gene functions and interactions.
Figure 5
Comparison
of tomato and watercress PSY nucleotide sequences. The
tomato PSY sequence was retrieved from the previous manuscript published
by Gady et al.[46] (A) Multiple alignments
of the SlPSY and NoPSY protein sequence
were performed with the BioEdit program. The yellow highlighted represents
the change in the amino acid sequence. Predicted 3D structures, (B) SlPSY, and (C) NoPSY. The amino (NH2) and carboxyl (COOH) termini are presented in blue and dark
red, respectively. In these structures, α-helices and β-strands
are shown in light sea green and hot pink, respectively. The changes
in the amino acid sequence in the structural region are indicated.
Figure 6
Relative gene expression profiles of 11 CBP genes of Nasturtium officinale. (A) Transcriptional levels
of CBP genes were analyzed in different tissues such as leaf, stem,
root, flower, and seed using qRT-PCR analysis. The relative gene expression
was calculated using ubiquitin-conjugating enzyme 9 (UBC9). Results are given as the means of triplicates
± SD. Letters a–e denote significant differences (p < 0.05). (B) Heat map showing the expression profiles
of CBP genes in five different tissues namely leaf, stem, root, flower,
and seed. The heat map was generated using fold change values obtained
from qRT-PCR. The tree view of hierarchical clustering was used to
show the organ-specific expression of CBP genes. A gradient color
bar at the top is used to illustrate whether the CBP genes are upregulated
(red) or downregulated (green).
Comparison
of tomato and watercress PSY nucleotide sequences. The
tomato PSY sequence was retrieved from the previous manuscript published
by Gady et al.[46] (A) Multiple alignments
of the SlPSY and NoPSY protein sequence
were performed with the BioEdit program. The yellow highlighted represents
the change in the amino acid sequence. Predicted 3D structures, (B) SlPSY, and (C) NoPSY. The amino (NH2) and carboxyl (COOH) termini are presented in blue and dark
red, respectively. In these structures, α-helices and β-strands
are shown in light sea green and hot pink, respectively. The changes
in the amino acid sequence in the structural region are indicated.Relative gene expression profiles of 11 CBP genes of Nasturtium officinale. (A) Transcriptional levels
of CBP genes were analyzed in different tissues such as leaf, stem,
root, flower, and seed using qRT-PCR analysis. The relative gene expression
was calculated using ubiquitin-conjugating enzyme 9 (UBC9). Results are given as the means of triplicates
± SD. Letters a–e denote significant differences (p < 0.05). (B) Heat map showing the expression profiles
of CBP genes in five different tissues namely leaf, stem, root, flower,
and seed. The heat map was generated using fold change values obtained
from qRT-PCR. The tree view of hierarchical clustering was used to
show the organ-specific expression of CBP genes. A gradient color
bar at the top is used to illustrate whether the CBP genes are upregulated
(red) or downregulated (green).
Expression Levels of CBP Genes in Different
Organs of N. officinale
qRT-PCR
analysis was used to determine the expression patterns of CBP genes
in the different plant organs such as leaves, stems, roots, flowers,
and seeds of N. officinale (Figure ). The result showed
that the CBP genes were constitutively expressed in N. officinale. Among the identified CBP genes, NoCHXB showed the highest expression levels. The key enzyme NoPSY was highly expressed in leaves and flowers, whereas
the lowest levels of expression were observed in the stems, roots,
and seeds. The upstream pathway genes, NoPDS and NoZDS-p, exhibited similar expression patterns to that of NoPSY, which is strongly high in the leaves and flowers,
whereas it was relatively low in the other organs. Among the downstream
CBP genes, NoCrtISO, NoLCYE, NoCHXE-p, and NoCCD were highly expressed in the flowers,
whereas lower levels of expression were observed in the leaves, stems,
roots, and seeds. Expression levels of NoLCYB, NoCHXB, NoZEP, and NoNCED were highest in the leaves, whereas
the lowest expression levels were observed in stems, roots, flowers,
and seeds. This result was consistent with a previous study that showed
that in Brassica rapa, most of the CBP genes are
highly expressed in the flower and leaves.[51] From the gene expression analysis results, most of the N. officinale CBP genes played a similar role to
their orthologs in other species. For instance, in A. thaliana, it was reported that AtPSY, AtPDS, AtZDS, and AtZEP genes play important roles in the CBP.[52] In these studies, most of the CBP genes were significantly expressed
in flowers (NoPSY, NoPDS, NoZDS-p, NoCrtISO, NoLCYE, NoCHXE-p, and NoCCD) and leaves
(NoLCYB, NoCHXB, NoZEP, and NoNCED). Expression profiles of CBP genes
accrued in this study will help contribute to future genetic research
in N. officinale and enhance the carotenoid
content through metabolic engineering.
Analysis
of Carotenoid Content in Different
Organs of N. officinale
Carotenoids
were analyzed and identified through coelution with 20 authentic standards.
Among these, only seven different carotenoids were detected from the
different N. officinale organs (Figure ). The total carotenoid
content in the different organs of N. officinale varied significantly, ranging from 3.44 to 2383.27 μg/g of
dry weight. The highest total carotenoid level was noted in the leaves
followed by stems, flowers, seeds, and roots. Total carotenoid contents
in N. officinale leaves were found
to be 7.57, 10.83, 360.01, and 692.81 times higher than those found
in roots, stems, seeds, and flowers, respectively. Among the seven
carotenoids, levels of six carotenoids, namely, 13-Z-β-carotene, 9-Z-β-carotene, E-β-carotene, lutein, violaxanthin, and β-cryptoxanthin,
were highest in the leaves, whereas in leaves the α-carotene
was not detected. Notably, 9-Z-β-carotene, E-β-carotene, and lutein levels were much higher compared
to the other individual carotenoids.
Figure 7
Carotenoid content in the different tissues
of Nasturtium
officinale. For HPLC analysis, samples were harvested
from 2-month-old plants. Results are given as the means of triplicates
± SD. Letters a–e denote significant differences (p < 0.05).
Carotenoid content in the different tissues
of Nasturtium
officinale. For HPLC analysis, samples were harvested
from 2-month-old plants. Results are given as the means of triplicates
± SD. Letters a–e denote significant differences (p < 0.05).Considering all the watercress
organs, lutein levels in the leaves
were 425.95, 252.97, 12.13, and 6.87 times higher than those in the
root, seed, flower, and stem, respectively, while E-β-carotene content was 1578.74, 74.70, 11.0, and 8.14 times
higher than that in the root, seed, flower, and stem, respectively.
Likewise, 9-Z-β-carotene contents in the leaves
were 1272.2, 454.36, 9.31, and 7.87 times higher than those in the
root, seed, flower, and stem, respectively. In the root, only 9-Z-β-carotene, E-β-carotene,
and lutein were detected. 13-Z-β-carotene levels
were mainly accumulated in the leaf, with its content reaching 8.88
and 7.69 times higher than the levels in the stem and flower, respectively,
whereas it was not detected in root and seed. Interestingly, the violaxanthin
and α-carotene were detected only in leaves and seed, respectively.
The levels of β-cryptoxanthin were slightly higher in the leaves
compared to the flower, whereas it was not detected in stem, roots,
and seed. In terms of overall individual carotenoid content, α-carotene
levels showed the lowest accumulation when compared to other carotenoids
detected in this study (Figure ). This finding is consistent with previous studies conducted
in Allium sativum,[53]B. rapa,[17]C. majus,[23] and M. charantia,[54] wherein it was found that the contents of carotenoids
were significantly high in the leaves compared to the other plant
organs.
Relationship between Carotenoid Content and
Gene Expression
Except for α-carotene, most of the
other individual carotenoids were significantly accumulated in leaves
(Figures and 8); however, the enhanced transcription of few CBP
genes (NoPSY, NoPDS, NoZDS-p, NoCrtISO, NoLCYE, NoCHXE-p, and NoCCD) was observed in N. officinale flowers (Figures and 8). This showed that most of the CBP
gene expression and carotenoid accumulation patterns were not correlated
and that the highest gene expression does not always lead to a significant
accumulation of carotenoids.[24,29] From another point
of view, it can be explained that the CBP is regulated at multiple
levels, not only at the transcriptional level but also at the translational
level.[55] In addition, the CBP gene expression
and carotenoid content are controlled by the combination of cis-regulatory
elements in the upstream promoter region and untranslated regions.[56] Moreover, protein modifications may be one of
the main reasons behind the mismatched accumulation patterns of carotenoid
accumulation and CBP gene expression.[57] Previously, several studies reported that soluble carbohydrates
play a crucial role in carotenoid metabolism.[58] There was strong coordination between the photosynthetic machinery
and carotenoids; this might be due to the hypothesis that carotenoid
pathway gene expression was repressed in leaves in response to high
glucose levels.[58] To support this hypothesis,
Mortain-Bertrand et al.[59] conducted a study
in tomatoes and found that the genes related to the carotenoid and
MEP pathway were repressed because of high glucose levels. In contrast,
few studies have reported that accumulation of sugar in leaves inhibits
photoinhibition which will trigger carotenoid accumulation in leaves.[60,61] This contradictory result was obtained in leaves of N. officinale, which might be due to the high soluble
sugar level present in leaves. These are all the possible reasons
for the mismatch accumulation of carotenoid content and gene expression
in N. officinale.
Figure 8
Overview of carotenoid
pathway gene expression and carotenoid accumulation
changes in different plant organs of N. officinale. Each colored box (left to right) under each gene and compound represents
F-Flower; L-Leaf; R-Root; S-Seed; St-Stem. The scale bar indicates
the transformed average value of gene expression level and metabolites,
and the colored square boxes (gene expression level (green to red)
and carotenoid content (dark blue to dark red)) represent the relative
gene expression level and metabolite abundance in different plant
organs.
Overview of carotenoid
pathway gene expression and carotenoid accumulation
changes in different plant organs of N. officinale. Each colored box (left to right) under each gene and compound represents
F-Flower; L-Leaf; R-Root; S-Seed; St-Stem. The scale bar indicates
the transformed average value of gene expression level and metabolites,
and the colored square boxes (gene expression level (green to red)
and carotenoid content (dark blue to dark red)) represent the relative
gene expression level and metabolite abundance in different plant
organs.
Conclusions
In conclusion, carotenoid metabolism has been widely studied in
plants because of its importance to plants and humans. Here, we present
for the first time comprehensive molecular characterization and analysis
of CBP genes in N. officinale. This
study will therefore improve our understanding of the molecular mechanisms
regulating carotenoid accumulation in N. officinale, and this can subsequently serve as a valuable resource for genetic
manipulation. Interestingly, by in silico analysis, we predict that
most CBP genes were localized in the chloroplast. However, in the
future, further studies are necessary to perform an in vivo localization
study to confirm using an organelle (e.g., chloroplast) specific marker
to validate it. In addition, how each CBP gene contributes to the
dynamic assembly and association of the multifaceted complex carotenoid
metabolons that must form in the suborganellar location also needs
to be studied. The knowledge may facilitate fine modification of the
carotenoids intended for specific organelles and increase the nutritional
content of edible tissues for human benefits.
Experimental
Section
Plant Material
Seeds of N. officinale (Lot No. 3056631) were acquired from
Asia Seed Co, Ltd., Seoul, Republic of Korea. Seeds were sowed in
a plastic pot (size: 11 × 11 cm) filled with commercial perlite.
The pots were kept in the greenhouse of Chungnam National University
(Daejeon, Korea), and the seeds were allowed to grow for 2 months.
Plants were subjected to irrigation following a 2-day interval. Three
biological replicates of the samples were collected from different
plant organs, namely, the leaves, stems, roots, flowers, and seeds.
Harvested samples were flash-frozen in liquid nitrogen and then stored
at −80 °C until RNA extraction and HPLC analysis.
In Silico Identification and Sequence Analysis
of CBP Genes
CBP gene sequences were retrieved from the N. officinale transcriptomic data obtained in our
laboratory. An Illumina NextSeq500 platform was used to analyze the
cDNA using the commercial service of Seeders, Inc. (Daejeon, South
Korea). Raw reads of the transcriptome sequence are available on the
NCBI SRA database under the accession number SRR3490957. Obtained
sequences were then subjected to in silico BLAST on the NCBI database.
Meanwhile, sequences were also analyzed using online servers and public
databases, including the PFAM (http://pfam.xfam.org/search) and NCBI Conserved Domain Database
(http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) databases, to predict putative protein signature motifs. Secondary
structure and signal peptide analyses were performed using the SOPMA
program (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html) and the SignalP 4.0 Server (http://www.cbs.dtu.dk/services/SignalP-4.0/), respectively. Predicted subcellular locations of the CBP proteins
were identified using the CELLO (http://cello.life.nctu.edu.tw/), ChloroP 1.1 (http://www.cbs.dtu.dk/services/ChloroP/), TargetP 1.1 (http://www.cbs.dtu.dk/services/TargetP-1.1/index.php), and WoLF PSORT (https://wolfpsort.hgc.jp/) tools. The theoretical pI (isoelectric point)/molecular weight
was then calculated using the compute pI/molecular weight tool on
the ExPASy platform (https://web.expasy.org/compute_pi/).
Structural
Analysis of CBP Gene
Multiple
sequence alignment was performed using BioEdit 7.2.5 (Therapeutics,
Carlsbad, CA, USA).[62] CBP protein sequences
were submitted to the Phyre2 online web server (www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi) for homology modeling and for 3D structure analysis. 3D structures
were predicted using Chimera 1.14 software (https://www.cgl.ucsf.edu/chimera/).[63] Conserved signature motifs among
the CBP genes were found using the Multiple Expectation maximizations
for Motif Elicitation tool (http://meme.nbcr.net/).
Phylogenetic Analysis and Percent Identity
Matrix
The phylogenetic tree was constructed using the MEGA7
software.[64] Neighbor-joining (NJ) phylogenetic
trees[65] were constructed using the Poisson
model. Robustness of the trees was estimated by performing 1000 bootstrap
replicates.[66] The percent identity matrix
between the CBP amino acid sequences was calculated using Clustal
Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/), and identities were calculated from the pairwise multiple sequence
alignment.[67]
RNA Extraction
and cDNA Synthesis
Total RNA was extracted from the leaves,
stem, root, flower, and
seed of the plant. Each sample was ground into a fine powder using
liquid nitrogen. Then, 100 mg of each sample was transferred to a
fresh 1.5-mL microcentrifuge tube. Total RNA was extracted using the
Plant Total RNA Mini Kit (Geneaid, Taiwan), according to the manufacturer’s
protocols. RNA concentration and quality were determined using a NanoVue
Plus spectrophotometer (GE Health Care Life Sciences, USA) and through
1% agarose gel electrophoresis, respectively. Extracted total RNAs
were reverse transcribed to cDNA using the ReverTra Ace-α-kit
(Toyobo Co. Ltd., Osaka, Japan), according to the manufacturer’s
protocols; afterward, the cDNAs templates were diluted 20-fold with
nuclease-free water for downstream experiments.
CBP Gene Expression
For qRT-PCR,
the ubiquitin-conjugating enzyme 9 (UBC9) gene was used as an internal control. Specific primers for the N. officinale CBP and UBC9 genes
were designed using the online Primer3 software.[68] Primers used in this study are shown in Table S4. Relative gene expression was calculated using UBC9. qRT-PCR conditions used in our study followed the
protocol described by Tuan et al.[18] For
calculating the gene expression, the ΔCt method was used.[69] The visualization and expression of CBP genes
in the heatmap and hierarchical clustering were analyzed using heatmapper
software.[70] Three biological replicates
were used for all the PCR reactions.
Extraction
of Carotenoids and HPLC Analysis
Carotenoids were extracted
and analyzed using HPLC following the
protocol reported by Ha et al.[71] For HPLC
analysis, 300 mg of fine powder samples were mixed with 3 mL of ethanol
containing 0.1% ascorbic acid (w/v), and the mixture was vortexed
and incubated for 10 min at 85 °C in a water bath. For saponification,
potassium hydroxide (120 μL, 80% w/v) was added, and then the
samples were immediately placed on ice for 5 min to terminate the
reaction. Then, 1.5 mL of ice-cold deionized water and 0.05 mL of
internal standard β-apo-8′-carotenal (1.25 μg)
were added to this mixture. Carotenoids were extracted thrice using
hexane (1.5 mL) and centrifuged at 140 × g for
5 min at 4 °C. The combined extracts were dried under nitrogen
gas and were redissolved in 0.25 mL of 50:50 (v/v) dichloromethane/methanol.
These mixtures were filtered through a 0.50 μm PTFE filter (Advantec,
Tokyo, Japan) into brown screw cap vials (Thermo Fisher Scientific,
USA). The carotenoids were separated using a HPLC Agilent 1100 system
(Massy, France) equipped with a photodiode array detector using a
C30 YMC column (250 × 4.6 mm, 3 μm, Water Corporation,
MA, USA), and the chromatogram is obtained at 450 nm. HPLC conditions
and gradient programs used followed the protocol described by Ha et
al.[71] The concentrations of individual
carotenoids were quantified using the retention time and their co-elution
with β-apo-8′-carotenal, an internal standard, and were
quantitated with reference to the corresponding calibration curves
of standards. All carotenoid standards were purchased from CaroteNature
(Lupsingen, Switzerland).
Statistical Analysis
In this study,
all results are expressed as the mean ± standard deviation (SD)
of three independent biological replicates. Data were all analyzed
by analysis of variance with Duncan’s multiple range tests
to compare means with a significant level of p <
0.05 using the Statistical Analysis System version 9.2 (SAS Institute
Inc., Cary, NC, USA, 2009).
Authors: Jin Jeon; Sun Ju Bong; Jong Seok Park; Young-Kyu Park; Mariadhas Valan Arasu; Naif Abdullah Al-Dhabi; Sang Un Park Journal: BMC Genomics Date: 2017-05-23 Impact factor: 3.969
Authors: Pham Anh Tuan; Yeon Bok Kim; Jae Kwang Kim; Mariadhas Valan Arasu; Naif Abdullah Al-Dhabi; Sang Un Park Journal: EXCLI J Date: 2014-11-03 Impact factor: 4.068
Authors: Sasha Babicki; David Arndt; Ana Marcu; Yongjie Liang; Jason R Grant; Adam Maciejewski; David S Wishart Journal: Nucleic Acids Res Date: 2016-05-17 Impact factor: 16.971
Authors: Ramaraj Sathasivam; Nam Su Kim; Minsol Choi; Haejin Kwon; Bao Van Nguyen; Jae Kwang Kim; Dae Hui Jeong; Eung Jun Park; Hong Woo Park; Sang Un Park Journal: Int J Mol Sci Date: 2022-04-27 Impact factor: 6.208