Fruit ripening is a highly complicated process, which is modulated by phytohormones, signal regulators and environmental factors playing in an intricate network that regulates ripening-related genes expression. Although transcriptomics is an effective tool to predict protein levels, protein abundances are also extensively affected by post-transcriptional and post-translational regulations. Here, we used RNA sequencing (RNA-seq) and tandem mass tag (TMT)-based quantitative proteomics to study the comprehensive mRNA and protein expression changes during fruit development and ripening in watermelon, a non-climacteric fruit. A total of 6,226 proteins were quantified, and the large number of quantitative proteins is comparable to proteomic studies in model organisms such as Oryza sativa L. and Arabidopsis. Base on our proteome methodology, integrative analysis of the transcriptome and proteome showed that the mRNA and protein levels were poorly correlated, and the correlation coefficients decreased during fruit ripening. Proteomic results showed that proteins involved in alternative splicing and the ubiquitin proteasome pathway were dynamically expressed during ripening. Furthermore, the spliceosome and proteasome were significantly enriched by Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis, suggesting that post-transcriptional and post-translational mechanisms might play important roles in regulation of fruit ripening-associated genes expression, which might account for the poor correlation between mRNAs and proteins during fruit ripening. Our comprehensive transcriptomic and proteomic data offer a valuable resource for watermelon research, and provide new insights into the molecular mechanisms underlying the complex regulatory networks of fruit ripening.
Fruit ripening is a highly complicated process, which is modulated by phytohormones, signal regulators and environmental factors playing in an intricate network that regulates ripening-related genes expression. Although transcriptomics is an effective tool to predict protein levels, protein abundances are also extensively affected by post-transcriptional and post-translational regulations. Here, we used RNA sequencing (RNA-seq) and tandem mass tag (TMT)-based quantitative proteomics to study the comprehensive mRNA and protein expression changes during fruit development and ripening in watermelon, a non-climacteric fruit. A total of 6,226 proteins were quantified, and the large number of quantitative proteins is comparable to proteomic studies in model organisms such as Oryza sativa L. and Arabidopsis. Base on our proteome methodology, integrative analysis of the transcriptome and proteome showed that the mRNA and protein levels were poorly correlated, and the correlation coefficients decreased during fruit ripening. Proteomic results showed that proteins involved in alternative splicing and the ubiquitin proteasome pathway were dynamically expressed during ripening. Furthermore, the spliceosome and proteasome were significantly enriched by Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis, suggesting that post-transcriptional and post-translational mechanisms might play important roles in regulation of fruit ripening-associated genes expression, which might account for the poor correlation between mRNAs and proteins during fruit ripening. Our comprehensive transcriptomic and proteomic data offer a valuable resource for watermelon research, and provide new insights into the molecular mechanisms underlying the complex regulatory networks of fruit ripening.
Watermelon (Citrullus lanatus, 2n = 2× = 22) belongs to the Citrullus genus of the Cucurbitaceae family, originated in Africa and is an important cucurbit crop grown throughout the world. Watermelon has emerged as an important experimental model to study the molecular mechanisms of non-climacteric fruits ripening (Erickson et al., 2005). This reflects its economic value and many favorable genetic characteristics, such as a relatively short life cycle, smaller genome size (359.8 Mb), diploid cultivar and stable genetic transformation (Guo et al., 2013). Ripening of fleshy fruits is a highly complex process that involves dramatic changes in sugar content, fruit color, fruit texture, flavor, and aroma (Giovannoni, 2001; Seymour et al., 2013). Depending on the ethylene system II, fleshy fruits are physiologically classified as climacteric fruits and non-climacteric fruits. System I is known as ethylene autoinhibitory and ethylene is produced in vegetative tissues including young fruit, while system II produces greater ethylene during climacteric fruit ripening and is autocatalytic. Climacteric fruits contain System I and System II, whereas non-climacteric fruits only produce ethylene by System I (Liu et al., 2015; Yue et al., 2020). It has long been thought that the ripening of climacteric and non-climacteric fruits is regulated by ethylene and abscisic acid (ABA), respectively (Alexander and Grierson, 2002; Li et al., 2011; Pech et al., 2012; Jia et al., 2013a; Leng et al., 2014; Wang et al., 2017b). In the last 20 years, much progress has been made toward understanding the complicated molecular mechanisms for ethylene-modulated ripening in climacteric fruits, and some key signaling components, including ethylene receptors [LeETHYLENE RECEPTOR 1 (LeETR1), LeETR2, NEVER RIPE (NR), LeETR4, LeETR5, LeETR6, and LeETR7] and several transcription factors [RIPENING INHIBITOR (RIN), COLORLESS NON-RIPENING (CNR), and NON-RIPENING (NOR)], have been identified, which significantly deepen our understanding of the molecular mechanisms of ripening from primary signal perception events to downstream genes expression (Vrebalov et al., 2002; Adams-Phillips et al., 2004; Manning et al., 2006; Klee and Giovannoni, 2011; Yuan et al., 2016; Quinet et al., 2019; Chen et al., 2020). Over the past decade, some ABA signaling components have been reported to be involved in regulation of ripening in non-climacteric fruits (Chai et al., 2011; Jia et al., 2011, 2013b; Han et al., 2015). However, the complex mechanisms and regulatory networks of non-climacteric fruits ripening are still largely unknown.Due to rapid advances in high-throughput sequencing technologies, multi-omic studies have dramatically promoted the dissection of the molecular mechanisms of fruit ripening, such as transcriptomics, proteomics, and metabolomics (Belouah et al., 2019; Umer et al., 2020). Wu et al. (2014a), Li et al. (2015), and Wang et al. (2017a) conducted transcriptomic and proteomic analyses of mango, pear, and citrus during fruit development and ripening, and 2,754, 1,810, and 4,648 proteins were identified or quantified, respectively (Wu et al., 2014a; Li et al., 2015; Wang et al., 2017a). Wu et al. (2014b) performed an integrative analysis of the transcriptome and proteome in the pulp of a spontaneous late-ripening sweet orange mutant and its wild type. A total of 1,839 proteins were identified, and they found that a number of genes displayed inconsistency at the transcript and protein levels (Wu et al., 2014b). Belouah et al. (2019) conducted transcriptomic and proteomic analysesof nine developmental stages in tomato, and a total of 2,375 proteins were quantified. Furthermore, they found that mRNAs and proteins had a poor correlation during ripening (Belouah et al., 2019). In addition, comparisons of mRNA levels, protein abundances and enzymatic activities have revealed low correlations between the metabolome and transcriptome, indicating that transcriptomics is not sufficient to understand protein dynamics (Gibon et al., 2006; Wienkoop et al., 2008). However, a more direct correlation is expected for proteins and metabolites (Wienkoop et al., 2008), making quantitative proteomics to be a powerful approach for establishing functional correlations between phenotypes and genotypes, and characterizing biochemical networks (Palma et al., 2011; Li et al., 2015).Although previous studies have conducted comparative analyses of the transcriptome and proteome during fruit development and ripening, most studies have the phenomenon of smaller amounts of quantified proteins or samples or biological replicates. In this study, we carried out high-throughput RNA sequencing (RNA-seq) and quantitative proteomics of four critical developmental stages in watermelon (three repeats for each stage), and quantified 18,856 genes and 6,226 proteins. The inclusion of more quantitative mRNAs and proteins is more conducive to conduct comparative analyses, which can better reflect the dynamic changes in mRNA-protein correlations during fruit ripening.
Materials and Methods
Plant Materials and Sugar Content Measurement
Watermelon C. lanatus (Thunb.) Matsum. & Nakai subsp. vulgaris cv 97103 was used in this study. In order to make the fruit grow uniformly, we keep only one watermelon per plant. Flowers were hand-pollinated at 3–5 nodes and tagged. Transcriptomic and proteomic analyses were performed on three biological replicates of watermelon center flesh samples collected at four developmental stages [10,18, 26, and 34 DAP (days after pollination), Figures 1A–D], each replicate resulting from the pooling of at least 10 fruits. Samples were rapidly frozen in liquid nitrogen and stored at –80°C. Degrees Brix were measured using a pocket refractometer (model pal-1; ATAGO) from a sample of juice collected from the center of each watermelon.
FIGURE 1
Overview and reproducibility of the proteome and transcriptome. Fruits of the cultivated watermelon 97103 at four different stages of ripening. (A) White [10 days after pollination (10 DAP)], (B) white-pink (18 DAP), (C) red flesh (26 DAP), (D) red-ripe (34 DAP). Sugar content of center flesh (E) and weight (F) statistics of four stages. (G) Venn diagram of identifiable and quantifiable mRNAs and proteins in this study. Heatmaps of Pearson correlation values of the TMT-labeled protein (I) and mRNA (H) abundances of each of the replicates compared to every other replicate. (J) Hierarchical clustering analysis of differentially expressed proteins (DEPs). Principal component analysis (PCA) was used to evaluate the reproducibility of the samples from proteome (L) and transcriptome (K).
Overview and reproducibility of the proteome and transcriptome. Fruits of the cultivated watermelon 97103 at four different stages of ripening. (A) White [10 days after pollination (10 DAP)], (B) white-pink (18 DAP), (C) red flesh (26 DAP), (D) red-ripe (34 DAP). Sugar content of center flesh (E) and weight (F) statistics of four stages. (G) Venn diagram of identifiable and quantifiable mRNAs and proteins in this study. Heatmaps of Pearson correlation values of the TMT-labeled protein (I) and mRNA (H) abundances of each of the replicates compared to every other replicate. (J) Hierarchical clustering analysis of differentially expressed proteins (DEPs). Principal component analysis (PCA) was used to evaluate the reproducibility of the samples from proteome (L) and transcriptome (K).
Total RNA Extraction, RNA-Sequencing Library Construction, Sequencing, and Data Analysis
Total RNA was isolated with a Total RNA Rapid Extraction Kit (Huayueyang Biotechnologies Co. Ltd., Beijing, China). Three biological replicates were carried out for RNA-seq. The RNA-seq libraries were generated using the NEBNext Ultra™ RNA Library Prep Kit, and were sequenced on the Illumina NovaSeq 6000 system using 150 PE mode. The clean reads were aligned to watermelon reference genome[1] using Tophat v2.0.12 (Trapnell et al., 2009). HTSeq v0.6.125 was used to count the read numbers mapped to each gene (Anders et al., 2015). The fragments per kilobase of exon per million mapped fragments (FPKM) was calculated based on the gene length and read count mapped to that gene. Genes with FPKM > 0.3 were considered as expressed genes. The DESeq package (1.18.0) was used to analyze the DEGs (Anders and Huber, 2010). Genes with padj < 0.05 and log2FoldChange > 1 were considered as differentially expressed genes (DEGs). P-values were adjusted using the Benjamini–Hochberg’s approach for controlling the false discovery rate (Benjamini and Hochberg, 1995).
Reverse Transcription-Quantitative PCR
Two micrograms of total RNA was subjected to first-strand cDNA synthesis using a Roche Transcriptor First Strand cDNA Synthesis Kit and an oligo (dT18) primer. Reverse transcription-quantitative PCR (RT-qPCR) was performed with SYBR Premix Ex Taq (Takara, Dalian, China) using a BioRad Real-Time System CFX96TM C1000 Thermal Cycler (Singapore). The ACTIN7 gene was used as an internal control. The primers for RT-qPCR are listed in Supplementary Table 1. For all RT-qPCR analyses, the assays were repeated three times, and the means of four biological experiments were used to estimate gene expression.
Protein Extraction, Trypsin Digestion, and Tandem Mass Tag Labeling
Samples were ground in liquid nitrogen, and then the powder was transferred to 5 mL clean tubes and sonicated by an ultrasonic processor (Ningbo Scientz Biotechnology Co., Ltd., Ningbo, China) in lysis buffer [5 mM DTT, 0.5% Triton X-100, 50 μM PR-619, 3 μM TSA, 50 mM NAM, 2 mM EDTA, 0.5 mM PMSF, 1X protease inhibitor (Roche, Mannheim, Germany)] on ice. Then the same volume of tris-saturated phenol (pH 8.0) was added to the sonicated samples, and vortexed for 5 min. After centrifugation (4°C, 20 min, 8,000 rpm), the upper phase was transferred to a new tube. Five volumes of ammonium sulfate-saturated methanol were added to the samples, which were then mixed well and stored at –20°C for 8 h. After centrifugation (4°C, 20 min, 8,000 rpm), the precipitate was washed once with ice-cold methanol and twice with acetone. We used 8 M urea to redissolve the proteins, and the protein concentration was determined by a BCA kit (Thermo Fisher Scientific, Waltham, MA, United States). Trypsin digestion and TMT labeling were conducted according to a previously published method (Wang et al., 2019). Briefly, the same amount of protein from each sample was used for digestion. An appropriate amount of the standard protein was added, and the volume was adjusted to the same with lysis buffer. TCA was added slowly to a final concentration of 20%, mixed with vortex, and precipitated at 4°C for 2 h. After centrifugation (4°C, 5 min, 4,500 g), discarded the supernatant and washed the precipitate 2–3 times with precooled acetone. After the precipitate was dried, TEAB was added to a final concentration of 200 mM. The precipitate was dispersed by ultrasound. Trypsin was added to the samples at the ratio of 1:50 (trypsin/protein, m/m) for digestion overnight. Samples were reduced with 5 mM DTT for 30 min at 56°C. After that, Iodoacetamide (IAA) was added to a final concentration of 11 mM, and incubated at room temperature for 15 min in the darkness.
HPLC Fractionation, LC-MS/MS Analysis, and Database Search
The tryptic peptides were fractionated through high pH reverse-phase HPLC using Thermo Betasil C18 column (5 μm particles, 10 mm ID, 250 mm length). Then the peptides were separated into 60 fractions by a gradient of acetonitrile (8–32%, pH 9.0). Finally, the peptides were combined into 6 fractions and dried by a vacuum-type centrifuge. LC-MS/MS Analysis and Database Search were conducted according to the previously published method (Huang et al., 2020). Briefly, we subjected the peptides to an NSI source, followed by MS/MS in Q ExactiveTM (Thermo Fisher Scientific, Waltham, MA, United States) coupled with UPLC online. Maxquant search engine (v.1.5.2.8) was used to process the resulting MS/MS data. Tandem mass spectra was searched with the uniprot database. Trypsin/P was specified as cleavage enzyme, and we allowed up to four missing cleavages. About 20 ppm was set for the mass tolerance for precursor ions in First search, and 5 ppm was used in Main search. About 0.02 Da is set for the mass tolerance for fragment ions. FDR was adjusted to <1%.
Data Analysis
The definition of up-regulated differentially expressed protein (DEP) in this study is P value < 0.05 and fold change is more than 1.3, whereas those with a (FC) < 1/1.3 (P value < 0.05) were considered as down-regulated proteins. The Gene Ontology (GO) annotations for the proteome were derived from the UniProt-GOA database.[2] Identified protein IDs were converted to UniProt IDs, and then mapped to GO IDs. If the identified proteins did not exist in the UniProt-GOA database, the alignment method (InterProScan soft, United States) was employed for the GO functional annotation of the protein. Proteins were classified into three categories by GO annotation: Cellular Compartment (CC), Molecular Function (MF), and Biological Process (BP). For each category, a two-tailed Fisher’s exact test was used to test the enrichment of the DEPs. A corrected P value < 0.05 was considered significant for GO analysis. The Kyoto Encyclopedia of Genes and Genomes (KEGG) database was used to identify enriched pathways. KEGG pathways were classified into hierarchical categories according to the KEGG website.For further hierarchical clustering based on DEPs functional classification (such as GO, Pathway, Domain, and Complex), we first collated all the categories obtained after enrichment along with their P values. Then we included the categories that were enriched in at least one of the clusters with a P value < 0.05, and the function x = –log10 (P value) was used to transform the filtered P value matrix. Finally, these x values were z-transformed for each functional category. One-way hierarchical clustering (Euclidean distance, average linkage clustering) was used to cluster z scores in Genesis. Cluster memberships were visualized using the heatmap function of the “gplots” R-package.For correlation coefficient calculation, we first combined the expression levels of proteins and mRNAs based on the corresponding relationship between protein and mRNA IDs, then we obtained the corresponding expression levels of co-quantified mRNAs and proteins. With these data, scatter plots of protein and mRNA expression levels could be drawn. To unify the definition of expression level, protein and transcript expression was log2 transformed before drawing the scatter plot, and the mean value were subtracted. The transformed mean value of the repeated sample was used to draw the scatter plot.For Gene Set Enrichment Analysis (GSEA), we first calculated the Pearson correlation coefficients of each protein and its corresponding transcript, and tabulated these Pearson correlation coefficients. KEGG annotations were used as a set of genes with known functions, and GSEA was conducted to study the main KEGG pathways of different regulatory relationships using the above correlation coefficients according to a previously published method. The pathways with NOM P value < 0.05 were screened to draw the GSEA map (Subramanian et al., 2005; Mertins et al., 2016).
Results
Transcriptomic and Proteomic Profiles During Watermelon Fruit Development and Ripening
Samples were collected at four key developmental stages [10 days after pollination (DAP), 18 DAP, 26 DAP, and 34 DAP] (Figures 1A–D), and the sugar content of the center flesh and fruit weight statistics are shown in Figures 1E,F. For RNA-seq, a total of 81.5 million high-quality, clean and mapped reads were obtained (NovaSeq 6000, PE150), and each sample had at least 5.59 million reads. For quantitative proteomics, a total of 1,080,654 spectrums were generated from the TMT experiments. These high-quality transcriptomic and proteomic datasets provided a solid foundation for our comparative analysis. The repeatability analysis of samples from the proteome and transcriptome showed that the results of mRNA and protein quantification were reliable (Figures 1H–L). A total of 18,856 [83.4%, Fragments per kilobase of exon per million mapped fragments (FPKM > 0.3)] genes were quantified in the flesh, and the number of genes covered by our transcriptomic dataset was comparable to that in other studies (Grassi et al., 2013; Guo et al., 2015). Genes with padj < 0.05 and log2FoldChange > 1 were considered as DEGs. The differences of transcriptome dynamics at four developmental stages were shown in Figures 2A–C (Supplementary Data 1). We also conducted aRT-qPCR assay to validate our RNA-seq results (Supplementary Figure 1). Additionally, a total of 6,410 and 6,226 proteins were identified and quantified by our proteomic data, respectively (Supplementary Data 2). Proteins with a fold change (FC) > 1.3 (P value < 0.05) were assigned as up-regulated proteins, whereas those with a (FC) < 1/1.3 (P value < 0.05) were considered as down-regulated proteins. On the basis of these criteria, 623 proteins were up-regulated and 1,245 proteins were down-regulated in 18 DAP vs 10 DAP group, 202 proteins were up-regulated and 243 proteins were down-regulated in 26 DAP vs 18 DAP group, and 603 proteins were up-regulated and 705 proteins were down-regulated in 34 DAP vs 26 DAP group (Figure 2D and Supplementary Data 2). A total of 2,729 DEPs were found in three groups, and 6 and 14 proteins were co-up-regulated and co-down-regulated in three groups, respectively (Figures 2E,F). Using the Fuzzy c-means algorithm, 1,235 DEPs were grouped into six clusters (clusters 1-6), which represented 45.25% of all DEPs (Figure 3 and Supplementary Data 3). Furthermore, the top two Domain, GO, and KEGG pathway enrichment results for the six clusters were shown on the right sideof Figure 3.
FIGURE 2
RNA-seq and TMT-labeled quantitative proteomics analyses of watermelon fruit development and ripening. (A,D) Differentially expressed genes and proteins statistics of three groups (18 DAP vs 10 DAP, 26 DAP vs 18 DAP, and 34 DAP vs 26 DAP). Venn diagrams of up-regulated mRNAs (B), down-regulated mRNAs (C), up-regulated proteins (E), down-regulated proteins (F) in three groups. (G,H) Hierarchical clustering analysis of KEGG pathways and GO terms (cellular components) in three groups.
FIGURE 3
Fuzzy c-means algorithm and enrichment analysis of DEPs. A total of 1,235 DEPs were divided into six clusters by the Fuzzy c-means algorithm, and the top two enriched items from GO/KEGG/Domain enrichment analysis were displayed on the right side of the corresponding clusters.
RNA-seq and TMT-labeled quantitative proteomics analyses of watermelon fruit development and ripening. (A,D) Differentially expressed genes and proteins statistics of three groups (18 DAP vs 10 DAP, 26 DAP vs 18 DAP, and 34 DAP vs 26 DAP). Venn diagrams of up-regulated mRNAs (B), down-regulated mRNAs (C), up-regulated proteins (E), down-regulated proteins (F) in three groups. (G,H) Hierarchical clustering analysis of KEGG pathways and GO terms (cellular components) in three groups.Fuzzy c-means algorithm and enrichment analysis of DEPs. A total of 1,235 DEPs were divided into six clusters by the Fuzzy c-means algorithm, and the top two enriched items from GO/KEGG/Domain enrichment analysis were displayed on the right side of the corresponding clusters.We further conducted GO and KEGG pathway enrichment analyses of all DEPs. The up-regulated biological pathways in 18 DAP vs 10 DAP group included starch and sucrose metabolism, glycolysis/gluconeogenesis, citrate cycle (TCA cycle), phenylpropanoid biosynthesis and so on. The down-regulated biological pathways in 18 DAP vs 10 DAP group included cell wall, spliceosome, ribosome and so on (Figures 2G,H and Supplementary Datas 4, 5). The up-regulated biological pathways in 26 DAP vs 18 DAP group were glycoside metabolic process, fatty acid degradation, protein processing in endoplasmic reticulum and so on. The down-regulated biological pathways in 26 DAP vs 18 DAP group were cell wall, plant-type cell wall, ribosome, and so on (Figures 2G,H and Supplementary Datas 6, 7). The up-regulated biological pathways in 34 DAP vs 26 DAP group included starch and sucrose metabolism, proteasome, protein processing in endoplasmic reticulum and so on. The down-regulated biological pathways in 34 DAP vs 26 DAP group contained ribosome, suberine and wax biosynthesis, cytoplasmic mRNA processing body assembly and so on (Figures 2G,H and Supplementary Datas 8, 9).
Integrative Analysis of the Proteome and Transcriptome
Considering that protein abundances are not only determined by transcript levels, we conducted a correlation analysis between the transcriptome and proteome during fruit ripening. Firstly, we calculated Pearson correlation coefficients of overlapping mRNAs and proteins. In total, 70.9% of genes were positively correlated, and the mean Pearson correlation coefficient was 0.325 (Figure 4A and Supplementary Data 10). However, we used correlation coefficient to evaluate the correlation between mRNAs and proteins. Secondly, we drew a scatter plot for each period using the log-transformed expression of mRNAs and proteins, and found that correlation coefficients (R2) decreased during fruit development, indicating that the relationship between mRNAs and proteins weakened during ripening (Figure 4B and Supplementary Data 11). Additionally, Venn diagram of identifiable and quantifiable mRNAs and proteins showed that 99% of the quantifiable proteins overlapped with quantifiable mRNAs (Figure 1G and Supplementary Datas 1, 2). However, we found that many up-regulated and down-regulated proteins were not up-regulated and down-regulated at the mRNA level, respectively (Supplementary Figure 2). These results suggested that post-transcriptional and post-translational mechanisms might play critical roles in regulation of fruit ripening-associated genes expression. Thirdly, we conducted GSEA to study the main KEGG enrichment pathways of different regulatory relationships using the above correlation coefficients. The positively correlated KEGG pathways included phenylpropanoid biosynthesis, fructose and mannose metabolism, galactose metabolism, glycolysis/gluconeogenesis and so on. The negatively correlated KEGG pathways were spliceosome, proteasome, protein export, aminoacyl-tRNA biosynthesis and basal transcription factors (Figure 4C and Supplementary Data 12). Finally, we subjected the transcriptomic and proteomic datasets to hierarchical clustering based on Ward’s minimum variance method (Murtagh and Legendre, 2014; Harvald et al., 2017). A total of 2,365 proteins and their corresponding mRNAs were grouped into six clusters, which further illustrated the correlation between mRNAs and proteins (Figure 5A and Supplementary Data 13). In addition, we conducted KEGG pathway enrichment analysis of the six clusters, and the corresponding enrichment pathways were shown in Figure 5B (Supplementary Data 14).
FIGURE 4
Integrative analysis of the proteome and transcriptome. (A) Pearson correlation analysis of overlapping differentially expressed proteins/genes. In total, 70.9% of genes were positively correlated, and the mean Pearson correlation coefficient was 0.325. (B) Correlation plots obtained at each of the four stages, and data were log-transformed before analysis. Correlation coefficients (R2) were given on top of each plot. (C) GSEA of global mRNA-to-protein correlation and enrichment analysis of the corresponding gene set.
FIGURE 5
Hierarchical and KEGG pathway enrichment analyses of DEGs and DEPs. (A) Hierarchical clustering analysis of 2,365 proteins and their corresponding mRNAs. The expression of proteins and corresponding mRNAs were transformed by log2 and subtracted from the mean value, the hclust “ward.D” method was used for cluster analysis. (B) Dot plots of enriched KEGG pathways in the six clusters shown in panel (A). The x-axis shows the fold enrichment of each KEGG pathway, where the color denotes the P value and the size of the dot denotes the number of IDs assigned to each KEGG pathway.
Integrative analysis of the proteome and transcriptome. (A) Pearson correlation analysis of overlapping differentially expressed proteins/genes. In total, 70.9% of genes were positively correlated, and the mean Pearson correlation coefficient was 0.325. (B) Correlation plots obtained at each of the four stages, and data were log-transformed before analysis. Correlation coefficients (R2) were given on top of each plot. (C) GSEA of global mRNA-to-protein correlation and enrichment analysis of the corresponding gene set.Hierarchical and KEGG pathway enrichment analyses of DEGs and DEPs. (A) Hierarchical clustering analysis of 2,365 proteins and their corresponding mRNAs. The expression of proteins and corresponding mRNAs were transformed by log2 and subtracted from the mean value, the hclust “ward.D” method was used for cluster analysis. (B) Dot plots of enriched KEGG pathways in the six clusters shown in panel (A). The x-axis shows the fold enrichment of each KEGG pathway, where the color denotes the P value and the size of the dot denotes the number of IDs assigned to each KEGG pathway.
mRNA and Protein Expression of a Subset of Ripening-Associated Genes
Previous genome-wide association studies (GWAS) and quantitative trait locus (QTL) studies have identified some important candidate genes, which are associated with watermelon development and ripening (Guo et al., 2015, 2019; Branham et al., 2017; Umer et al., 2020). Here, we examined the expression levels of these genes and those of other genes that might modulate fruit ripening at both mRNA and protein levels, such as ABA and ethylene biosynthesis and signaling-related genes, spliceosome-associated genes and proteasome-related genes.Sugar content is determined by both phloem unloading and metabolism in the flesh of cultivated watermelon (Guo et al., 2015). The main sugars of cucurbit plants are stachyose, sucrose and raffinose, which are transported from leaves to fruits in the phloem (Mitchell et al., 1992; Chen et al., 2001). α-galactosidase 2 (AGA2), vacuolar sugar transporter 1 (VST1), tonoplast sugar transporter 2 (TST2), trehalose-phosphate synthase (TPS), sucrose phosphate synthase (SPS), sucrose synthase (SUS), insoluble acid invertase (IAI), raffinose synthase (RFS), and UDP-Gal/Glc PPase (UGGP) were reported to be involved in sugar metabolism and accumulation during fruit ripening (Eastmond et al., 2002; Jia et al., 2013a; Ren et al., 2018, 2020, 2021; Durán-Soria et al., 2020). Our RNA-seq and proteomic results showed that most of these genes were consistently expressed at mRNA and protein levels during fruit ripening except for AGA2 and TPS (Cla97C01G023840) (Figure 6 and Supplementary Datas 1, 2).
FIGURE 6
Heatmap for protein and mRNA expression of a subset of fruit ripening-associated genes. The Stat.mean values represent the averaged magnitude, and direction of fold changes at the gene set level corresponds to the color-coded up-regulated (red) and down-regulated (blue) changes. AGA2, α-galactosidase 2; VST1, vacuolar sugar transporter 1; TST2, tonoplast sugar transporter 2; TPS, trehalose-phosphate synthase; SPS, sucrose phosphate synthase; SUS, sucrose synthase; IAI, insoluble acid invertase; RFS, raffinose synthase; UGGP, UDP-Gal/Glc PPase; PSY, phytoene synthase; ZDS, zeta-carotene desaturase; PHT4;2, phosphate transporter; CCD, carotenoid cleavage dioxygenase; PME, pectin methylesterase; 4CL, 4-coumarate:coenzyme A ligase; PE, pectinesterase; PL, pectate lyase; PG, polygalacturonase; PGI, polygalacturonase inhibitor; XET, xyloglucan endotransglucosylase/hydrolase; MANA, α-mannosidase; ZEP, zeaxanthin epoxidase; NCED, 9-cis-epoxy-carotenoid dioxygenase; PYL, PYR1-like; ACO, ACC oxidase; ETR, ethylene receptor; ERF, ethylene responsive factor; SRSF, serine/arginine-rich splicing factor; PSMD, 26S proteasome non-ATPase regulatory subunit.
Heatmap for protein and mRNA expression of a subset of fruit ripening-associated genes. The Stat.mean values represent the averaged magnitude, and direction of fold changes at the gene set level corresponds to the color-coded up-regulated (red) and down-regulated (blue) changes. AGA2, α-galactosidase 2; VST1, vacuolar sugar transporter 1; TST2, tonoplast sugar transporter 2; TPS, trehalose-phosphate synthase; SPS, sucrose phosphate synthase; SUS, sucrose synthase; IAI, insoluble acid invertase; RFS, raffinose synthase; UGGP, UDP-Gal/Glc PPase; PSY, phytoene synthase; ZDS, zeta-carotene desaturase; PHT4;2, phosphate transporter; CCD, carotenoid cleavage dioxygenase; PME, pectin methylesterase; 4CL, 4-coumarate:coenzyme A ligase; PE, pectinesterase; PL, pectate lyase; PG, polygalacturonase; PGI, polygalacturonase inhibitor; XET, xyloglucan endotransglucosylase/hydrolase; MANA, α-mannosidase; ZEP, zeaxanthin epoxidase; NCED, 9-cis-epoxy-carotenoid dioxygenase; PYL, PYR1-like; ACO, ACC oxidase; ETR, ethylene receptor; ERF, ethylene responsive factor; SRSF, serine/arginine-rich splicing factor; PSMD, 26S proteasome non-ATPase regulatory subunit.Lycopene predominantly accumulates in the red flesh of watermelon (Holden et al., 1999). The contents of lycopene and β-carotene gradually increase during fruit ripening in cultivated watermelon (Guo et al., 2015). Phytoene synthase (PSY), 9-cis-epoxy-carotenoid dioxygenase (NCED), Zeta-carotene desaturase (ZDS), Carotenoid cleavage dioxygenase (CCD), and ClPHT4;2 were reported to play crucial roles in carotenoid biosynthesis (Zhang et al., 2017; Sun et al., 2018; McQuinn et al., 2020). These five genes exhibited an overall up-regulated expression pattern at both mRNA and protein levels (Figure 6 and Supplementary Datas 1, 2).The fruit primary cell wall includes pectin, hemicellulose, cellulose and some structural proteins (Giovannoni, 2001). Fruit softening and textural changes are mainly caused by cell wall depolymerization and solubilization (Giovannoni, 2001; Guo et al., 2015). Pectin methylesterase (PME), 4-coumarate:coenzyme A ligase (4CL), pectinesterase (PE), polygalacturonase (PG), polygalacturonase inhibitor (PGI), pectate lyase (PL), xyloglucan endotransglucosylase/hydrolase (XET), and α-mannosidase (MANA) have been widely reported to participate in fruit softening (Giovannoni, 2001; Yun et al., 2019). We found that most of these genes were highly expressed at 10 DAP, and gradually decreased at both mRNA and protein levels during fruit ripening (Figure 6 and Supplementary Datas 1, 2). Consistently, down-regulated cell wall pathway was enriched by GO enrichment analysis in both18 DAP vs 10 DAP group and 26 DAP vs 18 DAP group.Abscisic acid and ethylene have been regarded as the ripening hormones in non-climacteric and climacteric fruits, respectively (Jia et al., 2011; Chen et al., 2020). However, increasing evidence indicates that ethylene is also involved in regulation of non-climacteric fruits ripening, such as strawberry and citrus (Katz et al., 2004; Trainotti et al., 2005; Iannetta et al., 2006). Our proteomic results showed that two ABA biosynthesis-related genes [zeaxanthin epoxidase (ZEP) and 9-cis-epoxy-carotenoid dioxygenase (NCED)] and two ABA receptors [PYR1-like 8 (PYL8) and PYL9] exhibited an upregulation pattern at both mRNA and protein levels (Figure 6). In addition, we found that one ethylene biosynthesis-related gene ACC oxidase (ACO, Cla97C04G079350), two ethylene receptors (ETRs) and one ethylene responsive factor (ERF, Cla97C05G098050) displayed up-regulated expression pattern, while two ACOs (Cla97C02G043890 and Cla97C09G173910) were down-regulated, and one ERF (Cla97C06G109360) showed an inconsistent expression pattern at the mRNA and protein levels during fruit ripening (Figure 6).Post-transcriptional and post-translational regulations play important roles in gene expression (Kwak et al., 2011; Robles and Quesada, 2019). Post-transcriptional regulation is commonly mediated by many factors, such as RNA-binding proteins (RBPs), small RNAs, and spliceosome (Xue et al., 2021). Serine/arginine-rich (SR) proteins are important parts of the spliceosome and play key roles in alternative splicing (AS) (Sperling, 2019). We found that several SR proteins and other components of spliceosome changed significantly at the protein level during fruit ripening (Figure 6 and Supplementary Table 2 and Supplementary Data 5). Consistently, the spliceosome was enriched by KEGG pathway enrichment analysis in 18 DAP vs 10 DAP group (Figure 2G and Supplementary Data 5). Ubiquitin-Proteasome System (UPS) is usually known as a protein-degrading machine at the post-translational level. In addition, it also regulates gene transcription at many stages, such as regulation of transcriptional elongation and recruitment of activators to the promoter region of genes (Kwak et al., 2011). Our proteomic results showed that UPS-related proteins were differentially expressed during ripening, such as E3 ubiquitin ligases and 26S proteasome non-ATPase regulatory subunit (PSMD) (Figure 6 and Supplementary Tables 3, 4 and Supplementary Datas 2, 4, 9). Consistently, the up-regulated proteasome pathway was significantly enriched by GO and KEGG pathway enrichment analysis (Figures 2G,H and Supplementary Datas 4, 9).
Discussion
Watermelon is an important fruit crop and becomes an ideal model plant for studies on non-climacteric fruit ripening. However, the molecular mechanisms of non-climacteric fruit ripening remain largely unknown. Using the high-throughput RNA-seq and quantitative proteome technology, we quantified 18,856 mRNAs and 6,226 proteins from four key fruit development stages in watermelon. Our comparative analysis of the transcriptome and proteome provides a better understanding of the complex regulatory networks of fruit ripening in watermelon.Gene Ontology enrichment analysis of down-regulated proteins in three groups showed that down-regulated cell wall process was significantly enriched in 18 DAP vs 10 DAP and 26 DAP vs 18 DAP groups but not 34 DAP vs 26 DAP group (Figure 2H and Supplementary Datas 4, 6, 8). This is consistent with the softening process of watermelon flesh. Similar results were also reported in citrus and tomato (Giovannoni, 2004; Wang et al., 2017a), they found that the pectinesterases were highly expressed prior to fruit maturation, and were gradually down-regulated during fruit ripening. These results suggested that fruit softening-related proteins played crucial roles in the early stages of fruit ripening. KEGG pathway enrichment analysis of up-regulated proteins in three groups revealed that the up-regulated starch and sucrose metabolism pathway was enriched in 18 DAP vs 10 DAP and 34 DAP vs 26 DAP groups (Supplementary Datas 5, 9). The up-regulation of starch and sucrose metabolism was consistent with the increase of sugar content during fruit ripening (Figure 1E). Phenolic compounds are produced by the phenylpropanoid pathway and contribute to fruit pigmentation and the disease resistance response in many fleshy fruits (Singh et al., 2010). Consistent with the previous study in pear (Li et al., 2015), our KEGG pathway enrichment analysis also demonstrated that phenylpropanoid biosynthesis was enriched at the early developmental stages of fruit ripening (Figure 2G and Supplementary Data 5).Abscisic acid is a major endogenous factor and the primary signal that regulates the ripening of non-climacteric fruits (Jia et al., 2011; Leng et al., 2014; Wang et al., 2017b; Fenn and Giovannoni, 2021; Kou et al., 2021). Previous studies also reported that ABA played a direct or indirect role in the induction of phenylpropanoid biosynthesis during fruit ripening (Jaakola, 2013; Vighi et al., 2019). We found that two ABA biosynthesis-related genes (ZEP and NCED) and two ABA receptors (PYL8 and PYL9) displayed an up-regulated expression pattern at both mRNA and protein levels during ripening (Figure 6 and Supplementary Data 2), suggesting that ABA signal transduction was enhanced during fruit ripening. The enhanced ABA signaling might account for up-regulated phenylpropanoid biosynthesis at the early stages of watermelon fruit ripening (Supplementary Data 5). Additionally, NCED and PYL9 were reported to be involved in regulation of tomato fruit ripening (Sun et al., 2012; Kai et al., 2019). Taken together, our and previous studies indicated that ABA might play key roles in fruit ripening, especially for non-climacteric fruits. Ethylene has long been known to play a major role in the ripening process of climacteric fruits (Klee, 2002; Chen et al., 2020; Fenn and Giovannoni, 2021). Increasing evidences have indicated that ethylene is also involved in regulation of non-climacteric fruits ripening (Chervin et al., 2004; Trainotti et al., 2005). Our proteomic results showed that ethylene biosynthesis-related genes, ethylene receptors and ethylene signaling regulators were differentially expressed during fruit ripening (Figure 6 and Supplementary Data 2), implying possible roles of ethylene in regulation of watermelon fruit ripening.Fruit set initiation is modulated by the combined accumulation of auxin and GA in many fleshy fruits (Srivastava and Handa, 2005; Fenn and Giovannoni, 2021). In addition, auxin signaling and auxin-GA interaction are key elements influencing fruit growth, such as regulation of cell division and expansion during fruit ripening (Fenn and Giovannoni, 2021). Our GO enrichment analysis showed that auxin and GA biosynthesis/metabolism-related proteins were significantly up-regulated in 18 DAP vs 10 DAP (Supplementary Table 5 and Supplementary Data 4), implying these proteins might play important roles in early stage of watermelon fruit development. Future work will study the potential mechanisms of these proteins in regulation of fruit development, especially whether these proteins affect fruit size.Base on our proteome methodology, integrative analysis of the proteome and transcriptome showed that smaller changes occurred in protein abundances than in mRNA abundances during fruit ripening (Supplementary Datas 1, 2, 11), and the correlation coefficients (R2) decreased during fruit ripening, indicating that the relationship between mRNAs and proteins weakened during the ripening process (Figure 4B). These results were consistent with studies in tomato (Belouah et al., 2019). Taken together, these consistent and solid results in watermelon and tomato indicated that post-transcriptional and post-translational mechanisms might play important roles in the regulation of fruit-related genes expression. Our proteomic results showed that alternative splicing-associated proteins were differentially expressed during fruit ripening, such as SR proteins and RNPs (Figure 6 and Supplementary Datas 2, 5). Furthermore, KEGG pathway enrichment analysis showed that spliceosome-related proteins were down-regulated in 18 DAP vs 10 DAP group, but up-regulated in 26 DAP vs 18 DAP group (Supplementary Table 2 and Supplementary Datas 5, 7). The dynamic expression of these alternative splicing-associated and spliceosome-related proteins indicated that abundant post-transcriptional regulatory events occurred during fruit ripening. Our proteomic results also showed that there were more down-regulated proteins than up-regulated proteins in three groups, and more proteins were down-regulated in 18 DAP vs 10 DAP and 34 DAP vs 26 DAP groups (Figure 2D and Supplementary Data 2). Consistently, GO and KEGG pathway enrichment analysesshowed that proteasome-associated and ubiquitin mediated proteolysis-related proteins were significantly up-regulated in 18 DAP vs 10 DAP and 34 DAP vs 26 DAP groups but not 26 DAP vs 18 DAP group (Supplementary Tables 3, 4 and Supplementary Datas 4, 9), implying that these down-regulated proteins in 18 DAP vs 10 DAP and 34 DAP vs 26 DAP groups might be regulated by proteasomal degradation at the post-translational level, especially for those down-regulated proteins whose expression is not down-regulated at the mRNA level (Supplementary Figure 2). Interestingly, GSEA showed that both spliceosome and proteasome displayed a negative correlation during ripening, indicating that these proteins themselves might be regulated by post-transcriptional and post-translational mechanisms (Figure 4C). Additionally, the weak correlation of spliceosome and proteasome at the mRNA and protein levels was also confirmed by hierarchical clustering analysis (Figure 5). Taken together, the results of our GO and KEGG pathway enrichment analyses, hierarchical clustering analysis and comparative analysis between mRNAs and proteins suggested that post-transcriptional and post-translational mechanisms might play important roles in fruit ripening by regulating the expression of fruit ripening-related genes (Figures 2G, 4B,C, 5 and Supplementary Tables 2–4 and Supplementary Datas 4, 5, 7, 9). These post-transcriptional and post-translational regulations might account for the poor correlation between mRNAs and proteins during ripening. Indeed, we found that at least 6,266 genes underwent AS during watermelon fruit ripening (data not shown). Most recently, MaMYB16L was reported to regulate starch degradation by alternative splicing in banana fruit during ripening (Jiang et al., 2021). These results support the potential post-transcriptional and post-translational mechanisms in regulation of fruit ripening, which have also been described in other fruits, such as tomato, sweet orange and citrus (Osorio et al., 2011; Pan et al., 2012; Wu et al., 2014b; Belouah et al., 2019). We will focus on post-transcriptional and post-translational regulation of watermelon development and ripening in future work.
Data Availability Statement
Our RNA-seq data were submitted to the SRA of NCBI under the accession number PRJNA718123. Proteomic data were uploaded to the Pride_EBI database (http://www.ebi.ac.uk/pride), and are available via ProteomeXchange with identifier PXD024490.
Author Contributions
YY and YX conceived and designed the study and wrote the manuscript with contributions from all co-authors. SG and HS provided good advice for data analysis. YR, JZ, and ML collected and grew the plant material. JW, YZ, YC, GG, and HZ provided critical comments and edited the manuscript. ST revised the manuscript and gave good advice for the revised manuscript. All authors approved the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.