Literature DB >> 32546133

Gene coexpression network analysis and tissue-specific profiling of gene expression in jute (Corchorus capsularis L.).

Zemao Yang1, Zhigang Dai2, Xiaojun Chen2, Dongwei Xie2, Qing Tang2, Chaohua Cheng2, Ying Xu2, Canhui Deng2, Chan Liu2, Jiquan Chen2, Jianguang Su3.   

Abstract

BACKGROUND: Jute (Corchorus spp.), belonging to the Malvaceae family, is an important natural fiber crop, second only to cotton, and a multipurpose economic crop. Corchorus capsularis L. is one of the only two commercially cultivated species of jute. Gene expression is spatiotemporal and is influenced by many factors. Therefore, to understand the molecular mechanisms of tissue development, it is necessary to study tissue-specific gene expression and regulation. We used weighted gene coexpression network analysis, to predict the functional roles of gene coexpression modules and individual genes, including those underlying the development of different tissue types. Although several transcriptome studies have been conducted on C. capsularis, there have not yet been any systematic and comprehensive transcriptome analyses for this species.
RESULTS: There was significant variation in gene expression between plant tissues. Comparative transcriptome analysis and weighted gene coexpression network analysis were performed for different C. capsularis tissues at different developmental stages. We identified numerous tissue-specific differentially expressed genes for each tissue, and 12 coexpression modules, comprising 126 to 4203 genes, associated with the development of various tissues. There was high consistency between the genes in modules related to tissues, and the candidate upregulated genes for each tissue. Further, a gene network including 21 genes directly regulated by transcription factor OMO55970.1 was discovered. Some of the genes, such as OMO55970.1, OMO51203.1, OMO50871.1, and OMO87663.1, directly involved in the development of stem bast tissue.
CONCLUSION: We identified genes that were differentially expressed between tissues of the same developmental stage. Some genes were consistently up- or downregulated, depending on the developmental stage of each tissue. Further, we identified numerous coexpression modules and genes associated with the development of various tissues. These findings elucidate the molecular mechanisms underlying the development of each tissue, and will promote multipurpose molecular breeding in jute and other fiber crops.

Entities:  

Keywords:  Comparative transcriptome analysis; Fiber crop; Jute; RNA-seq; WGCNA

Year:  2020        PMID: 32546133      PMCID: PMC7298812          DOI: 10.1186/s12864-020-06805-6

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Jute (Corchorus spp.), belonging to the Malvaceae family, is an important natural fiber crop, second only to cotton [1]. Among more than 50 Corchorus species [2], only two (C. capsularis L. and C. olitorius L.) are grown commercially in subtropical and tropical regions [3]. Jute fibers have advantages such as good moisture absorption, fast water dispersion, corrosion resistance, and are mainly used in the textile industry to make clothes, decorations, packaging materials, and other products [3, 4]. Jute is a multipurpose economic crop, and each tissue has its particular usage. For example, jute stalks can be used to make paper and to provide fuel, activated carbon, environmental protection materials, and building composite materials [3]. The leaves can be used as green vegetables and animal feed, and to produce skin care products and herbal medicine [5]. The seeds can be used to extract industrial oil and other products [6]. The versatility of jute in the marketplace will continue to expand, especially in environmental protection, vegetable production, and facial mask manufacturing [7]. These many benefits derive from the different chemical, physical, and biological properties of its various tissues, which are under tissue-specific gene expression control. Understanding the expression and regulation of genes in different tissues will help us to elucidate the molecular mechanisms underlying the development of these tissues [8]. With the rapid development of high-throughput sequencing technology and bioinformatics, tissue-specific gene expression and regulation analyses have been carried out on many crops [9-11]. In particular, weighted gene coexpression network analysis (WGCNA) has recently been widely used to predict the functional roles of gene coexpression modules and individual genes underlying the differences between tissues [12-14]. For example, WGCNA or comparative transcriptomic analysis revealed coexpression modules and dynamics in gene expression involved in stress response [12], seed development [13], and floral bud development [14] in various tissues of crop plants, etc.. WGCNA has become a fascinating integrated and systematic genome-wide approach, focusing on elucidating biological networks and gene function [15]. Recently, high-throughput sequencing technology has greatly promoted the study of jute molecular biology and genetics. For C. capsularis, many molecular markers including single nucleotide polymorphisms and simple sequence repeats have been developed through high-throughput sequencing [4, 16–18]. Several transcriptome studies have been reported in C. capsularis. These uncovered numerous differentially expressed unigenes involved in vegetative growth and development [19], abiotic stress [20] and bast fiber development [21]. However, a systematic and comprehensive transcriptome analysis has not yet been reported for C. capsularis. In this study, we performed a transcriptome analysis of different C. capsularis tissues in two different developmental stages. Our objective was to understand the molecular mechanisms underpinning the development of different tissue types, and to promote multipurpose molecular breeding in jute and other fiber crops.

Results

Transcriptome sequencing

We sequenced 19 RNA samples from jute (Yueyuan5hao) stem bast, leaf, fruit, and flower tissues during differential developmental stages. A total of 943.45 million high-quality reads were generated. Because jute flowers are very small, many flowers were required in order to obtain enough RNA for sequencing studies. Therefore, we combined many flowers for sequencing, whereas the other tissues were sequenced using three biological replicates. The smallest amount of sequencing data was obtained for flower tissue (54.22 million clean reads). We obtained clean reads for all other tissues, ranging from 145.26 to 152.42 million reads. We mapped the clean reads to the C. capsularis reference genome (CCACVL1_1.0); most clean reads from each tissue (> 92.45%) were aligned uniquely to the reference genome (Table 1).
Table 1

RNA sequencing statistics for tissues during two developmental stages in jute

Sample nameClean reads (Millions)Clean bases(Gb)Q20(%)Uniquely mapped (Millions)Uniquely mapped rate (%)
BVGP151.277.7097.1547.9393.48
BVGP247.767.1697.2844.7593.70
BVGP351.177.6896.8147.9493.70
Total of BVGP150.2022.5497.08140.6393.62
BFP150.297.5497.1047.4094.27
BFP245.466.8297.0242.1892.79
BFP353.748.0697.3550.6694.27
Total of BFP149.4822.4297.15140.2493.82
LVGP145.886.8896.9942.8293.33
LVGP246.006.9096.6842.4092.17
LVGP353.388.0096.6649.0891.94
Total of LVGP145.2621.7896.78134.2992.45
LFP152.767.9297.1348.9192.69
LFP248.527.2897.1145.1993.14
LFP351.157.6897.1047.2692.40
Total of LFP152.4322.8897.11141.3692.74
FT1_152.537.8897.1349.6494.48
FT1_247.487.1296.8843.9992.66
FT1_345.506.8296.8742.2492.83
Total of FT1145.5121.8296.96135.8793.37
FT2_152.587.8896.9549.1093.38
FT2_246.096.9297.6042.9693.21
FT2_347.697.1697.5044.2192.70
Total of FT2146.3621.9697.35136.2793.10
MF54.228.1497.2950.9894.03
Total943.45141.54879.63

MF mature flowers, LVGP leaf tissues of vegetative growth period, LFP leaf tissues of flowering period, FT1 Fruits < 0.8 cm in diameter, FT2 Fruits > 0.8 cm in diameter, BVGP bast of vegetative growth period, BFP bast of flowering period. Each tissue with three biological replicates

RNA sequencing statistics for tissues during two developmental stages in jute MF mature flowers, LVGP leaf tissues of vegetative growth period, LFP leaf tissues of flowering period, FT1 Fruits < 0.8 cm in diameter, FT2 Fruits > 0.8 cm in diameter, BVGP bast of vegetative growth period, BFP bast of flowering period. Each tissue with three biological replicates

Global transcriptome analysis of jute

To assess the number of genes expressed in the various tissue types at different stages, we analyzed the reads per kilobase of exon model per million reads (RPKM) of all 29,605 genes identified in this study. An RPKM value greater than one was set as the criteria for gene expression. In stem bast tissues, there was transcriptional activity for 18,320 and 18,268 genes, during the vegetative growth period (“bast tissue during the vegetative growth period”, BVGP) and flowering period (“bast tissue during the flowering period”, BFP), respectively (Additional files 1 and 2: Tables S1–2); in leaf tissues, 17,480 and 18,126 genes were expressed in these two stages, respectively (Additional files 3 and 4: Tables S3–4). During the flowering period, fruits were categorized into two developmental stages (diameter < 0.8 cm, hereafter “FF1”; or diameter > 0.8 cm, hereafter “FF2”), and were used for RNA sequencing; 19,396 and 19,509 genes, respectively, were expressed in these developmental stages (Additional files 5 and 6: Tables S5–6). Furthermore, 17,842 expressed genes (Additional file 7: Table S7) were identified in flowers. In total, 14,943 genes (Additional file 8: Table S8) were expressed across all tissues during the vegetative growth period and flowering period. Differences in gene expression between tissues were visualized using hierarchical cluster analysis based on the RPKM values of the 29,605 genes (Fig. 1).
Fig. 1

Hierarchical cluster analysis of the jute genes that we analyzed. The analysis is based on reads per kilobase of transcript per million mapped reads (RPKM). MF, mature flowers; LVGP, leaf tissues of vegetative growth period; LFP, leaf tissues of flowering period; FT1, Fruits < 0.8 cm in diameter; FT2, Fruits > 0.8 cm in diameter; BVGP, bast of vegetative growth period; BFP, bast of flowering period

Hierarchical cluster analysis of the jute genes that we analyzed. The analysis is based on reads per kilobase of transcript per million mapped reads (RPKM). MF, mature flowers; LVGP, leaf tissues of vegetative growth period; LFP, leaf tissues of flowering period; FT1, Fruits < 0.8 cm in diameter; FT2, Fruits > 0.8 cm in diameter; BVGP, bast of vegetative growth period; BFP, bast of flowering period

Comparative transcriptome analysis of the different tissues and developmental stages

We identified the candidate differentially expressed genes (DEGs) for each tissue by comparison with other tissues at the same developmental stage. Relative to leaf tissues (“leaf tissues during the vegetative growth period”, LVGP), we identified 2035 upregulated and 2231 downregulated genes in BVGP (Additional files 9 and 10: Tables S9–10). In FF1, there were 7108 upregulated and 6059 downregulated genes, relative to BFP; 7782 upregulated and 7074 downregulated genes, relative to leaf tissues during the flowering period (LFP); 281 upregulated and 1438 downregulated genes, relative to flowers; and 192 upregulated and 219 downregulated genes, relative to all other tissue types (Fig. 2a and b). In FF2, there were 5988 upregulated and 5067 downregulated genes, relative to BFP; 6897 upregulated and 6272 downregulated genes, relative to LFP; 256 upregulated and 1376 downregulated genes, relative to flowers; and 210 upregulated and 181 downregulated genes, relative to all other tissue types (Fig. 2c, d). In total, 94 upregulated and 133 downregulated genes were identified in fruit during the FF1 and FF2 developmental stages (Fig. 2e and f). In BFP, we identified 6059 upregulated and 7108 downregulated genes, relative to FF1; 5067 upregulated and 5988 downregulated genes, relative to FF2; 5328 upregulated and 5896 downregulated genes, relative to LFP; and 261 upregulated and 1648 downregulated genes, relative to flowers. In total, 103 upregulated and 184 downregulated genes were identified in stem bastduring the vegetative growth period and flowering period (Fig. 2g and h). In total, 275 upregulated and 207 downregulated genes were identified by comparing leaf tissues with other organ tissues during the vegetative growth period and flowering period (Fig. 2i and j). The fewest DEGs (< 3000 in total) were found in flower tissues relative to other tissues (Fig. 2k and l).
Fig. 2

Venn diagram of differential gene expression between various jute tissues. a-d Upregulated (a and c) and downregulated (b and d) genes in fruit tissues (FF1 and FF2) compared with other tissues at each developmental stage. e, f Genes identified as upregulated (e) and downregulated (e) in fruit tissue (FF1 and FF2) compared with all other tissues at each developmental stage. g, h Upregulated (g) and downregulated (h) genes in stem bast tissues compared with other tissues of the same developmental stage. i, j Upregulated (i) and downregulated (j) genes in leaf tissues compared with other tissues at the same developmental stage. k, l Upregulated and downregulated genes in flower tissues compared with other tissues at the same stage. MF, mature flowers; LVGP, leaf tissues of vegetative growth period; LFP, leaf tissues of flowering period; FT1, fruits < 0.8 cm in diameter; FT2, fruits > 0.8 cm in diameter; BVGP, bast of vegetative growth period; BFP, bast of flowering period

Venn diagram of differential gene expression between various jute tissues. a-d Upregulated (a and c) and downregulated (b and d) genes in fruit tissues (FF1 and FF2) compared with other tissues at each developmental stage. e, f Genes identified as upregulated (e) and downregulated (e) in fruit tissue (FF1 and FF2) compared with all other tissues at each developmental stage. g, h Upregulated (g) and downregulated (h) genes in stem bast tissues compared with other tissues of the same developmental stage. i, j Upregulated (i) and downregulated (j) genes in leaf tissues compared with other tissues at the same developmental stage. k, l Upregulated and downregulated genes in flower tissues compared with other tissues at the same stage. MF, mature flowers; LVGP, leaf tissues of vegetative growth period; LFP, leaf tissues of flowering period; FT1, fruits < 0.8 cm in diameter; FT2, fruits > 0.8 cm in diameter; BVGP, bast of vegetative growth period; BFP, bast of flowering period

Identification of gene coexpression modules

To identify genes and coexpression modules with similar expression profiles related to the development of different tissues, we carried out a WGCNA. To avoid spurious results, low-expression genes (average RPKM< 1) were excluded. In total, 20,012 genes were used in this analysis, and 12 coexpression modules comprising 126 to 4203 genes were identified; and there is a higher correlation among genes in modules (Fig. 3). Further, we investigated the associations between each module and each tissue at different developmental stages, using correlation analysis. Only one module was related to LFP (related module: blue), FF1 (related module: turquoise), and BFP (related module: brown); two modules were related to BVGP (related module: pink and purple), FF2 (related module: turquoise and magenta), LVGP (related module: black and greenyellow), and flowers (related module: greenyellow and red) (Fig. 4a). The turquoise module correlated with both FF1 and FF2. By comparing the genes in the modules related to particular traits to the candidate upregulated genes for each tissue type (defined as comparison group), we found that the candidate upregulated genes and the genes in each module were highly consistent for each comparison group. The ratio of overlapping genes between each comparison group was greater than 20% for almost all combinations, except the combination of flowers and the greenyellow module (2%) (Fig. 4b).
Fig. 3

The twelve coexpression modules, comprising 126 to 4203 jute genes. The modules were identified using weighted gene coexpression network analysis (WGCNA). The heatmap depicted adjacencies or topological overlaps, with light colors denoting higher adjacency (correlation), with red colors denoting low adjacency (correlation). The gene dendrograms and module colors are plotted along the top and left side of the heatmap. Each color represents a module

Fig. 4

Coexpression module and gene comparison analyses. We compared the genes in modules related to traits to the candidate upregulated genes. a Correlations between the modules and tissues at two different developmental stages. b Overlap between genes in modules related to traits and candidate upregulated genes. MF, mature flowers; LVGP, leaf tissues of vegetative growth period; LFP, leaf tissues of flowering period; FT1, fruits < 0.8 cm in diameter; FT2, fruits > 0.8 cm in diameter; BVGP, bast of vegetative growth period; BFP, bast of flowering period

The twelve coexpression modules, comprising 126 to 4203 jute genes. The modules were identified using weighted gene coexpression network analysis (WGCNA). The heatmap depicted adjacencies or topological overlaps, with light colors denoting higher adjacency (correlation), with red colors denoting low adjacency (correlation). The gene dendrograms and module colors are plotted along the top and left side of the heatmap. Each color represents a module Coexpression module and gene comparison analyses. We compared the genes in modules related to traits to the candidate upregulated genes. a Correlations between the modules and tissues at two different developmental stages. b Overlap between genes in modules related to traits and candidate upregulated genes. MF, mature flowers; LVGP, leaf tissues of vegetative growth period; LFP, leaf tissues of flowering period; FT1, fruits < 0.8 cm in diameter; FT2, fruits > 0.8 cm in diameter; BVGP, bast of vegetative growth period; BFP, bast of flowering period We performed Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis for the overlapping genes for each comparison group. The terms ‘protein processing in endoplasmic reticulum’, ‘sesquiterpenoid and triterpenoid biosynthesis’, ‘plant hormone signal transduction’, and ‘glycolysis/gluconeogenesis’, were enriched in stem bast tissues. In addition to other terms, the terms ‘phenylpropanoid biosynthesis’, ‘biosynthesis of secondary metabolites’, and ‘flavonoid biosynthesis’ were enriched in the fruit; ‘pentose and glucuronate interconversions’ and ‘phenylalanine metabolism’ were enriched in the flowers; and ‘metabolic pathways’ and ‘photosynthesis’ were enriched in the leaf tissues (Additional files 11 and 12: Fig. S1–2).

Identification of genes in coexpression modules associated with fiber tissues

The vegetative growth period is the period of jute fiber development and rapid thickening of stem bast. Based on the WGCNA results, the pink module related with BVGP. We evaluated the correlation between the expression of genes and stem bast tissues, and define this value as the Gene Significance (GS) score. We also assessed the correlation of the pink module with the gene expression profiles, based on this correlation, we defined module membership (MM) in the pink module. GS and MM were closely correlated (cor = 0.85, p < 1e− 200) in the pink module for stem bast tissue (Fig. 5), reflecting the strong correlation between stem bast tissue and the pink module genes. We identified 253 upregulated genes in stem bast that also occurred within the pink module, during the vegetative growth period. We further analyzed and constructed a coexpression network for these genes. We focused on a transcription factor gene (OMO55970.1), which was directly linked to 21 other genes (Fig. 6). Fourteen of these genes were included among the 253 common genes (Table 2). Some of these 14 genes were involved in the development of stem bast and fiber, with very high GS.BVGP, and MM.pink values. For example, OMO50871.1 is an epidermal patterning factor, OMO51203.1 is related to glucose metabolism, and OMO87663.1 is a wall-associated receptor kinase.
Fig. 5

Scatterplot of Gene Significance (GS) score versus module membership (MM) in the pink module. This is for stem bast tissue during the vegetative growth period

Fig. 6

A network of 21 genes directly linked to transcription factor OMO55970.1. This transcription factor is associated with genes in the pink module that code for fiber tissues

Table 2

Gene significance and membership of the pink module, for 15 jute genes including OMO55970.1 and genes directly linked to it

Gene nameGS.BVGPP.GS.BVGPMM.pinkP.MM.pinklog2FCP-adjAnnotation
OMO55970.10.470.04480.590.00791.840.0052sp|Q39237|TGA1_ARATH Transcription factor TGA1
Novel023460.640.00340.680.00142.520.0005sp|P04793|HSP13_SOYBN 17.5 kDa class I heat shock protein
OMO50871.10.780.00010.870.00001.860.0332sp|Q9T068|EPFL2_ARATH EPIDERMAL PATTERNING FACTOR-like protein
OMO51203.10.840.00000.890.00001.820.0056sp|Q9LTA3|U91C1_ARATH UDP-glycosyltransferase 91C1
OMO55771.10.590.00810.600.00672.760.0000sp|Q94CH6|EXL3_ARATH GDSL esterase/lipase EXL3
OMO59515.10.770.00010.800.00003.240.0000sp|Q84TH5|AB25G_ARATH ABC transporter G family member 25
OMO63886.10.560.01180.700.00105.240.0497
OMO67249.10.770.00010.830.00001.600.0162
OMO69537.10.700.00100.700.00081.670.0271sp|Q01JD1|SPL7_ORYSI Squamosa promoter-binding-like protein 7
OMO76397.10.600.00720.730.00042.140.0008sp|Q9FKW0|RNG1A_ARATH Putative E3 ubiquitin-protein ligase RING1a
OMO79017.10.570.01040.590.00764.700.0317
OMO80742.10.870.00000.900.00003.330.0001
OMO87663.10.870.00000.940.00002.100.0010sp|Q8RY67|WAKLO_ARATH Wall-associated receptor kinase-like 14
OMO94989.10.850.00000.830.00002.060.0038sp|O04212|Y2090_ARATH Putative ABC1 protein
OMP03057.10.630.00380.680.00143.670.0000

These genes were differentially expressed between the jute bast and other tissues

Scatterplot of Gene Significance (GS) score versus module membership (MM) in the pink module. This is for stem bast tissue during the vegetative growth period A network of 21 genes directly linked to transcription factor OMO55970.1. This transcription factor is associated with genes in the pink module that code for fiber tissues Gene significance and membership of the pink module, for 15 jute genes including OMO55970.1 and genes directly linked to it These genes were differentially expressed between the jute bast and other tissues

Validation of the differential gene expression results

To validate the RNA-seq results, qRT-PCR analysis was performed for 12 genes during the vegetative growth period. The genes showed differential expression when comparing stem bast with leaf tissue, consistent with the results obtained by the RNA-seq analysis (Additional file 13: Fig. S3). In addition, we compared DEGs identified in bast tissue during the vegetative growth period in our study with the DEGs identified in fibre cells which were included in bast tissue by comparing fibre cells with seedling reported by Islam et al. A total of 714 upregulated and 837 downregulated genes were discovered in the both studies (Additional files 14 and 15: Tables S11–12), accounting for approximately 35% (714/2035) and 38% (837/2231) of upregulated and downregulated genes identified in bast tissue during the vegetative growth period in our study.

Discussion

Knowing how genes are expressed and regulated in various tissues is the basis of studying gene function, and is required for molecular breeding. The study of gene expression profiles of various tissues during different developmental stages has been reported for many crops [22, 23]. However, a systematic and comprehensive transcriptome has not yet been reported for C. capsularis, and transcription information for various tissues is lacking. In the study, we compared the transcriptomes of four tissue types during two growth stages, revealing many genes and marked distinctions in gene expression. Tissues were clustered together according to their gene expression signatures, consistent with previous reports [23, 24]. Comparative transcriptome analysis is a useful and conventional method for investigating spatiotemporal pattern of gene expression [23, 25–27]. However, it is difficult to examine the relationships between DEGs and tissues using differential gene expression analysis. Therefore, the systematic analytical methods are called for comprehensively extracting information from transcriptome data. WGCNA is a popular and useful approach for revealing meaningful relationships between gene modules and biological processes, and between genes and traits [13, 14]. In this study, we first identified the genes that were differentially expressed in each tissue compared with other tissues in the same developmental period, then identified the genes that were consistently up- or down regulated in different periods for each tissue. These findings will help us to understand the development of different tissues. In particular, genes that are consistently up- and downregulated might play crucial roles in the metabolism, growth, and development of each tissue; these should be the focus of future studies. We then identified 12 coexpression modules comprising 126 to 4203 genes, that were associated with the development of various tissues, using WGCNA. We compared the genes in modules related to tissues, to the candidate upregulated genes in each tissue, and found that they were highly consistent (Fig. 4b). This corroborated our analysis of the correlation between the coexpression modules and the gene expression profiles of each tissue. Furthermore, the KEGG analysis for the overlapping genes for each comparison group showed that the various tissue types have different enrichment pathways. This provides evidence that the genes that code for the pathways have variable spatiotemporal pattern of expression in the tissues. Some similar results have been found in previous studies on jute pathway enrichment, for example, ‘glycolysis/gluconeogenesis’ was enriched in stem bast tissue, which associated with bast fibre synthesis [19, 28]; We verified that the WGCNA was suitable for the analysis of gene expression modules in the various tissues that we studied. Additionally, 253 genes that were upregulated in stem bast also occurred within the pink module, during the vegetative growth period. Among these genes, transcription factor OMO55970.1 was directly linked to 21 other genes, some of them showing very high GS.BVGP and MM.pink correlation and directly involved in the development of stem bast and fiber according to gene annotation [28]. In particular, OMO55970.1, a TGA1 transcription factor, is a candidate transcription factor for fiber differentiation [29]. OMO50871.1 is an epidermal patterning factor; OMO51203.1 is related to glucose metabolism; and OMO87663.1 is a wall-associated receptor kinase which play a crucial role in correct cell-wall synthesis [30]. Our results suggest that these genes are of great significance for the formation of stem bast and fiber, and that they should be the focus of future research.

Conclusion

This is the first systematic and comprehensive transcriptome analysis for Corchorus capsularis. To describe the transcriptomes associated with various tissue types and developmental stages, we performed a comparative transcriptome and coexpression modules analysis. We compared the genes in modules related to tissues to the candidate upregulated genes for each tissue, and found that they were highly consistent. These results constituted a systematic and complete database of gene expression of jute various tissue types and tissue development. We identified a gene network of 21 genes that was directly regulated by transcription factor OMO55970.1. Some of these genes, such as OMO55970.1, OMO51203.1, OMO50871.1, and OMO87663.1, were immediately involved in the development of stem bast and fiber. These genes should be the focus of future research. In summary, these findings help to elucidate the molecular mechanisms of tissue development in jute, and to promote multipurpose molecular breeding in jute and other fiber crops.

Methods

Plant growth and transcriptome sequencing

The C. capsularis used in this study was a variety (Yueyuan5hao). It originally came from Guangdong Province, China, and stored in the National Bast Fiber Germplasm Middle-term Storage of China (our lab, in Changsha, Hunan province, China). In total, 20 plants were planted in a well-ventilated greenhouse, in 10 pots with two plants per pot. During the entire growth period, the environmental conditions of the soil and fertilizer in each pot were kept consistent, to ensure minimal differences in plant growth. When the plants had grown to approximately 1 m (vegetative growth period), stem bast and leaf tissues were collected for RNA extraction. During the flowering and fruiting period, tissues including leaves, stem bast, fruits with diameter < 0.8 cm, fruits with diameter > 0.8 cm, and flowers were collected for RNA extraction. Three replicates of each tissue were collected, except of flowers. Owing to the small size of the flowers, we sampled enough amount for mixed sequencing. Total RNA was extracted from 19 tissue samples using Trizol (Invitrogen, Santa Clara, CA, USA) following the manufacturer’s protocol. RNA quality assessment and construction of sequencing libraries were performed according to Yang et al. [31]. In total, 19 sequencing libraries were generated, using NEBNext Ultra RNA Library Prep Kit for Illumina (NEB, USA). RNA sequencing was carried out on an Illumina HiSeq 2500 System.

Read mapping and differential gene expression analysis

Firstly, raw reads were processed in Perl using in-house scripts, to obtain clean, high quality reads by removing adapter sequences and low-quality reads. All downstream analyses were performed using the clean reads. We downloaded the reference genome of C. capsularis (CCACVL1_1.0) from the NCBI database [28]. We then used Bowtie v2.0.6 [32] to build the reference genome index, TopHat v2.0.9 [33] to map clean reads to the reference genome, and HTSeq v0.6.1 to count the number of reads aligned to each gene [34]. To calculate gene expression levels, we assessed the RPKM of each gene based on the number of reads mapped to the gene and the length of the gene. Finally, differential gene expression analysis was performed using a model based on the negative binomial distribution, using the R package DESeq (1.10.1) [35]. Genes with an adjusted P-value < 0.05 were considered to be differentially expressed. P-values were adjusted to reduce the false discovery rate using the Benjamini-Hochberg procedure [35]. To identify gene clusters associated with various tissue types, we performed a WGCNA using the R package WGCNA [36], using RPKM values from the 19 tissue samples. First, we clustered all samples to exclude any obvious outliers (cutHeight = 3000). We then performed network construction and module detection using an automatic 1-step network construction and module detection function. We constructed a weighted gene network using thresholding power β (β = 1 to 30) to calculate adjacency among the genes. Further, we chose the soft thresholding power (β = 16) to construct a network based on the criterion of approximate scale-free topology, using mergeCutHeight = 0.25 and minModuleSize = 30. We estimated the association between genes and various tissues using the Gene Significance (GS) score (whereby each tissue is considered a quality trait), the correlation of the module and the gene expression profile (MM) and the correlation of module with traits (module eigengene). This enabled us to identify the expression modules and genes that are closely related to each tissue. Lastly, the WGCNA results were exported to Cytoscape software to construct a gene coexpression network [37].

Pathway enrichment analysis of differentially expressed genes

To examine the metabolic pathways associated with the development of different tissues, we first identified the genes that were both included in modules related to specific traits, and were upregulated in each tissue relative to the other tissues at the same stage. Second, we performed a KEGG enrichment analysis, using KOBAS software, for the genes that were identified both in each tissue and each module [38, 39]. Third, we selected the 20 most significantly enriched pathways to visualize in the maps. When fewer than 20 pathways were significantly enriched, all were displayed.

Validation of differential gene expression

To validate the RNA-seq results, we compared a set of genes expression in this study with the report by Islam et al. [28], and analyzed the expression of 12 genes (7 that were directly regulated by transcription factor OMO55970.1, and five randomly selected from among the DEGs expressed in leaves and stem bast tissues during the vegetative growth period) by qRT-PCR, using three independent biological and three technological replicates. We used the jute elongation factor-alpha (ELF) gene as an endogenous control. The primer sequences of the DEGs and ELF are listed in Additional file 16: Table S13. The qRT-PCR was performed according to Yang et al. [20]. Additional file 1: Table S1. Transcriptional genes that were identified in stem bast tissue during the vegetative growth period (BVGP). Additional file 2: Table S2. Transcriptional genes that were identified in bast tissue during the flowering period (BFP). Additional file 3: Table S3. Transcriptional genes that were identified in leaf tissues during the vegetative growth period (LVGP). Additional file 4: Table S4. Transcriptional genes that were identified in leaf tissues during the flowering period (LFP). Additional file 5: Table S5. Transcriptional genes that were identified in fruit < 0.8 cm in diameter (FF1). Additional file 6: Table S6. Transcriptional genes that were identified in fruit > 0.8 cm in diameter (FF2). Additional file 7: Table S7. Transcriptional genes that were identified in flower tissues. Additional file 8: Table S8. Transcriptional genes that were identified in all tissues during differential stages. Additional file 9: Table S9. Upregulated genes that were identified by comparing bast tissue during the vegetative growth period (BVGP) with leaf tissue during the vegetative growth period (LVGP). Additional file 10: Table S10. Downregulated genes that were identified by comparing bast tissue during the vegetative growth period (BVGP) with leaf tissue during the vegetative growth period (LVGP). Additional file 11: Figure S1. Enriched terms for the genes that overlapped between the gene modules related to traits and the candidate upregulated genes, obtained using KEGG enrichment analysis. Additional file 12: Figure S2. Enriched terms for the genes that overlapped between the gene modules related to traits and the candidate upregulated genes, obtained using KEGG enrichment analysis. Additional file 13: Figure S3. RNA-seq and qRT-PCR analysis results for the genes directly regulated by transcription factor (OMO55970.1), and for the five genes that were randomly selected from among the genes that were differentially expressed between the bast tissueand the leaf tissue during the vegetative growth period. Additional file 14: Table S11. Common upregulated genes identified in bast tissue during the vegetative growth period in our study and in fibre cells which were included bast tissue identified by comparing fibre cells with seedling reported by Islam et al. [28]. Additional file 15: Table S12. Common downregulated genes identified in bast tissue during the vegetative growth period in our study and in fibre cells which were included bast tissue identified by comparing fibre cells with seedling reported by Islam et al. [28]. Additional file 16: Table S13. The primer sequences used for ELF, for the seven genes that were directly regulated by transcription factor (OMO55970.1), and for the five genes that were randomly selected from among the differentially expressed genes.
  30 in total

1.  ePlant: Visualizing and Exploring Multiple Levels of Data for Hypothesis Generation in Plant Biology.

Authors:  Jamie Waese; Jim Fan; Asher Pasha; Hans Yu; Geoffrey Fucile; Ruian Shi; Matthew Cumming; Lawrence A Kelley; Michael J Sternberg; Vivek Krishnakumar; Erik Ferlanti; Jason Miller; Chris Town; Wolfgang Stuerzlinger; Nicholas J Provart
Journal:  Plant Cell       Date:  2017-08-14       Impact factor: 11.277

2.  A global view of transcriptome dynamics during flower development in chickpea by deep sequencing.

Authors:  Vikash K Singh; Rohini Garg; Mukesh Jain
Journal:  Plant Biotechnol J       Date:  2013-04-01       Impact factor: 9.803

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

4.  Development of an InDel polymorphism database for jute via comparative transcriptome analysis.

Authors:  Zemao Yang; Zhigang Dai; Dongwei Xie; Jiquan Chen; Qing Tang; Chaohua Cheng; Ying Xu; Tingzhang Wang; Jianguang Su
Journal:  Genome       Date:  2018-02-08       Impact factor: 2.166

5.  Comparative genomics of two jute species and insight into fibre biogenesis.

Authors:  Md Shahidul Islam; Jennifer A Saito; Emdadul Mannan Emdad; Borhan Ahmed; Mohammad Moinul Islam; Abdul Halim; Quazi Md Mosaddeque Hossen; Md Zakir Hossain; Rasel Ahmed; Md Sabbir Hossain; Shah Md Tamim Kabir; Md Sarwar Alam Khan; Md Mursalin Khan; Rajnee Hasan; Nasima Aktar; Ummay Honi; Rahin Islam; Md Mamunur Rashid; Xuehua Wan; Shaobin Hou; Taslima Haque; Muhammad Shafiul Azam; Mahdi Muhammad Moosa; Sabrina M Elias; A M Mahedi Hasan; Niaz Mahmood; Md Shafiuddin; Saima Shahid; Nusrat Sharmeen Shommu; Sharmin Jahan; Saroj Roy; Amlan Chowdhury; Ashikul Islam Akhand; Golam Morshad Nisho; Khaled Salah Uddin; Taposhi Rabeya; S M Ekramul Hoque; Afsana Rahman Snigdha; Sarowar Mortoza; Syed Abdul Matin; Md Kamrul Islam; M Z H Lashkar; Mahboob Zaman; Anton Yuryev; Md Kamal Uddin; Md Sharifur Rahman; Md Samiul Haque; Md Monjurul Alam; Haseena Khan; Maqsudul Alam
Journal:  Nat Plants       Date:  2017-01-30       Impact factor: 15.793

6.  Global transcript profiling of primary stems from Arabidopsis thaliana identifies candidate genes for missing links in lignin biosynthesis and transcriptional regulators of fiber differentiation.

Authors:  Jürgen Ehlting; Nathalie Mattheus; Dana S Aeschliman; Eryang Li; Britta Hamberger; Ian F Cullis; Jun Zhuang; Minako Kaneda; Shawn D Mansfield; Lacey Samuels; Kermit Ritland; Brian E Ellis; Jörg Bohlmann; Carl J Douglas
Journal:  Plant J       Date:  2005-06       Impact factor: 6.417

7.  Transcriptome Analysis of Two Species of Jute in Response to Polyethylene Glycol (PEG)- induced Drought Stress.

Authors:  Zemao Yang; Zhigang Dai; Ruike Lu; Bibo Wu; Qing Tang; Ying Xu; Chaohua Cheng; Jianguang Su
Journal:  Sci Rep       Date:  2017-11-29       Impact factor: 4.379

8.  Transcriptomics and co-expression networks reveal tissue-specific responses and regulatory hubs under mild and severe drought in papaya (Carica papaya L.).

Authors:  Samuel David Gamboa-Tuz; Alejandro Pereira-Santana; Jesús Alejandro Zamora-Briseño; Enrique Castano; Francisco Espadas-Gil; Jorge Tonatiuh Ayala-Sumuano; Miguel Ángel Keb-Llanes; Felipe Sanchez-Teyer; Luis Carlos Rodríguez-Zapata
Journal:  Sci Rep       Date:  2018-09-28       Impact factor: 4.379

9.  TiGER: a database for tissue-specific gene expression and regulation.

Authors:  Xiong Liu; Xueping Yu; Donald J Zack; Heng Zhu; Jiang Qian
Journal:  BMC Bioinformatics       Date:  2008-06-09       Impact factor: 3.169

10.  Comprehensive metabolic and transcriptomic profiling of various tissues provide insights for saponin biosynthesis in the medicinally important Asparagus racemosus.

Authors:  Prabhakar Lal Srivastava; Anurag Shukla; Raviraj M Kalunke
Journal:  Sci Rep       Date:  2018-06-14       Impact factor: 4.379

View more
  2 in total

1.  Revealing Genetic Differences in Fiber Elongation between the Offspring of Sea Island Cotton and Upland Cotton Backcross Populations Based on Transcriptome and Weighted Gene Coexpression Networks.

Authors:  Shengmei Li; Shiwei Geng; Bo Pang; Jieyin Zhao; Yajie Huang; Cun Rui; Jinxin Cui; Yang Jiao; Ru Zhang; Wenwei Gao
Journal:  Genes (Basel)       Date:  2022-05-26       Impact factor: 4.141

2.  Comparative Transcriptome Analyses of Different Rheum officinale Tissues Reveal Differentially Expressed Genes Associated with Anthraquinone, Catechin, and Gallic Acid Biosynthesis.

Authors:  Lipan Zhou; Jiangyan Sun; Tianyi Zhang; Yadi Tang; Jie Liu; Chenxi Gao; Yunyan Zhai; Yanbing Guo; Li Feng; Xinxin Zhang; Tao Zhou; Xumei Wang
Journal:  Genes (Basel)       Date:  2022-09-05       Impact factor: 4.141

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.