| Literature DB >> 34206279 |
Elizaveta Grigoreva1,2,3, Alexander Tkachenko1, Serafima Arkhimandritova4, Aleksandar Beatovic1, Pavel Ulianich5, Vladimir Volkov2,3, Dmitry Karzhaev3, Cécile Ben6, Laurent Gentzbittel6, Elena Potokina2,3.
Abstract
Guar (Cyamopsis tetragonoloba (L.) Taub.) is an annual legume crop native to India and Pakistan. Seeds of the plant serve as a source of galactomannan polysaccharide (guar gum) used in the food industry as a stabilizer (E412) and as a gelling agent in oil and gas fracturing fluids. There were several attempts to introduce this crop to countries of more northern latitudes. However, guar is a plant of a short photoperiod, therefore, its introduction, for example, to Russia is complicated by a long day length during the growing season. Breeding of new guar varieties insensitive to photoperiod slowed down due to the lack of information on functional molecular markers, which, in turn, requires information on guar genome. Modern breeding strategies, e.g., genomic predictions, benefit from integration of multi-omics approaches such as transcriptome, proteome and metabolome assays. Here we present an attempt to use transcriptome-metabolome integration to understand the genetic determination of flowering time variation among guar plants that differ in their photoperiod sensitivity. This study was performed on nine early- and six delayed-flowering guar varieties with the goal to find a connection between 63 metabolites and 1,067 differentially expressed transcripts using Shiny GAM approach. For the key biomarker of flowering in guar myo-inositol we also evaluated the KEGG biochemical pathway maps available for Arabidopsis thaliana. We found that the phosphatidylinositol signaling pathway is initiated in guar plants that are ready for flowering through the activation of the phospholipase C (PLC) gene, resulting in an exponential increase in the amount of myo-inositol in its free form observed on GC-MS chromatograms. The signaling pathway is performed by suppression of myo-inositol phosphate kinases (phosphorylation) and alternative overexpression of phosphatases (dephosphorylation). Our study suggests that metabolome and transcriptome information taken together, provide valuable information about biomarkers that can be used as a tool for marker-assisted breeding, metabolomics and functional genomics of this important legume crop.Entities:
Keywords: differentially expressed genes; gene network analysis; systems biology; transcriptome-metabolome integration
Mesh:
Substances:
Year: 2021 PMID: 34206279 PMCID: PMC8303896 DOI: 10.3390/genes12070952
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Guar lines with different photoperiod sensitivity that showed consistent early or late flowering time in experiments of 2018 and 2019 under the long day conditions.
| Plant ID | Early or Delayed Flowering on Long Day Conditions | Line ID | VIR Cat. Number | Origin | Accession |
|---|---|---|---|---|---|
| 34_1 | Early | 34 | 82 | India | landrace |
| 34_2 | Early | 34 | 82 | India | landrace |
| 34_3 | Early | 34 | 82 | India | landrace |
| 97_1 | Early | 97 | 52,586 | USA | cv. Lewis |
| 97_2 | Early | 97 | 52,586 | USA | cv. Lewis |
| 97_3 | Early | 97 | 52,586 | USA | cv.Lewis |
| 97_4 | Early | 97 | 52,586 | USA | cv.Lewis |
| 69_1 | Early | 69 | 52,585 | USA | cv.Kinman |
| 69_2 | Early | 69 | 52,585 | USA | cv.Kinman |
| 28_1 | Delayed | 28 | 550 | India | landrace |
| 28_2 | Delayed | 28 | 550 | India | landrace |
| 28_3 | Delayed | 28 | 550 | India | landrace |
| 75_1 | Delayed | 75 | 52,580 | Pakistan | landrace |
| 75_2 | Delayed | 75 | 52,580 | Pakistan | landrace |
| 75_3 | Delayed | 75 | 52,580 | Pakistan | landrace |
VIR: Vavilov Institute of Plant Genetic Resources.
Results reflecting assemblies’ parameters obtained by Transrate.
| Metrics | Results | |||
|---|---|---|---|---|
| rnaSPAdes | Trinity | Trinity | Trinity Genome-Guided | |
| Number of contigs | 132,825 | 102,909 | 112,788 | 102,539 |
| Shortest contigs (bp) | 131 | 197 | 201 | 186 |
| Longest contigs (bp) | 40,377 | 59,908 | 59,908 | 14,406 |
| Number of bases (bp) | 78,453,910 | 91,396,199 | 103,168,619 | 96,000,969 |
| Mean length (bp) | 539 | 888 | 914 | 936 |
| Number of contigs over 1000 bp | 22,143 | 307 | 32,499 | 32,819 |
| Number of contigs over 10,000 bp | 95 | 29,453 | 229 | 3 |
| Mean ORF % | 56 | 54 | 54 | 54 |
| N90 (bp) | 317 | 324 | 343 | 357 |
| N70 (bp) | 792 | 862 | 901 | 980 |
| N50 (bp) | 1394 | 1586 | 1615 | 1661 |
| N30 (bp) | 2243 | 2427 | 2404 | 2359 |
| N10 (bp) | 16,753 | 4939 | 4231 | 3634 |
| GC% | 40 | 41 | 40 | 39 |
| Mean of overall alignment rate of the transcripts against assembly (%) | 96 | 97 | 99 | 96 |
Comparative statistics of transcriptome assemblies available for guar.
| Assembly Metrics | Assembly (Genome-Guided Trinity, | Assembly Tanwar et al., 2017 [ | Assembly |
|---|---|---|---|
| N50 | 1661 | 1035 | 2552 |
| Total unigenes | 79,863 | 61,508 | 62,146 |
| Average transcript length (bp) | 936 | 679 | 1045 |
Figure 1PCA biplot showing coordinates of early and delayed flowering guar lines according to the expression level of 78,015 transcripts which corresponds to the longest isoforms. For each line, three different plants were assessed for gene expression level. Plant ID (Table 1) is prefixed with E (Early) or D (Delayed).
Figure 2Clusterprofiler GO enrichment results showing activation or suppression of genes in Early flowering guar genotypes in comparison with Late flowering genotypes for the functional categories: (a) Biological Process (BP); (b) Molecular Function (MF); (c) Cellular Component (CC).
Figure 3Transcriptome-metabolome network obtained with Shiny GAM. Green edges correspond to genes downregulated in Early flowering plants, red edges correspond to overexpressed genes. Green nodes correspond to metabolites whose negative log2fc value means lower concentrations in EFl plants compared to DFl plants, red nodes are metabolites ‘overexpressed’ in EFl. The nodes with increased size correspond to metabolites with significant p-value in 2019. Dark grey small nodes indicate metabolites with p-value > 0.05.
Description of significant metabolites-transcripts connection presented in Shiny GAM network.
| Metabolite Name (Shiny GAM) | Metabolite log2fc (2019) | Metabolite | Metabolite | Connected Genes in Shiny GAM | Gene log2fc | Gene |
|---|---|---|---|---|---|---|
| C00243 | −1.726 | 0.001 | nd |
| −0.711 | 0.001 |
| C00137 | 0.670 | 1.55 × 10−6 | 1.12 × 10−7 |
| −3.017 | 2.6 × 10−10 |
| C00811 | 2.243 | 5.62 × 10−4 | 6.53 × 10−7 |
| −1.034 | 2.36 × 10−6 |
| C00258 | 0.415 | 0.043 | 0.006 |
| −0.303 | 0.039 |
| C00158 | 1.119 | 1.36 × 10−4 | 3.45 × 10−5 |
| 0.497 | 0.001 |
| C00149 | 0.467 | 9.87 × 10−5 | 0.006 |
| −0.220; | 0.042; |
| C01595 | 0.493 | 0.003 | nd |
| −0.9 | 0.055 |
| C00049 | 0.621 | 0.009 | nd |
| −0.386; | 0.097; |
| C00065 | 0.393 | 0.008 | 0.001 |
| −1.448 | 5.492 × 10−5 |
Figure 4Putative transcripts involved in the phosphatidylinositol signaling pathway in guar based on homology with Arabidopsis thaliana on KEGG map obtained by Сlusterprofiler (pathway ath04070). The red-green scale is the mean value of log2fc. Green labels correspond to down-regulation in early flowering genotypes, red labels to up-regulated transcripts, grey labels to transcripts with no significant difference between conditions. The target metabolite is unphosphorylated myo-inositol marked by a circle.
Description of differentially expressed guar transcripts involved in the phosphatidylinositol signaling pathway based on homology with Arabidopsis thaliana on KEGG map obtained by Сlusterprofiler (pathway ath04070).
| Enzyme | Gene Name | Definition | Gene log2fc | Gene | MetaCyc Database | |
|---|---|---|---|---|---|---|
| 3.1.3.25 |
| myo-inositol 1-phosphate monophosphatase |
| 0.1375 | 0.3832 | 1D-myo-inositol 3-monophosphate + H2O → myo-inositol + phosphate |
| 3.1.3.25 |
| inositol-phosphate phosphatase |
| 0.3604 | 0.0600 | 1D-myo-inositol 3-monophosphate + H2O → myo-inositol + phosphate |
| 3.1.3.25 |
| L-galactose-1-phosphate phosphatase |
| 0.3508 | 0.0522 | β-L-galactose 1-phosphate + H2O → L-galactopyranose |
|
|
| phosphatidylinositol-3,4,5-trisphosphate 3-phosphatase |
| 0.1911 | 0.1132 | 1-phosphatidyl-1D-myo-inositol 3-phosphate + H2O → a 1-phosphatidyl-1D-myo-inositol |
|
|
| Inositol polyphosphate 1-phosphatase |
| 0.2393 | 0.0582 | D-myo-inositol (1,4)-bisphosphate + H2O → 1D-myo-inositol 4-monophosphate |
|
|
| Phosphoinositide-specific phospholipase C family protein |
| 1.9714 | 0.10466 | 1-phosphatidyl-1D-myo-inositol 4,5-bisphosphate + H2O → a 1,2-diacyl-sn-glycerol + D-myo-inositol (1,4,5)-trisphosphate + H+ |
| PIKFYVE |
| 1-phosphatidylinositol 3-phosphate 5-kinase |
| −0.6137 | 0.0185 | 1-phosphatidyl-1D-myo-inositol 3-phosphate + ATP → 1-phosphatidyl-1D-myo-inositol 3,5-bisphosphate + ADP + H+ |
|
|
| Phosphatidylinositol 4-kinase alpha 1 |
| −0.2559 | 0.0727 | 1-phosphatidyl-1D-myo-inositol + ATP 1-phosphatidyl-1D-myo-inositol 4-phosphate + |
|
|
| Inositol 1,3,4-trisphosphate 5/6-kinase family protein |
| −0.4706 | 0.0392 | D-myo-inositol (1,3,4)-trisphosphate + ATP→ D-myo-inositol (1,3,4,5)-tetrakisphosphate + ADP + H+ |
|
|
| diacylglycerol kinase 1 |
| −0.8194 | 0.00438 | ATP + 1,2-diacyl-sn-glycerol → a 1,2-diacyl-sn-glycerol 3-phosphate + |
|
|
| diphosphoinositol-pentakisphosphate kinase; |
| −0.9682 | 0.1076 | ATP + phytate → 1D-myo-inositol 1-diphosphate 2,3,4,5,6-pentakisphosphate |
| PIP5K |
| phosphatidylinositol-4-phosphate 5-kinase 1 |
| −0.5195 | 0.00026 | 1-phosphatidyl-1D-myo-inositol 4-phosphate + ATP → a 1-phosphatidyl-1D-myo-inositol 4,5-bisphosphate + ADP + H+ |