| Literature DB >> 24957996 |
Eva Collakova1, Delasa Aghamirzaie2, Yihui Fang3, Curtis Klumas4, Farzaneh Tabataba5, Akshay Kakumanu6, Elijah Myers7, Lenwood S Heath8, Ruth Grene9.
Abstract
Soybean (Glycine max) seeds are an important source of seed storage compounds, including protein, oil, and sugar used for food, feed, chemical, and biofuel production. We assessed detailed temporal transcriptional and metabolic changes in developing soybean embryos to gain a systems biology view of developmental and metabolic changes and to identify potential targets for metabolic engineering. Two major developmental and metabolic transitions were captured enabling identification of potential metabolic engineering targets specific to seed filling and to desiccation. The first transition involved a switch between different types of metabolism in dividing and elongating cells. The second transition involved the onset of maturation and desiccation tolerance during seed filling and a switch from photoheterotrophic to heterotrophic metabolism. Clustering analyses of metabolite and transcript data revealed clusters of functionally related metabolites and transcripts active in these different developmental and metabolic programs. The gene clusters provide a resource to generate predictions about the associations and interactions of unknown regulators with their targets based on "guilt-by-association" relationships. The inferred regulators also represent potential targets for future metabolic engineering of relevant pathways and steps in central carbon and nitrogen metabolism in soybean embryos and drought and desiccation tolerance in plants.Entities:
Year: 2013 PMID: 24957996 PMCID: PMC3901275 DOI: 10.3390/metabo3020347
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Changes in fatty acid and protein levels in developing soybean embryos. Fatty acids from hydrolyzed lipids and proteins were analyzed by GC-FID as fatty acid methylesters and by a fluorescent hydrophobic protein assay, respectively, as described in the Experimental Section. Young 5-day-old embryos already accumulate seed storage compounds and the levels of these compounds gradually increased in developing soybean embryos from day 5 to 25.
Fatty acid levels in developing soybean embryos. Fatty acids from hydrolyzed lipids were analyzed by GC-FID as fatty acid methyl esters as described in the Experimental Section. Averages ± SD (μg mg−1 dry weight) of three biological replicates are shown for each day in the time course.
| Fatty Acids | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Day | 16:0 | 18:0 | 18:1 Δ9 | 18:1 Δ12 | 18:2 Δ9,12 | 18:3 Δ9,12,15 | 20:0 | 20:1 Δ11 | 22:0 |
|
| 8.0 ± 2.2 | 2.2 ± 0.4 | 6.1 ± 2.3 | 0.45 ± 0.19 | 14 ± 4 | 9.4 ± 2.5 | 0.40 ± 0.06 | 0.12 ± 0.04 | 0.34 ± 0.10 |
|
| 18 ± 2 | 6.1 ± 1.0 | 29 ± 9 | 1.9 ± 0.3 | 57 ± 8 | 16 ± 1 | 0.80 ± 0.14 | 0.31 ± 0.05 | 0.47 ± 0.07 |
|
| 29 ± 9 | 10 ± 3 | 52 ± 28 | 3.5 ± 0.9 | 98 ± 24 | 24 ± 2 | 1.1 ± 0.4 | 0.47 ± 0.23 | 0.77 ± 0.26 |
|
| 29 ± 2 | 11 ± 1 | 69 ± 7 | 3.8 ± 0.3 | 120 ± 9 | 20 ± 1 | 1.2 ± 0.1 | 0.56 ± 0.06 | 0.77 ± 0.08 |
|
| 38 ± 5 | 16 ± 2 | 73 ± 10 | 4.2 ± 0.6 | 183 ± 23 | 25 ± 4 | 1.5 ± 0.2 | 0.72 ± 0.09 | 1.2 ± 0.1 |
|
| 35 ± 2 | 14 ± 1 | 81 ± 11 | 4.2 ± 0.6 | 161 ± 16 | 22 ± 1 | 1.4 ± 0.0 | 0.76 ± 0.07 | 1.1 ± 0.1 |
|
| 33 ± 2 | 14 ± 0 | 66 ± 3 | 3.7 ± 0.4 | 162 ± 20 | 22 ± 2 | 1.3 ± 0.1 | 0.71 ± 0.05 | 1.0 ± 0.0 |
|
| 37 ± 0 | 15 ± 1 | 81 ± 10 | 4.2 ± 0.4 | 176 ± 10 | 22 ± 2 | 1.5 ± 0.1 | 0.83 ± 0.10 | 1.2 ± 0.0 |
|
| 29 ± 2 | 11 ± 0 | 65 ± 4 | 3.2 ± 0.4 | 132 ± 13 | 16 ± 1 | 1.2 ± 0.0 | 0.67 ± 0.06 | 0.93 ± 0.01 |
|
| 25 ± 1 | 9.0 ± 0.7 | 54 ± 6 | 2.8 ± 0.2 | 112 ± 10 | 14 ± 1 | 0.95 ± 0.10 | 0.52 ± 0.01 | 0.78 ± 0.07 |
Figure 2Principal Component Analysis on Metabolite Level Correlations. Score plots (A and B) and loading plots (C and D) are shown for combinations of PC1, 2, and 3. PCA was performed on combined non-redundant data involving three replicates of the relative or absolute levels of free metabolites, individual fatty acids from hydrolyzed lipids, and hydrophobic protein in developing soybean embryos by using JMP Pro 10 software (SAS, Cary, NC, USA). The color-coded numbers in A represent the corresponding age of the embryos (days 5 through 55), and each dot of the same color represents replicate samples. The color-coding is retained for B. The ovals highlight the basic clustering/separation of similar/dissimilar samples. In the loading plots C and D, the eigenvectors represented by red arrows show how (the direction) and how much (the length) each metabolite contributes to the individual correlations represented by PC1, 2, and 3.
Figure 3Changes in metabolite levels in developing soybean embryos. Relative metabolite levels obtained from GC-MS-FID and UPLC analyses and protein levels were subjected to Limma computational analysis to obtain trends as described in the Experimental Section. Only metabolites that showed statistically significant changes (p-value < 0.05) for at least one time point were subjected to SplineCluster analysis resulting in four clusters (shown in the same scale). The levels of homoserine, aconitate, mucic acid, ornithine, and Met did not change significantly over embryo development. Cluster 1 contained 10 metabolites (citrate and nine lipid-derived fatty acids that were detected by GC-FID (Table 1)) and total protein. Cluster 2 contained 10 metabolites (sucrose, oxalate, mannitol, lactobionate, melibiose, galactinol, galactinol-like, raffinose, Arg, 2-aminobutyrate). Cluster 3 contained 16 metabolites (phosphate, malate, pinitol, inositol, ethanolamine, myo-inositol, maltose, Gly, Asp, Glu, Thr, Ala, γ-aminobutyrate, Pro, Lys, Tyr). Cluster 4 contained 22 metabolites (fructose, glucose, glycolate, malonate, succinate, glycerate, fumarate, beta-cyanoAla, pyroGlu, 4-guanidinobutyrate, putrescine, dehydroascorbate, saccharate, His, Asn, Ser, Gln, citrulline, Val, Ileu, Leu, Phe).
Figure 4Major five gene expression trends (A–E) representing processes of interest observed in developing soybean embryos.
Figure 5RNA sequencing, differential expression, and coexpression pipeline. Computational tools are shown in color.
Statistical results from RNA sequencing analysis (number and percentage of raw, filtered, and aligned reads). Three replicates (a, b, and c) are shown for each time point (days 5–55).
| sample | raw | % | filtered | % | aligned | % |
|---|---|---|---|---|---|---|
| d5a | 50,277,367 | 100 | 48,847,168 | 97.2 | 45,036,342 | 89.6 |
| d5b | 62,438,141 | 100 | 60,618,219 | 97.1 | 55,916,016 | 89.6 |
| d5c | 42,789,850 | 100 | 41,586,412 | 97.2 | 38,479,330 | 89.9 |
| d10a | 72,420,225 | 100 | 69,364,427 | 95.8 | 61,431,613 | 84.8 |
| d10b | 46,641,758 | 100 | 44,850,498 | 96.2 | 39,812,742 | 85.4 |
| d10c | 85,142,664 | 100 | 81,774,580 | 96.0 | 71,198,971 | 83.6 |
| d15a | 42,701,829 | 100 | 39,342,420 | 92.1 | 33,742,148 | 79.0 |
| d15b | 55,919,488 | 100 | 51,665,677 | 92.4 | 44,507,297 | 79.6 |
| d15c | 98,613,720 | 100 | 91,253,129 | 92.5 | 78,873,912 | 80.0 |
| d20a | 71,059,794 | 100 | 65,828,827 | 92.6 | 52,933,436 | 74.5 |
| d20b | 44,455,535 | 100 | 41,248,905 | 92.8 | 33,195,196 | 74.7 |
| d20c | 54,423,534 | 100 | 50,618,077 | 93.0 | 40,743,929 | 74.9 |
| d25a | 61,500,744 | 100 | 56,010,032 | 91.1 | 46,786,403 | 76.1 |
| d25c | 83,670,143 | 100 | 76,387,754 | 91.3 | 63,785,353 | 76.2 |
| d30a | 74,112,923 | 100 | 68,885,120 | 92.9 | 59,572,944 | 80.4 |
| d30b | 77,985,515 | 100 | 72,428,008 | 92.9 | 61,711,555 | 79.1 |
| d30c | 100,671,683 | 100 | 93,523,791 | 92.9 | 81,026,533 | 80.5 |
| d35a | 67,498,080 | 100 | 62,861,610 | 93.1 | 54,691,875 | 81.0 |
| d35b | 87,943,727 | 100 | 82,007,125 | 93.2 | 71,352,895 | 81.1 |
| d35c | 99,568,258 | 100 | 92,761,463 | 93.2 | 80,856,912 | 81.2 |
| d40a | 135,660,811 | 100 | 124,626,322 | 91.9 | 107,084,232 | 78.9 |
| d40b | 54,135,432 | 100 | 49,929,189 | 92.2 | 43,065,040 | 79.6 |
| d40c | 55,241,634 | 100 | 51,064,159 | 92.4 | 44,214,195 | 80.0 |
| d45a | 67,554,585 | 100 | 63,069,990 | 93.4 | 53,773,223 | 79.6 |
| d45b | 62,187,743 | 100 | 58,293,335 | 93.7 | 50,756,256 | 81.6 |
| d45c | 74,873,986 | 100 | 70,329,261 | 93.9 | 60,877,029 | 81.3 |
| d55a | 74,383,353 | 100 | 68,110,555 | 91.6 | 56,475,935 | 75.9 |
| d55b | 86,213,964 | 100 | 79,106,644 | 91.8 | 65,108,144 | 75.5 |
| d55c | 47,160,359 | 100 | 43,338,248 | 91.9 | 35,914,805 | 76.2 |
Soybean genes expressed during the dormancy induction phase in clusters D and E (see Figure 4). Arabidopsis homologs of soybean genes expressed during dormancy induction were categorized by search terms present in GO biological process categories. Each cluster group has the number of Arabidopsis homologs with GO annotations and the percentage of genes matching the search term in the left most column. Gene percentages are calculated using the number of search term matches in the cluster divided by the total genes in the cluster. ‘All clusters’ include both cluster D and E. Each cluster contains unique genes with no overlap, but a gene may be present in more than one category and the total number of genes represents non-redundant genes. All gene percentages > 10% are highlighted in bold.
| GO Search Terms | Trend D | Trend E | Both Trends (D + E) | |||
|---|---|---|---|---|---|---|
| Number | % | Number | % | Number | % | |
| abscisic acid | 19 | 7.36 | 16 | 8.56 | 35 | 8.05 |
| ethylene | 9 | 3.49 | 11 | 5.88 | 20 | 4.60 |
| jasmonic acid | 16 | 6.20 | 13 | 6.95 | 29 | 6.67 |
| salicylic acid | 14 | 5.43 | 13 | 6.95 | 27 | 6.21 |
| chloroplast | 8 | 3.10 | 1 | 0.53 | 9 | 2.07 |
| redox | 4 | 1.55 | 3 | 1.60 | 7 | 1.61 |
| germination | 7 | 2.71 | 1 | 0.53 | 8 | 1.84 |
| flowering | 5 | 1.94 | 1 | 0.53 | 6 | 1.38 |
| dormancy | 9 | 3.49 | 6 | 3.21 | 15 | 3.45 |
| transcription | 37 |
| 28 |
| 65 |
|
| signaling | 28 |
| 21 |
| 49 |
|
| metal | 5 | 1.94 | 4 | 2.14 | 9 | 2.07 |
| iron | 8 | 3.10 | 2 | 1.07 | 10 | 2.30 |
| trehalose | 0 | 0.00 | 2 | 1.07 | 2 | 0.46 |
| stress | 36 |
| 24 |
| 60 |
|
| oxidative stress | 11 | 4.26 | 9 | 4.81 | 20 | 4.60 |
| salt stress | 20 | 7.75 | 9 | 4.81 | 29 | 6.67 |
| osmotic stress | 6 | 2.33 | 4 | 2.14 | 10 | 2.30 |
| biotic stimulus | 2 | 0.78 | 3 | 1.60 | 5 | 1.15 |
| defense response | 24 | 9.30 | 34 |
| 58 |
|
| water deprivation | 17 | 6.59 | 13 | 6.95 | 30 | 6.90 |
| Total number of genes | 258 | 187 | 435 | |||