| Literature DB >> 27760136 |
Si Wu1, Saleh Alseekh1, Álvaro Cuadros-Inostroza1,2, Corina M Fusari1, Marek Mutwil1, Rik Kooke3, Joost B Keurentjes3, Alisdair R Fernie1, Lothar Willmitzer1, Yariv Brotman1,4.
Abstract
Plant primary metabolism is a highly coordinated, central, and complex network of biochemical processes regulated at both the genetic and post-translational levels. The genetic basis of this network can be explored by analyzing the metabolic composition of genetically diverse genotypes in a given plant species. Here, we report an integrative strategy combining quantitative genetic mapping and metabolite‒transcript correlation networks to identify functional associations between genes and primary metabolites in Arabidopsis thaliana. Genome-wide association study (GWAS) was used to identify metabolic quantitative trait loci (mQTL). Correlation networks built using metabolite and transcript data derived from a previously published time-course stress study yielded metabolite‒transcript correlations identified by covariation. Finally, results obtained in this study were compared with mQTL previously described. We applied a statistical framework to test and compare the performance of different single methods (network approach and quantitative genetics methods, representing the two orthogonal approaches combined in our strategy) with that of the combined strategy. We show that the combined strategy has improved performance manifested by increased sensitivity and accuracy. This combined strategy allowed the identification of 92 candidate associations between structural genes and primary metabolites, which not only included previously well-characterized gene‒metabolite associations, but also revealed novel associations. Using loss-of-function mutants, we validated two of the novel associations with genes involved in tyrosine degradation and in β-alanine metabolism. In conclusion, we demonstrate that applying our integrative strategy to the largely untapped resource of metabolite-transcript associations can facilitate the discovery of novel metabolite-related genes. This integrative strategy is not limited to A. thaliana, but generally applicable to other plant species.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27760136 PMCID: PMC5070769 DOI: 10.1371/journal.pgen.1006363
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 4Association between tyrosine and HGO.
(a) Profiles of tyrosine and HGO across 23 time points in four conditions (4-D, 21-D, 21-LL, and 32-D). Red and blue lines represent metabolite intensity and gene expression, respectively. (b) Merged network by combining the four condition-specific networks (4-D, 21-D, 21-LL, and 32-D). Node colors in the network stand for the functional classes to which transcripts belong. The width of edge in the network represents the number of conditions that a certain association between two nodes (corresponding metabolite and transcript) shares. In order to represent the most robust associations with tyrosine, only transcripts that are connected with tyrosine in all four conditions, and metabolites that are connected with HGO in at least two conditions, are displayed in this zoom-in merged transcript‒metabolite correlation network. (c) Box-plot of tyrosine intensity in wide-type (Col-0) plants and HGO mutant plants under normal and stress conditions (32-D). Tyrosine intensity is log2 transformed.
Common loci verified by GWAS, RIL, and IL datasets (using GWAS LOD ≥ 5.3)
| Trait | Chr | Left border of the locus (bp) | Right border of the locus (bp) | Number of genes in the locus | Confirmed by GWAS | Confirmed by RIL | Confirmed by IL |
|---|---|---|---|---|---|---|---|
| Nicotinic acid | 5 | 4746332 | 4787839 | 10 | T | T | T |
| Fructose | 2 | 16970258 | 17012067 | 12 | T | T | F |
| Leucine | 4 | 8231017 | 8322201 | 30 | T | T | F |
| Glutamic acid | 1 | 2743761 | 2788447 | 11 | T | F | T |
| Gluconic acid | 3 | 9052982 | 9095537 | 10 | T | F | T |
| Lysine | 3 | 464279 | 521747 | 24 | T | F | T |
| 4-Aminobutyric acid | 5 | 26854022 | 26883430 | 9 | T | F | T |
Permutation test of F-measure for the combined strategy with 10,000 iterations
| Reference Gene List | LOD = 5.3 | LOD = 4.5 | ||||
|---|---|---|---|---|---|---|
| Actual | Permuted | Actual | Permuted | |||
| RGL1 | 0.494 | 0.301 ± 0.055 | 5.00E-04 | 0.433 | 0.272±0.049 | 0.0007 |
| RGL2 | 0.682 | 0.494 ± 0.052 | 3.00E-04 | 0.619 | 0.448±0.047 | 2.00E-04 |
| RGL3 | 0.776 | 0.606 ± 0.048 | 4.00E-04 | 0.722 | 0.551±0.043 | 1.00E-04 |
| RGL4 | 0.85 | 0.758 ± 0.039 | 0.008 | 0.788 | 0.678±0.036 | 0.002 |
Detailed information of candidate associations discussed in the text, selected by integrating the results from GWAS, network analysis, RIL, and IL
| Metabolite | Gene | LOD | GWAS check | Networks check | RIL check | IL check | PCC | Annotation | Reference | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 21L | 21D | 4L | 4D | 32L | 32D | 21LL | 21HL | |||||||||
| Glycine | AT1G62800 | 4.77 | T | T | T | T | -0.66 | 0 | -0.73 | 0 | 0 | 0 | 0 | 0 | aspartate aminotransferase 4 | [ |
| Nicotinic acid | AT5G14780 | 6.16 | T | T | T | T | 0 | 0.79 | 0 | 0 | 0 | 0 | 0 | 0 | formate dehydrogenase | [ |
| Nicotinic acid | AT5G14760 | 6.16 | T | F | T | T | NA | NA | NA | NA | NA | NA | NA | NA | L-aspartate oxidase | [ |
| beta-alanine | AT4G39660 | 5.16 | T | T | F | F | 0.85 | 0.82 | 0 | 0 | 0 | 0.61 | 0.65 | 0 | alanine:glyoxylate aminotransferase 2 | [ |
| beta-alanine | AT5G53120 | 4.51 | T | T | F | F | -0.61 | -0.75 | -0.79 | -0.87 | 0 | -0.64 | -0.86 | 0 | spermidine synthase 3 | [ |
| Homoserine | AT2G17265 | 5.30 | T | F | F | F | NA | NA | NA | NA | NA | NA | NA | NA | homoserine kinase | [ |
| Tyrosine | AT5G54080 | NA | F | T | T | F | 0 | 0.78 | NA | 0.85 | NA | 0.73 | 0.74 | NA | Encodes a homogentisate 1,2-dioxygenase that can convert homogentisate to malylacetoacetate and is likely to be involved in tyrosine catabolism | [ |
| Tyrosine | AT5G53970 | NA | F | T | T | F | -0.69 | 0 | 0.68 | 0 | 0 | 0.67 | -0.65 | -0.57 | encodes tyrosine aminotransferase, strongly induced upon aging and coronatine treatment | [ |
| Tyramine | AT4G28680 | 8.89 | T | F | F | F | NA | NA | NA | NA | NA | NA | NA | NA | encode a stress-induced tyrosine decarboxylase | [ |
Value zero in PCC means that the actual absolute correlations are below the statistical PCC threshold and are rounded to 0. NA means that this association is not found in network analysis.