| Literature DB >> 28338798 |
Dominic Knoch1, David Riewe1, Rhonda Christiane Meyer1, Anastassia Boudichevskaia2, Renate Schmidt2, Thomas Altmann1.
Abstract
To gain insight into genetic factors controlling seed metabolic composition and its relationship to major seed properties, an Arabidopsis recombinant inbred line (RIL) population, derived from accessions Col-0 and C24, was studied using an MS-based metabolic profiling approach. Relative intensities of 311 polar primary metabolites were used to identify associated genomic loci and to elucidate their interactions by quantitative trait locus (QTL) mapping. A total of 786 metabolic QTLs (mQTLs) were unequally distributed across the genome, forming several hotspots. For the branched-chain amino acid leucine, mQTLs and candidate genes were elucidated in detail. Correlation studies displayed links between metabolite levels, seed protein content, and seed weight. Principal component analysis revealed a clustering of samples, with PC1 mapping to a region on the short arm of chromosome IV. The overlap of this region with mQTL hotspots indicates the presence of a potential master regulatory locus of seed metabolism. As a result of database queries, a series of candidate regulatory genes, including bZIP10, were identified within this region. Depending on the search conditions, metabolic pathway-derived candidate genes for 40-61% of tested mQTLs could be determined, providing an extensive basis for further identification and characterization of hitherto unknown genes causal for natural variation of Arabidopsis seed metabolism.Entities:
Keywords: Arabidopsis thaliana; gas chromatography–mass spectrometry; metabolic quantitative trait locus; primary metabolism; recombinant inbred line; seed biology.
Mesh:
Year: 2017 PMID: 28338798 PMCID: PMC5444479 DOI: 10.1093/jxb/erx049
Source DB: PubMed Journal: J Exp Bot ISSN: 0022-0957 Impact factor: 6.992
Fig. 1.Distribution of mQTLs for metabolites of known chemical structure. Chromosomal locations of significant mQTLs for the 58 metabolites of known chemical structure and the seed protein content are indicated by boxes representing the 1.5-LOD QTL support intervals. Vertical black lines within the boxes indicate the apices of the corresponding LOD curves. The mQTLs are color-coded according to their significance [threshold at alpha of 0.05 (yellow), 0.01 (orange), 0.001 (red)] derived from permutation results of the genome-wide maximum LOD scores. Vertical lines represent marker positions. For a subset, their approximate distance in centiMorgans is indicated. Asterisks at the bottom correspond to the position of identified mQTL hotspots.
Comparison of detected mQTLs in seeds and leaf material
| Metabolite | Chromosome | Support interval |
| Support interval |
|
|---|---|---|---|---|---|
| Glycine | III | 17.27–23.28 Mbp | 5.42 | 16.24–17.78 Mbp | 8.00 |
| Malic acid | IV | 13.69–18.54 Mbp | 9.85 | 10.67–15.39 Mbp | 4.20 |
|
| I | 3.49–9.36 Mbp | 5.32 | 4.12–6.50 Mbp | 6.50 |
| Raffinose | III | 17.27–23.41 Mbp | 4.30 | 16.24–19.50 Mbp | 4.70 |
| Serine | II | 5.18–10.43 Mbp | 3.30 | 3.00–5.33 Mbp | 5.10 |
| Serine | III | 17.27–19.86 Mbp | 7.54 | 15.17–17.78 Mbp | 6.90 |
| Tyrosine | III | 14.30–23.41 Mbp | 3.36 | 11.77–17.78 Mbp | 4.20 |
| Tyrosine | V | 18.83–26.92 Mbp | 3.50 | 21.92–22.91 Mbp | 9.60 |
Estimated proportion of the phenotype variance explained by a QTL
Fig. 2.mQTL analysis and candidate gene identification for leucine. (A) LOD profiles were plotted for all five Arabidopsis chromosomes. Gray lines represent LOD profiles calculated with the ‘cim’ function (composite interval mapping). Gray dots indicate selected cofactors. The horizontal dashed gray line corresponds to a CIM alpha threshold of 0.05, estimated by 10 000 permutations. The solid black lines indicate LOD profiles calculated with the ‘stepwiseqtl’ function using a multiple QTL model. The positions of the QTL apices in centiMorgans are given above the curves. (B) A simplified genetic map with known and putative genes involved in leucine biosynthesis and degradation. Purple horizontal lines indicate the locations of genes, directly or indirectly involved in leucine metabolism. Leucine mQTLs were identified on chromosomes II, III, IV, and V. Support intervals are shown as red vertical lines beside the chromosomes. Leucine-related genes, located within the confidence intervals of the mQTLs, are indicated. Identified candidate genes for chromosome II are AT2G23170 (GH3.3), AT2G26800 (HML1), and AT2G31810, for chromosome III AT3G48560 (AHAS) and AT3G49680 (BCAT3), for chromosome IV AT4G27260 (GH3.5), and for chromosome V AT5G65780 (BCAT5). (C) Boxplots of normalized and median divided leucine abundances in seeds of RILs. Samples were subdivided into four groups according to the allelic state at the epistatically interacting loci on chromosomes IV and V. Significant differences between the groups are indicated by upper case letters (ANOVA with post-hoc Tukey HSD, Padj<0.001; number of individuals: nC24/C24=113, nCol-0/C24=20, nC24/Col-0 =82, nCol-0/Col-0=149). (D) Boxplots of normalized and median divided leucine abundances in seeds of parental and reciprocal F1 hybrid plants derived from an independent experiment. Significant differences between the groups are indicated by upper case letters (ANOVA with post-hoc Tukey HSD, Padj<0.05; number of individuals: nC24=7, nC24×Col-0=5, nCol-0×C24=5, nCol-0=5).
Summary of mQTL hotspots
| Chromosome | Marker | Position (kbp) | Position (cM) | Number of mQTLs |
|---|---|---|---|---|
| II | M2_4269 | 8410.151 | 32.48 | 16 |
| II | MASC02644 | 10 428.938 | 41.29 | 20 |
| II | MASC09222 | 14 375.406 | 58.38 | 34 |
| III | MASC09224 | 18 501.466 | 68.17 | 44 |
| III | MASC02788 | 20 744.711 | 78.77 | 32 |
| IV | MASC04123 | 301.329 | 4.15 | 27 |
| IV | MASC04725 | 1092.491 | 10.21 | 35 |
| IV | MASC05042 | 2188.362 | 12.90 | 44 |
| IV | MASC04685 | 5230.768 | 14.01 | 16 |
| V | MASC09209 | 7717.922 | 26.27 | 94 |
| V | MASC09211 | 25 579.812 | 92.79 | 15 |
Fig. 3.Principal component analysis of metabolite data. Score plot of the first two principal components PC1 and PC2 explaining 41% and 20% of variance of the data set, respectively. Samples were colored according to the genotype information on chromosome IV/marker: ‘MASC05042’ (12.90 cM). Black, red, and green circles correspond to Col-0, C24, and heterozygous alleles, respectively. Data were normalized, Pareto scaled, and mean centered prior to the calculation of the principal components.
Selection of transcription factor (TF) genes within the mQTL hotspot on chromosome IV expressed in seeds
| AGI locus identifier | TF family | General expression profile | Seed-specific expression profile |
|---|---|---|---|
|
| HB | Ubiquitous | Intermediate development |
|
| bZIP | Seed specific | Late development and mature seeds |
|
| WRKY | Preferentially in seeds | Late development |
|
| MYB-related | Seeds and other organs | Mature seeds |
|
| bHLH | Uubiquitous | Intermediate development |
|
| ABI3VP1 | Ubiquitous | Early and intermediate development |
|
| ABI3VP1 | Seeds and other organs | Intermediate development |
|
| SET | Ubiquitous | Early development |
|
|
|
|
|
|
| C2H2 | Seeds and other organs | Early development |
|
| ABI3VP1 | Seeds and other organs | Late development |
|
| PDF2 | Ubiquitous | Intermediate development |
According to Arabidopsis eFP Browser 2.0