| Literature DB >> 32978264 |
Sarah D Turner-Hissong1, Kevin A Bird1, Alexander E Lipka2, Elizabeth G King1, Timothy M Beissinger3,4, Ruthie Angelovici5.
Abstract
Plant growth, development, and nutritional quality depends upon amino acid homeostasis, especially in seeds. However, our understanding of the underlying genetics influencing amino acid content and composition remains limited, with only a few candidate genes and quantitative trait loci identified to date. Improved knowledge of the genetics and biological processes that determine amino acid levels will enable researchers to use this information for plant breeding and biological discovery. Toward this goal, we used genomic prediction to identify biological processes that are associated with, and therefore potentially influence, free amino acid (FAA) composition in seeds of the model plant Arabidopsis thaliana Markers were split into categories based on metabolic pathway annotations and fit using a genomic partitioning model to evaluate the influence of each pathway on heritability explained, model fit, and predictive ability. Selected pathways included processes known to influence FAA composition, albeit to an unknown degree, and spanned four categories: amino acid, core, specialized, and protein metabolism. Using this approach, we identified associations for pathways containing known variants for FAA traits, in addition to finding new trait-pathway associations. Markers related to amino acid metabolism, which are directly involved in FAA regulation, improved predictive ability for branched chain amino acids and histidine. The use of genomic partitioning also revealed patterns across biochemical families, in which serine-derived FAAs were associated with protein related annotations and aromatic FAAs were associated with specialized metabolic pathways. Taken together, these findings provide evidence that genomic partitioning is a viable strategy to uncover the relative contributions of biological processes to FAA traits in seeds, offering a promising framework to guide hypothesis testing and narrow the search space for candidate genes.Entities:
Keywords: Arabidopsis; GenPred; MultiBLUP; Shared Data Resources; amino acids; complex traits; genomic prediction
Mesh:
Substances:
Year: 2020 PMID: 32978264 PMCID: PMC7642941 DOI: 10.1534/g3.120.401240
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Summary of selected biological pathways
| Pathway | Number of genes | Number of SNPs | MapMan BINCODE |
|---|---|---|---|
| amino acid synthesis | 376 | 2084 | 13.1 |
| amino acid degradation | 160 | 1094 | 13.2 |
| amino acid transport | 144 | 939 | 34.3 |
| glycolysis | 148 | 858 | 4 |
| TCA cycle | 167 | 926 | 8 |
| ATP synthesis (alternative oxidase) | 10 | 66 | 9.4 |
| isoprenoids | 269 | 1788 | 16.1 |
| phenylpropanoids | 161 | 845 | 16.2 |
| nitrogen containing | 39 | 229 | 16.4 |
| sulfur containing | 113 | 733 | 16.5 |
| flavonoids | 171 | 1062 | 16.8 |
| amino acid activation | 203 | 1231 | 29.1 |
| protein synthesis | 1383 | 7290 | 29.2 |
| protein targeting | 624 | 3689 | 29.3 |
| protein postranslational modification | 1407 | 8794 | 29.4 |
| protein degradation | 996 | 6405 | 29.5 |
| ubiquitin | 2691 | 16000 | 29.5.11 |
| protein folding | 138 | 814 | 29.6 |
| protein glycolysis | 87 | 459 | 29.7 |
| protein assembly | 44 | 312 | 29.8 |
Pathways include genes and SNPs within a 2.5 kb buffer before the start and after the stop position of each gene.
Genomic prediction results for 65 free amino acid (FAA) traits using a GBLUP model
| Predictive ability (r) | Reliability ( | |||||||
|---|---|---|---|---|---|---|---|---|
| Trait type | Metabolic family | Trait | mean | SE | mean | SE | slope (bias) | RMSE |
| absolute | aspartate | asp | 0.341 | 0.028 | 0.157 | 0.020 | 0.936 | 0.055 |
| met | 0.384 | 0.027 | 0.224 | 0.024 | 1.051 | 0.0243 | ||
| thr | 0.099 | 0.022 | 0.075 | 0.013 | 0.756 | 0.1348 | ||
| AspFam | 0.327 | 0.031 | 0.159 | 0.022 | 1.042 | 0.0266 | ||
| BCAA_pyruvate | ala | 0.317 | 0.022 | 0.178 | 0.021 | 1.119 | 0.249 | |
| ile | 0.328 | 0.021 | 0.206 | 0.023 | 1.046 | 0.0583 | ||
| leu | 0.353 | 0.019 | 0.179 | 0.017 | 1.035 | 0.1261 | ||
| lys | 0.270 | 0.024 | 0.122 | 0.017 | 0.921 | 0.4597 | ||
| val | 0.397 | 0.019 | 0.242 | 0.022 | 1.027 | 0.1007 | ||
| BCAA | 0.388 | 0.019 | 0.252 | 0.023 | 1.042 | 0.101 | ||
| PyrFam | 0.351 | 0.021 | 0.249 | 0.026 | 1.108 | 0.12 | ||
| glutamate | arg | 0.323 | 0.029 | 0.146 | 0.023 | 0.923 | 0.0552 | |
| gln | 0.178 | 0.025 | 0.114 | 0.018 | 1.032 | 0.5348 | ||
| glu | 0.356 | 0.020 | 0.234 | 0.023 | 0.990 | 0.0072 | ||
| his | 0.359 | 0.020 | 0.149 | 0.014 | 0.880 | 1.0529 | ||
| pro | 0.310 | 0.021 | 0.182 | 0.021 | 1.010 | 0.0405 | ||
| GluFam | 0.389 | 0.020 | 0.199 | 0.018 | 0.916 | 0.0297 | ||
| serine | gly | 0.349 | 0.023 | 0.344 | 0.038 | 1.072 | 0.1086 | |
| ser | 0.241 | 0.021 | 0.142 | 0.019 | 1.078 | 0.0016 | ||
| SerFam | 0.320 | 0.022 | 0.247 | 0.028 | 1.030 | 0.0112 | ||
| aromatic | phe | 0.326 | 0.021 | 0.178 | 0.019 | 1.084 | 0.0202 | |
| trp | 0.317 | 0.018 | 0.209 | 0.021 | 1.019 | 0.0877 | ||
| tyr | 0.334 | 0.027 | 0.212 | 0.026 | 1.046 | 0.0266 | ||
| ShikFam | 0.411 | 0.015 | 0.229 | 0.016 | 1.023 | 0.0146 | ||
| Total | 0.392 | 0.022 | 0.193 | 0.018 | 1.015 | 0.019 | ||
| relative | aspartate | asp_t | 0.405 | 0.022 | 0.189 | 0.017 | 0.933 | 0.0386 |
| met_t | 0.313 | 0.025 | 0.157 | 0.017 | 1.021 | 0.0162 | ||
| BCAA_pyruvate | ala_t | 0.261 | 0.026 | 0.154 | 0.022 | 1.239 | 5.1795 | |
| ile_t | 0.218 | 0.021 | 0.127 | 0.017 | 1.085 | 0.0197 | ||
| leu_t | 0.296 | 0.022 | 0.122 | 0.015 | 1.093 | 0.0957 | ||
| lys_t | 0.196 | 0.023 | 0.125 | 0.019 | 1.112 | 0.8788 | ||
| val_t | 0.319 | 0.022 | 0.271 | 0.030 | 1.106 | 0.0138 | ||
| glutamate | arg_t | 0.276 | 0.037 | 0.220 | 0.042 | 1.056 | 0.0145 | |
| gln_t | 0.108 | 0.022 | 0.118 | 0.019 | 1.322 | 5.0853 | ||
| glu_t | 0.264 | 0.021 | 0.168 | 0.020 | 1.008 | 0.0355 | ||
| his_t | 0.259 | 0.024 | 0.123 | 0.016 | 1.076 | 37.6486 | ||
| pro_t | 0.253 | 0.019 | 0.134 | 0.017 | 1.022 | 0.0228 | ||
| serine | gly_t | 0.268 | 0.026 | 0.567 | 0.082 | 1.127 | 0.0155 | |
| ser_t | 0.076 | 0.023 | 0.118 | 0.019 | 1.452 | 0.0185 | ||
| aromatic | phe_t | 0.355 | 0.016 | 0.169 | 0.014 | 1.047 | 0.0181 | |
| trp_t | 0.205 | 0.024 | 0.110 | 0.015 | 0.940 | 0.0381 | ||
| tyr_t | 0.116 | 0.027 | 0.146 | 0.018 | 1.112 | 0.0118 | ||
| family | aspartate | asp_AspFam | 0.141 | 0.031 | 0.134 | 0.022 | 1.309 | 0.0782 |
| ile_AspFam | 0.165 | 0.029 | 0.139 | 0.021 | 1.197 | 0.0772 | ||
| lys_AspFam | 0.358 | 0.027 | 0.164 | 0.021 | 0.933 | 0.0189 | ||
| met_AspFam | 0.468 | 0.020 | 0.244 | 0.018 | 1.060 | 0.001 | ||
| thr_AspFam | 0.118 | 0.024 | 0.090 | 0.013 | 1.049 | 0.0552 | ||
| AspFam_Asp | 0.171 | 0.022 | 0.052 | 0.007 | 1.042 | 0.027 | ||
| BCAA_pyruvate | ala_PyrFam | 0.216 | 0.019 | 0.092 | 0.013 | 0.905 | 0.0222 | |
| ile_BCAA | 0.250 | 0.020 | 0.083 | 0.011 | 0.914 | 0.0348 | ||
| leu_BCAA | 0.251 | 0.024 | 0.091 | 0.012 | 1.076 | 0.075 | ||
| leu_PyrFam | 0.303 | 0.021 | 0.114 | 0.015 | 0.975 | 0.0244 | ||
| val_BCAA | 0.268 | 0.020 | 0.091 | 0.011 | 0.848 | 0.0224 | ||
| val_PyrFam | 0.298 | 0.019 | 0.232 | 0.024 | 0.858 | 0.0153 | ||
| glutamate | arg_GluFam | 0.205 | 0.032 | 0.193 | 0.029 | 1.243 | 0.0659 | |
| gln_GluFam | 0.167 | 0.034 | 0.195 | 0.039 | 1.076 | 0.0218 | ||
| glu_GluFam | 0.139 | 0.022 | 0.153 | 0.022 | 0.992 | 1.1693 | ||
| GluFam_glu | 0.203 | 0.034 | 0.202 | 0.030 | 0.881 | 0.0665 | ||
| his_GluFam | 0.186 | 0.023 | 0.102 | 0.015 | 1.012 | 24.2851 | ||
| pro_GluFam | 0.289 | 0.024 | 0.155 | 0.018 | 1.004 | 0.0329 | ||
| serine | gly_SerFam | 0.305 | 0.025 | 0.351 | 0.048 | 1.172 | 0.0634 | |
| ser_SerFam | 0.325 | 0.024 | 0.364 | 0.047 | 1.179 | 0.0461 | ||
| aromatic | phe_ShikFam | 0.218 | 0.028 | 0.149 | 0.021 | 1.097 | 0.0607 | |
| trp_ShikFam | 0.187 | 0.024 | 0.111 | 0.015 | 1.028 | 0.0658 | ||
| tyr_ShikFam | 0.257 | 0.028 | 0.144 | 0.022 | 0.945 | 0.0331 | ||
Traits are grouped by the type of trait (absolute level, relative to total FAA content, and family ratio) and metabolic family based on shared precursor. SE, standard error; RMSE, root mean squared error.
Figure 1Genomic prediction performed well for a higher proportion of absolute traits compared to relative and family-based ratio traits. Boxplots show free amino acid traits with predictive ability (r) > 0.3 based on genomic best linear unbiased prediction (GBLUP). Black triangles indicate the genomic heritability for each trait. Colors indicate whether the trait is an absolute level, relative level, or family-based ratio. Each point represents an individual cross-validation.
Figure 2Biological pathways explain significant variation and improve predictive ability for free amino acid traits when incorporated into a MultiBLUP model. (A) Venn diagram showing which trait-pathway combinations passed significance criteria (FDR adjusted P-value ≤ 0.10) for proportion of heritability explained (Prop. h2), likelihood ratio test statistic (LRT), and improved predictive ability for MultiBLUP compared to GBLUP. The bottom right corner indicates the number of combinations that did not pass any significance criteria. The Venn diagram was constructed using the ‘seqsetvis’ package in R (Boyd 2019). (B) Points indicate trait-pathway combinations that passed all three significance criteria. The diameter of each point is proportional to the amount of genomic variance explained by pathway SNPs in the MultiBLUP model. Traits are included on the y-axis and are grouped by metabolic family (aspartate, glutamate, pyruvate/BCAA, serine, aromatic). Pathways are included on the x-axis and separated into amino acid, core, specialized, and protein metabolism categories.
Free amino acid traits and pathway combinations for which the MultiBLUP model explained a significant proportion of heritability, improved model fit relative to random gene groups of approximately the same size, and increased predictive ability compared to GBLUP
| Prop. h2 explained | Likelihood ratio | Predictive ability | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Category | Pathway | Trait | Prop. h2 | FDR | LR | FDR | Δr | FDR | Δ | Δslope | ΔRMSE | |||
| amino acid | aa_degradation | his | 0.202 | 0.001 | 0.020 | 13.000 | <0.001 | <0.001 | 0.039 | <0.001 | <0.001 | 0.026 | −0.022 | −1.46E-02 |
| aa_degradation | ile_t | 0.412 | 0.003 | 0.060 | 5.689 | 0.007 | 0.100 | 0.039 | 0.006 | 0.037 | 0.042 | −0.045 | −1.60E-04 | |
| aa_degradation | met_t | 0.268 | 0.003 | 0.060 | 6.382 | 0.004 | 0.080 | 0.027 | 0.003 | 0.012 | 0.015 | −0.079 | −1.57E-04 | |
| aa_degradation | val_BCAA | 0.218 | 0.002 | 0.040 | 8.024 | 0.004 | 0.080 | 0.042 | 0.001 | 0.006 | 0.027 | 0.003 | −4.67E-04 | |
| aa_synthesis | val | 0.336 | 0.003 | 0.030 | 4.507 | 0.007 | 0.067 | 0.017 | 0.006 | 0.064 | 0.016 | −0.036 | −8.14E-04 | |
| aa_synthesis | BCAA | 0.439 | 0.002 | 0.040 | 8.823 | 0.001 | 0.020 | 0.044 | <0.001 | <0.001 | 0.050 | −0.064 | −2.16E-03 | |
| specialized | isoprenoids | ile_t | 0.429 | 0.008 | 0.080 | 4.575 | 0.010 | 0.100 | 0.044 | 0.001 | 0.005 | 0.032 | −0.078 | −2.04E-04 |
| S_containing | ShikFam | 0.301 | 0.001 | 0.020 | 10.810 | 0.001 | 0.020 | 0.032 | <0.001 | 0.001 | 0.036 | −0.024 | −2.39E-04 | |
| protein | ||||||||||||||
| protein_folding | his | 0.146 | 0.009 | 0.090 | 11.520 | 0.002 | 0.020 | 0.021 | 0.011 | 0.055 | 0.013 | −0.009 | −8.36E-03 | |
| protein_folding | SerFam | 0.303 | 0.005 | 0.100 | 6.487 | 0.001 | 0.020 | 0.032 | 0.001 | 0.008 | 0.042 | −0.033 | −1.11E-04 | |
| protein_synthesis | total | 0.535 | 0.002 | 0.040 | 6.418 | <0.001 | <0.001 | 0.024 | 0.001 | 0.006 | 0.018 | −0.025 | −2.00E-04 | |
| protein_synthesis | PyrFam | 0.809 | 0.005 | 0.100 | 5.298 | 0.003 | 0.060 | 0.026 | 0.003 | 0.029 | 0.032 | −0.030 | −1.50E-03 | |
Bolded rows indicate trait and pathway combinations that increased predictive ability by more than 5% compared to a GBLUP model. The difference in slope between the MultiBLUP and GBLUP models was computed as . RMSE, root mean squared error.
Figure 3Pathway size influences the proportion of heritability explained and predictive ability when using a MultiBLUP model. (A) Spearman’s rank correlations between off-diagonal elements of the kinship matrices for each pathway and the remaining genomic SNPs. Pathways are sorted from top to bottom by increasing size (number of SNPs). (B) Difference in predictive ability between the MultiBLUP and GBLUP models compared to the proportion of heritability explained by each pathway for all 1300 trait-pathway combinations (65 traits, 20 pathways). The diameter of the points is proportional to the number of SNPs in the pathway and color indicates whether or not a trait-pathway combination passed significance thresholds for proportion of heritability explained, likelihood ratio test statistic, and predictive ability.