| Literature DB >> 29116144 |
Joseph L Gage1, Diego Jarquin2, Cinta Romay3, Aaron Lorenz4, Edward S Buckler3,5, Shawn Kaeppler1, Naser Alkhalifah1,6,7, Martin Bohn8, Darwin A Campbell6,7, Jode Edwards9, David Ertl10, Sherry Flint-Garcia11, Jack Gardiner12, Byron Good13, Candice N Hirsch4, Jim Holland14, David C Hooker15, Joseph Knoll16, Judith Kolkman17, Greg Kruger18, Nick Lauter9, Carolyn J Lawrence-Dill6,7, Elizabeth Lee13, Jonathan Lynch19, Seth C Murray20, Rebecca Nelson17,21, Jane Petzoldt1, Torbert Rocheford22, James Schnable2, Patrick S Schnable7, Brian Scully23, Margaret Smith21, Nathan M Springer24, Srikant Srinivasan25, Renee Walton6,7, Teclemariam Weldekidan26, Randall J Wisser26, Wenwei Xu27, Jianming Yu7, Natalia de Leon28.
Abstract
Remarkable productivity has been achieved in crop species through artificial selection and adaptation to modern agronomic practices. Whether intensive selection has changed the ability of improved cultivars to maintain high productivity across variable environments is unknown. Understanding the genetic control of phenotypic plasticity and genotype by environment (G × E) interaction will enhance crop performance predictions across diverse environments. Here we use data generated from the Genomes to Fields (G2F) Maize G × E project to assess the effect of selection on G × E variation and characterize polymorphisms associated with plasticity. Genomic regions putatively selected during modern temperate maize breeding explain less variability for yield G × E than unselected regions, indicating that improvement by breeding may have reduced G × E of modern temperate cultivars. Trends in genomic position of variants associated with stability reveal fewer genic associations and enrichment of variants 0-5000 base pairs upstream of genes, hypothetically due to control of plasticity by short-range regulatory elements.Entities:
Mesh:
Year: 2017 PMID: 29116144 PMCID: PMC5677005 DOI: 10.1038/s41467-017-01450-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Flowchart of experimental analyses. To investigate how putatively selected regions influence variation for G × E (a), 30 temperate and 30 tropical inbreds were used to calculate FST in 20 base pair sliding windows across the genome. Windows with mean FST > 0.5 were categorized as “high” FST, and windows with mean FST < 0.15 were categorized as “low” FST. SNPs from the Maize G × E project hybrids that were within high or low FST windows were categorized as high or low FST SNPs, and used to estimate G × E variances attributable to high and low FST regions of the genome. To investigate location of variants associated with G × E (b), hybrid phenotypes were regressed on the means of common hybrids at each location. The slope and mean squared errors from each hybrid’s regression were used as response variables in GWAS, and the 50 most significant SNPs from each GWAS were evaluated for their position relative to the nearest gene
Fig. 2MDS of genotypes used to selected 60 extreme individuals. Unique inbred individuals (n = 916) from the HapMap 3.1 visualized by multi-dimensional scaling (MDS). The temperate materials are bound by the blue box (coordinate 1 < −0.5, coordinate 2 > 0), and the tropical materials are bound by the green box (coordinate 1 > .5, coordinate 2 > 0). Two sets of 30 individuals were chosen from each box based on pedigree, genetic distance from others in the group (identity by state < 0.95), and quantity of missing SNP data
Fig. 3Comparisons of high and low FST SNPs. a Allele frequencies within the 30 temperate and 30 tropical inbred lines from Hapmap 3.1 for 736 high FST SNPs that overlap between Hapmap 3.1 and the G × E hybrid lines. Some SNPs with FST < 0.5 were designated as high FST because they lie in a window with mean FST > 0.5. b Histograms of FST distributions of 1248 high FST SNPs and 263,243 low FST SNPs from the G × E hybrid data set. FST values represent means of 20-SNP windows. c Distributions of the minor allele frequencies (MAF) in the G × E hybrid data set for 1248 high FST SNPs, 263,243 low FST SNPs, and the entire set of 372,273 polymorphic SNPs
Fig. 4Empirical distribution of estimated variance components for high and low FST G × E interaction. Distributions of G × E variance attributable to high and low FST SNPs for grain yield (a) and plant height (b) from 1000 replicated model fittings. 1248 high FST SNPs were included, while each model fitting used a subsample of 1248 low FST SNPs chosen randomly from the full set of 263,243 low FST SNPs. Proportion variance explained represents non-environmental model variance, i.e., was calculated using only genotype, G × E for high and low FST SNPs, and residual variances
Fig. 5Patterns of functional variation and classification of SNPs based on their distance to the nearest gene model. Proportions of 250 slope(type II stability)-associated, 250 MSE(type III stability)-associated, and 250 phenotype per-se-associated SNPs in genic, gene-proximal (0–5000 base pairs from nearest gene), and intergenic (>5000 base pairs from nearest gene) regions compared to a null distribution of proportions derived from all 413,796 SNPs. Text above each bar indicates sample sizes for each bin and two tailed p-values from an exact binomial test for the null hypothesis of underlying proportion equal to the null distribution. For , the Bonferroni multiple testing threshold is