| Literature DB >> 24023610 |
Sophie Bouchet1, Bertrand Servin, Pascal Bertin, Delphine Madur, Valérie Combes, Fabrice Dumas, Dominique Brunel, Jacques Laborde, Alain Charcosset, Stéphane Nicolas.
Abstract
The migration of maize from tropical to temperate climates was accompanied by a dramatic evolution in flowering time. To gain insight into the genetic architecture of this adaptive trait, we conducted a 50K SNP-based genome-wide association and diversity investigation on a panel of tropical and temperate American and European representatives. Eighteen genomic regions were associated with flowering time. The number of early alleles cumulated along these regions was highly correlated with flowering time. Polymorphism in the vicinity of the ZCN8 gene, which is the closest maize homologue to Arabidopsis major flowering time (FT) gene, had the strongest effect. This polymorphism is in the vicinity of the causal factor of Vgt2 QTL. Diversity was lower, whereas differentiation and LD were higher for associated loci compared to the rest of the genome, which is consistent with selection acting on flowering time during maize migration. Selection tests also revealed supplementary loci that were highly differentiated among groups and not associated with flowering time in our panel, whereas they were in other linkage-based studies. This suggests that allele fixation led to a lack of statistical power when structure and relatedness were taken into account in a linear mixed model. Complementary designs and analysis methods are necessary to unravel the architecture of complex traits. Based on linkage disequilibrium (LD) estimates corrected for population structure, we concluded that the number of SNPs genotyped should be at least doubled to capture all QTLs contributing to the genetic architecture of polygenic traits in this panel. These results show that maize flowering time is controlled by numerous QTLs of small additive effect and that strong polygenic selection occurred under cool climatic conditions. They should contribute to more efficient genomic predictions of flowering time and facilitate the dissemination of diverse maize genetic resources under a wide range of environments.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24023610 PMCID: PMC3758321 DOI: 10.1371/journal.pone.0071377
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Diversity statistics computed using 29911 Panzea SNP markers over the whole panel and five main genetic groups.
| Group | Abb | Nq
| Nq>0.8 | Mean Sim |
| Polym |
| Northern Flint | NF | 57 | 30 | 0.73 | 0.276 | 0.84 |
| European Flint | EF | 62 | 39 | 0.70 | 0.299 | 0.89 |
| Iowa Stiff Stalk | SS | 26 | 9 | 0.80 | 0.214 | 0.52 |
| Corn Belt Dent | CBD | 115 | 109 | 0.65 | 0.349 | 0.98 |
| Tropical | Trop | 76 | 55 | 0.67 | 0.339 | 0.98 |
| Whole panel | S1P9 | 336 | 242 | 0.64 | 0.36 | - |
Abb: abbreviation describing the group,
Nq = ∑i δiq with i the index of lines and q the index of groups, δiq the assignment proportion of line i to group q according to STRUCTURE software [40], [41] using 55 SSRs, Nq the number of lines in group q;
number of lines assigned to group q with a genome proportion above 0.8;
mean similarity within each group, calculated with IBS (identity by state);
expected heterozygosity inside each group computed with r-Hierfstat [46];
f percentage of polymorphic loci within each group,
statistics were computed considering lines with assignment to one group above 0.8.
Extent of linkage disequilibrium and number of markers needed to reach average r = 0.1 for individual chromosomes and the whole genome.
| Chromosome |
|
| #Markers |
| 1 | 111.11 (2711) | 78.61 (3832) | 6892 |
| 2 | 230.50 (1028) | 183.25 (1293) | 4963 |
| 3 | 191.37 (1213) | 146.75 (1582) | 4938 |
| 4 | 286.24 (843) | 233.92 (1031) | 4784 |
| 5 | 144.89 (1503) | 112.97 (1927) | 4711 |
| 6 | 156.89 (1078) | 121.36 (1393) | 3456 |
| 7 | 199.92 (881) | 154.89 (1137) | 3562 |
| 8 | 303.90 (578) | 211.58 (831) | 3719 |
| 9 | 228.62 (685) | 176.52 (887) | 3129 |
| 10 | 289.97 (518) | 237.90 (631) | 3070 |
| Whole genome | 197.75 (10402) | 152.37 (13500) | 43224 |
LD extent was computed with 43224 SNPs having MAF>0.05.
Physical distance to reach r or r equal to 0.1 estimated using the non-linear regression implemented in r-nls obtained by fitting Hill and Weir model [53]. r and r obtained with Plink software [50] and r-LDcorSV [34], respectively. The number reported in brackets indicates the number of equidistant markers that would be needed to reach an average r of 0.1 between adjacent markers;
number of markers used to estimate LD extent for each chromosome and the whole genome.
Figure 1Genome-wide linkage disequilibrium between all loci within and between chromosomes.
The upper triangle corresponds to the LD r measure calculated with Plink [50], the lower triangle to the LD r measure [34] corresponding to r corrected for structure, using the Q matrix obtained with STRUCTURE software [40], [41] and 55 SSRs. Values above 0.2 are highlighted by colored dots.
Figure 2Manhattan plot for female flowering (FFLW8) associations across the whole genome.
P-values were obtained with the mixed model including the structure matrix obtained using STRUCTURE software and 55 SSRs and the kinship matrix obtained with 94 SSRs and IBS measure. Horizontal dashed line indicates Bonferroni-corrected 5% significance threshold.
Figure 3Distribution of SNP effects (GDD) according to the early allele frequency.
In this figure, 673 markers with P-value<10−3 are represented. (A) For all inbred lines (rows) and SNPs (columns), red and blue colors correspond to the presence of late and early alleles, respectively. (B) The absolute SNP effect versus the frequency of the early allele for significant associations. Different colors correspond to different P-value thresholds.
Regions associated with FFLW8 (P-value below 10−5)) and/or under selection (log) above 0.5).
| Nb | Marker Name | Cent | Chr | Pos |
| Effect |
|
|
| PO |
| Meta QTL | Gene ID | Gene Descr | Gene Dist (bp) |
| 1 | PZE-101106079 | 0 | 1 | 109 394 965 | 1.39E-01 | −19.41 | 1.00E-16 | 0.02 | 0.07 | 0.77 | 0.51 | 1_5 | GRMZM2G041613 | uncharacterized | 0 |
| 2 | SYN13483 | 0 | 1 | 249 324 652 | 6.64E-06 | −82.29 | 1.05E-05 | 0.03 | 0.11 | −1.69 | 0.00 | 1_10 | GRMZM2G103843 | ribokinase activity | 0 |
| 3 | SYN19258 | 0 | 1 | 278 034 859 | 7.75E-01 | −3.98 | 4.56E-02 | 0.00 | 0.02 | 1.49 | 0.59 | - | GRMZM2G328224 | transferase activity on hexosyl groups | 0 |
| 4 | PZE-102079486 | 0 | 2 | 62 516 828 | 3.43E-06 | −70.49 | 4.42E-12 | 0.04 | 0.08 | −2.09 | 0.13 | 2_4 | GRMZM2G084462 | tRNA isopentenyltransferase activity | 0 |
| 5 | PZE-102086452 | 1 | 2 | 78 699 684 | 3.87E-08 | 76.55 | 1.00E-16 | 0.06 | 0.23 | −1.62 | 0.34 | 2_4 | GRMZM2G474153 | protein kinase activity | 0 |
| 6 | SYN12061 | 0 | 2 | 199 457 622 | 7.10E-06 | 51.01 | 5.64E-07 | 0.04 | 0.09 | −1.45 | 0.00 | 2_5 | GRMZM2G400173 | transporter activity | 0 |
| 7 | PZE-103099066 | 0 | 3 | 159 526 813 | 1.E-06 | 62.64 | 1.E-16 | 0.06 | 0.14 | −2.22 | 0.16 | - | GRMZM5G891247 | enzyme inhibitor activity | 47 249 |
| 8 | PZE-103098863 | 0 | 3 | 159 170 263 | 6.48E-01 | −7.92 | 7.53E-14 | 0.00 | 0.06 | 1.41 | 0.68 | - | GRMZM2G082608 | uncharacterized | 0 |
| 9 | PZE-103145047 | 0 | 3 | 199 953 486 | 2.49E-07 | −61.64 | 1.00E-16 | 0.04 | 0.13 | −2.08 | 0.37 | - | GRMZM5G886883 | uncharacterized | 0 |
| 10 | PZE-104010434 | 0 | 4 | 7 567 350 | 3.03E-01 | 14.95 | 3.56E-02 | 0.00 | 0.00 | 1.05 | 0.55 | - | GRMZM2G012821 | uncharacterized | 49 496 |
| 11 | PZE-104013311 | 0 | 4 | 11 576 332 | 3.94E-07 | 65.35 | 7.35E-05 | 0.05 | 0.17 | −2.02 | 0.22 | - | GRMZM2G113241 | uncharacterized | 50 532 |
| 12 | PZE-104080388 | 0 | 4 | 154 689 801 | 2.77E-08 | −85.74 | 3.33E-16 | 0.03 | 0.00 | −1.92 | 0.20 | - | GRMZM2G138407 | transcription factor activity | 106 768 |
| 13 | PZE-105049333 | 0 | 5 | 40 625 928 | 3.77E-01 | 12.11 | 5.02E-01 | 0.01 | 0.02 | 1.14 | 0.58 | 5_4 | GRMZM2G150893 | peroxidase activity | 119 669 |
| 14 | PZE-105082545 | 0 | 5 | 97 872 011 | 1.28E-07 | −66.39 | 5.43E-12 | 0.04 | 0.10 | −1.75 | 0.50 | - | GRMZM2G018962 | exostosin-like protein | 351 398 |
| 15 | PZE-105130917 | 0 | 5 | 187 848 427 | 4.93E-06 | −63.67 | 4.10E-13 | 0.03 | 0.07 | −2.18 | 0.15 | 5_6 | GRMZM2G159918 | transferase activity on hexosyl groups | 0 |
| 16 | PZE-106032161 | 0 | 6 | 74 959 244 | 2.01E-06 | 59.17 | 1.11E-16 | 0.03 | 0.08 | −2.15 | 0.34 | 6_1 | GRMZM2G005499 | transmembrane transport | 0 |
| 17 | SYN38078 | 0 | 6 | 158 225 752 | 4.43E-06 | −61.61 | 5.16E-06 | 0.03 | 0.02 | −2.00 | 0.17 | 6_3 | GRMZM2G074792 | intracellular zinc ion binding | 0 |
| 18 | SYN18613 | 0 | 7 | 150 876 445 | 5.64E-01 | 8.61 | 9.32E-01 | 0.00 | 0.00 | 3.10 | 0.67 | - | GRMZM2G038449 | uncharacterized | 11 523 |
| 19 | PZE-108038931 | 1 | 8 | 45 270 084 | 9.38E-08 | −69.31 | 8.99E-15 | 0.03 | 0.03 | −1.81 | 0.37 | - | GRMZM2G142870 | ATPase activity | 0 |
| 20 | PZE-108035509 | 1 | 8 | 52 084 948 | 3.05E-02 | 35.04 | 9.56E-09 | 0.01 | 0.12 | 2.85 | 0.61 | - | GRMZM2G006585 | zinc ion binding | 5783 |
| 21 | PZE-108053909 | 0 | 8 | 95 953 461 | 5.50E-01 | 8.45 | 6.87E-02 | 0.01 | 0.03 | 1.32 | 0.58 | 8_3 | GRMZM2G074645 | transcription factor activity | 174 238 |
| 22 | SYN12978 | 0 | 8 | 120 970 129 | 6.93E-01 | 5.07 | 1.00E-16 | 0.01 | 0.00 | 0.63 | 0.36 | - | GRMZM2G473111 | uncharacterized | 0 |
| 23 | PZE-108070380 | 0 | 8 | 123 506 141 | 5.88E-13 | 107.83 | 1.00E-16 | 0.09 | 0.24 | 3.10 | 0.71 | - | GRMZM2G179264 | ZCN8 protein | 5056 |
| 24 | PZE-108081330 | 0 | 8 | 138 262 446 | 2.E-07 | 81.15 | 1.E-16 | 0.06 | 0.34 | −1.58 | 0.34 | - | GRMZM2G164341 | transcription factor activity | 0 |
| 25 | PZE-108089589 | 0 | 8 | 146 555 253 | 2.32E-01 | −18.36 | 1.11E-03 | 0.00 | 0.00 | 1.02 | 0.58 | - | GRMZM2G094165 | carbonate dehydratase activity, binding | 0 |
| 26 | PZE-109008014 | 0 | 9 | 8 944 436 | 9.45E-08 | 67.29 | 7.12E-12 | 0.03 | 0.02 | −2.07 | 0.20 | - | GRMZM2G142072 | transcription elongation regulator activity | 0 |
| 27 | PZD00049.4 | 0 | 9 | 17 019 800 | 2.E-06 | 72.63 | 1.E-16 | 0.04 | 0.10 | −1.70 | 0.32 | - | AC197699.3 | transcription factor activity | 0 |
| 28 | PZE-109038470 | 0 | 9 | 57 186 011 | 1.70E-07 | 94.50 | 1E-16 | 0.04 | 0.17 | −2.10 | 0.19 | - | GRMZM2G076272 | transcription factor activity | 0 |
Illumina marker name;
if marker in centromeric region according to maize GDB and. [66] flanking markers, then 1, else 0;
physical chromosome;
physical position in RefGen_v2;
marker P-value for female flowering in growing degree days base temperature 8 (FFLW8) according to model Q55SSRs+KIBS(94SSRs);
marker effect on FFLW8 according to model Q55SSRs+KIBS(94SSRs);
marker P-value for FFLW8 according to naïve model;
percentage of variance explained by the marker in a linear model Q55SSRs;
percentage of variance explained by the marker in a linear mixed model Q55SSRs+KIBS(94SSRs);
log(posterior odds) for selection test obtained with BayeScan [47], significant (highly) when >0 (>1);
differentiation index calculated with r-Hierfstat [46] using non-admixed lines;
meta QTL name (adapted from [10]): meta QTL X_Y with X the chromosome number et Y the QTL number;
name of closest annotated gene according to RefGen_v2;
closest annotated gene function;
distance between marker and closest annotated gene.