| Literature DB >> 29176988 |
Jorge Urrestarazu1,2,3, Hélène Muranty1, Caroline Denancé1, Diane Leforestier1, Elisa Ravon1, Arnaud Guyader1, Rémi Guisnel1, Laurence Feugey1, Sébastien Aubourg1, Jean-Marc Celton1, Nicolas Daccord1, Luca Dondini2, Roberto Gregori2, Marc Lateur4, Patrick Houben4, Matthew Ordidge5, Frantisek Paprstein6, Jiri Sedlak6, Hilde Nybom7, Larisa Garkava-Gustavsson8, Michela Troggio9, Luca Bianco9, Riccardo Velasco9, Charles Poncet10, Anthony Théron10, Shigeki Moriya1,11, Marco C A M Bink12,13, François Laurens1, Stefano Tartarini2, Charles-Eric Durel1.
Abstract
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs.Entities:
Keywords: GWAS; Malus × domestica Borkh.; SNP; adaptive traits; association genetics; germplasm collection; microsynteny; quantitative trait loci
Year: 2017 PMID: 29176988 PMCID: PMC5686452 DOI: 10.3389/fpls.2017.01923
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Averages and ranges for the genotypic means for flowering and ripening periods.
| Whole population | 29 (1985–2014) | 5.45 | 0.82 | 4.79 | 1.15 | 1.73–9.24 |
| INRA | 4 (2009–2012) | 3.00 | 0.88 | 5.58 | 1.45 | 2.56–9.52 |
| UNIBO | 6 (1987–1992) | 7.60 | 0.84 | 5.26 | 0.99 | 2.57–7.81 |
| CRA-W | 19 (1985–2007) | 4.90 | 0.88 | 3.97 | 1.03 | 1.25–7.25 |
| RBIPH | 13 (1995–2010) | 5.00 | 0.85 | 4.42 | 0.83 | 3.03–8.61 |
| NFC | 1 | – | – | 4.91 | 0.92 | 2.00–9.00 |
| SLU | 3 (2012–2014) | 3.00 | 0.81 | 3.66 | 0.85 | 1.99–6.56 |
| Whole population | 22 (1987–2014) | 5.37 | 0.95 | 5.43 | 2.05 | 0.54–9.95 |
| INRA | 10 (2002–2014) | 4.86 | 0.95 | 6.89 | 1.77 | 1.62–9.26 |
| UNIBO | 13 (1987–2014) | 7.83 | 0.96 | 6.51 | 1.87 | 0.98–9.19 |
| CRA-W | 10 (1987–2008) | 4.37 | 0.87 | 4.90 | 1.15 | 1.12–8.38 |
| RBIPH | 5 (2006–2010) | 5.07 | 0.92 | 5.00 | 1.56 | 1.00–7.60 |
| NFC | 3 (1999–2013) | 2.93 | 0.87 | 5.94 | 1.77 | 2.00–8.33 |
| SLU | 3 (2012–2014) | 2.91 | 0.98 | 3.75 | 1.44 | 1.00–7.00 |
A single average value was available at NFC, assessed over 10 years (different years according to the cultivars).
Figure 1(A) Distribution of the genotypes according to ranges of genotypic means on flowering period at two different levels: (A1) Whole population; (A2) Geographic groups. The three geographic groups are depicted using the following color codes: Blue = North+East group; Green = West group; Red = South group. (B) Distribution of the genotypes according to ranges of genotypic means on ripening period at two different levels: (B1) Whole population; (B2) Geographic groups. The three geographic groups are depicted using the following color codes: Blue = North+East group; Green = West group; Red = South group.
Figure 2Scatter plot of the first two dimensions of the Principal Component Analysis (PCA) performed on the 1,168 apple genotypes based on 275,223 SNPs. The geographic groups are depicted using the following color codes: Blue = North+East group; Green = West group; Red = South group; Black = Other.
Figure 3LD decay according to the physical distance between SNPs. Both the usual r2 and the r2 after correcting for relatedness and population structure () are given.
Summary of trait associations at the optimal models according to the EBIC criterion.
| Whole population | 1,126 | 0.78 | 0.75 | 50 | 2 | 0.09 |
| INRA | 251 | 0.93 | 0.90 | 29 | 1 | 0.13 |
| UNIBO | 166 | 0.74 | 0.74 | 0 | 0 | 0.00 |
| CRA-W | 221 | 0.72 | 0.72 | 0 | 0 | 0.00 |
| RBIPH | 177 | 0.79 | 0.58 | 1 | 2 | 0.27 |
| NFC | 288 | 0.78 | 0.77 | 2 | 4 | 0.33 |
| SLU | 159 | 0.80 | 0.80 | 0 | 0 | 0.00 |
| Whole population | 1,149 | 0.84 | 0.85 | 82 | 6 | 0.17 |
| INRA | 260 | 0.84 | 0.84 | 12 | 1 | 0.13 |
| UNIBO | 178 | 0.88 | 0.86 | 2 | 1 | 0.16 |
| CRA-W | 217 | 0.70 | 0.65 | 1 | 1 | 0.12 |
| RBIPH | 176 | 0.80 | 0.78 | 2 | 2 | 0.18 |
| NFC | 293 | 0.97 | 0.92 | 38 | 1 | 0.22 |
| SLU | 160 | 0.94 | 0.89 | 8 | 4 | 0.28 |
Part of Variance Explained (PVE) by population structure, cofactors and kinship, the number of associations without cofactors, the number of significant cofactors, the PVE by cofactors, and the ratio PVE by kinship/PVE by cofactors and kinship are showed. Data obtained for individual collections and the whole populations are provided.
Summary of associations identified by Multi-Locus Mixed Model (MLMM) at the optimal models according to the EBIC criterion for flowering and ripening periods in the whole population and in the six individual collections.
| Whole population | FB_AFFY_0496090 | SNP.9-1 | 1 | 9 | 530,386 | 1.33E-08 | 0.11 | |
| Whole population | FB_AFFY_0495650 | SNP.9-2 | 2 | 9 | 557,419 | 6.81E-08 | 0.13 | |
| INRA | FB_AFFY_0495650 | SNP.9-2 | 1 | 9 | 557,419 | 1.06E-12 | 0.18 | |
| RBIPH | FB_AFFY_6830175 | SNP.4-1 | 1 | 4 | 968,334 | C/ | 3.16E-09 | 0.01 |
| RBIPH | FB_AFFY_1629518 | SNP.9-3 | 2 | 9 | 925,476 | A/ | 8.01E-08 | 0.14 |
| NFC | FB_AFFY_6873601 | SNP.4-2 | 4 | 4 | 7,719,622 | A/ | 3.94E-07 | 0.24 |
| NFC | FB_AFFY_7355751 | SNP.9-4 | 2 | 9 | 1,938,744 | 3.15E-08 | 0.11 | |
| NFC | FB_AFFY_2782466 | SNP.11-1 | 1 | 11 | 12,422,656 | A/ | 8.30E-09 | 0.02 |
| NFC | FB_AFFY_9818101 | SNP.12-1 | 3 | 12 | 14,536,815 | C/ | 5.35E-08 | 0.15 |
| Whole population | FB_AFFY_6730867 | SNP.3-3 | 4 | 3 | 30,430,113 | 6.76E-15 | 0.10 | |
| Whole population | FB_AFFY_7541229 | SNP.3-4 | 5 | 3 | 30,465,002 | C/ | 8.51E-10 | 0.09 |
| Whole population | FB_AFFY_4981462 | SNP.3-6 | 2 | 3 | 30,700,183 | C/ | 4.39E-19 | 0.18 |
| Whole population | FB_AFFY_1209620 | SNP.3-7 | 1 | 3 | 30,726,252 | A/ | 1.28E-13 | 0.41 |
| Whole population | FB_AFFY_3795860 | SNP.10-1 | 6 | 10 | 38,390,484 | 1.76E-08 | 0.23 | |
| Whole population | FB_AFFY_6370928 | SNP.16-1 | 3 | 16 | 9,146,297 | C/ | 5.16E-12 | 0.14 |
| INRA | FB_AFFY_1253936 | SNP.3-5 | 1 | 3 | 30,590,166 | A/ | 3.03E-14 | 0.08 |
| UNIBO | FB_AFFY_1253936 | SNP.3-5 | 1 | 3 | 30,590,166 | A/ | 6.75E-09 | 0.06 |
| CRA-W | FB_AFFY_4741632 | SNP.3-2 | 1 | 3 | 30,318,639 | 6.71E-08 | 0.11 | |
| RBIPH | FB_AFFY_4981462 | SNP.3-6 | 1 | 3 | 30,700,183 | C/ | 3.60E-10 | 0.20 |
| RBIPH | FB_AFFY_4836781 | SNP.15-1 | 2 | 15 | 10,377,731 | 3.18E-08 | 0.37 | |
| NFC | FB_AFFY_4981462 | SNP.3-6 | 1 | 3 | 30,700,183 | C/ | 1.37E-18 | 0.14 |
| SLU | FB_AFFY_0899559 | SNP.3-1 | 4 | 3 | 24,220,838 | 7.21E-07 | 0.10 | |
| SLU | FB_AFFY_1209620 | SNP.3-7 | 1 | 3 | 30,726,252 | A/ | 7.51E-15 | 0.33 |
| SLU | FB_AFFY_6239519 | SNP.13-1 | 3 | 13 | 1,889,560 | 1.41E-07 | 0.24 | |
| SLU | FB_AFFY_3879540 | SNP.16-2 | 2 | 16 | 10,298,660 | 2.34E-07 | 0.27 | |
Order of inclusion of the SNPs at the optimal model in MLMM according to the EBIC criterion.
The allele associated with an early flowering period is highlighted in bold. The alternative allele is thus associated with a late flowering period.
The allele associated with an early ripening period is highlighted in bold. The alternative allele is thus associated with a late ripening period.
The allele found in SLU with the lowest frequency was the opposite to the one that appeared in the lowest frequency in the whole population and the other five individual collections.
Figure 4Partition of variance at the optimal models according to EBIC for the whole population and the six individual collections for flowering period (A) and ripening period (B). Gray: part of variance explained by structure; Blue: part of variance explained by SNPs retained as cofactors; Green: part of variance explained by kinship; Red: residual variance.
(A) Pairwise LD between the two SNPs associated with flowering period in the whole population (A1) and in the three geographic groups: North+East (A2), West (A3), and South (A4). (B) Pairwise LD between the four SNPs associated with ripening period on chromosome 3 in the whole population (B1) and the three geographic groups: North+East (B2), West (B3), and South (B4).
| SNPs as cofactors | SNP.9-1 | SNP.9-2 | MAF | SNPs as cofactors | SNP.3-3 | SNP.3-4 | SNP.3-6 | SNP.3-7 | MAF |
| SNP.9-1 | 1.00 | 0.12 | 0.11 | SNP.3-3 | 1.00 | 0.55 | 0.31 | 0.02 | 0.10 |
| SNP.9-2 | 0.27 | 1.00 | 0.13 | SNP.3-4 | 0.71 | 1.00 | 0.27 | 0.06 | 0.09 |
| SNP.3-6 | 0.56 | 0.54 | 1.00 | 0.22 | 0.18 | ||||
| SNP.3-7 | 0.11 | 0.06 | 0.22 | 1.00 | 0.41 | ||||
| SNPs as cofactors | SNP.9-1 | SNP.9-2 | MAF | ||||||
| SNP.9-1 | 1.00 | 2.0E-03 | 0.09 | ||||||
| SNP.9-2 | 4.2E-04 | 1.00 | 0.11 | SNPs as cofactors | SNP.3-3 | SNP.3-4 | SNP.3-6 | SNP.3-7 | MAF |
| SNP.3-3 | 1.00 | 0.79 | 0.36 | 0.09 | 0.28 | ||||
| SNP.3-4 | 0.83 | 1.00 | 0.32 | 0.11 | 0.27 | ||||
| SNPs as cofactors | SNP.9-1 | SNP.9-2 | MAF | SNP.3-6 | 0.48 | 0.45 | 1.00 | 0.33 | 0.45 |
| SNP.9-1 | 1.00 | 0.22 | 0.13 | SNP.3-7 | 0.16 | 0.21 | 0.28 | 1.00 | 0.33 (0.67) |
| SNP.9-2 | 0.41 | 1.00 | 0.14 | ||||||
| SNPs as cofactors | SNP.3-3 | SNP.3-4 | SNP.3-6 | SNP.3-7 | MAF | ||||
| SNPs as cofactors | SNP.9-1 | SNP.9-2 | MAF | SNP.3-3 | 1.00 | 0.69 | 0.34 | 0.04 | 0.06 |
| SNP.9-1 | 1.00 | 0.07 | 0.09 | SNP.3-4 | 0.77 | 1.00 | 0.37 | 0.07 | 0.06 |
| SNP.9-2 | 0.06 | 1.00 | 0.05 | SNP.3-6 | 0.49 | 0.55 | 1.00 | 0.21 | 0.12 |
| SNP.3-7 | 0.09 | 0.12 | 0.21 | 1.00 | 0.39 | ||||
| SNPs as cofactors | SNP.3-3 | SNP.3-4 | SNP.3-6 | SNP.3-7 | MAF | ||||
| SNP.3-3 | 1.00 | 0.02 | 0.41 | 0.02 | 0.11 | ||||
| SNP.3-4 | 0.01 | 1.00 | 0.04 | 4.7E-03 | 0.01 | ||||
| SNP.3-6 | 0.43 | 0.07 | 1.00 | 0.03 | 0.13 | ||||
| SNP.3-7 | 0.03 | 0.02 | 0.04 | 1.00 | 0.17 | ||||
Values below the diagonal line refer to the usual r.
The allele found in the North-East group with the lowest frequency at the SNP.3-7 was the opposite to the one that appeared in the lowest frequency in the whole population and the other two geographic groups.
Pairwise LD between SNPs are highlighted using the following color scale: red (r.
Test of dominance and epistatic effects among the SNPs selected as cofactors in the GWAS of the whole population for flowering and ripening periods.
| Flowering period | Additive | 2 | 79.8 | 4.3E-33 | 9.1 |
| Dominance | 2 | 5.9 | 2.7E-03 | 0.7 | |
| Epistatic | 4 | 9.0 | 3.7E-07 | 2.0 | |
| Ripening period | Additive | 6 | 106.6 | 3.4E-106 | 17.4 |
| Dominance | 6 | 1.1 | 3.6E-01 | 0.2 | |
| Dominance of SNP.3-4 | 1 | 4.8 | 2.8E-02 | 0.1 | |
| Dominance of SNP.3-6 | 1 | 4.5 | 3.3E-02 | 0.1 | |
| Epistatic | 41 | 1.2 | 2.3E-01 | 1.3 |
Some combinations of SNP genotypes did not exist in the whole population, which reduced the df for all interactions between 6 SNPs.
Joint effect of the two SNPs associated with flowering period in the whole population and of the ten most frequent genetic variants defined by the four SNPs on chromosome 3 associated with ripening period in the whole population.
| Variant 1 | 760 | 0.67 | 4.65 | 4.73 | 0.95 | 1.73 | 8.87 | a | |
| Variant 2 | 126 | 0.11 | 5.83 | 5.73 | 1.32 | 2.73 | 8.88 | b | |
| Variant 3 | 121 | 0.11 | 4.43 | 4.47 | 1.13 | 1.73 | 8.24 | a | |
| Variant 4 | 89 | 0.08 | 4.49 | 4.67 | 1.00 | 2.34 | 7.37 | a | |
| Variant 5 | TT/GG | 11 | 0.01 | 8.38 | 7.82 | 1.08 | 5.73 | 9.24 | c |
| Variant 1 | GG/CC/CC/AA | 397 | 0.35 | 6.75 | 6.80 | 1.43 | 2.21 | 9.84 | a |
| Variant 2 | GG/CC/CC/A | 336 | 0.29 | 5.69 | 5.63 | 1.35 | 0.88 | 9.50 | b |
| Variant 3 | GG/CC/CC/ | 73 | 0.06 | 4.72 | 4.81 | 1.61 | 1.21 | 8.48 | c |
| Variant 4 | GG/CC/C | 61 | 0.05 | 3.75 | 3.74 | 1.73 | 0.88 | 8.77 | d |
| Variant 5 | 59 | 0.05 | 3.75 | 3.85 | 1.35 | 1.21 | 7.15 | d | |
| Variant 6 | 44 | 0.04 | 2.89 | 2.54 | 1.32 | 0.54 | 6.85 | d | |
| Variant 7 | GG/CC/C | 39 | 0.03 | 3.49 | 3.21 | 1.61 | 0.55 | 6.66 | d |
| Variant 8 | 29 | 0.03 | 7.44 | 7.67 | 1.51 | 4.82 | 9.80 | a | |
| Variant 9 | 28 | 0.02 | 2.01 | 2.09 | 0.62 | 1.16 | 4.14 | e | |
| Variant 10 | 23 | 0.02 | 2.11 | 2.21 | 1.10 | 0.54 | 4.42 | e | |
The allele associated with an early flowering period is highlighted in bold; order of SNPs is as follows: SNP.9-1/SNP.9-2.
The allele associated with an early ripening period is highlighted in bold; order of SNPs is as follows: SNP.3-3/SNP.3-4/SNP.3-6/SNP.3-7.