| Literature DB >> 27005772 |
Stéphane D Nicolas1,2, Jean-Pierre Péros1, Thierry Lacombe1, Amandine Launay1, Marie-Christine Le Paslier3, Aurélie Bérard3, Brigitte Mangin4, Sophie Valière5, Frédéric Martins5,6, Loïc Le Cunff7, Valérie Laucou1, Roberto Bacilieri1, Alexis Dereeper1,8, Philippe Chatelet1, Patrice This1, Agnès Doligez9.
Abstract
BACKGROUND: As for many crops, new high-quality grapevine varieties requiring less pesticide and adapted to climate change are needed. In perennial species, breeding is a long process which can be speeded up by gaining knowledge about quantitative trait loci linked to agronomic traits variation. However, due to the long juvenile period of these species, establishing numerous highly recombinant populations for high resolution mapping is both costly and time-consuming. Genome wide association studies in germplasm panels is an alternative method of choice, since it allows identifying the main quantitative trait loci with high resolution by exploiting past recombination events between cultivars. Such studies require adequate panel design to represent most of the available genetic and phenotypic diversity. Assessing linkage disequilibrium extent and panel power is also needed to determine the marker density required for association studies.Entities:
Keywords: Association panel; Genome-wide association studies; Haplotype; Kinship; Linkage disequilibrium; Power; SNP; SSR; Vassal collection; Vitis; sylvestris
Mesh:
Substances:
Year: 2016 PMID: 27005772 PMCID: PMC4802926 DOI: 10.1186/s12870-016-0754-z
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Schematic representation of the method used to design the association panel. WW: wine West, WE: wine East, TE: table East
Number of sequenced amplicons and genotyped SNPs
| Regiona | Number of sequenced amplicons | Number of final ampliconsb | Mean number of sequenced bp aligned per final amplicon | Total number of SNPs selected for genotyping | Total number of SNPs successfully genotyped |
|---|---|---|---|---|---|
| chr8:14529243..16762721 | 55 | 41 | 633 | 144 | 86 |
| chr9:3040957..5046544 | 60 | 43 | 629 | 153 | 97 |
| chr12:18728014..20687449 | 60 | 33 | 566 | 147 | 80 |
| chr17:5195037..7207967 | 55 | 48 | 650 | 150 | 109 |
| Distributed over the genome | 169 | 129 | 657 | 174 | 129 |
| Total | 399 | 294 | – | 768 | 501 |
aPosition in bp on grapevine reference sequence assembly version 12X.0 [69]. Study regions were covered by a single scaffold on chromosomes 8, 9 and 17, by two scaffolds on chromosome 12
bNumber of amplicons containing successfully genotyped SNPs
Fig. 2PCA analysis based on 20 SSRs for comparing the association panel with the whole Vassal collection. Other cultivars: the Vassal collection but the association panel
Fig. 3Distribution of five phenotypic traits in Vassal collection and the association panel. WW: wine West, WE: wine East, TE: table East. Others: the Vassal collection but the association panel
Fig. 4Genotypic LD (r 2 V) modeled as a function of physical distance according to Hill and Weir [82]. LD was modeled separately in each subgroup of the association panel and in the wild panel, for each of the four genomic regions
LD extent (r 2 V) in each of four subgroups and four genomic regions. Expected LD threshold was 0.2. WE (wine East), WW (wine West) and TE (table East) are the three subgroups of the association panel
| Study region | Linkage disequilibrium extent | Genetic size of the region (cM)a | |||
|---|---|---|---|---|---|
| WW | WE | TE | wild panel | ||
| chr 8 | 120 Kb (72) | 59 Kb (79) | 31 Kb (56) | 49 Kb (50) | 11 |
| chr 9 | 86 Kb (75) | 71 Kb (73) | 33 Kb (78) | 40 Kb (57) | 15 |
| chr 12 | 25 Kb (59) | 9 Kb (63) | 10 Kb (62) | 31 Kb (46) | 3 |
| chr 17 | 295 Kb (80) | 458 Kb (67) | 210 Kb (79) | 127 Kb (77) | 12 |
aEstimated from the composite map of Doligez et al. [40]
The number of SNPs with MAF ≥ 5 % is given in parentheses
Fig. 5Heatmaps of genotypic LD (r 2 VS) in four genomic regions in the whole association panel
Fig. 6Mean local LD (r 2 V) in a 300 Kb-sliding window along the chromosome 17 genomic region. Local LD was computed separately in each subgroup of the association panel and in the wild panel. Only mean LD values based on at least ten marker pairs are plotted. Vertical lines on the x-axis indicate SNP positions
Fig. 7Variation of power distribution within four genomic regions (at a total of 372 SNPs). We used a family wise error rate of 5 %, AIS kinship estimation and various heritability and QTL effect values
Power of association tests at markers linked to causal polymorphims according to LD extent in the association panel, heritability and effect of causal polymorphism (% of trait variance explained)
|
| Mean power at the marker linked to causal polymorphismb | |||||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| effect = 10 % | effect = 25 % | effect = 10 % | effect = 25 % | effect = 10 % | effect = 25 % | |
| 1 | 0.9 | 6.0 | 3.4 | 22.4 | 22.9 | 69.6 |
| 0.1 | 0.1 | 0.6 | 0.3 | 2.2 | 2.3 | 7.0 |
| 0.15 | 0.1 | 0.9 | 0.5 | 3.4 | 3.4 | 10.4 |
| 0.2 | 0.2 | 1.2 | 0.7 | 4.5 | 4.6 | 13.9 |
| 0.25 | 0.2 | 1.5 | 0.9 | 5.6 | 5.7 | 17.4 |
| 0.3 | 0.3 | 1.8 | 1.0 | 6.7 | 6.9 | 20.9 |
| 0.35 | 0.3 | 2.1 | 1.2 | 7.8 | 8.0 | 24.4 |
| 0.4 | 0.4 | 2.4 | 1.4 | 8.9 | 9.2 | 27.8 |
| 0.45 | 0.4 | 2.7 | 1.6 | 10.1 | 10.3 | 31.3 |
aSquared correlation between the causal polymorphism and the linked marker, corrected by kinship and structure
bMean power over the 314 SNPs with MAF > 5 % in the four genomic regions, assuming a family wise error rate of 5 % and using AIS kinship estimation, calculated as the power at the causal locus multiplied by the corrected LD between the causal locus and the marker