| Literature DB >> 26111146 |
Carolina Font i Forcada1, Nnadozie Oraguzie2, Sebastian Reyes-Chin-Wo1, Maria Teresa Espiau3, Rafael Socias i Company3, Angel Fernández i Martí4.
Abstract
To design an appropriate association study, we need to understand population structure and the structure of linkage disequilibrium within and among populations as well as in different regions of the genome in an organism. In this study, we have used a total of 98 almond accessions, from five continents located and maintained at the Centro de Investigación y Tecnología Agroalimentaria de Aragón (CITA; Spain), and 40 microsatellite markers. Population structure analysis performed in 'Structure' grouped the accessions into two principal groups; the Mediterranean (Western-Europe) and the non-Mediterranean, with K = 3, being the best fit for our data. There was a strong subpopulation structure with linkage disequilibrium decaying with increasing genetic distance resulting in lower levels of linkage disequilibrium between more distant markers. A significant impact of population structure on linkage disequilibrium in the almond cultivar groups was observed. The mean r2 value for all intra-chromosomal loci pairs was 0.040, whereas, the r2 for the inter-chromosomal loci pairs was 0.036. For analysis of association between the markers and phenotypic traits, five models comprising both general linear models and mixed linear models were selected to test the marker trait associations. The mixed linear model (MLM) approach using co-ancestry values from population structure and kinship estimates (K model) as covariates identified a maximum of 16 significant associations for chemical traits and 12 for physical traits. This study reports for the first time the use of association mapping for determining marker-locus trait associations in a world-wide almond germplasm collection. It is likely that association mapping will have the most immediate and largest impact on the tier of crops such as almond with the greatest economic value.Entities:
Mesh:
Year: 2015 PMID: 26111146 PMCID: PMC4482440 DOI: 10.1371/journal.pone.0127656
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Units, minimum, maximum, mean and standard deviation values for the chemical and physical traits evaluated in the almond germplasm.
| Trait | Units | Minimum | Maximum | Mean | Standard deviation |
|---|---|---|---|---|---|
| α-Tocopherol | mg kg-1 oil | 331 | 585 | 457 | 5.4 |
| δ-Tocopherol | mg kg-1 oil | 0.1 | 3.1 | 0.9 | 0.1 |
| γ-Tocopherol | mg kg-1 oil | 5 | 57 | 17 | 2.7 |
| Oleic acid | % of total oil content | 63 | 80 | 72 | 3.7 |
| Linoleic acid | % of total oil content | 11 | 27 | 19 | 3.2 |
| Stearic acid | % of total oil content | 1.5 | 3.5 | 2.1 | 0.2 |
| Palmitic acid | % of total oil content | 5.1 | 7.2 | 6.1 | 0.5 |
| Palmitoleic acid | % of total oil content | 0.3 | 0.6 | 0.4 | 0.1 |
| Oil content | % of kernel dry weight | 51 | 67 | 60 | 3.2 |
| Protein content | % of kernel dry weight | 10 | 30 | 20 | 4.6 |
| Nut width | mm | 16 | 29 | 23 | 2.4 |
| Nut thickness | mm | 8.9 | 20 | 16.2 | 1.6 |
| Nut length | mm | 25 | 47 | 33 | 4.3 |
| Nut weight | gr | 1.6 | 9.9 | 4 | 1.4 |
| Nut T/L ratio | — | 1.6 | 3.0 | 2.1 | 0.7 |
| Kernel doubles | % | 1 | 50 | 10 | 2.8 |
| Kernel width | mm | 11 | 19 | 14 | 1.5 |
| Kernel thickness | mm | 6 | 12 | 8 | 0.9 |
| Kernel length | mm | 19 | 30 | 24 | 2.5 |
| Kernel weight | g | 0.6 | 3.5 | 1.5 | 0.5 |
| Kernel T/L ratio | — | 2 | 4.5 | 3.3 | 0.5 |
Genetic parameters of 98 almond cultivars based on 40 SSR loci.
| SSR |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| BPPCT011 | 19 | 7.7 | 0.78 | 0.87 | 0.10 | 23.13 | 0.87 |
| CPPCT053 | 23 | 12.5 | 0.83 | 0.92 | 0.09 | 27.12 | 0.92 |
| EPDCU5100 | 2 | 1.3 | 0.25 | 0.22 | -0.14 | 0.37 | 0.18 |
| BPPCT001 | 15 | 4.5 | 0.39 | 0.78 | 0.50 | 19.95 | 0.79 |
| BPPCT030 | 4 | 1.3 | 0.24 | 0.22 | -0.12 | 0.44 | 0.20 |
| CPPCT044 | 18 | 4.8 | 0.70 | 0.79 | 0.11 | 20.79 | 0.69 |
| CPSCT021 | 14 | 4.8 | 0.69 | 0.79 | 0.13 | 19.45 | 0.92 |
| PceGA34 | 13 | 2.4 | 0.56 | 0.58 | 0.03 | 13.77 | 0.61 |
| BPPCT007 | 16 | 8.3 | 0.83 | 0.88 | 0.05 | 23.27 | 0.88 |
| BPPCT039 | 15 | 11.1 | 0.67 | 0.91 | 0.24 | 24.58 | 0.90 |
| CPDCT025 | 21 | 11.1 | 0.81 | 0.91 | 0.11 | 26.47 | 0.93 |
| EPDCU0532 | 9 | 4.8 | 0.78 | 0.79 | 0.01 | 17.14 | 0.79 |
| UDP96-008 | 5 | 2.2 | 0.53 | 0.55 | 0.03 | 0.97 | 0.43 |
| BPPCT010 | 16 | 7.1 | 0.41 | 0.86 | 0.52 | 22.42 | 0.93 |
| CPDCT045 | 17 | 10.0 | 0.90 | 0.90 | -0.02 | 24.89 | 0.92 |
| CPPCT005 | 20 | 10.0 | 0.87 | 0.90 | 0.03 | 25.27 | 0.95 |
| EPPCU6216 | 16 | 5.9 | 0.75 | 0.83 | 0.09 | 21.37 | 0.84 |
| EPPCU9168 | 12 | 4.3 | 0.75 | 0.77 | 0.02 | 19.07 | 0.77 |
| PMS40 | 18 | 5.3 | 0.75 | 0.81 | 0.07 | 20.72 | 0.81 |
| PS12e2 | 16 | 8.3 | 0.85 | 0.88 | 0.03 | 23.40 | 0.89 |
| UDP96-003 | 21 | 10.0 | 0.76 | 0.90 | 0.14 | 24.93 | 0.93 |
| UDP97-401 | 17 | 9.1 | 0.67 | 0.89 | 0.24 | 24.04 | 0.90 |
| BPPCT038 | 11 | 5.6 | 0.78 | 0.82 | 0.03 | 19.25 | 0.81 |
| CPPCT009 | 9 | 3.2 | 0.57 | 0.69 | 0.16 | 16.01 | 0.68 |
| CPPCT040 | 20 | 12.5 | 0.94 | 0.92 | -0.03 | 26.69 | 0.92 |
| CPSCT006 | 7 | 2.6 | 0.64 | 0.62 | -0.03 | 11.82 | 0.62 |
| CPSCT022 | 8 | 1.9 | 0.42 | 0.48 | 0.13 | 0.96 | 0.47 |
| PceGA25 | 18 | 6.2 | 0.75 | 0.84 | 0.11 | 21.73 | 0.84 |
| BPPCT025 | 16 | 8.3 | 0.78 | 0.88 | 0.11 | 23.07 | 0.90 |
| CPPCT008 | 7 | 4.0 | 0.71 | 0.75 | 0.04 | 15.12 | 0.72 |
| CPPCT021 | 17 | 3.1 | 0.42 | 0.68 | 0.38 | 16.61 | 0.68 |
| CPPCT047 | 19 | 6.7 | 0.71 | 0.85 | 0.17 | 23.34 | 0.90 |
| CPSCT012 | 16 | 10.0 | 0.72 | 0.90 | 0.20 | 24.60 | 0.90 |
| MA040 | 10 | 4.8 | 0.68 | 0.79 | 0.13 | 18.18 | 0.79 |
| EPDCU3392 | 10 | 5.6 | 0.48 | 0.82 | 0.42 | 19.33 | 0.82 |
| CPPCT022 | 19 | 5.3 | 0.59 | 0.81 | 0.27 | 21.23 | 0.81 |
| EPPCU7340 | 16 | 8.3 | 0.67 | 0.88 | 0.24 | 22.82 | 0.88 |
| PMS02 | 4 | 1.6 | 0.32 | 0.39 | 0.18 | 0.79 | 0.35 |
| CPPCT006 | 21 | 11.1 | 0.92 | 0.91 | -0.02 | 26.35 | 0.96 |
| CPSCT018 | 2 | 1.7 | 0.55 | 0.41 | -0.38 | 0.59 | 0.40 |
| Mean | 13.9 | 4.2 | 0.66 | 0.76 | 0.11 | 18.20 | 0.76 |
A observed number of alleles per locus, A effective number of alleles per locus, H observed heterozygosity, H expected heterozygosity, F Wright’s fixation index, I Shannon’s information index, PD power of discrimination
Fig 1A phenogram based on Neighbor-joining analysis showing the genetic relationships of 98 CITA almond collection using 40 SSR markers.
Local accessions from Spain are shown in a blue branch line; cultivars from other Mediterranean countries are shown in a red branch line while cultivars from other regions are shown in a green branch line.
Fig 2The log likelihood for each K, Ln P (D) = L (K) probability [21].
Fig 3Grouping of 98 almond accessions genotyped at 40 SSR loci based on STRUCTURE analysis.
Green and blue represent individuals within the subpopulations. Any blue or green bar that is not completely filled indicates admixture.
LD based on r2, averaged for map distance classes and germplasm groups based on population structure analysis in the STRUCTURE.
| Genomic región (cM) | N | All accessions (r2) | Meditteranean accessions (r2) | Non-Mediterranean accessions (r2) |
|---|---|---|---|---|
| 0–10 | 20 | 0.034 | 0.061 | 0.058 |
| 10–20 | 24 | 0.079 | 0.087 | 0.079 |
| 20–30 | 21 | 0.036 | 0.045 | 0.039 |
| >30 | 23 | 0.028 | 0.027 | 0.032 |
| Intra-chromosomal | 88 | 0.040 | 0.091 | 0.073 |
| Inter-chromosomal | 692 | 0.036 | 0.082 | 0.062 |
* Number of marker pairs included in each class. The analysis of linkage disequilibrium (LD) revealed a high level of LD up to 20 cM, with LD decaying after 20 cM.
Fig 4Linkage disequilibrium plot based on 40 SSR markers screened on 98 almond accessions.
Statistical significance of the p values and associations observed between markers and chemical traits in 98 almond cultivars.
| SSR | LG | %var (r2) (
| α-T | δ-T | γ -T | Oleic | Linoleic | Stearic | Palmitic | Oil | Protein |
|---|---|---|---|---|---|---|---|---|---|---|---|
| BPPCT011 | 1 | 70.1 |
(
|
(
|
(
|
(
| |||||
| CPDCT025 | 3 | 92.9 |
|
|
|
|
|
|
| ||
| UDP96-003 | 4 | 66.7 |
(
|
|
|
| |||||
| EPDCU3392 | 7 | 66.4 |
(
|
For multiple testing of genotypes, Bonferroni correction [29] was applied. The p values for associations are considered when at least one allele is associated with the SSR.
*p<0.00001
**p = 0.00001–0.0001
***p = 0.0001–0.0012, (K-model)
(a)Associations observed in the same regions where QTLs had previously been identified [3, 30]
(b)Percentage of the phenotypic variation (r2) explained by each marker
Statistical significance of the p values and associations observed between markers and nut and kernel physical traits of almond.
| Nut | Kernel | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SSR | LG | %var (r2) (
| W | T | L | Weight | T/L | T | L | Weight | T/L | Size |
| BPPCT011 | 1 | 66.2 |
(
|
(
| ||||||||
| EPDCT0532 | 3 | 52.2 |
|
|
|
| ||||||
| CPSCT006 | 5 | 55.4 |
(
|
(
|
(
| |||||||
| CPPCT021 | 6 | 61.8 |
(
|
(
|
(
| |||||||
For multiple testing of genotypes, Bonferroni correction [29] was applied. The p values for associations are considered when at least one allele is associated with the SSR. Abbreviations: W, width; T, thickness, L, length
*p<0.00001
**p = 0.00001–0.0001
***p = 0.0001–0.0012, (K-model)
(a)Associations observed in the same regions where QTLs had previously been identified [3, 30]
(b) Percentage of the phenotypic variation (r2) explained by each marker
Fig 5Comparison of different genome wide association study (GWAS) models.
Cumulative distribution of P-values was computed from the DNA markers and phenotypes for the different association models.
Fig 6Principal component analysis (PCA) plot showing the grouping of the almond accessions.