| Literature DB >> 31015545 |
Mohammad Mehdi Arab1, Annarita Marrano2, Rostam Abdollahi-Arpanahi3, Charles A Leslie2, Hossein Askari4, David B Neale2, Kourosh Vahdati5.
Abstract
Persian plateau (including Iran) is considered as one of the primary centers of origin of walnut. Sampling walnut trees originating from this arena and exploiting the capabilities of next-generation sequencing (NGS) can provide new insights into the degree of genetic variation across the walnut genome. The present study aimed to explore the population structure and genomic variation of an Iranian collection of Persian walnut (Juglans regia L.) and identify loci underlying the variation in nut and kernel related traits using the new Axiom J. regia 700K SNP genotyping array. We genotyped a diversity panel including 95 walnut genotypes from eight Iranian provinces with a variety of climate zones. A majority of the SNPs (323,273, 53.03%) fell into the "Poly High Resolution" class of polymorphisms, which includes the highest quality variants. Genetic structure assessment, using several approaches, divided the Iranian walnut panel into four principal clusters, reflecting their geographic partitioning. We observed high genetic variation across all of the populations (HO = 0.34 and HE = 0.38). The overall level of genetic differentiation among populations was moderate (FST = 0.07). However, the Semnan population showed high divergence from the other Iranian populations (on average FST = 0.12), most likely due to its geographical isolation. Based on parentage analysis, the level of relatedness was very low among the Iranian walnuts examined, reflecting the geographical distance between the Iranian provinces considered in our study. Finally, we performed a genome-wide association study (GWAS), identifying 55 SNPs significantly associated with nut and kernel-related traits. In conclusion, by applying the novel Axiom J. regia 700K SNP array we uncovered new unexplored genetic diversity and identified significant marker-trait associations for nut-related traits in Persian walnut that will be useful for future breeding programs in Iran and other countries.Entities:
Mesh:
Year: 2019 PMID: 31015545 PMCID: PMC6478883 DOI: 10.1038/s41598-019-42940-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Geographical and ecological data of the walnut populations studied.
| Country | Province | Region | Sample size | Altitude (m) | Longitude(E) | Latitude (N) | Annual rainfall (mm) | Annual avg. temp. (C) |
|---|---|---|---|---|---|---|---|---|
| Iran | Kerman | Baft-Gugher | 13 | 2763 | 56°23′ | 29°31′ | 247.55 | 15.33 |
| Iran | Kerman | Rabor | 16 | 2730 | 56°57′ | 29°24′ | 267 | 15 |
| Iran | Kerman | Rabor-Hanza | 6 | 2850 | 57°12′ | 29°19′ | 267 | 15 |
| Iran | Kerman | Bardsir | 6 | 2823 | 56°29′ | 29°37′ | 72.5 | 14.6 |
| Iran | Fars | Eqlid | 15 | 2167 | 52°47′ | 30°54′ | 305.35 | 12.98 |
| Iran | Fars | Bavanat | 5 | 2407 | 53°31′ | 30°26′ | 209.5 | 15.85 |
| Iran | Semnan | Shahmirzad | 9 | 1999 | 53°33′ | 35°55′ | 206.24 | 12.91 |
| Iran | Ilam | Ilam | 6 | 1387 | 46°30′ | 33°40′ | 560.54 | 16.93 |
| Iran | Ilam | Eyvan | 6 | 1140 | 46°15′ | 33°52′ | 691 | 17.1 |
| Iran | Yazd | Taft | 6 | 2450 | 53°48′ | 31°45′ | 57.05 | 19.48 |
| Iran | Markazi | Delijan-Jasb | 4 | 1998 | 50°48′ | 34°06′ | 171.66 | 17.59 |
| Iran | West Azerbaijan | Khoy | 2 | 1215 | 44°58′ | 38°35′ | 285 | 12.57 |
| Iran | Hamadan | Nahavand | 1 | 1644 | 48°25′ | 34°15′ | 375 | 14.58 |
|
| ||||||||
| USA | California | Davis | 1 | 9 | 121°28′ W | 38°33′ | 508 | 16.1 |
Fruit traits utilized in the studied walnut genotypes.
| No. | Trait | Abbr. | Unit | Min. | Max. | Mean | SD | CV (%) |
|---|---|---|---|---|---|---|---|---|
| 1 | Nut length | NuLe | mm | 27.69 | 50.38 | 38.68 | 5.29 | 13.67 |
| 2 | Nut width | NuWi | mm | 26.008 | 40.71 | 32.24 | 2.68 | 8.30 |
| 3 | Nut thickness | NuTh | mm | 25.54 | 40.48 | 31.68 | 2.77 | 8.75 |
| 4 | Nut weight | NuWe | g | 7.71 | 20.11 | 12.99 | 2.39 | 18.35 |
| 5 | Kernel percentage | KePe | % | 38.31 | 67.36 | 50.46 | 5.23 | 10.36 |
| 6 | Shape index | ShIn | 91.73 | 161.84 | 121.34 | 14.87 | 12.25 | |
| 7 | Size index | SiIn | 27.32 | 41.91 | 34.20 | 3.06 | 8.94 | |
| 8 | Round index | RoIn | 0.62 | 1.09 | 0.84 | 0.10 | 12.07 | |
| 9 | Nut shape | NuSh | Code (1–9) | 1.33 | 7.92 | 4.65 | 1.94 | 41.81 |
| 10 | Shell thickness | SheTh | mm | 0.98 | 2.63 | 1.70 | 0.34 | 19.84 |
| 11 | Shell color | SheCo | Code (1–9) | 1.17 | 8.25 | 4.19 | 1.52 | 36.30 |
| 12 | Shell texture | SheTe | Code (1–9) | 1.08 | 8.42 | 4.93 | 1.81 | 36.58 |
| 13 | Shell seal | SheSe | Code (1–9) | 1.67 | 7.67 | 5.32 | 1.28 | 24.11 |
| 14 | Shell strength | SheSt | Code (1–9) | 1.33 | 8.08 | 4.93 | 1.30 | 26.44 |
| 15 | Packing tissue thickness | PaTiTh | Code (1–7) | 1.08 | 5.92 | 2.38 | 1.01 | 42.56 |
| 16 | Kernel weight | KeWe | g | 3.83 | 8.97 | 6.50 | 1.04 | 16.09 |
| 17 | Kernel color | KeCo | Code (1–9) | 1 | 8.17 | 3.60 | 1.41 | 39.34 |
| 18 | Kernel plumpnes | KePl | Code (1–7) | 2.25 | 6.42 | 4.03 | 1.09 | 27.04 |
| 19 | Kernel shrivel | KeSh | Code (1–7) | 1.17 | 6.33 | 2.73 | 1.00 | 36.61 |
| 20 | Kernel vein | KeVe | Code (1–7) | 1.08 | 6.42 | 3.35 | 1.35 | 40.23 |
| 21 | Kernel filled | KeFi | Code (1–7) | 2.75 | 6.42 | 4.20 | 0.98 | 23.32 |
| 22 | Ease of kernel removal from nuts | EKeNu | Code (1–9) | 1 | 7.67 | 2.75 | 1.29 | 46.88 |
aSD is an abbreviation of standard deviation, which was calculated based on the measured values of twelve seeds.
bCV is an abbreviation of coefficient of variation, which was estimated as the ratio of the standard deviation to the mean of all genotypes.
Correlations among the fruits traits in the studied genotypes of walnut.
| Character | NuLe | NuWi | NuTh | NuWe | KePe | SiIn | RoIn | SheTh | SheSe | KeWe | KeCo | KePl | KeVe | KeFi |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NuLe | 1.00 | |||||||||||||
| NuWi | 0.54** | 1.00 | ||||||||||||
| NuTh | 0.41** | 0.94** | 1.00 | |||||||||||
| NuWe | 0.72** | 0.65** | 0.56** | 1.00 | ||||||||||
| KePe | −0.56** | −0.42** | −0.27** | −0.46** | 1.00 | |||||||||
| SiIn | 0.85** | 0.89** | 0.81** | 0.77** | −0.53** | 1.00 | ||||||||
| RoIn | −0.80** | 0.04 ns | 0.21* | −0.38** | 0.42** | −0.39** | 1.00 | |||||||
| SheTh | 0.11 ns | 0.20* | 0.22* | 0.30** | 0.12 ns | 0.19 ns | 0.05 ns | 1.00 | ||||||
| SheSe | 0.29** | 0.16 ns | 0.09 ns | 0.25* | −0.14 ns | 0.24* | −0.26* | 0.08 ns | 1.00 | |||||
| KeWe | 0.45** | 0.46** | 0.47** | 0.83** | 0.1 ns | 0.54** | −0.18 ns | 0.40** | 0.20* | 1.00 | ||||
| KeCo | −0.06 ns | −0.005 ns | 0.03 ns | −0.06 ns | 0.02 ns | −0.03 ns | 0.07 ns | 0.19 ns | −0.07 ns | −0.05 ns | 1.00 | |||
| KePl | −0.18 ns | −0.24* | −0.18 ns | −0.15 ns | 0.52** | −0.23* | 0.07 ns | −0.03 ns | −0.006 ns | 0.16 ns | −0.12 ns | 1.00 | ||
| KeVe | 0.29** | 0.13 ns | 0.01 ns | −0.02 ns | −0.29** | 0.21* | −0.32** | 0.02 ns | 0.18* | −0.20* | 0.33** | −0.36** | 1.00 | |
| KeFi | −0.34** | −0.25* | −0.20* | −0.25* | 0.54** | −0.33** | 0.23* | −0.08 ns | −0.015 ns | 0.06 ns | −0.25* | 0.66** | −0.45** | 1.00 |
**Correlation is significant at the 0.01 level.
*Correlation is significant at the 0.05 level.
nsCorrelation is no significant.
Traits abbreviations are explained in detail in the material and method section.
Summary of SNP data generated in walnut populations using Axiom J. regia 700K SNP array.
| Category | Number of Markers | % of Markers |
|---|---|---|
| PolyHighResolution | 323,273 | 53.03% |
| NoMinorHom | 78,476 | 12.87% |
| MonoHighResolution | 43,904 | 7.20% |
| CallRateBelowThreshold | 37,869 | 6.21% |
| OffTargetVariant | 33,468 | 5.49% |
| Other | 67,721 | 11.11% |
| AAvarianceX | 2,587 | 0.42% |
| AAvarianceY | 2,944 | 0.48% |
| ABvarianceX | 4,471 | 0.73% |
| ABvarianceY | 7,024 | 1.15% |
| BBvarianceX | 3,178 | 0.52% |
| BBvarianceY | 4,727 | 0.78% |
| HomHomResolution | 16 | 0.00% |
| Total | 609,658 | 100.00% |
The SNPs classes are explained in detail in the results section.
Figure 1(a) Relationships among Persian walnut populations samples as represented by principal component analysis (PCA) using 33,336 genome-wide SNPs profiles (PRO_ID: different provinces of Iran); (b) Geographical distribution of the studied walnut samples across Iran; and (c) Admixture proportions of 95 Iranian walnut accessions as assigned using fastSTRUCTURE and the admixture option for K = 2 to K = 5. Each vertical bar exemplifies a sample (95 Iranian walnut genotypes sampled in eight provinces of Iran).
Figure 2Manhattan plots (left) and quantile-quantile plots (right) of association analysis using the Q + K model for traits including; (a) Shape Index (ShIn); (b) Round index (RoIn) and (c) principal component analysis (PC). The y-axis of Manhattan plots shows the −log10 (P values) of SNP association. Each dot represents a SNP. The horizontal dashed green line represents the Bonferroni-corrected significance threshold. The threshold value was calculated by Bonferroni correction based on the tested number of SNP markers (P < 0.05/99449).
Basic descriptive population genetic parameters for each Iranian walnut population across all loci (313,657 SNPs).
| Province | Sample Size | Ar | Ho | He | UHE | FIS |
|---|---|---|---|---|---|---|
| Kerman | 41 | 1.61 | 0.33 | 0.34 | 0.34 | 0.03 |
| Fars | 20 | 1.58 | 0.31 | 0.32 | 0.33 | 0.03 |
| Ilam | 12 | 1.62 | 0.34 | 0.34 | 0.35 | 0.00 |
| Semnan | 9 | 1.51 | 0.30 | 0.29 | 0.31 | −0.04 |
| Yazd | 6 | 1.54 | 0.32 | 0.30 | 0.32 | −0.09 |
| Markazi | 4 | 1.51 | 0.30 | 0.29 | 0.33 | −0.04 |
| West Azerbaijan | 2 | 1.44 | 0.33 | 0.26 | 0.35 | −0.29 |
The parameters calculated are; N = Number of individuals per population sample genotyped per locus, Ar = Allelic richness per population, Ho = observed heterozygosity per locus, He = expected heterozygosity per locus, UHE = unbiased expected heterozygosity, FIS = inbreeding coefficient per population.
Pairwise values of Wright’s fixation index (FST) between populations of Iranian walnut across all loci (313,657 SNPs).
| Kerman | Fars | Ilam | Semnan | Yazd | Markazi | West Azerbaijan | |
|---|---|---|---|---|---|---|---|
| Kerman | — | ||||||
| Fars | 0.06 | — | |||||
| Ilam | 0.05 | 0.06 | — | ||||
| Semnan | 0.10 | 0.12 | 0.07 | — | |||
| Yazd | 0.07 | 0.04 | 0.07 | 0.14 | — | ||
| Markazi | 0.06 | 0.05 | 0.03 | 0.09 | 0.07 | — | |
| West Azerbaijan | 0.04 | 0.00 | 0.02 | 0.11 | 0.05 | 0.01 | — |
Parentage analysis and relationship categories assignment (RCA) for Iranian walnut genotyps obtained by SNP allelic profiles using PLINK.
| FID1 | IID1 | FID2 | IID2 | Z0a | Z1b | Z2c | PI_HATd |
|---|---|---|---|---|---|---|---|
|
| |||||||
| Kerman | KBG9 | Kerman | KBG12 | 0 | 0.0045 | 0.9955 | 0.9977 |
|
| |||||||
| Semnan | SeSh4 | Semnan | SeSh7 | 0.1243 | 0.6722 | 0.2036 | 0.5397 |
| Fars | FaEq13 | Fars | FaEq15 | 0.3379 | 0.2866 | 0.3756 | 0.5188 |
| Kerman | KBG3 | Kerman | KBG6 | 0.1294 | 0.7442 | 0.1264 | 0.4985 |
| Yazd | YT3 | Yazd | YT4 | 0.1298 | 0.7485 | 0.1217 | 0.496 |
| Kerman | KBG12 | Kerman | KBG13 | 0.1344 | 0.7559 | 0.1097 | 0.4876 |
| Kerman | KBG9 | Kerman | KBG13 | 0.1336 | 0.7582 | 0.1082 | 0.4873 |
| Kerman | KR10 | Kerman | KB6 | 0.1407 | 0.748 | 0.1112 | 0.4853 |
| Kerman | KR11 | Kerman | KBG11 | 0.1327 | 0.78 | 0.0873 | 0.4773 |
| Kerman | KR9 | Kerman | KB5 | 0.1414 | 0.7736 | 0.085 | 0.4718 |
|
| |||||||
| Ilam | IlEy3 | USA | Chandler | 0.2026 | 0.7449 | 0.0525 | 0.425 |
| Ilam | IlEy2 | USA | Chandler | 0.1966 | 0.7573 | 0.046 | 0.4247 |
| Ilam | IlIl2 | Ilam | IlIl3 | 0.5151 | 0.2654 | 0.2195 | 0.3522 |
| Fars | FaEq3 | Fars | FaEq7 | 0.512 | 0.3239 | 0.1641 | 0.326 |
| Fars | FaEq6 | Fars | FaEq14 | 0.5607 | 0.2846 | 0.1546 | 0.2969 |
| Fars | FaEq5 | Fars | FaEq7 | 0.5596 | 0.2999 | 0.1405 | 0.2905 |
| Kerman | KBG2 | Kerman | KBG8 | 0.5493 | 0.3593 | 0.0915 | 0.2711 |
| Fars | FaEq3 | Fars | FaEq5 | 0.6523 | 0.2024 | 0.1453 | 0.2465 |
| Ilam | IlEy2 | Ilam | IlEy3 | 0.6236 | 0.2753 | 0.1012 | 0.2388 |
| Kerman | KBG1 | Kerman | KBG6 | 0.6501 | 0.2468 | 0.1031 | 0.2265 |
| Kerman | KBG2 | Kerman | KBG6 | 0.7058 | 0.1462 | 0.148 | 0.2211 |
FID1 = Family ID for the first sample; IID1 = Individual ID for the first sample; FID2 = Family ID for the second sample, IID2 = Individual ID for the second sample; aprobability to share zero IBD allele; bprobability to share one IBD allele; cprobability to share two IBD allele; drelatedness measure.