| Literature DB >> 35783289 |
Muhammad Massub Tehseen1, Fatma Aykut Tonk1, Muzaffer Tosun1, Deniz Istipliler1, Ahmed Amri2, Carolina P Sansaloni3, Ezgi Kurtulus4, Muhammad Salman Mubarik5, Kumarse Nazari4.
Abstract
Landraces are considered a valuable source of potential genetic diversity that could be used in the selection process in any plant breeding program. Here, we assembled a population of 600 bread wheat landraces collected from eight different countries, conserved at the ICARDA's genebank, and evaluated the genetic diversity and the population structure of the landraces using single nucleotide polymorphism (SNP) markers. A total of 11,830 high-quality SNPs distributed across the genomes A (40.5%), B (45.9%), and D (13.6%) were used for the final analysis. The population structure analysis was evaluated using the model-based method (STRUCTURE) and distance-based methods [discriminant analysis of principal components (DAPC) and principal component analysis (PCA)]. The STRUCTURE method grouped the landraces into two major clusters, with the landraces from Syria and Turkey forming two clusters with high proportions of admixture, whereas the DAPC and PCA analysis grouped the population into three subpopulations mostly according to the geographical information of the landraces, i.e., Syria, Iran, and Turkey with admixture. The analysis of molecular variance revealed that the majority of the variation was due to genetic differences within the populations as compared with between subpopulations, and it was the same for both the cluster-based and distance-based methods. Genetic distance analysis was also studied to estimate the differences between the landraces from different countries, and it was observed that the maximum genetic distance (0.389) was between the landraces from Spain and Palestine, whereas the minimum genetic distance (0.013) was observed between the landraces from Syria and Turkey. It was concluded from the study that the model-based methods (DAPC and PCA) could dissect the population structure more precisely when compared with the STRUCTURE method. The population structure and genetic diversity analysis of the bread wheat landraces presented here highlight the complex genetic architecture of the landraces native to the Fertile Crescent region. The results of this study provide useful information for the genetic improvement of hexaploid wheat and facilitate the use of landraces in wheat breeding programs.Entities:
Keywords: SNP markers; analysis of molecular variance (AMOVA); genetic diversity; population structure; wheat landraces
Year: 2022 PMID: 35783289 PMCID: PMC9240388 DOI: 10.3389/fgene.2022.900572
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Distribution of 11,830 SNPs across 21 chromosomes of bread wheat landraces from 8 countries.
FIGURE 2(A) Population structure of 600 bread wheat landraces for k = 2, 3, and 4. Different colors represent the subpopulations, and each bar represents the estimated membership of a single genotype. The horizontal line under the figure indicates the geographic origins of the landraces. (B) The scree plot of ΔK against the proportion of explained variance states the optimal subpopulation number in DAPC analysis. (C) The plot of K against ΔK to determine the optimum K value for STRUCTURE analysis. (D) The PCA of 600 bread wheat landraces.
The STRUCTURE results of 600 bread wheat landraces for the fixation index (Fst), average distances (expected heterozygosity/He), gene flow (Nm), and the number of genotypes assigned to each subpopulation.
| Population | Inferred clusters | Mean Fst | Exp. Het | Nm | No. of genotypes |
|---|---|---|---|---|---|
| Pop1 | 0.619 | 0.2307 | 0.31 | 0.833 | 362 |
| Pop2 | 0.381 | 0.2078 | 0.3455 | 0.953 | 238 |
FIGURE 3Inference of the subpopulations by DAPC analysis grouping landraces from different countries together.
FIGURE 4Minimum spanning network (MSN) of 600 bread wheat landraces.
FIGURE 5Neighbor-joining clustering of 600 bread wheat landraces.
Analysis of molecular variance (AMOVA) revealing genetic diversity in bread wheat landraces.
| Method | Source | df | SS | MS | Est. var | % |
|---|---|---|---|---|---|---|
| Model based (STRUCTURE) | Among Pops | 1 | 26356.26 | 26356.26 | 80.95107 | 2.5 |
| Within Pops | 598 | 1858696 | 3108.187 | 3108.187 | 97.4 | |
| Total | 599 | 1885052 | 3189.138 | 1 | ||
| Distance based (Cluster, DAPC) | Among Pops | 2 | 59508.97 | 29754.49 | 133.9296 | 4.2 |
| Within Pops | 597 | 1825543 | 3057.861 | 3057.861 | 95.8 | |
| Total | 599 | 1885052 | 3191.791 | 1 | ||
| Based on origin | Among Pops | 7 | 147767.9 | 21109.71 | 328.5178 | 5.9 |
| Within Pops | 592 | 3622336 | 6118.811 | 6118.811 | 94.1 | |
| Total | 599 | 3770104 | 6293.997 | 6447.329 | 1 |
df; degrees of freedom, SS; Sum of squares, MS; Mean square, Est. var; Estimated variance.
Nei’s genetic identity (above diagonal) and genetic distance (below diagonal).
| Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Model based (STRUCTURE) |
|
| |||||||
|
| 0.987 | ||||||||
|
| 0.013 | ||||||||
| Distance based (Cluster, DAPC) |
|
|
| ||||||
|
| 0.977 | 0.969 | |||||||
|
| 0.023 | 0.985 | |||||||
|
| 0.032 | 0.015 | |||||||
| Based on origin |
|
|
|
|
|
|
|
| |
|
| 0.987 | 0.948 | 0.915 | 0.908 | 0.870 | 0.818 | 0.889 | ||
|
| 0.013 | 0.954 | 0.916 | 0.920 | 0.872 | 0.819 | 0.897 | ||
|
| 0.053 | 0.047 | 0.937 | 0.912 | 0.862 | 0.855 | 0.848 | ||
|
| 0.089 | 0.087 | 0.065 | 0.878 | 0.833 | 0.804 | 0.819 | ||
|
| 0.096 | 0.083 | 0.092 | 0.130 | 0.841 | 0.803 | 0.847 | ||
|
| 0.139 | 0.137 | 0.148 | 0.182 | 0.173 | 0.782 | 0.815 | ||
|
| 0.201 | 0.200 | 0.157 | 0.219 | 0.219 | 0.246 | 0.749 | ||
|
| 0.118 | 0.109 | 0.165 | 0.199 | 0.165 | 0.205 | 0.289 |
Mean of different genetic parameters: number of different alleles (Na), number of effective alleles (Ne), Shannon’s index (I), diversity index (He), unbiased diversity index (uHe), and percentage of polymorphic loci (PPL) in each of the two subpopulations.
| Method | Pop | Na | Ne | I | He | uHe | PPL (%) |
|---|---|---|---|---|---|---|---|
| Model based (STRUCTURE) |
| 1.988 | 1.472 | 0.441 | 0.286 | 0.286 | 99.38 |
|
| 1.997 | 1.479 | 0.445 | 0.289 | 0.290 | 99.84 | |
|
| 1.993 | 1.476 | 0.443 | 0.288 | 0.288 | 99.61 | |
| Distance based (Cluster, DAPC) | |||||||
|
| 1.970 | 1.434 | 0.410 | 0.264 | 0.266 | 98.35 | |
|
| 1.987 | 1.483 | 0.449 | 0.292 | 0.293 | 99.32 | |
|
| 1.992 | 1.474 | 0.440 | 0.286 | 0.287 | 99.49 | |
|
| 1.983 | 1.464 | 0.433 | 0.281 | 0.282 | 99.05 | |
| Based on origin | |||||||
|
| 1.988 | 1.474 | 0.442 | 0.287 | 0.287 | 99.40 | |
|
| 1.987 | 1.472 | 0.440 | 0.285 | 0.287 | 99.22 | |
|
| 1.762 | 1.412 | 0.381 | 0.248 | 0.254 | 85.63 | |
|
| 1.242 | 1.320 | 0.282 | 0.188 | 0.209 | 54.02 | |
|
| 1.021 | 1.253 | 0.222 | 0.148 | 0.166 | 42.47 | |
|
| 0.614 | 1.137 | 0.117 | 0.080 | 0.107 | 19.39 | |
|
| 0.297 | 1.000 | 0.000 | 0.000 | 0.000 | 0.00 | |
|
| 0.738 | 1.177 | 0.154 | 0.104 | 0.129 | 26.79 | |
|
| 1.887 | 1.447 | 0.416 | 0.270 | 0.273 | 53.36 |
FIGURE 6Estimated population membership probability of 600 bread wheat landraces from eight countries where each bar represents a landrace.