| Literature DB >> 35903232 |
Mohsin Ali1,2, Shan Danting1,2, Jiankang Wang1, Hafsa Sadiq3, Awais Rasheed1,3, Zhonghu He1, Huihui Li1,2.
Abstract
Synthetic hexaploid wheats and their derived advanced lines were subject to empirical selection in developing genetically superior cultivars. To investigate genetic diversity, patterns of nucleotide diversity, population structure, and selection signatures during wheat breeding, we tested 422 wheat accessions, including 145 synthetic-derived wheats, 128 spring wheat cultivars, and 149 advanced breeding lines from Pakistan. A total of 18,589 high-quality GBS-SNPs were identified that were distributed across the A (40%), B (49%), and D (11%) genomes. Values of population diversity parameters were estimated across chromosomes and genomes. Genome-wide average values of genetic diversity and polymorphic information content were estimated to be 0.30 and 0.25, respectively. Neighbor-joining (NJ) tree, principal component analysis (PCA), and kinship analyses revealed that synthetic-derived wheats and advanced breeding lines were genetically diverse. The 422 accessions were not separated into distinct groups by NJ analysis and confirmed using the PCA. This conclusion was validated with both relative kinship and Rogers' genetic distance analyses. EigenGWAS analysis revealed that 32 unique genome regions had undergone selection. We found that 50% of the selected regions were located in the B-genome, 29% in the D-genome, and 21% in the A-genome. Previously known functional genes or QTL were found within the selection regions associated with phenology-related traits such as vernalization, adaptability, disease resistance, and yield-related traits. The selection signatures identified in the present investigation will be useful for understanding the targets of modern wheat breeding in Pakistan.Entities:
Keywords: EigenGWAS; bread wheat; gene annotation; genetic diversity; genotyping-by-sequencing; selection signatures
Year: 2022 PMID: 35903232 PMCID: PMC9315363 DOI: 10.3389/fpls.2022.877496
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
The summary statistics of GBS-SNPs across chromosomes and genomes.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 1A | 909 | 5% | 1145442 | 593501692 | 593.50 | 1.53 |
| 1B | 1,100 | 6% | 1430915 | 688327586 | 688.33 | 1.60 |
| 1D | 313 | 2% | 78777 | 493978993 | 493.98 | 0.63 |
| 2A | 1,186 | 6% | 626007 | 780652409 | 780.65 | 1.52 |
| 2B | 1,575 | 8% | 19097 | 800780364 | 800.78 | 1.97 |
| 2D | 317 | 2% | 2593153 | 649073688 | 649.07 | 0.49 |
| 3A | 966 | 5% | 607725 | 750500626 | 750.50 | 1.29 |
| 3B | 1,355 | 7% | 198860 | 829742765 | 829.74 | 1.63 |
| 3D | 429 | 2% | 344069 | 615061869 | 615.06 | 0.70 |
| 4A | 1,003 | 5% | 2013324 | 743825197 | 743.83 | 1.35 |
| 4B | 723 | 4% | 586028 | 673071478 | 673.07 | 1.07 |
| 4D | 128 | 1% | 1187858 | 509798252 | 509.80 | 0.25 |
| 5A | 903 | 5% | 1213423 | 709755448 | 709.76 | 1.27 |
| 5B | 1304 | 7% | 218401 | 712940770 | 712.94 | 1.83 |
| 5D | 196 | 1% | 2214591 | 564899608 | 564.90 | 0.35 |
| 6A | 944 | 5% | 684328 | 617838760 | 617.84 | 1.53 |
| 6B | 1,438 | 8% | 195536 | 720519123 | 720.52 | 2.00 |
| 6D | 226 | 1% | 70342 | 473049509 | 473.05 | 0.48 |
| 7A | 1,512 | 8% | 289461 | 736572283 | 736.57 | 2.05 |
| 7B | 1,540 | 8% | 88786 | 750602636 | 750.60 | 2.05 |
| 7D | 522 | 3% | 1570012 | 638541382 | 638.54 | 0.82 |
| A | 7,423 | 40% | 289461 | 780652409 | 4,932.65 | 1.51 |
| B | 9,035 | 49% | 19097 | 829742765 | 5,175.98 | 1.74 |
| D | 2,131 | 11% | 70342 | 649073688 | 3,944.40 | 0.53 |
| The whole genome | 18,589 | 100% | 19097 | 829742765 | 14,053.03 | 1.26 |
Figure 1Distribution of minor allele frequency (A), heterozygosity (B), genetic diversity (C), and polymorphic information content (D) across 422 accessions (WP), for 145 synthetic-derived wheats (SYN-DER), 128 Pakistan cultivars (PC), and 149 advanced lines (AL) based on 18,589 SNPs.
Transition (Ts) and transversion (Tv) SNPs identified using genotyping-by-sequencing.
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Number of allelic sites | 6,773 | 6,841 | 683 | 1,220 | 1,172 | 1,900 |
| Frequencies (%) | 36.44% | 36.80% | 3.67% | 6.56% | 6.30% | 10.22% |
| Total (%) | 73.24% | 26.76% | ||||
| TsTv ratio | 2.73 | |||||
Figure 2The Circos plot, from outside to inside, illustrates the patterns of nucleotide diversity (A) and Tajima'D (B) values in synthetic-derived wheats (SYN-DER), commercially released Pakistan cultivars (PC), and advanced breeding lines (AL), respectively. Nucleotide diversity and Tajima'D are plotted along reference chromosomes in sliding windows of 1,000 kb with a step size of 100 kb.
Figure 3Population structure and diversity analysis of 422 wheat accessions using 18,589 high-quality SNPs. (A) Neighbor-joining (NJ) tree, (B) principal component analysis (PCA) plot, and (C) heat map of pairwise kinship matrix with the tree shown on the top and left. The SYN-DER indicates synthetic-derived wheats; PC, commercially released Pakistan cultivars; AL, advanced breeding lines.
Results from analysis of molecular variance (AMOVA).
|
|
|
|
|
|
|---|---|---|---|---|
| Among subpopulations | 2 | 66796.64 | 99.49 | 3.41 |
| Among individuals within population | 419 | 2290148.43 | 2,647.50 | 90.74 |
| Within individuals | 422 | 72048 | 170.72 | 5.85 |
| Total | 843 | 2428993.06 | 2,917.73 |
df, degree of freedom.
“***” The source of variation was highly significant at P ≤ 0.001.
Pairwise fixation index (FST) between subpopulations SYN-DER (n = 145), PC (n = 128), and AL (n = 149).
|
|
|
| |
|---|---|---|---|
| SYN-DER | 0 |
|
|
| PC | 0.0601 | 0 |
|
| AL | 0.0492 | 0.075 | 0 |
SYN-DER, synthetic-derived wheats; PC, commercially released Pakistan cultivars; AL, advanced breeding lines.
Represents significant differences between two populations at P ≤ 0.001.
Figure 4The decay of linkage disequilibrium (LD) in the whole population (WP). Pairwise LD (r2) values plotted vs. corresponding pairwise physical distance (Mb) of GBS-SNPs. The trend line of non-linear regressions against physical distance is given by the red line. The horizontal and vertical lines represent the critical value of r2 (0.1) and LD decay values, respectively.
Top SNPs with a significance of –log10 5.0 between SYN-DER, PC, and AL accessions.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| 2D_16962948 | 1 | 2D | 11.96 | 21.96 | 0.2259 | Rht8, Ppd-D1a.1, QKw + QSpl | Kernel width + Spike length | Tian et al., |
| 2D_463912950 | 1 | 2D | 458.91 | 468.91 | 0.2402 | RHT-8 | Plant height, | |
| 3D_572085282 | 1 | 3D | 567.09 | 577.09 | 0.3956 | Myb10-D1, QGy + QKw | Seed color, Grain yield + Kernel width, | Li F. et al., |
| 7A_644861968 | 1 | 7A | 639.86 | 649.86 | 0.2869 | |||
| 7D_53502441 | 1 | 7D | 48.5 | 58.5 | 0.2758 | QTkw, GS3-D1 | Thousand kernel weight, | Röder et al., |
| 1A_5100794 | 9 | 1A | 0 | 10.49 | 0.0848 | Glu-A3, QBy, QHd + Qgy+ QFll + QKw, | Gluten/End-use quality, Biological yield, grain yield + Flag leaf length + kernel width, | Liu et al., |
| 1A_45252081 | 2 | 1A | 17.66 | 50.83 | 0.1606 | QTkw, QKl + QKns, | Thousand kernel weight, Kernel length + kernel number per spike, | Kumar et al., |
| 2A_41224267 | 2 | 2A | 36.22 | 46.22 | 0.1318 | Ppd-A1 | Flowering time, QTkw, | Nishida et al., |
| 2B_37251497 | 2 | 2B | 32.25 | 42.25 | 0.1039 | |||
| 2B_38773853 | 2 | 2B | 33.77 | 43.77 | 0.117 | |||
| 2B_508214726 | 2 | 2B | 503.21 | 513.21 | 0.169 | |||
| 2B_775770973 | 2 | 2B | 770.76 | 783.47 | 0.135 | QGw | Grain weight, | Alqudah et al., |
| 2D_447834700 | 2 | 2D | 442.83 | 452.83 | 0.1191 | |||
| 2D_537115124 | 2 | 2D | 530.56 | 542.12 | 0.1295 | |||
| 3A_20558735 | 2 | 3A | 15.56 | 25.56 | 0.1513 | QGy + QBm, QFlw | Grain yield + Biomass weight, Flag leaf width, | Bhatta et al., |
| 3B_42330180 | 2 | 3B | 37.33 | 47.33 | 0.1 | |||
| 5A_564429086 | 2 | 5A | 558.36 | 569.43 | 0.3426 | QSn, QAl, QSl + QKw, | SN, Awn length, Spike length+ Kernel width, | Cuthbert et al., |
| 5B_527181268 | 2 | 5B | 515.08 | 532.18 | 0.2534 | QSpl, | Spike length, | Li F. et al., |
| 5D_441028318 | 2 | 5D | 436.03 | 446.03 | 0.1327 | QAl | Awn length | Bhatta et al., |
| 6B_4519965 | 9 | 6B | 0 | 9.52 | 0.1472 | QAl, QPh, | Awn length, Plant height, | Bhatta et al., |
| 6B_14544540 | 2 | 6B | 7.74 | 22.69 | 0.3018 | QFlw | Flag leaf width | Bhatta et al., |
| 6D_24484257 | 2 | 6D | 19.48 | 29.96 | 0.1477 | QFlw | Flag leaf width | Bhatta et al., |
| 7B_573733013 | 2 | 7B | 457.45 | 467.45 | 0.225 | |||
| 7B_579773793 | 2 | 7B | 568.73 | 578.73 | 0.1138 | |||
| 7B_581047156 | 2 | 7B | 576.05 | 586.05 | 0.1242 | |||
| 7B_601123999 | 2 | 7B | 596.12 | 610.43 | 0.148 | |||
| 7B_616469496 | 2 | 7B | 611.47 | 621.47 | 0.1593 | |||
| 7B_670141587 | 2 | 7B | 665.14 | 675.14 | 0.1037 | |||
| 7D_559268072 | 2 | 7D | 554.27 | 564.27 | 0.1835 | QSn, Qfla | SN, Flag leaf area, | Li et al., |
| 3D_50888526 | 4 | 3D | 45.63 | 52.66 | 0.1203 | |||
| 7D_47657997 | 4 | 7D | 42.66 | 52.66 | 0.1532 | Lr34, QTkw | Leaf rust resistance, Thousand kernel weight | Bhatta et al., |
| 2B_769950981 | 8 | 2B | 769.95 | 774.97 | 0.1948 | QGnfs, | Grain number per fertile spikelets, | Alqudah et al., |
| 1A_42100559 | 2 | 1A | 535.86 | 545.86 | 0.1385 | QFla, QSdm, QFsps | Flag leaf area + Stem diameter, fertile spikelet per spike | Bhatta et al., |
| 2B_77484831 | 9 | 2B | 72.48 | 82.48 | 0.1472 | QTkw, QRl | Thousand kernel weight, root length | Quarrie et al., |
| 3B_40212954 | 9 | 3B | 35.21 | 45.21 | 0.0527 | |||
| 6A_6562879 | 9 | 6A | 0 | 11.56 | 0.1172 | QGw, QPh + QKl, | Grain weight, | Li F. et al., |
| 6B_276518336 | 2 | 6B | 271.52 | 281.52 | 0.1223 | |||
| 3B_59646004 | 10 | 3B | 54.65 | 64.65 | 0.4 | QSd, | Stem diameter, | Bhatta et al., |
Associated traits were obtained around 5 Mb physical nucleotide interval of significant SNP markers of each QTL.
EV, eigenvector;
CHR, Chromosome.
Figure 5Miami plot showing loci under selection identified by EigenGWAS (upper for PGC and lower for FST) for top 10 eigenvectors (EV) based on 422 wheat accessions. PGC is the p corrected by lamdaGC in EigenGWAS. EV1-EV10 were the first 10 eigenvectors, each of which was used as phenotype for EigenGWAS analysis.
Figure 6Pie charts depicting genomic annotation for whole-genome GBS-SNPs (18,859 SNPs) (A) and significant GBS SNPs identified by EigenGWAS (83 GBS-SNPs) (B).