| Literature DB >> 34221720 |
Vipin Tomar1,2,3, Guriqbal Singh Dhillon4, Daljit Singh5, Ravi Prakash Singh6, Jesse Poland7, Arun Kumar Joshi1,3,6, Budhi Sagar Tiwari2, Uttam Kumar1,3,6.
Abstract
Genetic diversity and population structure information are crucial for enhancing traits of interest and the development of superlative varieties for commercialization. The present study elucidated the population structure and genetic diversity of 141 advanced wheat breeding lines using single nucleotide polymorphism markers. A total of 14,563 high-quality identified genotyping-by-sequencing (GBS) markers were distributed covering 13.9 GB wheat genome, with a minimum of 1,026 SNPs on the homoeologous group four and a maximum of 2,838 SNPs on group seven. The average minor allele frequency was found 0.233, although the average polymorphism information content (PIC) and heterozygosity were 0.201 and 0.015, respectively. Principal component analyses (PCA) and population structure identified two major groups (sub-populations) based on SNPs information. The results indicated a substantial gene flow/exchange with many migrants (Nm = 86.428) and a considerable genetic diversity (number of different alleles, Na = 1.977; the number of effective alleles, Ne = 1.519; and Shannon's information index, I = 0.477) within the population, illustrating a good source for wheat improvement. The average PIC of 0.201 demonstrates moderate genetic diversity of the present evaluated advanced breeding panel. Analysis of molecular variance (AMOVA) detected 1% and 99% variance between and within subgroups. It is indicative of excessive gene traffic (less genetic differentiation) among the populations. These conclusions deliver important information with the potential to contribute new beneficial alleles using genome-wide association studies (GWAS) and marker-assisted selection to enhance genetic gain in South Asian wheat breeding programs. ©2021 Tomar et al.Entities:
Keywords: SNP; Analysis of molecular variance (AMOVA); Genetic diversity; Genotyping-by-sequencing (GBS); Population structure; Wheat
Year: 2021 PMID: 34221720 PMCID: PMC8231316 DOI: 10.7717/peerj.11593
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Chromosome wise distribution of SNP markers across the wheat genomes.
| 1A | 686 | 1.145 | 593.790 | 592.645 | 0.864 | 1.158 |
| 1B | 881 | 1.299 | 688.328 | 687.028 | 0.780 | 1.282 |
| 1D | 255 | 0.079 | 493.979 | 493.900 | 1.937 | 0.516 |
| 2A | 912 | 0.718 | 771.354 | 770.635 | 0.845 | 1.183 |
| 2B | 1486 | 0.019 | 800.870 | 800.851 | 0.539 | 1.856 |
| 2D | 254 | 8.791 | 649.074 | 640.282 | 2.521 | 0.397 |
| 3A | 832 | 0.608 | 750.539 | 749.931 | 0.901 | 1.109 |
| 3B | 1027 | 0.580 | 829.535 | 828.955 | 0.807 | 1.239 |
| 3D | 115 | 24.795 | 615.062 | 590.267 | 5.133 | 0.195 |
| 4A | 574 | 2.705 | 744.515 | 741.810 | 1.292 | 0.774 |
| 4B | 378 | 1.056 | 666.052 | 664.996 | 1.759 | 0.568 |
| 4D | 74 | 3.238 | 509.798 | 506.560 | 6.845 | 0.146 |
| 5A | 799 | 1.294 | 709.755 | 708.461 | 0.887 | 1.128 |
| 5B | 1084 | 8.072 | 712.941 | 704.869 | 0.650 | 1.538 |
| 5D | 106 | 62.140 | 555.015 | 492.875 | 4.650 | 0.215 |
| 6A | 825 | 0.684 | 617.839 | 617.155 | 0.748 | 1.337 |
| 6B | 1227 | 0.196 | 720.805 | 720.609 | 0.587 | 1.703 |
| 6D | 210 | 0.070 | 473.287 | 473.217 | 2.253 | 0.444 |
| 7A | 1293 | 2.083 | 736.572 | 734.489 | 0.568 | 1.760 |
| 7B | 1295 | 1.021 | 750.125 | 749.105 | 0.578 | 1.729 |
| 7D | 250 | 2.145 | 635.422 | 633.277 | 2.533 | 0.395 |
| Genome A | 5921 | 4915.126 | 0.830 | 0.090 | ||
| Genome B | 7378 | 5156.413 | 0.699 | 0.057 | ||
| Genome D | 1264 | 3830.378 | 3.030 | 0.030 | ||
| 14563 | 13901.92 | 1.048 | 0.955 |
Notes.
# Position and coverage of SNPs is in megabases (Mb), Distances between SNPs in Mb, Density = No of SNPs per MB
Figure 1Distribution of 14,563 SNPs based on chromosomal distribution and markers characteristics.
(A) Distribution of SNP markers across 21 chromosomes of bread wheat for 141 advanced wheat lines, (B) distribution of the percentage of heterozygosity (He), (C) distribution of the minor allele frequency (MAF), and (D) distribution of the polymorphic information content (PIC).
Figure 2Population structure analysis of 141 advanced bread wheat lines.
STRUCTURE results based on 14,563 GBS markers. (A) Sharp peak was observed at K = 2 with a maximum of ΔK determined by the Evanno method showing the stratification of the population into the minimum number of possible subgroups in the advanced wheat breeding lines. (B) Scatter plot of three principal components (PCs) of the SNP data with lines colored according to the two subgroups. (C) Structure plot for 141 advanced wheat breeding lines was stratified into two distinct clusters, where each color represents one subgroup representing (G1) and (G2).
Figure 3Dendrogram demonstrating the genetic relationships among 141 advanced bread wheat lines based on 14,563 GBS markers.
Dendrogram showing the relationship among advanced breeding lines. Phylogenetic network constructed using complete linkage hieratical clustering based on 14,563 SNPs for all 141 advanced breeding lines. Breeding lines labels were color-coded into two subgroups.
Analysis of Molecular Variance for K = 2 for within population and among-population variation.
| Among Pops | 1 | 3094.516 | 3094.516 | 14.292 | 1% | 86.428 | |
| Within Pops | 139 | 343401.995 | 2470.518 | 2470.518 | 99% | ||
| Total | 140 | 346496.511 | 2484.810 | 100% |
Notes.
#K = K value for sub-populations, Source = source of variation, df = degree of freedom, SS = sum of squares, MS = mean sum of squares, Est. Var.= estimated variation, % = percentage of variance explained, and Nm = haploid number of migrants.
Figure 4Proportion of Allelic patterns and diversity.
(A) The allelic patterns and diversity indexes for K = 2, each subpopulation based on number of different alleles (Na), number of effective alleles (Ne), Shannon’s information index(I), private alleles (PA), and diversity index (h) and (B) diversity among and within subgroups for K = 2.
Allelic patterns and diversity indexes for K = 2, for subpopulations and mean diversity among subpopulations for each K.
| Na | 1.954 (±0.002) | 2.000 (±0.000) | 1.977 |
| Na Freq. ≥ 5% | 1.870 (±0.003) | 1.973 (±0.001) | 1.921 |
| Ne | 1.512 (±0.003) | 1.525 (±0.002) | 1.519 |
| I | 0.468 (±0.002) | 0.487 (±0.001) | 0.477 |
| h | 0.307 (±0.001) | 0.318 (±0.001) | 0.313 |
| uh | 0.320 (±0.001) | 0.321 (±0.001) | 0.320 |
| PA | 6.8E−5 (±0.000) | 0.046 (±0.002) | 0.023 |
| Lines with PA | 22 | 90 | |
| Range of PA | 3–149 | 3–188 |
Notes.
# Na = number of different alleles, Ne= number of effective alleles, I = Shannon’s information index, PA = private alleles, h = diversity index, and uh = unbiased diversity index, ± represents standard error