| Literature DB >> 35464861 |
Shabbir Hussain1, Madiha Habib1, Zaheer Ahmed2, Bushra Sadia1, Amy Bernardo3, Paul St Amand3, Guihua Bai3, Nida Ghori3, Azeem I Khan2, Faisal S Awan1, Rizwana Maqbool2.
Abstract
Spring wheat (Triticum aestivum L.) is one of the most imperative staple food crops, with an annual production of 765 million tons globally to feed ∼40% world population. Genetic diversity in available germplasm is crucial for sustainable wheat improvement to ensure global food security. A diversity panel of 184 Pakistani wheat accessions was genotyped using 123,596 high-quality single nucleotide polymorphism (SNP) markers generated by genotyping-by-sequencing with 42% of the SNPs mapped on B, 36% on A, and 22% on D sub-genomes of wheat. Chromosome 2B contains the most SNPs (9,126), whereas 4D has the least (2,660) markers. The mean polymorphic information content, genetic diversity, and major allele frequency of the population were 0.157, 0.1844, and 0.87, respectively. Analysis of molecular variance revealed a higher genetic diversity (80%) within the sub-population than among the sub-populations (20%). The genome-wide linkage disequilibrium was 0.34 Mbp for the whole wheat genome. Among the three subgenomes, A has the highest LD decay value (0.29 Mbp), followed by B (0.2 Mbp) and D (0.07 Mbp) genomes, respectively. The results of population structure, principal coordinate analysis, phylogenetic tree, and kinship analysis also divided the whole population into three clusters comprising 31, 33, and 120 accessions in group 1, group 2, and group 3, respectively. All groups were dominated by the local wheat accessions. Estimation of genetic diversity will be a baseline for the selection of breeding parents for mutations and the genome-wide association and marker-assisted selection studies.Entities:
Keywords: Pakistan; Triticum aestivum L.; genetic diversity; genotyping-by-sequencing; polymorphic information content; population structure; single nucleotide polymorphisms
Year: 2022 PMID: 35464861 PMCID: PMC9019749 DOI: 10.3389/fgene.2022.772517
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1(A) Sub-genomic distribution of 123,596 SNPs on three sub-genomes of spring wheat. (B) Chromosomal SNPs distribution on each chromosome of three sub-genomes (A, B, D), (C) Number of SNPs within 1 Mb window size.
FIGURE 2Population structure analysis, (A) Delta K line graph, (B) structure analysis combine bar chart, (C) PCoA analysis of wheat accessions, (D) Neighbor Joining phylogenetic tree analysis.
FIGURE 3(A) Origin of wheat accessions, (B) Classification of accessions in three different groups on the basis of structure analysis and phylogenetic tree analysis (C) Distribution of different accession in different sub-groups, the red color showed local genotypes, black color indicated CIMMYT accessions, blue color for wild type and purple color for unknown origin.
Mean major allele frequency, genetic diversity, heterozygosity, and polymorphic information content calculated using 123,596 genome-wide SNPs derived from the panel of 184 wheat genotypes.
| Chromosome | Sample size | Marker | MFa | GDb | HZc | PICd |
|---|---|---|---|---|---|---|
| 1A | 184 | 5,344 | 0.8557 | 0.2118 | 0.0763 | 0.1776 |
| 2A | 184 | 7,166 | 0.8628 | 0.2055 | 0.0750 | 0.1733 |
| 3A | 184 | 6,253 | 0.8720 | 0.1956 | 0.0713 | 0.1660 |
| 4A | 184 | 6,668 | 0.8713 | 0.1949 | 0.0731 | 0.1652 |
| 5A | 184 | 5,556 | 0.8730 | 0.1938 | 0.0642 | 0.1648 |
| 6A | 184 | 5,496 | 0.8696 | 0.1966 | 0.0704 | 0.1667 |
| 7A | 184 | 7,889 | 0.8641 | 0.2025 | 0.0679 | 0.1705 |
| Means |
|
|
|
| ||
| 1B | 184 | 7,062 | 0.8730 | 0.1954 | 0.0651 | 0.1664 |
| 2B | 184 | 8,146 | 0.8592 | 0.2114 | 0.0645 | 0.1783 |
| 3B | 184 | 8,513 | 0.8710 | 0.1957 | 0.0634 | 0.1660 |
| 4B | 184 | 4,700 | 0.8927 | 0.1721 | 0.0630 | 0.1491 |
| 5B | 184 | 7,086 | 0.8621 | 0.2067 | 0.0655 | 0.1744 |
| 6B | 184 | 7,549 | 0.8628 | 0.2046 | 0.0658 | 0.1723 |
| 7B | 184 | 8,218 | 0.8751 | 0.1900 | 0.0607 | 0.1614 |
| Means |
|
|
|
| ||
| 1D | 184 | 3,466 | 0.8844 | 0.1779 | 0.0902 | 0.1523 |
| 2D | 184 | 4,588 | 0.8873 | 0.1779 | 0.0862 | 0.1534 |
| 3D | 184 | 3,847 | 0.8995 | 0.1592 | 0.0839 | 0.1377 |
| 4D | 184 | 2,591 | 0.9018 | 0.1571 | 0.0868 | 0.1366 |
| 5D | 184 | 3,389 | 0.8983 | 0.1623 | 0.0836 | 0.1407 |
| 6D | 184 | 3,850 | 0.8869 | 0.1742 | 0.0902 | 0.1492 |
| 7D | 184 | 5,490 | 0.9005 | 0.1567 | 0.0798 | 0.1355 |
| Means |
|
|
|
| ||
| Genome-wide mean |
|
|
|
|
MF : Major allele frequency, GD : Genetic Diversity, HZ : Heterozygosity, PIC : Polymorphic Information Content.
Analysis of molecular variance in the panel of 184 genotypes.
| Source | df | SS | MS | Est. Var. | % |
|
|---|---|---|---|---|---|---|
| Among pops | 1 | 16.860 | 16.860 | 0.126 | 20 | 0.001 |
| Within pops | 182 | 91.502 | 0.497 | 0.497 | 80 | 0.001 |
| Total | 183 | 108.858 | 17.357 | 0.623 | 100 | 0.001 |
| Haploid (Nm) | 0.49 | |||||
| Fst | 0.67 |
Df: degree of freedom, SS: sum of squares, MS: mean sum of squares, %: percentage variation.
Genetic differentiation among and within two subpopulations has been estimated, along with Fst gene flow Nm with 9,999 permutations. AP: estimated variance among pops, WP: estimated variance within pops.
***p value <0.001 (based on 9,999 permutations).
Population Shannon information index (I).
| Source of information | Degrees freedom | Log-like Chi-Sq | Shannon inform | Percent of total | Diversity estimate | [0,1] scaled diversity | [0,1] scaled overlap | Estimated probability |
|---|---|---|---|---|---|---|---|---|
| DF | G-test |
| Inform | Exp( | D′ | O’ = 1-D′ |
| |
| Among pops | 1 | 155.425 | 0.425 | 8.604 | 1.529 | 0.816 | 0.184 | 0.923 |
| Within pops | 182 | 1,651.067 | 4.511 | 91.396 | 91.023 | 0.998 | 0.002 | 0.082 |
| Total | 183 | 1806.492 | 4.936 | 100.000 | 139.180 | 0.998 | 0.002 |
FIGURE 4Heat map of kinship matrix with the dendrogram shown on the top-left side based on 123,596 markers.
FIGURE 5Scatter plot showing the linkage disequilibrium (LD) decay across the genomes for 184 diverse Pakistani bread wheat accessions. The genetic distance in megabase pair (Mbp) plotted against the LD estimate (r 2) for pairs of SNPs. The solid red line showed the threshold LD value at r 2 = 0.1 and the solid green line showed the average LD decay point at which the LD curve intercepts the critical r 2. The LOESS LD decay curve was presented with the solid blue curve (A) Genome-wide average LD decay plot using all genomes (B) LD decay plot of the A genome (C) LD decay plot of the B genome; (D) LD decay plot of the D genome.
Chromosome-wise linkage disequilibrium (LD) of individual chromosome.
| A genome | LD (Mbp) | B genome | LD (Mbp) | D genome | LD (Mbp) | |
|---|---|---|---|---|---|---|
| 1 |
| 0.09 |
| 0.05 |
| 0.12 |
| 2 |
| 0.09 |
| 0.01 |
| 0.16 |
| 3 |
| 0.12 |
| 0.13 |
| 0.08 |
| 4 |
| 0.08 |
| 0.08 |
| 0.01 |
| 5 |
| 0.11 |
| 0.11 |
| 0.06 |
| 6 |
| 0.11 |
| 0.1 |
| 0.03 |
| 7 |
| 0.1 |
| 0.11 |
| 0.06 |