| Literature DB >> 32795254 |
Ahasanul Hoque1, Jason D Fiedler2, Mukhlesur Rahman3.
Abstract
BACKGROUND: A sustainable breeding program requires a minimum level of germplasm diversity to provide varied options for the selection of new breeding lines. To maximize genetic gain of the North Dakota State University (NDSU) flax breeding program, we aimed to increase the genetic diversity of its parental stocks by incorporating diverse genotypes. For this purpose, we analyzed the genetic diversity, linkage disequilibrium, and population sub-structure of 350 globally-distributed flax genotypes with 6200 SNP markers.Entities:
Keywords: Flax; GBS; Genetic diversity; Germplasm collection; Linkage disequilibrium; SNP
Mesh:
Year: 2020 PMID: 32795254 PMCID: PMC7430851 DOI: 10.1186/s12864-020-06922-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Distribution of SNPs
| Chromosome | No. of SNPs | % SNPs | Start position a | End position a | Length (Mb) | Density (Kb) |
|---|---|---|---|---|---|---|
| Lu1 | 550 | 8.87 | 48,002 | 28,940,544 | 28.89 | 52.53 |
| Lu2 | 402 | 6.48 | 343,539 | 25,278,102 | 24.93 | 62.03 |
| Lu3 | 485 | 7.82 | 56,610 | 26,551,417 | 26.49 | 54.63 |
| Lu4 | 299 | 4.82 | 20,788 | 19,857,012 | 19.83 | 66.34 |
| Lu5 | 424 | 6.84 | 58,098 | 17,649,206 | 17.59 | 41.49 |
| Lu6 | 366 | 5.90 | 29,887 | 17,856,972 | 17.82 | 48.71 |
| Lu7 | 389 | 6.27 | 3623 | 18,287,460 | 18.28 | 47.00 |
| Lu8 | 358 | 5.77 | 81,017 | 23,662,693 | 23.58 | 65.87 |
| Lu9 | 412 | 6.65 | 124,789 | 21,763,401 | 21.63 | 52.52 |
| Lu10 | 308 | 4.97 | 199,991 | 17,833,309 | 17.63 | 57.25 |
| Lu11 | 454 | 7.32 | 76,724 | 19,841,794 | 19.76 | 43.54 |
| Lu12 | 425 | 6.85 | 52,319 | 20,832,003 | 20.77 | 48.89 |
| Lu13 | 552 | 8.90 | 14,015 | 20,413,108 | 20.39 | 36.95 |
| Lu14 | 423 | 6.82 | 24,838 | 19,367,496 | 19.34 | 45.73 |
| Lu15 | 353 | 5.69 | 38,217 | 15,613,904 | 15.57 | 44.12 |
| Mean | 413.33 | 51.17 |
aposition given in bp
Transition and transversion SNPs across the genome
| SNP type | Model | No. of sites | Frequencies (%) | Total (percentage) |
|---|---|---|---|---|
| Transitions | A/G | 1758 | 28.35 | 3532 (56.97%) |
| C/T | 1774 | 28.61 | ||
| Transversions | A/T | 720 | 11.61 | 2668 (43.03%) |
| A/C | 645 | 10.40 | ||
| G/T | 593 | 9.56 | ||
| G/C | 710 | 11.45 |
Fig. 1Clustering of whole collection using 6200 SNP markers. a Graphical representation of Delta K for different number of sub-population determined by Evanno’s method. b Estimated population structure of 350 flax genotypes on K = 7 according to Delta K
Number of pure and admixed individuals per sub-population
| Sub-populations | Total no. of genotypes | 0.7 cutoff | 0.9 cutoff | ||
|---|---|---|---|---|---|
| No. of genotypes | % of from total | No. of genotypes | % of from total | ||
| P1 | 42 | 20 | 47.62 | 12 | 28.6 |
| P2 | 55 | 35 | 63.64 | 21 | 38.2 |
| P3 | 72 | 44 | 61.11 | 14 | 19.4 |
| P4 | 22 | 4 | 18.18 | 0 | 0.0 |
| P5 | 106 | 86 | 81.13 | 40 | 37.7 |
| P6 | 27 | 22 | 81.48 | 21 | 77.8 |
| P7 | 26 | 16 | 61.54 | 14 | 53.8 |
| Total | 350 | 227 | 64.86 | 122 | 34.9 |
Fig. 2Principal coordinate analysis of SNP diversity based on genetic distance. Colors represent sub-populations identified at K = 7 in Fig. 1
Fig. 3Neighbor-joining phylogenetic tree based on genetic distance matrix representing the clustering of 350 flax genotypes
Sub-population wise diversity parameters
| Sub-populations | Polymorphic loci (%) | Tajima’s D | |||||
|---|---|---|---|---|---|---|---|
| P1 | 87 | 1.87 | 1.39 | 0.37 | 0.24 | 0.24 | 0.75 |
| P2 | 87 | 1.87 | 1.33 | 0.33 | 0.21 | 0.21 | 0.40 |
| P3 | 98 | 1.98 | 1.46 | 0.44 | 0.28 | 0.29 | 1.50 |
| P4 | 78 | 1.78 | 1.41 | 0.37 | 0.24 | 0.25 | 0.79 |
| P5 | 88 | 1.88 | 1.33 | 0.32 | 0.20 | 0.20 | 0.69 |
| P6 | 62 | 1.62 | 1.18 | 0.21 | 0.12 | 0.13 | −0.85 |
| P7 | 80 | 1.80 | 1.35 | 0.34 | 0.22 | 0.23 | 0.35 |
| Mean | 83 | 1.83 | 1.35 | 0.34 | 0.22 | 0.22 | 0.52 |
a No. of different alleles, b No. of effective alleles, c Shannon’s information index
d Diversity, e Unbiased diversity
Mean pairwise relatedness (r) values within sub-population
| Sub-populations | P1 | P2 | P3 | P4 | P5 | P6 | P7 |
|---|---|---|---|---|---|---|---|
| Mean | 0.095 | 0.101 | 0.043 | 0.114 | 0.088 | 0.338 | 0.127 |
| Upper mark | 0.006 | 0.004 | 0.003 | 0.011 | 0.002 | 0.008 | 0.008 |
| Lower mark | −0.005 | − 0.004 | −0.003 | − 0.008 | −0.003 | − 0.007 | −0.007 |
| 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 |
Fig. 4Within sub-population pairwise mean relatedness (r) for flax collections
Summary of AMOVA
| Sources of variation | df | SS | MS | Est. Var. | % of variation | Fixation indices | Nm |
|---|---|---|---|---|---|---|---|
| Among sub-populations | 6 | 164,598.46 | 27,433.08 | 274.90 | 28 | Fst: 0.28 | 0.64 |
| Among individuals | 343 | 483,200.88 | 1408.75 | 704.37 | 72 | Fis: 1.00 | |
| Within individuals | 350 | 0.00 | 0.00 | 0.00 | 0 | Fit: 1.00 | |
| Total | 699 | 647,799.34 | 979.27 | 100 |
Genetic differentiation among sub-populations
| Sub-population pairwise | |||||||
|---|---|---|---|---|---|---|---|
| P1 | P2 | P3 | P4 | P5 | P6 | P7 | |
| P1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| P2 | 0.25 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| P3 | 0.13 | 0.21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| P4 | 0.21 | 0.31 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 |
| P5 | 0.27 | 0.16 | 0.21 | 0.32 | 0.00 | 0.00 | 0.00 |
| P6 | 0.48 | 0.54 | 0.40 | 0.46 | 0.54 | 0.00 | 0.00 |
| P7 | 0.20 | 0.32 | 0.14 | 0.17 | 0.32 | 0.51 | 0.00 |
Below diagonal values are pairwise Fst comparison. Above diagonals depicts the P values.
Mantel test output showing genetic and geographic distance correlation
| Sub-populations | SSx a | SSy b | SPxy | Rxy c | |
|---|---|---|---|---|---|
| P1 | 45,800,457.07 | 5,538,022,460 | 93,859,398.73 | 0.19 | 0.05 |
| P2 | 134,861,866.3 | 41,553,712,025 | 246,361,305.5 | 0.10 | 0.13 |
| P3 | 120,655,500 | 8,473,759.059 | 3,098,570.347 | 0.10 | 0.09 |
| P4 | 26,416,786.96 | 1,683,893,955 | 52,933,834.97 | 0.25 | 0.01 |
| P5 | 318,721,174.9 | 85,673,146,302 | 1,827,968,297 | 0.35 | 0.01 |
| P6 | 92,879,654.84 | 3,062,447,732 | 179,000,333.5 | 0.34 | 0.07 |
| P7 | 19,812,140.31 | 479,364,655.5 | 35,378,695.93 | 0.36 | 0.05 |
| Whole collection | 8,016,027,762.51 | 1,165,284,415,054.30 | 28,882,623,823.41 | 0.30 | 0.001 |
a Genetic distance, b Geographic distance, c Correlation coefficient values
Linkage disequilibrium in the studied collection
| Sub-populations | Mean linked LD | Mean unlinked LD | Mean LD | Loci pairs in linked LD (%) | Loci pairs in unlinked LD (%) |
|---|---|---|---|---|---|
| Whole collection | 0.41 | 0.02 | 0.03 | 2.46 | 97.54 |
| P1 | 0.40 | 0.03 | 0.04 | 3.77 | 96.23 |
| P2 | 0.44 | 0.03 | 0.05 | 5.80 | 94.20 |
| P3 | 0.48 | 0.02 | 0.02 | 1.08 | 98.92 |
| P4 | 0.38 | 0.04 | 0.08 | 11.56 | 88.44 |
| P5 | 0.45 | 0.02 | 0.04 | 4.92 | 95.08 |
| P6 | 0.50 | 0.04 | 0.17 | 28.22 | 71.78 |
| P7 | 0.39 | 0.04 | 0.06 | 7.04 | 92.96 |
Fig. 5Linkage disequilibrium (LD) differences and decay pattern among sub-populations
Fig. 6Distribution of genotypes (The map was created using software ArcGIS10.5 trial version, https://www.arcgis.com/features/login/activation.html?activationcode=7c5ef1a364d6625b9482, Subscription ID: 3146087987, accessed on June 25, 2019)