| Literature DB >> 32625218 |
Do Yoon Hyun1, Raveendar Sebastin1, Kyung Jun Lee1, Gi-An Lee1, Myoung-Jae Shin1, Seong Hoon Kim1, Jung-Ro Lee1, Gyu-Taek Cho1.
Abstract
Wheat (Triticum spp.) has been an important staple food crop for mankind since the beginning of agriculture. The genus Triticum L. is composed of diploid, tetraploid, and hexaploid species, majority of which have not yet been discriminated clearly, and hence their phylogeny and classification remain unresolved. Genotyping-by-sequencing (GBS) is an easy and affordable method that allows us to generate genome-wide single nucleotide polymorphism (SNP) markers. In this study, we used GBS to obtain SNPs covering all seven chromosomes from 283 accessions of Triticum-related genera. After filtering low-quality and redundant SNPs based on haplotype information, the GBS assay provided 14,188 high-quality SNPs that were distributed across the A (71%), B (26%), and D (2.4%) genomes. Cluster analysis and discriminant analysis of principal components (DAPC) allowed us to distinguish six distinct groups that matched well with Triticum species complexity. We constructed a Bayesian phylogenetic tree using 14,188 SNPs, in which 17 Triticum species and subspecies were discriminated. Dendrogram analysis revealed that the polyploid wheat species could be divided into groups according to the presence of A, B, D, and G genomes with strong nodal support and provided new insight into the evolution of spelt wheat. A total of 2,692 species-specific SNPs were identified to discriminate the common (T. aestivum) and durum (T. turgidum) wheat cultivar and landraces. In principal component analysis grouping, the two wheat species formed individual clusters and the SNPs were able to distinguish up to nine groups of 10 subspecies. This study demonstrated that GBS-derived SNPs could be used efficiently in genebank management to classify Triticum species and subspecies that are very difficult to distinguish by their morphological characters.Entities:
Keywords: Triticum; genebank; genotyping-by-sequencing; phylogenetic analysis; species discrimination; spelt wheat; wheat
Year: 2020 PMID: 32625218 PMCID: PMC7311657 DOI: 10.3389/fpls.2020.00688
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of SNPs used in the Triticum species differentiation based on three homologous wheat genomes with reference to Wheat IWGSC RefSeq v. 1.0.
| Allele | 1A | 2A | 3A | 4A | 5A | 6A | 7A | A genome | 1B | 2B | 3B | 4B | 5B | 6B | 7B | B genome | 1D | 2D | 3D | 4D | 5D | 6D | 7D | D genome |
| No. of SNPs | 1,277 | 1,648 | 1,511 | 1,098 | 1,563 | 1,025 | 2,004 | 10,126 | 476 | 618 | 568 | 323 | 593 | 559 | 584 | 3,721 | 42 | 71 | 59 | 23 | 48 | 40 | 58 | 341 |
| Density (SNP/Mbp) | 2.15 | 2.11 | 2.01 | 1.47 | 2.20 | 1.66 | 2.72 | 2.05 | 0.69 | 0.77 | 0.68 | 0.48 | 0.83 | 0.78 | 0.78 | 0.72 | 0.08 | 0.11 | 0.10 | 0.05 | 0.08 | 0.08 | 0.09 | 0.09 |
| Transition | 864 | 1,083 | 980 | 754 | 1,019 | 695 | 1,271 | 6,666 | 295 | 384 | 355 | 207 | 374 | 365 | 382 | 2,362 | 24 | 42 | 37 | 12 | 24 | 20 | 40 | 199 |
| A/G | 139 | 168 | 164 | 111 | 174 | 119 | 199 | 1,074 | 44 | 60 | 57 | 33 | 55 | 67 | 77 | 393 | 4 | 3 | 5 | 0 | 3 | 8 | 7 | 30 |
| C/T | 299 | 371 | 330 | 250 | 340 | 237 | 452 | 2,279 | 91 | 130 | 117 | 73 | 134 | 110 | 125 | 780 | 12 | 16 | 8 | 5 | 13 | 7 | 13 | 74 |
| T/C | 135 | 177 | 153 | 137 | 143 | 114 | 197 | 1,056 | 49 | 61 | 47 | 40 | 58 | 62 | 64 | 381 | 3 | 7 | 8 | 2 | 3 | 3 | 6 | 32 |
| G/A | 291 | 367 | 333 | 256 | 362 | 225 | 423 | 2,257 | 111 | 133 | 134 | 61 | 127 | 126 | 116 | 808 | 5 | 16 | 16 | 5 | 5 | 2 | 14 | 63 |
| Transversion | 413 | 565 | 531 | 344 | 544 | 330 | 733 | 3,460 | 181 | 234 | 213 | 116 | 219 | 194 | 202 | 1,359 | 18 | 29 | 22 | 11 | 24 | 20 | 18 | 142 |
| A/T | 24 | 38 | 33 | 23 | 40 | 17 | 57 | 232 | 8 | 12 | 15 | 8 | 10 | 10 | 9 | 72 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 10 |
| A/C | 40 | 39 | 39 | 26 | 40 | 16 | 52 | 252 | 11 | 21 | 12 | 6 | 17 | 19 | 21 | 107 | 1 | 0 | 0 | 0 | 1 | 3 | 3 | 8 |
| T/A | 34 | 40 | 33 | 24 | 43 | 23 | 36 | 233 | 19 | 15 | 9 | 6 | 17 | 14 | 6 | 86 | 1 | 1 | 1 | 0 | 2 | 3 | 0 | 8 |
| T/G | 27 | 55 | 46 | 22 | 32 | 30 | 53 | 265 | 9 | 21 | 13 | 7 | 16 | 17 | 15 | 98 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 5 |
| C/A | 75 | 92 | 84 | 59 | 76 | 59 | 118 | 563 | 29 | 40 | 41 | 20 | 29 | 26 | 36 | 221 | 5 | 4 | 3 | 1 | 1 | 1 | 1 | 16 |
| C/G | 66 | 96 | 106 | 67 | 96 | 67 | 149 | 647 | 46 | 45 | 49 | 25 | 37 | 42 | 42 | 286 | 5 | 6 | 4 | 3 | 7 | 6 | 6 | 37 |
| G/T | 72 | 111 | 79 | 60 | 106 | 54 | 125 | 607 | 25 | 31 | 27 | 18 | 35 | 26 | 36 | 198 | 2 | 5 | 5 | 2 | 4 | 4 | 2 | 24 |
| G/C | 75 | 94 | 111 | 63 | 111 | 64 | 143 | 661 | 34 | 49 | 47 | 26 | 58 | 40 | 37 | 291 | 3 | 10 | 6 | 4 | 7 | 0 | 4 | 34 |
| Ts% | 67.66 | 65.72 | 64.86 | 68.67 | 65.20 | 67.80 | 63.42 | 65.83 | 61.97 | 62.14 | 62.50 | 64.09 | 63.07 | 65.30 | 65.41 | 63.48 | 57.14 | 59.15 | 62.71 | 52.17 | 50.00 | 50.00 | 68.97 | 58.36 |
| Tv% | 32.34 | 34.28 | 35.14 | 31.33 | 34.80 | 32.20 | 36.58 | 34.17 | 38.03 | 37.86 | 37.50 | 35.91 | 36.93 | 34.70 | 34.59 | 36.52 | 42.86 | 40.85 | 37.29 | 47.83 | 50.00 | 50.00 | 31.03 | 41.64 |
| Ts/Tv ratio | 2.09 | 1.92 | 1.85 | 2.19 | 1.87 | 2.11 | 1.73 | 1.93 | 1.63 | 1.64 | 1.67 | 1.78 | 1.71 | 1.88 | 1.89 | 1.74 | 1.33 | 1.45 | 1.68 | 1.09 | 1.00 | 1.00 | 2.22 | 1.40 |
FIGURE 1ADMIXTURE results assuming five and eight ancestral populations. Colors represent ancestry components. Stacked bars represent samples. Samples are arranged according to taxonomy as indicated in the x-axis.
FIGURE 2Discriminant analysis of principal components (DAPC) for 283 Triticum accessions using a 52,186 SNP set. Total 30 first PCs and five discriminant eigenvalues were retained during analyses, to describe the relationship between the clusters. The axes represent the first two linear discriminants (LDs). Each circle represents a cluster and each dot represents an individual. Numbers represent the different subpopulations identified by DAPC analysis.
Analysis of molecular variance (AMOVA) within and among the groups of 283 wheat accessions identified by the DAPC clustering.
| SV | df | SS | MS | Est. var. | % | PhiPT |
| Among clusters | 5 | 6706.747 | 1341.349 | 43.6254 | 67.12598 | 0.671 |
| Within clusters | 277 | 5918.089 | 21.36494 | 21.36494 | 32.87402 | |
| Total | 282 | 12624.84 | 44.76892 | 64.99035 | 100 |
Pairwiese genetic differentiation values (PhiPT) between clusters of 283 Triticum accessions.
| Cluster | Cluster | PhiPT |
| 1 | 2 | 0.055 |
| 3 | 1 | 0.469 |
| 3 | 2 | 0.340 |
| 4 | 1 | 0.128 |
| 4 | 2 | 0.078 |
| 4 | 3 | 0.212 |
| 4 | 5 | 0.139 |
| 4 | 6 | 0.134 |
| 5 | 1 | 0.316 |
| 5 | 2 | 0.222 |
| 5 | 3 | 0.186 |
| 5 | 6 | 0.135 |
| 6 | 1 | 0.244 |
| 6 | 2 | 0.181 |
| 6 | 3 | 0.176 |
FIGURE 3Bayesian phylogenetic tree of 114 accessions of the Triticum species and subspecies using 14,188 SNPs (80% missing level) obtained by GBS. Numbers in nodes are Bayesian posterior probabilities (×100). Only values above 50% are included. When a value is not included, the corresponding node was either present with lower support or unresolved. The accessions grouping in the color bar is identical to admixture ancestry coefficient (K = 5) of 114 accessions. The outgroup taxon is Ae. triuncialis.
FIGURE 4Species-specific SNP-based clustering of common [T. aestivum (A)] and durum [T. turgidum (T)] wheat accessions. Each node represents the number of species-specific SNPs (Supplementary Table S3).
FIGURE 5The possible concept for the evolution of spelt wheat complex. The Asian and European spelt wheat are monophyletic in the D genome whereas they are polyphyletic in A and B genome.