| Literature DB >> 26466852 |
Mark O Winfield1, Alexandra M Allen1, Amanda J Burridge1, Gary L A Barker1, Harriet R Benbow1, Paul A Wilkinson1, Jane Coghill1, Christy Waterfall1, Alessandro Davassi2, Geoff Scopes2, Ali Pirani2, Teresa Webster2, Fiona Brew2, Claire Bloor2, Julie King3, Claire West4, Simon Griffiths4, Ian King3, Alison R Bentley5, Keith J Edwards1.
Abstract
In wheat, a lack of genetic diversity between breeding lines has been recognized as a significant block to future yield increases. Species belonging to bread wheat's secondary and tertiary gene pools harbour a much greater level of genetic variability, and are an important source of genes to broaden its genetic base. Introgression of novel genes from progenitors and related species has been widely employed to improve the agronomic characteristics of hexaploid wheat, but this approach has been hampered by a lack of markers that can be used to track introduced chromosome segments. Here, we describe the identification of a large number of single nucleotide polymorphisms that can be used to genotype hexaploid wheat and to identify and track introgressions from a variety of sources. We have validated these markers using an ultra-high-density Axiom(®) genotyping array to characterize a range of diploid, tetraploid and hexaploid wheat accessions and wheat relatives. To facilitate the use of these, both the markers and the associated sequence and genotype information have been made available through an interactive web site.Entities:
Keywords: genotyping array; next-generation sequencing; secondary and tertiary gene pools; single nucleotide polymorphism; wheat; wheat progenitors
Mesh:
Substances:
Year: 2015 PMID: 26466852 PMCID: PMC4950041 DOI: 10.1111/pbi.12485
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Distribution of SNP probes across the twenty‐one hexaploid wheat chromosomes as determined using the Exonerate software
| Chromosome | Contigs per chromosome | SNPs assigned | SNP containing contigs | |||
|---|---|---|---|---|---|---|
| Number | Percentage | Number | Percentage | Number | Percentage | |
| 1A | 385 164 | 3.57 | 21 708 | 4.41 | 2577 | 4.24 |
| 1B | 380 769 | 3.53 | 20 955 | 4.26 | 2592 | 4.26 |
| 1D | 418 941 | 3.89 | 22 831 | 4.64 | 2495 | 4.10 |
| 2A | 586 072 | 5.44 | 28 131 | 5.72 | 3322 | 5.46 |
| 2B | 610 231 | 5.66 | 29 349 | 5.97 | 3688 | 6.06 |
| 2D | 753 346 | 6.99 | 29 467 | 5.99 | 3933 | 6.46 |
| 3A | 546 152 | 5.07 | 20 588 | 4.19 | 2842 | 4.67 |
| 3B | 546 922 | 5.08 | 28 508 | 5.80 | 3357 | 5.52 |
| 3D | 641 702 | 5.95 | 20 763 | 4.22 | 3078 | 5.06 |
| 4A | 663 964 | 6.16 | 22 968 | 4.67 | 2975 | 4.89 |
| 4B | 591 798 | 5.49 | 20 418 | 4.15 | 2769 | 4.55 |
| 4D | 572 506 | 5.31 | 22 277 | 4.53 | 2811 | 4.62 |
| 5A | 586 203 | 5.44 | 20 387 | 4.15 | 2567 | 4.22 |
| 5B | 573 553 | 5.32 | 29 635 | 6.03 | 3668 | 6.03 |
| 5D | 371 504 | 3.45 | 29 203 | 5.94 | 2876 | 4.73 |
| 6A | 456 255 | 4.23 | 19 395 | 3.94 | 2306 | 3.79 |
| 6B | 418 338 | 3.88 | 17 385 | 3.54 | 2454 | 4.03 |
| 6D | 292 347 | 2.71 | 22 473 | 4.57 | 2191 | 3.60 |
| 7A | 495 959 | 4.60 | 19 857 | 4.04 | 2702 | 4.44 |
| 7B | 507 514 | 4.71 | 18 679 | 3.80 | 2663 | 4.38 |
| 7D | 377 467 | 3.50 | 26 815 | 5.45 | 2975 | 4.89 |
| Total | 10 776 707 | 100.00 | 491 792 | 100.00 | 60 841 | 100.00 |
Figure 1Single nucleotide polymorphisms (SNP) probe distribution across the hexaploid genome. Distribution of SNP‐probes per IWGSC contigs compared to contig length. The number of SNP‐probes per IWGSC was determined using ‘Exonerate’. For each SNPs per contig grouping the mean (red line) or median (green line) size of the contigs in base pairs (bp) was determined by standard means as was the linear regression (dotted line) of the mean contig length.
Figure 2Examples of the six probe calling categories: (a) Poly High Resolution; (b) No Minor Hom; (c), Off‐Target Variants (OTV); (d) Mono High Resolution; (e) Call Rate Below Threshold; and (f) Other.
Figure 3Number of probes categorised as polymorphic and high quality for each of the different comparisons. Box colours highlight the number of polymorphisms within and between groups; green represents low numbers and red high numbers.
Physical location of the SNP probes. Physical location was determined using either the available nullisomic/tetrasomic lines or the Kansas deletion lines
| Chromosome | Exonerate | Nullitetra | Kansas |
|---|---|---|---|
| 1A | 21 708 | 5328 | 3769 |
| 1B | 20 955 | 7179 | 8786 |
| 1D | 22 831 | 8736 | 8150 |
| 2A | 28 131 | 1413 | 2684 |
| 2B | 29 349 | 12 814 | 4454 |
| 2D | 29 467 | 10 892 | 3384 |
| 3A | 20 588 | 13 760 | 11 518 |
| 3B | 28 508 | —– | 4535 |
| 3D | 20 763 | 14 790 | 3717 |
| 4A | 22 968 | 9792 | – |
| 4B | 20 418 | 1341 | 2186 |
| 4D | 22 277 | 8485 | 6735 |
| 5A | 20 387 | 11 121 | 5294 |
| 5B | 29 635 | 7863 | 2616 |
| 5D | 29 203 | 6540 | 4190 |
| 6A | 19 395 | 5198 | – |
| 6B | 17 385 | 4209 | 5581 |
| 6D | 22 473 | 8324 | – |
| 7A | 19 857 | 6125 | 6731 |
| 7B | 18 679 | 7037 | 37 482 |
| 7D | 26 815 | 10 922 | 6178 |
| Total | 491 792 | 161 869 | 127 990 |
We did not include a 3B nullisomic/tetrasomic line so could not map markers to this chromosome.
Deletion lines for these chromosomes were not included on the array.
Genetic location of the SNP probes determined using one of the three mapping populations Avalon × Cadenza (A × C), Savannah × Rialto (S × R) and Synthetic × Opata (S × O). Genetic maps are available in Tables S4 and S5
| Chromosome | A × C | S × R | S × O | Consensus |
|---|---|---|---|---|
| 1A | 988 | 1353 | 1371 | 2938 |
| 1B | 2379 | 885 | 2306 | 4303 |
| 1D | 896 | 471 | 979 | 2077 |
| 2A | 873 | 487 | 1285 | 2451 |
| 2B | 1709 | 2876 | 2368 | 5967 |
| 2D | 506 | 61 | 1168 | 1599 |
| 3A | 697 | 630 | 1144 | 2083 |
| 3B | 1051 | 1037 | 1787 | 3207 |
| 3D | 152 | 368 | 2010 | 2344 |
| 4A | 905 | 242 | 1518 | 2358 |
| 4B | 599 | 225 | 1240 | 1756 |
| 4D | 55 | 86 | 703 | 813 |
| 5A | 802 | 1048 | 1022 | 2348 |
| 5B | 1191 | 985 | 2138 | 3844 |
| 5D | 425 | 413 | 1519 | 2019 |
| 6A | 1459 | 1146 | 1565 | 3129 |
| 6B | 1557 | 1771 | 1739 | 4090 |
| 6D | 143 | 187 | 1192 | 1397 |
| 7A | 1161 | 965 | 1378 | 2851 |
| 7B | 1123 | 559 | 2110 | 3251 |
| 7D | 271 | 244 | 1266 | 1680 |
| Total | 18 942 | 16 039 | 31 808 | 56 505 |
Figure 4(a) Principal coordinate plot (multidimensional scaling) of all 167 lines (14 D genome, 8 AB genome tetraploids, 10 wild relatives, 108 ABD genome hexaploids and 27 Watkins lines) against 546 299 SNP‐markers. The wild relatives are: 1. Ae. caudata (Ae. markgrafii; C genome), 2. Ae. mutica (syn. Amblyopyrum muticum; T genome), 3. Ae. speltoides (closest living relative to the B genome progenitor), 4. S. cereale (R genome), 5. Th. bessarabicum (J genome), 6. Th. elongatum (E genome), 7. Th. intermedium (JJ sS), 8. Th. ponticum (JJJJ sJs genome), 9. T. timopheevii (GA genome), 10. T. urartu (syn. T. monococcum ssp. aegilopoides; A genome progenitor). The genomes, ploidy and synonyms of these species are given in Table S1. (b) PCO plot of the putative lines belonging to the D genome progenitor, Ae. tauschii. Two distinct clusters are formed; these essentially reflect subspecies (Ae. tauschii ssp. strangulata or Ae. tauschii ssp. tauschii) and geographical location of collection. The strangulata lines, which are indicated by a blue star, all come from northern Iran. (c) PCO plot of the T. turgidum accessions. The first coordinate separates the T. turgidum ssp. dicoccoides line (red dot) from all the other lines that belong to subspecies durum. (d) PCO plot of the hexaploid accessions; blue = winter wheats, green = spring wheats, red = Watkins collection. The numbered lines are those that carry the 1BS/1RS translocation: 1 = Bacanora, 2 = Bobwhite, 3 = Brompton, 4 = Gatsby, 5 = Humber, 6 = Kielder, 7 = Lynx, 8 = Relay, 9 = Rialto, 10 = Savannah. Please note that the accessions Lynx and Savannah (7 and 10, respectively) collocate on the PCO plot.
Figure 5Heatmaps of genotype scores of 104 hexaploid varieties for loci mapped to chromosome (a) 1B and (b) 7DL. The genotypes are organised horizontically by a dendrogram produced using hierarchical cluster analysis and vertically by centimorgan position along the chromosome according to the Synthetic × Opata genetic map. Genotype scores have been coded for each locus as: 1 = least common genotype score; 2 = second most common genotype score and 3 = most common genotype score, and have been coloured according to the legend shown. (a) The heatmap of chromosome 1B shows the distinct haplotypes between those lines carrying the 1RS/1BS substitution (accession names highlighted in red; 0–133 cM) and those lines that do not. This figure also displays the lines belonging to Cadenza derived accessions (accession names highlighted in blue) which have a distinct haplotype on 1B (97.8–198 cM). (b) The heatmap of 7DL highlights accessions carrying Ae. ventricosa introgressions (accession names highlighted in red, 456.8–556.8 cM; accession names highlighted in blue, 551.7–556.8 cM).