| Literature DB >> 28913866 |
Amanda J Burridge1, Paul A Wilkinson1, Mark O Winfield1, Gary L A Barker1, Alexandra M Allen1, Jane A Coghill1, Christy Waterfall1, Keith J Edwards1.
Abstract
Wheat breeders and academics alike use single nucleotide polymorphisms (SNPs) as molecular markers to characterize regions of interest within the hexaploid wheat genome. A number of SNP-based genotyping platforms are available, and their utility depends upon factors such as the available technologies, number of data points required, budgets and the technical expertise required. Unfortunately, markers can rarely be exchanged between existing and newly developed platforms, meaning that previously generated data cannot be compared, or combined, with more recently generated data sets. We predict that genotyping by sequencing will become the predominant genotyping technology within the next 5-10 years. With this in mind, to ensure that data generated from current genotyping platforms continues to be of use, we have designed and utilized SNP-based capture probes from several thousand existing and publicly available probes from Axiom® and KASP™ genotyping platforms. We have validated our capture probes in a targeted genotyping by sequencing protocol using 31 previously genotyped UK elite hexaploid wheat accessions. Data comparisons between targeted genotyping by sequencing, Axiom® array genotyping and KASP™ genotyping assays, identified a set of 3256 probes which reliably bring together targeted genotyping by sequencing data with the previously available marker data set. As such, these probes are likely to be of considerable value to the wheat community. The probe details, full probe sequences and a custom built analysis pipeline may be freely downloaded from the CerealsDB website (http://www.cerealsdb.uk.net/cerealgenomics/CerealsDB/sequence_capture.php).Entities:
Keywords: genotyping by sequencing; single nucleotide polymorphism; target capture; wheat
Mesh:
Substances:
Year: 2017 PMID: 28913866 PMCID: PMC5866950 DOI: 10.1111/pbi.12834
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
The number of sequencing reads and sequencing quality score for each variety. Adaptor trimming and quality check were carried out using Sickle version 1.33 (Joshi and Fass, 2011). The reads are available from NCBI sequencing read archive (Project accession: PRJNA349252)
| Variety | Total number of fastq reads | Total number of fastq reads after trimming (%) | Sequence quality before trimming | Sequence quality after trimming |
|---|---|---|---|---|
| Alchemy | 26 027 926 | 15 962 990 (61.33) | 34.56 | 35.29 |
| Apogee | 28 041 950 | 17 448 872 (62.22) | 34.60 | 35.30 |
| Avalon | 26 109 364 | 15 868 878 (60.78) | 34.58 | 35.29 |
| Battalion | 24 989 864 | 13 117 728 (52.49) | 34.42 | 35.27 |
| Cadenza | 21 962 466 | 13 266 852 (60.41) | 34.56 | 35.29 |
| Caphorn | 35 853 480 | 22 554 522 (62.91) | 34.61 | 35.30 |
| Chinese Spring | 26 889 740 | 16 967 472 (63.1) | 34.61 | 35.30 |
| Claire | 30 598 106 | 18 763 754 (61.32) | 34.57 | 35.30 |
| Cocoon | 26 623 364 | 16 586 542 (62.3) | 34.60 | 35.31 |
| Consort | 31 497 188 | 19 825 662 (62.94) | 34.61 | 35.30 |
| Cordiale | 23 419 158 | 14 324 164 (61.16) | 34.61 | 35.30 |
| Evolution | 23 395 860 | 14 035 648 (59.99) | 34.57 | 35.30 |
| Exsept | 25 121 678 | 16 083 460 (64.02) | 34.61 | 35.31 |
| Galahad | 33 196 828 | 19 799 898 (59.64) | 34.55 | 35.29 |
| Gallant | 26 343 408 | 15 973 848 (60.64) | 34.56 | 35.30 |
| Gatsby | 27 481 234 | 16 727 950 (60.87) | 34.58 | 35.29 |
| Glasgow | 32 678 758 | 19 101 072 (58.45) | 34.52 | 35.28 |
| Hereward | 24 876 424 | 15 346 846 (61.69) | 34.59 | 35.30 |
| Humber | 26 320 222 | 16 541 152 (62.85) | 34.61 | 35.30 |
| Keilder | 26 019 170 | 16 061 178 (61.73) | 34.58 | 35.30 |
| Mendel | 29 329 778 | 16 059 206 (54.75) | 34.46 | 35.28 |
| Opata | 25 953 102 | 15 725 162 (60.59) | 34.57 | 35.29 |
| Paragon | 26 175 096 | 16 860 030 (64.41) | 34.63 | 35.30 |
| Recital | 26 159 128 | 16 797 210 (64.21) | 34.63 | 35.30 |
| Reflection | 27 332 360 | 17 174 174 (62.83) | 34.61 | 35.30 |
| Rialto | 29 480 576 | 17 848 792 (60.54) | 34.56 | 35.29 |
| Robigus | 26 538 010 | 16 468 758 (62.06) | 34.59 | 35.30 |
| Savannah | 26 854 768 | 17 317 486 (64.49) | 34.63 | 35.30 |
| Skyfall | 26 245 230 | 15 728 342 (59.93) | 34.58 | 35.29 |
| Solstice | 29 430 650 | 18 067 354 (61.39) | 34.59 | 35.29 |
| Xi19 | 25 234 874 | 14 978 980 (59.36) | 34.54 | 35.29 |
Figure 1The genomic distribution of capture probes which captured a tri‐allelic single nucleotide polymorphisms in relation to the total number of capture probes used. Chromosome locations are based on IWGSC Whole Genome Assembly v0.4.
Figure 2Similarity of genotype between genotyping platforms using the same probe design. The number of probes is plotted against the percentage genotype similarity when compared to the varieties (a) Comparison between TGbyS and Axiom, a bin size of 31 was used to generate the histogram. (b) Comparison between TGbyS, Axiom and KASP, a bin size of 20 was used to generate the histogram.
Figure 3Principal co‐ordinates plot comparison of (a) Axiom genotype data and (b) TGbyS genotype data generated by NextSeq sequencing. Data generated for the 11 088 probes for which there was no missing data across 31 varieties. Clustered by squared Euclidean distance.
Summary of probe properties between the total 15, 167 probe set and the 3256 cross‐platform probe set
| Total probe set | Cross‐platform set | |
|---|---|---|
| %GC content | 49.39 | 45.52 |
| % Co‐dominant | 49.74 | 61.5 |
| % Dominant | 28.29 | 18.6 |
Figure 4The ratio of co‐dominant, dominant and partially co‐dominant probes between subsets. (a) Total probe set. (b) Probes identified as part of the cross‐platform set. (c) Probes which generated a genotype which correlated between TGbyS and Axiom platforms for fewer than 16 of the 31 varieties (<50% correlation).
Figure 5Distribution of capture probes throughout the wheat genome for full capture probe set and the cross‐platform subset. The full set represents 15 167 probes, and the cross‐platform set represents 3256 probes.