| Literature DB >> 29089957 |
Micha M Bayer1, Paulo Rapazote-Flores1, Martin Ganal2, Pete E Hedley1, Malcolm Macaulay1, Jörg Plieske2, Luke Ramsay1, Joanne Russell1, Paul D Shaw1, William Thomas1, Robbie Waugh1.
Abstract
High-throughput genotyping arrays continue to be an attractive, cost-effective alternative to sequencing based approaches. We have developed a new 50k Illumina Infinium iSelect genotyping array for barley, a cereal crop species of major international importance. The majority of SNPs on the array have been extracted from variants called in exome capture data of a wide range of European barley germplasm. We used the recently published barley pseudomolecule assembly to map the exome capture data, which allowed us to generate markers with accurate physical positions and detailed gene annotation. Markers from an existing and widely used barley 9k Infinium iSelect array were carried over onto the 50k chip for backward compatibility. The array design featured 49,267 SNP markers that converted into 44,040 working assays, of which 43,461 were scorable in GenomeStudio. Of the working assays, 6,251 are from the 9k iSelect platform. We validated the SNPs by comparing the genotype calls from the new array to legacy datasets. Rates of agreement averaged 98.1 and 93.9% respectively for the legacy 9k iSelect SNP set (Comadran et al., 2012) and the exome capture SNPs. To test the utility of the 50k chip for genetic mapping, we genotyped a segregating population derived from a Golden Promise × Morex cross (Liu et al., 2014) and mapped over 14,000 SNPs to genetic positions which showed a near exact correspondence to their known physical positions. Manual adjustment of the cluster files used by the interpreting software for genotype scoring improved results substantially, but migration of cluster files between sites led to a deterioration of results, suggesting that local adjustment of cluster files is required on a site-per-site basis. Information relating to the markers on the chip is available online at https://ics.hutton.ac.uk/50k.Entities:
Keywords: SNP; barley; exome capture; genotyping chip; iSelect
Year: 2017 PMID: 29089957 PMCID: PMC5651081 DOI: 10.3389/fpls.2017.01792
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Principal Coordinate Analysis (PCoA) of 394 accessions genotyped with 6,917 polymorphic SNPs. The core set of 148 barley lines chosen are classified into 5 groups: (i) Central European old cultivars; (ii) spring 6 row; (iii) spring 2 row; (iv) winter 2 row and; (v) winter 6 row, and the remaining 246 accessions (classified as “rest”) are shown.
Composition of the 170 barley spring and winter germplasm lines chosen for the 50k SNP design.
| 1920–1940 | 1 | |||
| 1940–1960 | 4 | 1 | 0 | 1 |
| 1960–1980 | 14 | 2 | 1 | 2 |
| 1980–2000 | 66 | 3 | 18 | 6 |
| 2000–2010 | 24 | 5 | 12 | 3 |
| Unknown | 5 | 1 | 1 | |
| Total | 113 | 12 | 31 | 14 |
Figure 2Plot of 9k SNP marker genetic map positions vs. physical map positions for chromosome 1H.
Figure 3Physical distribution of the SNPs in the initial SNP set, the additional set targeted at markers in the distal chromosome regions, and the combination of these two sets (chromosome 1H).
Summary of SNP numbers during the selection process.
| Total number called in 170 sample set | 20,560,627 |
| After filtering | 528,439 |
| Submitted to Illumina ADT (chr 1H-7H only) | 519,742 |
| Passed ADT analysis | 295,642 |
| Max. gene coverage set | 16,957 |
| Max. chromo coverage + gene evenness set | 26,091 |
| Total number from exome capture pre redundancy check | 43,048 |
| Overlap between 9k and EXCAP set | 732 |
| Legacy markers in design carried forward from 9k chip | 6,951 |
| Exome capture markers in design post redundancy check | 42,316 |
| Final number for array design | 49,267 |
| Total number of working assays on 50k chip | 44,040 (6,251 from 9k + 37,789 from EXCAP) |
| Total number of scorable assays on 50k chip | 43,461 |
SNP effect annotation for the final set of SNPs on the 50k chip by type/region.
| Intron | 28.56 |
| Downstream | 21.93 |
| Upstream | 13.80 |
| UTR 3 prime | 11.39 |
| Synonymous coding | 8.00 |
| Non-synonymous coding | 7.48 |
| UTR 5 prime | 4.88 |
| Splice site region | 1.95 |
| Start gained | 0.92 |
| Intergenic | 0.80 |
| Stop gained | 0.12 |
| Stop lost | 0.06 |
| Splice site acceptor | 0.05 |
| Splice site donor | 0.04 |
| Synonymous stop | 0.02 |
| Start lost | 0.01 |
| Non-synonymous start | 0.00 |
Figure 4Boxplot of genotype call agreement rate (%) from the comparison of calls from the exome capture variant calling and the GenomeStudio calls for the accessions genotyped with the new 50k chip. The data shown is categorised by the provenance of the SNPs (9k = SNPs from the existing 9k chip, EXCAP SNPs = SNPs from exome capture data that are new and exclusive to the 50k chip), the source and type of cluster file (JH, James Hutton; TG, TraitGenetics, adjusted vs. unadjusted) and the source of the 50k genotype call set used (JH vs. TG). Bold horizontal lines represent the median, box boundaries upper and lower quartiles, whiskers maxima and minima, and open circles represent outliers.
Summary of the genetic map derived from the GP × Mo RIL population.
| 1H | 113.7 | 1,730 |
| 2H | 146.1 | 2,927 |
| 3H | 127.3 | 2,160 |
| 4H | 118.4 | 1,371 |
| 5H | 179.3 | 2,299 |
| 6H | 116.0 | 1,991 |
| 7H | 110.2 | 2,148 |
| Total | 914.0 | 14,626 |
Figure 5Comparison of genetic map and physical positions of SNPs on chromosome 6H mapped in GPxMo RIL population.
Figure 6Comparison of genetic map and physical position on chromosome 5H around the position of HvDEP1, the causal gene underlying the ari-e.GP mutant phenotype mapped in the GP × Mo RIL population. SNPs from the 50k are shown as black diamonds and the three GBS SNPs are shown as red circles.