| Literature DB >> 31105733 |
Benoit Darrier1,2, Joanne Russell3, Sara G Milner4, Pete E Hedley3, Paul D Shaw5, Malcolm Macaulay3, Luke D Ramsay3, Claire Halpin2, Martin Mascher4, Delphine L Fleury1, Peter Langridge1, Nils Stein4,6, Robbie Waugh1,2,3.
Abstract
We compared the performance of two commonly used genotyping platforms, genotyping-by-sequencing (GBS) and single nucleotide polymorphism-arrays (SNP), to investigate the extent and pattern of genetic variation within a collection of 1,000 diverse barley genotypes selected from the German Federal ex situ GenBank hosted at IPK Gatersleben. Each platform revealed equivalent numbers of robust bi-allelic SNPs (39,733 and 37,930 SNPs for the 50K SNP-array and GBS datasets respectively). A small overlap of 464 SNPs was common to both platforms, indicating that the methodologies we used selectively access informative polymorphism in different portions of the barley genome. Approximately half of the GBS dataset was comprised of SNPs with minor allele frequencies (MAFs) below 1%, illustrating the power of GBS to detect rare alleles in diverse germplasm collections. While desired for certain applications, the highly robust calling of alleles at the same SNPs across multiple populations is an advantage of the SNP-array, allowing direct comparisons of data from related or unrelated studies. Overall MAFs and diversity statistics (π) were higher for the SNP-array data, potentially reflecting the conscious removal of markers with a low MAF in the ascertainment population. A comparison of similarity matrices revealed a positive correlation between both approaches, supporting the validity of using either for entire GenBank characterization. To explore the potential of each dataset for focused genetic analyses we explored the outcomes of their use in genome-wide association scans for row type, growth habit and non-adhering hull, and discriminant analysis of principal components for the drivers of sub-population differentiation. Interpretation of the results from both types of analysis yielded broadly similar conclusions indicating that choice of platform used for such analyses should be determined by the research question being asked, group preferences and their capabilities to extract and interpret the different types of output data easily and quickly. Access to the requisite infrastructure for running, processing, analyzing, querying, storing, and displaying either datatype is an additional consideration. Our investigations reveal that for barley the cost per genotyping assay is less for SNP-arrays than GBS, which translates to a cost per informative datapoint being significantly lower for the SNP-array.Entities:
Keywords: GBS; GWAS; SNP-array; diversity; germplasm evaluation
Year: 2019 PMID: 31105733 PMCID: PMC6499090 DOI: 10.3389/fpls.2019.00544
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Single nucleotide polymorphism-arrays (SNPs) marker distribution.
| (A) Number of SNPs per chromosome according to assay platform. | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1H | 2H | 3H | 4H | 5H | 6H | 7H | Anchored | Un | Total | |
| 50K | 4,364 | 6,564 | 6,076 | 4,668 | 7,333 | 4,937 | 5,791 | 39,733 | 2,567 | 42,300 |
| GBS | 4,709 | 5,879 | 6,147 | 4,162 | 5,631 | 4,624 | 6,175 | 37,327 | 603 | 37,930 |
| Monomorphic | 26 | 24 | 1,617 | 11,849 | ||||||
| Polymorphic | 39,707 | 39,709 | 35,710 | 25,478 | ||||||
FIGURE 1Distribution of minor allele frequencies in GBS and 50K array data. SNP counts were aggregated in 2% bins.
FIGURE 2Minor allele frequencies according to physical location of markers along barley chromosomes. (A–G) 50K SNP-array data for chromosomes 1H–7H top to bottom respectively (left panel) and (H–N) GBS data chromosomes 1H–7H top to bottom respectively (right panel). SNPs are color coded according to MAF.
FIGURE 3Sliding window analysis of genetic diversity (π) (A–G) seven barley chromosomes 1H–7H respectively. Red, 50K data; Blue, GBS data.
FIGURE 4Principal coordinates analysis. (A,C,E) Shows diversity revealed by the 50K SNP-array and (B,D,F) by GBS. (A,B) Shows genotypes color coded according to geographical origin, (C,D) according to growth habit and (E,F) according to row-type.
FIGURE 5Genome-wide association scans (GWASs) of phenological traits segregating in the 1000 core population. (A,C,E) Use the 50K array data. (B,D,F) Use GBS data. (A,B) Data for row-type, (C,D) non-adhering hull, and (E,F) seasonal growth habit. Horizontal red line = –log10(5e-8), Horizontal Blue line = –log10(1e-5). The location of known genes associated with each trait is indicated. Y-axis; −log10(p) values.
Genetic resolution of GWAS for row-type, growth habit, and hull adherence.
| Traits | Associate SNP | Chr | Position | Technology | Gene | Minimum distance in bp |
|---|---|---|---|---|---|---|
| Row- type | JHI-Hv50k-2016-107445 | 2H | 651,372,755 | 50K | VRS1 | |
| 2:651372029 | 2H | 651,372,029 | GBS | |||
| JHI-Hv50k-2016-231001 | 4H | 17,377,068 | 50K | INT-C | ||
| 4:17598761 | 4H | 17,598,761 | GBS | |||
| Growth habit | JHI-Hv50k-2016-335893 | 5H | 598,787,735 | 50K | VrnH1 | |
| 5:648520473 | 5H | 648,520,473 | GBS | |||
| Grain Hull | JHI-Hv50k-2016-491472 | 7H | 546,632,335 | 50K | NUD | |
| 7:548419008 | 7H | 548,419,008 | GBS |