| Literature DB >> 22363718 |
David Chagné1, Ross N Crowhurst, Michela Troggio, Mark W Davey, Barbara Gilmore, Cindy Lawley, Stijn Vanderzande, Roger P Hellens, Satish Kumar, Alessandro Cestaro, Riccardo Velasco, Dorrie Main, Jasper D Rees, Amy Iezzoni, Todd Mockler, Larry Wilhelm, Eric Van de Weg, Susan E Gardiner, Nahla Bassil, Cameron Peace.
Abstract
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple.Entities:
Mesh:
Year: 2012 PMID: 22363718 PMCID: PMC3283661 DOI: 10.1371/journal.pone.0031745
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Workflow for single nucleotide polymorphism (SNP) detection, validation, and final choice employed for development of the IRSC apple 8K SNP array v1.
GDsnp: ‘Golden Delicious’-validated SNP; RosCOS: Rosaceae Conserved Orthologous Set; MAF: Minor Allele Frequency; ADT: Assay Design Tool.
Apple cultivars used for low coverage re-sequencing and subsequent single nucleotide polymorphism detection.
| Accession | Number of reads | Type of reads | Estimated genome coverage (X) | Location of sequencing |
| ‘Braeburn’ | 34,699,040 | Paired-end | 6.9 | PFR |
| ‘Co-op 15’ | 51,864,610 | Single-end | 5.2 | ARC |
| ‘Cox's Orange Pippin’ | 35,120,961 | Paired-end | 7.0 | PFR & ARC |
| ‘Crimson Crisp’ | 6,128,575 | Paired-end | 1.3 | RosBREED |
| ‘Cripps Pink’ | 56,738,286 | Single-end | 6.1 | ARC |
| ‘Delicious’ | 17,995,052 | Paired-end | 1.9 | RosBREED |
| ‘Dolgo’ | 59,863,282 | Single-end | 6.4 | ARC |
| ‘Duchess of Oldenburg’ | 17,762,643 | Paired-end | 3.8 | RosBREED |
| F226829-2-2 | 31,580,732 | Paired-end | 6.7 | RosBREED |
| ‘Frostbite’ | 44,740,552 | Single-end | 4.5 | ARC |
| ‘Fuji’ | 44,545,764 | Single-end | 4.8 | ARC |
| ‘Geneva’ | 31,977,525 | Paired-end | 6.4 | PFR |
| ‘Golden Delicious’ | 22,101,159 | Paired-end | 4.7 | RosBREED |
| ‘Granny Smith’ | 36,112,204 | Paired-end | 7.2 | PFR |
| ‘Haralson’ | 15,499,148 | Paired-end | 3.3 | RosBREED |
| ‘Honeycrisp’ | 20,887,451 | Paired-end | 4.5 | RosBREED |
| ‘Idared’ | 33,479,444 | Paired-end | 6.7 | PFR |
| ‘James Grieve’ | 26,005,304 | Paired-end | 5.2 | PFR |
| ‘Jonathan’ | 18,812,375 | Paired-end | 4.0 | RosBREED |
| ‘McIntosh’ | 59,934,507 | Paired-end | 12.8 | RosBREED |
|
| 23,500,375 | Paired-end | 4.7 | USDA-ARS |
| ‘Ralls Janet’ | 35,485,026 | Paired-end | 7.1 | PFR |
| ‘Red Delicious’ | 25,993,982 | Paired-end | 5.2 | PFR |
| ‘Red Dougherty’ | 32,718,666 | Paired-end | 6.5 | PFR |
| ‘Rome Beauty’ | 23,220,046 | Paired-end | 5.0 | RosBREED |
| ‘Splendour’ | 36,353,918 | Paired-end | 7.3 | PFR |
| ‘Zestar’ | 54,404,792 | Single-end | 5.4 | ARC |
An approximate genome coverage was estimated for each accession using a genome size of 750 Mb. The source of cultivars and numbers of Illumina GA II reads obtained are indicated. ARC: Agricultural Research Council, South Africa; PFR: The New Zealand Institute for Plant & Food Research Ltd; RosBREED: U.S.-based international project.
Figure 2Classification of the 144 apple single nucleotide polymorphisms (SNPs) used for validation using the Illumina GoldenGate® assay.
Single nucleotide polymorphisms (SNPs) detected across the apple genome by re-sequencing 27 apple accessions.
| Chromosome | Sequence used for SNP detection (bp) | SNPs examined from | SNPs passing “Stage 1” filtering | Average distance between SNPs (bp) | Exonic SNPs |
| 1 | 36,084,648 | 556,642 | 102,825 | 350 | 31,678 |
| 2 | 40,172,783 | 793,583 | 144,715 | 277 | 42,496 |
| 3 | 39,907,579 | 700,701 | 134,595 | 296 | 36,005 |
| 4 | 25,411,901 | 474,397 | 91,212 | 278 | 27,540 |
| 5 | 37,603,833 | 661,644 | 134,672 | 279 | 38,400 |
| 6 | 30,670,413 | 514,972 | 97,014 | 316 | 29,450 |
| 7 | 31,181,013 | 546,126 | 117,652 | 265 | 33,155 |
| 8 | 35,800,717 | 625,020 | 114,173 | 313 | 31,946 |
| 9 | 37,514,065 | 737,124 | 146,472 | 256 | 41,935 |
| 10 | 38,388,612 | 687,313 | 146,102 | 262 | 43,522 |
| 11 | 40,097,013 | 751,789 | 146,718 | 273 | 42,681 |
| 12 | 36,276,268 | 664,326 | 121,739 | 297 | 35,456 |
| 13 | 39,686,055 | 720,281 | 158,912 | 249 | 43,699 |
| 14 | 34,156,235 | 607,215 | 114,259 | 298 | 32,712 |
| 15 | 55,775,419 | 919,506 | 159,368 | 349 | 46,218 |
| 16 | 23,462,870 | 421,197 | 85,530 | 274 | 24,357 |
| 17 | 27,122,502 | 533,920 | 97,162 | 279 | 30,349 |
| Total | 609,311,926 | 10,915,756 | 2,113,120 | 288 | 611,599 |
Results from a GoldenGate® assay of 144 single nucleotide polymorphisms (SNPs) screened over 160 apple accessions (Table S1).
| Proportion of SNPs | |||||
| SNP type | Total | Failed | Mono-morphic | Polymorphic (MAF<0.05) | Polymorphic (MAF>0.05) |
| Evenly spread | 100 | 0.32 | 0.08 | 0.04 | 0.56 |
| Accession-specific | 28 | 0.25 | 0.11 | 0.39 | 0.25 |
| Candidate genes | 27 | 0.22 | 0.11 | 0.04 | 0.63 |
|
| 20 | 0.40 | 0.00 | 0.05 | 0.55 |
| GDsnp | 8 | 0.00 | 0.00 | 0.00 | 1.00 |
| ADT score <0.8 | 40 | 0.35 | 0.00 | 0.05 | 0.60 |
| ADT score 0.8–0.9 | 35 | 0.40 | 0.06 | 0.03 | 0.51 |
| ADT score <0.9 | 69 | 0.26 | 0.12 | 0.17 | 0.45 |
| Mean ADT score (s.d.) | 0.85 (0.13) | 0.83 (0.12) | 0.94 (0.04) | 0.92 (0.10) | 0.83 (0.14) |
| Total | 144 | 0.32 | 0.07 | 0.10 | 0.51 |
Figure 3Detailed view of a genomic region at the top of Linkage Group 1 showing SNPs chosen for the International RosBREED SNP Consortium (IRSC) apple 8K SNP array v1.
Physical map locations of SNPs (left; in megabases) are compared with known genetic map locations of SNP markers developed from ‘Golden Delicious’ (GDsnp in centiMorgans). SNP clusters around focal points are boxed, with green boxes denoting GDsnps, blue boxes denoting Rosaceae Conserved Orthologous Set markers (RosCOS), and orange boxes denoting candidate genes.
Apple Infinium® II array v1 content and evaluation in the Plant & Food Research dataset. The number of attempted SNPs and clusters is indicated per linkage group (LG), as well as their density based on the apple genome assembly in kilobases (Kb) and the ‘Golden Delicious’ genetic map (cM).
| Attempted SNPs on IRSC Apple Infinium II array v1 | Array evaluation in the Plant & Food Research dataset | ||||||||||
| LG | No. of SNPs | No. of clusters | Average physical distance between focal points (kb) | Average genetic distance between focal points (cM) | No. of successful beadtypes | No. of poly-morphic SNPs | % poly-morphic SNPs | No. of poly-morphic SNPs with MAF>0.05 | MAF>0.05, 50 GC>0.4, call rate >0.95 | No. of poly-morphic clusters | % poly-morphic clusters |
| 1 | 443 | 81 | 443.7 | 1.10 | 434 | 299 | 68.9 | 250 | 180 | 75 | 92.6 |
| 2 | 695 | 113 | 355.1 | 0.75 | 684 | 519 | 75.9 | 436 | 313 | 108 | 95.6 |
| 3 | 499 | 74 | 538.3 | 1.19 | 487 | 350 | 71.9 | 288 | 212 | 72 | 97.3 |
| 4 | 392 | 65 | 388.7 | 1.00 | 386 | 292 | 75.6 | 235 | 201 | 63 | 96.9 |
| 5 | 492 | 82 | 456.9 | 1.10 | 486 | 360 | 74.1 | 308 | 239 | 76 | 92.7 |
| 6 | 351 | 64 | 473.2 | 1.17 | 340 | 233 | 68.5 | 194 | 117 | 63 | 98.4 |
| 7 | 345 | 71 | 438.5 | 0.83 | 340 | 215 | 63.2 | 181 | 129 | 62 | 87.3 |
| 8 | 405 | 68 | 519.9 | 1.07 | 399 | 276 | 69.2 | 229 | 182 | 62 | 91.2 |
| 9 | 490 | 80 | 449.2 | 0.91 | 477 | 364 | 76.3 | 311 | 232 | 76 | 95.0 |
| 10 | 554 | 83 | 446.6 | 1.12 | 531 | 377 | 71.0 | 271 | 224 | 79 | 95.2 |
| 11 | 466 | 80 | 500.7 | 0.97 | 456 | 322 | 70.6 | 277 | 177 | 73 | 91.3 |
| 12 | 469 | 85 | 426.5 | 0.87 | 459 | 338 | 73.6 | 258 | 210 | 81 | 95.3 |
| 13 | 433 | 81 | 489.6 | 0.94 | 423 | 299 | 70.7 | 229 | 178 | 76 | 93.8 |
| 14 | 388 | 72 | 472.2 | 1.02 | 374 | 266 | 71.1 | 215 | 151 | 64 | 88.9 |
| 15 | 629 | 106 | 525.6 | 1.09 | 621 | 440 | 70.9 | 365 | 253 | 101 | 95.3 |
| 16 | 355 | 74 | 316.2 | 0.76 | 347 | 271 | 78.1 | 215 | 158 | 69 | 93.2 |
| 17 | 461 | 76 | 356.7 | 0.99 | 448 | 333 | 74.3 | 231 | 215 | 70 | 92.1 |
| Total | 7867 | 1355 | 446.2 | 1.00 | 7692 | 5554 | 72.2 | 4493 | 3371 | 1270 | 93.7 |
The Apple Infinium® array was evaluated by screening 1398 progeny in 8 segregating populations and 117 individuals from the Plant & Food Research apple germplasm collection.
Figure 4Genetic maps of linkage group (LG) 1 constructed using the ‘Royal Gala’בGranny Smith’ segregating population.
Marker names indicate the type of SNP, physical location on the ‘Golden Delicious’ genome assembly (in base pair), cluster marker, minor allelic frequency (MAF) in the 27 re-sequenced accessions, gene model, position in the predicted gene model, and NCBI dbSNP accession number (in parentheses).
Figure 5Example of the usefulness of the International RosBREED SNP Consortium (IRSC) apple 8K SNP array v1 for developing haplotypes.
SNP markers were the same as represented in the example of Figure 3. Haplotypes inferred using FlexQTL™ for each cluster of SNPs are numbered from H1 to H4). Haplotypes inherited by the progeny from the parents are boxed and colored coded for each parent. A putative recombination is indicated by a cross for the ‘(Royal) Gala’בSplendour’−>‘Sciros’ trio.