| Literature DB >> 29467780 |
Qian You1,2, Xiping Yang2, Ze Peng2, Liping Xu1, Jianping Wang2,3,4.
Abstract
Polypoid species play significant roles in agriculture and food production. Many crop species are polyploid, such as potato, wheat, strawberry, and sugarcane. Genotyping has been a daunting task for genetic studies of polyploid crops, which lags far behind the diploid crop species. Single nucleotide polymorphism (SNP) array is considered to be one of, high-throughput, relatively cost-efficient and automated genotyping approaches. However, there are significant challenges for SNP identification in complex, polyploid genomes, which has seriously slowed SNP discovery and array development in polyploid species. Ploidy is a significant factor impacting SNP qualities and validation rates of SNP markers in SNP arrays, which has been proven to be a very important tool for genetic studies and molecular breeding. In this review, we (1) discussed the pros and cons of SNP array in general for high throughput genotyping, (2) presented the challenges of and solutions to SNP calling in polyploid species, (3) summarized the SNP selection criteria and considerations of SNP array design for polyploid species, (4) illustrated SNP array applications in several different polyploid crop species, then (5) discussed challenges, available software, and their accuracy comparisons for genotype calling based on SNP array data in polyploids, and finally (6) provided a series of SNP array design and genotype calling recommendations. This review presents a complete overview of SNP array development and applications in polypoid crops, which will benefit the research in molecular breeding and genetics of crops with complex genomes.Entities:
Keywords: SNP; SNP array; genotype calling; high throughput genotyping; polyploidy
Year: 2018 PMID: 29467780 PMCID: PMC5808122 DOI: 10.3389/fpls.2018.00104
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of SNP arrays developed in polyploid crops.
| Peanut (allo-tetraploid, 2 | 163K SNPs from DNA re-sequencing of 38 accessions and RNA-sequencing of 3 accessions | 58K (58,233) Axiom (Affymetrix) | 297 accessions from 48 countries, including 36 wide species | 44,424 (73.3%) polymorphism | Genetic diversity and genetic architecture | Pandey et al., |
| Potato tetraploid (auto-tetraploid, 2 | Two million SNPs from RNA-seq of 3 commercial cultivars and 8K SNPs from Sanger EST of 3 additional cultivars | 96 Illumina BeadXpress (Illumina) | 248 lines | 82 (85.4%) reliably scored | Genetic diversity analysis (population structure) | Hamilton et al., |
| Potato tetraploid (auto-tetraploid, 2 | 69K high confidence SNPs from previous identification (Hamilton et al., | 8K (8,303) Infinium (Illumina) | 184 progeny, 92 from population D84 and 92 from population DRH | Over 4,400 (53.0%) markers were mapped | Development of linkage maps | Felcher et al., |
| Potato tetraploid (auto-tetraploid, 2 | 20K SNPs from previous identification (Hamilton et al., | 18K (17,987) Infinium (Illumina) | 569 accessions, including 537 tetraploids and 32 diploids | 14,530 (80.8%) successfully scored with fitTetra | Reconstruction of the breeding history, shaping the genetic composition | Vos et al., |
| Cotton (allo-tetraploid, 2 | 50K SNPs from 9 intra-specific data sets, and 20K SNPs from 4 inter-specific data sets (11 previous studies and 2 unpublished studies) | 63K (63,058) Infinium (Illumina) | 1,156 individual samples | 38,822 (61.6%) polymorphic markers | Development of high density genetic map | Hulse-Kemp et al., |
| Alfalfa (auto-tetraploid, 2 | 900K SNPs from RNA-sequence of 27 alfalfa genotypes (including 23 tetraploid and 4 diploid) by previous reported (Li et al., | 9K (9,277) Infinium (Illumina) | 280 diverse genotypes including related species | 7,476 (81%) polymorphic markers | Evaluation population structure and linkage disequilibrium | Li et al., |
| Brassica (allo-tetraploid, 2 | 54K SNPs identified and evaluated by previous reports (Bus et al., | 52K (52,157) Infinium (Illumina) | 437 diverse genotypes (432 diverse genotypes were generated independently in two laboratories) | About 60% genome-specific markers | Genetic map generation | Clarke et al., |
| Wheat (allo-hexaploid, 2 | 25K SNPs from RNA-sequence of 26 hexaploid accessions | 9K (9,000) Infinium (Illumina) | 2,994 hexaploid accessions, including landraces and modern cultivars | 7,733 (85.9%) successfully genotyped | Genetic diversity and population structure, selection scans | Cavanagh et al., |
| Wheat (allo-hexaploid, 2 | 128K SNPs from RNA-sequencing of 19 hexaploid and 18 tetraploid accessions | 90K (91,829) Infinium (Illumina) | 646 accesions, including 55 tetraploid cultivars, 447 hexaploid cultivars, and 144 hexaploid landraces | 81,587 (89%) produced functional assays | Characterization of genomic diversity | Wang et al., |
| Wheat (allo-hexaploid, 2 | 921K SNPs from Exom sequencing of 14 diploid, 5 tetraploid, 23 hexaploid, and 1 decaploid accessions | 820K (819,571) Axiom (Affymetrix) | 475 accessions, including diploid, tetraploid, and hexaploid wheat accessions and wheat relatives | 546,299 (66.7%) polymorphic SNPs | Physical and genetic mapping, genetic characterization | Winfield et al., |
| Wheat (allo-hexaploid, 2 | 35K SNPs from previous study (Winfield et al., | 35K (35,143) Axiom (Affymetrix) | 1,843 DNA samples, including 1,779 unique hexaploid wheat accessions and 64 replicates | 33,326 (94.8%) polymorphic SNPs | Genetic mapping, and characterization of genetic diversity | Allen et al., |
| Oat (allo-hexaploid, 2 | 11K high-confidence SNPs from RNA-sequencing of 20 genotypes (Oliver et al., | 3K (3,072) GoldenGate (Illumina) | 390 recombinant inbred lines | 1,311 (42.7%, success rate) robust markers | Development of physically anchored consensus map | Oliver et al., |
| Oat (allo-hexaploid, 2 | 8K SNPs from 4 DNA sequence data sets. (Tinker et al., | 6K (5,743) Infinium (Illumina) | 1,110 hexaploid samples, including 109 diverse cultivars, 390 progeny, and 595 breeding lines | 4,975 (86.6%) SNPs produced successfully assays | SNP discovery and annotation, population genetic characteristics | Tinker et al., |
| Strawberry (allo-octploid, 2 | 160K di-allelic SNPs from DNA-sequencing of 19 octoploid and 6 diploid strawberry accessions | 90K (95,062) Axiom (Affymetrix) | 384 samples, including 357 octoploid accessions and cultivars, 4 diploid accessions, and 23 progeny. | 60,473 (64%) polymorphic SNPs, including 23,355 (24.6%, success rate) markers in | High density linkage maps, QTL identification | Bassil et al., |
| Sugarcane (auto-dodecaploid, 2 | 2.6M SNPs from target gene-rich regions sequencing of 16 lines | 345K (345,704) Axiom (Affymetrix) | 367 clones, including parental clones, cultivars, and unselected families | 48,802 (14.1%) validated polymorphic markers and 11,443 (3.3%, success rate) markers in | Association analysis of cane yield and sugar content, genetic mapping | Aitken et al., |
PHR, Poly High Resolution, which were polymorphic and passed all quality control (Bassil et al., .
Figure 1Flowcharts of genotype calling for Affymetrix and Illumina platforms. (A) The flowchart of executing genotype calling mainly based on Best Practices Genotyping Workflow on the Affymetrix Axiom SNP array platform. (B) The flowchart of executing genotype calling based on Illumina Infinium SNP array platform. GenCall score: a quality metric calculated for each genotype (data point), and ranges from 0 to 1; 10% GenCall score: 10th percentile GenCall score for all samples.