| Literature DB >> 32587600 |
Stefano Pavan1,2, Chiara Delvento1, Luigi Ricciardi1, Concetta Lotti3, Elena Ciani4, Nunzio D'Agostino5.
Abstract
High-throughput genotyping boosts genome-wide association studies (GWAS) in crop species, leading to the identification of single-nucleotide polymorphisms (SNPs) associated with economically important traits. Choosing a cost-effective genotyping method for crop GWAS requires careful examination of several aspects, namely, the purpose and the scale of the study, crop-specific genomic features, and technical and economic matters associated with each genotyping option. Once genotypic data have been obtained, quality control (QC) procedures must be applied to avoid bias and false signals in genotype-phenotype association tests. QC for human GWAS has been extensively reviewed; however, QC for crop GWAS may require different actions, depending on the GWAS population type. Here, we review most popular genotyping methods based on next-generation sequencing (NGS) and array hybridization, and report observations that should guide the investigator in the choice of the genotyping method for crop GWAS. We provide recommendations to perform QC in crop species, and deliver an overview of bioinformatics tools that can be used to accomplish all needed tasks. Overall, this work aims to provide guidelines to harmonize those procedures leading to SNP datasets ready for crop GWAS.Entities:
Keywords: GWAS; bioinformatics tools; crops; genotyping; quality control
Year: 2020 PMID: 32587600 PMCID: PMC7299185 DOI: 10.3389/fgene.2020.00447
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
List of some genomic and economic aspects that should be taken into consideration when planning GWAS in crops.
| 0.49 | 800 Kb ( | 980 (subgenome A) 143 (subgenome B) | 19.4 K | International Brassica SNP Consortium | Illumina Infinium BeadChip | 52K | ||||
| 0.90 | 665 Kb ( | 1353 | 36K | SolCAP Tomato 2013 | Illumina Infinium BeadChip | 9K | ||||
| Axiom Tomato Genotyping Array | Affymetrix Axiom | 52K | Unpublished | |||||||
| 0.84 | 1.5–0.6 Mb ( | 560–14,000 | 33.6K | SOLCAP Potato 2013 | Illumina Infinium BeadChip | 10K | ||||
| SolSTW array | Affymetrix Axiom | 20K | ||||||||
| 3.30 | 100 Kb ( | 33,000 | 132K | UCD TraitGenetics Pepper (Capsicum) Consortium | Illumina Infinium BeadChip | 19K | ||||
| Pepper (Capsicum) SNP Genotyping Array | Affymetrix Axiom | 640K | Unpublished | |||||||
| 0.35 | 24 Kb ( | 14,583 | 14K | – | Fluidigm | 35K | ||||
| 55–140.5 Kb ( | 6364–2491 | |||||||||
| 0.45 | 100 Kb ( | 4500 | 18K | |||||||
| 72–774 Kb ( | 6250–581 | |||||||||
| 0.59 | 1 Mb ( | 587 | 23.48K | BARCBean6K_1 | Illumina Infinium BeadChip | 5K | ||||
| 1.12 | 8.5–15.5 Mb ( | 131–72 | 44.6K | SoySNP50K | Illumina Infinium BeadChip | 6K | ||||
| 5.9–7 Mb ( | 189–159 | SoyaSNP180K Axiom | Affymetrix Axiom | 180K | ||||||
| 0.47 | 100–400 bp ( | 4,730,000–1,182,500 | 18.92K | |||||||
| 0.39 | 150 Kb ( | 2593 | 15.56K | RiceSNP50 | Illumina Infinium BeadChip | 50K | ||||
| RICE6K | Illumina Infinium BeadChip | 6K | ||||||||
| Axiom Rice Genotyping Array | Affymetrix Axiom | 50K | ||||||||
| 16.00 | 8 Mb ( | 2000 | 640K | US/Australia 9K Wheat Consortium | Illumina Infinium BeadChip | 9K | ||||
| Wheat 90K iSelect | Illumina Infinium BeadChip | 90K | ||||||||
| Axiom Wheat Breeders Genotyping Array | Affymetrix Axiom | 35K | ||||||||
| Axiom Wheat HD Genotyping Arrays | Affymetrix Axiom | 817K | ||||||||
| 2.50 | 6.34 Kb ( | 394,322 | 100K | MaizeSNP50 BeadChip | Illumina Infinium BeadChip | 50K | ||||
| 500 bp ( | 5,000,000 | Subset of MaizeSNP50 | Illumina Infinium BeadChip | 3K | ||||||
| 1.5 Kb ( | 1,666,667 | Axiom Maize Genotyping Array | Affymetrix Axiom | 600K | ||||||
| Maize 55K Axiom | Affymetrix Axiom | 55K | ||||||||
| 0.74 | 200 bp ( | 7,420,000 | 29.68K | RosBREED Apple | Illumina Infinium BeadChip | 8K | ||||
| Fruitbreedomics Apple20k | Illumina Infinium BeadChip | 20K | ||||||||
| Axiom Apple Genotyping Array | Affymetrix Axiom | 480K | ||||||||
| 0.27 | 1.2–3.2 Mb ( | 221–83 | 10.6K | RosBREEDPeach | Illumina Infinium BeadChip | 9K | ||||
| 0.48 | 43 Kb ( | 11047 | 19K | GrapeReSeq Consortium | Illumina Infinium BeadChip | 20K | ||||
| GeneChip | Applied Biosystems | 15K | Unpublished | |||||||
| 1.46 | 25 bp ( | 58,400,000 | 58.4K | |||||||
| 2.43 | 3.2–3.3 Mb ( | 759–736 | 97.2K | International Cotton SNP Consortium | Illumina Infinium BeadChip | 70K | ||||
| 900 Kb ( | 2700 | Axiom Cotton Genotyping Array | Affymetrix Axiom | 35K | Unpublished | |||||
FIGURE 1Overview of quality control procedures for crop GWAS. These include: filtering steps that are common to any GWAS experiment; filtering steps depending on the GWAS population structure (homozygous or heterozygous); the removal of duplicated samples; the characterization of ancestral relationships, starting from a SNP dataset pruned for markers in linkage disequilibrium.
FIGURE 2Frequency distribution analysis to define filtering solutions for (A) SNP call rate; (B) genotype call rate; (C) SNP inbreeding coefficient (FIT); (D) SNP proportion of heterozygosity. Dashed lines indicate possible filtering thresholds, based on classes occurring at suspiciously low (A,B) or high (D) frequency, and distribution gaps (C). Genotypic data used to build histograms are all relative to published genotyping-by-sequencing experiments, carried out in the self-pollinated crops Cicer arietinum L. (Pavan et al., 2017, A,C) and Lens culinaris Medik (Pavan et al., 2019, B), and the open-pollinated crop Cynara cardunculus L. (Pavan et al., 2018, D).