| Literature DB >> 31956447 |
Ahmad M Alqudah1, Ahmed Sallam2, P Stephen Baenziger3, Andreas Börner1.
Abstract
Understanding the genetic complexity of traits is an important objective of small grain temperate cereals yield and adaptation improvements. Bi-parental quantitative trait loci (QTL) linkage mapping is a powerful method to identify genetic regions that co-segregate in the trait of interest within the research population. However, recently, association or linkage disequilibrium (LD) mapping using a genome-wide association study (GWAS) became an approach for unraveling the molecular genetic basis underlying the natural phenotypic variation. Many causative allele(s)/loci have been identified using the power of this approach which had not been detected in QTL mapping populations. In barley (Hordeum vulgare L.), GWAS has been successfully applied to define the causative allele(s)/loci which can be used in the breeding crop for adaptation and yield improvement. This promising approach represents a tremendous step forward in genetic analysis and undoubtedly proved it is a valuable tool in the identification of candidate genes. In this review, we describe the recently used approach for genetic analyses (linkage mapping or association mapping), and then provide the basic genetic and statistical concepts of GWAS, and subsequently highlight the genetic discoveries using GWAS. The review explained how the candidate gene(s) can be detected using state-of-art bioinformatic tools.Entities:
Keywords: Association mapping; Barley breeding, GWAS; Gene identification; Hordeum vulgare L; QTL mapping
Year: 2019 PMID: 31956447 PMCID: PMC6961222 DOI: 10.1016/j.jare.2019.10.013
Source DB: PubMed Journal: J Adv Res ISSN: 2090-1224 Impact factor: 10.479
The main advantages and limitations of QTL mapping and GWAS mapping approach.
| Advantages | limitations | Advantages | limitations |
|---|---|---|---|
| Bi-parental crosses | Contrasting and crossable parents and multiple generations required to develop pedigrees | No parents or crossing | Population structure effect with spurious relatedness |
| Fewer markers required | A limited number of genotypes based on the success of crossing | Unlimited number of contrasting accessions | The high number of individuals are required |
| Low allele richness | Assumes dense markers with high allele richness | Low allele frequency | |
| Expecting the segregating trait(s) | Lower resolution based upon the number of recombination | Large numbers of phenotypic variation | Misleading natural variation |
| More robust in heterogeneity | Narrower genetic base | Highly dense map | Low heritability value |
| Less prone to false positives | Markers are usually sparse due to the recombination | Higher resolution and tests at marker positions | Many more markers required |
| Tests between markers | Type I or II error (false positive association) | ||
Fig. 1Visualization of population structure and number of subpopulations within the population. No clear population structure (a), whereas the population was well-structured (b). Log probability data as function of k (number of clusters/subpopulations) from the STRUCTURE run. No number of subpopulations (c), while two subpopulations are shown in (d). Each color in (a and b) represents a subgroup and each dot represents an accession/individual. PCA, principal component analysis.
Fig. 2The most important three stages for performing a successful GWAS experiment. Stage I: Phenotyping, stage II: Genotyping and stage III: Genome-wide association study including statistical models, multiple-testing analyses, and software/packages for QTL and gene identification.
Candidate-gene based GWAS which has been validated and cloned.
| Population | Sample size | Growth habit | Population structure | Model | Marker info | Phenotype | Software | Candidate gene | Validation | Ref. |
|---|---|---|---|---|---|---|---|---|---|---|
| Genobar | 224 | Spring | Row-type; photoperiod responses | MLM | 9 K SNP | Tillering, plant height | GENSTAT | Mutant analysis | ||
| Leaf area | Molecular, transcriptome, histological analyses | |||||||||
| Phase duration and development | The haplotype of specific gene-derived markers | |||||||||
| Agronomic traits | Map-based cloning | |||||||||
| European barley | 138 | Winter | Row-type | MLM | 9 K SNP | Leaf size | GAPIT-R | Histological analysis | ||
| UK cultivars | 500 | Spring, | Row-type; seasonal | MLM | 1.5 K SNP | Auricle, awn, spike, rachis, spikelet, and grain-related traits | GENSTAT | Re-sequencing & Cloning | ||
| Western Europe and North America | 190 | Row-type; seasonal | MLM | 2.5 K SNP | Spikelet fertility, spike architecture and tillering | GENSTAT | Re-sequencing & Cloning | |||
| UK cultivars | 401 | Spring | MLM | 5.2 K | Spike density-related traits | GENSTAT | Re-sequencing & Cloning | |||
| European barley | 804 | Spring, | seasonal | MLM | 9 K SNP | Heading date and agronomic traits | GENSTAT | Re-sequencing & Cloning | ||
| USDA | 2,671 | MLM | 9 K SNP | Salt Tolerance | GAPIT-R | Re-sequencing & Cloning |
Fig. 3The output results of GWAS. Manhattan plot (a). Horizontal-axis represents the position of markers over the barley chromosomes and vertical-axis represents -log10(P-values) of the marker-trait association. Each dot denotes marker. Horizontal blue-line represents threshold of -log10(0.001) and red-line represents the threshold of -log10(p-value) passing false-discovery rate (FDR). Quantile-quantile (QQ) plot of different GWAS models (b). The plot shows the expected vs. observed -log10(p-value) of each marker (dote). Red-line is the standered relationship among markers. General linear models (GLM), mixed linear models (MLM) and compressed MLM (CMLM). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Summary of the most important genes distributed over barley chromosomes, which are involved in developmental and agronomic traits.
The most significant associated genomic regions with quantitative traits in barley using a GWAS approach.
| Reference panel | Sample size | Marker info | Phenotype | Chr. (pos. (cM)) | Software | Ref. |
|---|---|---|---|---|---|---|
| Genobar | 224 | 1.5 K SNP | Heading date, | 2H (41–52), 3H (8–9), 5H (100–108), 6H (28, 60, 125), 7H (1 0 4) | TASSEL | |
| 9 K SNP | Phase transition, developmental stages, tillering, plant height, leaf area | 1H (3–8, 95.9–97.9), 2H (50.9–56.4, 82–88, 141–147), 3H (56–64, 122–127), 5H (2.6–9.3, 21.3–24.6, 31.7–34.1, 83–86), 6H (16.9–24.6) | GENSTAT | |||
| Germination and seedling shoot and root architecture traits | 1H (76–48), 2H (112–115), 5H (44–45) | |||||
| European cultivars | 183 | 253 DArT & 22 SSR | TGW, glume fineness, extract and friability | 1H (116–123), | TASSEL | |
| German winter | 106 | 1,169 DArT | Grain yield, TGW, | 2H (47–48), 3H (51–53), 6H (46–47, 142–143), 7H (1–5) | TASSEL | |
| Barley Germplasm | 185 | 710 DArT, 61 SNP and 45 SSR | Drought tolerance related traits (Grain yield, TGW, peduncle, leaf, and spike) | 3H (1 5 3), 5H (139–150) | GENSTAT | |
| European cultivars | 174 | 839 DArT | Grain yield, TGW, | 1H (10–12, 94–96), 2H (133–136), 5H (13–15) | GENSTAT | |
| Worldwide | 206 | 408 DArT | Salinity tolerance | 2H (3.5), 4H (1 4 5), 5H (43.5) | TASSEL | |
| UK cultivars | 500 | 1.5 SNP | Auricle, awn, spike, rachis, spikelet and grain-related traits | 2H (82–90), 4H (106–119), 5H (100–113, 119–130) | GENSTAT | |
| Pan-European Barley Cultivar Collection | 379 | 9 K SNP | Grain yield-associated traits | 2H (110–115, 145–155), 3H (95–100), 5H (160–170) | GENSTAT | |
| Jordanian landraces | 150 | 9 K SNP | Harvest index & Spikelet number per spike | 2H (106–107), 7H (1.6–15) | TASSEL | |
| NAM | 1,420 | 9 K SNP | Leaf sheath hairiness. | 4H (111.3) | SAS | |
| Salinity tolerance | 2H (140–145) | |||||
| Heading time and yield related-traits | 4H (3–4, 110–114) | |||||
| Kazakhstan collection | 92 | 9 K SNP | Stem rust resistance | 3H (131–136), 6H (63–64) | TASSEL | |
| EcoSeed | 184 | 9 K SNP | Seed dormancy and pre-harvest sprouting | 1H (5), 3H (104.3, 135.6), 5H (169.4) | GENSTAT | |
| Modern European cultivars | 148 | 407 SSR | Spike length, plant height and grain number | 1H (64–65), 2H (3–4, 14–15), 3H (126–127), 5H (86–87, 130–131), 6H (44–45, 95–96) | TASSEL | |
| Drought tolerance collection | 109 | 5,153 DArT | Water use efficiency, water content and relative water content | 2H (118–119), 3H (24–25), 4H (49–55), 5H (48–49, 147–148) | TASSEL |