| Literature DB >> 27966535 |
A Maurer1, W Sannemann1, J Léon2, K Pillen1.
Abstract
The emergence of multiparental mapping populations enabled plant geneticists to gain deeper insights into the genetic architecture of major agronomic traits and to map quantitative trait loci (QTLs) controlling the expression of these traits. Although the investigated mapping populations are similar, one open question is whether genotype data should be modelled as identical by state (IBS) or identical by descent (IBD). Whereas IBS simply makes use of raw genotype scores to distinguish alleles, IBD data are derived from parental offspring information. We report on comparing IBS and IBD by applying two multiple regression models on four traits studied in the barley nested association mapping (NAM) population HEB-25. We observed that modelling parent-specific IBD genotypes produced a lower number of significant QTLs with increased prediction abilities compared with modelling IBS genotypes. However, at lower trait heritabilities the IBS model produced higher prediction abilities. We developed a method to estimate multiallelic QTL effects in multiparental populations from simple biallelic IBS data. This method is based on cumulating IBS-derived single-nucleotide polymorphism (SNP) effect estimates in a defined genetic region surrounding a QTL. Comparing the resulting parent-specific QTL effects with those obtained from IBD approaches revealed high accordance that could be confirmed through simulations. The method turned out to be also applicable to a barley multiparent advanced generation inter-cross (MAGIC) population. The 'cumulation method' represents a universal approach to differentiate parent-specific QTL effects in multiparental populations, even if no IBD information is available. In future, the method could further benefit from the availability of much denser SNP maps.Entities:
Mesh:
Year: 2016 PMID: 27966535 PMCID: PMC5520528 DOI: 10.1038/hdy.2016.121
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Figure 1Variation of number of significant SNPs (a) and variation of prediction ability (R2val) (b) in the barley NAM population. Box plots indicate the distribution across 100 cross-validation runs. Empty and filled boxes represent models ‘IBS-M’ and ‘IBD-M × F’, respectively. Traits are separated by columns.
Comparison of mean R 2 and mean number of significant SNPs across 100 cross-validation runs, calculated for two models and four traits
| HEA | 0.62 | 0.63 | 0.85 | 0.76 | 80 | 6 |
| TGW | 0.52 | 0.31 | 0.77 | 0.40 | 74 | 2 |
| THR | 0.78 | 0.82 | 0.89 | 0.86 | 54 | 3 |
| GrCol | 0.71 | 0.85 | 0.93 | 0.95 | 80 | 5 |
| Mean | 0.66 | 0.65 | 0.86 | 0.74 | 72 | 4 |
Abbreviations: GrCol, grain colour; HEA, heading; IBD, identical by descent; IBS, identical by state; R2train, model fit of the training set; R2val, prediction ability; SNP, single-nucleotide polymorphism; TGW, thousand grain weight; THR, threshability.
Traits studied: HEA, TGW, THR and GrCo; Models applied: only marker main effects based on IBS (‘IBS-M’) and only marker-by-family effects based on IBD genotypes (‘IBD-M × F’). The last row indicates the mean across traits.
Figure 2Comparison of detection rates of significant markers across the genome between models ‘IBS-M’ and ‘IBD-M × F’ in the barley NAM population. Each of the four rows represents a different trait. The height of the peaks indicates the number of significant effects detected per SNP marker out of 100 cross-validation runs, ordered by genetic position according to the map of Maurer . Orange dots represent IBS-M, and blue dots IBD-M × F. Major QTLs detected by IBD-M × F are indicated by vertical lines. Dashed vertical lines indicate the 26 cM window used for cumulation.
Cumulation precision (r) of major QTLs located in barley NAM and MAGIC populations
| NAM | HEA | 2H-23 cM | 0.40 | 4.8 | 0.25 |
| HEA | 2H-57 cM | 0.60 | 3.2 | 0.40 | |
| HEA | 3H-107.8 cM | 0.26 | 2.6 | 0.45 | |
| HEA | 4H-3.5 cM | 0.68 | 2.7 | 0.71 | |
| HEA | 4H-113.4 cM | 0.69 | 3.9 | 1.41 | |
| HEA | 5H-125.5 cM | 0.90 | 7.7 | 0.66 | |
| HEA | 7H-34.3 cM | 0.70 | 8.1 | 0.87 | |
| TGW | 4H-14.9 cM | 0.36 | 2.8 | 0.75 | |
| TGW | 6H-49.1 cM | 0.58 | 3.6 | 0.61 | |
| THR | 1H-97.9 cM | 0.93 | 10.8 | 0.32 | |
| THR | 2H-69.3 cM | 0.75 | 5.3 | 0.43 | |
| GrCol | 1H-116.8 cM | 0.96 | 24.1 | 2.85 | |
| Correlation with cumulation precision | 0.69 | 0.44 | |||
| MAGIC | FT | QFT.MAGIC.HA-2H.a | 0.81 | 2 | 6.35 |
| FT | QFT.MAGIC.HA-3H.a | 0.26 | 1 | 6.14 | |
| FT | QFT.MAGIC.HA-3H.b | 0.51 | 2 | 6.51 | |
| FT | QFT.MAGIC.HA-4H.a | 0.37 | 3 | 3.11 | |
| FT | QFT.MAGIC.HA-5H.a | 0.42 | 2 | 2.99 | |
| FT | QFT.MAGIC.HA-5H.b | 0.89 | 1 | 6.82 | |
| FT | QFT.MAGIC.HA-7H.a | 0.98 | 4 | 6.38 | |
| Correlation with cumulation precision | 0.32 | 0.53 | |||
Abbreviations: CV, coefficient of variation; FT, flowering time; GrCol, grain colour; HEA, heading; MAGIC, multiparent advanced generation inter-cross; NAM, nested association mapping; QTL, quantitative trait locus; SNP, single-nucleotide polymorphism; TGW, thousand grain weight; THR, threshability.
NAM: barley population HEB-25 (Maurer ); MAGIC of barley (Sannemann ).
Correlation coefficient of QTL effect, obtained from ‘IBD-M × F’ (NAM) and parental allelic means, obtained from the haplotype approach (MAGIC), respectively, versus QTL effect, estimated by cumulating nearby SNP effects from ‘IBS-M’. Means: 0.67 (NAM); 0.60 (MAGIC).
Number of significant SNPs that were cumulated in an interval surrounding the QTL (NAM: 26 cM, MAGIC: 20 cM). For NAM, the mean across 100 cross-validation runs is shown.
CV of all parent-specific QTL effects obtained from model ‘IBD-M × F’ (NAM) and the haplotype approach (MAGIC), respectively.
Pearson’s correlation coefficient (r) between cumulation precision for NAM and MAGIC population, respectively.
Figure 3Scatter plots of the cumulated SNP effects from model ‘IBS-M’ against the M × F effects obtained from model ‘IBD-M × F’ for 12 major QTLs in the barley NAM population. Each dot represents the estimated QTL effect of one exotic HEB donor. A linear regression line (in orange) and Pearson’s correlation coefficients (r), indicating the cumulation precision, are given for each QTL. Axis values represent the absolute difference between exotic and cultivated (that is, Barke) QTL alleles.
Quality parameters of IBS-M and IBD-M × F in different simulation scenarios
| IBS-M | QTL detection power | 1 QTL | 1.00 | 0.48 | 0.01 |
| 3 QTLs | 0.99 | 0.78 | 0.37 | ||
| 8 QTLs | 0.91 | 0.74 | 0.45 | ||
| Mean | 0.94 | 0.72 | 0.39 | ||
| False positives | 1 QTL | 0.63 | 0.75 | 0.83 | |
| 3 QTLs | 0.54 | 0.57 | 0.73 | ||
| 8 QTLs | 0.46 | 0.48 | 0.55 | ||
| Mean | 0.55 | 0.60 | 0.70 | ||
| Prediction ability | 1 QTL | 0.65 | 0.13 | 0.03 | |
| 3 QTLs | 0.64 | 0.26 | 0.08 | ||
| 8 QTLs | 0.55 | 0.28 | 0.08 | ||
| Mean | 0.61 | 0.22 | 0.07 | ||
| IBD-M × F | QTL detection power | 1 QTL | 1.00 | 1.00 | 0.07 |
| 3 QTLs | 1.00 | 1.00 | 0.34 | ||
| 8 QTLs | 1.00 | 0.96 | 0.00 | ||
| Mean | 1.00 | 0.98 | 0.09 | ||
| False positives | 1 QTL | 0.31 | 0.00 | 0.00 | |
| 3 QTLs | 0.22 | 0.00 | 0.00 | ||
| 8 QTLs | 0.07 | 0.00 | n/a | ||
| Mean | 0.20 | 0.00 | 0.00 | ||
| Prediction ability | 1 QTL | 0.97 | 0.35 | 0.00 | |
| 3 QTLs | 0.96 | 0.54 | 0.13 | ||
| 8 QTLs | 0.95 | 0.68 | 0.00 | ||
| Mean | 0.96 | 0.52 | 0.04 |
Abbreviations: IBD-M × F, identical-by-descent marker-by-family effects; IBS-M, identical-by-state marker main effects; QTL, quantitative trait locus.
All values are averaged across 100 simulated cross-validation runs.
QTL detection power is defined as the model’s ability to precisely detect the simulated QTL within a 5 cM window surrounding the true position.
False positive associations are all detected significant associations that were outside the interval of cumulated single-nucleotide polymorphisms (SNPs; 26 cM) surrounding a QTL.
Prediction ability represents the correlation coefficient of predicted phenotypes (based on SNP effects obtained in the training set) and observed phenotypes in the test set.
Figure 4Correlation of observed and predicted flowering time of the eight MAGIC founders. Observed phenotypes are presented in Sannemann , and predicted phenotypes are based on the effect estimates of all significant SNPs obtained in model ‘IBS-M’. Founder lines are abbreviated: AB, Ack. Bavaria; AD, Ack. Danubia; B, Barke; C, Criewener 403; HF, Heils Franken; HH, Heines Hanna; PI, Pflugs Intensiv; R, Ragusa.