| Literature DB >> 32788287 |
Marcus T Brock1, Matthew J Rubin2,3,4, Dean DellaPenna5, Cynthia Weinig2,3,5,6.
Abstract
Linkage and association mapping populations are crucial public resources that facilitate the characterization of trait genetic architecture in natural and agricultural systems. We define a large nested association mapping panel (NAM) from 14 publicly available recombinant inbred line populations (RILs) of Arabidopsis thaliana, which share a common recurrent parent (Col-0). Using a genotype-by-sequencing approach (GBS), we identified single nucleotide polymorphisms (SNPs; range 563-1525 per population) and subsequently built updated linkage maps in each of the 14 RIL sets. Simulations in individual RIL populations indicate that our GBS markers have improved power to detect small effect QTL and enhanced resolution of QTL support intervals in comparison to original linkage maps. Using these robust linkage maps, we imputed a common set of publicly available parental SNPs into each RIL linkage map, generating overlapping markers across all populations. Though ultimately depending on allele frequencies at causal loci, simulations of the NAM panel suggest that surveying between 4 to 7 of the 14 RIL populations provides high resolution of the genetic architecture of complex traits, relative to a single mapping population.Entities:
Keywords: Arabidopsis thaliana; GWAS; MPP; Multiparent Advanced Generation Inter-Cross (MAGIC); NAM population; multiparental populations; recombinant inbred lines
Mesh:
Year: 2020 PMID: 32788287 PMCID: PMC7534452 DOI: 10.1534/g3.120.401239
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Metadata and metrics for each of 14 Arabidopsis thaliana recombinant inbred line populations (RIL) that share a common recurrent parent (Col-0) and form a nested association mapping population described herein. RIL populations are available at Institut National de la Recherche Agronomique (INRA; Versailles, France; http://publiclines.versailles.inra.fr)
| INRA population | Alternate parent | Population size | Total Markers | GBS Markers | Overall Length (cM) | Avg. interval (cM) | Max interval (cM) | Avg. missing data GBS | Avg. missing data post-imputation |
|---|---|---|---|---|---|---|---|---|---|
| 2RV | bla1 | 141 | 641 | 563 | 443.1 | 0.7 | 13.9 | 27.1% | 3.2% |
| 3RV | tsu0 | 145 | 1254 | 1175 | 449.8 | 0.4 | 6.4 | 23.6% | 1.4% |
| 4RV | nok1 | 144 | 1243 | 1158 | 462.6 | 0.4 | 7.1 | 23.0% | 1.1% |
| 6RV | ri0 | 144 | 1503 | 1419 | 489.5 | 0.3 | 7.5 | 16.6% | 0.8% |
| 8RV | cvi0 | 147 | 1615 | 1525 | 634.5 | 0.4 | 11.7 | 22.7% | 1.0% |
| 13RV | sha | 148 | 796 | 711 | 417.4 | 0.5 | 7.9 | 20.2% | 1.3% |
| 17RV | ge0 | 149 | 1201 | 1119 | 485 | 0.4 | 5 | 23.6% | 1.2% |
| 19RV | can0 | 148 | 1561 | 1477 | 481.5 | 0.3 | 6.1 | 26.0% | 1.2% |
| 20RV | bur0 | 142 | 1344 | 1257 | 433 | 0.3 | 5.5 | 21.5% | 1.1% |
| 21RV | blh1 | 137 | 1074 | 1000 | 485.1 | 0.5 | 6.8 | 23.3% | 1.3% |
| 23RV | yo0 | 150 | 1351 | 1269 | 477.5 | 0.4 | 7.2 | 20.6% | 0.8% |
| 27RV | oy0 | 149 | 1349 | 1264 | 488.7 | 0.4 | 9.7 | 19.7% | 1.1% |
| 28RV | jea | 147 | 1435 | 1348 | 482.2 | 0.3 | 6.5 | 17.8% | 0.8% |
| 29RV | ita0 | 137 | 1183 | 1096 | 430.8 | 0.4 | 9 | 16.4% | 0.9% |
Regions of significant linkage disequilibrium and specific allelic combinations with distorted low frequency (REF and ALT indicate Col-0 and alternate parent alleles, respectively) in each of eight Arabidopsis thaliana recombinant inbred populations. Regions are denoted as chromosome @ nucleotide position (in MB)
| INRA population | Alternate parent | LD region 1 | low frequency allelic combination 1 | LD region 2 | low frequency allelic combination 2 |
|---|---|---|---|---|---|
| 3RV | tsu0 | 3@17.6MB x 4@ 1.2MB | 13.3% REF & REF | ||
| 4RV | nok1 | 1@13.6MB x 5@ 8.5MB | 0% REF & ALT | ||
| 6RV | ri0 | 1@ 2.2MB x 3@23.1MB | 11.1% REF & REF | ||
| 8RV | cvi0 | 1@ 6.6MB x 3@ 2.2MB | 2.3% REF & ALT | 1@27.1MB x 5@ 3.4MB | 0% REF & ALT |
| 13RV | sha | 4@13.2MB x 5@26.7MB | 3.7% REF & ALT | ||
| 21RV | blh1 | 4@ 0.1MB x 5@ 4.4MB | 8.5% REF & REF | ||
| 28RV | jea | 4@10.9MB x 5@ 4.0MB | 2.1% REF & ALT | 2@14.9MB x 4@12.2MB | 16.1% REF & ALT |
| 29RV | ita0 | 1@27.1MB x 5@ 3.9MB | 1.6% REF & ALT | 4@ 1.4MB x 5@ 3.1MB | 3.4% ALT & ALT |
Figure 1A) Linkage map for the Cvi-0 x Col-0 Arabidopsis thaliana RIL population (8RV: http://publiclines.versailles.inra.fr/rils/index) using 1615 total SNPs (both original INRA and new GBS identified SNPs) following conservative imputation. B) Linkage disequilibrium plot of intra- and inter-chromosomal LD in Cvi-0 x Col-0 as measured by SNP-SNP Pearson product moment correlation coefficients (r2; top right of diagonal) with associated significance tests (p-value; lower left of diagonal). See Fig. S1 for linkage maps and LD plots of remaining 13 Arabidopsis RIL populations referenced in Table 1.
Results of simulating small and large effect QTL in each of 14 Arabidopsis thaliana recombinant inbred populations illustrating increased power of detection and enhanced interval resolution of linkage maps with GBS SNPs in comparison to original INRA maps
| Small Effect QTL | |||||||
|---|---|---|---|---|---|---|---|
| INRA population | Alternate parent | Mean QTL PVE | Mean 1.5 LOD Interval (cM) | Perc. Reduction of QTL Interval | Perc. QTL detected by INRA | ||
| GBS | INRA | GBS | INRA | ||||
| 2RV | bla1 | 10 | 9.7 | 25.2 | 29.1 | −13.4 | 74.2 |
| 3RV | tsu0 | 10.1 | 9.6 | 23.3 | 28.1 | −17.1 | 75.7 |
| 4RV | nok1 | 10.5 | 9.9 | 21.3 | 28.5 | −25.3 | 75.9 |
| 6RV | ri0 | 10.3 | 9.5 | 24.5 | 34.1 | −28.2 | 79.1 |
| 8RV | cvi0 | 10.6 | 9.8 | 24.2 | 36.9 | −34.4 | 78 |
| 13RV | sha | 9.8 | 9.3 | 21 | 27.1 | −22.5 | 80.3 |
| 17RV | ge0 | 9.9 | 9.2 | 21.9 | 28.4 | −22.9 | 78.2 |
| 19RV | can0 | 10.1 | 9.5 | 22.6 | 30.6 | −26.1 | 75.7 |
| 20RV | bur0 | 10.1 | 9.5 | 23.8 | 32.8 | −27.4 | 75.5 |
| 21RV | blh1 | 10.7 | 10 | 24.7 | 31.6 | −21.8 | 68.7 |
| 23RV | yo0 | 10.2 | 9.4 | 21.1 | 29.9 | −29.4 | 74.4 |
| 27RV | oy0 | 10 | 9.4 | 24.5 | 29.2 | −16.1 | 74.9 |
| 28RV | jea | 10.1 | 9.4 | 22.3 | 28.6 | −22 | 78.6 |
| 29RV | ita0 | 10.7 | 10 | 22 | 27.5 | −20 | 76.9 |
Figure 2A) Linkage disequilibrium (LD; r2) as a function of distance across all five Arabidopsis thaliana chromosomes of combined and separate RIL populations. Black lines illustrate LD in all 14 RIL populations of the NAM panel described herein; gray lines illustrate LD in a subset of just 7 RIL populations, and colored best fit lines illustrate the range of LD in each of the 7 populations. B) LD plots illustrating inter-chromosomal associations that are typically weak (r2; top right of diagonal) but commonly significant (p-values; lower left of diagonal).
Figure 3Plots of the ability for increasing numbers of randomly combined Arabidopsis thaliana RIL populations to detect simulated QTL of variable effect sizes (see legend for average simulated percent variance explained). Simulations bracket the range of possible allele frequencies found across the 14 RIL populations, testing two scenarios: A) simulated QTL are shared among all alternate parents and so are found to vary in all RIL populations and B) simulated QTL vary within only a single RIL population.