| Literature DB >> 29150594 |
Patrick O Ongom1, Gebisa Ejeta2,3.
Abstract
Multi-parent advanced generation intercross (MAGIC) populations are powerful next-generation mapping resources. We describe here the mating design and structure of the first MAGIC population in sorghum, and test its utility for mapping. The population was developed by intercrossing 19 diverse founder lines through a series of paired crosses with a genetic male sterile (MS) source, followed by 10 generations of random mating. At the final stage of random mating, 1000 random fertile plants in the population were identified and subjected to six generations of selfing to produce 1000 immortal MAGIC inbred lines. The development of this sorghum MAGIC population took over 15 yr. Genotyping-by-sequencing (GBS) of a subset of 200 MAGIC lines identified 79,728 SNPs, spanning high gene-rich regions. Proportion of SNPs per chromosome ranged from 6 to 15%. Structure analyses produced no evidence of population stratification, portraying the desirability of this population for genome-wide association studies (GWAS). The 19 founders formed three clusters, each with considerable genetic diversity. Further analysis showed that 73% of founder alleles segregated in the MAGIC population. Linkage disequilibrium (LD) patterns depicted the MAGIC population to be highly recombined, with LD decaying to r2 [Formula: see text] 0.2 at 40 kb and down to r2 [Formula: see text] 0.1 at 220 kb. GWAS detected two known plant height genes, DWARF1 (chromosome 9) and DWARF3 (chromosome 7), and a potentially new plant height quantitative trait locus (QTL) (QTL-6) on chromosome 6. The MAGIC population was found to be rich in allelic content with high fragmentation of its genome, making it fit for both gene mapping and effective marker-assisted breeding.Entities:
Keywords: MPP; genotyping-by-sequencing (GBS); linkage disequilibrium (LD); multi-parent advanced generation intercrosses (MAGIC); population structure; single nucleotide polymorphisms (SNPs)
Mesh:
Year: 2018 PMID: 29150594 PMCID: PMC5765360 DOI: 10.1534/g3.117.300248
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Founder lines of the sorghum MAGIC population
| Founders | Agronomic Significance |
|---|---|
| K443 | Drought tolerance and early ∼67 d to 50% flowering |
| TX430 | Yield potential, wide adaptation, drought tolerant, and medium ∼71 d to 50% flowering |
| GSA1346 | Drought tolerance and medium ∼85 d to 50% flowering |
| K22/22 | Drought tolerance |
| MR750 | Yield, disease and insect resistance (ergot, midge, and Sugarcane aphid), and general combining ability |
| K22/35 | Drought tolerance |
| SUCR3680/70 | Yield potential and general combining ability |
| K1/4 | Drought tolerance |
| MR732 | Yield potential, medium ∼76 d to 50% flowering, grain quality, and general combining ability |
| M36200 | Yield potential, grain quality, and general combining ability |
| MR747 | Yield potential, grain quality, and general combining ability |
| P954063 | Yield potential, early ∼68 d to 50% flowering, wide adaptation, grain quality, and general combining ability |
| M91051 | Yield potential, grain quality, and general combining ability |
| K1597 | Drought tolerance, early ∼52 d to 50% flowering, general combining ability |
| PP619 | Drought tolerance and yield potential |
| M36031 | Yield potential and grain quality |
| MR727 | Yield potential and grain quality, general combining ability |
| CS3541-22 | Yield potential, medium ∼75 d to 50% flowering, grain mold resistance, grain quality, general combining ability, and wide adaptation |
| TX2737 | Drought tolerance, medium ∼70 d to 50% flowering, and general combining ability |
Figure 1Mating design of the first multi-parent advanced generation intercross (MAGIC) population of sorghum, developed with the aid of genetic male sterility. In this figure, ms refers to the male sterile sorghum lines used to introduce sterility in the MAGIC population, RM0 to RM9 refers to cycles of random mating, S0 to S7 refers to generations of selfing, and SSD refers to single-seed descent. Initial pairwise hybridization made between 19 founding parents and 10 random sterile plants. Progenies were bulked and random mated for nine cycles, 1000 random lines were generated, and a subset of 200 has been genotyped.
Figure 2Chromosome-wide distribution of single nucleotide polymorphisms (SNPs) and reference sequence genes. Each of the 10 sorghum chromosomes (chr) is presented with SNP density (in gray) and a corresponding gene density (in blue) plotted below it. Chromosomes 5–10 are shown in Figure S1 in File S1. A summary plot of the number of SNPs per chromosome is presented at the bottom right hand corner of this figure. Pos, position.
Figure 3Pattern of genetic features in a subset of sorghum multi-parent advanced generation intercross (MAGIC) population and founder lines. (a) Chromosome-wide distribution of mean minor allele frequency (MAF), mean proportion of heterozygotes, and missing data in the MAGIC subset. The embedded bar chart in (a) depicts the percentage proportion of MAGIC SNPs corresponding to bins or categories of MAF. (b) Chromosome-wide distribution of mean MAF, mean proportion of heterozygotes, and missing data among the 19 founder lines. In (a and b), summary statistics are presented in the embedded text boxes. Nr., number; Pos, position.
Chromosome-wide distribution of polymorphic founder alleles that segregated in the MAGIC population
| MT | FT | MOnly | FOnly | M ∩ F | *(M ∩ F)/FT | |
|---|---|---|---|---|---|---|
| Chr 1 | 7,279 | 7,280 | 1,939 | 1,940 | 5,340 | 0.73 |
| Chr 2 | 6,510 | 6,231 | 1,928 | 1,649 | 4,582 | 0.74 |
| Chr 3 | 6,315 | 6,396 | 1,632 | 1,713 | 4,683 | 0.73 |
| Chr 4 | 5,379 | 5,255 | 1,579 | 1,455 | 3,800 | 0.72 |
| Chr 5 | 3,718 | 4,180 | 770 | 1,232 | 2,948 | 0.71 |
| Chr 6 | 4,517 | 4,506 | 1,066 | 1,055 | 3,451 | 0.77 |
| Chr 7 | 3,070 | 3,129 | 813 | 872 | 2,257 | 0.72 |
| Chr 8 | 3,375 | 3,689 | 822 | 1,136 | 2,553 | 0.69 |
| Chr 9 | 3,993 | 3,942 | 991 | 940 | 3,002 | 0.76 |
| Chr 10 | 4,171 | 3,937 | 1,280 | 1,046 | 2,891 | 0.73 |
| Summary | 48,327 | 48,545 | 12,820 | 13,038 | 35,507 | 0.73 |
MT and FT refer to the respective total number of multi-parent advanced generation intercross (MAGIC) and founder alleles that were polymorphic at minor allele frequency > 0.01; MOnly and FOnly are polymorphic alleles that were unique to the MAGIC and founders, respectively; and M ∩ F are the alleles that are shared between the MAGIC and the founders. * indicates the ratio of shared alleles to total founder alleles, depicting the proportion of founder alleles that segregated in the MAGIC population. Chr, chromosome.
Figure 4PC analysis of the sorghum MAGIC population. (a) Scree plot of the PCs (x-axis) and their contribution to variance (the eigenvalues) y-axis). (b) Axes plot of the first two PCs for the MAGIC subset (black dots), including the founders (red dots labeled with founder names). The MAGIC subset appears to form a single group surrounded by three major clusters of founders (black circles around groups of red dots). chr, chromosome; MAGIC, multi-parent advanced generation intercross; PC, principal component.
Figure 6Genome-wide scan for plant height loci. (a) Manhattan plot based on Q-model shows a strong association signal in the plant height gene regions. (b–d) Highlight of peaks for the two known (DWARF1 and DWARF3) and one new (QTL-6) dwarfing genes: (b) QTL-6, (c) DWARF3, and (d) DWARF1. The vertical dashed lines show the physical positions of the height genes. (e–g) Q-Q plots for the three models: (e) naïve model, (f) Q-model, and (g) Q+K-model. Significant threshold (horizontal line) in (a) was determined based on the average linkage disequilibrium (LD) decay extent in the multi-parent advanced generation intercross (MAGIC) population as: Number of independent tests = reference genome size (730 Mb)/MAGIC LD extent (220 kb); the threshold is then obtained as [0.05/(730 Mb/0.22 Mb)] = 1.50685 × 10−5, equivalent to −log10(P) of 4.8.
Figure 5LD patterns. (a and b) LD heatmaps of a segment of the MAGIC genome based on (a) r2 and (b) D’, with r2 and D’ shown above diagonals while P-values are below diagonals. Each cell represents the comparison of two pairs of marker sites with the color codes shown for the presence of significant LD. (c) LD decay rate for each chromosome. The top right insert depicts the mean genome-wide decay rate. The vertical dashed lines intersect the decay curve at r2 = 0.2 and r2 = 0.1, corresponding to the distances of 40 and 220 kb, respectively, the LD values below which the pairwise linkage between loci dissipates. chr, chromosome; LD, linkage disequilibrium; MAF, minor allele frequency; MAGIC, multi-parent advanced generation intercross; SNP, single nucleotide polymorphism.
ANOVA depicting magnitude of genetic variation for plant height among the 200 subset of sorghum MAGIC population
| Mean Squares | Variance Components and Repeatability | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 2013 | 2014 | Combined | Source | 2013 | 2014 | Combined | ||||
| Source of var. | d.f. | MS | d.f. | MS | d.f. | MS | ||||
| Genotype (G) | 199 | 760*** | 199 | 1358*** | 199 | 1849*** | σ2G | 325 | 582 | 395 |
| Replication | 1 | 75 | 1 | 812* | 1 | 690* | σ2Y | — | — | 12 |
| Year (Y) | — | — | — | — | 1 | 5187*** | σ2G × Y | — | — | 58 |
| G × Y | — | — | — | — | 199 | 269*** | σ2e | 110 | 193 | 152 |
| Residuals | 199 | 110 | 199 | 194 | 399 | 152 | σ2T | 435 | 776 | 618 |
| Total | 399 | 399 | 799 | R | 0.86 | 0.86 | 0.85 | |||
| Mean | 113 | 110 | 183 | |||||||
| Min | 113 | 110 | 111 | Proportion of Components (%) | ||||||
| Max | 230 | 250 | 235 | Source | 2013 | 2014 | Combined | |||
| 7.4 | 9.8 | 6.2 | σ2G | 75 | 75 | 64 | ||||
| — | — | 0.6 | σ2Y | — | — | 2 | ||||
| — | — | 8.7 | σ2G × Y | — | — | 9 | ||||
| 20.7 | 27.5 | 17.1 | σ2e | 25 | 25 | 25 | ||||
| — | — | 1.7 | σ2T | 100 | 100 | 100 | ||||
| — | — | 24.2 | ||||||||
| CV(%) | 9.3 | 12.7 | 6.7 | |||||||
Analysis results are presented for year 2014 and 2014 independently and in combination. *, **, and *** indicate significance at 0.05, 0.01, and 0.001 probability levels, respectively. σ2G, σ2Y, σ2G × Y, σ2e, and σ2T are the respective variance components for genotype, year, genotype by year interaction, error, and total. “R” is repeatability (analogous to broad sense heritability for estimates in single or few environments) and is a measure of predictability of trait measurements over time. MS, mean squares; d.f., degrees of freedom; Min, minimum; Max, maximum; LSD, least significant difference; CV, coefficient of variation.
The respective SEM for genotypes, year and the interaction between genotype and year.
The least significant difference for genotypes, year and the interaction between genotype and year respectively.