| Literature DB >> 29574570 |
Vemuri Hindu1,2, Natalia Palacios-Rojas3, Raman Babu1,4, Willy B Suwarno3,5, Zerka Rashid1, Rayalcheruvu Usha2, Gajanan R Saykhedkar1,6, Sudha K Nair7.
Abstract
KEY MESSAGE: Genome-wide association study (GWAS) on 923 maize lines and validation in bi-parental populations identified significant genomic regions for kernel-Zinc and-Iron in maize. Bio-fortification of maize with elevated Zinc (Zn) and Iron (Fe) holds considerable promise for alleviating under-nutrition among the world's poor. Bio-fortification through molecular breeding could be an economical strategy for developing nutritious maize, and hence in this study, we adopted GWAS to identify markers associated with high kernel-Zn and Fe in maize and subsequently validated marker-trait associations in independent bi-parental populations. For GWAS, we evaluated a diverse maize association mapping panel of 923 inbred lines across three environments and detected trait associations using high-density Single nucleotide polymorphism (SNPs) obtained through genotyping-by-sequencing. Phenotyping trials of the GWAS panel showed high heritability and moderate correlation between kernel-Zn and Fe concentrations. GWAS revealed a total of 46 SNPs (Zn-20 and Fe-26) significantly associated (P ≤ 5.03 × 10-05) with kernel-Zn and Fe concentrations with some of these associated SNPs located within previously reported QTL intervals for these traits. Three double-haploid (DH) populations were developed using lines identified from the panel that were contrasting for these micronutrients. The DH populations were phenotyped at two environments and were used for validating significant SNPs (P ≤ 1 × 10-03) based on single marker QTL analysis. Based on this analysis, 11 (Zn) and 11 (Fe) SNPs were found to have significant effect on the trait variance (P ≤ 0.01, R2 ≥ 0.05) in at least one bi-parental population. These findings are being pursued in the kernel-Zn and Fe breeding program, and could hold great value in functional analysis and possible cloning of high-value genes for these traits in maize.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29574570 PMCID: PMC6004279 DOI: 10.1007/s00122-018-3089-3
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Pedigree and summary statistics of DH populations employed in bi-parental mapping at two locations
| DH population | Pedigree | Population size | Trait | Celaya(CE) | Tlaltizapan(TZ) | ||
|---|---|---|---|---|---|---|---|
| Mean (µg/g) | Range (µg/g) | Mean (µg/g) | Range (µg/g) | ||||
| DHP1 | DTPYC9-F13-2-1-1-2-B-B/CML312 | 96 | Zn | 25.61 ± 0.40 | 17.85–37.45 | 26.07 ± 0.45 | 18.50–36.90 |
| Fe | 12.23 ± 0.28 | 7.10–24.50 | 15.14 ± 0.35 | 8.40–22.50 | |||
| DHP2 | CML503/CLWN201 | 112 | Zn | 25.32 ± 0.47 | 15.80–48.00 | 24.01 ± 0.38 | 16.15–35.35 |
| Fe | 13.67 ± 0.32 | 6.30–23.45 | 12.96 ± 0.30 | 7.30–22.00 | |||
| DHP3 | CML 465/CML451 | 143 | Zn | 27.96 ± 0.39 | 17.70–43.15 | 24.08 ± 0.33 | 15.60–37.80 |
| Fe | 13.91 ± 0.20 | 9.00–20.70 | 14.26 ± 0.18 | 8.20–19.00 | |||
Estimates of mean, variance components (across 3 locations) and broad-sense heritability for kernel-Zn and Fe concentrations in GWAS panel
| Trait | Environments | Zn (µg/g) | Fe (µg/g) |
|---|---|---|---|
| Mean | AF12A | 26.28 ± 2.61 | 16.29 ± 1.81 |
| CE12B | 25.26 ± 2.36 | 13.18 ± 1.68 | |
| AF13A | 29.6 ± 3.22 | 14.48 ± 2.04 | |
| Across | 27.04 ± 2.76 | 14.65 ± 1.85 | |
| Range | AF12A | 15.20–42.50 | 8.25–28.60 |
| CE12B | 14.76–50.15 | 5.65–25.19 | |
| AF13A | 17.10–39.80 | 7.12–27.60 | |
| Across | 17.11–43.69 | 8.19–25.65 | |
| LSD | AF12A | 5.11 | 3.54 |
| CE12B | 4.62 | 3.29 | |
| AF13A | 6.30 | 3.99 | |
| Across | 5.41 | 3.63 | |
| CV | AF12A | 9.91 | 11.10 |
| CE12B | 9.34 | 12.73 | |
| AF13A | 10.87 | 14.07 | |
| Across | 10.22 | 12.64 | |
| H2 | AF12A | 0.78 | 0.83 |
| CE12B | 0.83 | 0.78 | |
| AF13A | 0.76 | 0.77 | |
| Across | 0.86 | 0.83 | |
|
| 1.28*** | 4.65*** | |
|
| 2.12*** | 10.84*** |
AF12A Agua Fria 2012, CE12B Celaya 2012, AF13A Agua Fria 2013, H broad-sense heritability, LSD least significant difference, CV coefficient of variance, G genotype, E environment
***Indicates significance at 0.001
Fig. 1Quantile–quantile (Q–Q) plots showing inflation of estimated − log10 (P) values versus observed for traits Zn and Fe using uncorrected association model (U), Q (GLM) and Q + K (MLM). Q ten principal components (fixed), K kinship matrix (random), GLM general linear model, MLM mixed linear model
Fig. 2Manhattan plot from the Q + K (MLM) model for Zn and Fe, plotted with the individual SNPs of all chromosomes on the X-axis and − log10 P value of each SNP in the Y-axis. The different colors indicate the 10 chromosomes of maize. The black horizontal line shows the multiple testing threshold − log10 P value of 4.3 for this panel. The blue vertical lines show the associated SNPs validated in bi-parental populations
GWAS identified SNPs found within the previously reported QTLs for kernel-Zn and Fe
| Trait | GWAS-identified SNPs | Chr | Physical position (bp)a | QTL interval | QTL bin | QTL study |
|---|---|---|---|---|---|---|
| Zn | S3_40522792 | 3 | 40,522,792 | umc1504–umc1386a | 3.04 | Qin et al. ( |
| Zn | S4_161165956 | 4 | 161,165,956 | bnlg1621a–dupssr16 | 4.06 | Jin et al. ( |
| Zn | S5_100070727 | 5 | 100,070,727 | umc1110–bnlg1208 | 5.04 | |
| Zn | S9_151265550 | 9 | 151,265,550 | umc1310–bnlg128 | 9.06–9.07 | |
| Zn | S9_151265550 | 9 | 151,265,550 | dupssr29–bnlg619 | 9.07 | Qin et al. ( |
| Fe | S2_19265861 | 2 | 19,265,861 | umc1542–umc1042 | 2.07 | Qin et al. ( |
| Fe | S2_19265861 | 2 | 19,265,861 | bnlg1690–umc1890 | 2.04–2.07 | Jin et al. ( |
| Fe | S3_186200393 | 3 | 186,200,393 | mmp144a–umc1266 | 3.04–3.06 | |
| Fe | S4_167189737 | 4 | 167,189,737 | bnlg1621a–dupssr16 | 4.06 | |
| Fe | S5_5104719 | 5 | 5,104,719 | rz87, RZ87 | 5.1 | Lung’aho et al. ( |
| Fe | S8_164741044 | 8 | 164,741,044 | ZM0825 | 8.06 | Simic et al. ( |
| Fe | S8_164741133 | 8 | 164,741,133 | |||
| Fe | S8_164750150 | 8 | 164,750,150 |
The physical coordinates of the GWAS-identified SNPs and chromosomal bins of markers reported in earlier QTL mapping studies are obtained from B73 AGPV2 and maize GDB
aPhysical positions were retrieved from maize B73 AGPV2
GWAS identified markers validated in DH populations for kernel-Zn and Fe concentration
| Trait | SNP | Chr | Physical Position (bp)* | GWAS_ | DHP_ | Add. effect (µg/g)_DHP | ANOVA_adj | DH population | Effect_location | Gene model | Annotation |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Zn | S1_275780297 | 1 | 275,780,297 | 6.12E−04 | 7.01E−04 | − 1.53 | 10.3 | DHP2 | CE | GRMZM2G472171 | Protein_coding |
| S1_279913529 | 1 | 279,913,529 | 5.14E−04 | 4.78E−03 | − 1.57 | 10.7 | GRMZM2G400714 | Zinc ion binding and ZOS8-04—C2H2 zinc finger protein | |||
| S3_40522792 | 3 | 40,522,792 | 7.28E−05 | 9.63E−03 | − 0.82 | 5.1 | DHP3 | TZ | GRMZM2G089959 | PPR repeat containing protein | |
| S5_56793531 | 5 | 56,793,531 | 9.09E−04 | 4.72E−04 | 1.64 | 10.7 | DHP2 | CE and TZ | GRMZM2G116586 | UTP-glucose-1-phosphate uridylyltransferase | |
| S5_68423957 | 5 | 68,423,957 | 6.45E−04 | 2.79E−03 | 1.38 | 7.9 | GRMZM2G074896 | Glycosyltransferase protein | |||
| S5_71718466 | 5 | 71,718,466 | 7.35E−04 | 4.49E−06 | 1.81 | 13.1 | GRMZM2G144282 | Translation initiation factor eIF3 subunit | |||
| S5_100070727 | 5 | 100,070,727 | 6.40E−04 | 2.27E−06 | 1.60 | 13.3 | GRMZM2G079653 | Ice binding, Homoiothermy, | |||
| S7_173181688 | 7 | 173,181,688 | 5.44E−04 | 5.08E−09 | − 2.24 | 29.2 | DHP3 | GRMZM2G158162 | Zinc ion binding and B3 DNA binding domain containing protein | ||
| S7_173181689 | 7 | 173,181,689 | 6.75E−05 | 5.21E−09 | − 2.20 | 28.4 | |||||
| S8_125472630 | 8 | 125,472,630 | 3.95E−05 | 1.00E−02 | 1.33 | 7.4 | DHP2 | CE | GRMZM2G048200 | GDSL-like lipase,acylhydrolase | |
| S9_151265550 | 9 | 151,265,550 | 5.51E−05 | 1.68E−03 | 1.13 | 6.6 | DHP3 | GRMZM2G178775 | Putative ribonuclease P | ||
| Fe | S2_19265861 | 2 | 19,265,861 | 9.37E−04 | 1.45E−04 | − 0.78 | 11.0 | DHP3 | CE | GRMZM2G162333 | Pectinesterase activity |
| S3_186200393 | 3 | 186,200,393 | 1.78E−05 | 3.23E−03 | − 0.88 | 10.1 | DHP1 | GRMZM2G312201 | Regulation of transcription and pentatricopeptide | ||
| S4_236412442 | 4 | 236,412,442 | 9.06E−04 | 2.23E−04 | − 1.14 | 13.9 | DHP2 | TZ | GRMZM2G471479 | – | |
| S7_174289806 | 7 | 174,289,806 | 8.56E−04 | 1.97E−03 | − 0.90 | 11.0 | DHP1 | CE | GRMZM2G130950 | 5\′-AMP-activated protein kinase beta-2 subunit protein | |
| S8_164741044 | 8 | 164,741,044 | 2.89E−05 | 4.03E−04 | 1.69 | 27.4 | DHP2 | GRMZM2G472991 | Serine/threonine-protein kinase | ||
| S8_164741133 | 8 | 164,741,133 | 3.38E−05 | 2.26E−07 | 1.71 | 27.7 | |||||
| S8_164750150 | 8 | 164,750,150 | 6.59E−04 | 1.01E−04 | 1.19 | 14.8 | GRMZM2G172032 | Isoprenoid biosynthetic process | |||
| S8_167013673 | 8 | 167,013,673 | 9.99E−04 | 2.18E−07 | 1.80 | 34.1 | GRMZM2G074462 | Starch binding domain containing protein | |||
| S8_167086477 | 8 | 167,086,477 | 2.36E−05 | 4.01E−04 | 1.14 | 12.6 | GRMZM2G162329 | nsp1-like C-terminal region family protein | |||
| S8_167089929 | 8 | 167,089,929 | 4.97E−04 | 6.34E−04 | − 1.15 | 12.9 | GRMZM2G061043 | nsp1-like C-terminal region family protein | |||
| S8_172006274 | 8 | 172,006,274 | 1.38E−04 | 6.44E−06 | 1.47 | 19.2 | GRMZM2G004690 | Protein_coding |
The chromosomal locations along with effect estimates in different bi-parental populations, along with predicted gene models (B73 AGPV2) within which the SNPs were located are depicted
*Physical positions were retrieved from maize B73 AGPV2. CE and TZ represent the two environments Celaya and Tlaltizapan respectively
Fig. 3Box plots showing the phenotypic values of the different allele classes of major-effect SNPs validated in DH populations for Zn and Fe (upper and lower panel, respectively). The SNP names, P value, alleles and the specific DH population where the effect is witnessed are mentioned near each box. The black horizontal lines in the middle of the boxes are the median values for the Zn or Fe concentration in the respective allele classes. The vertical size of the boxes represents the inter-quantile range. The upper and lower whiskers represent the minimum and maximum values of data