| Literature DB >> 35487909 |
Nagenahalli Dharmegowda Rathan1, Hari Krishna1, Ranjith Kumar Ellur1, Deepmala Sehgal2, Velu Govindan3, Arvind Kumar Ahlawat1, Gopalareddy Krishnappa4, Jai Prakash Jaiswal5, Jang Bahadur Singh6, Saiprasad Sv6, Divya Ambati6, Sumit Kumar Singh1, Kriti Bajpai1, Anju Mahendru-Singh7.
Abstract
Malnutrition due to micronutrients and protein deficiency is recognized among the major global health issues. Genetic biofortification of wheat is a cost-effective and sustainable strategy to mitigate the global micronutrient and protein malnutrition. Genomic regions governing grain zinc concentration (GZnC), grain iron concentration (GFeC), grain protein content (GPC), test weight (TW), and thousand kernel weight (TKW) were investigated in a set of 184 diverse bread wheat genotypes through genome-wide association study (GWAS). The GWAS panel was genotyped using Breeders' 35 K Axiom Array and phenotyped in three different environments during 2019-2020. A total of 55 marker-trait associations (MTAs) were identified representing all three sub-genomes of wheat. The highest number of MTAs were identified for GPC (23), followed by TKW (15), TW (11), GFeC (4), and GZnC (2). Further, a stable SNP was identified for TKW, and also pleiotropic regions were identified for GPC and TKW. In silico analysis revealed important putative candidate genes underlying the identified genomic regions such as F-box-like domain superfamily, Zinc finger CCCH-type proteins, Serine-threonine/tyrosine-protein kinase, Histone deacetylase domain superfamily, and SANT/Myb domain superfamily proteins, etc. The identified novel MTAs will be validated to estimate their effects in different genetic backgrounds for subsequent use in marker-assisted selection.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35487909 PMCID: PMC9054743 DOI: 10.1038/s41598-022-10618-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Genetic parameters from the GWAS panel evaluated in IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
| Trait | Env | GWAS panel | |||||
|---|---|---|---|---|---|---|---|
| Range | Mean ± SD | CV (%) | LSD | Genotype variance | |||
| GZnC (mg/kg) | E1 | 31.86–56.93 | 44.64 ± 5.16 | 11.55 | 5.58 | 0.88 | 30.78*** |
| E2 | 22.31–36.91 | 28.83 ± 2.37 | 8.22 | 3.86 | 0.75 | 7.61*** | |
| E3 | 17.56–32.90 | 23.56 ± 2.80 | 11.90 | 4.95 | 0.72 | 11.16*** | |
| E4 | 28.65–37.72 | 32.30 ± 1.70 | 5.26 | 4.73 | 0.50 | 5.77*** | |
| GFeC (mg/kg) | E1 | 31.87–39.74 | 35.17 ± 1.58 | 4.48 | 4.21 | 0.53 | 4.81* |
| E2 | 36.71–51.38 | 41.42 ± 2.29 | 5.53 | 4.85 | 0.64 | 8.37** | |
| E3 | 25.47–52.09 | 31.28 ± 3.31 | 10.59 | 4.71 | 0.80 | 13.96*** | |
| E4 | 34.12–40.05 | 35.94 ± 0.99 | 2.76 | 3.45 | 0.40 | 2.53*** | |
| GPC (%) | E1 | 11.56–15.81 | 12.96 ± 0.70 | 5.39 | 1.14 | 0.75 | 0.66*** |
| E2 | 8.77–12.39 | 10.31 ± 0.67 | 6.46 | 0.90 | 0.82 | 0.55*** | |
| E3 | 8.63–13.24 | 10.33 ± 0.83 | 8.07 | 1.31 | 0.77 | 0.92*** | |
| E4 | 10.38–13.00 | 11.23 ± 0.44 | 3.93 | 1.09 | 0.56 | 0.35*** | |
| TKW (g) | E1 | 27.79–52.69 | 38.75 ± 4.09 | 10.56 | 3.52 | 0.92 | 18.39*** |
| E2 | 31.23–57.18 | 41.92 ± 3.59 | 8.57 | 4.04 | 0.87 | 15.12*** | |
| E3 | 24.33–51.53 | 36.49 ± 4.38 | 11.99 | 6.11 | 0.81 | 24.20*** | |
| E4 | 29.54–49.80 | 38.99 ± 3.01 | 7.71 | 4.83 | 0.75 | 12.05*** | |
| TW (kg/hl) | E1 | 71.32–81.45 | 77.64 ± 1.88 | 2.43 | 2.61 | 0.81 | 4.47*** |
| E2 | 74.10–83.77 | 80.26 ± 1.66 | 2.07 | 1.42 | 0.92 | 3.04*** | |
| E3 | 64.48–75.43 | 71.99 ± 1.74 | 2.42 | 3.01 | 0.73 | 4.26*** | |
| E4 | 71.76–79.05 | 76.50 ± 1.31 | 1.71 | 2.22 | 0.73 | 2.34*** | |
E1, IARI, New Delhi; E2, IARI, Indore; E3, GBPUAT, Pantnagar; E4, across environments; SD, standard deviation; CV, coefficient of variance; LSD, least significant difference; h2, heritability.
*Significant at p < 0.05.
**Significant at p < 0.01.
***Significant at p < 0.001.
Figure 1Frequency distribution of GZnC, GFeC, GPC, TKW and TW in the GWAS panel evaluated in IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
The highest-performing 10 lines for GFeC, GZnC, GPC, TW and TKW in the GWAS panel based on combined BLUPs across environments.
| Genotypes | GFeC (mg/kg) | Genotypes | GZnC (mg/kg) | Genotypes | GPC (%) | Genotypes | TW (g) | Genotypes | TKW (g) |
|---|---|---|---|---|---|---|---|---|---|
| Kundan | 40.05 | SHW (2.38) | 37.72 | Local collection 1c01 | 13.00 | C273 | 79.05 | Lokbold | 49.8 |
| UP2672 | 39.62 | C591 | 36.41 | IITR26 | 12.67 | HD3118 | 78.93 | DL1532 | 47.23 |
| NW1014 | 39.36 | NP852 | 36.06 | NP852 | 12.50 | HD3354 | 78.81 | Kundan | 46.2 |
| SHW (2.38) | 38.27 | Navrattan | 36.05 | Navrattan | 12.32 | HD2982 | 78.81 | CS5 | 45.94 |
| Hindi62 | 38.09 | QBP128 | 35.95 | VL829 | 12.29 | HI617 | 78.69 | Asocmap260 | 44.83 |
| Westonia | 38.04 | Kharchia65 | 35.88 | SHW (2.38) | 12.22 | HI1563 | 78.69 | UP2425 | 44.65 |
| Navrattan | 37.87 | C273 | 35.63 | NP770 | 12.11 | RAJ4120 | 78.69 | DBW187 | 44.52 |
| HD2932 | 37.81 | NP770 | 35.46 | DBW14 | 12.00 | K68 | 78.56 | PBW752 | 44.24 |
| Lokbold | 37.76 | Lokbold | 35.26 | HD2189 | 11.99 | QBP1210 | 78.56 | PBW689 | 43.91 |
| HD2189 | 37.74 | K68 | 34.98 | HI1605 | 11.98 | K1317 | 78.56 | DBW43 | 43.51 |
Pair-wise correlation coefficients among the traits in GWAS panel evaluated in IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
| Traits | GZnC | GPC | TW | TKW | |
|---|---|---|---|---|---|
| IARI-New Delhi (E1) | GFeC | 0.60*** | 0.28*** | 0.18* | 0.20** |
| GZnC | 0.46*** | − 0.03 | 0.01 | ||
| GPC | 0.03 | 0.03 | |||
| TW | 0.50*** | ||||
| IARI-Indore (E2) | GFeC | 0.37*** | 0.22** | 0.12 | 0.17* |
| GZnC | 0.30*** | 0.09 | − 0.01 | ||
| GPC | − 0.19** | − 0.23** | |||
| TW | 0.23** | ||||
| GBPUAT-Pantnagar (E3) | GFeC | 0.25*** | 0.23** | 0.06 | 0 |
| GZnC | − 0.14 | 0.1 | − 0.04 | ||
| GPC | − 0.16* | − 0.15* | |||
| TW | 0.50*** | ||||
| Across Env (E4) | GFeC | 0.41*** | 0.39*** | 0.23** | 0.15* |
| GZnC | 0.30*** | 0.14 | − 0.04 | ||
| GPC | 0 | − 0.18* | |||
| TW | 0.34*** |
*Significant at p < 0.05.
**Significant at p < 0.01.
***Significant at p < 0.001.
Figure 2Scatter plots showing the correlation of grain zinc (GZnC) and iron (GFeC) concentration in GWAS panel evaluated in IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
The sub-genome-wise distribution of SNP markers in the GWAS panel.
| GWAS panel | ||||||||
|---|---|---|---|---|---|---|---|---|
| Genome↓/chromosome → | 1 | 2 | 3 | 4 | 5 | 6 | 7 | Total |
| A | 496 | 498 | 402 | 315 | 466 | 336 | 466 | 2979 |
| B | 675 | 653 | 478 | 266 | 588 | 540 | 446 | 3646 |
| D | 610 | 597 | 391 | 170 | 436 | 320 | 354 | 2878 |
Figure 3Scatterplot showing linkage disequilibrium (LD) decay estimated by plotting (r2) against genetic distance (bp) in 184 diverse bread wheat accessions. The green line indicates the threshold point where LD dropped to 50% of its maximum value. LD decay value is at cut off point is indicated on the x-axis with the green font.
Figure 4Three-dimensional principal component analysis plot (a), Neighbor-joining tree (b) inferring the population structure.
Marker trait associations (MTAs) detected for GFeC, GZnC, GPC, TKW, and TW in IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
| Environment | Marker | Chromosome | Position (bp) | PV (%) | |
|---|---|---|---|---|---|
| E1 | 2B | 458622067 | 0.0001 | 9.26 | |
| E3 | 6A | 585439024 | 1.25E−05 | 11.14 | |
| 3A | 637962272 | 0.0001 | 12.62 | ||
| E4 | 3B | 473694803 | 2.26E−05 | 8.82 | |
| E2 | 7B | 488410704 | 9.81E−05 | 7.60 | |
| E3 | 1A | 544723357 | 9.94E−05 | 6.35 | |
| E1 | 4D | 482944373 | 1.23E−06 | 18.19 | |
| 3B | 745289825 | 1.70E−06 | 2.67 | ||
| 1A | 398138141 | 2.16E−06 | 13.71 | ||
| 1D | 492534106 | 3.42E−06 | 4.63 | ||
| 2B | 790751851 | 1.94E−05 | 6.51 | ||
| 1A | 354950241 | 4.57E−05 | 12.91 | ||
| 1B | 302518719 | 5.34E−05 | 6.69 | ||
| 2B | 9731087 | 7.80E−05 | 5.63 | ||
| E2 | 5B | 536516629 | 0.0001 | 6.59 | |
| E3 | 3B | 377864897 | 7.14E−08 | 3.27 | |
| 1D | 10720570 | 6.06E−07 | 3.82 | ||
| 2A | 24060452 | 6.93E−07 | 5.26 | ||
| 1D | 58575112 | 9.85E−07 | 2.60 | ||
| 2D | 640213277 | 1.27E−05 | 1.91 | ||
| 7D | 6696023 | 1.71E−05 | 7.07 | ||
| 2D | 620510699 | 5.02E−05 | 4.65 | ||
| E4 | 5B | 689223068 | 5.04E−08 | 13.87 | |
| 3D | 22767467 | 1.33E−06 | 4.08 | ||
| 6A | 307984014 | 2.00E−06 | 2.65 | ||
| 3D | 130928915 | 3.72E−06 | 5.01 | ||
| 1B | 137447851 | 6.01E−05 | 3.01 | ||
| 7A | 21353471 | 7.77E−05 | 7.20 | ||
| 1D | 315034709 | 7.77E−05 | 6.26 | ||
| E1 | 7A | 731882551 | 3.86E−07 | 12.07 | |
| 2B | 15684332 | 8.25E−07 | 10.83 | ||
| 7B | 652928863 | 7.09E−06 | 4.84 | ||
| 2A | 16066165 | 2.54E−05 | 3.71 | ||
| 1A | 403028587 | 2.97E−05 | 1.82 | ||
| 1B | 455211699 | 0.0001 | 2.10 | ||
| 3B | 694325462 | 0.0001 | 9.49 | ||
| 4D | 12599553 | 0.0001 | 7.48 | ||
| E2 | 2B | 779856091 | 8.21E−05 | 1.17 | |
| E3 | 7A | 731882551 | 2.68E−06 | 10.13 | |
| 2D | 590677321 | 3.93E−05 | 3.72 | ||
| 5D | 546679712 | 6.22E−05 | 2.90 | ||
| 1D | 138208001 | 6.95E−05 | 5.27 | ||
| 3A | 21223605 | 0.0001 | 5.60 | ||
| E4 | 7A | 731882551 | 7.25E−05 | 8.09 | |
| E2 | 6A | 425861645 | 2.14E−09 | 7.98 | |
| 5B | 444085704 | 4.68E−07 | 2.05 | ||
| 5D | 84317542 | 1.80E−06 | 7.39 | ||
| 6D | 458362043 | 5.99E−06 | 7.65 | ||
| 7B | 435815231 | 9.48E−06 | 7.23 | ||
| 5A | 440264163 | 2.84E−05 | 2.98 | ||
| 4B | 527009135 | 6.14E−05 | 2.20 | ||
| 3A | 147285088 | 7.58E−05 | 1.65 | ||
| 1B | 667825172 | 9.84E−05 | 2.81 | ||
| E3 | 6A | 3851364 | 1.05E−05 | 13.79 | |
| E4 | 5D | 46312169 | 0.0001 | 4.21 | |
Figure 5Manhattan and QQ plots for GFeC, GZnC, GPC, TKW and TW from IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
Figure 6Allelic differences of the significant MTAs identified for GFeC and GZnC in IARI-New Delhi (E1), IARI-Indore (E2), GBPUAT-Pantnagar (E3), and across environments (E4).
Putative candidate genes identified for GZnC and GFeC.
| Trait | SNP ID | Position (Mb) | Chr | TraesID | Putative candidate genes | Molecular function |
|---|---|---|---|---|---|---|
| GFeC | AX-94761251 | 458.62 | 2B | TraesCS2B02G321500 | Domain of unknown function DUF3475 Domain of unknown function DUF668 | Positive regulation of growth |
| TraesCS2B02G321600 | Zinc finger, GRF-type | Zinc ion binding | ||||
| AX-94850629 | 473.69 | 3B | TraesCS3B02G295000 | Serine-threonine/tyrosine-protein kinase Wall-associated receptor kinase | Protein kinase activity Polysaccharide binding, ATP binding | |
| TraesCS3B02G294600 | Rossmann-like alpha/beta/alpha sandwich fold Cytidyltransferase-like domain | Nicotinamide-nucleotide adenylyltransferase activity Catalytic activity ATPase | ||||
| TraesCS3B02G294700 | Reticulon-like protein | Response to bacterium | ||||
| TraesCS3B02G294900 | Glycoside hydrolase family 18 Serine-threonine/tyrosine-protein kinase | Hydrolase activity, Hydrolyzing O-glycosyl compounds, Protein kinase activity, ATP binding | ||||
| TraesCS3B02G294800 | Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain | Lipid transport | ||||
| TraesCS3B02G295300 | Wall-associated receptor kinase, C-terminal | Polysaccharide binding | ||||
| AX-94715803 | 585.43 | 6A | TraesCS6A02G353900 | Zinc finger CCCH-type, G-patch domain | Nucleic acid binding, Metal ion binding | |
| TraesCS6A02G353700 | Pheophorbide a oxygenase Rieske [2Fe-2S] iron-sulphur domain | Chlorophyllide a oxygenase [overall] activity, 2 iron, 2 sulfur cluster binding | ||||
| TraesCS6A02G353800 | Serine incorporator/TMS membrane protein | Integral component of membrane | ||||
| TraesCS6A02G354100 | Peptidase family M49, NUDIX hydrolase domain | Hydrolase activity | ||||
| AX-95002032 | 637.96 | 3A | TraesCS3A02G389000 | F-box-like domain superfamily | Protein binding | |
| TraesCS3A02G389100 | Spindle assembly checkpoint component Mad1 | Mitotic spindle assembly checkpoint signaling | ||||
| TraesCS3A02G389200 | Domain of unknown function DUF4094, Glycosyltransferase, family 31 | Galactosyltransferase activity Hexosyltransferase activity | ||||
| TraesCS3A02G389400 | Pentatricopeptide repeat | Protein binding | ||||
| TraesCS3A02G389300 | Protein kinase-like domain superfamily | Protein kinase activity | ||||
| GZnC | AX-94422893 | 488.41 | 7B | TraesCS7B02G266000 | Histone deacetylase domain superfamily Ureohydrolase domain superfamily | Histone deacetylase activity Hydrolase activity |
| TraesCS7B02G265800 | Cytochrome P450 | Heme binding Metal ion binding | ||||
| TraesCS7B02G265900 | Myc-type, basic helix-loop-helix (bHLH) domain Helix-loop-helix DNA-binding domain superfamily | Protein dimerization activity | ||||
| TraesCS7B02G266100 | Nucleotide-diphospho-sugar transferases | – | ||||
| TraesCS7B02G266200 | PPM-type phosphatase domain | Phosphatase activity | ||||
| AX-94651424 | 544.72 | 1A | TraesCS1A02G365900 | SANT/Myb domain, Homeobox-like domain superfamily | DNA binding | |
| TKW | AX-94939463 | 731.88 | 7A | TraesCS7A02G560100 | Polysaccharide biosynthesis domain superfamily | Xylan biosynthetic process |
| TraesCS7A02G560000 | Dehydrin | Response to abscisic acid Response to water Cold acclimation |