| Literature DB >> 35608947 |
Leandro Tonello Zuffo1,2,3, Rodrigo Oliveira DeLima2, Thomas Lübberstedt3.
Abstract
The identification of genomic regions associated with root traits and the genomic prediction of untested genotypes can increase the rate of genetic gain in maize breeding programs targeting roots traits. Here, we combined two maize association panels with different genetic backgrounds to identify single nucleotide polymorphisms (SNPs) associated with root traits, and used a genome-wide association study (GWAS) and to assess the potential of genomic prediction for these traits in maize. For this, we evaluated 377 lines from the Ames panel and 302 from the Backcrossed Germplasm Enhancement of Maize (BGEM) panel in a combined panel of 679 lines. The lines were genotyped with 232 460 SNPs, and four root traits were collected from 14-day-old seedlings. We identified 30 SNPs significantly associated with root traits in the combined panel, whereas only two and six SNPs were detected in the Ames and BGEM panels, respectively. Those 38 SNPs were in linkage disequilibrium with 35 candidate genes. In addition, we found higher prediction accuracy in the combined panel than in the Ames or BGEM panel. We conclude that combining association panels appears to be a useful strategy to identify candidate genes associated with root traits in maize and improve the efficiency of genomic prediction.Entities:
Keywords: zzm321990 Zea mays L; Association mapping; candidate genes; genomic selection; inbred line panel; linkage disequilibrium; population structure
Mesh:
Year: 2022 PMID: 35608947 PMCID: PMC9467658 DOI: 10.1093/jxb/erac236
Source DB: PubMed Journal: J Exp Bot ISSN: 0022-0957 Impact factor: 7.298
Summary of the 11 scenarios used for training and validation dataset for genome wide selection analysis
| Scenario | SS BGEM | NSS BGEM | SS Ames | NSS Ames | Training | Validation |
|---|---|---|---|---|---|---|
| Ames | — | — | 39 | 188 | 227 | 150 |
| BGEM | 101 | 79 | — | — | 180 | 122 |
| Ames→BGEM | — | — | 65 | 314 | 379 | 300 |
| BGEM→Ames | 169 | 131 | — | — | 300 | 379 |
| NSS Ames→NSS BGEM | — | — | — | 314 | 314 | 131 |
| NSS BGEM→NSS Ames | — | 131 | — | — | 131 | 314 |
| SS Ames→SS BGEM | — | — | 65 | — | 65 | 169 |
| SS BGEM→SS Ames | 169 | — | — | — | 169 | 65 |
| NSS Combined | — | 79 | — | 188 | 267 | 178 |
| SS Combined | 101 | — | 39 | — | 140 | 94 |
| Combined | 101 | 79 | 39 | 188 | 407 | 272 |
Training set before the arrow, and validation set after the arrow.
Number of lines used on the training set in each subpopulation from fastStructure in each scenario. SS and NS refer to Stiff Stalk and non-Stiff Stalk subpopulation, respectively.
Total number of lines used as training set in each scenario.
Total number of lines used as validation set in each scenario.
Best linear unbiased prediction estimates of means, standard deviation (SD), minimum and maximum, and estimates of broad-sense heritability for primary (PRL, cm), lateral (LRL, cm) and total (TRL, cm) root length, and total number of roots (TRN) measured in 302 and 377 maize inbred lines from BGEM and Ames panels, respectively
| Trait | BGEM panel | Ames panel | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Min | Max |
| Mean | SD | Min | Max |
| |
| PRL | 32.16 | 2.40 | 22.67 | 38.59 | 0.24 | 27.42 | 3.27 | 17.07 | 38.26 | 0.28 |
| LRL | 231.25 | 51.69 | 107.63 | 438.52 | 0.38 | 134.32 | 47.56 | 36.23 | 283.64 | 0.42 |
| TRL | 263.42 | 52.95 | 128.66 | 475.35 | 0.37 | 174.26 | 49.70 | 69.26 | 323.24 | 0.42 |
| TNR | 13.74 | 2.97 | 6.98 | 23.87 | 0.45 | 10.50 | 2.86 | 3.84 | 17.56 | 0.49 |
Fig. 1.Whole-genome linkage disequilibrium decay in the entire 679 maize lines from the Combined panel. Linkage disequilibrium within and over chromosomes is given in physical distances of 50 kb.
Fig. 2.Analysis of the population structure of 377 inbred lines from Ames panel (A), 302 DH lines from BGEM panel (B), and 679 maize lines from the Combined panel (C) using SNP markers. fastStructure clustering results obtained at K=4. Each inbred line is represented by a thin bar corresponding to the sum of assignment probabilities to the K cluster. For Ames and BGEM panels, orange and yellow refer to NSS and SS subpopulations, respectively; for the Combined panel the association between colors and subpopulations are as follows: orange, SS from BGEM; yellow, NSS from BGEM; blue, SS from Ames; green, NSS from Ames. Panel A (population structure results from Ames panel) was adapted from Pace .
Number of SNP markers significantly associated with some root trait in Ames, BGEM, and Combined panels
| Trait | Ames | BGEM | Combined |
|---|---|---|---|
| Primary root length (cm) | 0 | 0 | 10 |
| Lateral root length (cm) | 0 | 2 | 7 |
| Total root length (cm) | 0 | 1 | 10 |
| Total number of roots | 2 | 3 | 3 |
Fig. 3.Manhattan plot showing associations between SNP markers and root traits plotted from the association analysis. Lateral root length (A), primary root length (B), total number of roots (C), and total root length (D) in the Combined panel. The dashed horizontal line depicts the simpleM correction.
Significant SNP markers and candidate genes associated with lateral root length (LRL), primary root length (PRL), total root length (TRL), and total number of roots (TNR) on Ames, BGEM and Combined panel
| Panel | Trait | SNP | B73 Gene ID | Zm Gene ID | Function |
|---|---|---|---|---|---|
| Ames | TNR | S9_105426105 | GRMZM2G331015 | Zm00001d046883 | Diphthamide biosynthesis protein 3 |
| Ames | TNR | S9_105426105 | GRMZM2G031370 | Zm00001d046882 | Ypt/Rab-GAP domain of gyp1p superfamily protein |
| BGEM | LRL | S2_229011896 | GRMZM5G875516 | Zm00001d007603 | NB-ARC domain containing protein |
| BGEM | LRL | S8_3886189 | GRMZM2G153434 | Zm00001d008285 | PQ-loop repeat family protein/ transmembrane family protein |
| BGEM | TNR | S1_295347415 | GRMZM2G127309 | Zm00001d034729 | Carbohydrate-binding X8 domain superfamily protein |
| BGEM | TNR | S2_226393146 | GRMZM2G106655 | Zm00001d007504 | Sterol 14-demethylase |
| BGEM | TNR | S5_212654036 | GRMZM5G878379 | Zm00001d018326 | MAP kinase kinase4 |
| BGEM | TRL | S8_3886189 | GRMZM2G153434 | Zm00001d008285 | PQ-loop repeat family protein/ transmembrane family protein |
| Combined | LRL | S1_55575549 | GRMZM2G072892 | Zm00001d029053 | Lung seven transmembrane receptor family protein |
| Combined | LRL | S2_229287344 | GRMZM2G333448 | Zm00001d007624 | Uncharacterized |
| Combined | LRL | S3_164391691 | GRMZM2G357926 | Zm00001d042434 | Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family |
| Combined | LRL | S6_107885930 | GRMZM2G392700 | Zm00001d037109 | Smr domain containing protein |
| Combined | LRL | S6_160702812 | GRMZM2G474546 | Zm00001d038809 | Protein kinase superfamily protein |
| Combined | LRL | S6_83217874 | GRMZM2G132644 | Zm00001d036386 | Transducin/WD40 repeat-like superfamily protein |
| Combined | LRL | S8_19122996 | GRMZM2G125020 | Zm00001d008782 | Protein MARD1 |
| Combined | PRL | S1_29197616 | GRMZM2G476914 | Zm00001d028294 | Ubiquitin-protein ligase |
| Combined | PRL | S1_52765942 | GRMZM2G145008 | Zm00001d028967 | Probable ADP-ribosylation factor GTPase-activating protein AGD14 |
| Combined | PRL | S2_222724929 | GRMZM2G094978 | Zm00001d007366 | Haloacid dehalogenase-like hydrolase domain-containing protein |
| Combined | PRL | S4_188518876 | GRMZM2G011919 | Zm00001d052544 | Putative CBL-interacting protein kinase family protein |
| Combined | PRL | S4_191579817 | GRMZM2G004699 | Zm00001d052651 | Probable xyloglucan endotransglucosylase/hydrolase protein 5 |
| Combined | PRL | S6_86338227 | GRMZM2G082874 | Zm00001d036454 | Plant-specific domain TIGR01589 family protein |
| Combined | PRL | S10_5757204 | GRMZM2G113073 | Zm00001d023434 | Rac GTPase activating protein 1 |
| Combined | PRL | S10_73160806 | GRMZM2G343519 | Zm00001d024469 | Glutaredoxin family protein |
| Combined | TNR | S3_164391691 | GRMZM2G357926 | Zm00001d042434 | Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family |
| Combined | TNR | S8_152319365 | GRMZM2G487776 | Zm00001d011637 | Hydroxyproline-rich glycoprotein family protein |
| Combined | TRL | S1_12165882 | GRMZM2G076062 | Zm00001d027731 | Evolutionarily conserved C-terminal region 2 |
| Combined | TRL | S1_229753355 | GRMZM2G413853 | Zm00001d032655 | Dicer-like 102 |
| Combined | TRL | S3_164391691 | GRMZM2G357926 | Zm00001d042434 | Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family |
| Combined | TRL | S3_21565733 | GRMZM2G309897 | Zm00001d039967 | Probable leucine-rich repeat receptor-like protein kinase |
| Combined | TRL | S4_191764922 | GRMZM2G142344 | Zm00001d052655 | SNARE associated Golgi protein family |
| Combined | TRL | S4_191764922 | GRMZM2G142342 | Zm00001d052654 | Germin-like protein subfamily 3 member 4 |
| Combined | TRL | S4_76384998 | GRMZM2G066599 | Zm00001d050294 | VQ motif-containing protein |
| Combined | TRL | S6_107885930 | GRMZM2G392700 | Zm00001d037109 | Smr domain containing protein |
| Combined | TRL | S6_83217874 | GRMZM2G132644 | Zm00001d036386 | Transducin/WD40 repeat-like superfamily protein |
| Combined | TRL | S8_19122996 | GRMZM2G125020 | Zm00001d008782 | Protein MARD1 |
Based on B73 RefGen_v2.
Based on B73 RefGen_v4.
Obtained from MaizeGDB and Gramene.
Fig. 4.Accuracy assessment of genome wide studies for five root traits, four genetic models and 11 scenarios. LRL: lateral root length; PRL: primary root length; TRL: total root length; TNR: total number of roots.