Literature DB >> 36127632

Genome wide association analysis for yield related traits in maize.

Tingru Zeng1, Zhaodong Meng1, Runqing Yue1, Shouping Lu1, Wenlan Li1, Wencai Li1, Hong Meng1, Qi Sun2.   

Abstract

BACKGROUND: Understanding the genetic basis of yield related traits contributes to the improvement of grain yield in maize.
RESULTS: Using 291 excellent maize inbred lines as materials, six yield related traits of maize, including grain yield per plant (GYP), grain length (GL), grain width (GW), kernel number per row (KNR), 100 kernel weight (HKW) and tassel branch number (TBN) were investigated in Jinan, in 2017, 2018 and 2019. The average values of three environments were taken as the phenotypic data of yield related traits, and they were statistically analyzed. Based on 38,683 high-quality SNP markers in the whole genome of the association panel, the MLM with PCA model was used for genome-wide association analysis (GWAS) to obtain 59 significantly associated SNP sites. Moreover, 59 significantly associated SNPs (P < 0.0001) referring to GYP, GL, GW, KNR, HKW and TBN, of which 14 SNPs located in yield related QTLs/QTNs previously reported. A total of 66 candidate genes were identified based on the 59 significantly associated SNPs, of which 58 had functional annotation.
CONCLUSIONS: Using genome-wide association analysis strategy to identify genetic loci related to maize yield, a total of 59 significantly associated SNP were detected. Those results aid in our understanding of the genetic architecture of maize yield and provide useful SNPs for genetic improvement of maize.
© 2022. The Author(s).

Entities:  

Keywords:  Genome-wide association study; Maize; Marker-assisted selection; Quantitative trait nucleotide; Yield related trait

Mesh:

Year:  2022        PMID: 36127632      PMCID: PMC9490995          DOI: 10.1186/s12870-022-03812-5

Source DB:  PubMed          Journal:  BMC Plant Biol        ISSN: 1471-2229            Impact factor:   5.260


Background

As an important cereal and forage crop, maize plays an important role in sustaining global food security. Improvement of grain yield is a major and longstanding breeding goal for maize. Maize grain yield was determined by several yield-related traits, including grain yield per plant (GYP), ear length (EL), kernel row number (KRN), grain length (GL), grain width (GW), 100-kernel weight (HKW), and kernel number per row (KNR) [1]. Yield related traits possess higher heritability than grain yield and have great effects on improving grain yield [2]. They thus have attracted the attention of breeders in recent decades [3]. Nevertheless, our understanding of the molecular mechanisms underlying maize yield related traits is limited [4]. Identifying loci associated with yield related traits has become an essential topic in the molecular breeding practice of high yield maize which contributes to our understanding of the correlations between yield related traits at a molecular level. Up to now, some yield related traits genes have been cloned by studying mutants [5-7]. Unfortunately, most of the traits related to plant development and yield in mutants show negative effects, which limits the application of mutants in breeding [8]. Therefore, the alleles controlling yield related traits can be identified by linkage mapping and association mapping in natural variation populations. To date, a number of quantitative trait loci (QTL) for yield related traits in maize have been detected by linkage analysis. Liu et al. [9] detected four QTL controlling KRN in an F2 population and two QTL controlling KRN in a recombinant inbred line (RIL) population derived from the crossing of the maize inbred lines abe2 and B73. Using an intermated B73 × Mo17 Syn10 doubled haploid population, Zhang et al. [10] detected 100 QTLs for yield related traits and eight significant SNPs co-located within intervals of seven QTLs. Through linkage analysis, a PPR family gene ZmVPS29 regulating maize grain shape was successfully cloned according to genetic population which was constructed with maize inbred lines Huangzaosi and Lv28. Overexpression of ZmVPS29 could make the grain slender and significantly improve the yield per ear of maize [11]. However, QTL with small effects were difficult to identify since classical biparental populations generally lead to relatively low resolution [12]. Furthermore, some rare alleles are often neglected due to the lack of genetic diversity in biparental populations [13]. As a cost-effective tool for dissecting the genetic architecture of complex quantitative traits, genome-wide association studies (GWAS) provide a high-resolution approach for the identification of QTL and have been widely used for the examination of QTL for yield-related traits in crops [14]. Huang et al. [15] used high-density SNP data and GWAS method to analyze 950 rice varieties in the world, and identified 10 trait loci related to yield in rice. To better understand the molecular mechanism underlying yield, Li et al. [16] investigated four yield-related traits of 133 soybean landraces by GWAS method and the results revealed five candidate genes associated with yield-related traits. Maize had high genetic diversity and contains many rare alleles in genome, which is very suitable to study the genetic basis of yield-related traits by GWAS [17, 18]. Using the association panel composed of 240 maize inbred lines and recombinant inbred lines, Zhang et al. [2] identified 23 QTLs and 25 significant SNPs related to HKW, KRN and KNR, including a stable locus (PKS2) related to KRN, HKW and kernel shapes. Zhang et al. [10] Used a natural population and B73 × Mo17 syn10 doubled hybridized haploid population, detected 100 QTLs and 138 SNPs of yield related traits, and found that 8 important SNPs were located in the interval of 7 QTLs. Luo et al. [19] used the GWAS method to identify a QTL-YIGE1, which regulates ear length by affecting pistillate floret number. Overexpression of YIGE1 can promote the growth of female inflorescence meristem (IM), thereby increasing panicle length and grain number per row, thus increasing yield. The GWAS method has been used for detecting loci that control yield related traits in maize, such as grain yield per plant (GYP) [20], ear length (EL) [21], kernel row number (KRN) [22], kernel length (KL) [23], kernel width (GW) [23], 100-kernel weight (HKW) [24], and kernel test weight (KTW) [10]. Therefore, the yield related traits of quantitative trait nucleotides (QTNs) can be effectively identified by GWAS method, and will improve our understanding of the molecular mechanism underlying kernel yield formation in maize. Under the trend of increasing planting density and higher requirements for light energy utilization efficiency in modern breeding, the plant type of maize, such as tassel branch number (TBN), has a great correlation with the yield of maize [6]. At present, many genetic loci for the tassel branch number have been obtained by QTL mapping or GWAS analysis. Yi et al. [25] used F2:3 population with 266 families and RIL population with 301 families to locate QTLs for tassel length and tassel branch number, detected 15 and 16 QTLs respectively, of which 4 QTLs can be co-located by the two populations. Upadyayulia et al. [26] analyzed the tassel correlation traits of maize backcross population and detected 45 QTLs controlling the tassel correlation traits, of which the bnlg344-phi027 segment of bin9.02 can explain 14.6% of the phenotypic variation. The known ramosa1 (ra1) gene controlling the development of tassel is located in bin7.02 within the QTL interval. Using US-NAM population and CN-NAM population, 63 QTLs controlling the tassel branch number and 62 QTLs controlling the length of tassel were identified by linkage analysis, and 965 QTNs significantly associated with the tassel branch number were detected by association analysis [27]. In the present study, we used an association panel of 291 maize inbred lines to identify the significant SNPs related to yield related traits by GWAS in different environments. The objective of the study was to map SNPs that are significantly associated with yield related traits and identify the candidate genes involved in yield related traits. Our results will improve the understanding of molecular mechanisms underlying maize yield related traits and provide novel molecular markers that may be used by breeders to develop superior varieties.

Results

Yield related tarit phenotypic variations

Taking the average value of yield related traits in 3 years as phenotypic data, the six yield related traits were statistically analyzed. The six phenotypic traits GYP, GW, GL, KNR, HKW and TBN of 291 maize inbred lines showed an approximate normal distribution (Fig. 1), indicating that these traits were typical quantitative traits controlled by multiple genes. Among 291 maize inbred lines, the phenotypes of GYP, KNR and TBN quantitative traits showed great variation (CV was 42.37, 39.95 and 49.79% respectively), and the 6 yield related traits showed high broad-sense heritability, which were 0.62, 0.65, 0.71, 0.61, 0.76 and 0.83 respectively (Table 1).
Fig. 1

Phenotypic variation of yield related traits in 291 maize inbred lines

Table 1

Statistical analysis of yield related traits

TraitMeanMaxMinSDCV (%)H2
GYP49.92127.503.3921.1542.37%0.62
GW8.3430.604.970.9611.51%0.65
GL9.3212.476.101.0711.43%0.71
KNR26.4653.505.3310.5739.95%0.61
HKW25.1840.049.754.9319.59%0.76
TBN6.7021.001.003.3449.79%0.83

GYP grain yield per plant (g), GW grain width (cm), GL grain length (cm), KNR kernel number per row, HKW 100-kernel weight (g), TBN tassel branch number. The same as below

Phenotypic variation of yield related traits in 291 maize inbred lines Statistical analysis of yield related traits GYP grain yield per plant (g), GW grain width (cm), GL grain length (cm), KNR kernel number per row, HKW 100-kernel weight (g), TBN tassel branch number. The same as below

Group structure analysis of association penal

Based on the genotypes of 291 inbred lines, according to TASSEL5.0 software for cluster analysis, combined with the analysis results of Li et al. [28] on the population structure, 291 materials were clustered (Fig. 2). When 50% group attribute ratio was used as the basis for classification, 227 (78.0%) of 291 inbred lines were divided into 6 groups: Lüda red cob group (LRC), Tangsipingtou group (TSPT), Lancaster group (LAN), P group (P), Improved Reid group (IR) and Reid group (Reid); while the remaining 64 lines did not have clear group attribution characteristics and were classified as mixed groups (Mix). Among the seven groups, Lüda red cob group, Tangsipingtou group, Lancaster group, P group, Improved Reid group and Reid group, contain 10, 27, 39, 33, 26 and 92, materials respectively, accounting for 3.4, 9.3, 13.4, 11.3, 8.9 and 31.6% of the total materials respectively. Lüda red cob group, Tangsipingtou group, Lancaster group, P group, Improved Reid group and Reid group have been reported in previous studies, and they are all commonly used heterosis groups in maize breeding [29, 30]. The materials in the mixed population contained Chinese and foreign germplasm widely, so the association panel had a wide genetic basis and rich yield related variation loci.
Fig. 2

Cluster analysis of 291 maize inbred lines

Cluster analysis of 291 maize inbred lines

Genome wide association analysis of yield related traits

In total, 38,683 high-quality SNPs were used to perform GWAS for six yield related traits. MLM with PCA model was used to analyze the average values of yield related traits of 291 maize inbred lines in 6 environments. The GWAS results showed that a total of 59 significantly associated yield related SNPs were identified, and their p values were less than 0.0001 or could be detected in two yield related traits (Fig. 3 and Table 2). Among the significantly SNPs, 11 SNPs of GYP were detected, which were located on chromosomes 1, 2, 3, 7, 8, 9 and 10; 29 SNPs of GW were detected, which were located on all chromosomes; 4 SNPs of GL were detected, which were located on chromosomes 2, 7 and 10; 5 SNPs of KNR were detected, which were located on chromosomes 1, 6 and 7; 2 SNPs of HKW were detected, which were located on chromosomes 2 and 6; 11 SNPs of TBN were detected, which were located on chromosomes 1, 3, 4, 7, 8 and 10. At the same time, three of these SNPs can be detected in two different traits (bold SNPs, Table 2).
Fig. 3

Manhattan plot for genome-wide association study of maize yield related traits

Table 2

List of significant SNPs associated maize yield related traits and the candidate genes and their functional annotations

TraitSNPP valueR2%Candidate geneGene annotation
GYP9_1502572461.92E-057.77GRMZM2G330945Cold regulated gene 27 (COR27)
GYP7_1620016022.86E-058.50GRMZM2G151649ARM repeat superfamily protein
GYP3_1384196445.32E-057.01GRMZM5G803355MYB transcription factor
GYP2_293369015.90E-056.94GRMZM5G845736Inactive beta-glucosidase
GYP3_1384192036.10E-056.91

GRMZM5G803355

GRMZM2G585025

MYB transcription factor

Small RNA degrading nuclease 5

GYP3_74171577.15E-056.79GRMZM2G015267FAD/NAD(P)-binding oxidoreductase
GYP1_2090097447.45E-056.76GRMZM2G125557Auxin-repressed protein putative expressed
GYP2_668313367.76E-056.73GRMZM2G097406Unknown
GYP3_1387621789.58E-056.58GRMZM2G087590PsbP domain-containing protein 4
GYP8_290233379.72E-056.57GRMZM2G393347HOPZ-ACTIVATED RESISTANCE 1 (ZAR1)
GYP10_349386981.48E-046.26GRMZM2G003090Unknown
KW6_770816423.60E-1832.43

GRMZM2G328197

GRMZM2G376957

RING zinc finger domain superfamily protein

Histone H3-like 5

KW1_2991771962.17E-1732.56GRMZM2G110851Pentatricopeptide repeat-containing protein
KW3_1755692912.86E-1220.42GRMZM2G149662COV1-like protein
KW1_52,668,9691.42E-1018.58GRMZM2G174696Mitochondrial import receptor subunit TOM40–1
KW5_1772774111.02E-0815.00GRMZM2G492156MADS-box transcription factor 27
KW9_1546731012.50E-0814.28GRMZM2G092741ARATH AP-2 complex subunit alpha-2
KW6_674796692.53E-0813.00GRMZM2G430902C3HC4-type RING finger family protein
KW1_388583135.16E-0812.44AC204035.3Unknown
KW4_2380372476.29E-0812.28GRMZM2G166145Apoptosis-inducing factor homolog
KW10_1105334552.82E-0712.31GRMZM2G153215Membrane-anchored ubiquitin-fold protein 4
KW2_2235917284.83E-0710.70

GRMZM2G172101

GRMZM2G052507

Seryl-trna synthetase

Serine carboxypeptidase-like 45

KW2_369524546.26E-0711.67GRMZM2G102238PAP2 family domain containing protein
KW1_136823356.28E-0710.49GRMZM2G417455Beta-galactosidase 5
KW7_1309097426.31E-0710.49GRMZM2G464985Serine/threonine-protein kinase D6PKL1
KW4_1740676459.84E-0710.15GRMZM2G010933Cytochrome c oxidase copper chaperone 1
KW6_663639631.72E-069.72GRMZM2G110983Ubiquitin-conjugating enzyme E2
KW1_1548067521.76E-069.70GRMZM2G446047Trm32_arath protein trm32
KW5_1794405192.05E-069.58GRMZM2G00338460S ribosomal protein L6
KW1_763487522.30E-0610.62GRMZM2G174708Polygalacturonase inhibitor 1
KW6_193916832.56E-0610.54GRMZM2G319397Unknown
KW8_711704302.78E-0610.47GRMZM5G887975GATA transcription factor 19
KW2_541353172.80E-069.35GRMZM2G173218Unknown
KW2_2330336013.21E-069.24

GRMZM5G843555

GRMZM2G149935

Putative prolyl 4-hydroxylase 12

Hydroxyproline o-galactosyltransferase galt4

KW2_1537262473.51E-069.17GRMZM5G800842Ubiquitin-activating enzyme E12
KW7_1151630585.27E-068.86

GRMZM2G071059

GRMZM2G171408

CCR4-NOT transcription complex subunit 7

Spotted leaf protein 11

KW1_873043885.56E-068.82GRMZM2G046848U-box domain-containing protein kinase family
KW1_2336208486.16E-068.74GRMZM2G001850Nucleolin like 2
KW5_2048163476.48E-068.70GRMZM5G899787RNA-binding (RRM/RBD/RNP motifs) protein
KW8_658403066.66E-068.68GRMZM2G477340CDP-diacylglycerol--serine O-phosphatidyltransferase 2
KL2_2002485304.01E-057.18GRMZM2G435689Unknown
KL2_1923109527.72E-056.70GRMZM2G013892Zinc finger C3HC4 type domain containing protein
KL7_1372562609.02E-056.59GRMZM2G458164Glucan endo-1 3-beta-glucosidase precursor
KL10_349386981.51E-046.21GRMZM2G003090Unknown
KNR1_522534101.23E-059.29GRMZM2G128644VQ motif-containing protein
KNR1_2991771961.58E-059.09GRMZM2G110851PPR repeat domain containing protein
KNR6_770816423.10E-057.52

GRMZM2G328197

GRMZM2G376957

RING zinc finger domain superfamily protein

Histone H3-like 5

KNR6_1580993444.58E-058.25

GRMZM2G049091

GRMZM2G138067

Transcription initiation factor IIF beta subunit

Protein LURP-one-related 5

KNR7_135861758.49E-057.77GRMZM2G446921F-box domain containing protein
HKW2_2064277092.45E-058.72

GRMZM2G418343

GRMZM2G117900

Cell wall protein precursor putative

Translation initiation factor family protein

HKW6_676170184.05E-045.59GRMZM2G430902C3HC4-type RING finger family protein
TBN8_139,471,5882.00E-0914.76GRMZM2G101664Zinc finger protein
TBN8_894332926.73E-0710.25

GRMZM5G856067

GRMZM2G063676

Ribosomal protein L18ae family

heat shock 70 kDa protein 4

TBN1_290249221.07E-069.90

GRMZM2G153611

GRMZM2G180023

E3 ubiquitin-protein ligase ARI2

serine/threonine receptor-like kinase NFP

TBN10_42479001.98E-069.44GRMZM5G886547Non-specific lipid-transfer protein 3
TBN8_1481989542.07E-069.40GRMZM2G130586Unknown
TBN8_1391648944.92E-068.76GRMZM2G046037ubiquitin carboxyl-terminal hydrolase MINDY-2
TBN3_1603684307.87E-068.40GRMZM2G129114Nucleotide-diphospho-sugar transferase
TBN1_2700007811.41E-057.97GRMZM5G857351translational activator GCN1
TBN3_1800174391.20E-058.09GRMZM2G042295methyltransferase putative expressed
TBN4_2095978371.39E-057.98GRMZM2G094541Receptor-like serine/threonine-protein kinase SD1–6
TBN7_1754484191.32E-058.02GRMZM2G086628early nodulin-like protein 16

Bold represents the SNPs associated with two yield related traits

Manhattan plot for genome-wide association study of maize yield related traits List of significant SNPs associated maize yield related traits and the candidate genes and their functional annotations GRMZM5G803355 GRMZM2G585025 MYB transcription factor Small RNA degrading nuclease 5 GRMZM2G328197 GRMZM2G376957 RING zinc finger domain superfamily protein Histone H3-like 5 GRMZM2G172101 GRMZM2G052507 Seryl-trna synthetase Serine carboxypeptidase-like 45 GRMZM5G843555 GRMZM2G149935 Putative prolyl 4-hydroxylase 12 Hydroxyproline o-galactosyltransferase galt4 GRMZM2G071059 GRMZM2G171408 CCR4-NOT transcription complex subunit 7 Spotted leaf protein 11 GRMZM2G328197 GRMZM2G376957 RING zinc finger domain superfamily protein Histone H3-like 5 GRMZM2G049091 GRMZM2G138067 Transcription initiation factor IIF beta subunit Protein LURP-one-related 5 GRMZM2G418343 GRMZM2G117900 Cell wall protein precursor putative Translation initiation factor family protein GRMZM5G856067 GRMZM2G063676 Ribosomal protein L18ae family heat shock 70 kDa protein 4 GRMZM2G153611 GRMZM2G180023 E3 ubiquitin-protein ligase ARI2 serine/threonine receptor-like kinase NFP Bold represents the SNPs associated with two yield related traits

Candidate genes involved in yield related traits

The LD analysis results of this association panel showed that when r2 > 0.2, the LD decay with physical distance in our association panel was calculated to be 100 kb (Fig. S1). SNPs with significant correlation were screened out from GWAS. The yield related candidate genes within the LD range of significant association sites were found on the maizeGDB website (B73 RefGen_v4). A total of 66 candidate genes were identified in 59 SNPs controlling yield related traits, of which 58 had functional annotation (Table 2).

Discussion

Abundant phenotypic variations in the yield related traits

At present, GWAS method has been widely used to study the genetic basis of important traits of many species by calculating the association between genotypic and corresponding phenotypic variations [31]. In the study conducted by Zhang et al. [10], the population had a large phenotypic variation in ERN (ear row number), ranging from 9.00 to 20.10; in HKW, ranging from 14.84 to 41.75 g; in KNR, ranging from 14.50–35.05; in EGW (ear grain weight), ranging from 102.70–801.75 g. Meanwhile, in the study of Ma et al. [20], phenotypic variation of the association panel in the BLUE (best linear unbiased estimate) value of GYP was 42.2 g, CV 40%; the BLUE value of HKW is 26 g, CV 17%; the BLUE value of KNR was 15.87, CV 28%. Greater phenotypic variation would be beneficial for dissecting the genetic architecture of the yield related traits. Among the association panel composed of 291 inbred lines had a large phenotypic variation in GYP, GW, GL, KNR, HKW and TBN (Table 1), so the association panel was suitable for association analysis of yield related traits.

Genetic architecture of yield related traits

Crop yield is a complex quantitative trait. Understanding the genetic structure of maize yield is helpful to maize high-yield breeding. GWAS facilitates the identification of QTNs and candidate genes associated with the target traits. In this study, we performed GWAS using the association panels, including 291 inbred lines with 38,683 SNP markers, we obtained 59 significant SNPs (P < 0.0001) that were significantly associated with six yield related traits in maize. Among these SNPs, some overlapped with previously reported QTL/QTN intervals. The SNP 9_150257246 (Chr9: 150.25 Mb), 7_162001602 (Chr7: 162.00 Mb) and 1_209009744 (Chr1: 209.00 Mb) of GYP were mapped to the previously detected QTL Yqgypp9 (Chr9: 140.8–158.6 Mb), qgy-7.2 (Chr9: 161.51–165.72 Mb), the QTL detected in RIL population derived from lines DAN340 × K22 (Chr1: 208.36–209.3 Mb) [32-34]. the GYP-associated SNP 7_162001602 (Chr7: 162.00 Mb) was closely located with the SNP chr7.S_162987283 (Chr7:162.98 Mb) detected in the RIL population [34]. Four GW-related SNPs 2_36952454 (Chr2:36.95 Mb), 2_54135317 (Chr2:54.13 Mb), 3_175569291 (Chr3:175.56 Mb) and 5_177277411 (Chr5:177.27 Mb) were mapped to the previously reported intervals of the GW-related QTL on Chr2: 33.71–36.47 Mb, Chr2: 45.2–54.97 Mb, Chr3: 175.56–179.42 Mb and Chr5: 168.68–177.86 Mb [21, 34, 35]. 7_137256260 (Chr7: 137.25 Mb) that was associated with GL situated closely the interval of the GL SNP chr7.S_137701632 (Chr7: 137.70 Mb) identified in the RIL population [34]. The SNP 8_139,471,588, 8_139164894 and 8_148198954 of TBN were located closely and mapped to the previously detected QTL of TBN in qTBN8–1 (Chr8: 129.97–154.67 Mb) [22]. SNP 8_89433292 (chr8: 89.43 Mb) associated with TBN and located in the QTL interval of Q49CN-NAM (chr8: 73-101 Mb), which was positioned by Wu et al. [27] in TBN. The SNP 3_180017439 (chr3: 180.01 Mb) of TBN was closely linked to the SNP S3_179732428 (chr3: 179.73 Mb) and 179,982,665 (chr3: 179.98 Mb) of TBN [4, 27]. These yield related SNPs could be considered population-stable SNPs, which should be given close attention in MAS breeding for improving maize yield. In addition, several SNPs not found in previous studies might contribute to achieving high and stable yield in maize.

Pleiotropic loci affect yield related traits in maize

Pleiotropism is a common phenomenon that has been found in the QTL mapping and GWAS of multiple crops [36, 37]. According to combined linkage and association mapping, Zhang et al. [2] found 17 QTL/SNPs which had pleiotropism in yield related traits in maize. Liu et al. [23] investigated in an association panel and a biparental population, and also identified five pleiotropic QTLs for kernel traits, which implicating that a close genetic correlation existed among different kernel traits in maize. In our study, we identified 3 pleiotropic SNPs (pSNP) that have pleiotropic effect on different yield related traits (Table 2 bold SNP). Of these, pSNP 1_299177196 and 6_77081642 displayed a pleiotropic effect on GW and KNR. The SNP 1_299177196 associated candidate gene was GRMZM2G110851, which encoded a pentatricopeptide repeat-containing (PPR) protein. Chen et al. [11] cloned the PPR family gene Zmvps29 through linkage analysis, which can regulate the kernel width of maize and increase the kernel number per row. GRMZM2G110851 and Zmvps29 both belong to PPR family genes and have the same regulatory effect on maize kernel, suggesting that GRMZM2G110851 has a similar function with Zmvps29. The candidate gene GRMZM2G328197 of SNP 6_77081642 in GW and KNR encoded a RING zinc finger domain superfamily protein, which was previously reported to be significantly related to panicle length in rice and to have a positive role in seed germination in Arabidopsis [37, 38]. The pSNP 10_34938698 could be detected in both GYP and GL which associated with a candidate gene GRMZM2G003090, but its function was unknown. These pleiotropic SNPs detected in multiple yield related traits might be stable sites for regulating maize yield, which was helpful to understand the molecular mechanism of maize yield formation. Among these candidate genes in this study, some of them were previously reported to affect grain yield or kernel development, which were considered the top-priority candidate genes. The SNP 3_138419644 and 3_138419203 were both associated with GRMZM5G803355, which encoded an MYB transcription factor. Jia et al. [39] found that the expression of ZmMYBE1 in the two hybrids was higher than that in their parents, and considered that ZmMYBE1 was related to yield heterosis at the transcriptional level. The candidate gene of SNP 6_67617018 and 6_67479669 were GRMZM2G430902, which encoded a C3HC4 type ring finger family protein. The family genes were expressed in many tissues of Arabidopsis and maize during reproductive development, also played an important role in plant seed development [40]. The SNP 1_ 52,668,969 had a high P-value, which associated gene GRMZM2G174696 encoded a TOM40 protein. TOM40 was relatively conservative and had homologous genes in rice and maize. In Arabidopsis, AtTOM40 was essential for the normal structure of the mitochondrion, and participated in early embryo development and pattern formation through maintaining the biogenesis of mitochondria [41]. The candidate gene GRMZM2G304745 of SNP 2_23576028 encoded a leucine-rich repeat protein kinase family protein, overexpression of LRK (leucine-rich repeat receptor kinase) gene could increase the yield of rice [42]. GRMZM5G878070 encoded a ABC1-like kinase protein, overexpressing OsAGSW1 (ABC1-like kinase related to Grain size and Weight) exhibited a phenotype with a significant increase in grain size, grain weight, grain filling rate and 1000-grain weight compared with the wild-type and RNAi transgenic plants in rice [43]. GRMZM2G492156 encoded a MADS-box transcription factor 27 protein, overexpressing MADS-box showed new attributes such as the increase of vegetative growth and grain weight in maize [44]. GRMZM2G464985 annotated as a serine/threonine-protein kinase gene, was previously demonstrated to play vital roles in ear length, kernel number and enhance maize hybrids grain yield [45]. The SNP 8_ 139,471,588 had the most significant p-value in the TBN, which can explain 14.76% of the phenotypic variation of TBN. This locus was associated with GRMZM2G101664, which encoded a zinc finger protein. NSG1 encoded a member of the zinc finger protein family and was expressed mainly in the organ primordia of the spikelet in rice, which played a pivotal role in maintaining organ identities in the spikelet by repressing the expression of LHS1, DL, and MFO1 [46]. Maize ramosa1 (ra1) gene encoded a zinc finger transcription factor protein, which was involved in the regulation of tassel development in maize [47]. The zinc finger protein encoded by GRMZM2G101664 might also be involved in the development of maize tassel. The SNP 8_89433292 could explain 10.25% of the phenotypic variation and associated with GRMZM2G042295, which encoded a heat shock protein. HSP101 can participate in the regulation of tassel development at the post transcriptional level in maize [48]. The SNP 3_180017439 associated with GRMZM2G042295, which encoded a methyltransferase family protein. Wang et al. [49] found a methyltransferase family protein and played a key role in the regulation of secondary wall biosynthesis in interfascicular fibers during inflorescence stem development of Arabidopsis. These genes are considered to be reliable candidate genes for regulating yield related traits in maize, and further verification of their function will be helpful for further elucidating the underlying genetic and molecular mechanisms of yield related traits.

Conclusion

In this study, a genome-wide association study (GWAS) method was made on an association panel of 291 inbred lines. Using 38,683 high-quality SNPs, six yield related traits were analyzed by the MLM with PCA method. A total of 59 yield related SNP were detected, involving 66 candidate genes. In the future, it is expected to improve the accuracy of GWAS results by adding more representative inbred lines to expand the association population and identifying high-quality phenotypic data from multiple environmental trials. Our results will improve the understanding of the genetic and molecular mechanisms underlying maize grain yield as well as provide new molecular markers for breeders to develop superior maize varieties.

Method

Plant material and field experiments

An association panel of 291 wide range of genetic diversity maize inbred lines in China, was collected for GWASs. All the accessions were planted following a randomized block design of two replicates in 3 years (2017, 2018, 2019) Jinan in Shandong Province (E117°10′, N36°25′). Each material was planted in a row. The field experiments include in a single row 3 m in length, with 0.6 m between adjacent rows and 14 individual plants per row. The Maize Institute of Shandong Academy of Agricultural Sciences has established experimental field bases at Jinan. The field experiments were approved by the Maize Institute, and field management followed local maize management practices. The field studies did not involve endangered or protected species in this study. We declare that all plant materials comply with the ‘Convention on the Trade in Endangered Species of Wild Fauna and Flora’ in this study. The plant materials used in this study were conserved in our lab.

Phenotyping and data analysis

The phenotypic traits measured in this study included grain yield per plant (GYP), grain length (GL), grain width (GW), kernel number per row (KNR), 100-kernel weight (HKW) and tassel branch number (TBN). In GYP, the ears of each line were harvested after reaching maturity and 10 ears with consistent growth were selected for evaluation in each replication. In GL, GW and KNR, the phenotypes were represented by the mean values of 10 ears. In HKW, the average weight of three repeated measures of 100 randomly selected kernels from the bulked kernels of each line. TBN was the average number of tassel branch number of 10 random individual plants in each row. Excel 2016 and SPSS16 software were used to make statistical analysis on six traits, including GYP, GL, GW, HKW, KNR and TBN. The average values of each trait of 3 years were taken, and the standard deviation and coefficient of variation of each trait were calculated (Table S2). QTL IciMapping V4.1 was used to calculate broad-sense heritability (H) by ANOVA in software [50]. The coefficient of variation was calculated as CV(Coefficient of variation) = SD(Standard Deviation)/Mean [28].

DNA extraction and genotyping

Five maize plants were selected from each material, and their fresh leaves were used to extract genomic DNA. We extracted the genomic DNA followed the cetyltrimethylammonium bromide (CTAB) method [51]. All samples were quality checked and genotyped using the GenoBaits Maize 40 K chip [52]. Then, the successfully called SNPs with a missing rate of more than 20% and minor allele frequency (MAF) of < 0.05 were excluded from the genotyping dataset [53]. After that, 38,682 high-quality SNPs were used in further analysis (Table S1).

Genome-wide association studies

All the above phenotypic and genotypic data in the above associated population were used for GWAS. Based on high-quality SNPs, TASSEL 5.0 software was used to analyze the population structure of 291 inbred lines. Combined with the material pedigree, iTOL software was used to draw neighbor joining tree [54]. Using MLM with principal components analysis (PCA) model by TASSEL 5.0, we carried out GWAS for the six yield related traits investigated in this study [55]. The suggestive P value (0.05/N) was set as a significance threshold and N was calculated using the simpleM package in R to control false negatives [56].

Candidate genes identification

We examined the LD in the genomic region around each significant SNP to establish a supporting interval for the significant association. That supporting interval would comprise the surrounding region in LD (r2 > 0.2) [57]. The candidate genes in the LD region around significant SNPs were identified based on the B73 reference genome V3 from MaizeGDB (https://www.maizegdb.org/). Additional file 1. Additional file 2. Additional file 3. Additional file 4.
  53 in total

1.  Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation.

Authors:  Ivica Letunic; Peer Bork
Journal:  Bioinformatics       Date:  2006-10-18       Impact factor: 6.937

Review 2.  Genetic architecture of complex traits in plants.

Authors:  James B Holland
Journal:  Curr Opin Plant Biol       Date:  2007-02-08       Impact factor: 7.834

3.  Population structure and linkage disequilibrium of a mini core set of maize inbred lines in China.

Authors:  Ronghuan Wang; Yongtao Yu; Jiuran Zhao; Yunsu Shi; Yanchun Song; Tianyu Wang; Yu Li
Journal:  Theor Appl Genet       Date:  2008-08-12       Impact factor: 5.699

4.  Prediction and association mapping of agronomic traits in maize using multiple omic data.

Authors:  Y Xu; C Xu; S Xu
Journal:  Heredity (Edinb)       Date:  2017-06-07       Impact factor: 3.821

5.  Genome-wide dissection of the maize ear genetic architecture using multiple populations.

Authors:  Yingjie Xiao; Hao Tong; Xiaohong Yang; Shizhong Xu; Qingchun Pan; Feng Qiao; Mohammad Sharif Raihan; Yun Luo; Haijun Liu; Xuehai Zhang; Ning Yang; Xiaqing Wang; Min Deng; Minliang Jin; Lijun Zhao; Xin Luo; Yang Zhou; Xiang Li; Jie Liu; Wei Zhan; Nannan Liu; Hong Wang; Gengshen Chen; Ye Cai; Gen Xu; Weidong Wang; Debo Zheng; Jianbing Yan
Journal:  New Phytol       Date:  2015-12-30       Impact factor: 10.151

6.  Maize SBP-box transcription factors unbranched2 and unbranched3 affect yield traits by regulating the rate of lateral primordia initiation.

Authors:  George S Chuck; Patrick J Brown; Robert Meeley; Sarah Hake
Journal:  Proc Natl Acad Sci U S A       Date:  2014-12-15       Impact factor: 11.205

7.  Genome-wide association study of four yield-related traits at the R6 stage in soybean.

Authors:  Xiangnan Li; Xiaoli Zhang; Longming Zhu; Yuanpeng Bu; Xinfang Wang; Xing Zhang; Yang Zhou; Xiaoting Wang; Na Guo; Lijuan Qiu; Jinming Zhao; Han Xing
Journal:  BMC Genet       Date:  2019-03-29       Impact factor: 2.797

8.  Combined linkage and association mapping reveal candidate loci for kernel size and weight in maize.

Authors:  Derong Hao; Lin Xue; Zhenliang Zhang; Yujing Cheng; Guoqing Chen; Guangfei Zhou; Pengcheng Li; Zefeng Yang; Chenwu Xu
Journal:  Breed Sci       Date:  2019-06-27       Impact factor: 2.086

9.  Genetic characterization and linkage disequilibrium estimation of a global maize collection using SNP markers.

Authors:  Jianbing Yan; Trushar Shah; Marilyn L Warburton; Edward S Buckler; Michael D McMullen; Jonathan Crouch
Journal:  PLoS One       Date:  2009-12-24       Impact factor: 3.240

10.  Genome-wide association studies and whole-genome prediction reveal the genetic architecture of KRN in maize.

Authors:  Yixin An; Lin Chen; Yong-Xiang Li; Chunhui Li; Yunsu Shi; Dengfeng Zhang; Yu Li; Tianyu Wang
Journal:  BMC Plant Biol       Date:  2020-10-27       Impact factor: 4.215

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.