Literature DB >> 33963265

Genome-wide association studies of grain quality traits in maize.

Yunxiao Zheng1,2,3, Fan Yuan1,2,3, Yaqun Huang1,2,3, Yongfeng Zhao1,2,3, Xiaoyan Jia1,2,3, Liying Zhu1,2,3, Jinjie Guo4,5,6.   

Abstract

High quality is the main goal of today's maize breeding and the investigation of grain quality traits would help to breed high-quality varieties in maize. In this study, genome-wide association studies in a set of 248 diverse inbred lines were performed with 83,057 single nucleotide polymorphisms (SNPs), and five grain quality traits were investigated in diverse environments for two years. The results showed that maize inbred lines showed substantial natural variations of grain quality and these traits showed high broad-sense heritability. A total of 49 SNPs were found to be significantly associated with grain quality traits. Among these SNPs, four co-localized sites were commonly detected by multiple traits. The candidate genes which were searched for can be classified into 11 biological processes, 13 cellular components, and 6 molecular functions. Finally, we found 29 grain quality-related genes. These genes and the SNPs identified in the study would offer essential information for high-quality varieties breeding programs in maize.

Entities:  

Year:  2021        PMID: 33963265      PMCID: PMC8105333          DOI: 10.1038/s41598-021-89276-3

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Maize (Zea mays L.) has become one of the most important crops globally for food, feed, and fuel since it appeared and spread widely[1]. In the past few years, more and more people paid attention to the quality of maize grain due to the rapid development of animal husbandry and processing industry. However, the nutritional quality of maize grain remains poor, especially the deficiency of lysine in the maize grain, which can not meet the nutritional and health requirements of people[2]. Thus, the genetic enhancement of nutritional quality in maize grains is essential to increase the nutritional value and conduct high-quality maize breeding[2]. The results of genetic studies have indicated that variations in nutritional quality in maize grain characterize quantitative traits. Over the past two decades, genetic dissection of nutritional quality in maize kernels by classical QTL mapping has resulted in the identification of numerous nutritional quality QTLs. Mangolin et al.[3] detected 13 QTLs by QTL mapping of maize kernel oil content in F2:3 population. Liu et al.[4] detected seven QTLs associated with protein content, six QTLs associated with starch content, and five QTLs associated with oil content using F2:3 population and BC2F2 population. Wang et al.[5] detected 38 QTLs for maize grain quality traits using three RIL populations in three environments. To date, many genes related to maize proteins have been cloned, such as opaque1 (o1), floury4 (fl4) and Mucronate (Mc)[6-8]. Some genes such as linoleic acid1 (ln1), Oleic acid content1 (olc1), fatty acid desaturation 2 (fad2), and fad6 have been reported to influence oil content in maize[9-11]. A few starch content-related genes such as Shrunken1 (Sh1), Sh2 and Brittle2 (Bt2) have been identified[12,13]. Genome-wide association study (GWAS) is becoming a powerful tool to address interspecies genotype–phenotype association based on the development of next-generation sequencing technology. In maize, GWAS has made a significant progress in the past decade. For example, Li et al.[14] used GWAS to dissect the genetic architecture of oil biosynthesis in maize kernels. Luo et al.[15] used GWAS to detect 57 loci significantly associated with salt tolerance, and 49 candidate genes from these loci. It can be seen that GWAS is widely used in maize. However, there is still a huge problem about how to obtain phenotypic data accurately and quickly. Traditionally, phenotyping method for grain quality, such as chemical method, is not only laborious and time-consuming, but also damages the integrity of maize kernels. By contrast, Near Infrared Reflectance Spectroscopy (NIRS) is a fast, reliable, and non-destructive method. NIRS has been increasingly used in plant phenotyping measurements, such as maize kernel starch content[16] and wheat protein content[17]. Therefore, NIRS can fully measure maize grain nutritional quality. Although some potential grain quality genes and QTLs have been identified in maize, the genetic studies of grain quality are limited. In this study, we used NIRS to measure the main nutritional quality traits of 248 maize inbred lines and used 83,057 single nucleotide polymorphism (SNPs) markers to conduct GWAS. Our study was designed to accomplish the following objectives: (1) perform GWAS to identify SNPs responsible for moisture, protein, oil, starch and lysine contents in maize kernels, (2) compare our GWAS results with previous QTL mapping results, and (3) predict and identify candidate genes of these quality traits for future studies.

Results

Phenotypic variations of grain quality traits

The phenotypes of grain quality traits are shown in Tables 1 and 2 and Fig. 1. As displayed in Table 1, the results indicated that there were abundant phenotypic variations in the 248 inbred lines and all grain quality traits followed a normal distribution, which benefited the dissection of the genetic architecture of the grain.
Table 1

Statistical analysis of grain quality traits in different environments.

TraitEnvironmentRangeMeanSDSkewnessKurtosisCV(%)
Moisture content (%)2016BD4.24–12.788.031.600.17− 0.2519.88
2017BD3.78–11.346.971.310.130.0318.75
2016SJZ5.41–16.1710.521.830.03− 0.4017.38
2017SJZ3.16–11.776.591.790.41− 0.1027.21
Protein content (%)2016BD9.22–16.7111.961.280.380.1110.70
2017BD8.44–14.1211.231.270.07− 0.6211.31
2016SJZ8.85–15.7511.751.290.34− 0.0311.01
2017SJZ8.53–14.8011.361.390.34− 0.4112.23
Oil content (%)2016BD2.13–5.694.250.62− 0.410.4414.64
2017BD2.77–5.894.320.59− 0.04− 0.2413.75
2016SJZ2.39–6.244.720.620.380.6013.05
2017SJZ2.42–6.764.600.570.221.0912.32
Starch content (%)2016BD59.86–75.1269.162.37− 0.521.013.42
2017BD58.47–74.9469.522.46− 0.491.323.54
2016SJZ60.15–74.8468.832.65− 0.37− 0.033.86
2017SJZ61.24–74.8469.052.70− 0.17− 0.333.91
Lysine content (%)2016BD0.21–0.340.260.020.320.917.14
2017BD0.23–0.330.270.020.521.146.34
2016SJZ0.17–0.330.240.020.520.639.58
2017SJZ0.22–0.320.270.020.340.546.58

BD and SJZ stand for Baoding and Shijiazhuang.

Table 2

Analysis of variance (ANOVA) for grain quality traits.

TraitF-value
EnvironmentGenotypeEnvironment*Genotype\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${h}^{2}$$\end{document}h2(%)
Moisture content (%)594.97**4.72**1.22*74.19
Protein content (%)20.46**5.63**1.1379.94
Oil content (%)78.03**5.17**1.39**78.43
Starch content (%)3.39*5.49**1.0880.37
Lysine content (%)164.79**3.86**1.1570.16

*and ** are significant correlation at P < 0.05 and P < 0.01, respectively.

Figure 1

Correlation analysis of grain quality traits at BLUP. *, ** and *** are significant correlation at P < 0.05, P < 0.01, P < 0.001, respectively.

Statistical analysis of grain quality traits in different environments. BD and SJZ stand for Baoding and Shijiazhuang. Analysis of variance (ANOVA) for grain quality traits. *and ** are significant correlation at P < 0.05 and P < 0.01, respectively. Correlation analysis of grain quality traits at BLUP. *, ** and *** are significant correlation at P < 0.05, P < 0.01, P < 0.001, respectively. Significant correlations were detected among grain quality traits, except for the correlation between moisture content and protein content (p = − 0.13) and the correlation between oil content and lysine content (p = 0.079) (Fig. 1). In addition, starch content had significant negative correlation with protein content, oil content, and lysine content, whereas the correlation among the remaining traits was positive. Analysis of variance (ANOVA) indicated that highly significant variations for genotypes and environments were found (Table 2). However, the genotype-by-environment interaction was not significant except for the genotype-by-environment interaction of moisture content and oil content. The broad-sense heritability (, %) for grain quality traits across the four environments in the 248 inbred lines ranged from 70.16 (lysine content) to 80.37 (starch content), indicating the predominant role of genetic factors in determining these traits (Table 2). Overall, the grain quality exhibited significant genetic variations and it was suitable for association analysis.

Genome-wide association analysis

With the BLUP value of each grain quality trait, we conducted a GWAS with 83,057 genome-wide SNPs. In total, we detected 3, 7, 21, 8 and 10 SNPs to be significantly associated with moisture content, protein content, oil content, starch content and lysine content, respectively (Table 3, Fig. 2). For moisture content, three SNPs were located on chromosomes 1, 3, and 9, which individually explained 10.53%–23.16% of the phenotypic variation. For protein content, seven SNPs were located on chromosomes 1, 2, 3, and 4, which individually explained 5.45%–32.79% of the phenotypic variation. For oil content, 21 SNPs were located on chromosomes 1, 3, 4, 5, 6, 7, 8, 9, and 10, which individually explained the phenotypic variation of 9.04–31.24%. For starch content, eight SNPs were located on chromosomes 1, 3, 4, and 6, which individually explained the phenotypic variation of 3.77–23.61%. For lysine content, ten SNPs were located on chromosomes 2, 3, 4, 5, 8, and 9, which individually accounted for 5.71%–32.42% of the phenotypic variation.
Table 3

Analysis of correlated SNP with grain quality traits at BLUP.

TraitSNPChromosomePositionAlleleBinP-valuePVE (%)
Moisture content (%)1_615755716,157,557C/T1.017.72E-0510.53
3_1864867193186,486,719A/T3.065.62E-0519.09
9_97538609997,538,609A/G9.034.31E-0523.16
Protein content (%)1_60098312160,098,312A/G1.048.54E-0513.48
1_2153314091215,331,409C/G1.075.73E-059.19
2_43189825243,189,825C/T2.045.71E-0532.79
3_146211231,462,112A/T3.003.45E-055.45
3_146214731,462,147A/G3.003.45E-055.45
3_2137914863213,791,486C/G3.084.75E-0529.23
4_1705164604170,516,460C/G4.067.63E-0522.61
Oil content (%)1_19252635119,252,635A/C1.025.48E-059.08
1_60098266160,098,266C/T1.041.88E-059.60
1_69088187169,088,187A/C1.044.98E-0513.13
1_1907581421190,758,142C/T1.068.46E-0520.72
1_2037250361203,725,036C/T1.076.90E-0518.61
3_1331821283133,182,128C/T3.056.95E-0522.52
3_1917006513191,700,651A/G3.075.52E-0527.91
4_198356241,983,562C/T4.015.66E-0529.20
4_85966198485,966,198C/T4.057.09E-0524.09
5_1404171535140,417,153C/G5.041.35E-0514.23
5_1404172085140,417,208A/T5.041.35E-0514.23
5_1404172265140,417,226C/G5.042.14E-0513.49
5_1884660755188,466,075C/G5.057.54E-0514.36
6_398883863,988,838G/T6.002.52E-059.60
6_398885663,988,856G/T6.002.30E-059.56
7_25691920725,691,920A/G7.028.28E-0517.78
8_1604256958160,425,695A/G8.064.02E-0531.24
8_1604256988160,425,698A/G8.064.02E-0531.24
8_1604257078160,425,707C/G8.064.02E-0531.24
9_97538609997,538,609A/G9.031.44E-0517.51
10_12789530110127,895,301A/G10.043.47E-059.04
Starch content (%)1_60098266160,098,266C/T1.041.26E-0516.70
1_60098312160,098,312A/G1.041.02E-0516.64
3_146211231,462,112A/T3.007.94E-063.77
3_146214731,462,147A/G3.007.94E-063.77
3_1332060873133,206,087A/G3.054.79E-054.03
3_1332060963133,206,096C/G3.055.43E-054.16
4_1556100214155,610,021G/T4.063.70E-0523.61
6_1051566426105,156,642C/T6.049.22E-0518.69
Lysine content (%)2_1697336112169,733,611A/G2.069.07E-055.71
2_1878855822187,885,582C/T2.079.82E-0525.57
2_1878856022187,885,602A/C2.071.68E-0532.42
2_1974536212197,453,621A/T2.075.84E-0514.33
2_2184601942218,460,194C/G2.089.90E-057.67
3_2066053493206,605,349C/T3.085.39E-0515.78
4_20547939420,547,939C/T4.034.72E-0516.47
5_2150543195215,054,319A/T5.087.60E-0529.81
8_451241484,512,414C/T8.018.06E-0518.91
9_1535265989153,526,598A/G9.078.07E-0516.91
Figure 2

Manhattan plots of GWAS results for (a) moisture content, (b) protein content, (c) oil content, (d) starch content and (e) lysine content. Black solid lines correspond to the threshold (− log10 (p) = 4).

Analysis of correlated SNP with grain quality traits at BLUP. Manhattan plots of GWAS results for (a) moisture content, (b) protein content, (c) oil content, (d) starch content and (e) lysine content. Black solid lines correspond to the threshold (− log10 (p) = 4). Comparing the localization results, we found eight co-localized sites and the physical position between SNPs was not more than 200 Kb (Table 4). Notably, four co-localized sites between different traits were detected. 1_60098266-60,098,312 was associated with oil content, starch content and protein content, which explained 9.60%, 16.70%, 16.64% and 13.48% of the phenotypic variation, respectively. 3_1462112-1,462,147 was associated with starch content and protein content, which explained 3.77% and 5.45% of the phenotypic variation, respectively. 3_133182128-133,206,096 was associated with oil content and starch content, which accounted for 22.52%, 4.03% and 4.16% of the phenotypic variation, respectively. 9_97538609-97,538,609 was associated with moisture content and oil content, which explained up to 23.16% and 17.51% of the phenotypic variation, respectively.
Table 4

Co-localized SNPs of grain quality traits in natural population.

NumberIntervalTraitsSNPChrpositionAllelebinP-valuePVE (%)D-value (Kb)
11_60098266-60,098,312oil1_60098266160,098,266C/T1.041.88E-059.600.046
starch1_60098266160,098,266C/T1.041.26E-0516.70
starch1_60098312160,098,312A/G1.041.02E-0516.64
protein1_60098312160,098,312A/G1.048.54E-0513.48
22_187885582-187,885,602lys2_1878855822187,885,582C/T2.079.82E-0525.570.020
lys2_1878856022187,885,602A/C2.071.68E-0532.42
33_1462112-1,462,147protein3_146211231,462,112A/T3.003.45E-055.450.035
starch3_146211231,462,112A/T3.007.94E-063.77
starch3_146214731,462,147A/G3.007.94E-063.77
protein3_146214731,462,147A/G3.003.45E-055.45
43_133182128-133,206,096oil3_1331821283133,182,128C/T3.056.95E-0522.5223.968
starch3_1332060873133,206,087A/G3.054.79E-054.03
starch3_1332060963133,206,096C/G3.055.43E-054.16
55_140417153-140,417,226oil5_1404171535140,417,153C/G5.041.35E-0514.230.073
oil5_1404172085140,417,208A/T5.041.35E-0514.23
oil5_1404172265140,417,226C/G5.042.14E-0513.49
66_3988838-3,988,856oil6_398883863,988,838G/T6.002.52E-059.600.018
oil6_398885663,988,856G/T6.002.30E-059.56
78_160425695-160,425,707oil8_1604256958160,425,695A/G8.064.02E-0531.240.012
oil8_1604256988160,425,698A/G8.064.02E-0531.24
oil8_1604257078160,425,707C/G8.064.02E-0531.24
89_97538609-97,538,609moisture9_97538609997,538,609A/G9.034.31E-0523.160.000
oil9_97538609997,538,609A/G9.031.44E-0517.51
Co-localized SNPs of grain quality traits in natural population.

Candidate genes associated with significant SNPs

Previous study have shown that the correlation coefficient (r2) between SNP markers is less than 0.1, which is considered to be no correlation[18]. Therefore, we choose r2 = 0.1 as the LD decay distance. Candidate genes were predicted based on LD decay (r2 = 0.1) in the MaizeGDB genome browser. A total of 208 candidate genes were found and detailed descriptions were summarized in Table S1. The candidate genes can be classified into 11 biological processes, 13 cellular components, and six molecular functions. The number of candidate genes involved in the grain quality traits of moisture, protein, oil, starch and lysine contents was 77, 46, 103, 136 and 49, respectively. Among them, the candidate genes in biological processes were mainly concentrated in the cellular process and the metabolic process; the candidate genes in cellular component were mainly concentrated in organelle, cell and cell part; and the candidate genes in molecular function were mainly concentrated in catalytic activity and binding (Fig. 3). As for the KEGG analysis of the candidate genes, a total of 12 pathways were identified. These pathways included the biosynthesis of secondary metabolites, and that of amino acids, starch and sucrose metabolism, inositol phosphate metabolism, and phosphatidylinositol signaling system, which could be related to grain quality (Fig. 4). In addition, a protein classification analysis tool was used to classify candidate gene proteins, 46 of which matched the PANTHER database. A further analysis showed that these 46 proteins fell into nine categories (Fig. 5), which contained the largest number of proteins— metabolite interconversion enzyme (PC00262). Furthermore, we identified 29 candidate genes to be associated with grain quality (Table 5). Annotation information suggested that these candidate genes may control multiple traits during maize growth and development.
Figure 3

GO-second class of candidate genes.

Figure 4

Analysis of KEGG pathway based on candidate genes. (The figure was created by R version 3.6.1 based on KEGG pathway database www.kegg.jp/kegg/kegg1.html).

Figure 5

Protein classification of candidate genes.

Table 5

Putative candidate gene of grain quality traits.

TraitSNPCandidate geneGene IDRefGen_v2 Annotated Gene description
Moisture content (%)3_186486719GRMZM2G069024100,216,811Beta-glucosidase 11
1_6157557GRMZM2G032852100,383,301Putative calcium-dependent protein kinase family protein
GRMZM2G321041100,192,077Putative RING zinc finger Domain superfamily protein
Protein content (%)3_213791486GRMZM2G047129100,285,541Alpha-L-fucosidase 2
2_43189825GRMZM2G466833100,272,900Malate dehydrogenase3
GRMZM2G071714100,279,807Lipoyl synthase, Mitochondrial
Oil content (%)4_1983562GRMZM2G033544103,652,693Cyclopropane-fatty-acyl-pHospholipid synthase
8_160425695, 8_160425698, 8_160425707GRMZM2G433942100,191,906Palmitoyltransferase ZDHHC9
GRMZM2G134308100,384,778Putative Beta-14-xylosyltransferase IRX10L
Starch content (%)3_1462112, 3_1462147GRMZM2G175218100,192,000Beta amylase4
1_60098312, 1_60098266GRMZM2G082034100,284,904Beta-amylase
GRMZM2G347708103,644650Inactive beta-amylase 9
4_155610021GRMZM2G000520103,653910Ethylene-responsive Transcription factor ERF027
6_105156642GRMZM2G404453103,631308Ethylene-responsive Transcription factor ERF036
Lysine content (%)3_206605349GRMZM2G050570100,283397Threonine synthase
2_218460194GRMZM2G129209100,281245Omega-3 fatty acid Desaturase
GRMZM2G076307100,194354Glycosyltransferases
GO-second class of candidate genes. Analysis of KEGG pathway based on candidate genes. (The figure was created by R version 3.6.1 based on KEGG pathway database www.kegg.jp/kegg/kegg1.html). Protein classification of candidate genes. Putative candidate gene of grain quality traits.

Discussion

Genetic basis of grain quality traits

Maize grain quality traits are complex quantitative traits, controlled by main effect genes and lots of micro effect genes. In this study, there was a wide variety of grain quality traits in the natural population, which were normally distributed. To reduce the influence of the environment on the genotype, phenotypic BLUP values across four environments were used for association studies. Phenotypic correlations were observed among the five grain quality traits. For instance, oil content had significant positive correlation with protein content and significant negative correlation with starch content, which is consistent with previous results[4,19]. Meanwhile, starch content had significant positive correlation with protein content and lysine content, which is consistent with previous studies[20,21]. Moreover, all of the five grain quality traits had higher broad-sense heritability. Among them, the heritability for protein content, oil content, starch content and lysine content was higher than that in previous studies[21]. The above results indicated a stable genetic association among these grain quality traits of maize. It is well known that there are hard choices between yield and quality. Previous studies showed negative correlation between quality traits and yield[22]. Therefore, how to carry out quality breeding while continuing to improve the yield of maize will be a new subject for maize breeders in the twenty-first century. At present, maize quality breeding is mainly to increase protein content and improve the composition of base acids, especially to increase the content of essential amino acids such as lysine and tryptophan[23]. In this study, a total of 83,057 SNP markers were used to scan the whole genome, combined with moisture, protein, starch, oil, lysine content and other phenotypic traits and genotypes for association analysis. The purpose of this study was to find the main genes to control the quality traits of maize, and then to introduce the genes into the parents of maize high-yield hybrids from molecular level, and then to obtain high-yield and high-quality hybrids and to provide a theoretical basis for genetic improvement of the quality traits of maize.

Significant SNPs for grain quality traits

Nowadays many researchers at home and abroad apply linkage analysis to locating the QTL of regulating grain quality traits, but few GWAS studies are on grain quality traits. In addition, the QTL detected by linkage analysis and association analysis have consistency in position[24]. The GWAS analysis is performed with a Bonferroni correction, however this was found to be too strict for less significant trait associations. In order to better detect micro-effect polygenes and identify genetic sites, we reduced the significance threshold to -log10(P) = 4 for all traits[25]. In this study, a total of 49 SNPs significantly associated with grain quality were detected, and the phenotypic variation explained (PVE) value by a single SNP ranged from 3.77% (3_1462112 and 3_1462147 of starch content) to 32.79% (2_43189825 of protein content). In addition, four co-localized SNPs were detected by multiple traits and single phenotypic variation explained value over 3.77%, indicating that starch content, protein content, oil content, and moisture content are interrelated in the components of corn kernels. The SNPs detected in this study were compared with previous studies, and some SNPs were found to be located in the localized QTL confidence interval. Among them, five SNPs were located in the QTL interval with Zhang et al.[26], where 1_190758142 for oil content was located on chromosomes 1 (bnlg2086-umc1122 interval), 4_85966198 for oil content was located on chromosomes 4 (phi096-bnlg1755 interval), co-localized site 1_60098266-60,098,312 for protein content and starch content was located on chromosomes 1 (phi001-umc1988 interval). Two SNPs were located in the QTL interval with Wang et al.[27], where 1_19252635 for oil content was located on chromosome 1 (umc1685-umc1044 interval), and 1_190758142 for oil content was located on chromosome 1 (umc1395-umc2237 interval). In addition, 2_169733611 for lysine content was located on chromosome 2 (bnlg1138-umc1065 interval) of study by Zhang et al.[2]. Moreover, three SNPs were located in the QTL interval with Yang et al.[28], where 1_190758142 for oil content was located on chromosome 1 (umc1590-bnlg1556 interval), 3_213791486 for protein content were located on chromosome 3 (umc2275-umc1594 interval), 10_127895301 for oil content was located on chromosome 10 (umc1272-bnlg1839 interval). However, some SNPs were not found in previous studies. There are several reasons for these differences. First, the population in our investigation mightnot be different enough for grain quality. Second, different estimating methods also caused the variations. Third, these SNPs were newly discovered and needed testing further. All in all, the results of this study can serve as a reference for other studies.

Putative genes and pathways involved in grain quality

In this study, a total of 208 candidate genes were searched, of which 17 possible candidate genes for grain quality traits were predicted. For moisture content, three candidate genes were detected. GRMZM2G069024 encoded Beta-glucosidase 11, an important component of the cellulase system[29]. Previous studies reported that dehydration rapidly induced polymerization of AtBG1, a beta-glucosidase[30]. GRMZM2G032852 encoded putative calcium-dependent protein kinase family protein and GRMZM2G321041 encoded putative RING zinc finger domain superfamily protein. The two proteins emerged as key proteins in response to drought stresses in plants[31]. The three enzymes are closely related to moisture content, therefore, three candidate genes may affect moisture content by influencing enzymes activities and genes expression levels.For protein content, three candidate genes were detected. One of the candidate genes (GRMZM2G047129) encoded alpha-L-fucosidase 2. Alpha-L-fucosidase has been reported in only a few plants, such as Arabidopsis,[32] and pea[33]. It was reported that alpha-L-fucosidase can hydrolyze fucose residues from glycoproteins[34], thus this candidate gene may affect protein content. Another candidate gene (GRMZM2G466833) encoded malate dehydrogenase 3 (MDH), that is, one of the key enzymes to synthesize malic acid. MDH played a key role in many physiological metabolic pathways, such as C4 pathway, crassulacean acid metabolism, gluconeogenesis, tricarboxylic acid cycle and photosynthesis, linking the metabolism of sugars, proteins and lipids in the body[35]. Therefore, this candidate gene may affect protein content by influencing the metabolism of proteins. Finally, GRMZM2G071714 encoded lipoyl synthase (LS) that analyzes the final step of lipoyl cofactor biosynthesis[36]. Protein lipoylation was denovo lipoylation pathway in plastids, and two octanoyltransferases and one LS provided protein lipoylation autonomy to plastids of Arabidopsis[37]. Therefore, this candidate gene may affect protein content by influencing the protein lipoylation. For oil content, three candidate genes were identified. GRMZM2G433942 encoded palmitoyltransferase ZDHHC9. Serine palmitoyltransferase (SPT) is the key enzyme of sphingolipids biosynthesis, and sphingolipids are essential components of plant cells[38]. As can be seen, it has a certain effect on the oil content of maize. GRMZM2G134308 encoded putative beta-14-xylosyltransferase IRX10L. Xylose is a kind of glycosyl component widely found in plants. Plant glycosyltransferases are enzymes that are closely related to the metabolism of glycolipids, polysaccharides, glycoproteins, nucleic acids, plant secondary products, and so on[39]. Therefore, this candidate gene may affect oil content by influencing beta-14-xylosyltransferase activities and genes expression levels. GRMZM2G033544 encoded cyclopropane-fatty-acyl-phospholipid synthase that is synonymous with cyclopropane fatty acid (CFA) synthase. CFA is an important membrane fatty acid in the stress-resistant mechanism and the presence of CFA can enhance membrane rigidity[40]. CFA synthase is a key enzyme regulating synthetic CFA. Therefore, this candidate gene may affect oil content by influencing CFA synthase activities and genes expression levels.For starch content, five candidate genes were identified. Three of the candidate genes (GRMZM2G175218, GRMZM2G082034 and GRMZM2G347708) encoded beta-amylase, which was directly involved in the synthesis of starch and metabolic process of polysaccharides[41]. Two of the candidate genes (GRMZM2G000520 and GRMZM2G404453) encoded ethylene-responsive transcription factor (ERF), a member of a transcription factor family involved in plant growth and environmental stress responses[42]. In addition, ethylene has been shown to affect starch biosynthesis by influencing enzymes activities and genes expression levels involved in starch synthesis in maize[43]. For lysine content, three candidate genes were detected. GRMZM2G050570 encoded threonine synthase which catalyzed the terminal reaction in the biosynthetic pathway of threonine[44]. From a nutritional point of view, lysine and threonine were essential amino acids in maize, and GRMZM2G050570 may indirectly affect lysine content by influencing threonine synthase activities and genes expression levels.. GRMZM2G129209 encoded omega-3 fatty acid desaturase which was a key enzyme for α-linolenic acid (ALA) biosynthesis[45]. Moreover, previous studies demonstrated that ALA was a crucial component in storing lipids in plants[46]. Therefore, the gene may regulate grain lysine content indirectly by regulating lipid synthesis. Finally, GRMZM2G076307 encoded glycosyltransferases (GTs), which belong to a multi-member genes family. According to previous studies, GTs played a very important role in the growth and development of plants, such as regulating plant hormone levels, participating in the synthesis, modification and transportation of secondary metabolites in plants, and participating in plant defense reactions[47,48]. Therefore, the gene may regulate grain lysine content indirectly by regulating plant hormone and secondary metabolites synthesis. All in all, these candidate genes are closely related to grain quality and future work will include functional validation of these genes and illustrate the molecular mechanisms for controlling grain quality in maize plants.

Materials and methods

Association mapping panel and genotyping

The association panel consisted of 248 diverse lines, including some excellent backbone inbred lines in China and some high-quality inbred lines introduced from abroad. Details on 248 of these lines could be found in previous studies[20]. The DNA of all the maize inbred lines were extracted using CTAB method[49] and genotyped using Genotyping-By-sequencing (GBS) method[50]. The methods of SNP filtering and calculating linkage disequilibrium (LD) decay were described in previous studies[51]. A total of 83,057 SNPs were used and the LD decay was 120 kb (r2 = 0.1) in this study.

Field experiments and phenotyping investigation

All 248 maize inbred lines of the association panel were planted in four environments, that is, Baoding in Hebei Province in 2016 and 2017, and Shijiazhuang in Hebei Province in 2016 and 2017. In each environment, all the maize inbred lines were planted in a single row plot using a randomized block design with two replications. Each experimental plot consisted of a row length of 3 m and 0.6 m between adjacent rows. After maturity, all corns except the head and tail of each row were harvested. Perten DA7200 Near Infrared Grain Analyzer was applied to determinate the moisture, protein, oil, starch and lysine contents of maize. Each material was repeatedly measured two times.

Phenotype statistical analysis

The IBM SPSS 21.0 software was used to make the descriptive statistical analysis and the analysis of variance (ANOVA). The broad-sense heritability () for each trait was estimated as [52], where , and are genetic variance, genotype-by-environment interaction variance and error variance, respectively, is the number of environments, and is the number of replications. The “PerformanceAnalytics” package in the R software was used to perform correlations analysis. For each trait, BLUP value was evaluated by using the following mixed linear model in the “lme4” package of the R software[53]: Y = (1|rep%in%env) + (1|env) + (1|lines) + (1|env:lines), where Y stands for trait data, the parentheses indicate random effects, “1|” means groups, “:” means interactions, “lines” means all materials and “env” means the environment.

Genome-wide association study of grain quality traits

The BLUP value and 83,057 SNPs were used to conduct the GWAS by using FarmCPU model[54] implemented in the GAPIT package in the R software[55], with both K and Q matrix taken into account.

Prediction of candidate genes

Candidate genes were predicted based on the significant SNPs and their extension regions from 120 kb upstream to 120 kb downstream (LD decay) in the MaizeGDB (https://www.maizegdb.org/) genome browser B73 reference genome version v2. The MaizeGDB, NCBI (https://www.ncbi.nlm.nih.gov/) and Uniprot (https://www.uniprot.org/) were used to obtain annotation of candidate genes. Then these candidate genes were performed GO analysis on the GENE ONTOLOGY website (http://www.geneontology.org/). The KOBAS 3.0 website (http://kobas.cbi.pku.edu.cn/kobas3/?t=1) was used to performe Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis[56].

Statement

Experimental research and field studies complies with relevant institutional, national, and international guidelines and legislation.
  38 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Best linear unbiased estimation and prediction under a selection model.

Authors:  C R Henderson
Journal:  Biometrics       Date:  1975-06       Impact factor: 2.571

3.  Quantitative trait loci for cell wall components in recombinant inbred lines of maize (Zea mays L.) II: leaf sheath tissue.

Authors:  M D Krakowsky; M Lee; J G Coors
Journal:  Theor Appl Genet       Date:  2005-12-15       Impact factor: 5.699

Review 4.  Lipid biosynthesis.

Authors:  J Ohlrogge; J Browse
Journal:  Plant Cell       Date:  1995-07       Impact factor: 11.277

5.  Genome-wide association study dissects the genetic bases of salt tolerance in maize seedlings.

Authors:  Xi Luo; Bingcai Wang; Shan Gao; Fei Zhang; William Terzaghi; Mingqiu Dai
Journal:  J Integr Plant Biol       Date:  2019-05-09       Impact factor: 7.061

6.  Sequence variation and genomic organization of fatty acid desaturase-2 (fad2) and fatty acid desaturase-6 (fad6) cDNAs in maize.

Authors:  V Mikkilineni; T R Rocheford
Journal:  Theor Appl Genet       Date:  2003-04-01       Impact factor: 5.699

7.  Opaque1 encodes a myosin XI motor protein that is required for endoplasmic reticulum motility and protein body formation in maize endosperm.

Authors:  Guifeng Wang; Fang Wang; Gang Wang; Fei Wang; Xiaowei Zhang; Mingyu Zhong; Jin Zhang; Dianbin Lin; Yuanping Tang; Zhengkai Xu; Rentao Song
Journal:  Plant Cell       Date:  2012-08-14       Impact factor: 11.277

8.  Multigene engineering of starch biosynthesis in maize endosperm increases the total starch content and the proportion of amylose.

Authors:  Lili Jiang; Xiaoming Yu; Xin Qi; Qian Yu; Sen Deng; Bing Bai; Ning Li; Ai Zhang; Changfu Zhu; Bao Liu; Jinsong Pang
Journal:  Transgenic Res       Date:  2013-06-06       Impact factor: 2.788

9.  Proteome phenotyping of acid stress-resistant mutants of Lactococcus lactis MG1363.

Authors:  Aurélie Budin-Verneuil; Vianney Pichereau; Yanick Auffray; Dusko Ehrlich; Emmanuelle Maguin
Journal:  Proteomics       Date:  2007-06       Impact factor: 3.984

10.  A novel glycosyltransferase catalyses the transfer of glucose to glucosylated anthocyanins in purple sweet potato.

Authors:  Hongxia Wang; Chengyuan Wang; Weijuan Fan; Jun Yang; Ingo Appelhagen; Yinliang Wu; Peng Zhang
Journal:  J Exp Bot       Date:  2018-11-26       Impact factor: 6.992

View more
  3 in total

Review 1.  The Importance of a Genome-Wide Association Analysis in the Study of Alternative Splicing Mutations in Plants with a Special Focus on Maize.

Authors:  Zi-Chang Jia; Xue Yang; Xuan-Xuan Hou; Yong-Xin Nie; Jian Wu
Journal:  Int J Mol Sci       Date:  2022-04-11       Impact factor: 6.208

2.  Genomic Regions Associated With Seed Meal Quality Traits in Brassica napus Germplasm.

Authors:  Gurleen Bhinder; Sanjula Sharma; Harjeevan Kaur; Javed Akhatar; Meenakshi Mittal; Surinder Sandhu
Journal:  Front Plant Sci       Date:  2022-07-14       Impact factor: 6.627

3.  Combining datasets for maize root seedling traits increases the power of GWAS and genomic prediction accuracies.

Authors:  Leandro Tonello Zuffo; Rodrigo Oliveira DeLima; Thomas Lübberstedt
Journal:  J Exp Bot       Date:  2022-09-12       Impact factor: 7.298

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.