Literature DB >> 30038634

Genetic Diversity and Genome-Wide Association Study of Major Ear Quantitative Traits Using High-Density SNPs in Maize.

Xiao-Mei Zhu1, Xiao-Yu Shao1, Yu-He Pei2,3, Xin-Mei Guo2,3, Jun Li2,3, Xi-Yun Song2,3, Mei-Ai Zhao1,3.   

Abstract

Kernel and ear traits are key components of grain yield in maize (Zea mays L.). Investigation of these traits would help to develop high-yield varieties in maize. Genome-wide association study (GWAS) uses the linkage disequilibrium (LD) in the whole genome to determine the genes affecting certain phenotype. In this study, five ear traits (kernel length and width, ear length and diameter, cob diameter) were investigated across multi-environments for 2 years. Combining with the genotype obtained from Maize SNP50 chip, genetic diversity and association mapping in a set of 292 inbred lines were performed. Results showed that maize lines were clustered into seven subgroups and a total of 20 SNPs were found to be associated with ear traits significantly (P < 3.95E-05). The candidate genes identified by GWAS mainly encoded ubiquitin-activation enzymes (GRMZM2G015287), carotenoid cleavage dioxygenase (GRMZM2G446858), MYB-CC type transfactor, and phosphate starvation response protein 3, and they were associated with kernel length (KL) and ear diameter (ED), respectively. Moreover, two novel genes corresponding to RNA processing and fructose metabolism were found. Further, the SNPs detected by GWAS were confirmed by meta-QTL analysis. These genes and SNPs identified in the study would offer essential information for yield-related genes clone and breeding program in maize.

Entities:  

Keywords:  GWAS; candidate gene; genetic diversity; maize ear traits; meta-QTL

Year:  2018        PMID: 30038634      PMCID: PMC6046616          DOI: 10.3389/fpls.2018.00966

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Maize (Zea mays L.) is one of the most significant cereal crops worldwide and plays a crucial role in sustaining food security. In addition, forage crop and industrial energy require maize as a raw material. The wide demands of maize make grain yield a major breeding target. In the past century, maize grain yield has increased eight-fold with the majority of the yield being attributed to selection and hybrid breeding (Duvick, 2005; Xiao et al., 2016). Grain yield is a quantitative trait and easily affected by environmental factors (Messmer et al., 2009; Liu et al., 2014). Kernel and ear traits, kernel length (KL), kernel width (KW), ear length (EL), ear diameter (ED), and cob diameter (CD), are all important yield components in maize (Liu et al., 2014; Zhang et al., 2014), and KL was the most effective one among them in principal component analysis (PCA). Li X. et al. (2016) also found that KL and KW were positive correlation with the single ear yield and grain yield per unit significantly. Thus, it is useful to find the genes of these traits for breeding program. Genome-wide association study (GWAS) have been verified to be a powerful approach for identifying genes, alleles or haplotypes related to a certain agronomic traits under complex environments (Yan J.B. et al., 2011; Li X. et al., 2016), which is based on the linkage disequilibrium (LD) resulting from the association of target trait and haplotype loci. GWAS provide the opportunity to methodically analyze the genetic architecture of complex quantitative traits in many crops including maize and benefit from the high diversity and rapid LD decay in this species (Li et al., 2012). Using the 24,355 SNPs distributed in the whole genome of wheat, 38 SNPs were found to have high relationship with wheat height by GWAS analysis (Chen et al., 2015), 11 loci of which steadily expressed at least two environments. In maize, GWAS was also successfully identify numerous candidate genes controlling complex traits (Cui et al., 2016), such as plant height (Li X. et al., 2016), drought tolerance (Wang X. et al., 2016), disease resistance (Mammadov et al., 2015), stalk cell wall components (Li K. et al., 2016), ear height (Li X. et al., 2016), etc. Additionally, many QTLs about ear and kernel traits were mapped with linkage populations. Using a F2 population, Yang (2008) detected three consistent QTLs of maize ear diameter, which located in chromosome 2, 5, and 7 and explained 0.8, 1.5, and 0.52% of the phenotype variation, respectively. Liu et al. (2010) mapped QTLs about the number of panicles and the number of row grains with 239 recombinant intersections of Mo17 × Huangzaosi by composite interval mapping (CIM). Li et al. (2009) detected QTLs of KL on chromosome 1, 3, and 6, respectively, with a F2:3 population created from Qi 319 and Huangzaosi. Six QTLs identified by Liu (2013) were individually accounted for 1.18–12.92% of the phenotypic variation. Wang (2015) detected one QTL-qKL9 for KL by using 263 single plants of BC2F2 population, which explained 14.38% of the phenotypic variation. Qin et al. (2015) detected a total of seven QTLs distributing on chromosome 1, 4, 7, and 10, respectively. And the main effect QTL for KL, named as qklen1, was mapped on the physical location of 210–212 Mb on chromosome 1 using the average value of multi-environments. Qin et al. (2015) identified 22 QTLs about KW distributed on chromosomes 1, 2, 3, 4, 5, 6, 9, and 10, respectively, and among them, one QTL on chromosome 10 was further mapped on the physical location of 147 Mb. Also, another four QTLs for KW was detected on each of chromosome 4, 6, 9, and 10 using average value of multi-environments, and named as qkwid4, qkwid6, qkwid9 and qkwid10, respectively (Qin et al., 2015). These identified QTLs and genes were helpful for studying the mechanism of yield-related traits. However, the above studies often use biparental mapping populations, which could not reveal the genetic variation of broader genetic back-ground. Recently, some kernel related genes had been cloned. With Zheng 58 as the plant material, Xia et al. (2016) obtained a gene ZmMADS-RIN using the homologous cloning method, which has sufficient homology with the gene OsMADS6 related to kernels development in rice. Based on the yield-related gene OsGW2 in rice, Kong et al. (2014) cloned a maize homologous gene ZmGW2-1, which encoded E3 ubiquitin ligase protein and probably regulated the development of ear. A CLAVATA receptor protein locus on chromosome 4 was cloned, and its mutation led to the increase of meristem and ear row number (Bommert et al., 2013). Using a F2 population constructed by a near isogenic line, a 3 kb intergenic region at downstream of Unbranched3 (UB3) gene on chromosome 4 was found, which was responsible for the quantitative variation in kernel row number (KRN) by regulating UB3 expression (Liu L. et al., 2015). According to a key kernel size-related gene (OsGS5) in rice, a 981 bp gene segment ZmGS5 was obtained using homologous cloning method, which encoded protein sequence that belonged to serine carboxypeptidase in maize (Li et al., 2014). And this sequence was as highly as 75% homologous to the protein sequence encoded by OsGS 5 in rice. The results of bioinformatics analysis carried out by Tian et al. (2014) showed that the gene GRMZM2G070323 on chromosome 1 and GRMZM2G148539 on chromosome 5 in maize were homologous with a kernel length-related gene (OsPPKL) in rice. At present, most of yield-related genes of maize come from the homology genes in rice, so the study of GWAS and QTL mapping for maize yield-related traits is imperative. In this study, the phenotype of five ear traits for an association mapping panel consisting of 292 maize inbred lines were collected in 2015 and 2016. Then, QTLs associated with the five traits were identified with GWAS method, and candidate genes were also predicted. This study would provide useful insights into the genetic basis of related traits, and supply molecular tools for improving kernel size and grain yield in maize.

Materials and Methods

Materials

A total of 292 maize inbred lines (Supplementary Table ) were analyzed in the study, which were derived from four subgroups of China, Reid, Lvdahonggu, P group, and Sipingtou, as well as some tropical lines and sweet-waxy maize. All the materials were collected or bred by the maize molecular breeding team, Qingdao Agriculture University, China and were commonly used in maize breeding program.

Experimental Design and Phenotyping

The 292 maize inbred lines were grown at three locations of China in 2 years, which were Qingzhou, Shandong Province in 2015 and 2016 (QZ15 and QZ16), Luoyang, Henan Province 2015 (LY15), and Jiaozhou, Shandong Province 2016 (JZ16). Experiment was arranged in a randomized complete block design with three replications, and each inbred line was grown in a single row with 15 plants, 3 m in length, 0.6 m between adjacent rows, and 0.2 m between adjacent plants. The measure method of KL and KW were as described in Jiang et al. (2015). The field management followed normal agricultural practices. When harvest, five well-developed ears in the middle of each row were selected so as to minimize the boundary effect. A digital caliper (Guilin Guanglu Measuring Instrument Co., Ltd., CHINA) was used to measure KL, KW, EL, EW, and CD, in which EW and CD were measured at the middle of each ear and cob, KL and KW were measured with ten mixed and randomly selected kernels from each inbred line. To ensure accuracy, the data of each trait were determined with the average value of three replications.

Statistical Analysis of Phenotypic Data

SPSS20 statistical software (Armonk, NY, United States: IBM Corp) was used to calculate the phenotypic data, including Normal analysis of each trait and Pearson correlation analysis between traits and environments. The broad-sense heritability (h2) was calculated using the following formula: Where, VP is phenotype variation, VG is genetic variation, VE represents environmental variance. Basing on the formula, the smaller the variance of the environment was, the higher ratio the genetic variance in the phenotypic variance accounted for, thus the genetic variation is mostly inherited, and vice versa.

DNA Extraction and SNP Genotyping

Genomic DNA was extracted from the tender leaves at the six-leaf growth stage with the modified cetyltrimethylammonium bromide (CTAB) method (Chen and Ronald, 1999). A total of 56,110 SNPs were selected from the whole maize genome and anchored on a maizeSNP50 DNA chip, and then each inbred line was genotyped with the chip from Pioneer Dupont (United States). After the SNPs with missing rate >20% and heterozygosity >20% were excluded, 35,355 SNPs was kept to be analyzed further. Then, some SNPs with minor allele frequency (MAF) <0.05 also were excluded through genetic diversity analysis of maize population, and only 25,331 SNPs were left for GWAS analysis. The number of alleles and allele frequency of each SNP locus was calculated with PowerMarker V3.25 software[1] (Liu and Muse, 2005) .

Molecular Diversity, Linkage Disequilibrium and Population Structure Analysis

Genetic distance of the 292 inbred lines and LD of each chromosome were calculated with Cladogram and LD functions of Tassel 5.2.31[2] (Bradbury et al., 2007), respectively, and the neighbor-joining (NJ) cluster map constructed. LD level and its decay rate between each pair of SNPs on each chromosome was analyzed with the squared of Pearson correlation coefficient (r2). The calculation result was imported into Excel to create the LD decay plot. The molecular markers used for subgroups division by the Structure V2.3.4 software[3] (Pritchard et al., 2000; Evanno et al., 2005) were selected bases on the distance between the markers of r2 = 0.1. Bayesian cluster analysis of 292 maize inbred lines was carried out using the selected 1361 independent SNPs. Then, the reasonable subgroups number (K) of the population was inferred according to lnP (D) value in Structure V2.3.4. With the optimum K-values ranging from 1 to 10, the strong Markov Chain Monte Carlo (MCMC) after the non-repeated iteration was set to 10000 times at the beginning, and then set to 50000 times with the number of iterations set at 7. The probability of each inbred line grouped into a subgroup to determine the genetic composition of materials.

GWAS Analysis

Fixed and random model Circulating Probability Unification (FarmCPU) (Liu, 2015) was adapted for analyzing large data and calculation speed was quick. With this model, an iterative usage of fixed and random effects for powerful and efficient GWAS was developed to solve the mixed problem of false positive and false negative SNPs in MLM. Then, a total of 25331 SNPs were used for GWAS, with the genome-wide threshold of P = 1/total number of SNPs = 3.95E-05 (Wang et al., 2015; Bai et al., 2016; Ma et al., 2016). Further, to ensure the GWAS results with FarmCPU model, we did GWAS analysis using the compressed mixed linear model (CMLM) in GAPIT package (Lipka et al., 2012). The CMLM is a compression and optimization model based on MLM in Tassel.

Candidate Genes Mining

Based on the SNP locus which were significantly association with target traits, the genome sequence of the maize line B73 was used as the reference genome for selecting candidate gene (Schnable et al., 2009; Liu et al., 2016). The genes corresponding to each SNP locus were checked using the molecular marker database in MaizeGDB[4] according to the physical positions of the SNPs. Then, the functional annotations of candidate genes were predicted in NCBI[5]. The significant association regions were scanned for putative genes using IGV downloaded[6], and LD analysis of the linkage SNPs in 310-kb window were conducted using Haploview v.4.2[7].

Integration of Meta-QTL

Now many QTLs have been reported, but most of them are different due to different mapping populations, different analysis methods, and different environmental conditions, thus resulting in the problems of oversizing or overlapping of QTL position. In this study, we used the meta-analysis function in software BioMercator V4.2, to integrate the many QTLs related to ear traits into the IBM2 2008 Neighbors Map. Thus, many information of QTLs on maize ear traits were collected from the main literatures published from 2007 to 2016 basing on China National Knowledge Infrastructure (CNKI[8] and NCBI, which included the size and type of the mapping population, the markers’ genetic distance, mapping function, and the position, LOD value, contribution rate, and confidence intervals of QTLs. Because the maximum likelihood value, confidence interval, and contribution rate are important information of QTL (Okuda et al., 2007), the 95% confidence interval of each QTL is inferred from the two equations below before it was introduced into the IBM2 2008 Neighbors Map. Where, CI is the confidence interval, N represents the mapping population size, R2 represents the contribution ratio. And Equation 1 is suitable for backcross and F2 mapping populations, and Equation 2 is suitable for the recombinant inbred lines (Qi et al., 2011). So, the high-density genetic linkage map IBM2 2008 Neighbors was obtained. The specific position on the map successively contained 1, 2, 3, 4, and N “real” QTL(s) in five models of QTL given by the simulation operation, and the optimal model was judged by the value of the minimum Akaike-type criteria value (AIC) (Goffinet and Gerber, 2000). And the initial QTLs used for mata-QTL was not less than three.

Results

Genetic Diversity, Linkage Disequilibrium and Population Structure Analysis

Based on the 25331 SNPs for GWAS, we made relationship diagram among 292 maize inbred lines (Figure ). And 96.75% of the relationship values between two lines ranged between 0.25 and 0.75, and only 0.06% of them was zero, 3.19% >1, and 38.48% <0.5, which mean that most of the materials are relatively relevant to each other, with a few irrelevant. Analysis of genetic diversity. (A) Distribution of pairwise relative kinship for 292 maize inbred lines calculated using 25331 filtered SNPs. (B) Whole-genome LD in the entire panel based on 292 maize inbred lines. (C) ΔK-value of 292 inbred lines based on 1362 SNPs. (D) The Bayes cluster plot of 292 maize inbred lines when K = 7. The LD level of the whole genome of inbred lines was estimated using 25331 SNPs. Results showed that LD decayed differently in ten chromosomes, with chromosome 2 had the most rapid decay rate and chromosome 4 had the slowest. At a cut-off value of r2 = 0.1, the averaged LD decay distance of 292 maize inbred lines was approximately 310 kb (Figure ). The genetic distance between 292 maize inbred materials was calculated by Tassel 5.2.31, and a neighbor-joining clustering graph was constructed (Figure ). The entire materials were divided into seven groups, namely, Tangsipingtou, PA, PB, PC, Lvdahonggu, Lancaster, and an integrated groups. Tangsipingtou group mainly consists of inbred lines such as Chang 7-2, Lx 9801, H 21 as well as hybrids selected from Chang 7-2 hybridizing with other materials in this study. The PA group tends to Reid, including Zheng 58, Ye 478 etc. PB group mainly includes Ex, Qi 319, P 138, Qi 318, X 178 and so on. PC group tends to BSSS, mainly including B73. Lvdahonggu group mainly contains E28, Dan 340 and other materials. Most of the inbred lines were clustered into their corresponding subgroup, and the tropical lines and sweet-waxy maize derived from Philippines and Mexico were clustered into the integrated group. However, a few inbred lines belonging to Lancaster subgroup, such as Qi 205 and Ji 846, were diffused in other subgroups. Neighbor-joining (NJ) tree for the 292 maize inbred lines based on 25331 SNPs analysis data. The genetic diversity results analyzed with the Structure software are the same as that of NJ clustering. When K = 7, the 292 inbred lines could be grouped into seven large subgroups (Figures ).

Phenotype Statistics

The descriptive statistics for ear and kernel traits under three natural environments in 2 years are presented in Table . Abundant and large variation of the five traits, KL, KW, EL, ED, and CD, was observed in each location. But the variation values of each trait were different at different environments. For example, the variation for the KL in LY15 ranged from 65.04 to 135.96 mm (mean ±SD = 98.54 ± 10.40 mm), but it ranged from 67.30 to 117.95 mm (92.71 ± 9.04 mm) in JZ16. The H2 of the five traits was relatively high, ranging from 68.86 for EL to 85.88% for ED (Table ), indicating that a large portion of phenotypic variance for ear and grain traits could be attributed to genotypic effects. All the phenotypic data of every trait follow Normal distribution, as the absolute values of kurtosis and skewness among these environments were less than 1, thus they were suitable for QTL mapping (Figure ). Descriptive statistics and broad-sense heritability (H2) for five traits including kernel length (KL), kernel width (KW), ear length (EL), ear diameter (ED), and cob diameter (CD) among 292 accessions across four environments. Phenotypic distribution for measured traits of Luoyang in 2015. (A) KL, kernel length; (B) KW, kernel width; (C) EL, ear length; (D) ED, ear diameter; (E) CD, cob diameter.

Correlation Analysis

To verify the accuracy and consistency of the results, ANOVA was conducted to reveal the significant correlation among the five ear traits in 2 years (Table ). Among all the positive correlations in 2015, the highest value was found between ED and CD, while the lowest value was between ED and KW. In 2016, significantly positive correlations were obtained between all traits except KL and CD in Jiaozhou (Supplementary Table ). Phenotypic correlation coefficient estimates for each trait in Luoyang, 2015 (above diagonal) and Qingzhou, 2015 (below diagonal). Using FarmCPU, the phenotype data of five ear traits and the genotype data of 25331 SNPs were analyzed for GWAS. Quantile–quantile (Q–Q) plots implied that the population structure and kinship relationship were well controlled in the GWAS for each trait. The horizontal and the verticle axises show the values of -lg (Transformed expected P-value) and -lg (Transformed observed P-value), respectively (Figure and Supplementary Figures –). (A–E) Manhattan plots of (A) kernel length, (B) kernel width, (C) ear length, (D) ear diameter, (E) cob diameter for the 10 chromosomes carrying the significant markers detected by FarmCPU in Luoyang, 2015; (F–J) Quantile–Quantile (Q–Q) plots of the five traits in the same order with Manhattan plots showing expected null distribution of p-value assuming no associations, represented as solid red line; distribution of p-value observed. Twenty SNPs, significantly associated with the five ear-related traits, have been detected and distributed on eight chromosomes of maize (Table ). Among them, four SNPs, associated with KL dispersed on chromosome 3, 6, and 7 (2), were detected. Especially, the SNP locus PZE_107042407 on bin7.02 was detected both in JZ16 and LY15. For KW, three SNPs were detected only in LY15 and distributed on chromosomes 6 and 10 (2), respectively. Additional, one SNP locus associated with EL, seven loci associated with ED, and five associated with CD were also identified. Among all the SNPs related to ear traits in this study, the one related to CD on chromosome 1 was the most significant among them. All the details of SNPs and candidate genes were shown in Supplementary Table . Most significant marker loci associated with five traits over 2 years in three locations. We also analyzed the five traits using CMLM model in GAPIT package (Figure ). In the Q-Q plots of association studies, the signal above the Bonferroni correction line by FarmCPU was better than that by CMLM model in GAPIT, which suggested that candidate genes were difficult to be distinguished from the background noise by CMLM. Namely, CMLM reduces the detection efficiency of the associated sites with known candidate genes when compared with FarmCPU (Figure ). In our study, and the significantly association loci decreased to 26 when the CMLM was employed. (A–E) Manhattan plots and Q-Q plots of (A) kernel length, (B) kernel width, (C) ear length, (D) ear diameter, (E) cob diameter resulted by GAPIT in Luoyang, 2015. Manhattan plots and Q-Q plots of kernel length in Luoyang, 2015 resulted by (A) FarmCPU and (B) GAPIT.

Candidate Genes Predicting

Candidate genes containing SNPs associated with the five ear-related traits were identified, and their function were predicted basing on MaizeGDB and NCBI (Table ). SNP locus, PZE_107042407, which was detected in two environments of JZ16 and LY15 simultaneously, was located within the interval of gene GRMZM2G173943, whose functional annotation is MYB-CC type transfactor. Reported genes, Arabidopsis AtPHR1 and algal PSR1, are members of MYB-CC gene family and regulated phosphorous hunger signal pathway (Moseley et al., 2006; Li, 2007; Bournier et al., 2013). Besides, a gene related to ED (LOC103626417), responded to protein phosphoric acid hunger was discovered in this study. Three SNPs associated with KW were located within the intervals of gene GRMZM2G124502, GRMZM2G092475, and GRMZM2G057441, respectively, and the three genes are related to encoding SWIB complex BAF60b domain-containing protein, probable sodium/metabolite co-transporter BASS4, chloroplastic and ubiquitin-activating enzyme E1 2, respectively. What’s more, four candidate genes related to ubiquitin, GRMZM2G057441, GRMZM2G015287, GRMZM2G018798, and LOC100381748 were screened. Among them, GRMZM2G015287 correlated with ED and CD was detected in LY15 and QZ16 simultaneously. The genes located at the association regions flanked by three significant SNPs were also identified within the estimated 310-kb window (Table ). Two linkage genes at chr: 72.21–72.53 Mb were identified in the same LD block with PZE_107042407 (Figure ). Candidate genes for each significant SNP associated with traits and theirs encoding products. Candidate genes scanned within a approximately 310 kb extended region for three significant SNPs. (A) 310-kb association region of gene GRMZM2G173943 on chromosome 7. (B) The linkage disequilibrium blocks of the major SNP (PZE_107042407) within 310-kb window.

Meta-QTL Analysis

Information of 293 QTLs related to the panicle traits (Table ) were collected and used to construct the consistency map (Figure ). In the meta-analysis, the model with the minimum AIC value was chosen as the optimal one, then 20 meta-QTLs were obtained according to the selection criteria of at least three initial QTLs existing within one location. These meta-QTLs are mainly distributed on chromosomes 1(4), 3 (3), 4 (3), and 9 (5), respectively, with confidence intervals ranging from 4.2 to 15.13 cM (Table ). Then, these meta-QTLs and the SNPs associated with ear traits in this study was mapped according to the physical distance of SNP on both sides of MQTL tag, Results showed that five of the detected SNPs was located within the intervals of these meta-QTLs, such as locus PZE_101048890 in MQTL4 interval, PZE_103171163 in MQTL8, SYNGENTA6857 in MQTL12, and PZE_110067406 and PZE_110061773 were both in MQTL20 interval. These results further i verified the accuracy of the SNP loci related to ear traits in this study. Summary of the QTLs of ear traits in maize (2007–2016). “Consensus” QTL map integrated for ear and kernel traits in maize. Information of position of Meta-QTL by meta-analysis for ear and kernel traits in maize.

Discussion

Natural germplasm with a broad genetic base could be a potential resource for improving yield (Wang H. et al., 2016). Genetic diversity analysis of the germplasm available provides key information on heterosis exploitation and breeding strategies, especially in maize hybrid breeding. In the present study, a panel consisting of 292 inbred lines representing temperate germplasm from Huang-Huai-Hai region were separated into seven subgroups, Tangsipingtou, Lvdahonggu, PA, PB, PC, Lancaster and an integrated group based on NJ cluster analysis, which were consistent with the results of UPGMA tree analyzed with genetic distance by Rogers (1972). The integrated group mainly included the tropical lines from Mexico, and the other six were consistent with the subgroups of maize germplasm in China (Liu C.L. et al., 2015). In addition, because LD patterns of population structure are crucial for association mapping and selection of candidate genes (Liu et al., 2014), LD of the 292 inbred lines was also analyzed, with the average LD decay distance of about 310 kb, which was similar to the 391 kb of 367 inbred lines in Wu et al. (2014), but lower than the 643 bp in 240 temperate inbred lines (Liu et al., 2014) and the 0.50–0.75 Mb in 362 Southwest lines of China (Zhang et al., 2016). The lower LD of the materials in this study was due to that most of the 292 inbred lines were temperate germplasm, while it was still lower than that of other temperate population suggested that the these materials have some excellent genes related to plant resistance (Lu et al., 2011). Usually, a lot of QTLs for a certain trait have been detected with different populations in different environments, and most of them are not consistence with each other. In previous studies, Yang et al. (2016) detected six QTLs for KL and KW, with a minimum physical distance of 0.72 Mb between chr. 2-11541 and chr. 2-12258 on chromosome 2. Liu et al. (2014) detected 83 QTLs for KL and KW with a minimum physical interval of 1.37 Mb and average of 17.30 Mb between umc1165 and umc1265 on chromosome 2. Li et al. (2013) detected three QTLs for KL at bin 1.07, 4.08, and 9.03, respectively, and seven QTLs for KW at bin 1.04, 1.11, 2.07, 3.07, 4.03, 4.05, and 10.07, respectively. Previous reports also showed that bin 4.05 and bin 10.03 were important genomic regions for controlling maize yield-related traits, such as KL or kernel number per row (KNPR), KW and 10-kernel thickness (KT) (Peng et al., 2011). In this study, we used an interactive usage model, FarmCPU instead of MLM or CMLM, in GWAS, which could exclude the false positive associations exactly. And the Q–Q plots also suggested that the false positive associations in this study were well controlled for the GWAS of the five traits across different environments. Then, we filtrated 20 unique loci (SNPs) at P < 3.95E-05 level among 97 SNPs that were associated with five ear-related traits. The twenty consistent SNPs about ear and kernel traits were found at the same chromosome intervals by integrating the previously reported QTLs information, among which four SNPs for KL were located at bin 3.07, 6.04, and 7.02 (2), three for KW at bin 6.07, 10.00, and 10.07, one for EL at bin 7.03, seven for ED at bin 1.10, 3.09, 4.05, 5.03, 8.03 (2), and 10.03, and five for CD at bin 1.05, 1.08, 1.10, 1.12, and 8.03, respectively. Especially, the SNP for KL at bin 7.02 was common in two environments. Besides, another SNP at bin 8.03 (PZE_108042082) was detected both for ED and CD, indicating that ED and CD may have same genetic basis, which was supported by the significant correlations of the two traits in our research materials. Therefore, our results would provide important information for further fine mapping yield-related genes, thus to reveal corresponding molecular mechanisms. In addition, four functional genes detected are related to ubiquitin (Ub), which is widely existing in eukaryotes and highly conservative. The most important gene, GRMZM2G015287, encoded ubiquitin-conjugating enzyme E2N, and the other three separately encoded ubiquitin-activating enzyme E1 2, ubiquitin carboxyl-terminal hydrolase 13, and E3 ubiquitin protein ligase DRIP2. Ubiquitination pathway is an important regulatory process in plant biological activities including growth and development, response to biological and abiotic stress signals, etc. The covalent attachment of Ub to target proteins involves a three kinds of enzymes (ubiquitin-activating enzyme E1, ubiquitin-conjugating enzyme E2 and ubiquitin ligase E3) and a series of reactions via the transfer of thioester linkage between these enzymes (Ramadan et al., 2015). Previous studies have found that E3 is an important specific identification factor in regulating seed size during the process of ubiquitination (Hershko and Ciechanover, 1998). The gene GW2 in rice could decrease cell division, associated with grain width, grain weight and growth period, codes a RING-type protein with E3 ubiquitin ligase activity that locating in the cytoplasm and degradating zymolytes by anchoring them into proteasomes and ultimately decreasing cell division. The candidate genes mined in this study suggested that development of grain was related to ubiquitination pathway, then grain and related traits affect the maize yield. Jiang et al. (2015) found genes of ZmGS3-CHR1-1, ZmGS3-CHR1-2, ZmGS3-CHR2, ZmGS3-CHR7, ZmGS3-CHR4, and ZmGS3-CHR5 that homologous with ring E3 ubiquitin-protein ligase gene GS3 and GS2 in rice by integrating meta-QTLs of ear row number and grain weight reported in major journals over 1994–2012. The gene GW2 in rice could decrease cell division, and loss of GW2 function would result in a larger and wider spikelet hull, which accelerated grain milk filling rate and enhanced grain width, weight and yield (Song et al., 2007; Yan S. et al., 2011). WY3, alleles of GW2, can significantly increase grain width and 1000-grain weight, resulting in the output raise of single plant. D3, a gene that encoding ubiquitin E3, was associated with ear number and plant height (Ishikawa et al., 2005) and encoded an F-box leucine-rich repeat (LRR) protein which can inhibit the activity of rice tiller buds, maintain their dormancy and participate in the dark-induced leaf senescence process and hydrogen-induced leaf cell death process (Yan H.F. et al., 2007; Falcon de Longevialle et al., 2008). So, four functional genes detected are related to ubiquitin, which is widely existing in eukaryotes and highly conservative. Except that, two novel genes PRFB1 (GRMZM2G398608) encoding peptide chain release factor PrfB1 in chloroplast (AtPrfB1) and GRMZM2G103843 encoding fructokinase-2 were found. PRFB1 responses to the peptide chain termination codon UGA, which is necessary for the proper translation and stability of UGA-containing polycistronic transcripts in chloroplasts. So AtPrfB1 participates in the biological processes of plastid organization (Meurer et al., 1996), RNA processing and translational termination (Meurer et al., 2002). Fructokinase-2 encoded by gene FRK1 in maize, may play an important role in fructose metabolic process, thus maintaining the flux of carbon toward starch formation during seed development (Zhang et al., 2003; Riggs et al., 2017).

Conclusion

Genetic diversity and GWAS of maize ear traits were performed in a panel of 292 inbred lines. And 20 significant SNPs associated with kernel size and grain yield were detected using FarmCPU software. Among them, a candidate genes on chromosome 1 was related to ubiquitin, and two novel candidate genes, (GRMZM2G398608 and GRMZM2G103843) related with ED and CD, were also explored in the study. Bioinformatics analysis showed that gene PRFB1 (GRMZM2G398608) encode peptide chain release factor PrfB1 in chloroplast (AtPrfB1) and GRMZM2G103843 encoded fructokinase-2. Besides, a MYB-CC type transfactor and a gene encoding phosphate starvation response protein 3 were found to be associated with KL and ED, respectively. These results would be helpful for understanding the relationship between yield and the ear-traits in maize.

Author Contributions

X-YSo and M-AZ conceived and designed the study. M-AZ, X-MZ, X-YSh, Y-HP, X-MG, and JL performed the experiments, phenotyped the populations and field work. X-MZ and X-YSh contributed to genotype and phenotype data analysis. X-YSh conducted the meta-analysis. X-MZ drafted the manuscript. All authors read and approved the final manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Table 1

Descriptive statistics and broad-sense heritability (H2) for five traits including kernel length (KL), kernel width (KW), ear length (EL), ear diameter (ED), and cob diameter (CD) among 292 accessions across four environments.

TraitEnvironmentMinimumMaximumMean ± SDCV (%)SkewnessKurtosisH2(%)
KL2015LY65.04135.9698.54 ± 10.4010.550.0490.51579.51
2015QZ65.62121.9892.20 ± 9.3610.150.1960.503
2016JZ67.30117.9592.71 ± 9.049.75−0.004−0.093
2016QZ69.61121.0195.57 ± 9.219.64−0.053−0.313
KW2015LY64.21105.2085.26 ± 6.597.73−0.0930.14688.74
2015QZ66.17109.8186.21 ± 7.678.900.1730.211
2016JZ60.24101.4581.58 ± 7.338.99−0.1570.569
2016QZ55.42103.3483.70 ± 6.748.05−0.2370.845
EL2015LY79.00205.00133.60 ± 22.5216.860.3570.56768.86
2015QZ79.80181.40111.37 ± 20.5018.41−1.3620.433
ED2015LY24.3251.8939.54 ± 4.1610.52−0.0290.97985.88
2015QZ25.1052.2240.16 ± 4.1310.28−0.330.594
2016JZ26.9349.7837.16 ± 3.719.980.0960.386
2016QZ26.9546.8638.16 ± 3.509.17−0.326−0.124
CD2015LY17.2036.4727.03 ± 2.8410.51−0.0550.50485.18
2015QZ15.6035.4825.71 ± 3.0111.710.0040.719
2016JZ14.8734.0225.81 ± 2.8411.000.0260.219
2016QZ17.7733.1126.27 ± 2.8010.66−0.130.011
Table 2

Phenotypic correlation coefficient estimates for each trait in Luoyang, 2015 (above diagonal) and Qingzhou, 2015 (below diagonal).

TraitKLKWELEDCD
KL0.090ns0.295∗∗0.765∗∗0.274∗∗
KW0.204∗∗0.133ns0.183∗∗0.223∗∗
EL−0.104ns0.018ns0.442∗∗0.341∗∗
ED0.529∗∗0.341∗∗0.103ns0.738∗∗
CD0.255∗∗0.169∗∗−0.1190.770∗∗
Table 3

Most significant marker loci associated with five traits over 2 years in three locations.

TraitSNP IDChrBinAllelesP-valueAdditive effectEnv
KLPZE_10314152433.07C/T2.31E-14−3.36QZ16
PZE_10609924866.04C/T5.52E-092.35LY15
PZE_10704240777.02A/G9.93E-062.79JZ16
PZE_10704240777.02A/G2.23E-052.48LY15
KWPZE_1101055981010.07C/T7.38E-10−2.02LY15
SYN430966.07C/G6.09E-09−1.91LY15
PZE_1100002281010.00C/G1.54E-08−3.95LY15
ELPZE_10708162877.03A/G6.61E-079.31LY15
EDPZE_1100446051010.03G/T1.29E-10−1.07LY15
PZE_10804208288.03A/G3.75E-09−1.71LY15
PZE_10406934444.05T/C1.61E-08−1.27LY15
PZE_10317116333.09A/G2.07E-071.00JZ16
PZE_10412400344.09A/G4.67E-060.67LY15
PZE_10804208288.03A/G6.45E-05−1.75QZ16
PZE_10505933055.03A/G9.97E-05−0.76QZ15
CDSYN1347611.08A/C4.11E-08−0.79QZ16
PZE_10125515911.12C/T4.31E-08−0.72LY15
PZE_10111617711.05A/C3.86E-06−0.72QZ15
PZE_10804208288.03A/G6.64E-06−0.85LY15
PZE_10122385011.10C/T6.16E-05−0.47QZ15
Table 4

Candidate genes for each significant SNP associated with traits and theirs encoding products.

TraitChrSNP physical position (Mb)Gene IDEncoding
KL3197085927GRMZM2G446858Carotenoid cleavage dioxygenase
6152940683GRMZM2G067198Hypothetical protein
772,337,865GRMZM2G173943MYB-CC type transfactor
772,337,865GRMZM2G173943MYB-CC type transfactor
KW10146673417GRMZM2G124502SWIB complex BAF60b domain-containing protein
6165488800GRMZM2G092475Probable sodium/metabolite cotransporter BASS4,chloroplastic
101153678GRMZM2G057441Ubiquitin-activating enzyme E1 2
EL7136607023GRMZM2G074386Proteasome subunit beta type-1
ED1085027699GRMZM2G126007UPF0481 protein At3g47200
867245292GRMZM2G015287Ubiquitin-conjugating enzyme E2 N
4206956947LOC100381748Ubiquitin carboxyl-terminal hydrolase 13
4137099594GRMZM2G179810Adenine phosphoribosyltransferase 2
3218272659GRMZM2G156126Alpha/beta-hydrolases superfamily protein
GRMZM2G156158P-loop containing nucleoside triphosphate hydrolase superfamily protein
1275012730GRMZM2G018798E3 ubiquitin protein ligase DRIP2
GRMZM2G398608Peptide chain release factor PrfB1, chloroplastic
558165601LOC103626417Protein PHOSPHATE STARVATION RESPONSE 3
CD1249380954GRMZM2G103843Fructokinase-like 2, chloroplastic
1298903090GRMZM2G084176Putative pentatricopeptide repeat-containing protein
1139605674GRMZM2G014872SEC1 family transport protein SLY1
867245292GRMZM2G015287Ubiquitin-conjugating enzyme E2 N
1275011999GRMZM2G018798E3 ubiquitin protein ligase DRIP2
GRMZM2G398608Peptide chain release factor PrfB1, chloroplastic
Table 5

Candidate genes scanned within a approximately 310 kb extended region for three significant SNPs.

SNPBinMbGene IDDescription
PZE_1070424077.0272.21–72.53GRMZM2G064197myb family transcription factor-related protein
GRMZM2G173943MYB-CC type transfactor
GRMZM2G008657Putative uncharacterized protein DDB_G0287975
PZE_1080420828.0367.10–67.40GRMZM2G119316HSP40/DnaJ peptide-binding protein
GRMZM2G015287Ubiquitin-conjugating enzyme E2 N
PZE_11010559810.07146.51–146.81GRMZM2G28492Zinc finger C-x8-C-x5-C-x3-H type family protein
GRMZM2G86949Auxin response factor 11
GRMZM2G124502SWIB complex BAF60b domain-containing protein
GRMZM2G124495GLK52 transfactor
GRMZM2G124476Zinc finger C-x8-C-x5-C-x3-H type family protein;
GRMZM2G124466putative plastid-lipid-associated protein 13 chloroplastic
GRMZM2G125149Trafficking protein particle complex subunit 3
GRMZM2G423861Kinesin-like protein KIN-14I
GRMZM2G125083N-acylphosphatidylethanolamine synthase
GRMZM2G405662Heavy metal transport/detoxification superfamily protein
GRMZM2G107620Regulatory protein RecX family protein
GRMZM2G107557Plant cysteine oxidase 3
Table 6

Summary of the QTLs of ear traits in maize (2007–2016).

ParentsPop sizeTypeMethodReference
Dan232, N04258F9CIMLi et al., 2011
Dan232, N04220BC2F2CIMLi et al., 2007
PB260, PB266149F2:3CIMMendesmoreira et al., 2015
Mc, V671270F2:3CIMLiu et al., 2014
SK, Zheng58204F6CIMRaihan et al., 2016
Si287, Si144187F3CIMSun et al., 2013
Nongxi928161ILsICIMZhu et al., 2012
L26, 095186F2CIMDai et al., 2009
092, Jiao51400F2IMLü et al., 2008
Nongdan579DHCIMZhang et al., 2009
Nongxi110, 5395BC2F2CIMHu et al., 2010
Ye478, Nan340397F2:3CIMYang, 2008
Zong387-1294F9:10CIMBai, 2014
Table 7

Information of position of Meta-QTL by meta-analysis for ear and kernel traits in maize.

Meta-QTLChrPositionC.I.(cm)Left markerRight markerTriat
MQTL1165.0863.12–67.05mmp172pco135045KW
MQTL2172.9569.68–76.22IDP637geb7KL
MQTL31167.36164.58–170.13cdo1081aAY108650ED
MQTL41195.34192.07–198.62npi423agrr92aED
MQTL52151.09146.21–155.97les10umc1003ED
MQTL63107.31103.77–110.84bnlg1957uaz37ED
MQTL73143.05135.49–150.62IDP4818IDP7565ED
MQTL83204.96199.26–210.66T1-39(8995) (3)IDP9094ED
MQTL9436.6534.65–38.73cdo344bbx6EL
MQTL10454.637.5–63.1umc2276umc1738EL
MQTL114260.33255.41–265.24pco136360umc49jED
MQTL125221.53216.54–226.52pza03049umc2527ED
MQTL137129.39123.89–134.88agrc261npi47bED
MQTL14958.3551.34–60.64glk1gpm289cKW
MQTL15987.4486.12–88.76umc1921asn4KL
MQTL16992.691.52–93.69mHbrMT307_Mo17umc1494KL
MQTL179114.31100.6–120.4TIDP2493npi97bKW
MQTL189137.7131.45–143.95PZE-109109569SYN8851KW
MQTL199148.75139.45–158.05PUT-163A-94472707-4860SYN5732KL
MQTL201077.4475.34–79.54cdo482bnl17.08ED
  46 in total

1.  Quantitative trait loci: a meta-analysis.

Authors:  B Goffinet; S Gerber
Journal:  Genetics       Date:  2000-05       Impact factor: 4.562

Review 2.  The ubiquitin system.

Authors:  A Hershko; A Ciechanover
Journal:  Annu Rev Biochem       Date:  1998       Impact factor: 23.643

3.  Genetic Architecture of Natural Variation in Rice Chlorophyll Content Revealed by a Genome-Wide Association Study.

Authors:  Quanxiu Wang; Weibo Xie; Hongkun Xing; Ju Yan; Xiangzhou Meng; Xinglei Li; Xiangkui Fu; Jiuyue Xu; Xingming Lian; Sibin Yu; Yongzhong Xing; Gongwei Wang
Journal:  Mol Plant       Date:  2015-03-05       Impact factor: 13.164

4.  Suppression of tiller bud activity in tillering dwarf mutants of rice.

Authors:  Shinji Ishikawa; Masahiko Maekawa; Tomotsugu Arite; Kazumitsu Onishi; Itsuro Takamure; Junko Kyozuka
Journal:  Plant Cell Physiol       Date:  2005-01-19       Impact factor: 4.927

5.  Genome-based approaches to understanding phosphorus deprivation responses and PSR1 control in Chlamydomonas reinhardtii.

Authors:  Jeffrey L Moseley; Chiung-Wen Chang; Arthur R Grossman
Journal:  Eukaryot Cell       Date:  2006-01

6.  Genetic variation in ZmVPP1 contributes to drought tolerance in maize seedlings.

Authors:  Xianglan Wang; Hongwei Wang; Shengxue Liu; Ali Ferjani; Jiansheng Li; Jianbing Yan; Xiaohong Yang; Feng Qin
Journal:  Nat Genet       Date:  2016-08-15       Impact factor: 38.330

7.  Conserved domain structure of pentatricopeptide repeat proteins involved in chloroplast RNA editing.

Authors:  Kenji Okuda; Fumiyoshi Myouga; Reiko Motohashi; Kazuo Shinozaki; Toshiharu Shikanai
Journal:  Proc Natl Acad Sci U S A       Date:  2007-05-02       Impact factor: 11.205

8.  Arabidopsis ferritin 1 (AtFer1) gene regulation by the phosphate starvation response 1 (AtPHR1) transcription factor reveals a direct molecular link between iron and phosphate homeostasis.

Authors:  Marc Bournier; Nicolas Tissot; Stéphane Mari; Jossia Boucherez; Eric Lacombe; Jean-François Briat; Frédéric Gaymard
Journal:  J Biol Chem       Date:  2013-06-20       Impact factor: 5.157

9.  Identification and biochemical characterization of the fructokinase gene family in Arabidopsis thaliana.

Authors:  John W Riggs; Philip C Cavales; Sonia M Chapiro; Judy Callis
Journal:  BMC Plant Biol       Date:  2017-04-26       Impact factor: 4.215

10.  Genome-Wide Association Study for Plant Height and Grain Yield in Rice under Contrasting Moisture Regimes.

Authors:  Xiaosong Ma; Fangjun Feng; Haibin Wei; Hanwei Mei; Kai Xu; Shoujun Chen; Tianfei Li; Xiaohua Liang; Hongyan Liu; Lijun Luo
Journal:  Front Plant Sci       Date:  2016-11-29       Impact factor: 5.753

View more
  11 in total

1.  Genetic Basis of Maize Resistance to Multiple Insect Pests: Integrated Genome-Wide Comparative Mapping and Candidate Gene Prioritization.

Authors:  A Badji; D B Kwemoi; L Machida; D Okii; N Mwila; S Agbahoungba; F Kumi; A Ibanda; A Bararyenya; M Solemanegy; T Odong; P Wasswa; M Otim; G Asea; M Ochwo-Ssemakula; H Talwana; S Kyamanywa; P Rubaihayo
Journal:  Genes (Basel)       Date:  2020-06-24       Impact factor: 4.096

2.  The Multisite PeachRefPop Collection: A True Cultural Heritage and International Scientific Tool for Fruit Trees.

Authors:  Marco Cirilli; Sabrina Micali; Maria José Aranzana; Pere Arús; Annarosa Babini; Teresa Barreneche; Marco Bink; Celia M Cantin; Angelo Ciacciulli; José Enrique Cos-Terrer; Pavlina Drogoudi; Iban Eduardo; Stefano Foschi; Daniela Giovannini; Walter Guerra; Alessandro Liverani; Igor Pacheco; Thierry Pascal; Benedicte Quilot-Turion; Ignazio Verde; Laura Rossini; Daniele Bassi
Journal:  Plant Physiol       Date:  2020-07-29       Impact factor: 8.340

3.  Identification of quantitative trait loci for kernel-related traits and the heterosis for these traits in maize (Zea mays L.).

Authors:  Yinghong Liu; Qiang Yi; Xianbin Hou; Yufeng Hu; Yangping Li; Guowu Yu; Hanmei Liu; Junjie Zhang; Yubi Huang
Journal:  Mol Genet Genomics       Date:  2019-09-11       Impact factor: 3.291

4.  A combination of linkage mapping and GWAS brings new elements on the genetic basis of yield-related traits in maize across multiple environments.

Authors:  Xiaoxiang Zhang; Zhongrong Guan; Zhaoling Li; Peng Liu; Langlang Ma; Yinchao Zhang; Lang Pan; Shijiang He; Yanling Zhang; Peng Li; Fei Ge; Chaoying Zou; Yongcong He; Shibin Gao; Guangtang Pan; Yaou Shen
Journal:  Theor Appl Genet       Date:  2020-06-27       Impact factor: 5.699

5.  Genetic diversity and selection signatures in maize landraces compared across 50 years of in situ and ex situ conservation.

Authors:  Francis Denisse McLean-Rodríguez; Denise Elston Costich; Tania Carolina Camacho-Villa; Mario Enrico Pè; Matteo Dell'Acqua
Journal:  Heredity (Edinb)       Date:  2021-03-30       Impact factor: 3.821

6.  Association Mapping and Transcriptome Analysis Reveal the Genetic Architecture of Maize Kernel Size.

Authors:  Juan Ma; Lifeng Wang; Yanyong Cao; Hao Wang; Huiyong Li
Journal:  Front Plant Sci       Date:  2021-03-18       Impact factor: 5.753

Review 7.  Rhizobial-Host Interactions and Symbiotic Nitrogen Fixation in Legume Crops Toward Agriculture Sustainability.

Authors:  Ravinder K Goyal; Autar K Mattoo; Maria Augusta Schmidt
Journal:  Front Microbiol       Date:  2021-06-11       Impact factor: 5.640

8.  Grain Yields and Nitrogen Use Efficiencies in Different Types of Stay-Green Maize in Response to Nitrogen Fertilizer.

Authors:  Wen Fu; Yang Wang; Youliang Ye; Shuai Zhen; Binghui Zhou; Yin Wang; Yujie Hu; Yanan Zhao; Yufang Huang
Journal:  Plants (Basel)       Date:  2020-04-09

9.  Genome-Wide Association Study and Selection Signatures Detect Genomic Regions Associated with Seed Yield and Oil Quality in Flax.

Authors:  Frank M You; Jin Xiao; Pingchuan Li; Zhen Yao; Gaofeng Jia; Liqiang He; Santosh Kumar; Braulio Soto-Cerda; Scott D Duguid; Helen M Booker; Khalid Y Rashid; Sylvie Cloutier
Journal:  Int J Mol Sci       Date:  2018-08-06       Impact factor: 5.923

10.  Identification of Gene Associated with Sweetness in Corn (Zea mays L.) by Genome-Wide Association Study (GWAS) and Development of a Functional SNP Marker for Predicting Sweet Corn.

Authors:  Vinitchan Ruanjaichon; Kanogporn Khammona; Burin Thunnom; Khundej Suriharn; Chalong Kerdsri; Wanchana Aesomnuk; Arweewut Yongsuwan; Naraporn Chaomueang; Paradee Thammapichai; Siwaret Arikit; Samart Wanchana; Theerayut Toojinda
Journal:  Plants (Basel)       Date:  2021-06-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.