Literature DB >> 28796926

The genetic architecture of amylose biosynthesis in maize kernel.

Changsheng Li1,2,3, Yongcai Huang3,4, Ruidong Huang2, Yongrui Wu3, Wenqin Wang1.   

Abstract

Starch is the most abundant storage carbohydrate in maize kernel. The content of amylose and amylopectin confers unique properties in food processing and industrial application. Thus, the resurgent interest has been switched to the study of individual amylose or amylopectin rather than total starch, whereas the enzymatic machinery for amylose synthesis remains elusive. We took advantage of the phenotype of amylose content and the genotype of 9,007,194 single nucleotide polymorphisms from 464 inbred maize lines. The genome-wide association study identified 27 associated loci involving 39 candidate genes that were linked to amylose content including transcription factors, glycosyltransferases, glycosidases, as well as hydrolases. Except the waxy gene that encodes the granule-bound starch synthase, the remaining candidate genes were located in the upstream pathway of amylose synthesis, while the downstream members were already known from prior studies. The linked candidate genes could be transferred to manipulate amylose content and thus add value to maize kernel in the breeding programme.
© 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

Entities:  

Keywords:  zzm321990SNPzzm321990; amylose; genome-wide association study; kernel; maize; starch

Mesh:

Substances:

Year:  2017        PMID: 28796926      PMCID: PMC5787843          DOI: 10.1111/pbi.12821

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


Introduction

Maize is one of the most important crops in the world. Given the fact of ~10% protein and ~70% starch in the endosperm, it serves as a primary food for humans and feed for livestock (Nelson and Pan, 1995). There are mainly two forms of starch: the linear amylose and branched amylopectin. Amylose is composed of glucose residues linked through α‐1,4 bonds without any branch. Amylopectin contains both linear and branched glucose chains, 5% of which are joined via α‐1,6 bonds that introduce chain branches. The clusters of side chains allow amylopectin to fold into a dense and large molecule of glucose residues, which in turn increases kernel weight (James et al., 2003). Starch in the normal maize endosperm is approximately 25% amylose and 75% amylopectin (Nelson and Pan, 1995). The ratio of amylose to amylopectin plays an important role in appearance, structure and quality of food product and processing. The amylose content determines the starch gelling and firmness, whereas the amylopectin is primarily responsible for the formation of crystalline granules and thickening of paste (Whitt et al., 2002). In general, high amylose improves the product texture of starch and turns it into a source of slowly digestible carbohydrate (Collins, 1909; Kim et al., 1998; Stinard et al., 1993), while low amylose corresponds to higher peak of paste viscosity and strong resistance to retrogradation (Van Hung et al., 2006). Over the past decades, enormous progress has been made in understanding of the genetics and biochemistry of starch synthesis. The key enzymes involved in starch synthesis have been elucidated (Figure 1). Basically, the starch is synthesized by a suite of enzymes, including sucrose synthase (SUS), ADPglucose pyrophosphorylase (AGPase, the small unit encoded by brittle2 (bt2) and the large by shrunken2 (sh2)), soluble starch synthases (SSs) and granule‐bound starch synthase (GBSS, encoded by waxy (wx)), starch‐branching enzyme (BE, encoded by amylose extender1 (ae1)) and starch‐debranching enzyme (DBE, encoded by sugary1 (su1)) (Jeon et al., 2010). The starch synthesis pathway in maize endosperm is thought to begin with the cleavage of sucrose into fructose and UDP‐glucose, catalysed by SUS, the products of which are then converted into ADPglucose (ADPG) by AGPase. Amylose and amylopectin both use ADPG as the activated glucosyl donor for synthesis, but they are synthesized by different enzymes afterwards. GBSS is responsible for synthesis of amylose. Amylopectin biosynthesis requires a well‐coordinated machinery complex of enzymes including SSs, BE and DBE (Figure 1) (Hannah, 1997; James et al., 2003; Jeon et al., 2010; Keeling and Myers, 2010; Whitt et al., 2002). Lack of the BE enzyme in ae1 leads to accumulation of up to 50% amylose due to the less amylopectin production. Mutations at the waxy locus eliminate amylose synthesis, resulting in 100% amylopectin in the endosperm. Both mutants have been used in the corn breeding to create either high‐ or low‐amylose maize to alter starch properties and utility. Reduction in amylopectin has been accomplished with ae1 and su1 mutants (James et al., 1995).
Figure 1

The elucidated pathway of starch biosynthesis in maize. The model summarizes the elucidated key enzymes involved in starch synthesis. Basically, ADP‐glucose as the glucosyl donor for starch biosynthesis is activated by ADP‐glucose pyrophosphorylase (AGPase), which is composed of two large subunits (AGP LSU) and two small subunits (AGP SSU). In amyloplast, the amylose is synthesized by granule‐bound starch synthase (GBSS), while amylopectin biosynthesis requires three more coordinated enzymes of soluble starch synthase (SS), starch‐branching enzyme (BE) and starch‐debranching enzyme (DBE).

The elucidated pathway of starch biosynthesis in maize. The model summarizes the elucidated key enzymes involved in starch synthesis. Basically, ADPglucose as the glucosyl donor for starch biosynthesis is activated by ADPglucose pyrophosphorylase (AGPase), which is composed of two large subunits (AGP LSU) and two small subunits (AGP SSU). In amyloplast, the amylose is synthesized by granule‐bound starch synthase (GBSS), while amylopectin biosynthesis requires three more coordinated enzymes of soluble starch synthase (SS), starch‐branching enzyme (BE) and starch‐debranching enzyme (DBE). There is a dramatic variation in amylose content in the maize natural population ranging from zero (the waxy maize) to 64% (the amylomaize) (Mercier, 1973). Elucidating the genetic variation in starch biosynthesis and regulation is laborious based on traditional QTL mapping. Genome‐wide association study (GWAS) has become a promising way to decipher the causal relationship between genetic polymorphisms and biological traits in plants. The maize inbred lines via self‐cross are ideal materials to conduct GWAS. A number of agronomic traits in maize such as mercury accumulation (Zhao et al., 2017), starch content (Liu et al., 2016b), drought tolerance (Wang et al., 2016) have been successfully analysed. The GWAS identified 37 and 74 loci significantly associated with mercury accumulation and kernel oil concentration in maize, respectively (Li et al., 2013; Zhao et al., 2017). The recent GWAS for total starch content in maize kernel identified four significant SNPs and 77 candidate genes including a key gene of glucose‐1‐phosphate adenylyltransferase (Liu et al., 2016b). Still, the complete network and regulation involved in specific amylose or amylopectin biosynthesis is limited. Here, we are aiming to detect SNPs that are significantly associated with amylose accumulation, which may aid in enhancing the real value of starch in maize kernel. The further functionally validated candidate genes can be readily transferred to maize and other species using genetic engineering biotechnology.

Results

Natural variation in the amylose content

The amylose/amylopectin composition in starch is a complex quantitative trait affecting the quality and yield of maize endosperm. In order to improve the seed quality by manipulating target genes, we selected a mapping population of 464 lines from a natural‐variation germplasm pool with a wide genetic diversity. There was no obvious division of subgroups in our population, which could eliminate false positive and improve the power to recover meaningful associations. We measured the amylose content for the 464 maize lines including 454 natural inbred lines and 10 waxy lines that contain low amylose (Table S1) using a modified iodine colorimetry. There was a great variation in the amylose content ranging from 7.38% to 32.82% in the association panel. The overall distribution exhibited left‐skewed due to the presence of waxy maize materials (Figure 2). The average kernel amylose content was 25.93%, close to 25% in the normal maize seed.
Figure 2

The frequency distribution of amylose content. x‐axis shows the amylose content, and y‐axis shows the population frequency.

The frequency distribution of amylose content. x‐axis shows the amylose content, and y‐axis shows the population frequency.

Linkage disequilibrium

Developed by Lai's group (Liu et al., 2016a), 9 007 194 single nucleotide polymorphisms (SNPs) with a minor allele frequency (MAF) more than 0.05 could cover the whole maize genome and are theoretically sufficient for efficient GWAS analysis in maize with the genome size of 2 300 MB. We took advantage of these qualified SNPs as input data to calculate the genome‐wide linkage disequilibrium (LD) in the association panel. We found a rapid decline in LD with the increasing physical distance on all chromosomes. The mean length of LD decay decreased rapidly to 15 kb at a cut‐off of r 2 = 0.2 (Figure 3). The decay rate varied among chromosomes with a shortest LD (150 kb, r 2 = 0.1) estimate observed on chromosome 6 and the longest LD (450 kb, r 2 = 0.1) observed on chromosome 7. The overall LD decay distance was 250 kb (r 2 = 0.1) across the entire genome (Figure 3).
Figure 3

Linkage disequilibrium decay on maize chromosomes and whole genome. The red dashed horizontal line shows the LD threshold for the association panel (r 2 = 0.1).

Linkage disequilibrium decay on maize chromosomes and whole genome. The red dashed horizontal line shows the LD threshold for the association panel (r 2 = 0.1).

Loci associated with the amylose content

GWAS was performed to associate the phenotype of amylose content with the genotype of 464 diverse inbred lines. Given the population structure and familial relatedness in the natural population, we applied the mixed linear model (MLM) to decrease the false‐positive error. Quantile–quantile plots (Figure 4a) showed that the observed distribution did not deviate from the expected line, indicating that the confounders did not have significant influence. However, the curved tail represented a small number of true associations with the amylose content among thousands of unassociated SNPs. Association analysis identified 42 SNPs significantly associated with the amylose content at the minimum of P ≤ 5 × 10−8 (Figure 4b and Table S2). Due to the overall LD decay distance being 250 kb (r 2 = 0.1) across the entire genome (Figure 3), a 250‐kb region flanking the left and right sides of each SNP was defined as a QTL. 352 genes identified within the defined QTLs are summarized in Table S2. Of them, 39 candidate genes at 27 QTLs were implicated in carbohydrate metabolism and regulation (Table 1). The most significant SNP (chr9.S_23283117) was linked with waxy1 (~22 kb away of GRMZM2G024993) (Figure 4b), a key gene encoding GBSS for amylose biosynthesis (Shure et al., 1983). Four SNPs exist in gene regions, namely two (chr9.S_145846544) in the locus of GRMZM2G425683 (Figure 5a), one (chr4.S_160268689) located in GRMZM2G110483 (Figure 5b) and one in AC204428.4_FGT004 with unknown function yet. GRMZM2G425683 encodes a major facilitator superfamily protein (MFS‐like) responsible for sugar transport. The two SNPs in GRMZM2G425683 create a stop codon and a nonsynonymous amino acid substitution (Figure 5c). The allele of C was greatly correlated with a high amylose content, while the allele of G was linked with a low level of amylose (Figure 5e). Surprisingly, GRMZM2G425683 was specifically expressed in roots, but not in the seeds (Stelpflug et al., 2016). GRMZM2G110483 encodes a pentatricopeptide repeat‐containing protein (PPR‐like), which is highly expressed in the endosperm. The SNP in the PPR‐like gene causes a C‐to‐T mutation leading to an Ala‐to‐Val amino acid substitution (Figure 5d). The allele for high amylose is C compared with the T allele for low amylose (Figure 5f). Still, further investigation is required to test whether GRMZM2G425683 and GRMZM2G110483 affect the amylose content.
Figure 4

Quantile–quantile and Manhattan plots for the association study of amylose content in maize kernels. (a) Quantile–quantile for amylose content. (b) Manhattan plot of amylose content. The dashed line indicates the significance threshold of P‐value 5 × 10–8. 27 unique SNPs are labelled with red dots, and the corresponding genes in the amylose/amylopectin of carbohydrate metabolic pathway are highlighted.

Table 1

SNPs and candidate genes significantly associated with amylose content

SNP IDAllele P valueAssociate genesAnnotationSymbol
chr3.S_139436833C/T2.06E−08GRMZM2G174769Putative galacturonosyltransferase‐like 9GAUT‐like9
chr3.S_139499410A/G2.22E−09GRMZM2G340756MYB‐related‐transcription factorMYBR14
chr3.S_140663351A/G4.61E−08GRMZM2G402368Alpha‐galactosidase 3GLA3
chr3.S_141284190C/T2.77E−08GRMZM2G073584Beta galactosidase 9lacZ9
chr3.S_141284190C/T2.77E−08GRMZM2G067171Putative GATA transcription factorGATA31
chr3.S_149274688A/T3.12E−08GRMZM2G127789Glutathione S‐transferase GST 29GST29
chr4.S_125184357A/G1.43E−09GRMZM2G106741Sucrose transporter 6SUT6
chr4.S_125294997G/T7.84E−12GRMZM2G312806Mitochondrial transcription termination factormTERF
chr4.S_159118307A/G1.57E−09GRMZM2G026024PhosphoribulokinasePRK
chr4.S_160268689C/T2.89E−11GRMZM2G110483Pentatricopeptide repeat‐containing proteinPPR‐like
chr4.S_160268689C/T2.89E−11GRMZM2G589696DOF‐transcription factor 43dof43
chr4.S_160268689C/T2.89E−11GRMZM2G143804PhosphoribulokinasePRK
chr4.S_163058675A/G2.19E−08AC186147.3_FG008Alpha‐6‐galactosyltransferasex34.3
chr4.S_165621095C/T4.31E−08GRMZM2G122846bZIP transcription factorbZIP
chr4.S_25408622C/G3.08E−08GRMZM5G809417Nudix hydrolase 15NUDT15
chr4.S_37345698A/G4.76E−09GRMZM2G008482Orphans transcription factorOrphans
chr4.S_37345698A/G4.76E−09GRMZM2G363540Glutathione S‐transferase GST 26GST26
chr4.S_46702131C/G1.09E−09GRMZM2G100583NAC domain transcription factorNAC75
chr4.S_46744218C/T2.49E−10GRMZM2G129090UDP‐glycosyltransferase 91D1UGT91D1
chr5.S_149999559C/T3.71E−08GRMZM2G173674PhosphoglucomutasePGM
chr5.S_202171324A/C4.45E−08GRMZM2G164912Galactan beta‐1,4‐galactosyltransferaseGALS1
chr5.S_24096526A/G4.71E−09GRMZM2G085747Malic enzymeME
chr5.S_67474748A/C3.28E−08GRMZM2G128929L‐lactate dehydrogenaseLDH
chr5.S_67474748A/C3.28E−08GRMZM2G089836Invertase 2INV2
chr5.S_83843824A/G1.48E−08GRMZM2G449843Alpha/beta‐HydrolasesABHD
chr8.S_110374862C/T4.87E−08GRMZM2G0390173‐beta‐glycosyltransferaseB3GALT
chr9.S_138336082C/T4.52E−08GRMZM2G118979Alpha/beta‐HydrolasesABHD
chr9.S_138363042C/G2.40E−08GRMZM2G095727Two‐component response regulator‐like PRR73ARR‐B‐like
chr9.S_139318861A/C1.20E−08GRMZM2G165357UDP‐glucuronic acid decarboxylase 1 isoformUXS1
chr9.S_139318861A/C1.20E−08GRMZM2G161293Beta‐1‐3‐galactosyl‐O‐glycosyl‐glycoproteinBGGP
chr9.S_145846544C/T6.52E−12GRMZM2G356579Calmodulin‐binding transcription activator 2‐likeCAMTA2
chr9.S_145846544C/T6.52E−12GRMZM2G026742HSF‐transcription factor 9hsftf9
chr9.S_145846544C/T6.52E−12GRMZM2G126936NAC domain‐containing protein 67‐likenactf45
chr9.S_145846544C/T6.52E−12GRMZM2G425683Major facilitator superfamily proteinMFS‐like
chr9.S_145846544C/T6.52E−12GRMZM2G126834ARR transcription factorarr1
chr9.S_23283117C/T3.87E−14GRMZM2G024993Granule‐bound starch synthaseGBSS
chr9.S_23283117C/T3.87E−14GRMZM2G171395NAC domain transcription factornactf86
chr9.S_24270258A/G1.60E−08GRMZM5G811192GlycosidasesGH81
chr9.S_24270258A/G1.60E−08GRMZM2G069008Nudix hydrolase 14NUDT

Association analysis identified 27 SNPs and 39 candidate genes implicated in carbohydrate metabolism and regulation.

Figure 5

Candidate causative genes and variants underlying amylose content in maize kernel. (a) and (b) Regional Manhattan plot of the mfs‐like genomic region on chromosome 9 and the ppr‐like genomic region on chromosome 4. The 250‐kb genomic region on either side of the most significant SNP is shown. The lead SNP is shown with a largest red diamond. The red broken circle indicates the SNPs in coding region for the corresponding genes. (c) and (d) Candidate causative variants in mfs‐like and ppr‐like. Red nabla shows the locations of SNPs. The green colour showed the predicted coding region. (e) and (f) Allele effects for corresponding SNPs in mfs‐like and ppr‐like genes.

Quantile–quantile and Manhattan plots for the association study of amylose content in maize kernels. (a) Quantile–quantile for amylose content. (b) Manhattan plot of amylose content. The dashed line indicates the significance threshold of P‐value 5 × 10–8. 27 unique SNPs are labelled with red dots, and the corresponding genes in the amylose/amylopectin of carbohydrate metabolic pathway are highlighted. SNPs and candidate genes significantly associated with amylose content Association analysis identified 27 SNPs and 39 candidate genes implicated in carbohydrate metabolism and regulation. Candidate causative genes and variants underlying amylose content in maize kernel. (a) and (b) Regional Manhattan plot of the mfs‐like genomic region on chromosome 9 and the ppr‐like genomic region on chromosome 4. The 250‐kb genomic region on either side of the most significant SNP is shown. The lead SNP is shown with a largest red diamond. The red broken circle indicates the SNPs in coding region for the corresponding genes. (c) and (d) Candidate causative variants in mfs‐like and ppr‐like. Red nabla shows the locations of SNPs. The green colour showed the predicted coding region. (e) and (f) Allele effects for corresponding SNPs in mfs‐like and ppr‐like genes.

Functional annotation for candidate genes

Besides the known waxy gene involved in amylose synthesis, we identified more genes such as GRMZM2G089836, GRMZM2G173674 and GRMZM2G069008 from the upstream of the starch synthesis pathway. For example, GRMZM2G089836 encodes an invertase. It was reported that sucrose entering the seed has to be cleaved by invertase into glucose and fructose, which are then resynthesized into sucrose (Walley et al., 2016; Wind et al., 2010). The mutant of the Arabidopsis cytosolic invertase had a high sucrose‐to‐glucose ratio and grew shorter primary roots (Qi et al., 2007). The maize miniature1 (mn1), a mutant with lack of cell wall invertase2, reduced the sucrose transport to the seeds, which resulted in 30% loss of seed weight (Kang et al., 2009). The gene of GRMZM2G173674 encodes a phosphoglucomutase (PGM). It was studied in potato that phosphoglucomutase catalysed the conversion of glucose‐6‐phosphate into glucose‐1‐phosphate that acted as a precursor for starch biosynthesis in plastids (Van Harsselaar et al., 2017). The gene was broadly expressed in all tissues, especially in immature cob and early stage of seed development (Walley et al., 2016). It was found that the plastidic phosphoglucomutase expression was up‐regulated when starch began to accumulate, while its mutant impaired starch synthesis in rice pollen grains and caused male sterility (Lee et al., 2016). GRMZM2G069008 encodes a Nudix hydrolase that is widely distributed in dicots and monocots. Its main function catalyses the cleavage of ADPglucose linked to starch biosynthesis (Kraszewska, 2008). It was reported that the pattern of ADPglucose hydrolytic activity was inversely correlated with starch accumulation (Rodriguez‐Lopez et al., 2000). One of Nudix hydrolases reduced the levels of both ADPglucose and the amylose content when it was overexpressed in Arabidopsis (Ge and Xia, 2008; Munoz et al., 2006). Its decreased expression at the middle stage of maize endosperm development was consistent with the amylose content (Chen et al., 2014). The 39 potential genes at 27 associated loci were annotated as nine transcription factors, four glycosyltransferases, three glycosidases and four hydrolases, as well as 19 other members (Table 1). The transcription factors included MYB, GATA, bZIP, DOF, HSF, NAC, ARR superfamily, indicating their multiple functions and regulations in amylose metabolism (Table 1). Glycosyltransferases and glycosidases form the major catalytic machinery for the synthesis and breakage of glycosidic bonds in disaccharides, oligosaccharides and polysaccharides (Hancock et al., 2006; Lairson et al., 2008; Liang et al., 2015). The enriched glycosyltransferases and glycosidases from the GWAS analysis indicated their important role in amylose metabolism. In conclusion, we propose to integrate the previous knowledge with our GWAS analysis into a model pathway of amylose biosynthesis in maize endosperm (Comparot‐Moss and Denyer, 2009; Keeling and Myers, 2010; Zeeman et al., 2010). Sucrose is produced from the carbon fixation of Calvin cycle in leaves through the activity of mitochondrial NAD‐dependent malic enzyme (ME encoded by GRMZM2G085747), phosphoribulokinase (PRK encoded by GRMZM2G026024 and GRMZM2G143804) and other enzymes. Then, sucrose is transported to the storage organ by sucrose transporter (SUT6 encoded by GRMZM2G106741), where it is imported into the cytosolic compartment of each cell. In the cytosol, the sucrose synthase (SUS) or invertase (INV encoded by GRMZM2G089836) cleaves sucrose into fructose and UDP‐glucose. After glucose 6‐phosphate is transported into plastids, phosphoglucomutase (PGM encoded by GRMZM2G173674) catalyses glucose 6‐phosphate into glucose 1‐phosphate that is activated into ADPglucose as the substrate for amylose synthesis. The content of ADPglucose is also negatively regulated by Nudix hydrolases (NUDT encoded by GRMZM5G809417 and GRMZM2G069008). GBSS encoded by GRMZM2G024993 cooperates with other glycosyltransferase (AC186147.3_FG008, GRMZM2G129090, GRMZM2G164912, GRMZM2G039017) and glycosidase (GRMZM2G402368, GRMZM2G073584, GRMZM5G811192) to catalyse and elongate the sugar chains (Figure 6).
Figure 6

A model of amylose/amylopectin biosynthesis pathway. The model is summarized based on the previous knowledge and our GWAS analysis. In leaves, sucrose is produced from the Calvin cycle by carbon fixation with the activity of mitochondrial NAD‐dependent malic enzyme (ME encoded by GRMZM2G085747), phosphoribulokinase (PRK encoded by GRMZM2G026024 and GRMZM2G143804) and other enzymes. Then, sucrose is transported through phloem to the storage organ of endosperm by sucrose transporter (SUT6 encoded by GRMZM2G106741), where it is imported into the cytosolic compartment of each cell. In the cytosol, the sucrose synthase (SUS) or invertase (INV encoded by GRMZM2G089836) cleaves sucrose into fructose and UDP‐glucose. After glucose 6‐phosphate is transported into amyloplasts, phosphoglucomutase (PGM encoded by GRMZM2G173674) catalyses glucose 6‐phosphate into glucose 1‐phosphate that is activated into ADP‐glucose as the substrate for starch synthesis. The content of ADP‐glucose is also negatively regulated by Nudix hydrolases (NUDT encoded by GRMZM5G809417 and GRMZM2G069008) that break down ADP‐glucose linked to starch biosynthesis. Granule‐bound starch synthase (GBSS encoded by GRMZM2G024993) cooperates with other enzymes to catalyse and elongate sugar chains. All enzymes encoded by candidate genes found from GWAS are highlighted in red colour.

A model of amylose/amylopectin biosynthesis pathway. The model is summarized based on the previous knowledge and our GWAS analysis. In leaves, sucrose is produced from the Calvin cycle by carbon fixation with the activity of mitochondrial NAD‐dependent malic enzyme (ME encoded by GRMZM2G085747), phosphoribulokinase (PRK encoded by GRMZM2G026024 and GRMZM2G143804) and other enzymes. Then, sucrose is transported through phloem to the storage organ of endosperm by sucrose transporter (SUT6 encoded by GRMZM2G106741), where it is imported into the cytosolic compartment of each cell. In the cytosol, the sucrose synthase (SUS) or invertase (INV encoded by GRMZM2G089836) cleaves sucrose into fructose and UDP‐glucose. After glucose 6‐phosphate is transported into amyloplasts, phosphoglucomutase (PGM encoded by GRMZM2G173674) catalyses glucose 6‐phosphate into glucose 1‐phosphate that is activated into ADPglucose as the substrate for starch synthesis. The content of ADPglucose is also negatively regulated by Nudix hydrolases (NUDT encoded by GRMZM5G809417 and GRMZM2G069008) that break down ADPglucose linked to starch biosynthesis. Granule‐bound starch synthase (GBSS encoded by GRMZM2G024993) cooperates with other enzymes to catalyse and elongate sugar chains. All enzymes encoded by candidate genes found from GWAS are highlighted in red colour.

Discussion

It is well known that after the key enzyme of AGPase catalyses 1‐P‐glucose into the activated form of ADPglucose for amylose and amylopectin biosynthesis, the downstream enzymes determine their destinies. Amylose is synthesized by GBSS, whereas amylopectin production is coordinately catalysed by SSs, BE and DBE (Figure 1). It is not surprising that a prominent association peak in the QTL region of the waxy locus (GRMZM2G024993) on chromosome 9 was highly linked with the amylose content. The other QTLs might contribute to both amylose and amylopectin biosynthesis (Figure 6). The waxy gene encoding GBSS plays a critical role in amylose synthesis (Comparot‐Moss and Denyer, 2009; Jeon et al., 2010; Stamp et al., 2016). The dynamic transcriptome analysis showed that waxy was actively expressed in the middle phase of maize endosperm development (Chen et al., 2014). Mutations in waxy eliminate amylose but increase the amylopectin content during the grain filling. These waxy mutants have been used in breeding to create high‐amylopectin maize (Stamp et al., 2016; Tsai, 1974). Recent studies were focused on the downstream genes involved in the final product of starch biosynthesis, such as bt2, sh2, su2, ae1, su1 (Figure 1). Due to the complex of starch biosynthesis and small effect from multiple QTLs, the genetic mechanism of amylose itself rather than total starch synthesis is challengeable to investigate. Taking advantage of GWAS analysis, we were able to detect almost all QTLs at a genome‐wide level with sufficient accuracy and sensitivity. Noticeably, the enzymes in the upstream of amylose pathway were deciphered, such as sucrose transporter, invertase, phosphoglucomutase, Nudix hydrolase, glycosyltransferases and glycosidases that are responsible for the precursors for amylose biosynthesis (Figure 6). The 2,300 Mb of maize genome includes ~32,000 genes with large intergenic regions and ~84% repeats (Jiao et al., 2017). On average, every 72 kb contains a gene. Thus, it is reasonable that we found waxy being 22 kb away from one of the most significant SNPs with a P‐value of 3.86e−14. The acceptable distance between SNPs and candidate genes relies on species, genotype structure and the linkage disequilibrium decay distance. For example, the GWAS for seed oil melting point in Arabidopsis identified two candidate genes FAD2 (AT3G12120) and FATB (AT1G08510) being 74 and 55 kb away from their corresponding SNPs. Molecular studies of FAD2 and FATB variants confirmed that they influence the quantities of the fatty acids (Branham et al., 2016). In the GWAS analysis of oil biosynthesis in maize, the distance from the lead SNP was applied as 50 kb to define potential candidate genes (Li et al., 2013). The 200‐kb genomic region on either side of the most significant SNP is determined as acceptable distance for the genome‐wide study of drought tolerance in maize (Wang et al., 2016). GWAS has begun to serve as a new foundation for understanding the genetic architecture of complex agronomic traits in crops, although it still has confounding limitations. GWAS is powerful to analyse traits underpinned by a small number of loci with large effect size. It should be noticed that the identified significant 42 SNPs are not always the true causative loci by the facts of linkage or error structure of data. We need to take into account of sample size, the imperfect genotyping data and other confounding factors. We maximized the genetic variance within 464 samples collected from different locations. We only used inbred lines to keep genetically identical, which are suited to GWAS analysis. We found 39 candidate genes from 27 significant loci associated with the amylose content. Surprisingly, we did not find any overlapped genes such as sh2 and bt2 in the known pathway of starch biosynthesis. The reason could be due to small effect of multiple QTLs contributing to the trait of amylose/amylopectin content. Additionally, rare variants suffer from many other noncausative rare variants with strong or complete association within the genome (Korte and Farlow, 2013).

Experimental procedures

Plant material

The GWAS mapping population included 464 temperate maize inbred lines collected and genotyped by Dr. Lai's laboratory at China Agricultural University (Liu et al., 2016a). All the inbred lines were planted in the field of Sanya (Hainan Province, 18.75N, 109.17E) in November 2013. The mature seeds of each inbred line were harvested in bulk and used for amylose content analysis.

Amylose content determination

Apparent amylose content (AAC) was measured based on the modified protocol from Gibbon et al. (Gibbon et al., 2003). Mature kernels were ground lightly in a grinding mill. Instead of total starch isolation, we filtered the kernel flour with 80‐mesh nylon screen and dried the flour at 37 °C in oven overnight. We weighed 25 mg kernel flour into 15‐mL centrifuge tube, added 4 mL 80% ethanol and mixed well, heated in a water bath at 80 °C for 20 min with a mixing every 5 min, then centrifuged at 1500 g for 10 min, and supernatant was poured. The precipitates were gelled in 300 μL of absolute ethyl alcohol and 4.5 mL of 1M NaOH by heating to 65 °C for 1 h; subsequently, 500 μL sample was transferred to 50‐mL centrifuge tube, diluted by adding 25 mL water and neutralized in 1 mL 1 m acetic acid, then added 1 mL I2/KI solution (2 mg/mL I2/20 mg/mL KI) and made constant volume to 50 mL, mixed well and incubated for 10 min. The OD was measured on a spectrophotometer (Beckman Coulter) using a 620‐nm filter. The amylose content was determined by the average value from three biological replicates.

Association mapping and SNP annotation

The genotypes of 9,007,194 SNP markers filtered to retain only MAF >0.05 were generated by Dr. Lai's laboratory at China Agricultural University (Liu et al., 2016a). The decay distance of linkage disequilibrium in the association panel was calculated by PopLDdecay (https://github.com/BGI-shenzhen/PopLDdecay) with all SNP markers and mixed linear model (MLM) GWAS was carried out using the EMMAX suite (Kang et al., 2010). EMMAX‐kin was used to calculate the Balding–Nichols kinship matrix to account for family relatedness among the samples, after which EMMAX associations with the phenotype were calculated. The significant threshold of P value is 5e−8, indicated by a horizontal line in the Manhattan plot at −log10 (P value) = 7.3. All SNPs significantly related to the phenotype were annotated by snpEff (Cingolani et al., 2012). The SNP with the most significance within the same LD block (r 2 < 0.1) was selected to represent the locus. The LD between SNPs was also calculated by PopLDdecay (https://github.com/BGI-shenzhen/PopLDdecay). To seek candidate genes in the identified loci for the amylose, we used public gene annotation data sets from both maizeGDB (http://maizegdb.org) and NCBI (https://www.ncbi.nlm.nih.gov). All the annotated genes within the decay distance of linkage disequilibrium in the association panel of SNPs were retrieved. Blastp, InterProScan (Jones et al., 2014) and Pfam (Finn et al., 2014) were also used for annotation candidate genes.

Conflict of interest

The authors have no conflict of interest to declare. Table S1 Amylose content for 464 maize lines. Click here for additional data file. Table S2 Significantly associated SNPs identified by GWAS in maize. Click here for additional data file.
  39 in total

Review 1.  Biochemistry and genetics of starch synthesis.

Authors:  Peter L Keeling; Alan M Myers
Journal:  Annu Rev Food Sci Technol       Date:  2010

2.  The function of the waxy locus in starch synthesis in maize endosperm.

Authors:  C Y Tsai
Journal:  Biochem Genet       Date:  1974-02       Impact factor: 1.890

3.  Cloning, expression and characterization of a Nudix hydrolase that catalyzes the hydrolytic breakdown of ADP-glucose linked to starch biosynthesis in Arabidopsis thaliana.

Authors:  Francisco José Muñoz; Edurne Baroja-Fernández; María Teresa Morán-Zorzano; Nora Alonso-Casajús; Javier Pozueta-Romero
Journal:  Plant Cell Physiol       Date:  2006-06-13       Impact factor: 4.927

Review 4.  Starch: its metabolism, evolution, and biotechnological modification in plants.

Authors:  Samuel C Zeeman; Jens Kossmann; Alison M Smith
Journal:  Annu Rev Plant Biol       Date:  2010       Impact factor: 26.379

5.  Sucrose: metabolite and signaling molecule.

Authors:  Julia Wind; Sjef Smeekens; Johannes Hanson
Journal:  Phytochemistry       Date:  2010-08-07       Impact factor: 4.072

Review 6.  Starch synthesis in the cereal endosperm.

Authors:  Martha G James; Kay Denyer; Alan M Myers
Journal:  Curr Opin Plant Biol       Date:  2003-06       Impact factor: 7.834

7.  Genetic Isolation, Cloning, and Analysis of a Mutator-Induced, Dominant Antimorph of the Maize amylose extender1 Locus.

Authors:  P. S. Stinard; D. S. Robertson; P. S. Schnable
Journal:  Plant Cell       Date:  1993-11       Impact factor: 11.277

8.  Plastidic phosphoglucomutase and ADP-glucose pyrophosphorylase mutants impair starch synthesis in rice pollen grains and cause male sterility.

Authors:  Sang-Kyu Lee; Joon-Seob Eom; Seon-Kap Hwang; Dongjin Shin; Gynheung An; Thomas W Okita; Jong-Seong Jeon
Journal:  J Exp Bot       Date:  2016-09-01       Impact factor: 6.992

9.  Genome-wide association analysis identifies loci governing mercury accumulation in maize.

Authors:  Zhan Zhao; Zhongjun Fu; Yanan Lin; Hao Chen; Kun Liu; Xiaolong Xing; Zonghua Liu; Weihua Li; Jihua Tang
Journal:  Sci Rep       Date:  2017-03-21       Impact factor: 4.379

Review 10.  The advantages and limitations of trait analysis with GWAS: a review.

Authors:  Arthur Korte; Ashley Farlow
Journal:  Plant Methods       Date:  2013-07-22       Impact factor: 4.993

View more
  15 in total

1.  Combination of multi-locus genome-wide association study and QTL mapping reveals genetic basis of tassel architecture in maize.

Authors:  Yanli Wang; Jie Chen; Zhongrong Guan; Xiaoxiang Zhang; Yinchao Zhang; Langlang Ma; Yiming Yao; Huanwei Peng; Qian Zhang; Biao Zhang; Peng Liu; Chaoying Zou; Yaou Shen; Fei Ge; Guangtang Pan
Journal:  Mol Genet Genomics       Date:  2019-07-09       Impact factor: 3.291

2.  Development and validation of rapid and cost-effective protocol for estimation of amylose and amylopectin in maize kernels.

Authors:  Shashidhar Bayappa Reddappa; Rashmi Chhabra; Zahirul Alam Talukder; Vignesh Muthusamy; Rajkumar Uttamrao Zunjare; Firoz Hossain
Journal:  3 Biotech       Date:  2022-02-07       Impact factor: 2.406

3.  Meta-analysis of QTLs associated with popping traits in maize (Zea mays L.).

Authors:  Sukhdeep Kaur; Sujay Rakshit; Mukesh Choudhary; Abhijit Kumar Das; Ranjeet Ranjan Kumar
Journal:  PLoS One       Date:  2021-08-19       Impact factor: 3.240

4.  QTL mapping for maize starch content and candidate gene prediction combined with co-expression network analysis.

Authors:  Feng Lin; Ling Zhou; Bing He; Xiaolin Zhang; Huixue Dai; Yiliang Qian; Long Ruan; Han Zhao
Journal:  Theor Appl Genet       Date:  2019-03-19       Impact factor: 5.699

5.  A combination of linkage mapping and GWAS brings new elements on the genetic basis of yield-related traits in maize across multiple environments.

Authors:  Xiaoxiang Zhang; Zhongrong Guan; Zhaoling Li; Peng Liu; Langlang Ma; Yinchao Zhang; Lang Pan; Shijiang He; Yanling Zhang; Peng Li; Fei Ge; Chaoying Zou; Yongcong He; Shibin Gao; Guangtang Pan; Yaou Shen
Journal:  Theor Appl Genet       Date:  2020-06-27       Impact factor: 5.699

6.  Genome-Wide Association Mapping of Starch Pasting Properties in Maize Using Single-Locus and Multi-Locus Models.

Authors:  Yang Xu; Tiantian Yang; Yao Zhou; Shuangyi Yin; Pengcheng Li; Jun Liu; Shuhui Xu; Zefeng Yang; Chenwu Xu
Journal:  Front Plant Sci       Date:  2018-09-05       Impact factor: 5.753

7.  DNA and Protein Analyses to Confirm the Absence of Cross-Contamination and Support the Clinical Reliability of Extensively Hydrolysed Diets for Adverse Food Reaction-Pets.

Authors:  Isabelle Lesponne; Jérôme Naar; Sébastien Planchon; Tommaso Serchi; Mauricio Montano
Journal:  Vet Sci       Date:  2018-06-26

8.  The Genome Landscape of Tibetan Sheep Reveals Adaptive Introgression from Argali and the History of Early Human Settlements on the Qinghai-Tibetan Plateau.

Authors:  Xiao-Ju Hu; Ji Yang; Xing-Long Xie; Feng-Hua Lv; Yin-Hong Cao; Wen-Rong Li; Ming-Jun Liu; Yu-Tao Wang; Jin-Quan Li; Yong-Gang Liu; Yan-Lin Ren; Zhi-Qiang Shen; Feng Wang; EEr Hehua; Jian-Lin Han; Meng-Hua Li
Journal:  Mol Biol Evol       Date:  2019-02-01       Impact factor: 16.240

9.  Supersweet and waxy: meeting the diverse demands for specialty maize by genome editing.

Authors:  Le Dong; Xiantao Qi; Jinjie Zhu; Changlin Liu; Xin Zhang; Beijiu Cheng; Long Mao; Chuanxiao Xie
Journal:  Plant Biotechnol J       Date:  2019-05-18       Impact factor: 9.803

10.  Comparing Different Statistical Models and Multiple Testing Corrections for Association Mapping in Soybean and Maize.

Authors:  Avjinder S Kaler; Jason D Gillman; Timothy Beissinger; Larry C Purcell
Journal:  Front Plant Sci       Date:  2020-02-25       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.