Literature DB >> 34284723

Genome-wide association screening and verification of potential genes associated with root architectural traits in maize (Zea mays L.) at multiple seedling stages.

Abdourazak Alio Moussa1, Ajmal Mandozai2, Yukun Jin2, Jing Qu2, Qi Zhang2, He Zhao2, Gulaqa Anwari2, Mohamed Abdelsamiaa Sayed Khalifa2, Abraham Lamboro2, Muhammad Noman3, Yacoubou Bakasso4, Mo Zhang2, Shuyan Guan2, Piwu Wang5.   

Abstract

BACKGROUND: Breeding for new maize varieties with propitious root systems has tremendous potential in improving water and nutrients use efficiency and plant adaptation under suboptimal conditions. To date, most of the previously detected root-related trait genes in maize were new without functional verification. In this study, seven seedling root architectural traits were examined at three developmental stages in a recombinant inbred line population (RIL) of 179 RILs and a genome-wide association study (GWAS) panel of 80 elite inbred maize lines through quantitative trait loci (QTL) mapping and genome-wide association study.
RESULTS: Using inclusive composite interval mapping, 8 QTLs accounting for 6.44-8.83 % of the phenotypic variation in root traits, were detected on chromosomes 1 (qRDWv3-1-1 and qRDW/SDWv3-1-1), 2 (qRBNv1-2-1), 4 (qSUAv1-4-1, qSUAv2-4-1, and qROVv2-4-1), and 10 (qTRLv1-10-1, qRBNv1-10-1). GWAS analysis involved three models (EMMAX, FarmCPU, and MLM) for a set of 1,490,007 high-quality single nucleotide polymorphisms (SNPs) obtained via whole genome next-generation sequencing (NGS). Overall, 53 significant SNPs with a phenotypic contribution rate ranging from 5.10 to 30.2 % and spread all over the ten maize chromosomes exhibited associations with the seven root traits. 17 SNPs were repeatedly detected from at least two growth stages, with several SNPs associated with multiple traits stably identified at all evaluated stages. Within the average linkage disequilibrium (LD) distance of 5.2 kb for the significant SNPs, 46 candidate genes harboring substantial SNPs were identified. Five potential genes viz. Zm00001d038676, Zm00001d015379, Zm00001d018496, Zm00001d050783, and Zm00001d017751 were verified for expression levels using maize accessions with extreme root branching differences from the GWAS panel and the RIL population. The results showed significantly (P < 0.001) different expression levels between the outer materials in both panels and at all considered growth stages.
CONCLUSIONS: This study provides a key reference for uncovering the complex genetic mechanism of root development and genetic enhancement of maize root system architecture, thus supporting the breeding of high-yielding maize varieties with propitious root systems.
© 2021. The Author(s).

Entities:  

Keywords:  Candidate genes; GWAS; Maize; SNPs; qRT-PCR; root-related traits

Mesh:

Year:  2021        PMID: 34284723      PMCID: PMC8290564          DOI: 10.1186/s12864-021-07874-x

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Maize (Zea mays L.) is one of the most widely produced grain crops in the world [1]. With the fast-growing world population, improving the yield of corn has become an important target for breeders. The root system plays a primordial role in plant species growth and development and even productivity [2-4]. Plants rely on the root system for anchorage and the acquisition and absorption of nutrients essential for sustaining productivity [2]. As the place of plant and soil interactions, roots play a fundamental role in plant responses to biotic and abiotic stresses [5], and influence significantly many agronomically important traits, including drought and flood tolerance [6-8], root-lodging resistance [9], and nutrient use efficiency particularly nitrogen (N), phosphorus(P), and calcium (Ca) under suboptimal growth conditions [10-13] and resource-challenging environments [2, 14, 15]. Important synchronizations were previously revealed between root growth especially shoot-borne roots with N uptake efficiency in maize [15, 16]. Besides, some pieces of evidence support that high yielding maize varieties are supposed to have propitious root systems, which may efficiently sustain water and nutrients, resulting in increased yield [17] especially under limited water or nutrient availability [18]. Furthermore, grain yield was reported to be closely correlated with root related traits in the early stages of maize development [19]. Nevertheless, maize roots have received much less attention than shoot structures since they are hidden, complex, dynamic and greatly influenced by the soil environment [15, 20–22]. Due to the challenge in achieving reliable root-related trait data from the field, characterizing crops such as maize with improved root system characteristics in the field remains still a major challenge to current plant biology [18, 23] and root trait phenotyping studies commonly use soil-less nutritive solutions [5]. However, it was previously indicated that plant growing systems that nearly mimic the soil media are more stable in mineral elements and environmental factors, and easier to operate for root morphological traits phenotyping in maize [24]. Therefore, to offer a better and robust tool for plant behavior prediction under field conditions, various experimental growing systems with soil-based substrates have been implemented [25-28]. Owing to the rapid progress in digitally automatic image analysis, root phenotypic data acquisition is becoming nowadays cheaper, quicker and more effective [29-34]. Thus, numerous software frameworks such as ARIA [31], EZ-Rhizo [35], Smart Root [36], WinRhizo [37], Optimas analysis software, Image J [38], Root Nav [32], IJ_Rhizo [39], Root System Analyzer [40], and Root Trace [41] have been broadly used for automated root traits measurements in a high throughput manner. To date, several Quantitative Trait Loci (QTL) studies have been conducted to locate root-trait QTLs under various conditions of growth, at diverse developmental stages and involving various genetic populations [21, 22, 24]. Yet, due to low-density markers and large confidence intervals, the localizations of the identified QTLs were inconsistent among the different findings. Thus, further root studies were necessary to detect more chromosomal regions and ultimately identify consistent loci to further screen and verify candidate genes crucial for marker-assisted selection. By enabling the identification of essential loci at high-resolution, association studies have several advantages over conventional genetic mapping approaches for understanding the genetic basis of complex traits [42] like root traits in maize [2]. In the 21st century, genome-wide association studies (GWAS) have been auspiciously used as a high-throughput technique to analyzing the genetic basis for a variety of major crops [42], such as rice, sorghum, soybean, wheat, and maize essential for modern genetic studies [43]. Recently, Sanchez et al. [44] used 300 doubled haploid exotic introgression lines and found 39 SNPs for root architecture traits along with, multiple SNPs within candidate genes that displayed expression in maize seedling roots. In a GWAS analysis implying 14 days old maize seedlings generated from 384 inbred lines genotyped by sequencing (GBS), 268 SNPs associated with root morphological traits, along with 9 SNPs within one candidate gene region were reported [2]. Zaidi et al. [8] used a CIMMYT Asia panel involving 396 tropical maize lines. They revealed 67 SNPs associated with root structural traits under drought stress with many SNPs found within various candidate gene regions. However, to our knowledge no previous study has investigated the genetic basis of maize root architectural traits at multiple developmental stages with successful functional verification of associated candidate genes. Therefore, the objectives of this study were to (i) screen the existing phenotypic variability of root architectural traits within a maize elite germplasm at multiple seedling stages, (2) detect novel significant genomic regions throughout the whole genome and across stages associated with root architectural traits, and (3) identify and verify the expression of possible potential candidate genes.

Results

Phenotypic analysis of root architectural traits

From the mapping population, the evaluated root traits displayed large variations both in parental lines and their offspring at all the three stages (Table 1 and Additional file 1: Table S1). Analysis of variance related to the root trait performances of the two parental lines revealed significant to highly significant differences (P < 0.05; P < 0.01; P < 0.001) for all the measured seedling traits and at all the three stages except RDW/SDW at V1 stage (Table 1). Comparing the two parents, P014 displayed significantly higher root trait performances than E1312 across the three stages (Table 1). This result shows the instantaneous nature of the development of the two parental root systems over time which confirms the pertinence of the three selected experimental time-points for root traits assessment. RBN and TRL exhibited the largest variations of 264.94 and 121.00 cm, respectively (Table 1). Similar heritability and correlation patterns were also observed across stages. Heritability values ranged between 50.22 ( for RDW/SDW) and 99.96 % (for TRL). SUA and TRL exhibited the strongest positive significant correlations (r = 0.924; P < 0.01) while RDW/SDW showed very weak correlations with all other traits (r = 0.149 ~ 0.464; P < 0.05; P < 0.01; Table 2).
Table 1

Descriptive statistics of the seven root-related traits within the mapping population at V1, V2 and V3 stages

TraitsStageP014E1312Sig.aRILs
MeanMeanMean±SDRangeSkewnessKurtosisCV (%)H2(%)
RDW(g)V10.040.01*0.030.030.171.853.6090.0570.91
V20.060.03**0.050.030.201.372.5859.0169.34
V30.070.04**0.100.050.321.061.3252.5485.73
RDW/SDWV10.870.38ns0.781.109.914.2822.43141.3365.94
V20.480.38**0.330.192.283.6626.6758.9050.22
V30.250.19*0.410.171.491.253.5641.9183.27
TRL(cm)V173.7936.86***55.3125.19147.890.680.6845.5598.81
V2155.29100.07***123.5757.78418.321.464.4146.7699.24
V3216.63107.19***249.96121.00521.610.44-0.6948.4199.96
SUA (cm2)V128.1916.42***19.329.7566.031.172.9350.4899.31
V240.6935.71***40.1722.12134.381.442.8855.0794.73
V376.3043.73**91.1651.44235.910.68-0.2856.4397.89
ARD (mm)V11.131.00**0.940.201.461.224.0721.4697.74
V22.311.16***1.120.252.322.4311.8922.5288.49
V33.492.45***1.380.454.574.2228.7632.6995.46
ROV (cm3)V11.150.63***0.560.362.602.318.8465.5298.15
V22.880.84***1.130.734.381.291.9164.3296.66
V33.081.79***2.822.2622.943.6325.1780.2597.35
RBNV151.3328.33***45.6626.57126.000.670.0358.1999.28
V294.3352.67***105.8893.841051.006.3359.1988.6499.79
V3331.00145.33***264.94362.104518.009.39107.55136.6799.89

RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, SUA surface area, ARD average root diameter, ROV root volume, RBN root branching number, SD Std. dev, CV coefficient of variation, H Broad-sense heritability

alevel of significance via student-test with, ns no significant difference

*significantly different at P < 0.05

**significantly different at P < 0.01

***significantly different at P < 0.001

Table 2

Pearson correlation coefficients between the seven root-related traits within the mapping population at V1, V2 and V3 stages

TraitsRDWRDW/SDWTRLSUAARDROV
V1
 RDW/SDW0.719**
 TRL0.330**0.022
 SUA0.389**0.0810.886**
 ARD0.194**0.0500.0550.261**
 ROV0.425**0.1030.669**0.854**0.514**
 RBN0.299**0.0500.835**0.781**0.0870.597**
V2
 RDW/SDW0.648**
 TRL0.659**0.270**
 SUA0.535**0.184**0.826**
 ARD0.0040.0290.0110.309**
 ROV0.459**0.156*0.614**0.884**0.502**
 RBN0.473**0.304**0.783**0.758**0.1140.577**
V3
 RDW/SDW0.675**
 TRL0.771**0.412**
 SUA0.818**0.464**0.924**
 ARD0.0000.054-0.0360.051
 ROV0.639**0.353**0.680**0.843**0.247**
 RBN0.310**0.149*0.442**0.523**0.169*0.800**

RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, SUA surface area, ARD average root diameter, ROV root volume, RBN root branching number

the symbol * and ** indicate respectively, significance at P < 0.05 and at P < 0.01

Descriptive statistics of the seven root-related traits within the mapping population at V1, V2 and V3 stages RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, n class="Chemical">SUA surface area, ARD average root diameter, ROV root volume, RBN root branpan>ching number, SD Std. dev, CV coefficienpan>t of variationpan>, H Broad-senpan>se heritability alevel of significance via student-test with, ns no significant difference *significantly different at P < 0.05 **significantly different at P < 0.01 ***significantly different at P < 0.001 Pearson correlation coefficients between the seven root-related traits within the mapping population at V1, V2 and V3 stages RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, n class="Chemical">SUA surface area, ARD average root diameter, ROV root volume, RBN root branpan>ching number the symbol * and ** indicate respectively, significance at P < 0.05 and at P < 0.01 Similarly, substantial variation at all growth stages was observed within the GWAS population for all root traits evaluated (Table 3 and Additional file 2: Table S2). At stage V3, RBN and ROV showed the highest coefficients of variation of 77.16 and 76.47 %, respectively. Most root-related traits investigated nearly followed a normal distribution, somewhat skewed from left to right (Fig. 1). Moderate to high broad-sense heritability estimates were observed for all seedling traits and at all stages (Table 3). The highest value was observed from RBN (99.84 %) while the lowest one from ARD (43.20 %) (Table 3). Pearson correlation analysis was also performed to examine the phenotypic relationships among root related traits at each specified stage. Similar significant correlation patterns, but with a greater extent at later stages were detected (Table 4). In regards to all evaluated traits at all stages, ROV and n class="Chemical">SUA exhibited the strongest positive significant correlations (r > 90 %, P < 0.01) while RDW/SDW is weakly correlated to all other traits (r = -0.199 ~ 0.477; P < 0.05; P < 0.01; Table 4).
Table 3

Descriptive statistics of seedling root related traits for the GWAS population at three stages

TraitsStageMean±SDRangeSkewnessKurtosisCV (%)H2 (%)
RDW (g)V10.060.020.150.392.2440.8294.07
V20.100.060.310.970.4962.4684.68
V30.120.070.340.820.3355.7371.89
RDW/SDWV10.601.387.882.567.3986.3290.56
V20.630.413.272.7511.7565.3280.87
V30.450.181.090.801.1339.8461.71
TRL(cm)V194.2948.98261.420.930.6451.9597.87
V2195.30134.37699.971.412.4568.8099.52
V3305.10184.01780.260.71-0.3360.3199.51
SUA (cm2)V132.9016.6779.180.950.5450.6896.17
V270.2852.53251.411.341.8974.7498.67
V3107.9166.56294.090.66-0.4261.6899.30
ARD (mm)V11.080.261.460.610.2022.1158.08
V21.120.302.351.887.5627.1151.31
V31.280.221.420.641.2920.2643.29
ROV (cm3)V10.960.522.570.930.6354.2475.11
V22.041.758.831.441.9986.0694.27
V33.022.3113.481.352.7976.4789.26
RBNV169.4745.45201.001.231.1565.4398.51
V2169.39132.69642.001.472.3078.3499.28
V3275.04212.22864.001.180.7577.1699.84

RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, SUA surface area, ARD average root diameter, ROV root volume, RBN root branching number, SD Std. dev, CV coefficient of variation, H Broad-sense heritability

Fig. 1

Distribution frequencies of the seven seedling root traits in the GWAS population across three stages (A, B, and C represent the results from V1 (in turquoise color), V2 (in red color), and V3 (in blue color) stages, respectively). RDW = root dry weight; RDW/SDW = root per shoot dry weight; TRL = total root length; SUA = surface area; ARD = average root diameter; ROV = root volume; RBN = root branching number

Table 4

Pearson correlations at three stages among all root-related traits for GWAS population

TraitsRDWRDW/SDWTRLSUAARDROV
V1
 RDW/SDW0.509**
 TRL0.115-0.251*
 SUA0.183-0.224*0.902**
 ARD0.0370.025-0.303**-0.031
 ROV0.172-0.2030.731**0.907**0.0262*
 RBN0.104-0.2120.752**0.790**-0.1140.749**
V2
 RDW/SDW0.101
 TRL0.788**-0.183
 SA0.822**-0.214*0.941**
 ARD0.381**-0.1270.247*0.418**
 ROV0.816**-0.199*0.857**0.970**0.542**
 RBN0.634**-0.209*0.837**0.852**0.378**0.814**
V3
 RDW/SDW0.615**
 TRL0.773**0.423**
 SUA0.843**0.477**0.870**
 ARD0.394**0.215*0.1040.343**
 ROV0.813**0.438**0.704**0.933**0.532**
 RBN0.719**0.451**0.900**0.806**0.1000.649**

RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, SUA surface area, ARD average diameter, ROV root volume, RBN root branching number

the symbol * and ** indicate respectively, significance at P < 0.05 and at P < 0.01

Descriptive statistics of seedling root related traits for the GWAS population at three stages RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, n class="Chemical">SUA surface area, ARD average root diameter, ROV root volume, RBN root branpan>ching number, SD Std. dev, CV coefficienpan>t of variationpan>, H Broad-senpan>se heritability Distribution frequencies of the seven seedling root traits in the GWAS population across three stages (A, B, and C represent the results from V1 (in turquoise color), V2 (in red color), and V3 (in blue color) stages, respectively). RDW = root dry weight; RDW/SDW = root per shoot dry weight; TRL = total root length; n class="Chemical">SUA = surface area; ARD = average root diameter; ROV = root volume; RBN = root branpan>ching number Pearson correlations at three stages among all root-related traits for GWAS population RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, n class="Chemical">SUA surface area, ARD average diameter, ROV root volume, RBN root branpan>ching number the symbol * and ** indicate respectively, significance at P < 0.05 and at P < 0.01

QTL mapping

The linkage map contained 4235 high-quality SNP markers covering a total length of 1514.57 cM distributed for 10 linkage groups [45]. Using inclusive composite interval mapping method with LOD ≥ 2.5 as a threshold, a total of eight substantial QTLs with a phenotypic variance explained ranging from 6.44 to 8.83 % were detected across the three stages (Table 5). The mapped QTLs were allocated to chromosomes 1, 2, 4, and 10. Chromosome 4 contained the highest number of QTLs, with 3 QTLs detected while chromosomes 1, 2, and 10 contained between 1 and 2 QTLs (Table 5). Four QTLs were detected at V1 while two QTLs where identified at both V2 and V3 stages. When examining the number of QTL inheriting parental favorable alleles, the alleles involved in increasing root characteristics at four loci belonged to the parent P014. Meanwhile, the paternal line E1312 contributed to the other four loci, underlying, therefore, the imperative implication of the two parents in root features discrimination. QTL clusters were identified on chromosome 1 and 10 at V3 and V1, respectively. The cluster on chromosome 1 (qRDWv3-1-1 and qRDW/SDWv3-1-1) located within the marker interval Snp3292_Snp3298 was associated with RDW and RDW/SDW at the genetic region 92.5- 95.5 cM. The Cluster on chromosome 10 (qRBNv1-10-1 and qTRLv1-10-1) detected within the marker interval Snp62466_Snp62578 was significantly associated to RBN and TRL and spanned 50.5–51.5 cM genetic region. This region harbored two candidate genes GRMZM2G116542 and GRMZM2G016477 predicted to encode a putative Spc97 / Spc98 family of spindle pole body (SBP) component and a putative leucine-rich repeat receptor-like protein kinase, respectively. The three QTLs detected on chromosome 4 (qSUAv1-4-1, qSUAv2-4-1, and qROVv2-4-1) were significantly associated with SUA and ROV and spanned the genetic region 89.5–102.5 cM. QTL qROVv2-4-1 (LOD = 3.43, PVE = 8.83 %) associated with ROV was the most significant QTL detected in this study (Table 5). Interestingly, the gene model GRMZM2G068506 predicted to encode a Glucose-1-phosphate adenylyltransferase was found within this chromosomal region. The physical positions of all the detected QTLs are presented in Additional file 3: Table S3.
Table 5

Summary of root traits QTLs detected in P014 × E1312 population

QTLaChrBinPeak(cM)Marker intervalGenetic interval(cM)LODPVEb (%)Add.c
qRDWv3-1-111.0595Snp3292_Snp329892.5–95.52.516.74-0.01
qRDW/SDWv3-1-111.0595Snp3292_Snp329892.5–95.52.516.74-0.01
qRBNv1-2-122.1015Snp16808_Snp1667514.5–15.52.516.446.38
qSUAv1-4-144.0591Snp25452_Snp2543489.5–91.52.676.722.53
qSUAv2-4-144.05102Snp26234_Snp26219100.5- 102.53.037.755.97
qROVv2-4-144.0596Snp25161_Snp2508595.5–96.53.438.830.21
qTRLv1-10-11010.05-0651Snp62466_Snp6257850.5–51.52.666.77-6.65
qRBNv1-10-11010.05-0651Snp62466_Snp6257850.5–54.52.737.16-6.81

aThe identified QTLs: the name contains trait initials, seedling growing stage, and the number of correspondent chromosome

bThe percentages of phenotypic variation explained

cThe QTL additive effect: positive values indicate that P014 provides increased alleles and negative ones indicate that E1312 alleles increased the trait

Summary of root traits QTLs detected in n class="Chemical">P014 × pan> class="Chemical">E1312 population aThe identified QTLs: the name contains trait initials, seedling growing stage, and the number of correspondent chromosome bThe percentages of phenotypic variation explained cThe QTL additive effect: positive values indicate that n class="Chemical">P014 provides increased alleles anpan>d negative onpan>es indicate that pan> class="Chemical">E1312 alleles increased the trait

NGS analysis results

High-quality genomic data consisted of 3230.75 Gb with an average of 40.38 Gb per sample were obtained from the whole-genome sequencing of the 80 inbred maize lines. All related sequences were made available under the accession number PRJNA495031 in the Sequence Read Archive (https:/www.ncbi.nlm.gov/sra). The averages sequencing depth and coverage were 17.62 and 88.39 %, respectively. With reference to the B73 genome (RefGen_v3), the average similarity rate was 98.82 %.

Population structure and linkage disequilibrium

Based on phylogenetic and PCA analysis, the 80 inbred maize lines were subdivided into three subgroups (Fig. 2A, B). Subgroup 1 mainly included the Reid germplasm represented by PH09B and PH6WC maize inbred lines. Subgroup 2 included mainly the Chinese Lvda Red Cob and Tang Si Ping Tou germplasm as well as some tropical maize lines. Subgroup 3 comprised European and Lancaster germplasm, including Non-Reid maize inbred lines such as PHB1 M and Mo17. As shown in Fig. 3, the average decay distance of the LD across all chromosomes was about 5.2 kb at r2 = 0.1. Chromosome 1 with a distance of 10.7 kb showed the highest LD decay while the shortest decay distance (3.7 kb) was observed on chromosome 2.
Fig. 2

Population structure of the 80 maize accessions: A Phylogenetic generated using TreeBeST, B Principal component analysis based on genome-wide complex trait analysis software tool (GCTA)

Fig. 3

Linkage disequilibrium decay across all 10 maize chromosomes within the 80 maize panel

Population structure of the 80 n class="Species">maize accessionpan>s: A Phylogenpan>etic genpan>erated using TreeBeST, B Principal componpan>enpan>t anpan>alysis based onpan> genpan>ome-wide complex trait anpan>alysis software tool (GCTA) Linkage disequilibrium decay across all 10 maize chromosomes within the 80 pan> class="Species">maize panel

GWAS for root architectural traits

In this current analysis, three GWAS approaches including EMMAX, FarmCPU, and MLM were used to scan significant SNPs associated with seven root traits namely RDW, RDW/SDW, TRL, SUA, ARD, ROV, and RBN across three vegetative stages (V1, V2, and V3). The detailed list of all significant SNPs detected in this study and their associated genes is presented in Additional file 4: Table S4. SNPs identified within candidate genes or across at least two different stages/methods simultaneously were considered as reliable in this study. Hence, according to these criteria, 53 unique SNPs, along with 46 SNPs within candidate genes that exhibited significant associations with root morphological traits at the critical threshold of -log10(P) ≥ 6.0 were obtained (Table 6; Fig. 4). These abovementioned SNPs were distributed all over the 10 maize chromosomes and individually explained between 5.10 and 30.2 % of phenotypic variation (Table 6). When analyzing significant SNPs that were detected throughout different stages, 17 SNPs were repeatedly detected from at least two stages along with 3 stable SNPs scanned across all the three growth stages (Table 6; Fig. 4). Our study regarded these SNPs as of great interest for further breeding purposes. Comparing the results across the different GWAS approaches, 34, 19, and 1 SNPs were identified by EMMAX, FarmCPU, and MLM, respectively (Table 6; Fig. 4). The SNP with the lowest p-value was located on chromosome 7, position 58,218,452 (-log10(P) = 14.95, R2 = 30.2 %), and was associated with RBN and SUA. This SNP was detected by FarmCPU stably across V1 and V3 stages. The SNP on chromosome 2 (S2_1707072, -log10(P) = 8.36, R2 = 25.1 %) was simultaneously detected by two different methods (EMMAX, MLM) at V2 stage. In regards of significant SNPs controlling multiple traits, two SNPs located on chromosomes 1 and 5 (S1_227871089, S5_82882718) were substantially linked to three root traits including ROV (-log10(P) = 6.06, 14.10, R2 = 14 %, 22.8 %), RDW(-log10(P ) = 6.10, 6.87, R2 = 14 %, 6.3 %), and SUA (-log10(P) = 7.02, 14.10 R2 = 12.1 %, 22.8 %), respectively. Another SNP on chromosome 2 (S2_43293834, -log10(P) = 6.89, R2 = 11 %) was also significantly linked with three different root traits, including RBN, ROV and SUA. The Q-Q (quantile-quantile) plots of all traits at all stages are shown in Additional file 5: Figure S1.
Table 6

Potential significant SNPs associated with root related traits

TraitsPositionMethodaStageChrP-value-log10(P)R2Genotype
ARD209,661,1442V2Chr38.51E-087.070.100G/A
ARD159,805,3682V2Chr68.32E-076.080.111T/G
RBN72,599,7411V1,V3Chr13.31E-076.480.101T/G
RBN127,352,0561V2Chr12.95E-076.530.115T/C
RBN1,707,0721,3V2Chr24.37E-098.360.251G/A
RBN192,996,7241V2Chr27.59E-087.120.151 C/A
RBN195,707,0911V2Chr26.17E-087.210.152 C/T
RBN2,564,2961V2Chr21.86E-076.730.108 C/T
RBN32,824,9651V2Chr26.76E-087.170.107G/C
RBN171,057,1722V2Chr34.68E-076.330.101G/A
RBN122,501,3441V1,V3Chr48.91E-076.050.121 C/T
RBN87,176,0061V1,V3Chr57.08E-076.150.123 C/T
RBN221,805,1442V2Chr59.77E-076.010.134G/A
RBN178,455,9442V2Chr52.29E-076.640.140 C/T
RBN205,892,8472V2Chr52.82E-076.550.122T/C
RBN128,905,2601V2Chr68.32E-076.080.114G/A
RBN162,388,4752V2Chr67.08E-076.150.121G/A
RBN88,347,9631V1,V3Chr87.41E-076.130.138T/C
RBN145,938,1502V2Chr95.25E-076.280.150G/A
RBN, SUA58,218,4522V1,V3Chr71.12E-1514.950.302 A/G
RBN,ROV, SUA43,293,8341V1,V2Chr21.29E-076.890.110G/A
RDW227,871,0891V2Chr17.94E-076.100.140 C/T
RDW82,567,2491V2Chr17.76E-076.110.140 C/G
RDW2,246,0812V2Chr101.29E-1110.890.223G/A
RDW191,539,2972V1,V3Chr57.94E-076.100.160G/C
RDW82,882,7182V1,V3Chr51.35E-076.870.063 C/A
RDW118,512,7032V2Chr79.55E-076.020.113 A/G
RDW,SUA230,477,4461V1,V3Chr18.91E-076.050.051G/A
RDW/SDW19,943,3841V2Chr17.41E-098.130.160 C/T
ROV227,871,0891V1,V2,V3Chr18.71E-076.060.140 C/T
ROV173,181,8441V2Chr16.46E-076.190.120G/A
ROV150,754,7261V2Chr28.32E-076.080.150 A/T
ROV166,210,2991V2Chr25.01E-076.300.122 C/T
ROV21,486,1131V2Chr21.70E-076.770.110 C/T
ROV187,822,5822V2Chr32.82E-1211.550.140 C/T
ROV12,060,8382V2Chr31.00E-066.000.100 A/G
ROV241,936,5761V1,V2,V3Chr49.77E-076.010.052 C/G
ROV118,806,0681V2Chr59.55E-076.020.153T/C
ROV28,955,5061V2Chr68.32E-076.080.054G/A
ROV,SUA82,882,7182V1,V2,V3Chr57.94E-1514.100.228 C/A
SUA227,871,0891V2Chr19.55E-087.020.121 C/T
SUA1,707,0721V2Chr23.55E-076.450.200G/A
SUA2,610,0941V2Chr22.57E-087.590.121 C/G
SUA94,781,4311V2,V3Chr37.94E-076.100.153 C/T
SUA7,141,3742V1,V3Chr56.46E-087.190.152G/A
SUA217,144,0202V1,V3Chr51.55E-1211.810.160 A/T
SUA119,718,5901V2Chr55.50E-076.260.150 C/T
SUA179,029,1121V2Chr72.40E-076.620.145 C/T
TRL72,599,7691V1,V3Chr14.68E-087.330.100T/C
TRL111,734,3172V2Chr27.08E-076.150.128 A/G
TRL59,237,0401V2Chr35.37E-087.270.134G/A
TRL241,936,5761V1,V3Chr42.95E-076.530.061 C/G
TRL128,905,2541V2Chr65.13E-076.290.106G/A

MAF minor allele frequency, R2 phenotypic contribution, Chr chromosome, RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, SUA surface area, ARD average root diameter, ROV root volume, RBN root branching number

amethod 1-3 refers to EMMAX, FarmCPU, and MLM, respectively

Fig. 4

Manhattan plot of all potential significant SNPs associated with each root-related trait at each specific stage (V1, V2, and V3). 1, 2, 3 refer to EMMAX, FarmCPU, and MLM, respectively. RDW = root dry weight; RDW/SDW = root per shoot dry weight; TRL = total root length; SUA = surface area; ARD = average root diameter; ROV = root volume; RBN = root branching number

Potential significant SNPs associated with root related traits MAF minor allele frequency, R2 phenotypic contribution, Chr chromosome, RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, n class="Chemical">SUA surface area, ARD average root diameter, ROV root volume, RBN root branpan>ching number amethod 1-3 refers to EMMAX, FarmCPU, and MLM, respectively Manhattan plot of all potential significant SNPs associated with each root-related trait at each specific stage (V1, V2, and V3). 1, 2, 3 refer to EMMAX, FarmCPU, and MLM, respectively. RDW = root dry weight; RDW/SDW = root per shoot dry weight; TRL = total root length; n class="Chemical">SUA = surface area; ARD = average root diameter; ROV = root volume; RBN = root branpan>ching number

Candidate genes and functional annotations

A total of 46 genes, along with 41 genes with SNPs inside, were found showing associations with the seven root architectural traits (Table 7). The candidate gene Zm00001d019766 was found only 5.54 kb away from the most significant SNP detected in this study located on chromosome 7 (S7_58218452) and associated with RBN and SUA. This gene was predicted to encode a RING/U-box superfamily protein. The gene model Zm00001d032473 on chromosome 1 (S1_227871089 for ROV, RDW, and SUA located within exon of the candidate gene) was predicted to confer a nonsynonymous mutation associated with CDPK-related kinase 3. The candidate genes Zm00001d029482 (S1_72599741 and S1_72599769 within the candidate gene) and Zm00001d037546 (S6_128905260 and S6_128905254 within the candidate gene) located respectively on chromosomes 1 and 6 contained two significant markers found for two traits, TRL and RBN. The gene model Zm00001d029482 was predicted to encode a NAD (P)-binding Rossmann-fold superfamily protein. Gene model Zm00001d005925 (SNP S2_192996724 for RBN located within the candidate gene) encodes a phosphoglucose isomerase protein with various pathways including GDP-mannose biosynthesis, gluconeogenesis I, glycolysis I (from glucose 6-phosphate), starch biosynthesis, and sucrose biosynthesis I. Gene model Zm00001d017279 on chromosome 5 (SNP S5_191539297 for RDW located within the candidate gene) encodes a phenylalanine ammonia-lyase protein associated with trans-cinnamoyl-CoA biosynthesis pathways. Gene Zm00001d038676 located on chromosome 6 (SNP S6_162388475 for RBN located within the candidate gene) encodes xyloglucan 6-xylosyltransferase and xyloglucan glycosyltransferase associated with xyloglucan and biosynthesis. The details of all candidate genes associated with potential SNPs and the functional annotations are presented in Table 7.
Table 7

Candidate genes associated with potential SNPs and functional annotations

Gene_IDTraitsChrSNP PositionDistance(bp)Gene typeFunctional annotation/biological pathway
Zm00001d043773ARD3209,661,1440exon,synonymousPutative clathrin assembly protein
Zm00001d038558ARD6159,805,3680UTR3cystatin3, CC3, Cysteine proteinase inhibitor 2,
Zm00001d029482RBN172,599,7410intronicNAD(P)-binding Rossmann-fold superfamily protein
Zm00001d030376RBN1127,352,0560intronicATP-dependent DNA helicase
Zm00001d001900RBN22,564,2960intronicProbable cysteine protease RD21B
Zm00001d003119RBN232,824,9650intronicTrafficking protein particle complex II-specific subunit 120 homolog
Zm00001d005925RBN2192,996,7240UTR5Glucose-6-phosphate isomerase 1 chloroplastic,
Zm00001d006030RBN2195,707,0910intergenicENT domain-containing protein
Zm00001d042535RBN3171,057,1720intronicselenoprotein family protein
Zm00001d050783RBN4122,501,3440exon,nonsynonymousMolybdopterin synthase sulfur carrier subunit
Zm00001d015379RBN587,176,0060exon,nonsynonymousSplicing factor arginine/serine-rich 12
Zm00001d016858RBN5178,455,9440UTR3Ankyrin repeat protein SKIP35
Zm00001d017751RBN5205,892,8470exon,nonsynonymousPentatricopeptide repeat-containing protein (chloroplastic)
Zm00001d018496RBN5221,805,1440exon,synonymousPumilio homolog 4
Zm00001d037546RBN6128,905,2600intronicunknown
Zm00001d038676RBN6162,388,4750exon,synonymousProbable xyloglucan glycosyltransferase 12,
Zm00001d009896RBN888,347,9630exon,nonsynonymousunknown
Zm00001d047946RBN9145,938,1500intronicCell cycle checkpoint protein RAD17
Zm00001d003405RBN, ROV, SUA243,293,8345823intergenicBifunctional inhibitor/lipid-transfer protein/seed storage 2 S albumin superfamily protein
Zm00001d001841RBN, SUA21,707,0720intronicunknown
Zm00001d019766RBN, SUA758,218,452-5544intergenicRING/U-box superfamily protein
Zm00001d029683RDW182,567,2490exon,nonsynonymousirregular pollen exine1
Zm00001d017279RDW5191,539,2970exon,nonsynonymousPhenylalanine ammonia lyase7
Zm00001d020485RDW7118,512,7030UTR5Golgi SNAP receptor complex member 1
Zm00001d023292RDW102,246,0810exon,synonymousTrigger factor-like protein TIG Chloroplastic
Zm00001d015290RDW, ROV, SUA582,882,7182998intergenicAdagio protein 1
Zm00001d032558RDW, SUA1230,477,44620,619intergenicunknown
Zm00001d028001RDW/SDW119,943,3840UTR3Unknown
Zm00001d031009ROV1173,181,8440intronicProtein tesmin/TSO1-like CXC 2
Zm00001d002751ROV221,486,1130intronicProbable isoaspartyl peptidase/L-asparaginase 3
Zm00001d004960ROV2150,754,7260intronic2-isopropylmalate synthase 1 chloroplastic
Zm00001d005264ROV2166,210,2990exon,synonymousTetratricopeptide repeat (TPR)-like superfamily protein
Zm00001d039693ROV312,060,8380intronicProtein RST1
Zm00001d043059ROV3187,822,5820UTR5Protein FATTY ACID EXPORT 3 chloroplastic,
Zm00001d015779ROV5118,806,0680intronic14-3-3-like protein GF14 omega
Zm00001d035487ROV628,955,5060intronicE3 SUMO-protein ligase SIZ1
Zm00001d032473ROV, RDW, SUA1227,871,0890exon,nonsynonymousCDPK-related kinase 3
Zm00001d053827ROV, TRL4241,936,576-2034intergenicBEACH domain-containing protein B
Zm00001d001901SUA22,610,0940UTR3Reticulon-like protein B11
Zm00001d041070SUA394,781,4310intronic5-methylthioadenosine/S-adenosylhomocysteine deaminase,
Zm00001d013252SUA57,141,3740UTR360 S ribosomal protein L13a-1
Zm00001d015788SUA5119,718,5900UTR3proteasome component4, 26 S protease regulatory subunit S10B
Zm00001d018235SUA5217,144,0200intronicunknown
Zm00001d022502SUA7179,029,1120intronicExocyst complex component Sec. 8
Zm00001d029482TRL172,599,7690intronicNAD(P)-binding Rossmann-fold superfamily protein
Zm00001d004438TRL2111,734,3170intronicPullulanase-type starch debranching enzyme1
Zm00001d040704TRL359,237,0400exon,nonsynonymousATP-dependent DNA helicase
Zm00001d037546TRL6128,905,2540intronicunknown

RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, SUA surface area, ARD average root diameter, ROV root volume, RBN root branching number, Chr chromosome

Candidate genes associated with potential SNPs and functional annotations RDW root dry weight, RDW/SDW root per shoot dry weight, TRL total root length, n class="Chemical">SUA surface area, ARD average root diameter, ROV root volume, RBN root branpan>ching number, Chr chromosome

Expression levels analysis

Five candidate genes including Zm00001d015379, Zm00001d050783, Zm00001d018496, Zm00001d038676, and Zm00001d017751 harboring significant SNPs (with phenotypic contribution rates greater than 12 %) within exonic regions were tested for expression levels using n class="Species">maize accessions with extreme root branching number differences from both GWAS and mapping panels at two growth stages (V1, V3). The relative expression level results obtained through qRT-PCR revealed that three candidate genes viz. Zm00001d038676, Zm00001d015379, and Zm00001d018496 acted as positive regulators for root branching number while two genes viz. Zm00001d050783 and Zm00001d017751 acted as negative regulators for root branching in both GWAS and mapping accessions (Figs. 5 and 6) at all the considered stages (V1, V3). The expression values expressed in fold change of genes Zm00001d038676, Zm00001d015379, and Zm00001d018496 in both GWAS and mapping accessions with high root branching number were significantly higher (P < 0.05, P < 0.01, P < 0.001) than those of accessions with low root branching number at all V1 and V3 stages (Figs. 5 and 6). In contrast, genes Zm00001d050783 and Zm00001d017751 displayed significantly lower expression levels in accessions with high root branching number as compared to low root branching number accessions (Figs. 5 and 6).
Fig. 5

Relative expression levels (mean from three replicates) of five putative candidate genes (1 to 5) at V1 (in red bars) and V3 (in blue bars) growth stages in phenotypically extreme maize accessions for root branching number trait from the GWAS panel. Values of fold difference are shown in mean ± standard deviation (error bar). Relative expression levels were calculated using the 2^(-∆∆ct) method. 1, 2, and 3 are positive regulating genes while 4 and 5 are negative regulating genes. a and b stand for high and low root branching number accessions, respectively. ***, **, and * indicate the significance level for P < 0.001, P < 0.01 and P < 0.05, respectively

Fig. 6

Relative expression levels (mean from three replicates) of five putative candidate genes (1 to 5) at V1 (in red bars) and V3 (in blue bars) growth stages in phenotypically extreme maize accessions for root branching number trait from the biparental population. Values of fold difference are shown in mean ± standard deviation (error bar). Relative expression levels were calculated using the 2^(-∆∆ct) method. 1, 2, and 3 are positive regulating genes while 4 and 5 are negative regulating genes. a and b stand for high and low root branching number accessions, respectively. ***, **, and * indicate the significance level for P < 0.001; P < 0.01 and P < 0.05, respectively

Relative expression levels (mean from three replicates) of five putative candidate genes (1 to 5) at V1 (in red bars) and V3 (in blue bars) growth stages in phenotypically extreme n class="Species">maize accessions for root branching number trait from the GWAS panel. Values of fold difference are shown in mean ± standard deviation (error bar). Relative expression levels were calculated using the 2^(-∆∆ct) method. 1, 2, and 3 are positive regulating genes while 4 and 5 are negative regulating genes. a and b stand for high and low root branching number accessions, respectively. ***, **, and * indicate the significance level for P < 0.001, P < 0.01 and P < 0.05, respectively Relative expression levels (mean from three replicates) of five putative candidate genes (1 to 5) at V1 (in red bars) and V3 (in blue bars) growth stages in phenotypically extreme n class="Species">maize accessions for root branching number trait from the biparental population. Values of fold difference are shown in mean ± standard deviation (error bar). Relative expression levels were calculated using the 2^(-∆∆ct) method. 1, 2, and 3 are positive regulating genes while 4 and 5 are negative regulating genes. a and b stand for high and low root branching number accessions, respectively. ***, **, and * indicate the significance level for P < 0.001; P < 0.01 and P < 0.05, respectively

Discussion

The use of natural variation approaches not only promotes the genetic basis of complex traits and the discovery of new regulators but also, can facilitate the identification of interesting alleles that can be further used to dissect the molecular mechanisms of the genes underlying the trait variation, which may be directly used for breeding purposes [4]. In this study, wide ranges of variations in terms of seedling root related traits were observed at both stages investigated. In regards to all traits, root length and branching number showed the largest phenotypic variations. Thus far, considerable natural phenotypic variations for root system architecture traits in various maize panels have been reported [2, 24, 31, 44, 46, 47]. Broad sense heritability estimates were relatively high with similar tendencies across stages. Recently, inline heritability ranges have been recorded in similar studies regarding maize seedling root traits at various growth stages/time-points both under field and controlled conditions [8, 15]. The traits like root volume, total root length, surface area, and root branching number were found tightly correlated in this current investigation. Positive significant associations between seedling and adult root traits were earlier reported by Abdel-Ghani et al. [17]. These findings were consistent with those observed by several investigators [2, 24, 44]. Root dry weight, surface area and total root length were stated to affect plant’s nutrients and water assimilation and absorption [48]. Most of the earlier root-related studies have been carried out using low-density genetic linkage maps involved commonly simple sequence repeat (SSR) or restriction fragment length polymorphism (RFLP), resulting in scanty inter-marker intervals not efficient enough for accurate QTL and gene identification [49]. In crops, association studies based on LD were previously stated as an effective approach for detecting and identifying SNPs or genes correlated with complex traits, such as roots [50]. GWAS use millions of single nucleotide polymorphisms (SNPs) markers to determine alleles associated with multiple traits in crops and identify genes controlling their expression [51]. In this analysis, a total of 1,490,007 consistent SNP markers were distributed among the ten maize chromosomes to directly mine SNPs and genes putatively associated with seven root architectural traits viz. RDW, RDW/SDW, TRL, SUA, ARD, ROV, and RBN across three vegetative stages (V1, V2, and V3) and using three GWAS models (MLM, EMMAX, and FarmCPU). Abundant polymorphisms and fast LD decay make maize an excellent crop for association studies [52]. The average LD decay distance was 5.2 kb, which specially enhances SNPs and genes mapping accuracy as compared to the reported densities for GBS-SNP markers ranging from 6.2 kb to 100 Mbp [8, 44] in recent maize root-trait QTL studies (last summary) [22]. These results are consistent with the reported average LD decay for inbred maize lines occurring within 1–10 kb [2, 53]. Based on significant SNPs (for multiple testing critical threshold of -log10(P) ≥ 6.0) identified within candidate genes or across at least two stages/methods simultaneously, 53 potential SNPs were detected. Comparatively, EMMAX and FarmCPU were the most efficient and accurate models for detecting 34 SNPs and 19 SNPs, respectively, while MLM model was the least efficient by detecting only one significant SNP. Due to its high stringency, MLM was earlier noted to create type II errors and cause false negatives [2, 44]. Recently, in a similar study, Sanchez et al. monitored fourteen SNPs through FarmCPU and four SNPs by MLM using a panel of 62,077 SNP markers [44]. It was previously indicated that various algorithmic approaches ought to be used to perform GWAS analysis in a real application for complex trait studies due to the limitation in a single model to detect polygenic variation associations [54-56]. A large number of root-related trait SNPs detected in this study are consistent with known genes recorded in previous findings. Using the results of fifteen root QTL studies from nine different bi-parental mapping populations, a meta-analysis study of QTLs recapitulated many putative MQTLs related to maize root development [57]. Interestingly, our detected SNPs were found to be in LD with four noteworthy loci involved in root development throughout developmental stages, namely S1_227871089 with Ax-2 (at bin 1.07), S2_32824965 and S2_43293834 with Rt-6 (at bin 2.04), S3_171057172, and S3_187822582 with Rt-7 (at bin 3.06), and S6_128905254 and S6_128905260 with Rt-13 (at bin 6.05) [57]. SNP S1_227871089 on chromosome 1 significantly associated with ROV, RDW and SUA housed within the candidate gene Zm00001d032473 predicted to encode a CDPK-related kinase 3 enzyme highly expressed in primary root growth [58-60]. This SNP was also in LD with the detected cluster QTLs qRDWv3-1-1 and qRDW/SDWv3-1-1 in addition to qTRSA21-1, qTRL21-1 [19], and RTN1-1 [24]. SNP S4-122501344 associated with RBN was located within gene Zm00001d050783 which codes molybdopterin synthase sulfur carrier subunit. This SNP was also in LD with the detected QTLS qSUAv1-4-1, qSUAv2-4-1, and qROVv2-4-1 in addition to the gene model GRMZM2G32186 [2] associated with several SNPs for root length, which showed high expression in the maize primary root at emergence and V1 stage [61]. Furthermore, SNP S4-122501344 was found to be within another reported gene (GRMZM2G153722) with high expression in seedling root and shoot [61] which contains nine other SNPs associated with root diameter and surface area [2]. SNPs S5_87176006 and S5_205892847 associated with RBN were located within gene models Zm00001d015379 and Zm00001d017751 which encode for an arginine/serine-rich protein 12 and a pentatricopeptide repeat-containing protein At2g15820 like, respectively. These SNPs were in LD with a QTL for crown root length in bin 5.04 detected by Liu et al. [15]. SNPs S6_128905254 and S6_128905260 significantly associated with TRL and RBN were found to be within the gene model Zm00001d037546 with currently unknown function. Its associated synonymous gene (GRMZM2G030235) highly expressed in maize seedling primary root at both VE and V1 stages [58, 61]. These SNPs were furthermore in LD with qARL26-1, qARN26-1 [19] and a QTL for seminal root number in bin 6.05 reported by Liu et al. [15]. SNP S7_118512703 associated with RDW on chromosome 7 was found within Zm00001d020485, a gene encoding a Golgi SNAP receptor complex member 1. This SNP was also in LD with qTRL17-1 [19] and LRL7-1 [24]. Remarkably, the cluster qTRLv1-10-1 and qRBNv1-10-1 associated with total root length and root branching number at V1 stage collocated consistently with the QTL qTRL-10-1 regulating total root length detected recently by Moussa et al. [45] within the same population using 5 days old seedlings. This chromosomal region can, therefore, be considered as a special noteworthy locus that can directly be used in marker-assisted selection programs. When looking for SNPs within identified mutants or cloned genes for maize root developmenpan>t, rth6 [62] was putatively in LD with SNPs S1_127352056 anpan>d S1_173181844 associated with RBN anpan>d ROV; rth5 [63] was in LD with SNPs S3_171057172 anpan>d S3_187822582 associated with RBN anpan>d ROV; rum1 [64, 65] was in LD with SNP S3_209661144 associated with ARD; anpan>d rtpan> class="Chemical">h2 [66] was in LD with SNPs S5_87176006, S5_118806068, S5_119718590, S5_205892847, and S5_82882718 associated with RBN, ROV, SUA, and RDW. Gene expression profile can show whether the gene possesses a biological function [67]. However, several genes have been broadly determined through GWAS, but most of them are new genes without functional verification. Today, qRT-PCR is widely used to validate GWAS-detected gene expression with high accuracy and sensitivity [68, 69]. In this study, five potential genes were checked for expression levels using maize accessions with extreme root branching number differences. At all considered growth stages (V1, V3), three genes viz. Zm00001d038676, Zm00001d015379, and Zm00001d018496 acted as positive regulators for root branching number with significantly higher expression levels in high root branching number maize accessions. Conversely, two other genes viz. Zm00001d050783 and Zm00001d017751 with significantly lower expression in high root branching number accessions were found to act as negative regulating genes. The identified SNPs and evaluated genes by enriching our insights into the molecular mechanisms of maize root architecture, could thus be of great value for future marker-assisted breeding programs. Upcoming research will, however thoroughly elucidate the function of these genes in maize root growth and development.

Conclusions

This study provides a comprehensive analysis of the genetic architecture of root traits in elite maize lines. A recombinant inbred line population and a genome-wide association study panel were used for mapping loci associated with maize root architectural traits at three developmental stages under standard conditions. QTL mapping using inclusive composite interval mapping identified eight QTLs for root traits on chromosomes 1(qRDWv3-1-1 and qRDW/SDWv3-1-1), 2 (qRBNv1-2-1), 4 (qSUAv1-4-1, qSUAv2-4-1, and qROVv2-4-1), and 10 (qTRLv1-10-1, qRBNv1-10-1), and each QTL explained 6.44–8.83 % of phenotypic variation. Genome-wide association study analysis identified 53 substantial SNPs through three GWAS models (EMMAX, FarmCPU, and MLM). The detected SNPs showed individual phenotypic contribution rates ranging from 5.10 to 30.2 % and were significantly associated with RBN (17), ROV (10), SUA (8), RDW (6), TRL (5), ARD (2), and RDW/SDW (1). Within the LD region of 5.2 kb for the significant SNPs, 46 candidate genes were identified. Five promising genes viz. Zm00001d038676, Zm00001d015379, Zm00001d018496, Zm00001d050783, and Zm00001d017751 were identified and successfully verified for expression level. The evaluated genes were shown to serve as positive/negative regulators for root branching in the spring maize lines. Thus, SNPs and underlying genes discovered in the present study could be critical for future marker-assisted breeding programs of high-efficient root systems in maize, as well as supporting the breeding of high-yielding maize varieties.

Methods

Plant materials, growth chamber experiment, and traits measurement

In this study, two genetic populations were used. The GWAS panel comprised 80 natural maize inbred lines covering more than 80 % planting area in Jilin Province (China). All the accessions are the same as defined in our earlier study [52]. The mapping population consisted of 179 RILs developed by a cross between the female parent P014 and the male parent E1312 and continuous inbreeding for 9 generations. P014 line was known to possess a larger and thicker root system with significantly higher root dry weight, total root length, surface area, projected area, average root diameter, and root tips as compared to E1312 line [45]. Genetically pure seeds from both populations were produced at the Jilin Agricultural University’s experimental field in summer 2019. A controlled growth chamber experiment was conducted using a completely randomized design (CRD) with three replications. The growth chamber parameters were: 28/25℃ temperature, 14/10 h (light/darkness) photoperiod, and 70/80 % relative humidity day/night, respectively. The light intensity was set at 200 µmol photons m− 2 s− 1. The seeds from both GWAS and mapping populations were directly planted in polyvinyl chloride (PVC) pipes (8 cm bottom diameter, 3.2 mm thickness, and 25 cm height) containing a mixture of sandy soil and vermiculite (2:1 ratio). Root architectural traits were investigated at three vegetative stages of maize growth namely V1 (one fully expanded leaf), V2 (two fully developed leaves), and V3 (three fully developed leaves) which approximately corresponded to 5, 15 and 25 days after emergence, respectively. For data collection, each PVC pipe consisted of one seedling was considered an experimental unit and the growth chamber independent trials were completed in February, April, and July 2020. All the experiments were repeated three times to increase the reliability of root quantitative traits measurements. The seedlings were cautiously removed from soil at each specific stage and washed with running water to eliminate the soil residues. For each seedling, the root system was separated from the shoot, and roots were scanned using a root scanner-based image (Perfection V800 Epson, resolution of 12,800 dots per inch (dpi: 5039.37 dots per cm)) then analyzed via DJ-GXG02 software [70]. If the collection of data could not be carried out within one single day, seedlings were kept in 30 % ethanol by swamping the roots and then placed in a cold chamber (4℃) with reference to Sanchez et al. [44]. A total of seven root related traits namely root dry weight, root to shoot dry weight, total root length, surface area, root volume, average root diameter, and root branching number were collected (Table 8). For dry weight biomass, root and shoot were recorded individually using an electronic weighing balance once drying in an oven set at 75℃ for at least 48 h to achieve the constant weight.
Table 8

Collected root related traits initials and illustrations

Trait NameAbbreviationsTrait description
Root dry weightRDWTotal root dry weight of the seedling in gram
Root to shoot dry weightRDW/SDWRoot to shoot dry weight ratio in gram
Total root lengthTRLCumulative length of the root system in cm
Surface areaSUAWhole root system surface area in cm2
Root volumeROVCumulative volume of all the roots in cm3
Average root diameterARDAverage diameter of the entire root system in mm
Root branching numberRBNTotal number of all the root tips
Collected root related traits initials and illustrations

Phenotypic analysis of root related traits

The descriptive analyses were carried out using Minitab17 program (Minitab Inc., State College, PA, USA). For every root related trait at each stage and in both populations, descriptive statistics including mean, standard deviation, maximum, minimum, skewness, kurtosis, and coefficient of variation were calculated. The broad-sense heritability (n class="Chemical">H2) was determined using variance components obtained via ANOVA as described by Pace et al. [2]. Normal distribution and Pearson coefficients of correlation among traits were also generated. Figures were produced using GraphPad Prism 8.0.1 (GraphPad Software, Inc., San Diego, CA).

Genotyping, linkage map construction, and QTL mapping

The 179 RILs were genotyped by sequencing (GBS), and MSTmap was used for linkage analysis of marker data [71]. Linkage map was constructed based on automatic parameter settings with reference to Meng et al. [72]. Briefly, the markers were grouped at a LOD of ≥ 3.0, ordered, rippled, and then outputted to generate the linkage map. QTL IciMapping 4.1 software was used to perform QTL analysis using inclusive composite interval mapping for additive QTL (ICIM-ADD) [73]. The walking speed was 1.0 cM and the size of the windows was 5.0 cM. A LOD threshold peak score value of ≥ 2.5, which is commonly used in maize QTL mapping [49, 74], was set to declare significant QTLs. QTL additive effects and phenotypic variance explained (PVE) were also analysed.

Next-generation sequencing of GWAS population genome

Genomic DNA was extracted from leaves for the 80 maize inbred lines using a cetyltrimethylammonium bromide (CTAB) protocol for plant tissues [75]. The different accessions were genotyped via next-generation sequencing at Novogene Biological Company (https://novogene.com/, Beijing, China). Briefly, for genomic libraries construction, the DNA samples were digested by sonication to a size of 350 bp, the genomics fragments were then subjected to end polishing and A-tailing afterward ligated to the full-length adapter for Illumina HiSeq PE150 platform (Illumina Inc., San Diego, CA, USA). Subsequently, the libraries were analyzed using Agilent 2100 Bioanalyzer and high-consistent sequencing data that show polymorphisms were then mapped to the B73 maize reference genome (RefGen_v3) using BWA software [76]. Duplicates were expurgated using SAMtoots [77]. Thus, 34,872,961 SNPs were obtained from next-generation sequencing analysis. Finally, using the Bayesian model mpileup of SAMtools with the criteria of SNP missing rate of < 10 % and minor allele frequency (MAF) of ≥ 5 %, a final total of 1,490,007 high-consistent SNP markers were obtained for genetic evolution and GWAS analyses.

Population structure and linkage disequilibrium analysis

Based on a total of 1,490,007 high-consistent SNP markers, the principal component analysis (PCA) of individuals was performed using genome-wide complex trait analysis software tool (GCTA) [78]. The distances between the materials were inferred based on the distance matrix for phylogenetic tree contruction using TreeBeST program (http://treesoft.sourceforge.net/treebest.shtml/). Using PLINK [79], the decay of linkage disequilibrium measured in base pairs was calculated on each chromosome using an r2 value of 0.1 as a cut-off.

GWAS analysis

In this study, to control false positive or spurious associations, three GWAS models were implemented, viz.: (1) Mixed Linear Model (MLM) [80], where PCA (Q) from population structure and kinship (K) were used as covariates; (2) Efficient Mixed-Model Association eXpedited (EMMAX), which outperforms both PCA and genomic control in correcting for sample structure with high statistical power [81] and; (3) Fixed and random model Circulating Probability Unification (FarmCPU), where PCA (as a fixed effect) and kinship (as a random effect) were used as covariates [82]. GWAS using the MLM model and EMMAX model was performed using the software TASSEL 5.0 [83], and FarmCPU method was performed using R FarmCPU package [82]. The threshold was defined based on the number of effective SNP markers. Multiple testing using simpleM program integrated in R, which calculates the number of informative SNP markers (Meff_G) was used to set the threshold. Briefly, a correlation matrix for all 1,490,007 SNPs was generated and the corresponding eigenvalues were calculated; a composite LD correlation was then calculated directly using GAPIT package and once this SNP matrix was obtained, Meff_G was calculated and this value was used to compute for the multiple testing threshold in the same way as the Bonferroni correction method, where the significance threshold (α = 0.05) was divided by the Meff_G (α /Meff_G) [2, 44]. Finally, after adjusting, P ≤ 0.000001(-log10 (P) ≥ 6) was set as the critical threshold to detect SNPs significantly associated with the different root morphological traits.

Candidate gene identification

Candidate genes with SNP in coding regions and which could promote mutation were considered as high priority candidate genes. MaizeGDB (http://www.maizegdb.org/) and Gramene (https://www.gramene.org/) databases were used for predicting functional annotations of the candidate genes with reference to B73_RefGen_v4 [84, 85].

Expression analysis

High priority candidate genes were chosen based on the existence of substantial SNPs within the exon of candidate genes and phenotypic contribution rates greater than 12 %. For expression analysis, ten maize accessions with extreme root branching number differences from both GWAS and biparental populations were chosen. There were M08, M40, M25, M06, M35, M44, M63, M52, M56, and M54 from the GWAS population. From the mapping population, there were P014 (female parent), E1312 (male parent), C055, C039, C0143, C096, C004, C070, C152, and C010. The roots were sampled at two different stages (V1 and V3). Total RNA was extracted using the Trizol method, cDNA was reverse transcribed the All-in-One First-Strand cDNA Synthesis kit (GeneCopoeia Inc., USA) following the standard protocol. Oligonucleotide primers were designed with PrimerPremier5.0 (http://www.premierbiosoft.com) (Additional file 6: Table S5). The qRT-PCR protocol was performed in a total volume of 20 µL: 2µL cDNA, 2 µL of each forward and reverse primer, 10 µL qPCR Master Mix, and 4µL ddH2O. The qRT-PCR Thermo cycling conditions were: initial denaturation at 95 °C for 30 s, 40 cycles of 95 °C for 5 s, annealing at 58 °C for 30 s, extension at 72 °C for 15 s, and an infinite hold at 10 °C. Leunig was used as the internal control gene and the relative expression levels were calculated using the 2^(-∆∆ct) method [86]. All experiments were conducted in triplicates at each specified growth stage. Additional file 1: Table S1. Phenotypic data of the seven root-related traits from the mapping population at V1, V2 and V3 stages. Additional file 2: Table S2. Phenotypic data of the seven root-related traits from the GWAS panel at V1, V2 and V3 stages. Additional file 3: Table S3. Physical positions of the identified QTLs. Additional file 4: Table S4. List of all significant SNPs detected in this study and their associated genes. Additional file 5: Figure S1. Q-Q (quantile-quantile) plots of all traits at V1, V2 and V3 stages. Additional file 6: Table S5. List of n class="Chemical">oligonucleotide primers used for the qRT-PCR assay for the evaluated canpan>didate genpan>es.
  64 in total

Review 1.  Steep, cheap and deep: an ideotype to optimize water and N acquisition by maize root systems.

Authors:  Jonathan P Lynch
Journal:  Ann Bot       Date:  2013-01-17       Impact factor: 4.357

2.  Genomic prediction of seedling root length in maize (Zea mays L.).

Authors:  Jordon Pace; Xiaoqing Yu; Thomas Lübberstedt
Journal:  Plant J       Date:  2015-09       Impact factor: 6.417

3.  Association analysis of single nucleotide polymorphisms in candidate genes with root traits in maize (Zea mays L.) seedlings.

Authors:  Bharath Kumar; Adel H Abdel-Ghani; Jordon Pace; Jenaro Reyes-Matamoros; Frank Hochholdinger; Thomas Lübberstedt
Journal:  Plant Sci       Date:  2014-04-03       Impact factor: 4.729

4.  Mapping QTLs for root system architecture of maize (Zea mays L.) in the field at different developmental stages.

Authors:  Hongguang Cai; Fanjun Chen; Guohua Mi; Fusuo Zhang; Hans Peter Maurer; Wenxin Liu; Jochen C Reif; Lixing Yuan
Journal:  Theor Appl Genet       Date:  2012-06-21       Impact factor: 5.699

5.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

6.  Association genetics of chilling injury susceptibility in peach (Prunus persica (L.) Batsch) across multiple years.

Authors:  Arun Prabhu Dhanapal; Carlos H Crisosto
Journal:  3 Biotech       Date:  2013-01-05       Impact factor: 2.406

7.  An updated gene atlas for maize reveals organ-specific and stress-induced genes.

Authors:  Genevieve M Hoopes; John P Hamilton; Joshua C Wood; Eddi Esteban; Asher Pasha; Brieanne Vaillancourt; Nicholas J Provart; C Robin Buell
Journal:  Plant J       Date:  2019-01-22       Impact factor: 6.417

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  GLO-Roots: an imaging platform enabling multidimensional characterization of soil-grown root systems.

Authors:  Rubén Rellán-Álvarez; Guillaume Lobet; Heike Lindner; Pierre-Luc Pradier; Jose Sebastian; Muh-Ching Yee; Yu Geng; Charlotte Trontin; Therese LaRue; Amanda Schrager-Lavelle; Cara H Haney; Rita Nieu; Julin Maloof; John P Vogel; José R Dinneny
Journal:  Elife       Date:  2015-08-19       Impact factor: 8.140

10.  Genome-wide association and high-resolution phenotyping link Oryza sativa panicle traits to numerous trait-specific QTL clusters.

Authors:  Samuel Crowell; Pavel Korniliev; Alexandre Falcão; Abdelbagi Ismail; Glenn Gregorio; Jason Mezey; Susan McCouch
Journal:  Nat Commun       Date:  2016-02-04       Impact factor: 14.919

View more
  2 in total

1.  Deciphering the Genetic Basis of Root and Biomass Traits in Rapeseed (Brassica napus L.) through the Integration of GWAS and RNA-Seq under Nitrogen Stress.

Authors:  Nazir Ahmad; Bin Su; Sani Ibrahim; Lieqiong Kuang; Ze Tian; Xinfa Wang; Hanzhong Wang; Xiaoling Dun
Journal:  Int J Mol Sci       Date:  2022-07-19       Impact factor: 6.208

2.  Combining datasets for maize root seedling traits increases the power of GWAS and genomic prediction accuracies.

Authors:  Leandro Tonello Zuffo; Rodrigo Oliveira DeLima; Thomas Lübberstedt
Journal:  J Exp Bot       Date:  2022-09-12       Impact factor: 7.298

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.