Literature DB >> 31581635

Integrating GWAS and Gene Expression Analysis Identifies Candidate Genes for Root Morphology Traits in Maize at the Seedling Stage.

Houmiao Wang1,2, Jie Wei3, Pengcheng Li4,5, Yunyun Wang6, Zhenzhen Ge7, Jiayi Qian8, Yingying Fan9, Jinran Ni10, Yang Xu11,12, Zefeng Yang13,14,15, Chenwu Xu16,17,18.   

Abstract

Root system plays an essential role in water and nutrient acquisition in plants. Understanding the genetic basis of root development will be beneficial for breeding new cultivars with efficient root system to enhance resource use efficiency in maize. Here, the natural variation of 13 root and 3 shoot traits was evaluated in 297 maize inbred lines and genome-wide association mapping was conducted to identify SNPs associated with target traits. All measured traits exhibited 2.02- to 21.36-fold variations. A total of 34 quantitative trait loci (QTLs) were detected for 13 traits, and each individual QTL explained 5.7% to 15.9% of the phenotypic variance. Three pleiotropic QTLs involving five root traits were identified; SNP_2_104416607 was associated with lateral root length (LRL), root surface area (RA), root length between 0 and 0.5mm in diameter (RL005), and total root length (TRL); SNP_2_184016997 was associated with RV and RA, and SNP_4_168917747 was associated with LRL, RA and TRL. The expression levels of candidate genes in root QTLs were evaluated by RNA-seq among three long-root lines and three short-root lines. A total of five genes that showed differential expression between the long- and short-root lines were identified as promising candidate genes for the target traits. These QTLs and the potential candidate genes are important source data to understand root development and genetic improvement of root traits in maize.

Entities:  

Keywords:  GWAS; RNA-seq; candidate genes; maize; quantitative trait locus; root traits

Mesh:

Year:  2019        PMID: 31581635      PMCID: PMC6826382          DOI: 10.3390/genes10100773

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Introduction

Over the next 30 years, the global human population is expected to grow by 25% and reach 10 billion, but the current pace of yield increase for major crops is insufficient to meet future demand [1]. Water shortages, nutrient deficiency, and lodging caused by extreme weather can severely reduce crop productivity and threaten global food security [2]. Root systems provide mechanical support, uptake water and nutrients, and interact with rhizosphere microbes and are vital for plant growth and adaptation [3]. After germination, root systems continue to expand in size to become structurally and functionally complex and sometimes may be bigger than above-ground parts [4]. Root systems have developmental plasticity and are capable of altering root branching, root angle, and root growth rates to adapt to adverse soil conditions [5]. The root system architecture is instrumental for superior productivity, but it remains a largely untapped resource in plant breeding programs [6]. The complex maize root system comprises five main types of roots: primary, seminal, crown, brace, and lateral roots. Embryonic roots, including primary and seminal roots, are important for maize early development. The relationship between early root development and crop productivity and nutrient use efficiency has been revealed by genetic and phenotypic studies [7,8,9]. Crown and brace roots have a major role in anchorage and soil resource acquisition in later growth [10]. Lateral roots are a major determinant of root architecture. Maize genotypes with reduced lateral root branching showed high drought tolerance, whereas long lateral roots were optimal for nitrate acquisition, and short lateral roots were optimal for phosphorus acquisition [11]. Root architecture involves complex, multicomponent, and interconnected traits, and optimal root system architecture has a major impact on soil resource usage and yield productivity [12]. Although root traits can play key roles in enhancing resource use efficiency and plant adaptation, root traits have seldom been considered as selection criteria in maize breeding programs, mainly because only a few root genes have been identified. The monogenes rtcs, rtcl, rum1, Bige1, rth1, rth3, rth5, and rth6 have been cloned in maize [6]. In rice, a large number of genes related to root development have been identified (http://www.ricedata.cn/ontology/ontology.aspx?ta=GO:0048364). For example, ARL1, CRL4, OsPIN1, OsIAA23, and OsCAND1, which are involved in auxin signaling, play essential roles in crown root development [13]. Several studies showed that genes involved in sugar metabolism (OsGNA1, OsMOGS, and OsDGL1) and cell wall biosynthesis (OsGLU3 and OsEXPA8) play important roles in root elongation [14]. Recently, two transcription factors OsMADS25 and OsSPL3 were found to control primary root length, lateral root density, and crown root development [13,15]. Most of these genes have not been targeted for breeding improvement and only two quantitative trait loci (QTLs), deeper rooting 1 (DRO1) and phosphorus starvation tolerance 1 (PSTOL1), have been used to improve drought and low phosphorus tolerance in rice [16,17]. In maize, although a large number of root QTLs have been mapped, no QTLs have been cloned. Because maize displays abundant genetic diversity and rapid linkage disequilibrium (LD) decay, it has become the model crop for association analysis, and several genes associated with complex traits have been identified [18,19,20]. This approach has been successfully used to pinpoint root genes. By integrating genome-wide association studies (GWAS) and gene expression analyses, Meijón et al. (2013) identified a new F-box gene KUK that was associated with root cell length in Arabidopsis [21]. A brassinosteroid signaling kinase BSK3 that modulated root elongation under mild nitrogen deficiency also was identified by GWAS [22]. In rice, GWAS was conducted for 21 root traits under normal and drought stress, and 143 significant associations were identified, including 11 reported root-related genes [23]. Additionally, new causal genes Nal1 and OsJAZ1 were validated by sequence variation, expression, and transgenic experiments [23]. Forty-four and 97 QTL candidate genes for root length and root thickness were identified by GWAS, and linkage mapping, genomic, transcriptomic, and haplotype data were obtained; five of the root length QTL candidates were validated by T-DNA insertional mutation [24]. In maize, GWAS was conducted for 22 root traits in 384 inbred lines, and 268 marker–trait associations were identified. GRMZM2G153722, which is located on chromosome 4, contained nine significant markers and was the most likely candidate gene; however, no further validation was conducted [25]. Therefore, many loci that affect root development remain unknown in maize. Another reason for the poor consideration of roots in breeding programs is that root traits are difficult to evaluate in the field. Roots are hidden in soil, and their exceptional degree of plasticity to changing soil conditions makes them inaccessible to direct genetic analyses [6]. Although many new approaches have been developed to evaluate root traits in both laboratory and field conditions [26], paper roll under hydroponic conditions was found to be the best system to access root phenotypes at the seedling stage. These indoor culture systems allow high-throughput and accurate evaluation of root traits for large-scale genetic screens [27]. In this study, a panel of 297 maize inbred lines [28,29] was used to conduct GWAS to investigate maize root architecture at the seedling stage. RNA-seq was conducted with three long-root lines and three short-root lines to obtain the expression levels of candidate genes. The objectives of this study were (i) to study phenotypic variation of 16 root and shoot traits within a maize association panel, (ii) to identify significant SNPs associated with root traits, and (iii) to employ GWAS and RNA-seq data to explore potential candidate genes for root development.

2. Materials and Methods

2.1. Plant Materials and Growth Conditions

A total of 297 maize inbred lines [28,29] were evaluated for root traits and used for GWAS. Root traits were measured in a paper roll system as described previously [27,29]. Briefly, seeds of similar size were selected for germination and surface sterilized with 10% H2O2 solution for 20 min. The sterilized seeds were washed twice with distilled water, soaked in saturated CaSO4 for 6 h, then germinated in the dark on moist filter paper at 28 °C and 80% relative humidity. After 2 days, eight germinated seeds from each of the 297 genotypes were vertically rolled in two-layer brown germination roll paper (Anchor Paper Company, St Paul, MN, USA). The germination paper rolls were placed in 39.5 × 29.5 × 22.5 cm black incubators containing 7.5 L nutrient solution, as shown in Table S1. The nutrient solution was renewed every 3 days. Plants were grown in a greenhouse located on the campus of Yangzhou University in September to October in 2018. The experiment was conducted in a completely randomized design with two replicates.

2.2. Plant Phenotyping and Data Analysis

Seedlings were harvested at 14 days after germination, and the roots were separated from the shoots and stored at 4 °C. Roots were also scanned to produce high resolution images that were analyzed using WinRHIZO software (Pro 2004b, Quebec, Canada). To determine root and shoot dry weights, roots and shoots were collected separately and dried at 105 °C for 30 min and then at 55 °C until a constant weight was achieved. In total, 16 traits including 13 root traits and 3 shoot traits were measured (Table 1). Statistical analysis was performed using the R software package (R Development Core Team 2013). Mean values of each line were used for subsequent phenotypic summarization, correlation analysis, principal component analysis, and trait–marker associations. Principal component analyses were performed using the “prcomp” function in R with further visualization performed using the “factoextra” package.
Table 1

Summary of the root and shoot traits and their measurements.

AbbreviationFull NameUnitMeasuring Method
PHPlant heightcmmeasured from the coleoptilar to yhe tip of the longest leaf
SPADLeaf chlorophyll concentrations-measured using a SPAD-502 PLUS chlorophyll (Minolta, Japan)
SDWShoot dry weight10 mgmeasured by electronic balance
ARDAverage root diametermmmeasured by image analysis software (WinRhizo Pro 2009)
PRLPrimary root lengthcmmeasured by a ruler
SRLSeminal roots lengthcmmeasured by a ruler
ASRLAverage seminal root lengthcmestimated by SRL and SRN
SRNSeminal roots numbersnumberCount of the seminal roots
LRLLateral root lengthcmestimated by TRL, PRL and SRL
TRLTotal root lengthcmmeasured by image analysis software
RL005L ≤ 0.5 (Root length between 0 and 0.5 mm in diameter)cmmeasured by image analysis software
RL05100.5 < L ≤ 1.0 (Root length between 0.5 mm and 1 mm in diameter)cmmeasured by image analysis software
RL10151.0 < L ≤ 1.5 (Root length greater than 1 mm in diameter)cmmeasured by image analysis software
RVRoot volumecm3measured by image analysis software
RARoot surface areacm2measured by image analysis software
RDWRoot dry weight10 mgmeasured by electronic balance

2.3. Genotypic Data and Genome-Wide Association Analysis

The panel of 297 inbred lines was genotyped using the genotyping-by-sequencing strategy [28,29]. The MseI restriction enzyme was used for library preparation, and GBS was performed on an Illumina platform by Novogene Bioinformatics Institute, Beijing, China. The clean reads from each sample were mapped to the B73 RefGen_V3 reference genome sequence (AGPv3, release 31) by BWA with the settings ‘mem -t 4 -k 32 -M’ [30]; variant calling was performed for all samples using the SAMtools software (V1.3.1) [31]. After quality control (missing rate ≥20%; minor allele frequency ≥0.05) 131,271 SNPs remained for GWAS. Principal component analysis was performed using TASSEL 5.0, and the top five principal components were used to create a population structure matrix to control population structure. Kinship matrices were calculated using the Centered-IBS method in TASSEL 5.0 to estimate genetic relatedness among individuals. We performed GWAS with a mixed linear model (MLM) with principal components and kinship and a general linear model (GLM) with principal components in TASSEL 5.0. Because several SNPs were in LD, the effective numbers of independent SNPs (35,831) was estimated using the GEC software tool (V0.2) [32] with the significant P-value threshold set as 2.79 × 10−5 (1/independent marker number). Considering the complexity of root traits, a suggestive P-value threshold (1 × 10−4) was used to avoid ignoring minor effect loci. The decay distance of LD across the whole genome was determined by software PopLDdecay [33]. Based on the average LD decay of 50 kb, significant SNPs within 50 kb were binned into a QTL, and the most significant SNP were selected as the lead SNP.

2.4. Candidate Gene Analysis and Transcriptome Sequencing

All the potential candidate genes within 100 kb (50-kb upstream and 50-kb downstream of the lead SNP) of the detected loci were identified. Gene annotation information from maize GDB database (http://www.maizegdb.org) was used to assign functions to the candidate genes. The physical locations of the genes and SNPs were based on the maize B73 RefGen_V3 genome (AGPv3, release 31). Three long-root lines and three short-root lines were used for the transcriptomic analysis. The six lines were planted in the paper roll system described above. Fourteen days after germination, roots of each line were sampled for total RNA extraction using an RNeasy R Plant Mini kit (Qiagen, Shanghai, China). High quality RNA was used to construct an RNA-seq library for each sample using a TruSeq RNA sample preparation kit (Illumina). Clean data were obtained by removing adapters, low-quality reads, and reads containing poly-N from the raw data. High-quality reads were aligned to the B73 RefGen_V3 reference genome sequence (AGPv3, release 31) using HISAT2 (V2.1.0). FeatureCounts in the Subread package (v1.6.5) was used to count the read numbers mapped to each gene [34]. The FPKM (fragments per kilobase of exon model per million mapped reads) of each gene was calculated based on the length of the gene and read count mapped to it [35]. The R package ‘edgeR’ was used to identify differentially expressed genes. Differentially expressed genes were identified by calculating the log2 fold change (FC) between the genes from the long- and short-root lines. DEGs were selected with |log2 (FC)| > 1 and with statistical significance P < 0.05. SNP calling from RNAseq data was performed with ‘the GATK Best Practices workflow for SNP and indel calling on RNAseq data’ [36]. The identified SNPs were annotated using snpEff [37]. The sequencing data were deposited in the NCBI Short Read Archive database with the accession number PRJNA558447. The basic information of the transcriptomic data is provided in Supplementary Dataset 3.

3. Results

3.1. Phenotypic Analysis of Root Traits at the Seedling Stage

To dissect the genetic basis of root traits at the maize seedling stage, 297 inbred maize lines were cultured in paper rolls for two weeks. A total of 18 traits, including shoot dry weight (SDW), total root length (TRL), lateral root length (LRL), primary root length (PRL), average root diameter (ARD), root surface area (RA), root volume (RV), and root dry weight (RDW), were measured (Table S1). All the captured traits showed a continuous distribution with a slight left skew (Figure 1). For most of the traits, considerable phenotypic variation was detected among the lines, with coefficients of variation that ranged from 12.01% for PRL to 46.22% for RL1015. PRL varied from 21.27 to 41.98 with an average of 30.12, and RL1015 varied from 1.24 to 15.32 with an average of 5.43. All traits exhibited >2-fold differences and ranged from 2.02 to 21.36 among the lines (Table 2).
Figure 1

Distribution of root and shoot traits in 297 maize inbred lines.

Table 2

Phenotypic variation for root and shoot traits of 297 maize inbred lines.

TraitMeanSDMinMaxFold ChangeCV aANOVA
MS (Genotype)MS (Error)F (Genotype)
PH32.475.7418.8048.692.5917.67%63.367.708.23 ** b
SPAD28.473.8617.0538.152.2413.55%28.829.233.12 **
SDW8.862.603.8517.124.4529.34%12.342.385.18 **
ARD0.440.060.330.662.0014.16%0.010.0011.79 **
PRL30.123.6221.2742.982.0212.01%25.684.116.24 **
SRL67.0822.716.60141.0021.3633.85%967.54313.633.09 **
ASRL23.954.4011.7134.832.9718.39%31.877.554.22 **
SRN2.810.761.005.335.3327.19%1.090.601.83 **
LRL265.88111.4634.35687.1220.0041.92%23267.841826.3412.74 **
TRL319.86114.7978.31756.819.6635.89%24982.711914.5913.05 **
RL005236.7896.3242.79599.8414.0240.68%17497.931603.3410.91 **
RL051075.6026.6016.03143.268.9435.18%1372.08173.747.90 **
RL10155.432.511.2415.3212.3546.22%12.232.395.12 **
RV0.450.130.180.894.9428.60%296.3930.669.67 **
RA41.9712.7214.3490.756.3330.31%0.030.008.23 **
RDW9.152.432.4015.756.5626.52%11.741.667.09 **

a coefficient of variation. b indicates a significant effect was found by F test (P < 0.01).

Phenotypic correlation analysis showed that most measured traits were closely positively correlated with each other, expect ARD (Figure S1). Strong positive correlations were found for TRL, LRL, RL005, RL0510, RL1015, RV, and RA (r = 0.45–0.99). Principal component analyses were conducted for 16 traits, and three major principal components that accounted for more than 70% of the phenotypic variance were detected (Figure S2). The first dimension (Dim1) represented 49.9% of the variability and accounted primarily for nine traits RA, TRL, LRL, RL005, RV, RL0510, SDW, PH, and SRL. Dim2 explained 12.2% of the variation and accounted mainly for ARD (Figure S2). The results are consistent with the phenotypic correlation analysis.

3.2. Genome-Wide Association Studies

A set of 131,271 filtered SNPs with minor allele frequencies >0.05 were used for GWAS with GLM and MLM. A total of 355 and 28 marker–trait associations were identified by GLM and MLM, respectively (Supplementary Dataset 1). As expected, MLM was too strict, whereas there were a large number of false positives for GLM because of the population structure. We found that 27 of the 28 significant associations from MLM also were detected by GLM. Some close-by SNPs that showed strong LD could not be considered as separate loci, so the 28 significant associations were clumped into 21 significant QTLs based on an average LD decay of 50 kb (Figure S3). To avoid missing SNPs because of the strict MLM, we defined suggestive QTLs as those with at least one SNP [−log10(P) > 4] in MLM and two SNPs [−log10(P) > 4.55] in GLM within 50-kb distances of each other. For example, qPRL1 was selected as a suggestive QTL because in a 50-kb genomic region, two SNPs were associated with PRL in MLM [−log10(P) > 4], and the same two SNPs were identified in GLM [−log10(P) > 4.55] (Table 3; Figure 2). A total of 21 significant QTLs and 13 suggestive QTLs were detected for 13 traits; except SRL, SDW, and PH (Table 3; Figure 2). These QTLs were distributed on all 10 maize chromosomes with the highest number on chromosomes 4 and 8, each of which contained eight QTLs (Figure 2). Individually, these QTLs explained between 5.7% and 15.9% of the phenotypic variance (Table 3). A total of 19 QTLs were detected for seven root length traits, namely, TRL (2), LRL (2), RL005 (3), RL0510 (3), RL1015 (2), PRL (3), and ASRL (4). QTLs for TRL were completely coincident with QTLs for LRL. Among the 19 QTLs, nine explained more than 10% of the phenotypic variation (Table 3). Three QTLs were detected for ARD, and the most significant association was for SNP_1_213315833, which explained 8.5% of the phenotypic variance. We found 3, 2, and 2 QTLs that were associated with RA, RV, and RDW, respectively. The most significant SNP for RA was SNP_2_104416607, which explained 14.65% of the phenotypic variation. For SRN and SPAD, 2 and 3 QTLs were found, respectively. Three pleiotropic QTLs involving five root traits were identified; SNP_2_104416607 was associated with LRL, RA, RL005, and TRL; SNP_2_184016997 was associated with RV and RA; and SNP_4_168917747 was associated with LRL, RA, and TRL (Figure 2).
Table 3

QTLs significantly associated with root traits detected by GWAS.

TraitQTLLead SNPChr.Pos. aAllele b−log10(P)R2
ARD qARD1 SNP_1_2133158331213315833G/T4.678.50
qARD3 SNP_3_1473970473147397047A/C4.637.96
qARD4 SNP_4_2190996744219099674C/T4.1612.61
ASRL qASRL1 SNP_1_1219962111121996211C/T4.359.49
qASRL3 SNP_3_1078997173107899717C/T4.176.33
qASRL4 SNP_4_1859429134185942913A/T4.897.01
qASRL10 SNP_10_342449431034244943A/G4.747.05
LRL qLRL2 SNP_2_1044166072104416607C/T4.5214.97
qLRL4 SNP_4_1689177474168917747C/T5.399.27
PRL qPRL1 SNP_1_1203087121120308712C/T4.306.34
qPRL10-1 SNP_10_673503341067350334G/T4.006.54
qPRL10-2 SNP_10_685872471068587247A/G5.389.29
RA qRA2-1 SNP_2_1044166072104416607C/T4.4514.65
qRA2-2 SNP_2_1840169972184016997C/T4.056.99
qRA4 SNP_4_1689177474168917747C/T4.577.58
RDW qRDW3 SNP_3_1568382403156838240C/T4.708.29
qRDW4 SNP_4_713581947135819A/C4.578.39
RL005 qRL005_2 SNP_2_1044166072104416607C/T4.8615.87
qRL005_4 SNP_4_33081784433081784C/T4.4812.29
qRL005_6 SNP_6_63998692663998692A/G4.9710.66
RL0510 qRL0510_1-1 SNP_1_22179639122179639A/G4.629.52
qRL0510_1-2 SNP_1_2175825721217582572G/T4.8110.10
qRL0510_2 SNP_2_50742849250742849A/G5.1212.96
RL1015 qRL0510_4 SNP_4_34568124434568124G/T4.0511.43
qRL0510_6 SNP_6_1536966126153696612A/G5.0113.90
RV qRV2 SNP_2_1840169972184016997C/T4.397.65
qRV6 SNP_6_54506432654506432A/G4.659.37
SPAD qSPAD5 SNP_5_25046214525046214C/T5.128.17
qSPAD8 SNP_8_1603714258160371425A/G4.386.36
qSPAD9 SNP_9_483281494832814A/T4.848.36
SRN qSRN2 SNP_2_52559869252559869C/T4.015.65
qSRN8 SNP_8_19589120819589120C/T5.429.39
TRL qTRL2 SNP_2_1044166072104416607C/T4.6415.22
qTRL4 SNP_4_1689177474168917747C/T5.138.60

a Position in base pairs for the lead SNP according to B73 RefGen_V3. b Underlined bases are the favorable alleles.

Figure 2

Manhattan plots of mapped single nucleotide polymorphisms (SNP) markers associating with each trait. (A) Mixed Linear Model (MLM), (B) General Linear Model (GLM). Abbreviations for traits are as follows: ARD, average root diameter; ASRL, average seminal root length; LRL, lateral root length; PRL, primary root length; RA, root surface area; RDW, root dry weight; RL005, root length between 0 and 0.5 mm in diameter; RL0510, root length between 0.5mm and 1mm in diameter; RL1015, root length greater than 1 mm in diameter; RV, root volume; SPAD, leaf chlorophyll concentrations measured by SPAD-502 PLUS chlorophyll (Minolta, Japan); SRN, seminal roots numbers; TRL, total root length.

3.3. Determination of Candidate Genes

A total of 96 candidate genes were located within the ± 50-kb (Figure S3) genomic regions of the lead SNPs for the identified QTLs (Supplementary Dataset 2). To reduce the number of candidate genes, three long-root lines and three short-root lines were selected from among the 297 inbred lines and performed RNA-seq to evaluate the whole-genome gene expression levels (Figure 3). A total of 4458 DEGs were identified between long-root lines and short-root lines (Supplementary Dataset 3). Within loci identified by GWAS, we detected 78 genes that were not expressed in all lines and 5 genes were differential expressed between the long- and short-root lines (Table 4). Further, the variants were identified from the RNA-seq data, and a total of 805,328 SNPs were detected in the whole genome, including 1083 SNPs located in 42 candidate genes for the identified root QTLs (Supplementary Dataset 4 and 5). A total of 84 SNPs located in UTR (untranslated regions) and CDS (coding DNA sequence) were identified for five DEGs. The pleiotropic marker SNP_2_104416607 on chromosome 2 was significantly associated with four root traits (LRL, RA, RL005, and TRL), and the increased allele (T) increased the phenotype by 15% relative to the alternative allele (Figure 4). Three genes (GRMZM2G112828, GRMZM2G112838, and GRMZM2G397965) were detected in the 104,366,607–104,466,607 bp region around SNP_2_104416607 on chromosome 2. The transcriptome analysis showed that GRMZM2G112828 and GRMZM2G112838 were not expressed, and GRMZM2G397965 was up-regulated with log2(FC) = 2.78 in the long-root lines compared with the short-root lines. Eight SNPs, including three non-synonymous mutations and two mutations in untranslated regions, may affect expression level of GRMZM2G397965 (Table 4). SNP_3_ 147397047 was found to be significantly associated with ARD, and four genes (AC184770.3_FG001, GRMZM2G138258, GRMZM2G138338 and GRMZM2G138342) were identified in the 147,347,047–147,447,047 bp region around SNP_3_ 147397047 on chromosome 3. GRMZM2G138338 was up-regulated with log2(FC) = 1.38 in the long-root lines (Table 4). Eight non-synonymous mutations were detected in this gene. Two DEGs, GRMZM2G174797 and GRMZM2G476902, located in qASRL4, GRMZM2G174797 was up-regulated with log2(FC) = 1.44, while GRMZM2G476902 was down-regulated [(log2(FC) = 1.44] in the long-root lines. The candidate gene GRMZM2G031528 for qRDW3 is predicted to code for a heavy metal transport/detoxification superfamily protein. It contained one non-synonymous mutation and eight UTR variants. By integrating the results of the GWAS and transcriptomic analyses, we detected five potential genes for the root traits.
Figure 3

Scanned root images of 3 short-root lines and 3 long-root lines.

Table 4

Potential candidate genes identified by GWAS and RNA-Seq.

QTLCandidate Geneslog2(FC) a P SNPAmino Acid PolymorphismAnnotation
qARD3 GRMZM2G138338−1.380.032398Leucine-rich receptor-like protein kinase family protein
qASRL4 GRMZM2G174797−1.440.00017216ELMO/CED-12 family protein
GRMZM2G4769023.740.0007941Armadillo/beta-catenin repeat family protein
qLRL2/ qRA2-1/ qRL005_2/ qTRL2 GRMZM2G397965−2.780.008283Vignain precursor/ (SAG12) senescence-associated gene 12
qRDW3 GRMZM2G031528−2.480.00017121Heavy metal transport/detoxification superfamily protein

a Log2 fold change between short-root lines and long-root lines.

Figure 4

Association analysis of pleiotropic SNP_2_104416607 with LRL, RA, RL005, and TRL. (A) Manhattan plots of association analysis, (B) Candidate genes around SNP_2_104416607, (C) The box shows phenotypic differences between lines carrying different alleles of SNP_2_104416607, the difference of mean (Δm), and the P-value are given.

4. Discussion

Roots anchor plants in the soil and absorb nutrients and water from soil environments. However, phenotyping root traits is difficult, especially for a large plant, like maize. Several methods for root phenotyping in the laboratory have been developed; hydroponics and paper roll culture are the most commonly used methods for root research because they allow rapid, accurate, and high-throughput screening of root characteristics in a large number of genotypes at the seedling stage. Almost half of all root genetics research in maize, barley, and wheat used this approach [6,38]. Here, we grew 297 inbred maize lines under standardized conditions in paper rolls to get accurate phenotypes at the seedling stage, including root length traits (SRL, TRL, and LRL) that are important for nutrient uptake efficiency and grain yield [7]. Understanding the natural variation of root traits and identifying SNPs associated with the variation is a prerequisite for the improvement of maize roots by molecular breeding technology. Abundant phenotypic diversity is essential for the success of genetic studies. In this study, root traits showed 2.00–21.36-fold differences in 297 inbred lines (Table 2), which is similar to that of other natural populations [25,39]. In previous studies, more extensive phenotypic variation was observed. In a panel of 30 maize inbred lines, the seminal root number ranged from 0 to 10 [40]. Root traits in maize landraces and teosinte were highly variable in the number of nodal and seminal roots, making them a valuable resource for identifying natural variations to improve root traits in maize [41]. Currently, the only cloned root QTLs in crop species were found in rice, namely genes DRO1 and PSTOL1. DRO1 enhances root gravitropism to increase the root growth angle, which maintained high yield performance under drought conditions; the functional variation of DRO1 was found in a drought-tolerant cultivar ‘Kinandang Patong’ from the Philippines [16]. PSTOL1 acts as an enhancer of early root growth and promotes root growth under both high and low phosphorus conditions; the functional variation was found in the low phosphorus-tolerant aus-type ‘Kasalath’ [17]. These two genes were identified in landrace germplasm rather than elite breeding lines [42]. During domestication, maize and soybean genes associated with root system architecture were both under indirect selection [43,44], suggesting that both the inbred lines and the wild ancestor retained a lot of variation for root traits. In this study, maize inbred lines with extensive phenotypic variation allowed the discovery of significant SNPs underlying root traits. GWAS was a powerful tool to detect natural variations for complex traits, but complex population structure and ancestral relatedness in natural population reduced the mapping power. Even though several statistical methods have also been developed to increase statistical power, there are still many restrictions [45]. For example, there were a large number of false positives for GLM, but MLM can lead to false negatives by overcompensating for population structure and kinship. Only four SNPs were detected for 22 root traits by MLM using 135,311 SNP markers in 384 inbred lines [25]. Many studies have used both approaches as well as suggestive and significant association level to balance the false positives and false negatives in root studies [23,46]. Here, 21 significant and 13 suggestive QTLs were detected by integrating the results of GLM and MLM. GWAS has been used successfully to identify thousands of root-associated loci in Arabidopsis, rice, wheat, barley, and maize [21,22,23,24,25], and some important genes and variations have been detected. However, these significant loci explained only a few percent of the phenotypic diversity. For example in barely, the detected root QTLs cumulatively explained 12.1%–48.1% of the phenotypic variance [46]. In this study, we detected from two to four QTLs for each trait that jointly explained less than 40% of the total phenotypic variation (Table 3), so a large proportion of the heritability was missed. We propose two possible reasons for this. One, root traits are controlled by numerous small-effect QTLs, which is why the phenotypic variation explained by each QTL ranged from 5.7% to 15.9% in our study. Even in bi-parental populations most root QTLs explained less than 10% of the phenotypic variation [7,47]. The other reason may be that the presence of rare variants reduced the statistical power of GWAS. Multi-parent populations such as NAM and MAGIC may make powerful tools for unraveling the molecular basis of maize root development. Here, a total of 34 QTLs were detected for 12 root traits and SPAD value, and 96 candidate genes within 100 kb of these loci (50-kb upstream and downstream of the lead SNP) were identified. The number of candidate genes in each locus range from 1 to 10, with an average of 3.65 (Supplementary Dataset 2). However, gene annotation in the maize genome is largely incomplete, and only about 1% of the genes have functional annotations based on mutant analyses; therefore, it is difficult to identify causal genes for a phenotype [48]. Whole genome gene expression is an easily measurable source of functional information. Several previous studies have used GWAS and RNA-seq data to detect causal genes for complex traits, such as root traits in rice [24], senescence, elemental concentrations, and oil biosynthesis in maize [20,48,49], and disease resistance in soybean [50]. We selected three long-root lines and three short-root lines for RNA-seq. Within GWAS-identified loci five genes showed differential expression between the long- and short-root lines. The functions of these genes need to be confirmed by mutant analyses or genetic transformation.

5. Conclusions

In conclusion, we identified 34 QTLs associated with seedling root traits in a panel of 297 maize inbred lines by genome-wide association mapping. The expression levels of the genes located in GWAS-identified loci were assessed among long- and short-root lines by RNA-seq analysis. A total of five DEGs with potential causing variants were identified as strong candidate genes underlying seedling root development in maize. These genes can be further studied to help understand the genetic basis of root development and improve the root architecture.
  45 in total

Review 1.  Molecular and Environmental Regulation of Root Development.

Authors:  Hans Motte; Steffen Vanneste; Tom Beeckman
Journal:  Annu Rev Plant Biol       Date:  2019-03-01       Impact factor: 26.379

Review 2.  Genetic Control of Root System Development in Maize.

Authors:  Frank Hochholdinger; Peng Yu; Caroline Marcon
Journal:  Trends Plant Sci       Date:  2017-11-20       Impact factor: 18.313

3.  Genome-wide association study using cellular traits identifies a new regulator of root development in Arabidopsis.

Authors:  Mónica Meijón; Santosh B Satbhai; Takashi Tsuchimatsu; Wolfgang Busch
Journal:  Nat Genet       Date:  2013-11-10       Impact factor: 38.330

4.  Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels.

Authors:  Hui Li; Zhiyu Peng; Xiaohong Yang; Weidong Wang; Junjie Fu; Jianhua Wang; Yingjia Han; Yuchao Chai; Tingting Guo; Ning Yang; Jie Liu; Marilyn L Warburton; Yanbing Cheng; Xiaomin Hao; Pan Zhang; Jinyang Zhao; Yunjun Liu; Guoying Wang; Jiansheng Li; Jianbing Yan
Journal:  Nat Genet       Date:  2012-12-16       Impact factor: 38.330

5.  Genetic control of the root system in rice under normal and drought stress conditions by genome-wide association study.

Authors:  Xiaokai Li; Zilong Guo; Yan Lv; Xiang Cen; Xipeng Ding; Hua Wu; Xianghua Li; Jianping Huang; Lizhong Xiong
Journal:  PLoS Genet       Date:  2017-07-07       Impact factor: 5.917

6.  Genome-Wide Association Mapping of Starch Pasting Properties in Maize Using Single-Locus and Multi-Locus Models.

Authors:  Yang Xu; Tiantian Yang; Yao Zhou; Shuangyi Yin; Pengcheng Li; Jun Liu; Shuhui Xu; Zefeng Yang; Chenwu Xu
Journal:  Front Plant Sci       Date:  2018-09-05       Impact factor: 5.753

Review 7.  Molecular Mechanisms of Root Development in Rice.

Authors:  Funing Meng; Dan Xiang; Jianshu Zhu; Yong Li; Chuanzao Mao
Journal:  Rice (N Y)       Date:  2019-01-10       Impact factor: 4.783

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Two major quantitative trait loci controlling the number of seminal roots in maize co-map with the root developmental genes rtcs and rum1.

Authors:  Silvio Salvi; Silvia Giuliani; Claudia Ricciolini; Nicola Carraro; Marco Maccaferri; Thomas Presterl; Milena Ouzunova; Roberto Tuberosa
Journal:  J Exp Bot       Date:  2016-02       Impact factor: 6.992

Review 10.  Uncovering the hidden half of plants using new advances in root phenotyping.

Authors:  Jonathan A Atkinson; Michael P Pound; Malcolm J Bennett; Darren M Wells
Journal:  Curr Opin Biotechnol       Date:  2018-07-19       Impact factor: 9.740

View more
  6 in total

1.  Genome-Wide Association Studies of Maize Seedling Root Traits under Different Nitrogen Levels.

Authors:  Yafang Fu; Jianchao Liu; Zhenqing Xia; Qi Wang; Shibo Zhang; Guixin Zhang; Haidong Lu
Journal:  Plants (Basel)       Date:  2022-05-26

2.  A novel motif in the 5'-UTR of an orphan gene 'Big Root Biomass' modulates root biomass in sesame.

Authors:  Komivi Dossa; Rong Zhou; Donghua Li; Aili Liu; Lu Qin; Marie A Mmadi; Ruqi Su; Yujuan Zhang; Jianqiang Wang; Yuan Gao; Xiurong Zhang; Jun You
Journal:  Plant Biotechnol J       Date:  2021-02-01       Impact factor: 9.803

3.  GWAS and RNA-seq analysis uncover candidate genes associated with alkaline stress tolerance in maize (Zea mays L.) seedlings.

Authors:  Chunxiang Li; Yue Jia; Runyu Zhou; Liwei Liu; Mengna Cao; Yu Zhou; Zhenhua Wang; Hong Di
Journal:  Front Plant Sci       Date:  2022-07-18       Impact factor: 6.627

4.  GWAS and Transcriptome Analysis Reveal Key Genes Affecting Root Growth under Low Nitrogen Supply in Maize.

Authors:  Yunyun Wang; Tianze Zhu; Jiyuan Yang; Houmiao Wang; Weidong Ji; Yang Xu; Zefeng Yang; Chenwu Xu; Pengcheng Li
Journal:  Genes (Basel)       Date:  2022-09-11       Impact factor: 4.141

5.  Association mapping uncovers maize ZmbZIP107 regulating root system architecture and lead absorption under lead stress.

Authors:  Fengxia Hou; Kai Liu; Na Zhang; Chaoying Zou; Guangsheng Yuan; Shibin Gao; Minyan Zhang; Guangtang Pan; Langlang Ma; Yaou Shen
Journal:  Front Plant Sci       Date:  2022-09-26       Impact factor: 6.627

6.  Phenotypic Diversity and Association Mapping of Ascorbic Acid Content in Spinach.

Authors:  Dario Rueda; Henry O Awika; Renesh Bedre; Devi R Kandel; Kranthi K Mandadi; Kevin Crosby; Carlos A Avila
Journal:  Front Genet       Date:  2022-01-03       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.