Literature DB >> 32569428

Dissecting the phenotypic components and genetic architecture of maize stem vascular bundles using high-throughput phenotypic analysis.

Ying Zhang1, Jinglu Wang1, Jianjun Du1, Yanxin Zhao2, Xianju Lu1, Weiliang Wen1, Shenghao Gu1, Jiangchuan Fan1, Chuanyu Wang1, Sheng Wu1, Yongjian Wang1, Shengjin Liao1, Chunjiang Zhao1, Xinyu Guo1.   

Abstract

High-throughput phenotyping is increasingly becoming an important tool for rapid advancement of genetic gain in breeding programmes. Manual phenotyping of vascular bundles is tedious and time-consuming, which lags behind the rapid development of functional genomics in maize. More robust and automated techniques of phenotyping vascular bundles traits at high-throughput are urgently needed for large crop populations. In this study, we developed a standard process for stem micro-CT data acquisition and an automatic CT image process pipeline to obtain vascular bundle traits of stems including geometry-related, morphology-related and distribution-related traits. Next, we analysed the phenotypic variation of stem vascular bundles between natural population subgroup (480 inbred lines) based on 48 comprehensively phenotypic information. Also, the first database for stem micro-phenotypes, MaizeSPD, was established, storing 554 pieces of basic information of maize inbred lines, 523 pieces of experimental information, 1008 pieces of CT scanning images and processed images, and 24 192 pieces of phenotypic data. Combined with genome-wide association studies (GWASs), a total of 1562 significant single nucleotide polymorphism (SNPs) were identified for 30 stem micro-phenotypic traits, and 84 unique genes of 20 traits such as VBNum, VBAvArea and PZVBDensity were detected. Candidate genes identified by GWAS mainly encode enzymes involved in cell wall metabolism, transcription factors, protein kinase and protein related to plant signal transduction and stress response. The results presented here will advance our knowledge about phenotypic trait components of stem vascular bundles and provide useful information for understanding the genetic controls of vascular bundle formation and development.
© 2020 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

Entities:  

Keywords:  MaizeSPD; genome-wide association studies; maize stem; micro-CT; micro-phenotype; vascular bundle

Mesh:

Year:  2020        PMID: 32569428      PMCID: PMC7769239          DOI: 10.1111/pbi.13437

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


Introduction

Maize (Zea mays ssp. mays) was domesticated from its wild ancestor, teosinte (Zea mays ssp. parviglumis) nearly 6000 to 10 000 years ago in southwestern Mexico (Doebley, 1990, 2004). The widely cultivation and strong adaptation allow maize to be the largest productive crop in the worldwide, which is important in the satisfying global food demand and safeguarding world food security (Liu et al., 2015; USDA FAS, 2013). The yield of maize closely relies on the source–sink relationship, and the ideal state of that is plenty in source, rich in sink and efficient in flow. In recent year, with the genetic improvement of source–sink traits and great progress in cultivation measures, more and more attentions have been paid in the role of ‘flow’ in yield formation (Wang et al., 2011). Vascular bundle, the important ‘flow’ which links source and sink, is responsible for both the delivery of water, essential mineral nutrients, sugars and amino acids among organs and decides the transportation efficiency of photosynthetic products, water and essential mineral nutrients (Housley and Peterson, 1982). As an effective long‐distance transport system of source–translocation–sink, correlations between vascular bundle performance and grain yield have been reported in crops (Chen et al., 2004; Cui et al., 2003; Nátrová, 1991; Peterson et al., 1982). Studies have shown the number, size and role of vascular bundle directly affected the transporting efficiency of assimilates from the source to kernels and eventually as important limiting causes effecting crop yield (Housley and Peterson, 1982; Huang et al., 2016; Nátrová, 1985; Zhai et al., 2018). Vascular bundle traits are so important that it is critical to make unremitting efforts to explore genetic mechanism. In Arabidopsis, many genes took part in vascular bundle patterning have been identified, such as PHB, PHV, AtHB15 and REV (Du and Wang, 2015; McConnell et al., 2001; Zhong and Ye, 2004). In rice, many quantitative trait loci (QTLs) for vascular bundle have been identified (Bai et al., 2012; Fei et al., 2019; Zhai et al., 2018), and candidate genes such as APO1, ABV, DEP1 and NAL1 have been reported (Fei et al., 2019; Fujita et al., 2013; Qi et al., 2008; Terao et al., 2010). In maize, Huang et al. (2016) identified quantitative trait loci (QTL) for the number of vascular bundles in the uppermost internode of maize stem using a large maize‐teosinte BC2S3 RIL population. That study provided important insights for the genetic architecture of vascular bundle number in maize stem. However, the genetic researches of vascular bundle in maize stem received much less attention compared with research progress in rice and Arabidopsis. In recent years, with the rapid development of high‐density single nucleotide polymorphism (SNP) genotyping and the next‐generation sequencing (NGS) technologies, genome‐wide association study (GWAS) has become a powerful tool to dissect the genetic basis for the quantitative variation of complex traits in crops (Chen et al., 2019; Xiao et al., 2017). For maize, since the release of the B73 reference genome (Schnable et al., 2009), many agronomic important traits, such as plant height (Dell'Acqua et al., 2015; Farfan et al., 2015; Li et al., 2016a,b; Peiffer et al., 2014; Riedelsheimer et al., 2012; Wang et al., 2019; Weng et al., 2011; Yang et al., 2014a,b), flowering time (Buckler et al., 2009; Farfan et al., 2015; Hung et al., 2012; Li et al., 2016a,b; Van Inghelandt et al., 2012; Yang et al., 2013, 2014a,b), ear height (Dell'Acqua et al., 2015; Farfan et al., 2015; Li et al., 2016a,b; Peiffer et al., 2014; Yang et al., 2014a,b) and grain size (Dell'Acqua et al., 2015; Yang et al., 2014a,b), have been dissected through GWASs. GWAS application in agricultural traits of maize provides useful reference for revealing the phenotypic traits diversity and genetic architecture of vascular bundles in maize stem. However, because of the need for accurate identification of microscopic phenotypes for large amounts of maize and a lack of high‐throughput and effective micro‐phenotyping detection methods, few GWASs for ‘flow’ traits have been performed in large population of maize inbred lines. With the rapid development of functional genomics and molecular breeding, the ability to quickly screen thousands of lines for targeted phenotypic traits is becoming more and more important (Fiorani and Schurr, 2013). Manually counting vascular bundle number was a strenuous and tedious work, and errors in the measurement were unavoidable. What is more, many anatomical traits of vascular bundles could not be detected by manual test, which seriously affected develops in maize genomics for these important characters of vascular bundles. To bridge this gap, progress in the development of high‐throughput phenotyping technology is required (Yang et al., 2013a,b). In recent years, several tools for automated detecting phenotypes of stem have become available (Du et al., 2016; Heckwolf et al., 2015; Legland et al., 2014; Zhang et al., 2013). However, much more robust and accurate identification methods are urgently needed that are suitable for large populations of maize. In this study, given the rich genetic variation in maize natural population (Yang et al., 2011), we developed a standard process for stem micro‐CT data acquisition and automatic CT image process pipeline to extract micro‐phenotypic traits. The stems of maize natural population panel containing 480 inbred lines were phenotyped at the silking stage, and 48 traits were automatically extracted by CT image processing pipeline at one time. Based on representative phenotypic data, the phenotypic properties of stem vascular bundles of maize diverse natural population were analysed. And we established the first database for stem micro‐phenotypes, MaizeSPD, which stored 554 pieces of basic information for maize inbred lines, 523 pieces of experimental information, 1008 pieces of CT scanning images and processed images, and 24 192 pieces of micro‐phenotypic data. Finally, GWASs were conducted to reveal the natural genetic variation and to dissect the genetic architecture of vascular bundles; a total of 1562 significant SNPs were identified for 30 stem micro‐phenotypic traits, and 84 unique genes of 20 traits such as VBNum, VBAvArea, IZVBNum and PZVBDensity were detected. The results presented here will advance our knowledge about phenotypic trait components of stem vascular bundles and will provide useful information for understanding the genetic controls of vascular bundle formation and development.

Results

Extraction of vascular bundle phenotypes

How to quantify the traits of maize stem and vascular bundles is a challenge, also an opportunity of standardize the protocol of data gaining and image processing from micro‐phenotyping of maize stem. Based on the morphological characteristics of stem vascular bundles and previous research (Du et al., 2016), we developed automated image analysis pipeline that suitable for large‐population CT images to extract the micro‐phenotypic traits of maize stems automatically. All image processing and analysis steps were conducted in Visual studio C++ and OpenCV. The function modules of VesselParser 4.0 software (NERCITA, Beijing, China) included data management module, method parameter module, phenotyping calculation module and statistic analysis module (Figure 1 and Figure S1), and the flow chart outlines of the image analysis pipeline are summarized in Figure 1.
Figure 1

Function modules of VesselParser 4.0 software, including data management module, method parameter module, phenotyping computation module and statistic analysis module. The phenotyping computation module is further demonstrated a flowchart outlines of the image analysis pipeline.

Function modules of VesselParser 4.0 software, including data management module, method parameter module, phenotyping computation module and statistic analysis module. The phenotyping computation module is further demonstrated a flowchart outlines of the image analysis pipeline. During the maize growth and development, the substance accumulation of maize stem shows as the gradually increasing average HU values (i.e. intensity values) and changing distribution (i.e. connectivity relationship). According to the observable intensity and distribution differences between stem substances, the entire stem slice could be reasonably divided into three functional zones, that is epidermis zone, periphery zone and inner zone, corresponding to the anatomy of stem namely epidermis, periderm and pith. In our knowledge, the zonation of stem tissue is difficult to be accurately measured by manual work owing to the boundary ambiguity, and there are no more automated segmentation methods. Once the functional zones were segmented, vascular bundles in each zone were extracted and phenotypic traits of that were calculated. Using VesselParser 4.0 software, 48‐item phenotypic traits were automatically extracted and calculated at one time, including 18‐nondimensional morphological and 30‐dimensional geometrical features, and most of which were difficult to measure and mark through manual work. The list and abbreviations of these 48 traits are shown in Table S1. The average calculation time of one image is about 30 s, and large quantities of images can be conducted batch processing.

Phenotypic variations of vascular bundles between natural population subgroup

Based on the phenotypic data gained by VesselParser 4.0 pipeline, we further analysed the variation in vascular bundle traits of the third internode of maize stem between natural population subgroup. Wide phenotypic variations in vascular bundle size, morphology, number, distribution density and other characteristics in cross section and functional zones (epidermal/periphery/inner zones) were observed in NP population (Figure 2), and 48 phenotypic parameters in NP population are listed in Table S2. The average area of circularity of inner zone (IZCir) ranging from 0.01863 ‐to 0.1814 with an average of 0.05945 had the highest maximum change of 9.74‐fold, followed by average area of inner zone vascular bundles (IZVBAvArea, 7.30‐fold), number of inner zone vascular bundles (IZVBNum, 6.33‐fold), area of periphery zone (PZArea, 6.25‐fold) and area of inner zone (IZArea, 6.20‐fold). The rectangularity of stem cross section (CSRect), rectangularity of epidermis zone (EZRect) and rectangularity of periphery zone (PZRect) had the lowest change of 1.03‐fold. Moreover, the frequency distribution of 48‐item phenotypic traits in NP population showed a continuous variation (Figure S2), which suggested that vascular bundle phenotypes were the typical quantitative traits controlled by polygenes.
Figure 2

CT scanning images of stem cross section from different inbred lines and corresponding processing images by VesselParser 4.0 pipeline. The first row: source image. The second row: the segmentation results of functional zones, the boundaries of the epidermis (blue), periphery (green) and inner zones (red) were labelled different colours. The third row: the segmentation results of vascular bundles, according to the vascular bundle area, k‐means clustering was performed and each class (total five classes) was marked with different colours.

CT scanning images of stem cross section from different inbred lines and corresponding processing images by VesselParser 4.0 pipeline. The first row: source image. The second row: the segmentation results of functional zones, the boundaries of the epidermis (blue), periphery (green) and inner zones (red) were labelled different colours. The third row: the segmentation results of vascular bundles, according to the vascular bundle area, k‐means clustering was performed and each class (total five classes) was marked with different colours. Based on phenotypic data, clustering analysis of 48 phenotypic traits was conducted with hierarchical clustering using the Pearson correlation as a distance metric. The 48 accessions were clustered into four major groups, and the resulting dendrogram is shown in Figures S3 and S4. Geometric and morphological characters of stem cross section and functional zones, and quantitative character of vascular bundles were gathered into group I, including 25 phenotypic parameters. Group II was consisted of seven phenotypic parameters, which were related to vascular bundle area characteristics. Distribution properties of vascular bundle were classified into group III, containing three phenotypic parameters. Group IV was composed of the remaining 12 phenotypic parameters, representing geometric and morphological characters of stem cross section and functional zones. Cluster analysis results above provided the basis for selecting the most representative and biologically intuitive traits from one cluster for GWAS and follow‐up analysis. The heritability (H 2) of a trait is one of the key parameters used for making decisions concerning the design and selection of plant breeding schemes (Chen and Lübberstedt, 2010; Holland et al., 2003). Next, heritability was calculated for each trait of four categories. Forty‐eight phenotypic parameters showed different heritability patterns, ranging from 0.128 to 0.836 (Figure 3). About 77% (37 items parameters) had heritability >0.5 and more than 60% (30 items) of the parameters had heritability >0.7, indicating that variability in stem micro‐traits is governed in a large part by genetic factors. The phenotypic traits from groups 1, 2 and 3 showed high heritability, whereas phenotypic traits from group 4 had low heritability, which might be due to the lower genetic variation of these traits. So far, according to the clustering analysis and heritability values, we selected 30 phenotypic indicators with heritability higher than 0.7, allowing for the natural variation analysis of phenotypic traits in subpopulation and identification of SNPs controlling their expression in revealing plant genetic regulation mechanisms.
Figure 3

The broad‐sense heritability (H 2) of the investigated 48 phenotypic traits.

The broad‐sense heritability (H 2) of the investigated 48 phenotypic traits. Based on the 30 phenotypic indicators with heritability higher than 0.7, an ANOVA was used to discover whether differences exist between the different subpopulations of NP population (TST, NSS, SS and mixed). We found that in addition to the traditional index of vascular bundle number, the phenotypic indexes with the most significant differences (P ≤ 0.001) between subpopulations also included VBAvArea, IZVBNum, IZVBAvArea, PZVBAvArea and PZVBDensity (as shown as Figure 4). The number of stem vascular bundles (VBNum) was much higher for TST than for NSS, SS and mixed, but average area of stem vascular bundles (VBAvArea) was much lower for TST than for NSS, SS and mixed. The phenotypic differences of vascular bundles in inner zone were as the same as that in the stem cross section. For the periphery zone, there was no significant difference in vascular bundle number, but the differences in the average area of vascular bundles and vascular bundle density in that zone were extremely significant. For example, vascular bundle density in periphery zone (PZVBDensity) was much higher for TST than for NSS, SS and mixed. These results reflected the morphological structure and distribution characteristics of vascular bundles among different genotypes; for example, the number of stem vascular bundles tends to be much more and higher vascular bundle density, but the area of vascular bundles is smaller in tropical and subtropical regions, presenting a more intensive distribution pattern in vascular bundles. The trait variation between subpopulations for the remaining 24 indicators is shown in Figure S5.
Figure 4

The trait variation between subpopulations (TST, NSS, SS and mixed) for the VBNum, VBAvArea, IZVBNum, IZVBAvArea, PZVBAvArea and PZVBDensity. The 1–3 figures in the first row represent VBNum, IZVBNum and PZVBDensity, respectively. The 1–3 figures in the second row represent VBAvArea, PZVBAvArea and IZVBAvArea, respectively. ** denotes significant differences between subpopulations at P ≤ 0.01 probability level, and * denotes significant differences between subpopulations at P ≤ 0.05.

The trait variation between subpopulations (TST, NSS, SS and mixed) for the VBNum, VBAvArea, IZVBNum, IZVBAvArea, PZVBAvArea and PZVBDensity. The 1–3 figures in the first row represent VBNum, IZVBNum and PZVBDensity, respectively. The 1–3 figures in the second row represent VBAvArea, PZVBAvArea and IZVBAvArea, respectively. ** denotes significant differences between subpopulations at P ≤ 0.01 probability level, and * denotes significant differences between subpopulations at P ≤ 0.05.

MaizeSPD—phenotype database of stem vascular bundles for NP Population

Based on the micro‐phenotypic data of stem from NP population, the first database for stem micro‐phenotypes, MaizeSPD, was established. All data from MaizeSPD were stored and managed in a MySQL relational database, and a user‐friendly web interface was developed to help users search and use the data. MaizeSPD consists of seven data tables: namely information table of maize inbred lines, experimental information table, index table of experimental results, link table of stem CT scanning images, link table of stem images processed by VesselParser, micro‐phenotypic indicators list of stem vascular bundles and micro‐phenotypic data table of NP population inbred lines. Currently, MaizeSPD has stored 554 pieces of basic information for maize inbred lines, 523 pieces of experimental information, 1008 pieces of CT scanning images and processed images, and 24 192 pieces of stem micro‐phenotypic data classified as 48 categories. MaizeSPD is the first database for stem micro‐phenotypic information, which lays a foundation for the storage, sharing and improvement of microscopic phenotype data (Video S1).

Significant SNPs obtained by genome‐wide association study

In this study, multi‐locus random‐SNP‐effect mixed linear models in R package ‘mrMLM’ (version 4.0) was used to carry out genome‐wide association analysis on 30 stem and vascular bundle phenotypic traits. Finally, a total of 1562 significant associated SNPs (P‐value < 6.4e‐7) were identified for target traits. Because these results were a collection of six GWAS methods, the top one most significant SNPs obtained by each method and the SNPs validated by two or more methods were considered as highly significant results. And the reliability of these results could be higher than that of the others. Consequently, 292 highly significant associated SNPs were filtered for all 30 key traits. The detailed statistical results of highly significant SNPs for each trait are shown in Table 1.
Table 1

Summary of significant loci from genome‐wide association study

TraitNo. of unique SNPsNo. of unique annotated genesNo. of significant SNPs listed top 1 and validated by multiple methodsNo. of unique annotated genes listed top 1 and validated by multiple methodsNo. of genes only related to specific traitNo. of genes only related to specific trait listed top 1 and validated by multiple methods
CSArea721311020226
CSCirR5091132500
CSInsCirR72123122300
CSLen81147132200
CSWid71134173200
EZArea691219165714
EZCirR5091132500
EZInsCirR72123122300
EZLALen81147132200
EZSALen71134173200
IZArea821531325406
IZCirR731341018298
IZInsCirR641181733456
IZLALen671221018317
IZSALen6712414262210
IZVBAvArea386658428
IZVBDensity37699163113
IZVBNum8213610198919
IZVBVoAvArea34607142714
PZCirR55991019132
PZInsCirR731291324204
PZLALen691231117235
PZSALen891541121202
PZVBAvArea38662774140399134
PZVBDensity55936116211
PZVBNum7113010198918
VBAvArea35056962111335100
VBDensity377012234216
VBNum651247137711
VBVoAvArea4876813392
Summary156223482925221554416
Summary of significant loci from genome‐wide association study

Candidate genes co‐localized with associated SNPs

All candidate genes were annotated according to the latest maize B73 reference genome (B73 RefGen_v4) available in EnsemblPlants and NCBI Gene database. In total, 2348 unique candidate genes were annotated by 1562 significant associated SNPs. The number of single‐trait‐related candidate genes annotated by SNPs listed in multiple methods validated results was 416. For the 416 single‐trait‐related candidate genes, the NCBI Gene database was used for further annotation, and 294 genes with more detailed functional annotation were obtained (Table S3). Additionally, 84 genes listed both in top one of each method's results and multiple methods validated results were identified (marked as ‘Y’ in Table 2). Since these genes were not only annotated by the top one SNPs of each method, but also validated by multiple methods, the reliability of these genes was considered to be higher than that of other genes. Among them, the numbers of unique loci associated with each trait were 5 (VBNum), 2 (CSArea), 7 (EZArea), 2 (IZArea), 4 (IZCirR), 2(IZInsCirR), 3 (IZLALen), 5 (IZSALen), 3 (IZVBAvArea), 4 (IZVBDensity), 7 (IZVBNum), 4 (IZVBVoAvArea), 2 (PZCirR), 1 (PZInsCirR), 3 (PZLALen), 4 (PZVBAvArea), 8 (PZVBDensity), 9 (PZVBNum), 5 (VBAvArea) and 4 (VBDensity). Remarkably, we found a set of genes distributing on 2, 3, 4, 9, 10 chromosomes associated with vascular bundle numbers traits, which involved in the plant signal transduction and stress response; candidate gene distributing on 4 chromosomes associated with vascular bundle area trait, which involved in the gibberellin biosynthesis; several candidate genes distributing on 4, 8 chromosomes associated with vascular bundle distribution density traits, which involved reproductive processes and embryogenesis; and a set of candidate genes distributing on 3, 8 chromosomes associated with epidermis area traits, which encoded enzymes involved in cell wall metabolism.
Table 2

Gene annotation of genome‐wide association study significant loci

TraitsGeneDescriptionChromosomePositionAlleles * SNP MAFMultimethods Genes listed top 1
CSAreaGRMZM2G002002Nucleotidyltransferase120699924Gchr1.S_206999240.07871Y
CSAreaGRMZM2G134367Nodulin‐related protein 1120699924Gchr1.S_206999240.07871Y
EZAreaZEAMMB73_Zm00001d008913Uncharacterized LOC100382589825128942Gchr8.S_251289420.45411Y
EZAreaGRMZM5G812425Phospholipase A1 EG1, chloroplastic/mitochondrial8168955849Cchr8.S_1689558490.41082Y
EZAreaGRMZM2G381473Uncharacterized LOC100279339357096327Cchr3.S_570963270.43962Y
EZAreaGRMZM2G033644Dihydrolipoyllysine‐residue acetyltransferase component of pyruvate dehydrogenase complex357096327Cchr3.S_570963270.43962Y
EZAreaGRMZM2G142334Uncharacterized LOC1003831253227046350Achr3.S_2270463500.07121Y
EZAreaGRMZM2G034943Uncharacterized LOC1002763423227046350Achr3.S_2270463500.07121Y
EZAreaGRMZM2G070343Putative thioredoxin superfamily protein4150570610Tchr4.S_1505706100.1681Y
IZAreaGRMZM2G133475Uncharacterized LOC5416742223589318Cchr2.S_2235893180.08051Y
IZAreaGRMZM2G450717Peroxidase 522223589318Cchr2.S_2235893180.08051Y
IZCirRGRMZM2G400965Uncharacterized LOC10364584625425011Achr2.S_54250110.26511Y
IZCirRGRMZM2G474039Uncharacterized LOC10027843625425011Achr2.S_54250110.26511Y
IZCirRGRMZM2G119186Uncharacterized LOC10027538152582887Gchr5.S_25828870.18871Y
IZCirRGRMZM5G882758Pentatricopeptide repeat‐containing protein At5g08510925254192Cchr9.S_252541920.06461Y
IZInsCirRGRMZM2G021973LOC103626187‐like pseudogene531333610Tchr5.S_313336100.04221Y
IZInsCirRGRMZM2G390804Uncharacterized LOC103626185531333610Tchr5.S_313336100.04221Y
IZLALenGRMZM2G145935, GRMZM2G415791FIP1[V]‐like protein510079109Tchr5.S_100791090.09761Y
IZLALenGRMZM2G084891, GRMZM5G895064DNA‐directed RNA polymerase II subunit RPB2517275214Gchr5.S_172752140.28891Y
IZLALenGRMZM2G126447Ubiquitin carboxyl‐terminal hydrolase 155147983323Tchr5.S_1479833230.15621Y
IZSALenGRMZM2G098577Uncharacterized LOC10038203910129270447Tchr10.S_1292704470.17851Y
IZSALenZEAMMB73_Zm00001d041038Uncharacterized LOC103650398392384170Cchr3.S_923841700.07391Y
IZSALenZEAMMB73_Zm00001d046782DNA polymerase eta9105824379Gchr9.S_1058243790.43181Y
IZSALenGRMZM2G114680Protein MONOCULM 19105824379Gchr9.S_1058243790.43181Y
IZSALenGRMZM5G865319Protein Z417762350Gchr4.S_177623500.10241Y
IZVBAvAreaGRMZM2G167283Uncharacterized LOC1002731243201232009Gchr3.S_2012320090.20082Y
IZVBAvAreaGRMZM5G854666Uncharacterized LOC103652335976536683Achr9.S_765366830.22971Y
IZVBAvAreaGRMZM2G065694EREBP‐4 like protein976536683Achr9.S_765366830.22971Y
IZVBDensityGRMZM2G107101Uncharacterized LOC100147733821822571Achr8.S_218225710.17321Y
IZVBDensityGRMZM2G087146Uncharacterized LOC100192922821822571Achr8.S_218225710.17321Y
IZVBDensityGRMZM2G350793Probable LRR receptor‐like serine/threonine‐protein kinase At3g475701171535280Gchr1.S_1715352800.09843Y
IZVBDensityGRMZM2G307823, GRMZM2G409627NF‐X1‐type zinc finger protein NFXL19479733Cchr9.S_4797330.21781Y
IZVBNumGRMZM2G339562Uncharacterized LOC100274353523764039Cchr5.S_237640390.06432Y
IZVBNumZEAMMB73_Zm00001d013884Uncharacterized LOC103626113523764039Cchr5.S_237640390.06432Y
IZVBNumGRMZM2G048962Anther‐specific proline‐rich protein APG6122034792Cchr6.S_1220347920.10761Y
IZVBNumGRMZM2G155546Ribonucleoside‐diphosphate reductase small chain6122034792Cchr6.S_1220347920.10761Y
IZVBNumGRMZM2G168214Uncharacterized LOC100382169927177987Tchr9.S_271779870.07261Y
IZVBNumGRMZM2G08473960S ribosomal protein L9927177987Tchr9.S_271779870.07261Y
IZVBNumGRMZM2G322593RHOMBOID‐like protein 9 chloroplastic4230734500Gchr4.S_2307345000.20582Y
IZVBVoAvAreaZEAMMB73_Zm00001d009545Uncharacterized LOC103635332869589710Gchr8.S_695897100.10891Y
IZVBVoAvAreaGRMZM2G086766Tetraspanin‐68135351047Achr8.S_1353510470.12991Y
IZVBVoAvAreaZEAMMB73_Zm00001d013156NADH‐ubiquinone oxidoreductase 23 kDa subunit55548545Gchr5.S_55485450.08532Y
IZVBVoAvAreaGRMZM2G107651Uncharacterized LOC1036404479155480084Achr9.S_1554800840.29791Y
PZCirRGRMZM2G157589DNA methyl transferase 48146889829Cchr8.S_1468898290.37661Y
PZCirRGRMZM2G134738Cytochrome c oxidase subunit 5b‐2 mitochondrial8146889829Cchr8.S_1468898290.37661Y
PZInsCirRGRMZM2G089525Heat stress transcription factor C‐1a3220979555Achr3.S_2209795550.06461Y
PZLALenGRMZM2G180283Rhamnosyl transferase2208377204Tchr2.S_2083772040.11871Y
PZLALenZEAMMB73_Zm00001d032540Uncharacterized LOC1005012221229995904Cchr1.S_2299959040.30181Y
PZSALenGRMZM2G016622Uncharacterized LOC100283085739220405Gchr7.S_392204050.07521Y
PZVBAvAreaGRMZM2G460581Uncharacterized LOC1035047208164858024Tchr8.S_1648580240.37141Y
PZVBAvAreaGRMZM2G100707appr‐1‐p processing enzyme family protein111480465Tchr1.S_114804650.19693Y
PZVBAvAreaZEAMMB73_Zm00001d031730Uncharacterized LOC1003826261200008691Tchr1.S_2000086910.1052Y
PZVBAvAreaGRMZM2G380227LOC109940389‐like pseudogene1200008691Tchr1.S_2000086910.1052Y
PZVBDensityGRMZM2G064870Uncharacterized LOC10027315222620313Achr2.S_26203130.19161Y
PZVBDensityGRMZM2G064949Probable carboxylesterase Os04g066960022620313Achr2.S_26203130.19161Y
PZVBDensityGRMZM2G083016Metacaspase type II8123912995Tchr8.S_1239129950.10422Y
PZVBDensityGRMZM2G066041Uncharacterized LOC1002723848123912995Tchr8.S_1239129950.10422Y
PZVBDensityGRMZM2G030284Uncharacterized LOC1001914508156039997Cchr8.S_1560399970.17851Y
PZVBDensityGRMZM2G339736Aspartic proteinase PCS18174156791Achr8.S_1741567910.09761Y
PZVBDensityGRMZM2G353268Alpha zein46406610Tchr4.S_64066100.31232Y
PZVBDensityGRMZM2G169160Seryl‐tRNA synthetase46406610Tchr4.S_64066100.31232Y
PZVBNumGRMZM2G161913Protein CTR9 homolog216948568Achr2.S_169485680.2522Y
PZVBNumGRMZM2G042756Dehydration‐responsive element‐binding protein 1D216948568Achr2.S_169485680.2522Y
PZVBNumGRMZM2G126194Uncharacterized LOC10027507110110298248Cchr10.S_1102982480.27432Y
PZVBNumGRMZM2G130351Putative DNA primase large subunit10110298248Cchr10.S_1102982480.27432Y
PZVBNumGRMZM2G073041Pre‐mRNA‐splicing factor 1810132560491Gchr10.S_1325604910.15171Y
PZVBNumGRMZM2G330035Uncharacterized LOC10364284010132560491Gchr10.S_1325604910.15171Y
PZVBNumGRMZM2G348238Uncharacterized LOC10752210739819503Gchr3.S_98195030.10241Y
PZVBNumGRMZM2G170120Protein kinase‐like350441322Achr3.S_504413220.34782Y
PZVBNumGRMZM2G083262Eukaryotic translation initiation factor 2 subunit alpha9139836925Cchr9.S_1398369250.31662Y
VBAvAreaGRMZM2G045005Transglutaminase238982881Tchr2.S_389828810.45011Y
VBAvAreaGRMZM2G062488Acetylglucosaminyltransferase/ transferase, transferring glycosyl groups238982881Tchr2.S_389828810.45011Y
VBAvAreaGRMZM2G057416Uncharacterized LOC1002168128158967480Gchr8.S_1589674800.10823Y
VBAvAreaGRMZM2G122503Benzyl alcohol O‐benzoyltransferase46743055Tchr4.S_67430550.09062Y
VBAvAreaGRMZM2G445854Ent‐copalyl diphosphate synthase 246743055Tchr4.S_67430550.09062Y
VBDensityZEAMMB73_Zm00001d034336LOC100274439‐like pseudogene1290113928Cchr1.S_2901139280.18771Y
VBDensityGRMZM2G061695Gibberellin responsive 13223096267Gchr3.S_2230962670.35561Y
VBDensityGRMZM2G126603LIN1 protein3223096267Gchr3.S_2230962670.35561Y
VBDensityZEAMMB73_Zm00001d052115Probable polyribonucleotide nucleotidyltransferase 1, chloroplastic4180199431Cchr4.S_1801994310.10292Y
VBNumGRMZM5G837841Uncharacterized LOC100279337629286497Achr6.S_292864970.49482Y
VBNumGRMZM2G022686myb‐related protein Myb49138754628Cchr9.S_1387546280.34381Y
VBNumGRMZM2G038108Activator of 90 kDa heat shock protein ATPase467144165Tchr4.S_671441650.14832Y
VBNumZEAMMB73_Zm00001d051364Peptidyl‐prolyl cis‐trans isomerase CYP20‐3, chloroplastic4156023882Gchr4.S_1560238820.20581Y
VBNumGRMZM2G047128Uncharacterized LOC1002774384156023882Gchr4.S_1560238820.20581Y

MAF, minor allele frequency.

The allele represents the favourable allele.

Leading SNP of each significant locus associated with each trait.

Validated by how many methods.

Gene annotation of genome‐wide association study significant loci MAF, minor allele frequency. The allele represents the favourable allele. Leading SNP of each significant locus associated with each trait. Validated by how many methods.

Pathways enriched by functional enrichment analysis

Functional enrichment analysis was completed to further explore the function of the genes associated with seven dry matter traits. After uploading the candidate gene IDs of all 30 phenotypic traits to PlantRegMap and KOBAS 3.0, a total of 172 GO terms and 8 KEGG pathways (P‐value < 0.05) were enriched by these two methods (Table S4 and Figure S6). As mentioned above, 30 traits were clustered into three groups by hierarchical clustering using the Pearson correlation as a distance metric, which as groups I, II and III. And the functional enrichment analysis of candidate genes as each group was conducted, 69 GO BP terms and 4 KEGG pathways (P‐value < 0.05) were enriched for 21 phenotypic traits from group I, 27 GO BP terms and 4 KEGG pathways (P‐value < 0.05) were enriched for five phenotypic traits from group II, and 87 GO BP terms were enriched for three phenotypic traits from group III. For the group I, geometric and morphological characters of stem cross section and functional zones, and quantitative character of vascular bundles, two GO BP terms ‘negative regulation of reproductive process’ (GO:2000242, P‐value = 0.00015), ‘negative regulation of post‐embryonic development’ (GO:0048581, P‐value = 0.00062) that both containing genes GRMZM2G161913 and GRMZM2G348238 were obtained with the highest significance. In addition, the four significant KEGG pathways ‘Metabolic pathways’ (zma01100, P‐value = 0.0170), ‘RNA transport’ (zma03013, P‐value = 0.0442), ‘Fatty acid degradation’ (zma00071, P‐value = 0.0172) and ‘Tyrosine metabolism’ (zma00350, P‐value = 0.0096) were enriched by KOBAS 3.O (Figure S6A,D). For the group II, related to vascular bundle area characteristics, two GO BP terms ‘nucleobase‐containing compound biosynthetic process’ (GO:0034654, P‐value = 0.0159) and ‘aromatic compound biosynthetic process’ (GO:0019438, P‐value = 0.0186) were obtained with the highest significance. In addition, the four significant KEGG pathways ‘Fructose and mannose metabolism’ (zma00051, P‐value = 0.0069), ‘N‐Glycan biosynthesis’ (zma00510, P‐value = 0.0237), ‘Metabolic pathways’ (zma01100, P‐value = 0.0254), ‘Flavone and flavonol biosynthesis’ (zma00944, P‐value = 0.0353) and ‘Protein processing in endoplasmic reticulum’ (zma04141, P‐value = 0.0443) were enriched by KOBAS 3.O (Figure S6B,E). For the group III, distribution properties of vascular bundle, two GO BP terms ‘regulation of biological process’ (GO:0050789, P‐value = 0.0005) and ‘biological regulation’ (GO:0065007, P‐value = 0.00078) that both containing genes GRMZM2G066041, GRMZM2G107101 and GRMZM2G307823 were obtained with a high significance (Figure S6C).

Trait‐gene network visualization

A complex network consisted of 30 phenotypic indicators, and their candidate genes were constructed by Cytoscape v3.7.2 (Figure 5). The trait‐gene network contained 30 large nodes (phenotypic traits) and 522 small round nodes (candidate genes), with 828 edges (the interactions between traits and genes). Group I (21 traits), group II (5 traits) and group III (3 traits) traits were marked as green, blue and purple, respectively. And candidate genes with different colours represented the diversity of interactions. The light grey nodes stood for genes only have correlation with one specific trait, and the red one indicated the multi‐trait shared genes. It was obviously that there were many genes shared between traits within and between groups, especially within Group I traits. There were 104 shared genes shared between traits within and between groups as shown in Figure 5.
Figure 5

The gene‐phenotypic trait network constructed by 30 phenotypic traits and their related genes. Traits and genes are shown in different shapes and sizes. Of the 30 large octagon nodes, the 21 green nodes represent geometric and morphological characters of stem cross section and functional zones, and quantitative character of vascular bundles (CSInsCirR, EZInsCirR, PZInsCirR, IZInsCirR, CSWid, EZSALen, PZSALen, IZSALen, CSArea, EZArea, IZArea, CSLen, EZLALen, PZLALen, IZLALen, CSCirR, EZCirR, PZCirR, IZCirR, VBNum, PZVBNum), the five blue ones represent vascular bundle area characteristics (VBVoAvArea, IZVBVoAvArea, VBAvArea, PZVBAvArea, IZVBAvArea), and the remaining three purple ones represent distribution properties of vascular bundle (VBDensity, IZVBDensity, PZVBDensity). Genes are represented by the small round nodes, and different colours indicate different attributes. The red round nodes represent the overlapped genes of multiple traits; the light grey round nodes stand for genes only have correlation with specific traits.

The gene‐phenotypic trait network constructed by 30 phenotypic traits and their related genes. Traits and genes are shown in different shapes and sizes. Of the 30 large octagon nodes, the 21 green nodes represent geometric and morphological characters of stem cross section and functional zones, and quantitative character of vascular bundles (CSInsCirR, EZInsCirR, PZInsCirR, IZInsCirR, CSWid, EZSALen, PZSALen, IZSALen, CSArea, EZArea, IZArea, CSLen, EZLALen, PZLALen, IZLALen, CSCirR, EZCirR, PZCirR, IZCirR, VBNum, PZVBNum), the five blue ones represent vascular bundle area characteristics (VBVoAvArea, IZVBVoAvArea, VBAvArea, PZVBAvArea, IZVBAvArea), and the remaining three purple ones represent distribution properties of vascular bundle (VBDensity, IZVBDensity, PZVBDensity). Genes are represented by the small round nodes, and different colours indicate different attributes. The red round nodes represent the overlapped genes of multiple traits; the light grey round nodes stand for genes only have correlation with specific traits.

Discussion

The scope of plant phenotyping has expanded from plant population, single plant, to tissue and cell scales. Apart from the visual traits, internal anatomical traits are equally important; however, few advances have been made in high‐performance micro‐phenotyping. X‐ray micro‐CT can perform non‐destructive, non‐invasive, and three‐dimensional visualization and quantification of the internal structure of biological material, with a minimum resolution of 1 µm (Cnudde and Boone, 2013; Landis and Keane, 2010; Zhao et al., 2019). Based on CT images, the image analysis software VesselParser 1.0 developed by du et al. (2016) realized the high‐throughput detection of vascular bundle phenotypic traits of stem for the first time. However, it was difficult to segment and analyse the vascular bundle phenotypic traits of mature stem and basal stem, leaving much room for improvement. By previous research, we developed a standard process for stem micro‐CT data acquisition and automatic CT image process pipeline to extract vascular bundle traits, which provided the possibility for the microscopic phenotype analysis of large‐scale maize accessions. Through VesselParser 4.0 pipeline, contour representations of the slice, functional zones, layers, and vascular bundles provided uniform analysis to output lots of traits, such as geometry‐related, morphology‐related and distribution‐related traits. Compared to other vascular bundles phenotyping methods, VesselParser 4.0 has the following advantages: (i) ‘zone’ defining provided a basis for the segmentation strategy of stem vascular bundles. There were significant differences in vascular bundles between periphery and inner region, including vascular bundle area, cavity size, CT value and distribution density. ‘Zone’ provides the most suitable classification criteria for the vascular bundles of stem. In ‘inner zone’, vascular bundles are independent of each other and can be separated by simple threshold segmentation. In the periphery zone, vascular bundles are connected by surrounding parenchyma, so more adaptive segmentation strategies should be adopted. (ii) Zone' reflected the material contents in the stem, structural characteristics of vascular bundle and distribution changes. Based on ‘Zone’ image segmentation, quantitative analysis of phenotypic characteristics of vascular bundles in different regions of stem could be achieved, which were more novel and accurate than by manually measuring. The new phenotypic indexes such as zone area, vascular bundle density, might have a better correlation with crop production and will provide new insights into the genetic architecture of vascular system. According to the information of population structure in previous study (Yang et al., 2011), phenotypic variation of 30 phenotypic indicators between subgroups was compared and obvious differences were identified (Figures 4 and S5). In this study, 1.03‐ to 9.74‐fold variations of vascular bundle traits were detected in a larger sample size of association mapping panel which consisted of 480 inbred lines across whole world. The phenotypic indicators with the most significant differences (P ≤ 0.001) between subpopulations were VBNum, VBAvArea, IZVBNum, IZVBAvArea, PZVBAvArea and PZVBDensity. In addition to the number of vascular bundles, the characteristics of vascular bundle area and vascular bundle density are important references for distinguishing different genotypes. Using VesselParser 4.0, more comprehensively phenotypic information of vascular bundles was captured, thus further expanding our understanding of variations of vascular bundle traits between subgroups. Due to the traditional manual testing method was time‐consuming and laborious, only the research of vascular bundle number in the uppermost internode of maize stem was reported (Huang et al., 2016). With greater understanding of the true multivariate and multiscale nature of genotypes will come increased insight into the mechanistic and developmental underpinnings of vascular bundle form and function. A huge amount of complex, integration of a wide range of image, spectral, environmental data can be generated through by the high‐throughput phenotyping technologies. Thus, the efficient storage, management and retrieval of phenotypic data are becoming the important issues to be considered (Zhao et al., 2019). In recent years, the database and management system of plant phenomics have been reported. 2011, PHENOPSIS DB for Arabidopsis thaliana phenotypic data were established (Fabre et al., 2011); 2014, ClearedLeaves DB, an on open online database was built to store, manage and access cleared leaf images and phenotypic data (Das et al., 2014); 2016, the Leibniz Institute of Plant Genetics and Crop Plant Research and the German Plant Phenotyping Network jointly launched the PGP repository as infrastructure to publish plant phenotypic and genotypic data comprehensively (Arend et al., 2016). However, because of the requirement for accurate identification of microscopic phenotypes for large amount population and a lack of high‐throughput and effective micro‐phenotyping detection methods, data sets or data management systems for microscopic phenotype information are rarely retrieved. Here, based on the micro‐phenotypic data of stem from NP population, the first database for stem micro‐phenotypes, MaizeSPD, was established. Currently, MaizeSPD has stored 554 pieces of basic information for maize inbred lines, 523 pieces of experimental information, 1008 pieces of CT scanning images and processed images, and 24 192 pieces of stem micro‐phenotypic data classified as 48 categories. MaizeSPD is a successful example of crop microscopic phenotype data storage, management, retrieve and sharing, which lays a foundation in accumulating and improving microscopic phenotype data. In the future, we will continue to expand the information in the database, from stem microscopic data to kernel, leaf and root micro‐phenotypic data. Genomic data play a major role in crop genetic improvements and breeding programmes. However, considerable gains can only be achieved by tightly coupling genomic discoveries to plant phenomics (Cobb et al., 2013). High‐throughput and automatic phenotyping facilities in indoor and field environment have developed rapidly over the last 5 years, significantly improving the efficiency and accuracy of crop phenotyping. Large‐scale phenotyping has been become an important compliment to genome sequencing and identifying genetic regulatory mechanisms (Harfouche et al., 2019; Zhao et al., 2019). In recent years, multi‐omics techniques that combine genomic data with phenotypic data have been applied to crop plants, rapidly decoding the function of a large number of unknown genes and identifying the molecular basis of many agronomic traits (Busemeyer et al., 2013; Muraya et al., 2017; Salvi and Tuberosa, 2015; Wu et al., 2019; Yang et al., 2014a,b, 2015; Zhang et al., 2017). In this study, vascular bundle traits of maize natural population panel containing 480 inbred lines were analysed. Combined with genome‐wide association studies (GWASs), a total of 1562 significant single nucleotide polymorphisms (SNPs) were identified for 30 stem micro‐phenotypic traits. The number of single‐trait‐related candidate genes annotated by SNPs listed in multiple methods validated results was 416. For the 416 single‐trait‐related candidate genes, the NCBI Gene database was used for further annotation, and 294 genes with more detailed functional annotation were obtained. Additionally, 84 genes listed both in top one of each method's results and multiple methods validated results were identified. Candidate genes identified by GWAS mainly encode enzymes involved in cell wall metabolism, transcription factors, protein kinase and protein related to plant signal transduction and stress response. Remarkably, we found a set of genes involved in the plant signal transduction and stress response, which associated with vascular bundle numbers traits. The SNP on chromosome 9 significantly associated with MYB transcription factor family genes was found located within the gene model GRMZM2G022686, which encodes a MYB‐related protein Myb4. This protein plays important roles in plant dwarf phenotype and increased tolerance to cold and freezing in Arabidopsis and barley (Soltész et al., 2012). And SNP on chromosome 3 significantly associated with MYB transcription factor family genes was also found located within the gene model GRMZM2G348238, which encodes a MYB family transcription factor EFM. This transcription factor acts as a flowering repressor, directly repressing FT expression in a dosage‐dependent manner in the leaf vasculature (Yan et al., 2014). Another significant SNP located on chromosome 8 at position 16948568 is contained in the gene region of GRMZM2G042756 that encodes dehydration‐responsive element‐binding protein 1D, which is a transcription factor played significant roles in responses to biotic and abiotic stresses (Zhang et al., 2012). And the significant SNP located on chromosome 4 is contained in the gene region of Zm00001d051364, which encodes peptidyl‐prolyl cis‐trans isomerase CYP20‐3. Peptidyl‐prolyl cis‐trans isomerase CYP20‐3 is a jasmonate family binding protein, and the jasmonate family of phytohormones plays central roles in plant development and stress acclimation (Dominguez‐Solis et al., 2008). For the vascular bundle area trait, we found genes involved in the gibberellin biosynthesis. The significant SNP located on chromosome 4 at position 6743055 is contained in the gene region of GRMZM2G445854 that encodes ent‐copalyl diphosphate synthase 2, catalysing the conversion of geranylgeranyl diphosphate to the gibberellin precursor ent‐copalyl diphosphate (ent‐CPP) in responses to biotic and abiotic stresses (Harris et al., 2005; Mafu et al., 2018). A set of candidate genes encoded enzymes involved in cell wall metabolism were identified, which associated with epidermis area traits. The significant SNP located on chromosome 3 at position 57096327 is contained in the gene region of GRMZM2G381473 that encodes UDP‐glucuronic acid decarboxylase 4. UDP‐glucuronate decarboxylase is the key enzyme involved in the biosynthesis of UDP ‐α‐D‐xylose, which is a nucleotide sugar involved in the synthesis of diverse plant cell wall hemicelluloses (xyloglucan, xylan) and minor plant metabolites. What is also interesting is that we found genes involved reproductive processes and embryogenesis, which associated with PZVBDensity traits. The significant SNP located on chromosome 4 at position 6406610 is contained in the gene region of GRMZM2G353268 that encodes 19 kDa zein A30, which is specifically expressed during seed development (Song et al., 2011). Another significant SNP located on chromosome 8 at position 174156791 is contained in the gene region of GRMZM2G339736 that encodes aspartic proteinase PCS1. In Arabidopsis, PCS1, which encodes an aspartic protease, has an important role in determining the fate of cells in embryonic development and in reproduction processes (Ge et al., 2005). Because the anatomical phenotypes of stem vascular bundles are comparatively difficult to obtain and there have been few genetic studies on microscopic traits, the phenotype‐associated genes obtained in our study can provide a reference for related research and provide new ideas for exploring the genetic mechanisms of vascular bundle agronomic traits in future studies.

Experimental procedures

Materials, growth conditions and sample collection

Four hundred and eighty maize inbred lines used in this study belonged to the maize natural population described by Yang et al. (2011), which were classified into four subgroups based on population structure Q matrix: Stiff stalk (SS) with 30 lines, non‐stiff stalk (NSS) with 133 lines, tropical‐subtropical (TST) with 212 lines and an admixed group with 105 lines. The population had a high‐density genotype of 1.25 million single nucleotide polymorphism (SNPs) with minor allele frequency (MAF) of > 0.0534 (Liu et al., 2017). The plants were grown in Tongzhou Experimental Station of Beijing Academy of Agriculture and Foresting Sciences in Beijing, China (116.68°E, 39.69°N). Sowing took place on 28 April 2018. Each inbred line was planted in four‐row plot, with eight plants each row. Each row was 2.1 m long and 60 cm between rows. The third internodes of three plants for each inbred line were collected at the silking period (73 days after sowing) for later research.

Standard scanning protocol and CT image reconstruction

The standardized procedure of maize microdata acquisition was constructed to ensure the reliability and consistency of image acquisition, and to provide a solution for large‐population and high‐precision scanning imaging of crop. First, we developed a sample preparation protocol for maize stem. A motor electric cutting machine (Bosch stone cutting machine, GDM13‐34) was firstly used to cut the third internode of maize stem into a series of 0.5–1.0 cm segments, since it was strenuous and fail‐prone by manual cutting. The sample segments were soak in FAA solution (90:5:5 v/v/v, 70% ethanol:100% formaldehyde:100% acetic acid) immediately. After the FAA fixation, samples were performed the sequential ethanol gradient dehydration in batch (i.e. 70%, 95% and 100%) and set the processing time of each ethanol gradient as 30 min. Next, samples were transferred to tertiary butyl alcohol and soaked for 24 h, and then froze samples at −80 ° c for 24 h. Finally, frozen samples were placed in the freeze‐dryer (LGJ‐10E, China) and freeze‐dried for 3 h in batch. According to the ' micro‐CT scanning protocol' introduced by Zhang (Zhang et al., 2018), dried stem samples were scanned by Skyscan 1172, and the unified scanning parameter was set as: 40 kV/250 µA, the imaging pixel sizes as 13.55 µm, 2K scanning mode (2000 × 2000 pixels). Finally, we defined much stricter reconstruction parameters to guarantee the consistence and quantification of imaging quality. According to the linear absorbance coefficient for X‐ray of various materials, the Hounsfield (HU) values of air and water are, respectively, −1000 and 0. For different maize variety and growth stages, we found out that HU values of maize stem distributed in a wide range from 0 to 7000. To provide a standard for quantification and evaluation, we defined a wider value ranges covered whole HU ranges of plant materials, that is [−1000, 9240], to transform raw data (16 grey level) into an 8‐bit grey‐level image with a value range [0, 255] (Figure S7).

Image segmentation and analysis strategies

Here, we designed a fully automated image analysis pipeline based C++ and OpenCV, named as VesselParser 4.0. This image analysis pipeline was summarized as a flow chart in Figure 1. As an important innovation, the transverse structures of maize stem were divided into three zones with physiological significance based on the material distribution and relationship, that is epidermis zone, periphery zone and inner zone. From imaging viewpoint, these zones were demonstrated different pixel intensity (matching to CT Hounsfield value) and pixel connectivity. In our knowledge, the physiological zones of maize stem are difficult to be accurately measured by manual work and visual investigation owing to the boundary ambiguity. Here, we first defined and detected these zones in the pixel level of image. Moreover, these zones could be described using simpler pattern description, such as boundary contours. Based on an object‐oriented strategy, we built a three‐layer structure (i.e. maize stem, zones and vascular bundles, respectively) to represent the maize stem slice for all maize cultivars. For each layer, the image analysis scheme of vascular bundles was more specific and robust according to the zone properties. As a result, more valuable traits related with three‐layer structures could be, respectively, detected. For example, epidermis region can be almost regarded as the epidermis zone; thus once the epidermis zone can be precisely detected based CT image, the epidermis thickness is easy to be measured, and as everyone knows, it is difficult to be done by manual work.

Data detection and selection

The pre‐processing of phenotypic data involved outlier detection and trait reproducibility assessments. We first adopted Grubbs' test (Grubbs, 1950) to evaluate the repeatability of phenotypic traits based on the assumption of normally distributed phenotypic data points for repeated measurements on replicated plants with a single genotype for each trait. Grubbs test results with a P‐value < 0.01 were considered to be outliers. The frequency of outliers in the reproducibility trait should be less than the number of random occurrences. Then, the Pearson correlation coefficient was used to detect sample outliers. A sample with r < 0.8 among all the other samples was identified as an outlier. Next, multiple linear regression (MLR) was implemented to reduce the correlation among explanatory variables. As a general rule, we considered VIF > 5 as a cut‐off value for the high multi‐collinearity problem. We used the VIF function in the R software package fmsb to calculate VIF. The ‘lm’ function of the R package lmer was used to construct the MLR model. Finally, the variables that remained after the stepwise regression were used for following analysis.

Clustering analysis, ANOVA and heritability analysis

We conducted unsupervised hierarchical clustering analysis using Pearson's correlation as a distance metric to organize 48 trait data into meaningful structures. In clustering, it considers each trait as a random variable with n observations and measures the similarity between the two traits by calculating the linear relationship between the distributions of the two corresponding random variables (Jiang and Lu, 2007). And the analysis of variance (ANOVA) was carried out to found out whether differences exist between different subpopulations means. These statistical analyses were conducted using SPSS Statistics 22 (2016) software (IBM, Armonk, NY, USA). Heritability refers to the percentage of genetic variation (V A) that accounts for the total variation of the phenotype, generally denoted by H 2. It can be used to compare the relationship between genetic () and environmental () factors for a specific phenotypic variation (V P). Heritability (H 2) was calculated for each trait as follows: The above analysis was performed in ASReml‐R v.3.0 by using the ‘asreml’ function of R package asreml (David, 2009).

Genome‐wide association study

Genotype data were obtained from Professor Yan Jianbing's laboratory of Huazhong Agricultural University (download URL: www.maizego.org/Resources.html). After quality control, 779 855 SNPs with minimum allele frequency (MAF) >0.05 and call rate >0.01 were used in our study. Population structure was estimated using the STRUCTURE program version 2.3.4 (Hubisz et al., 2009) based on 480 maize inbred lines, and 779 855 SNPs were used to estimate the relative kinship by TASSEL 5 (Bradbury et al., 2007). A multi‐locus random‐SNP‐effect mixed linear model tool for GWAS (R package ‘mrMLM’ version 4.0) (Zhang et al., 2019) was used on the 30 phenotypic traits separately to test the statistical association between phenotypes and genotypes. Population structure and relative kinship were taken into account in the model, and six methods (mrMLM, FASTmrMLM, FASTmrEMMA, ISIS EM‐BLASSO, pLARmEB and pKWmEB) included in the function ‘mrMLM’ were used in our study. For the first step, the criterion of P‐value was set as 6.4e‐7 (P ≤ 0.5/N, where N is the total number of genome‐wide SNPs). And then the default P‐value 0.0002 was used as the filter threshold for the second step to declare significance of SNPs associated with a given trait. The common results obtained by all methods were regarded as significant SNPs associated with phenotypic traits, and the overlapped loci of multiple methods were considered to be more reliable results. All candidate genes were annotated according to the latest maize B73 reference genome (B73 RefGen_v4) available in EnsemblPlants (http://plants.ensembl.org/Zea_mays/Info/Index) and NCBI Gene database (https://www.ncbi.nlm.nih.gov/gene).

Functional and network analysis

Pathway enrichment analysis was performed by PlantRegMap (Jin et al., 2015, 2017) and KOBAS 3.0 (Wu et al., 2006; Xie et al., 2011). The input data were consisted of all significant genes annotated by SNPs in coding regions for each phenotypic trait. Gene Ontology (GO) (Ashburner et al., 2000) terms with P‐value < 0.05 were identified as significant results. Besides, the significant interactions between the genes and their related phenotypic traits were visualized using Cytoscape v3.7.2 (National Institute of General Medical Sciences, Bethesda, MD, USA).

Conflict of interest

The authors declare that they have no conflicts of interest.

Author contributions

Y Zhang drafted and revised the manuscript. X Guo and C Zhao proposed the conceptualization of this study and reviewed the manuscript. J Du and J Wang edited the manuscript. Y Zhang, Y Zhao, X Lu, W Wen, S Gu, J Fan, C Wang, Y Wang, S Wu and S Liao performed field experiments and collected image data. J Du developed the image processing pipeline. Y Zhang and J Wang implement the statistical analysis and GWAS work. J Wang implements MaizeSPD construction. Y Zhang, J Du and J Wang analysed and interpreted the results. All authors read and approved the final manuscript. Figure S1 The screenshots of VesselParser 4.0 software function modules, including data management module, method parameters module, phenotyping computation module, and statistic analysis module. Click here for additional data file. Figure S2 Phenotypic distribution of 48 items phenotypic traits in the third internode of maize stem in NP population. Click here for additional data file. Figure S3 The clustering analysis of 48 phenotypic traits conducted with hierarchical clustering using the Pearson correlation as a distance metric. Click here for additional data file. Figure S4 Pearson correlation of 48 micro‐phenotypic traits of the basal third internode of maize stem in NP population. Click here for additional data file. Figure S5 The traits variation between subpopulations (TST, NSS, SS, and Mixed) for the remaining 24 indicators. Click here for additional data file. Figure S6 Functional enrichment results of all candidate genes associated with phenotypic traits. Click here for additional data file. Figure S7 The flowchart of high‐throughput micro‐phenotype analysis of maize stem. Click here for additional data file. Table S1 The 48 traits of stem description and abbreviation. Table S2 The phenotypic variations for 48 traits of vascular bundle in the third internode of maize stem among natural population. Click here for additional data file. Table S3 294 candidate genes with detailed functional annotation by the NCBI Gene database. Click here for additional data file. Table S4 GO terms and KEGG pathway enriched by the genes associated with 30 phenotypic traits by PlantRegMap and KOBAS 3.0. Click here for additional data file. Video S1 Video introduction of MaizeSPD, the phenotype database of stem vascular bundles for NP population. Click here for additional data file.
  66 in total

Review 1.  Yield-related QTLs and their applications in rice genetic improvement.

Authors:  Xufeng Bai; Bi Wu; Yongzhong Xing
Journal:  J Integr Plant Biol       Date:  2012-05       Impact factor: 7.061

2.  A gene controlling the number of primary rachis branches also controls the vascular bundle formation and hence is responsible to increase the harvest index and grain yield in rice.

Authors:  Tomio Terao; Kenji Nagata; Kazuko Morino; Tatsuro Hirose
Journal:  Theor Appl Genet       Date:  2009-11-22       Impact factor: 5.699

3.  Sequence, regulation, and evolution of the maize 22-kD alpha zein gene family.

Authors:  R Song; V Llaca; E Linton; J Messing
Journal:  Genome Res       Date:  2001-11       Impact factor: 9.043

Review 4.  Genome-wide Association Studies in Maize: Praise and Stargaze.

Authors:  Yingjie Xiao; Haijun Liu; Liuji Wu; Marilyn Warburton; Jianbing Yan
Journal:  Mol Plant       Date:  2016-12-27       Impact factor: 13.164

5.  Genetic variation of growth dynamics in maize (Zea mays L.) revealed through automated non-invasive phenotyping.

Authors:  Moses M Muraya; Jianting Chu; Yusheng Zhao; Astrid Junker; Christian Klukas; Jochen C Reif; Thomas Altmann
Journal:  Plant J       Date:  2017-01-07       Impact factor: 6.417

6.  Discovery, Biosynthesis and Stress-Related Accumulation of Dolabradiene-Derived Defenses in Maize.

Authors:  Sibongile Mafu; Yezhang Ding; Katherine M Murphy; Omar Yaacoobi; J Bennett Addison; Qiang Wang; Zhouxin Shen; Steven P Briggs; Jörg Bohlmann; Gabriel Castro-Falcon; Chambers C Hughes; Mariam Betsiashvili; Alisa Huffaker; Eric A Schmelz; Philipp Zerbe
Journal:  Plant Physiol       Date:  2018-02-23       Impact factor: 8.340

7.  Genome-wide association mapping of flowering time and northern corn leaf blight (Setosphaeria turcica) resistance in a vast commercial maize germplasm set.

Authors:  Delphine Van Inghelandt; Albrecht E Melchinger; Jean-Pierre Martinant; Benjamin Stich
Journal:  BMC Plant Biol       Date:  2012-04-30       Impact factor: 4.215

8.  Genome wide association study for drought, aflatoxin resistance, and important agronomic traits of maize hybrids in the sub-tropics.

Authors:  Ivan D Barrero Farfan; Gerald N De La Fuente; Seth C Murray; Thomas Isakeit; Pei-Cheng Huang; Marilyn Warburton; Paul Williams; Gary L Windham; Mike Kolomiets
Journal:  PLoS One       Date:  2015-02-25       Impact factor: 3.240

9.  Genome-wide association study of rice (Oryza sativa L.) leaf traits with a high-throughput leaf scorer.

Authors:  Wanneng Yang; Zilong Guo; Chenglong Huang; Ke Wang; Ni Jiang; Hui Feng; Guoxing Chen; Qian Liu; Lizhong Xiong
Journal:  J Exp Bot       Date:  2015-03-20       Impact factor: 6.992

Review 10.  Next-generation phenotyping: requirements and strategies for enhancing our understanding of genotype-phenotype relationships and its relevance to crop improvement.

Authors:  Joshua N Cobb; Genevieve Declerck; Anthony Greenberg; Randy Clark; Susan McCouch
Journal:  Theor Appl Genet       Date:  2013-03-08       Impact factor: 5.699

View more
  10 in total

Review 1.  Advanced high-throughput plant phenotyping techniques for genome-wide association studies: A review.

Authors:  Qinlin Xiao; Xiulin Bai; Chu Zhang; Yong He
Journal:  J Adv Res       Date:  2021-05-12       Impact factor: 10.479

2.  High-Throughput Phenotyping Accelerates the Dissection of the Phenotypic Variation and Genetic Architecture of Shank Vascular Bundles in Maize (Zea mays L.).

Authors:  Shangjing Guo; Guoliang Zhou; Jinglu Wang; Xianju Lu; Huan Zhao; Minggang Zhang; Xinyu Guo; Ying Zhang
Journal:  Plants (Basel)       Date:  2022-05-18

3.  Dissecting the Genetic Structure of Maize Leaf Sheaths at Seedling Stage by Image-Based High-Throughput Phenotypic Acquisition and Characterization.

Authors:  Jinglu Wang; Chuanyu Wang; Xianju Lu; Ying Zhang; Yanxin Zhao; Weiliang Wen; Wei Song; Xinyu Guo
Journal:  Front Plant Sci       Date:  2022-06-28       Impact factor: 6.627

4.  Dissecting the phenotypic components and genetic architecture of maize stem vascular bundles using high-throughput phenotypic analysis.

Authors:  Ying Zhang; Jinglu Wang; Jianjun Du; Yanxin Zhao; Xianju Lu; Weiliang Wen; Shenghao Gu; Jiangchuan Fan; Chuanyu Wang; Sheng Wu; Yongjian Wang; Shengjin Liao; Chunjiang Zhao; Xinyu Guo
Journal:  Plant Biotechnol J       Date:  2020-07-19       Impact factor: 9.803

5.  Responses of Maize Internode to Water Deficit Are Different at the Biochemical and Histological Levels.

Authors:  Fadi El Hage; Laetitia Virlouvet; Paul-Louis Lopez-Marnet; Yves Griveau; Marie-Pierre Jacquemot; Sylvie Coursol; Valérie Méchin; Matthieu Reymond
Journal:  Front Plant Sci       Date:  2021-02-26       Impact factor: 5.753

6.  Darkfield and Fluorescence Macrovision of a Series of Large Images to Assess Anatomical and Chemical Tissue Variability in Whole Cross-Sections of Maize Stems.

Authors:  Marie Berger; Marie-Françoise Devaux; David Legland; Cécile Barron; Benoit Delord; Fabienne Guillon
Journal:  Front Plant Sci       Date:  2021-12-14       Impact factor: 5.753

7.  A Genome-Wide Association Study Dissects the Genetic Architecture of the Metaxylem Vessel Number in Maize Brace Roots.

Authors:  Meiling Liu; Meng Zhang; Shuai Yu; Xiaoyang Li; Ao Zhang; Zhenhai Cui; Xiaomei Dong; Jinjuan Fan; Lijun Zhang; Cong Li; Yanye Ruan
Journal:  Front Plant Sci       Date:  2022-03-10       Impact factor: 5.753

8.  Exploring the Developmental Progression of Endosperm Cavity Formation in Maize Grain and the Underlying Molecular Basis Using X-Ray Tomography and Genome Wide Association Study.

Authors:  Shengjin Liao; Ying Zhang; Jinglu Wang; Chunjiang Zhao; Yong-Ling Ruan; Xinyu Guo
Journal:  Front Plant Sci       Date:  2022-04-07       Impact factor: 5.753

9.  OsCOMT, encoding a caffeic acid O-methyltransferase in melatonin biosynthesis, increases rice grain yield through dual regulation of leaf senescence and vascular development.

Authors:  Liexiang Huangfu; Rujia Chen; Yue Lu; Enying Zhang; Jun Miao; Zhihao Zuo; Yu Zhao; Minyan Zhu; Zihui Zhang; Pengcheng Li; Yang Xu; Youli Yao; Guohua Liang; Chenwu Xu; Yong Zhou; Zefeng Yang
Journal:  Plant Biotechnol J       Date:  2022-03-01       Impact factor: 13.263

10.  End-to-End Fusion of Hyperspectral and Chlorophyll Fluorescence Imaging to Identify Rice Stresses.

Authors:  Chu Zhang; Lei Zhou; Qinlin Xiao; Xiulin Bai; Baohua Wu; Na Wu; Yiying Zhao; Junmin Wang; Lei Feng
Journal:  Plant Phenomics       Date:  2022-08-02
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.