Literature DB >> 35126428

Detecting QTL and Candidate Genes for Plant Height in Soybean via Linkage Analysis and GWAS.

Jiajing Wang1, Bo Hu1, Yuliang Jing2, Xiping Hu3, Yue Guo1, Jiankun Chen1, Yuxi Liu1, Jianhui Hao1, Wen-Xia Li1, Hailong Ning1.   

Abstract

Soybean is an important global crop for edible protein and oil, and plant height is a main breeding goal which is closely related to its plant shape and yield. In this research, a high-density genetic linkage map was constructed by 1996 SNP-bin markers on the basis of a recombinant inbred line population derived from Dongnong L13 × Henong 60. A total of 33 QTL related to plant height were identified, of which five were repeatedly detected in multiple environments. In addition, a 455-germplasm population with 63,306 SNP markers was used for multi-locus association analysis. A total of 62 plant height QTN were detected, of which 26 were detected repeatedly under multiple methods. Two candidate genes, Glyma.02G133000 and Glyma.05G240600, involving in plant height were predicted by pathway analysis in the regions identified by multiple environments and backgrounds, and validated by qRT-PCR. These results enriched the soybean plant height regulatory network and contributed to molecular selection-assisted breeding.
Copyright © 2022 Wang, Hu, Jing, Hu, Guo, Chen, Liu, Hao, Li and Ning.

Entities:  

Keywords:  QTL; QTN; candidate genes; plant height; soybean

Year:  2022        PMID: 35126428      PMCID: PMC8813865          DOI: 10.3389/fpls.2021.803820

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Soybean [Glycine max (L.) Merr.] is one of the most important crops in the world as a major source of protein and oil (Feng et al., 2021). Plant height (PH) as the main trait of soybean plant shape is related to soybean yield (Assefa et al., 2019). Low plants result in lower yields, while too high plants are prone to yield reduction due to lodging. Plant height also affects yield through other traits such as number of pods per plant and number of nodes in the main stem (Chang et al., 2018; Li M. W. et al., 2020). PH wass a complex quantitative trait which was controlled by multiple genes and influenced by environmental conditions (Lee et al., 1996). With the objective to breeding efficiently, QTL mapping for PH were conducted by linkage and genome-wide association analysis (GWAS) analysis. Based on the bi-parent derived populations and linkage analysis (Xu et al., 2017), 238 QTLs associated with plant height had been listed on all 20 chromosomes.[1] In these researches, most of the linkage maps were constructed by low-throughput molecular markers such as restriction fragment length polymorphism (RFLP), amplified fragment length polymorphism (AFLP), and simple sequence repeat (SSR) markers, which led to low marker density, large genomic region intervals for QTL localization. It was difficult to predict candidate genes and design marker-assisted selection for PH. With the continuous development of molecular markers, high-throughput and high-density single nucleotide polymorphism markers (SNPs) were used as major markers for linkage analysis for mapping QTL (Adewusi et al., 2017; Gomez-Casati et al., 2017; Zhang et al., 2019; Karikari et al., 2020; Tian et al., 2020; Salari et al., 2021; Silva et al., 2021). In order to construct effective linkage intervals to identify QTL, SNP bin maker technology were gradually used in construction of linkage map. Cao et al. (2019) constructed two linkage maps by 3,958 and 2,600 SNP bin markers for two RIL populations, and identified 8 and 12 PH QTL on chromosomes 2,5,6,7,9,10,15,16,17, and 19 explaining 1.8–50.7% of the phenotypic variation, respectively. Wang et al. (2021) constructed a high-density map containing 2225 bin markers and detected 39 PH QTLs on chromosome 5, 6, 7, 9, 10, 12, 15, 16, 18, and 20, and the phenotypic variation explanation (PVE) ranged from 1.11 to 18.99 % based on a recombinant inbred line population of soybean. The second method for detecting QTL was GWAS, which has been extensively studied through recombinant inbred lines and germplasm populations of soybean (Lü et al., 2016; Qi et al., 2020; Song et al., 2020). With the objective of overcoming the shortage of false positives (Sonah et al., 2015), combinations of linkage and association analysis were gradually used in detecting genome regions controlling quantitative traits (Zhang et al., 2019; Song et al., 2020; Li X. et al., 2021). However, few studies combining both methods have been conducted on PH of soybean. Based on the results of linkage and GWAS, some genes controlling PH formation were gradually mined, such as GA20ox, GA2ox, GA3ox (Fernandez et al., 2009), and GmDW1 (Li et al., 2018), Glyma.01G023100, Glyma.03G207700, Glyma.12G182500, Glyma.16G137500, Glyma.20G122200, Glyma.20G122300 (Jing et al., 2019), and Glyma.11g145500,Glyma.13g139000, Glyma.13g339800 and Glyma.19g006100 (Han et al., 2021). With pleiotropism, some genes controlled simultaneously multiple traits, for example, such as genetic loci Dw3 and Ma1 (Higgins et al., 2014), PH24 (Zhang et al., 2015), and uqA07-5 (Shen et al., 2018), and genes GmTOE4a (Zhao et al., 2015), GmAP1 (Chen et al., 2020), and Dt1 (Yue et al., 2021), GmGIa and GmFPA (Han et al., 2021) control flowering time and PH in soybean. GmTFL1b determines PH and growth habit, which a candidate gene for Dt1 (Liu et al., 2010). In this research, QTL/QTN localization of soybean plant height was performed via linkage analysis of a recombinant inbred lines and GWAS of a 455-germplasm population. In the region of QTL/QTN, candidate genes related to PH formation were predicted and initially validated by qRT-PCR. The objective of this research was to lay foundation for probing genetic basis and molecular assistant selection of PH.

Materials and Methods

Plant Populations

Two soybean varieties with large differences in PH, Dongnong L13 obtained from a cross between Heinong 40 and Jiujiao 5640 and Henong 60 obtained from a cross between Beifeng 11 and Hobbit, were used as parents to mate cross in 2008 in Harbin, Heilongjiang Province (E 126.63°, N 45.75°). F1 was planted in Yacheng City, Hainan Province (E 109.00°, N 17.50°) in the winter of the same year. After five consecutive generations from 2010 to 2014 by planting in Harbin and Yacheng alternatively, 139 recombinant inbred lines were obtained and a population formed and were used to conduct linkage analysis. Furthermore, a 455-germplasm population, including 4 local soybean varieties, 387 domestic varieties and 44 foreign varieties, was used for GWAS. The variety name was described and published earlier by Li X. et al. (2020).

Field Trials and Phenotypic Measurement

RIL6013 were planted in eight environments at three locations: Harbin (E 126.63°, N 45.75°), Keshan (E 125.64°, N 48.25°) and Shuangcheng (E 126.92°, N 45.75°). About 455 germplasm resources were planted in Harbin and Shuangyashan (E 131.16°, N 46.64°) in 2018 and 2019, respectively. The detail information for each plant environment was summarized in Supplementary Table 1. The field experiments were conducted in a randomized block in replication design (RBRD). RBRD is a randomized incomplete block design with three replicates used in each environment. With the objective to eliminate the difference of blot among large amounts of lines in a replicate (block), the replicates were divided into multiple sub-blocks which contain about 15 lines. Three ridges were contained in on blot, and the ridges were 3 m in length and 0.67 m in width. The seeds were sowed in single row on the ridges with the plant space set 0.07. The whole experiments were managed as the same as local field production. Ten plants were randomly selected from each blot to determine PH at the maturity stage. The average value of the 10 plants was used as the observation value of the plot, and finally the average value of the three blots was used for QTL and QTN mapping.

Statistical Analysis of Phenotype Data

Frequency distribution histograms were drawn from the phenotypic data of PH in each environment and descriptive statistics were performed. Analysis of variance (ANOVA) and estimation of generalized heritability were also performed as Li X. et al. (2021). The statistical model for the multi-environment ANOVA for RBRD was showed as follows: where μ is the grand average, G is the ith genotype effect, E is the eth environment effect, R is the jth replication effect, B is the kth block in jth replicate, RB is the interaction effect between jth replication and kth block, B(G ) is ith genotype in kth block, ER is interaction between eth environment and jth replication, EB is interaction between eth environment and kth block, ERB is interaction effect among eth environment and jth replication and kth block, EB(G) is ith genotype under interaction of eth environment and kth block, and ε is the error effect following N(0, σ2). The broad-sense heritability in multiple environments was showed as following: where h2 is the broad sense generalized heritability of average in over multiple environments,is the variance of genotype under block,is the variance of genotype under environment × block interaction, σ2 is the error variance, e is the number of environments, and r is the number of repetitions in each environment. Significance of each factor was tested by the general linear model method and variance were estimated using mixed method implemented by SAS 9.2 (SAS Institute, Cary, NC, United States).

SNP Genotyping

DNA samples extracted by CTAB method from RIL6013 were genotyped for SNPs using a soybean SNP660K microarray at Beijing Boao Biotechnology Co., Ltd. A total of 54,836 SNPs were screened on 20 chromosomes. A total of 63,306 SNPs were screened on 20 chromosomes using a soybean SNP180K microarray for SNP genotyping of 455 DNA samples from germplasm resources at Beidahuang Kenfeng Seed Co., Ltd. The obtained SNP markers were screened according to the following criteria: minimum allele frequency for markers (MAF > 5%) and maximum deletion rate <10% for each SNP (Belamkar et al., 2016).

Bin Maker Map Construction and QTL Localization

Here the SNP data from RIL6013 was used to identify possible crossovers via python 2.7snpbinner, and the minimum distance between crossovers is 0.2% of the chromosome length. The aggregated breakpoints generated from the crossover points were then used to create representative bins for the entire population (minimum distance of 30 kb per bin). The obtained bin markers were used to construct a high-density genetic linkage map of SNPbins using the.map function (Linkage map construction) in the software QTL IcimappingV 4.1 (Wang, 2009). QTL IcimappingV 4.1 (Wang, 2009) software was used to locate additive QTL using two mapping methods: interval mapping (IM-ADD) and inclusive composite interval mapping (ICIM-ADD). The scan step was set to 1.00 cM and the LOD threshold was set to 2.50. The PIN value of the ICIM-ADD method was set to 0.001. The QTL were named using the method of McCouch (1997).

Genome-Wide Association Analysis

The population structure and LD of the germplasm resource population were described and published earlier by Li X. et al. (2020). The germplasm resource population consisted of two subpopulations containing 132 (29.01%) and 323 (70.99%) lineages, respectively (K = 2). And the physical distance of LD decay was estimated as the position where r2 dropped to half of its maximum value, the LD decay distance was estimated to be 86 kb. Genome-wide association analysis was performed using the mrMLM.GUI package (Zhang et al., 2020), and the six methods (mrMLM (Wang et al., 2016), FASTmrMLM (Tamba and Zhang, 2018), FASTmrEMMA (Wen et al., 2019), pLARmEB (Zhang et al., 2017), ISIS EM-BLASSO (Tamba et al., 2017), and pKWmEB (Ren et al., 2018) were used to detect significant QTN. In the first stage, the critical p-value parameter was set to 0.005 for all methods except FASTmrEMMA, and the critical LOD value for significant QTN was set to 3 in the final stage. The kinship matrix used in the analysis was also calculated by the software itself.

Candidate Gene Prediction

Genomic regions repeatedly identified in multiple environments or two populations were used to predict genes involving in PH formation. Specifically, the genome region of QTL interval localized in multiple environments with a genomic region less than 300 kb and the LD decay distance of 86 kb of the QTN localized within the QTL genomic region were selected, and the genes were searched for by the Phytozome website.[2] The genes expressed in the stems were then screened. Finally, candidate genes related with PH were identified by combining annotation information of genes, pathway analysis in the Kyoto Encyclopedia of Genes and Genomes (KEGG)[3] and previous studies.

Candidate Gene Validation

Two parents (Dongnong L13 and Henong 60), two varieties (HN400 and HN451) with lower PH and two varieties (HN369 and HN477) with higher PH, were selected in the RIL6013 population based on the PH phenotype data. The qRT-PCR was used to study the relative expression of candidate genes in these six varieties. These varieties were planted in Harbin in the same environment as E1. Stems were sampled at 10-day intervals starting from the R1 period when elongation is the fastest. The third node down from the top of the main stem was taken and replicated three times per plant. Total RNA was extracted using the OminiPlant RNA Kit (Dnase I) (CWBIO, Jiangsu, China). Two microgram of total RNA was extracted using the EasyScript® One-Step gDNA Removal and cDNA Synthesis SuperMix kit (TransGen Biotech, Beijing, China). The first strand cDNA was synthesized from 2μg of total RNA using the EasyScript® One-Step gDNA Removal and cDNA Synthesis SuperMix kit (TransGen Biotech, Beijing, China). Twenty microliter reaction volume was determined for qRT-PCR using the SYBR®Green doping method from Roche Light Cycle™ containing the following components: 10 μL SYBR®Green Realtime PCR Master Mix (TOYOBO, Japan), 0.8 μL of each primer (10 μM), 6.4 μL of distilled water and 2 μL of diluted cDNA. The whole reaction was run under the following conditions: pre-denaturation 95°C for 30 s; PCR40 cycles, 95°C for 5 s, 60°C for 20 s, 72°C for 15 s; solubility curve analysis 95°C 10 s, 65°C for 60 s, and 97°C for 1 s. All PCR reactions were repeated three times. Data were processed using the 2–ΔΔCt method using FBOX as the internal reference gene (Bansal et al., 2015), the primers used are shown in Supplementary Table 2.

Molecular Marker Identification

With the objective to verify the effect of gene and develop markers for molecular assistant selection, the markers with polymorphism in the 100k bp interval of the genes were evaluated for the association with plant height in the 455-germplasm population. The significant differences of averages between allelic genotypes were determined by analysis of variance, and the probability to determine the significance was set 0.05.

Results

Phenotypic Variation Analysis

Phenotypic data collected from 139 lines of RIL6013 in eight environments were analyzed. 455 germplasm resource populations in four environments were analyzed early by Wang et al. (2021). The results of descriptive statistics (Table 1) showed that the absolute values of kurtosis and skewness were less than 1 in all the eight environments of RIL6013 except E8, which was close to 1. It showed that PH distributed normally (Figure 1). The range of PH in RIL6013 contained those of parents, which indicated a transgressive segregation in the two populations. The coefficient of variation ranged from 13.10 to 22.00% for the RIL6013 population and from 18.03 to 20.31% for the 455-germplasm population, which suggested that a wide range of variation in plant height in two populations and a different genetic basis in different environments.
TABLE 1

Descriptive statistics for soybean plant height of RIL6013 population in eight environments and 455 germplasm resources in four environments.

Env.P1aP2Minimum (cm)Maximum (cm)Rangeb (cm)Mean ± STDc (cm)SkewKurtCVd (%)
E177.00144.0067.00117.52 ± 15.40–0.63–0.2213.10
E267.40151.4084.00120.49 ± 19.02–0.75–0.2615.79
E3134.0063.8072.00157.0085.00120.96 ± 19.31–0.48–0.5415.96
E4132.0063.0065.00155.0090.00120.42 ± 16.35–0.830.9613.58
E5139.6775.6765.33149.0083.67123.58 ± 17.97–0.80–0.1714.54
E6135.0072.0060.00163.33103.33133.23 ± 19.47–0.930.6814.61
E7144.6777.5062.00162.67100.67130.80 ± 19.01–0.800.6314.53
E8137.5077.0058.00152.0094.00122.04 ± 19.85–1.201.2316.27
E937.00131.0094.0086.82 ± 15.65–0.190.5418.03
E1038.33176.67138.3495.83 ± 17.440.482.4018.20
E1140.00168.67128.6797.35 ± 19.77–0.100.0820.31
E1247.00158.00111.0093.13 ± 16.500.120.2217.71

FIGURE 1

Frequency histograms of plant height in RIL 6013 in eight environments.

Descriptive statistics for soybean plant height of RIL6013 population in eight environments and 455 germplasm resources in four environments. Frequency histograms of plant height in RIL 6013 in eight environments. The results of ANOVA (Table 2) showed that there were highly significant differences in environment, genotype, and genotype × environment interaction effect, which indicated that PH was influenced not only by genotype and environment but also by genotype by environment interaction effect. Higher broad sense heritability (65 and 72%) was found in RIL6013 and 455 germplasm resource populations, respectively, which indicated that the variation of soybean plant height mainly come from genetic effect.
TABLE 2

Joint ANOVA of PH of RIL6013 population and 455 germplasm resources in multiple environment and heritablity.

PopulationSourceDFSSMSFPr > Fσ2
RIL6013Replication2208.033104.0120.360.6949
Environment793199.50013314.21246.6<0.0001
Block923934.8612659.4349.31<0.0001
Replication × Block188740.392485.5831.70.033
Block (Genotype)129351127.2342721.9239.53<0.000174.412
Replication × Environment143107.823221.9940.780.6954
Environment × Block6367999.4441079.3623.78<0.0001
Replication × Environment × Block12638642.481306.6911.070.2779
Environment × Block (Genotype)896849663.510948.2913.32<0.0001220.863
Error2050585738.432285.733285.733
Total33142019279.831
h265%
GermplasmReplication21353.231676.6122.460.0852
Environment3101153.85233717.952122.83<0.0001
Block30197653.3746588.45124<0.0001
Replication × Block6017256.132287.6301.050.3769
Block (Genotype)424914880.7612157.7447.86<0.0001139.413
Replication × Environment6381.32363.5530.230.9665
Environment × Block86112773.7721311.3214.78<0.0001
Replication × Environment × Block17245913.361266.9420.970.5865
Environment × Block (Genotype)1178753903.492639.9932.33<0.0001125.211
Error3204879537.283274.514274.512
Total51653071626.461
h272%
Joint ANOVA of PH of RIL6013 population and 455 germplasm resources in multiple environment and heritablity.

Bin Map and QTL Localization for RIL6013

A high-density SNP bin genetic linkage map covered all 20 chromosomes containing 1996 bin markers, and the total length of the map was 2874.72 cM. The number of SNP bin markers per chromosome ranged from 59 to 158, and the length of each linkage group ranged from 82.37 to 238.98 cM. The average number of markers per linkage group was 99.8, and the average distance between markers was 1.48 cM (Figure 2 and Table 3).
FIGURE 2

The high-density bin map of RIL6013.

TABLE 3

Description of characteristics of the 20 linkage groups in the high-density map.

Chr.No.of SNP in mapMap length (cM)Average interval (cM)
Chr174141.721.94
Chr2118238.982.04
Chr393152.621.66
Chr4131150.461.16
Chr56291.621.50
Chr6120145.881.23
Chr7135207.451.55
Chr897150.881.57
Chr9128129.441.02
Chr1075151.672.05
Chr1175114.371.55
Chr1259115.621.99
Chr13158232.851.48
Chr1480131.031.66
Chr15135184.951.38
Chr168394.021.15
Chr179691.750.97
Chr188982.370.94
Chr1994154.331.66
Chr2094112.711.21
Sum19962874.721.48
The high-density bin map of RIL6013. Description of characteristics of the 20 linkage groups in the high-density map. A total of 33 QTLs associated with plant height were localized in the RIL6013 population on 12 chromosomes of soybean using two methods IM and ICIM based on bin mapping (Figure 3 and Supplementary Table 3). The number of QTL localized on each chromosome ranged from one (Chr02, Chr04, Chr12, and Chr13) to six (Chr14), with phenotypic contributions ratio ranging from 0.55 to 13.64%. 2, 16, 6, 9, 1, 1, and 6 QTLs were localized in E2–E8, respectively. A total of three QTL (qPH-1-1, qPH-6-2, and qPH-18-4) showed phenotypic contributions ratio more than 10% and can be considered as the main effective QTL for plant height.
FIGURE 3

Frequency of QTL for plant height on 20 chromosomes in RIL 6013.

Frequency of QTL for plant height on 20 chromosomes in RIL 6013. A total of five QTLs were localized in multiple environments (Table 4), and the additive effects were all positive, indicating that the parent Dongnong L13 could increase plant height via these QTL. qPH-2-1 was localized on chr02 in E3, E4, and E8 environments with LOD values of 2.87–3.09 and phenotypic contributions ratio of 1.60–6.46%. The genomic region of qPH-2-1 was shorter than 320kb, which is suitable for searching candidate genes.
TABLE 4

Five QTL detected in multiple environments.

QTLEnv.Chr.Marker IntervalLODaPVE (%)bADDcPhysical Region (Mb)Method
qPH-1-1E3/E5/E8/E8160c01056-60c010523.34/4.28/3.62/3.621.67/3.19/2.68/11.7216.70/41.16/31.10/31.1048.83–50.55IM/IM/IM/ICIM
qPH-2-1E3/E4/E8260c02058-60c020592.87/3.09/2.961.60/6.46/3.5315.53/18.93/27.0913.66–13.98IM/IM/IM
qPH-3-5E5/E6/E8/E6360c03076-60c030802.63/2.73/3.51/2.734.72/3.92/2.19/3.9221.31/25.74/32.81/25.7442.10–42.72IM/IM/IM/ICIM
qPH-12-1E2/E3/E21260c12024-60c120322.68/2.90/2.685.60/1.47/5.6015.77/15.54/15.7711.79–14.97IM/IM/ICIM
qPH-18-3E5/E81860c18061-60c180584.37/3.483.25/0.8624.58/40.7749.04–52.21IM/IM

Five QTL detected in multiple environments.

Multi-Locus GWAS for Germplasm

A total of 62 QTN were detected on 18 chromosomes (except for chr11 and chr20) using six multilocus methods within the mrMLM package: mrMLM, FASTmrMLM, FASTmrEMMA, pLARmEB, ISIS EM-BLASSO, and pKWmEB, respectively. LOD values ranged from 3.02 to 10.45, and the ratio of phenotypic variation explained by QTN ranged from 1.12 to 13.12%. Six methods detected 20, 10, 3, 18, 25, and 29 QTN, respectively, while 15, 23, 13, and 11 QTN were detected within E1, E2, E3, and E4, respectively. Of all QTN, 26 detected by multiple methods were located on chromosomes 1, 2, 3, 4, 6, 7, 9, 10, 13, 14, 15, 16, and 18, respectively, with LOD values ranging from 3.04 to 10.45. The proportion of phenotypic variation explained by QTN ranged from 1.12 to 6.62%. The detected QTN effects (positive or negative) were consistent between methods (Wang et al., 2021).

Co-detected Regions by Linkage Analysis and Association Analysis

The regions detected by GWAS were compared with those of the linkage analysis. The results showed that two QTN loci fell within the genomic region where the two QTLs identified in the RIL6013 population (Figure 4). Among them, AX-90484715 was located within the interval of qPH-5-4 and AX-90349538 was located within the interval of qPH-14-2. Candidate genes were searched within 43 kb flanking these two QTN loci based on LD distance.
FIGURE 4

Distribution of QTLs and QTNs for plant height identified in RIL 6013 and the germplasm panel on genome map. Bolded black fonts represent multi-environment QTL, blue fonts represent QTLs in previous studies and red fonts represent QTLs and QTNs that are used to predict candidate genes.

Distribution of QTLs and QTNs for plant height identified in RIL 6013 and the germplasm panel on genome map. Bolded black fonts represent multi-environment QTL, blue fonts represent QTLs in previous studies and red fonts represent QTLs and QTNs that are used to predict candidate genes. Based on the above results, candidate genes were selected to search within 13.66–13.98 Mb on Chr02, 41.55-41.63Mb on Chr05 and 4.01-4.09Mb on chr14. A total of 50 candidate genes were searched, of which 46 genes were expressed in the stems. The pathway analysis on 46 genes showed that a total of 18 genes (39.1%) had annotations (Supplementary Table 4). Based on the annotation information of KEGG and metabolic function information, three potential candidate genes were predicted that may be directly or indirectly related to PH (Table 5).
TABLE 5

Detailed information of three candidate genes related to plant height.

QTL or QTN nameGene nameChromosomePositionKO numberAnnotation
qPH-2-1Glyma.02G132200Chr0213704118..13708015K09843CYP707A; (+)-abscisic acid 8’-hydroxylase [EC:1.14.14.137]
qPH-2-1Glyma.02G133000Chr0213760608..13761881K13448CML; calcium-binding protein CML
AX-90484715Glyma.05G240600Chr0541566791..41569485K22736VIT; vacuolar iron transporter family protein
Detailed information of three candidate genes related to plant height. The relative expression of the three candidate genes in the two parents, HN400, HN451, HN369, and HN477, were characterized by applying qRT-PCR. The plant height of the six varieties continued to grow from R1 to day 30, with highly significant differences in plant height from day 10 after R1 (Supplementary Table 5 and Figure 5A). Relative expression amount (REA) of Glyma. 02G132200 did not differ significantly among varieties at the whole stages, which indicated Glyma. 02G132200 was not directly related to PH. It could be related to the trait from the DNA level or some other pathway. For Glyma. 02G133000, REA of the six varieties increased continuously from R1 to day 20 and started to decrease from day 20 to day 30. REA from day 10 to day 30 of HN369, HN477 and Dongnong L13 was larger significant than that of HN400, HN451 and Henong 60 (Figure 5B). The expression of Glyma. 05G240600 in the six cultivars continued to increase from R1 to day 30. REA of HN369, HN477 and Dongnong L13 were significantly higher than that of HN400, HN451 and Henong 60 from R1 to day 30 (Figure 5C).
FIGURE 5

Plant height and relative expression patterns of candidate genes. Plant height of six varieties at different times express as (A), relative expression of Glyma.02G133000 in six varieties at different times express as (B) and relative expression of Glyma.02G133000 in six varieties at different times express as (C). *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

Plant height and relative expression patterns of candidate genes. Plant height of six varieties at different times express as (A), relative expression of Glyma.02G133000 in six varieties at different times express as (B) and relative expression of Glyma.02G133000 in six varieties at different times express as (C). *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001. From the 455-germplasm population, seven and one SNP markers associated with plant height were detected near Glyma.05G240600 and Glyma.02G133000 (Table 6), which indicated that the two genes controlling plant height. Among these markers, AX-90483488, AX-90490846, and AX-90515514 were detected in three environments, while the rest five markers were detected in only one environment. These eight markers could be used improve plant height commonly or specifically.
TABLE 6

SNPs markers associated with plant height near Glyma.02G133000 and Glyma.05G240600.1.

GenesSNP probePhysical PositionGenotype18Ha
19H
19S
FPr < FFPr < FFPr < F
Glyma.02G133000.1 AX-9039481013746369C/T9.110.0027
AX-9052467613750891A/C9.010.0028
AX-9050521813763345A/T4.110.0433
AX-9035136713765868A/G4.950.0267
AX-9046040613766378A/G4.120.043
AX-9048348813787299G/C14.50.000210.190.00158.120.0046
AX-9049084613790192G/T7.80.00557.720.005710.60.0012
Glyma.05G240600.1 AX-9051551441489472C/T6.040.01447.10.0083.930.0481

SNPs markers associated with plant height near Glyma.02G133000 and Glyma.05G240600.1.

Discussion

Improving the Accuracy of QTL Analysis and GWAS by Multi-Environment Experiments and Sufficient SNP Markers

The small amount of RFLP, AFLP, and SSR markers used in previous studies made it difficult to ensure the accuracy of linkage analysis (Singh et al., 2016; Bhat et al., 2020), and most of the previously localized QTL were analyzed in a single environment, which is prone to false positive results (Fang et al., 2020). QTL detected repeatedly in multiple environments are more authentic than those detected in a single environment (Fulton et al., 1997). Here, a high-density genetic map containing 1,966 SNP bin markers was constructed using RIL6013 with the average distance between markers of 1.48 cM, which improved the resolution of the map and facilitated the localization of more QTL and shortened the interval of localized QTL. Phenotypic variation was enriched using an eight-environment experiment at multiple locations over multiple years. And the candidate genes were searched within the stable QTL intervals that were repeatedly localized in multiple environments. Summarizing the above measures, the accuracy of the linkage analysis was improved. For association analysis, more molecular markers could produce a higher probability of detecting functional loci (Xu et al., 2021). Multi-locus GWAS methods are effective in reducing false-positive QTNs compared to single-locus GWAS methods (Qi et al., 2020). An ideal germplasm resource population should contain rich genotypic and phenotypic data (Kaler et al., 2020). Based on the above considerations, the genotype data of 63,306 SNP markers from a natural 455-germplasm population and phenotypic data from four environments were used to conduct multi-locus GWAS analysis, which improve the accuracy of association analysis and reduce the ratio of false positives.

Comparison With Previous Results of Localized QTL

Here, the five QTLs located by RIL6013 repeatedly in multiple environments and the two QTNs identified by a combination of linkage and association analysis were compared with 238 QTLs associated with PH located by previous researches in the soybase database (Figure 4). The interval of qPH-2-1, which is located on chr02, was contained by the interval of Plant height 42-1 (Hu et al., 2013). The interval of qPH-3-5 on chr03 located in the interval of Plant height 26-17 (Sun et al., 2006). The interval of qPH-12-1 on chr12 had a overlapping region with the interval of Plant height 38-7 (Lee et al., 2015). The interval of qPH-18-3 on chr18 crossed the intervals of Plant height 26-13 and Plant height 26-14 (Sun et al., 2006). The genomic region of AX-90484715 on ch05 had a overlapping region with the interval of Plant height 37-1 (Yao et al., 2015). The qPH-1-1 localized on chr01 in three environments, E3, E5 and E8, was a newly identified QTL, which was more than 20 Mb away from mqPlant height-005 (Pathan et al., 2013). The AX-90349538 on chr14 was a newly identified QTN, which was more than 1.9 Mb away from Plant height 34-6 (Kim et al., 2012). In addition, compared with genes related to plant height identified in previous studies, we found that the GA20ox controlling PH formation (Fernandez et al., 2009) was only 170 kb away from AX-90464100. These results support the accuracy of this study.

Further Analysis of Candidate Genes by qRT-PCR

Using annotation information and metabolic function information we initially predicted three candidate genes that might be associated with PH. Applying qRT-PCR technique to identify the relative expression of the three candidate genes in six varieties with significant differences in plant height, it was found that two candidate genes may be associated with plant height. Among them, Glyma.02G133000 is a calcium-binding protein gene involved in calmodulin synthesis, and calmodulin is involved in regulating leaf senescence and ABA response in Arabidopsis affecting plant growth and development (Dai et al., 2018). Glyma.05G240600 is involved in the synthesis of the vesicular iron transporter protein VIT, and iron stored in the vesicles plays a crucial role in the development of plant seedlings; if iron is deficient, it leads to stunted seedlings. Thus, it was speculated to regulate of plant height (Kim et al., 2006). The relative expression of Glyma.02G133000 and Glyma.05G240600 from R1 to day 30 was higher in the high PH varieties than in the low PH varieties, so it was speculated that Glyma.02G133000 and Glyma.05G240600 may have a positive regulatory function on plant height. However, plant height is a complex quantitative trait, and the specific regulation of plant height by these two candidate genes needs to be investigated in follow-up studies.

Summary

Here, five multi-environmental QTL and 26 multi-method QTN were detected by linkage analysis and association analysis, respectively, and two candidate genes associated with plant height were identified by pathway analysis and qRT-PCR validation. These results lay the foundation for marker-assisted selection.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author Contributions

W-XL and HN conceived and designed the experiments. JW, BH, YJ, XH, YG, JC, YL, and JH performed the field experiments. JW and HN analyzed and interpreted the results. JW, BH, and HN drafted the manuscript. W-XL provided the laboratory conditions. All authors contributed to the manuscript revision.

Conflict of Interest

XH is employed by Key Laboratory of Crop Biotechnology Breeding of the Ministry of Agriculture, Beidahuang Kenfeng Seed Co., Ltd., Harbin, China. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  40 in total

1.  FT5a interferes with the Dt1-AP1 feedback loop to control flowering time and shoot determinacy in soybean.

Authors:  Lin Yue; Xiaoming Li; Chao Fang; Liyu Chen; Hui Yang; Jie Yang; Zhonghui Chen; Haiyang Nan; Linnan Chen; Yuhang Zhang; Haiyang Li; Xingliang Hou; Zhicheng Dong; James L Weller; Jun Abe; Baohui Liu; Fanjiang Kong
Journal:  J Integr Plant Biol       Date:  2021-03-26       Impact factor: 7.061

2.  Soybean AP1 homologs control flowering time and plant height.

Authors:  Liyu Chen; Haiyang Nan; Lingping Kong; Lin Yue; Hui Yang; Qingsong Zhao; Chao Fang; Haiyang Li; Qun Cheng; Sijia Lu; Fanjiang Kong; Baohui Liu; Lidong Dong
Journal:  J Integr Plant Biol       Date:  2020-07-24       Impact factor: 7.061

3.  Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology.

Authors:  Shi-Bo Wang; Jian-Ying Feng; Wen-Long Ren; Bo Huang; Ling Zhou; Yang-Jun Wen; Jin Zhang; Jim M Dunwell; Shizhong Xu; Yuan-Ming Zhang
Journal:  Sci Rep       Date:  2016-01-20       Impact factor: 4.379

4.  pKWmEB: integration of Kruskal-Wallis test with empirical Bayes under polygenic background control for multi-locus genome-wide association study.

Authors:  Wen-Long Ren; Yang-Jun Wen; Jim M Dunwell; Yuan-Ming Zhang
Journal:  Heredity (Edinb)       Date:  2017-12-13       Impact factor: 3.821

5.  Genome-Wide Association Studies for Dynamic Plant Height and Number of Nodes on the Main Stem in Summer Sowing Soybeans.

Authors:  Fangguo Chang; Chengyu Guo; Fengluan Sun; Jishun Zhang; Zili Wang; Jiejie Kong; Qingyuan He; Ripa A Sharmin; Tuanjie Zhao
Journal:  Front Plant Sci       Date:  2018-08-20       Impact factor: 5.753

6.  Recommended Reference Genes for Quantitative PCR Analysis in Soybean Have Variable Stabilities during Diverse Biotic Stresses.

Authors:  Raman Bansal; Priyanka Mittapelly; Bryan J Cassone; Praveen Mamidala; Margaret G Redinbaugh; Andy Michel
Journal:  PLoS One       Date:  2015-08-05       Impact factor: 3.240

7.  Linkage Analysis and Multi-Locus Genome-Wide Association Studies Identify QTNs Controlling Soybean Plant Height.

Authors:  Yanlong Fang; Shulin Liu; Quanzhong Dong; Kaixin Zhang; Zhixi Tian; Xiyu Li; Wenbin Li; Zhongying Qi; Yue Wang; Xiaocui Tian; Jie Song; Jiajing Wang; Chang Yang; Sitong Jiang; Wen-Xia Li; Hailong Ning
Journal:  Front Plant Sci       Date:  2020-02-14       Impact factor: 5.753

8.  An efficient multi-locus mixed model framework for the detection of small and linked QTLs in F2.

Authors:  Yang-Jun Wen; Ya-Wen Zhang; Jin Zhang; Jian-Ying Feng; Jim M Dunwell; Yuan-Ming Zhang
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

9.  Identification of quantitative trait nucleotides and candidate genes for soybean seed weight by multiple models of genome-wide association study.

Authors:  Benjamin Karikari; Zili Wang; Yilan Zhou; Wenliang Yan; Jianying Feng; Tuanjie Zhao
Journal:  BMC Plant Biol       Date:  2020-09-01       Impact factor: 4.215

View more
  1 in total

1.  An APETALA2/ethylene responsive factor transcription factor GmCRF4a regulates plant height and auxin biosynthesis in soybean.

Authors:  Zhiyong Xu; Ruikai Wang; Keke Kong; Naheeda Begum; Aisha Almakas; Jun Liu; Hongyu Li; Bin Liu; Tuanjie Zhao; Tao Zhao
Journal:  Front Plant Sci       Date:  2022-09-06       Impact factor: 6.627

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.