Literature DB >> 31552060

Utilization of Interspecific High-Density Genetic Map of RIL Population for the QTL Detection and Candidate Gene Mining for 100-Seed Weight in Soybean.

Benjamin Karikari1, Shixuan Chen1, Yuntao Xiao1, Fangguo Chang1, Yilan Zhou1, Jiejie Kong1, Javaid Akhter Bhat1, Tuanjie Zhao1.   

Abstract

Seed-weight is one of the most important traits determining soybean yield. Hence, it is prerequisite to have detailed understanding of the genetic basis regulating seed-weight for the development of improved cultivars. In this regard, the present study used high-density interspecific linkage map of NJIR4P recombinant inbred population evaluated in four different environments to detect stable Quantitative trait loci (QTLs) as well as mine candidate genes for 100-seed weight. In total, 19 QTLs distributed on 12 chromosomes were identified in all individual environments plus combined environment, out of which seven were novel and eight are stable identified in more than one environment. However, all the novel QTLs were minor (R 2 < 10%). The remaining 12 QTLs detected in this study were co-localized with the earlier reported QTLs with narrow genomic regions, and out of these only 2 QTLs were major (R 2 > 10%) viz., qSW-17-1 and qSW-17-4. Beneficial alleles of all identified QTLs were derived from cultivated soybean parent (Nannong493-1). Based on Protein ANalysis THrough Evolutionary Relationships, gene annotation information, and literature search, 29 genes within 5 stable QTLs were predicted to be possible candidate genes that might regulate seed-weight/size in soybean. However, it needs further validation to confirm their role in seed development. In conclusion, the present study provides better understanding of trait genetics and candidate gene information through the use high-density inter-specific bin map, and also revealed considerable scope for genetic improvement of 100-seed weight in soybean using marker-assisted breeding.

Entities:  

Keywords:  QTL; candidate gene; marker-assisted breeding; seed-weight; soybean

Year:  2019        PMID: 31552060      PMCID: PMC6737081          DOI: 10.3389/fpls.2019.01001

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Soybean (Glycine max L. Merr.) is one of the most economically important crop being rich source of both edible oil and protein as well as has significant role in health, biofuel, and soil fertility improvement (Kulkarni et al., 2016). In China, soybean production has continuously declined with considerable low yield increase in the past 50 years (Liu et al., 2018). Moreover, China imports >80% of soybean for their total domestic use; hence, it is prerequisite to increase the domestic production of soybean to make country self-sufficient (Liu et al., 2018). Different yield-related traits are targeted by plant breeders to increase soybean production. In this context, seed-weight is one of the most important yield-related trait for increasing seed yield in soybean; however, it is a complex quantitative trait governed by polygenes and are highly influenced by environment, which makes it selection difficult for plant breeders (Yao et al., 2015). Furthermore, seed weight/size determines the specific soy-based food product that can be made from soybean (Cui et al., 2004; Gandhi, 2009). For instance, small-seeded cultivars are suitable for fermented soybean (natto) and sprout production, whereas large-seeded cultivars are used for boiled soybean (nimame), green soybean (edamame), soymilk, and soybean curd (tofu) (Liang et al., 2016; Teng et al., 2017; Wu et al., 2018). In addition, seed weight/size influences germination ability and seedling vigor, which in turn determines the competitive ability of the seedling for light, nutrient resources, and stress tolerance (Coomes and Grubb, 2003; Gomez, 2004; Haig, 2013). Seed weight is one of the traits that was altered during domestication (Lee et al., 2011; Han et al., 2016). During domestication process from wild species to cultivated soybean, selecting desirable agronomic traits to keep achieving high yield allows many genes to be either directly selected or filtered out, resulting in a significant reduction of genetic diversity in soybean gene pool (Guo et al., 2010; Tang et al., 2010). Hyten et al. (2006) suggested that 50% of the genetic diversity and 81% of the rare alleles have been lost during domestication and that 60% of the genes show significant changes in allele frequency as a result of soybean domestication. It has been reported that wild soybean (Glycine soja) is an important source of genes for higher yield and related traits, quality, as well as biotic and abiotic stresses (Zhou Z. et al., 2015). Thus, it is necessary to broaden the gene pool in soybean breeding from diverse sources, especially from wild soybean (G. soja). The seed of cultivated soybean (G. max) is heavier and bigger compared to the wild accessions (Yu et al., 2017). Both wild and cultivated soybean belong to the same genus Glycine (Kim M.Y. et al., 2010), with the former having higher level of genetic diversity, as well as better adaptation to harsh environments (Stupar, 2010; Qiu et al., 2013; Zhang et al., 2017). Thus, G. soja holds great potential to improve its agriculturally important domesticated relative (G. max), beyond what is currently known (Kofsky et al., 2018). For example, comparative genomics, transcriptomics, and bioinformatics application have revealed the role of domestication in the seed weight of soybean (Lu et al., 2016; Zhao et al., 2016; Yu et al., 2017). Quantitative trait loci (QTL) mapping using domesticated and wild progenitors have been reported to be the useful means for identifying genomic regions involved in morphological and physiological changes that distinguish crops from their wild relatives (Paterson, 2010). The wild soybean has been recently reported to be an important source of QTLs contributing to the increase in seed size in soybean. For example, Lu et al. (2017) identified a phosphatase 2C protein (PP2C-1) allele from wild soybean underlying a QTL that enhances the 100-seed weight in soybean. Although many genetic studies have been carried out in the past decades to identify QTLs for seed weight/size using different types of DNA markers through QTL mapping analyses. Currently, there are a total of 325 QTLs identified for seed-weight/size available on SoyBase[1], and most of them are minor and not validated (Liu et al., 2018). In addition, knowledge for the molecular mechanism of soybean seed weight is very limited compared to other crops like rice (Wang et al., 2015; Liu et al., 2018). Till date, only two genes related to seed weight/size have been isolated from soybean viz., ln (Jeong et al., 2012) and PP2C-1 (Lu et al., 2017). Hence, it is prerequisite to identify stable QTLs for seed-weight as well as mine candidate genes underlying them to facilitate understanding of the molecular mechanisms regulating seed-weight in soybean (Kato et al., 2014). Furthermore, only few mapping populations derived from wild and domesticated soybean crosses have been used for QTLs detection of seed-weight in soybean[1]. Also, most of the previous studies have used low-throughput markers (such as SSR) for QTL identification of seed-weight in soybean (Panthee et al., 2005; Gai et al., 2007; Kato et al., 2014; Kulkarni et al., 2016; Wu et al., 2018). These marker systems have low resolution and larger confidence interval compared with high-density SNP markers (Hyten et al., 2010; Xu et al., 2013; Lu et al., 2017) that were revealed to be useful for high-throughput QTL mapping. Also, most of the published reports did not mine the candidate genes for seed-weight (Zhang et al., 2004; Kato et al., 2014; Kulkarni et al., 2016; Wu et al., 2018). Therefore, by keeping the above into view the present study used high-density inter-specific genetic map of the recombinant inbred line (RIL) population (NJIR4P) derived from a cross between Nannong493-1 (G. max) and PI 342618B (G. soja) that was evaluated in multiple environments to map stable QTLs as well as mine possible candidate genes underlying 100-seed weight in soybean. Using interspecific RIL population with wide range of variation in 100-seed weight has greatly assisted in the detection of more number of major and minor QTLs regulating 100-seed weight in soybean. The use of this RIL population could enhance our understanding of molecular mechanism, evolution, and genetic regulation of seed weight in soybean. The results of the present study will be helpful in marker-assisted breeding (MAB) for developing soybean varieties with improved seed-weight.

Materials and Methods

Plant Materials and Experimental Conditions

An interspecific RIL population consisting of 161 lines were derived through single seed descent (SSD) method by crossing a soybean cultivar Nannong493-1 (G. max) with wild soybean line PI 342618B (G. soja), and this RIL population were named as NJIR4P. The Nannong493-1 parent has a higher 100-seed weight with an average value of 18.02 ± 2.60 g, whereas PI 342618B is an annual wild soybean with low 100-seed weight (1.4 g) (Xie et al., 2014). The RILs (F6:9–F6:11) along with their parents were planted in four different environments viz., Fengyang Experimental Station, Chuzhou, Anhui Province (Latitude 32°87′ N; Longitude 117°56′ E), in 2012 (FY2012), and Jiangpu Experimental Station, Nanjing, Jiangsu Province (JP) (Latitude 33°03′ N; Longitude 118°63′ E) in 2012, 2013, and 2014 (JP2012, JP2013, and JP2014). Soybean lines were planted in a single line plot of 1 m in length and 0.5 m in width in a randomized complete block design (RCBD) with three replications. Standard cultural and agronomic practices were followed in each environment (Lihua, 1982; Liu et al., 2008).

Phenotypic Analysis of 100-Seed Weight

Each row of the RILs and their parents were harvested, threshed, and dried to a suitable moisture. Four-hundred healthy dried seeds from each row were selected randomly for measurement of 100-seed weight. The 100-seed weight, i.e., weight of 100 seeds at 13% moisture content was measured by electronic balance and were repeated four times. Seed-weight was calculated for all the three replication and mean value was used for analysis. Analysis of variance (ANOVA) in each environment and combined environments (CEs) were conducted using the general linear model (GLM) and mixed procedure, respectively, in SAS (SAS Institute, 2010. SAS/STAT software version 9.2; SAS Institute Inc., Cary, NC, United States). The broad-sense heritability (H2) was calculated for both individual environments plus CE following the procedure of Hanson et al. (1956). Also genotypic coefficients of variation (GCV) was calculated by using the following formula proposed by Singh (1985): , where is the genotypic standard deviation in each environment while μ is the mean value of 100-seed weight.

QTL Mapping Analysis

In the present study, an inter-specific high-density bin map earlier developed by Wang et al. (2016) by using RAD-sequencing approach for this population was used for QTL mapping. This bin map consisted of 4,354 bin markers that were derived from 80,995 single-nucleotide polymorphisms (SNPs) distributed on all 20 soybean linkage groups/chromosomes, and has a total length of 2,136.717 cM. The average number of markers per linkage group and length of linkage group was 218 and 106.84 cM, respectively, with mean distance between bins as 0.49 cM (Supplementary Table 1). Among the NJRI4P-RIL, 46.07% were inherited their genetic background from Nannong493-1, 50.06% were from PI 342618B, and the remaining 3.87% were heterozygous genotypes. The segregation ratios of each bin marker were calculated, and only few significant segregation distortion regions were identified. In NJRI4P, out of 4,354 bin markers only 1 bin showed extreme segregation distortion at P < 0.0001 on chromosome 2, and 2 bins exhibited segregation distortion at P < 0.0005 on chromosomes 7 and 19, whereas the remaining bin markers did not show significant segregation distortion (Wang et al., 2016). The QTL analysis was performed via WinQTLCart 2.5 software (Wang et al., 2007). For the WinQTLCart 2.5 software, the model of composite interval mapping (CIM) was used with a 10 cM window at a walking speed of 1 cM. The LOD threshold was calculated using 1,000 permutations for an experimental-wise error rate of P = 0.05 to determine whether the QTL was significantly associated with (Churchill and Doerge, 1994). The CIM model was also used to identify the main QTLs in the CE with the same parameters as used in the individual environment. Mapping for CE was done using the Best Linear Unbiased Prediction (BLUP) values for each independent environment and across all environments by using the lme4 package in R (Bates et al., 2014). QTLs detected in different environments at the same, adjacent, or overlapping marker intervals were considered the same QTL (Palomeque et al., 2009, 2010; Qi et al., 2017). QTL naming was done following the nomenclature of McCouch et al. (1997), thus starting with “q,” followed by an abbreviation of the trait name (SW, seed weight) and the name of the chromosome, followed by the number of QTL detected on the same chromosome. The QTL genetic and physical positions based on the flanking markers with known positions were used to retrieve a number of earlier reported QTLs available on SoyBase[2] (Williams 82.a1.v.1.1). QTLs that did not overlap with reported QTLs in both genetic and physical positions were considered as new in this study. The QTLs identified in the individual environments were presented in Venn diagram using an online tool[3] (Oliveros, 2007).

Candidate Gene Prediction Analysis

In this study, QTL was considered as stable when detected in at least two environments. Model genes within the genomic physical position of the stable QTLs on the soybean genome (Williams 82.a1.v.1.1) available at SoyBase[3] were downloaded. Gene ontology (GO) enrichment analysis was conducted for all the genes within each QTL region using online GO tool[4]. Gene classification was then carried out using Web Gene Ontology (WeGO) Annotation Plotting tool, Version 2.0[5] (Ye et al., 2018). The predicted candidate genes were further subjected to Protein ANalysis THrough Evolutionary Relationships (PANTHER) Classification System to classify proteins (and their genes) in order to facilitate high-throughput analysis according to family and subfamily, molecular function, biological process, and pathway[6]. The selected candidate genes structure analysis was carried out using http://gsds.cbi.pku.edu.cn/ (Hu et al., 2014).

Results

Phenotypic Variation of 100-Seed Weight

Mean, range, standard deviation, skewness, kurtosis, H2, and GCV among the RILs and their parents across the four different environments (FY2012, JP2012, JP2013, and JP2014), and CE are presented in Table 1. The average 100-seed weight of the Nannong493-1, PI 483460B, and RILs were 16.49–19.09, 1.23–1.40, and 1.37–11.84 g, respectively, across all the studied environments (Table 1). However, there was no clear transgressive segregation among the RIL (Figure 1). Furthermore, ANOVA were performed to evaluate the effects of genotypes/lines (G), environment (E), and their interactions (GE) on 100-seed weight. The RILs showed highly significant differences (P < 0.01) for 100-seed weight in the individual environments. ANOVA for CE showed that G, E, and GE contributed significant variation to seed weight among the RILs of NJIR4P population (Supplementary Table 2). Hence, significant influence of E and GE on 100-seed weight of soybean suggests that seed-weight is a complex quantitative trait governed by polygenes. Moreover, high H2 values in individual as well as CEs varying from 88.27 to 97.23% coupled with high GCV (>20%) suggest that considerable proportion of phenotypic variation of 100-seed weight is due to genotype.
TABLE 1

Descriptive statistics, broad-sense heritability (H2), and genotypic coefficient of variation (GCV) of 100-seed weight in NJIR4P RIL population and two parental lines viz., Nannong493-1 and PI 483460B.

Env.Parents ± SDa (g)RIL population (g)SkewnessKurtosisH2 (%)GCV (%)


Nannong493-1PI483460BMeanMinMaxSD
FY201219.09 ± 0.781.27 ± 0.065.642.1210.231.350.520.2497.2327.54
JP201216.49 ± 0.781.23 ± 0.054.841.3710.181.230.731.5788.2728.40
JP201316.88 ± 1.351.40 + 0.174.852.3111.781.271.123.2993.7930.85
JP201417.66 ± 0.981.41 ± 0.055.642.5311.841.440.540.9289.1327.37
CEb17.37 ± 2.431.33 ± 0.095.251.3711.841.390.711.1097.1665.24
FIGURE 1

Frequency distribution of 100-seed weight QTLs among the NJIR4P-RIL in the various environments (A, B, C, and D for FY2012, JP2012, JP2013, and JP2014, respectively).

Descriptive statistics, broad-sense heritability (H2), and genotypic coefficient of variation (GCV) of 100-seed weight in NJIR4P RIL population and two parental lines viz., Nannong493-1 and PI 483460B. Frequency distribution of 100-seed weight QTLs among the NJIR4P-RIL in the various environments (A, B, C, and D for FY2012, JP2012, JP2013, and JP2014, respectively).

QTL Mapping of 100-Seed Weight Using CIM

A total of 19 QTLs associated with seed-weight were identified in all the individual environments plus CE distributed on 12 of the 20 chromosomes of soybean, and explaining 4.22–13.20% of the phenotypic variation (R2) (Figure 2 and Table 2). Out of these 19 QTLs, 7 were identified for the first time viz., qSW-2-1, qSW-2-2, qSW-2-3, qSW-6-1, qSW-19-1, qSW-19-2, and qSW-19-3, and remaining 12 QTLs have been previously reported in reference to soybean genome GmComposite2003 (SoyBase) (Table 2). The highest number of four QTLs are present on Chr17 followed by three on each Chr2 and Chr19, and the rest 10 chromosomes contain one or two QTLs each. Of the 19 QTLs identified only two are major (R2 > 10%) viz., qSW-17-1 and qSW-17-4 both are located on Chr17, and the remaining 17 QTLs identified are minor (R2 < 10%). Notably, the most prominent QTL with the highest LOD score (7.28) was identified in a 23.01 cM region on Chr17, named as qSW-17-1, explaining 13.20% of phenotypic variation. Five QTLs viz., qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4 were identified in more than one individual environments (Figure 3), and three more QTLs viz., qSW-4-1, qSW-17-1, and qSW-17-3 were detected in one individual environment plus CE. Interestingly both major QTLs located on Chr17 (qSW-17-1 and qSW-17-4) were detected in more than one environments, suggesting the stability and consistency of these QTLs (Table 2). The remaining 11 QTLs were environment-specific QTLs identified in only one specific environment (Table 2). Out of these eight stable QTLs, two were novel QTLs identified for the first time (qSW-2-1 and qSW-2-2). All the QTLs identified for 100-seed weight in the RILs population displayed positive additive effects with positive alleles from higher seed-weight parent (Nannong493-1). Moreover, all the novel QTLs identified were minor (R2 < 10%), thus, none of the novel QTLs detected in this study was major. However, most of the previously detected QTLs were identified in a narrowed physical genomic region (Table 2). The highest number of QTLs for 100-seed weight were identified on Chr17, Chr2, and Chr19 suggest the important role of these chromosomes in governing the inheritance of seed-weight in soybean.
FIGURE 2

100-seed weight QTLs identified in NJIR4P-RIL (complete map is not presented here, it represents only the portion where QTLs have been identified). Right side of chromosomes indicates the interval distance between markers using cM (centiMogan) as the unit; the left side of chromosomes indicates Bin-DNA markers.

TABLE 2

Main QTLs identified in an interspecific RIL population (NJIR4P) in four different environments (FY2012, JP2012, JP2013, and JP2014) and combined environment (CE).

QTL nameaChrbPos (cM)cLODdR2(%)eAfPhy Pos (bp)gFlanking markersCI (cM)hEnv.iReferencesj
qSW-1-1183.213.646.9221.7651,204,893–51,446,046bin184-bin18882.6–84.1JP2014Seed weight 15-2 (Hyten et al., 2004)
qSW-2-1250.113.897.7522.419,751,130–10,665,129bin319-bin32547–51.5JP2012New
50.113.857.7123.1647–52.4JP2013
qSW-2-257.214.158.0623.9711,419,866–13,045,477bin330-bin34354.1–59.7JP2012New
57.213.907.1424.0356.2–59.2FY2012
qSW-2-362.913.165.8625.0313,613,400–14,313,380bin345-bin35360.3–65FY2012New
qSW-4-1440.512.625.0025.747,285,209–7,837,062bin772-bin77940.3–40.9JP2012Li et al., 2010; Zhang et al., 2015
40.315.018.8925.3437.7–40.6CE
qSW-4-250.713.506.5926.428,778,683–11,107,607bin788-bin80450.2–51.8JP2012
50.712.564.2227.0150.2–52.5FY2012
qSW-6-1618.012.704.6027.023,772,133–4,753,088bin1153-bin115916–19.1JP2012New
qSW-9-1917.912.534.7228.663,096,080–4,376,263bin1910-bin192016.7–21.1JP2012Seed weight 35-6 (Han et al., 2012)
qSW-11-11181.414.728.5730.5135,912,819–36,898,349bin2498-bin250380.6–82.7FY2012Seed weight 37-9 (Teng et al., 2009; Sun et al., 2012)
qSW-13-113100.812.825.3530.5332,932,681–33,843,851bin2893-bin290098.3–102.4JP2013Seed weight 45-6 (Yan et al., 2014)
qSW-14-11447.614.147.8731.049,463,148–13,115,201bin3080-bin308746.6–48.8JP20120Seed weight 36-14 (Han et al., 2012)
47.913.717.0831.1146.3–48.7JP2013
qSW-15-1154.512.814.9125.741,499,442–2,546,339bin3190-bin31993.7–6.2JP2013
qSW-17-11723.215.6510.8732.335,762,020–6,841,677bin3613-bin362122.8–25.6JP2012
23.017.2813.2032.1221.8–24.8CE
qSW-17-229.513.757.5132.867,615,994–7,984,614bin3629-bin363129.2–30.3JP2014
qSW-17-333.513.056.0934.158,235,618–8,924,784bin3634-bin363932–34.5JP2012
33.014.238.0033.9232.2-33.1CE
qSW-17-439.913.707.0936.639,420,885–10,095,969bin3645-bin365137.7–42.3JP2013
40.715.9611.5837.0339.7–42.2JP2014
qSW-19-1191.912.864.8737.911–906,420bin3996-bin39990–4JP2012New
qSW-19-241.414.608.7838.1137,530,933–38,131,800bin4078-bin408341.1–43JP2014New
qSW-19-346.913.065.9843.0938,456,684–39,675,684bin4088-bin409546–50.3JP2014New
FIGURE 3

QTLs identified in different individual environments (FY2012, JP2012, JP2013, and JP2014).

100-seed weight QTLs identified in NJIR4P-RIL (complete map is not presented here, it represents only the portion where QTLs have been identified). Right side of chromosomes indicates the interval distance between markers using cM (centiMogan) as the unit; the left side of chromosomes indicates Bin-DNA markers. Main QTLs identified in an interspecific RIL population (NJIR4P) in four different environments (FY2012, JP2012, JP2013, and JP2014) and combined environment (CE). QTLs identified in different individual environments (FY2012, JP2012, JP2013, and JP2014).

Gene Ontology and Candidate Gene Prediction Within Stable QTLs

Based on the number of individual environments QTL were detected, we selected five stable QTLs identified in more than one individual environments viz., qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4 for GO and candidate gene prediction analysis. Within the physical genomic interval of qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4, the 91, 100, 92, 137, and 70 model genes were present, respectively, and these genes as well as their gene annotation were downloaded from Soybase[7]. After GO enrichment analysis, we employed WeGO web-based tool to visualize the biological process, molecular function, and cellular component main categories (Figure 4). In all the five stable QTLs viz., qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4, higher percentage of genes were associated with the terms cell part, cell, organelle, catalytic activity, binding, metabolic process, and cellular process (Figure 4). This suggests an important role of these terms in the seed development of soybean.
FIGURE 4

WeGO analysis of the genes located within the stable QTL regions viz., qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4.

WeGO analysis of the genes located within the stable QTL regions viz., qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4. However, to identify the possible candidate genes underlying the above five stable QTLs responsible for 100-seed weight in soybean, we used PANTHER analysis, gene annotation information, and literature search. The PANTHER analysis is a comprehensive system that combines gene function, ontology, pathways, and statistical analysis tools, and enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics, or gene expression experiments (Huaiyu et al., 2013). Based on the PANTHER analysis, gene annotation, as well as available literature, 29 genes out of total 490 model genes within the physical regions of the five stable QTLs were considered as possible candidate genes regulating seed-weight in soybean. Out of these 29 genes, 5 belong to ubiquitin-protein ligase (PC00234) class, 4 to carbohydrate transporter (PC00067), 3 are transporters (PC00227), 2 are involved in vesicle coat protein (PC00235), 1 in the SNARE protein (PC00034), and the remaining 18 belong to one or two other protein class (Table 3). Furthermore, Glyma02g13350, Glyma14g12220, and Glyma17g13000 genes had no protein class according to PANTHER analysis, and therefore were further analyzed using the gene expression data (RNA-seq) from phytozome database[8], and their expression data revealed that these genes were highly expressed in the seed, and thus were also included as potential candidate genes. For instance, Glyma14g12220 has the domain of PP2C which is homolog to the PP2C that was demonstrated to enhance 100-seed weight by Lu et al. (2017).
TABLE 3

Thirty-one possible candidate genes predicated within five stable QTL regions identified in this study based on PANTHER analysis, gene annotation, and available literature.

QTLMapped IDsPANTHER family/subfamilyPANTHER protein class
qSW-2-1Glyma02g11570E3 UBIQUITIN-PROTEIN LIGASE MARCH6 (PTHR13145:SF0)Ubiquitin-protein ligase (PC00234)
Glyma02g11850CULLIN-3A-RELATED (PTHR11932:SF95)Ubiquitin-protein ligase (PC00234)
Glyma02g11960RING/FYVE/PHD ZINC FINGER SUPERFAMILY PROTEIN (PTHR23012:SF165)Ubiquitin-protein ligase (PC00234)
Glyma02g12030BETA-1,3-GALACTOSYLTRANSFERASE GALT1 (PTHR11214:SF129)Glycosyltransferase (PC00111)
Glyma02g12351EMP24/GP25L/P24 FAMILY/GOLD FAMILY PROTEIN-RELATED (PTHR22811:SF47)Transfer/carrier protein (PC00219); vesicle coat protein (PC00235)
Glyma02g12390VESICLE TRANSPORT V-SNARE 12 (PTHR21230:SF57)SNARE protein (PC00034)
qSW-2-2Glyma02g13210SUBFAMILY NOT NAMED (PTHR24298:SF315)Oxygenase (PC00177)
Glyma02g13350PROTEIN CRABS CLAW (PTHR31675:SF1)Plant-specific transcription factor YABBY family protein
Glyma02g13401AGAMOUS-LIKE MADS-BOX PROTEIN AGL3 (PTHR11945:SF352)MADS box transcription factor (PC00250)
Glyma02g13420AGAMOUS-LIKE 79-RELATED (PTHR11945:SF244)MADS box transcription factor (PC00250)
Glyma02g13730SUBFAMILY NOT NAMED (PTHR23500:SF144)Carbohydrate transporter (PC00067)
qSW-4-2Glyma04g10590SUBFAMILY NOT NAMED (PTHR11206:SF194)Transporter (PC00227)
Glyma04g10600SUBFAMILY NOT NAMED (PTHR22951:SF24)Vesicle coat protein (PC00235)
Glyma04g10610E3 UBIQUITIN-PROTEIN LIGASE ATL15-RELATED (PTHR14155:SF296)Ubiquitin-protein ligase (PC00234)
Glyma04g11060PROTEIN DETOXIFICATION 10-RELATED (PTHR11206:SF87)Transporter (PC00227)
Glyma04g11080MITOCHONDRIAL ADENINE NUCLEOTIDE TRANSPORTER BTL2-RELATED (PTHR24089:SF292)Amino acid transporter (PC00046); calmodulin (PC00061); mitochondrial carrier protein (PC00158); transfer/carrier protein (PC00219)
Glyma04g11120SUGAR TRANSPORT PROTEIN 5 (PTHR23500:SF44)Carbohydrate transporter (PC00067)
Glyma04g11130SUGAR TRANSPORT PROTEIN 5 (PTHR23500:SF44)Carbohydrate transporter (PC00067)
Glyma04g11140SUGAR TRANSPORT PROTEIN 5 (PTHR23500:SF44)Carbohydrate transporter (PC00067)
Glyma04g12120MITOCHONDRIAL IMPORT INNER MEMBRANE TRANSLOCASE SUBUNIT TIM44 (PTHR10721:SF1)Mitochondrial carrier protein (PC00158); transfer/carrier protein (PC00219)
qSW-14-1Glyma14g11780TRANSMEMBRANE 9 SUPERFAMILY MEMBER 10-RELATED (PTHR10766:SF46)Transporter (PC00227)
Glyma14g12110SUBFAMILY NOT NAMED (PTHR23051:SF2)Transfer/carrier protein (PC00219)
Glyma14g12120MONOGALACTOSYLDIACYLGLYCEROL SYNTHASE 1, CHLOROPLASTIC (PTHR43025:SF3)Acetyltransferase (PC00038); glycosyltransferase (PC00111); transfer/carrier protein (PC00219)
Glyma14g12405SEC31 ORTHOLOG, ISOFORM D (PTHR13923:SF11)Vesicle coat protein (PC00235)
Glyma14g12220Phosphatase 2C (PP2C) protein
Glyma14g13011RING/FYVE/PHD ZINC FINGER SUPERFAMILY PROTEIN (PTHR23012:SF165)Ubiquitin-protein ligase (PC00234)
qSW-17-4Glyma17g12910ABC TRANSPORTER G FAMILY MEMBER 32 (PTHR19241:SF280)ATP-binding cassette (ABC) transporter (PC00003)
Glyma17g13000HISTONE DEACETYLASE 15 (PTHR45634:SF12)
Glyma17g13050DNA MISMATCH REPAIR PROTEIN MSH2 (PTHR11361:SF35)DNA-binding protein (PC00009)
Thirty-one possible candidate genes predicated within five stable QTL regions identified in this study based on PANTHER analysis, gene annotation, and available literature.

Discussion

Phenotypic Analysis of Seed-Weight

Seed-weight is an important economical trait controlling the yield in soybean. Therefore, developing soybean cultivars with improved seed-weight was the prime objective of soybean breeders. However, to develop the soybean cultivars with improved seed-weight, it is necessary to understand the genetic mechanisms as well as identify genetic elements associated with 100-seed weight. Seed-weight is a polygenic quantitative trait governed by multiple genes, and is highly environmentally sensitive trait. Although over the past decades many QTLs related to soybean seed-weight/size have been reported, and there are ∼325 QTLs documented for seed weight/size in the USDA Soybean Genome Database (SoyBase[9]). However, most of these QTLs were not stable as well as confirmed due to small sized mapping population and low-density genetic map, and hence have not been used for breeding improved seed-weight in soybean. Therefore, the aim of the present study was to utilize interspecific high-density linkage map of NJRI4P RIL population evaluated in four different environments to identify the stable QTLs as well as mine possible candidate genes for 100-seed weight in soybean. In the present study, ANOVA revealed that 100-seed weight was significantly affected by G, E, and G × E, similar as reported earlier by Fasoula et al. (2004). The RIL did not show clear transgressive segregation in any of environment, that might be due to unwanted linkages between beneficial and undesirable alleles contributed by exotic germplasm (Concibido et al., 2003; Wang et al., 2014). Furthermore, the cultivated and wild parents of RIL population showed clear and large difference in seed-weight/size confirming earlier reports that 100-seed is a domestication-related trait (Zhou L. et al., 2015; Zhou Z. et al., 2015). This wide difference between two parents of inter-specific RIL population for 100-seed weight has allowed detection of more number of QTLs including some novel QTLs. Maximum 100-seed weight of the RILs in each environment was more than three times higher than that of wild parent (PI 483460B), and also the RILs with minimum seed-weight were higher than PI 483460B indicating the usefulness of wild soybean in breeding program for specific seed size (Concibido et al., 2003; Hyten et al., 2006; Kim M.Y. et al., 2010; Lam et al., 2010; Yu et al., 2017; Kofsky et al., 2018). The higher H2-value observed for seed-weight in both the individual and CEs suggests that large proportion of trait variation is under genetic control, and these findings are similar as reported earlier by Kulkarni et al. (2016).

Genetic Control of Seed-Weight

As discussed above, many QTLs have been reported for seed-weight in soybean[10]. But majority of these previous studies used low-density genetic maps based on SSR or other low-throughput markers (Specht et al., 2001; Hyten et al., 2004; Panthee et al., 2005; Liu et al., 2011; Yao et al., 2015), which has low resolution with large confidence interval of QTLs not suitable for candidate gene detection (Hyten et al., 2010; Xu et al., 2013). The quality of genetic maps has great influence on the accuracy of QTL detection (Gutierrez-Gonzalez et al., 2011). In this context, high-density genetic map could identify more recombination events in a population, and will increase accuracy of QTL mapping (Xie et al., 2010). In the present study, we used high-density inter-specific bin map of NJIR4P RIL population consisting of 4,354 bin markers distributed to all 20 chromosomes of soybean with an average number of markers and distance per chromosome are 218 and 106.84 cM, respectively. The average distance between two markers was 0.49 cM (Wang et al., 2016). In addition, high-density genetic map assists in identifying tightly linked markers associated with QTLs, and provided a good foundation for analyzing quantitative traits. Moreover, the use of interspecific population would also enhance identification of genomic region(s) which was/were altered during domestication (Liu et al., 2018). The QTLs associated with seed-weight in soybean has been mapped on all soybean linkage groups/chromosomes. In the present study, we identified a total of 19 QTLs associated with 100-seed weight using inter-specific genetic map of NJIR4P population, and these QTLs contributed significantly to the seed-weight. By comparing our QTL results with SoyBase database[11], 12 QTLs have been previously reported in the same physical genomic region, and only 7 were novel identified for the first time (Table 2). The seven novel QTLs detected indicating the distinct genetic architecture of NJIR4P population, and suggest the need to use more germplasm for revealing the complex genetic basis of 100-seed weight in soybean. The physical interval of qSW-2-1, qSW-2-2, qSW-2-3, qSW-6-1, qSW-19-1, qSW-19-2, and qSW-19-3 did not overlap with any of the previously reported seed-weight QTLs, and hence were considered as novel QTLs. The qSW-1-1 was identified in the genetic interval (82.6–84.1 cM) that overlap with the seed-weight QTLs viz., Seed weight 15-2 and Seed weight 18.1-2 identified in the same genetic and physical position as reported earlier (Hyten et al., 2004; Panthee et al., 2005). Similarly, two QTLs identified on Chr4 viz., qSW-4-1 and qSW-4-2 overlapped with seed weight 47-1 corresponding to physical position of 96,434–51,252,852 bp (Li et al., 2010) and seed weight per plant 6-2 corresponding to physical position of 486,057–526,777 bp (Yao et al., 2015), respectively. The qSW-9-1 were detected in the same genomic physical interval as previously reported QTL, Seed weight 35-6 QTL (Han et al., 2012). Likewise, SW-11-1 was located in the genomic position of Seed weight 10-3 (Specht et al., 2001), Seed weight 32-1 (Li et al., 2008), and Seed weight 36-11 (Han et al., 2012). The qSW-14-1 could be the same QTL as Seed weight 36-14 (Han et al., 2012). Lu et al. (2017) identified QTL on Chr15 at the same physical interval (1,901,425–2,855,666 bp) as qSW-15-1 reported in the present study. The major and stable QTL viz., qSW-17-1 overlapped with earlier reported QTLs, Seed weight 21-1 (Gai et al., 2007), Seed weight 22-3 (Zhang et al., 2004), and Seed weight 47-2 (Li et al., 2010). Moreover, qSW-17-2 and qSW-17-3 overlapped with seed-weight QTLs previously reported by Li et al. (2010) and Wang et al. (2015), respectively. Another major and stable QTL identified on Chr17 viz., qSW-17-4 has been also reported by the number of earlier studies (Kim H.-K. et al., 2010; Kato et al., 2014; Wang et al., 2015; Zhou Z. et al., 2015; Liu et al., 2018). The seven novel QTLs identified for 100-seed weight together explained ∼46% of the phenotypic variation, which suggested potential importance of these loci for seed-weight. The QTLs identified in this study had narrow genetic and physical regions for the instance, qSW-17-4 which overlapped with Seed weight 47-2 (Li et al., 2010). In our study, qSW-17-4 was detected at genetic and physical positions of 37.7–42.3 cM and 9,420,885–10,095,969 bp, respectively, compared to Seed weight 47-2 (24.52–124.30 cM and 5,788,551–40,525,673 bp). In plant breeding, stability of QTL is essential for their use in MAB. Besides, two novel stable QTLs (qSW-2-1 and qSW-2-2) identified in the present study, the 12 QTLs for 100-seed weight have been previously co-localized in the same physical interval by earlier studies (see references in Table 2). Of the 12 QTLs previously reported, two are major QTLs with R2-value > 10% both located on Chr17 viz., qSW-17-1 and qSW-17-4 (see references in Table 2). Hence, these QTLs might also be considered as stable QTLs, and major stable QTLs can be used for further fine mapping and map-based cloning to unravel the mechanisms of seed-weight in soybean, as well as might be good for MAB. All the beneficial/positive alleles in the NJIR4P RIL population were derived from the cultivated soybean (Nannong 493-1), indicating that seed-weight was altered during domestication (Zhou L. et al., 2015; Zhou Z. et al., 2015; Lu et al., 2016). Similar to our findings, Lu et al. (2017) also reported that all the beneficial alleles for 100-seed weight were inherited from the cultivated soybean except one beneficial QTL allele viz., PP2C-1 that was derived from wild soybean parent. Although it has been revealed that wild soybean is a potential source for improving cultivated soybean in terms of yield-related traits, seed quality, and biotic and abiotic stress tolerance (Tuyen et al., 2010; Kim et al., 2011). In accordance with the earlier studies (Wang et al., 2016; Xin et al., 2016; Liu et al., 2018), our study also revealed that alleles derived from wild soybean contribute to a reduction in seed weight in all 19 seed-weight QTLs. It is not always the purpose of soybean breeders to increase seed weight/size, but also sometimes breeding program requires a suitable/optimized combination of yield-related parameters such as seed size, the number of seeds per pod, and the number of pods per plant. Hence, QTLs detected in our study would be valuable for controlling seed size via genomic breeding by design and positional cloning of the relevant genes. Furthermore, most of the QTLs detected in this study overlapped earlier reported QTLs indicating the accuracy of our mapping results. Moreover, those confirmed in this study with narrow regions could be integrated into breeding program via marker-assisted selection (MAS).

Candidate Gene Analysis for Seed-Weight

It is of great interest for both theoretical study and practical breeding program to identify the actual candidate gene underlying the QTL region. Most of the earlier QTL mapping on seed-weight did not mine for candidate genes (Zhang et al., 2004; Kato et al., 2014; Wu et al., 2018), and till date only two seed weight/size-related genes have been isolated from soybean viz., ln gene has a large effect on the number of seeds per pod and seed size (Jeong et al., 2012), and recently, the PP2C-1 (protein phosphatase type-2C) allele from wild soybean accession ZYD7 was found to contribute to the increase in seed size (Lu et al., 2017). Hence, based on the available information in current literature, gene annotation as well as bioinformatics tools, the present study identified the possible candidate genes regulating the 100-seed weight in soybean that underlie the stable QTLs. A total of 490 model genes were mined from the physical regions of the five stable QTLs viz., qSW-2-1, qSW-2-2, qSW-4-2, qSW-14-1, and qSW-17-4, and out of these 29 were considered as possible candidate genes based on the PANTHER analysis, gene function, and available literature (Huaiyu et al., 2013). Based on the WeGo analysis most of the genes underlying above five stable QTLs belong to the terms cell component, catalytic activity, binding, transporting, metabolic and cellular process, and these elements were reported to be vital in seed development (Fan et al., 2006; Mao et al., 2010; Li and Li, 2014). For example, Glyma02g13210 gene underlying QTL qSW-2-2 belongs to oxygenase (PC00177) protein class, that has been demonstrated to regulate seed size in soybean (Zhao et al., 2016). Similarly, protein family E3 ubiquitin-protein ligase (PC00234) are involved in the ubiquitin-proteasome pathway, and this protein family include members from various crop species such as DA1, DAR1, DA2, and EOD1/BB (Arabidopsis), GW2 (rice), TaGW2 (Wheat), ZmGW2 (maize), and UBP15/SOD2 (Arabidopsis), and all these genes have been reported to have significant effect on seed development (Li and Li, 2014, 2016; Ge et al., 2016). Thus, Glyma02g11570, Glyma02g11850, Glyma02g11960, Glyma04g10610, and Glyma14g13011 belonging to E3 ubiquitin-protein ligase (PC00234) were considered as possible candidate genes in the present study. Furthermore, Xian-Jun et al. (2007) reported a gene underlying QTL for rice grain width and weight (GW2) that encodes a previously unknown RING-type E3 ubiquitin ligase has been demonstrated by Xian-Jun et al. (2007). They demonstrated that loss of GW2 function increased cell numbers, resulting larger spikelet hull, and accelerated the grain milk filling rate, resulting in enhanced grain width, grain weight, and yield. It has been revealed that regulation of seed development is controlled by source (leaf) and sink (seed) relationship in plants (Schnyder, 1993), which is influenced by assimilate translocation/transportation. Therefore, genes viz., Glyma02g13730, Glyma04g10590, Glyma04g11060, Glyma04g11120, Glyma04g11130, Glyma04g11140, and Glyma14g11780 belonging to carbohydrate transporter (PC00067 or PC00227) gene family were might be possible candidate genes for seed-weight. Legume seed development is closely related to metabolism and nutrient (sucrose) transport (Borisjuk et al., 2003). Candidate genes Glyma02g12351, Glyma04g10590, Glyma04g10600, and Glyma14g12405 belong to vesicle coat protein (PC00235). This protein family have been reported to be involved in protein–protein interaction and transport (Harley and Beevers, 1989; Anantharaman and Aravind, 2002). Two candidate genes Glyma02g13401 and Glyma02g13420 were members of K and MAD box protein family, and this protein family has been reported to regulate flower development in plants (Bowman et al., 1991; Ditta et al., 2004). The flower as an organ acts as either source or sink and determine the seed number, which indirectly affect the seed size and shape (Stanton, 1984; Jia et al., 2016). The Glyma04g11080 belongs to several protein classes such as amino acid transporter (PC00046); calmodulin (PC00061); mitochondrial carrier protein (PC00158); transfer/carrier protein (PC00219) which could possibly be involved in seed weight regulation. For example, in rice, Asano et al. (2002) has shown that a calmodulin-like domain protein kinase is required for storage product accumulation during seed development. Moreover, Glyma02g12030, Glyma04g12120 and Glyma14g12120 belong to one or more protein classes: such as acyltransferase (PC00038), glycosyltransferase (PC00111), and transfer/carrier protein (PC00219), and these protein classes were demonstrated to play role in seed development (Rehman et al., 2016). The Glyma17g12910 gene underlying a major stable QTL, qSW-17-4, belongs to ATP-binding cassette (ABC) transporter (PC00003) which could possibly be involved in seed development (David et al., 2010). In addition, Glyma17g13000 belongs to histone deacetylase 15 (PTHR45634:SF12) that might be involved in regulating seed weight (Shahbazian and Grunstein, 2007; Peserico and Simone, 2011). As Yang et al. (2016) demonstrated maize histone deacetylase HDA101 function and regulatory mechanism during seed development. Also, Glyma17g13050 and Glyma17g13210 which code for DNA-binding protein (PC00009) and leucine-rich repeat-containing protein, respectively, play significant role in seed development (Li and Li, 2016; Li et al., 2019). Among the predicted candidate genes, the minimum number of exons and introns was two with the maximum gene sequence of 13,670 bps for Glyma17g13050 (Supplementary Figure 1). However, few of the 29 possible candidate genes predicted in this study for 100-seed weight have been included in our on-going projects for their functional validation. Lastly, the major and stable QTLs identified in the present study will be the main focus of soybean breeders for fine mapping and MAB of soybean cultivars with improved 100-seed weight.

Conclusion

In conclusion, the present study used high-density bin map of an interspecific RIL population (NJIR4P) evaluated in multiple environments to detect QTLs as well as mine possible candidate genes controlling 100-seed weight. A total of 19 QTLs were found associated with 100-seed weight, and out of which 7 were novel (reported for the first time). In addition, out of 19 QTLs, 8 were considered as stable QTLs identified in either more one individual environments or one individual environment plus CE, and two of them were major viz., qSW-17-1 and qSW-17-4 (R2 > 10%). Moreover, most of the previously reported QTLs validated in the present study had narrow physical genomic interval. All the beneficial/positive alleles of 19 QTLs were derived from the cultivated soybean (Nannong493-1). Twenty-nine possible candidate genes were mined within the five stable QTLs and most of them are belonging to ubiquitin-protein ligase (PC00234) that have been earlier reported to play significant role in seed/organ size development and regulation. However, it needs further validation to determine their actual role in seed weight and development, although few of them have been included in our on-going projects for functional validation. Hence, after proper functional validation of these candidate genes, these candidate genes can be used for improving 100-seed weight of soybean through transgenic or MAB. Lastly, our study provides detailed information for accurate QTL localization and candidate gene discovery, and these findings will be of great use for MAS of soybean varieties with improved seed-weight.

Data Availability

All datasets generated for this study are included in the manuscript and/or the Supplementary Files.

Author Contributions

TZ conceived and designed the experiments. BK, SC, YX, FC, YZ, and JK performed the experiments. BK and JB analyzed the data. BK and JB drafted the manuscript. TZ and JB revised the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  3 in total

1.  Detection of a major QTL and development of KASP markers for seed weight by combining QTL-seq, QTL-mapping and RNA-seq in peanut.

Authors:  Zhihui Wang; Liying Yan; Yuning Chen; Xin Wang; Dongxin Huai; Yanping Kang; Huifang Jiang; Kede Liu; Yong Lei; Boshou Liao
Journal:  Theor Appl Genet       Date:  2022-03-09       Impact factor: 5.699

2.  Identification of Additive-Epistatic QTLs Conferring Seed Traits in Soybean Using Recombinant Inbred Lines.

Authors:  Meng Li; Lingling Chen; Jian Zeng; Muhammad Khuram Razzaq; Xianchao Xu; Yufei Xu; Wubin Wang; Jianbo He; Guangnan Xing; Junyi Gai
Journal:  Front Plant Sci       Date:  2020-12-10       Impact factor: 5.753

3.  Genome-Wide Association Study Reveals Genetic Architecture and Candidate Genes for Yield and Related Traits under Terminal Drought, Combined Heat and Drought in Tropical Maize Germplasm.

Authors:  Alimatu Sadia Osuman; Baffour Badu-Apraku; Benjamin Karikari; Beatrice Elohor Ifie; Pangirayi Tongoona; Eric Yirenkyi Danquah
Journal:  Genes (Basel)       Date:  2022-02-15       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.