Literature DB >> 25124843

Expression quantitative trait loci infer the regulation of isoflavone accumulation in soybean (Glycine max L. Merr.) seed.

Yan Wang, Yingpeng Han, Weili Teng, Xue Zhao, Yongguang Li, Lin Wu, Dongmei Li, Wenbin Li1.   

Abstract

BACKGROUND: Mapping expression quantitative trait loci (eQTL) of targeted genes represents a powerful and widely adopted approach to identify putative regulatory variants. Linking regulation differences to specific genes might assist in the identification of networks and interactions. The objective of this study is to identify eQTL underlying expression of four gene families encoding isoflavone synthetic enzymes involved in the phenylpropanoid pathway, which are phenylalanine ammonia-lyase (PAL; EC 4.3.1.5), chalcone synthase (CHS; EC 2.3.1.74), 2-hydroxyisoflavanone synthase (IFS; EC1.14.13.136) and flavanone 3-hydroxylase (F3H; EC 1.14.11.9). A population of 130 recombinant inbred lines (F5:11), derived from a cross between soybean cultivar 'Zhongdou 27' (high isoflavone) and 'Jiunong 20' (low isoflavone), and a total of 194 simple sequence repeat (SSR) markers were used in this study. Overlapped loci of eQTLs and phenotypic QTLs (pQTLs) were analyzed to identify the potential candidate genes underlying the accumulation of isoflavone in soybean seed.
RESULTS: Thirty three eQTLs (thirteen cis-eQTLs and twenty trans-eQTLs) underlying the transcript abundance of the four gene families were identified on fifteen chromosomes. The eQTLs between Satt278-Sat_134, Sat_134-Sct_010 and Satt149-Sat_234 underlie the expression of both IFS and CHS genes. Five eQTL intervals were overlapped with pQTLs. A total of eleven candidate genes within the overlapped eQTL and pQTL were identified.
CONCLUSIONS: These results will be useful for the development of marker-assisted selection to breed soybean cultivars with high or low isoflavone contents and for map-based cloning of new isoflavone related genes.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25124843      PMCID: PMC4138391          DOI: 10.1186/1471-2164-15-680

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Soy food has been taken as a functional food because it contains many health beneficial molecules such as isoflavones [1]. Studies on human nutrition have shown that soybean isoflavones play an important role in preventing a number of chronic diseases [2, 3]. Equally, isoflavones are critical factors in defending soybean crops against pests [4, 5], in promoting nodulation by rhizobia [6], and in changing or adjusting the microorganisms around plant roots [7]. The major bioactive components of soybean isoflavones in human nutrition are daidzein (DZ), genistein (GT) and glycitein (GC). Isoflavone contents in soybean seed are inherited as complex quantitative traits [8-11]. Since soy seed isoflavones are regulated by multiple genetic factors, their concentrations in seed are highly variable [1, 12–14]. Over fifty QTLs underlying individual and/or total soybean isoflavone content have been reported [8, 15–23]. However, only 12 of these QTLs were in genomic regions encoding isoflavone synthesis enzymes. A group of enzymes in the phenylpropanoid pathway lead to the biosynthesis of DZ, GT and GC [11]. Phenylalanine ammonia lyase (PAL; EC 4.3.1.5), chalcone synthase (CHS; EC 2.3.1.74) and flavanone 3-hydroxylase (F3H; EC 1.14.11.9) [24] are the first three enzymes that convert the amino acid phenylalanine into p-Coumaroyl-CoA in this pathway [11]. In the isoflavonoid biosynthetic pathway [25], the co-catalytic action of CHS [26, 27] with chalcone reductase (CHR; EC 2.3.1.170) [28] produces isoliquiritigenin and naringenin chalcone, which are isomers of the central isoflavanone intermediates naringenin and liquiritigenin, respectively. Isoliquiritigenin and naringenin chalcone are respectively converted into liquiritigenin and naringenin by chalcone isomerase (CHI; EC 5.5.1.6) [29]. These two products are the precursors of DZ and GT, which are formed after the catalysis of the precursors by the key enzyme 2-hydroxyisoflavanone synthase (IFS; EC 1.14.13.136) [30, 31]. The enzyme F3H, that competes with IFS in utilizing naringenin, catalyzes the conversion of flavanones to dihydroflavonols, which are intermediates in the biosynthesis of flavonols, anthocyanidins, catechins and proanthocyanidins [32, 33]. For the synthesis of GC, isoliquiritigenin is likely a precursor to form GC after several biochemical steps, which are not entirely known yet [34]. However, seed isoflavone concentrations in soybean can be regulated by metabolic engineering of the complex phenylpropanoid biosynthetic pathways [35]. Regulating transcript abundance is an effective approach to improve phenotypes [36]. The integrated analysis of genotype and transcript abundance data for association with complex traits can be used to identify novel genetic pathways involved in complex traits. ‘Expression QTL’ (eQTL), first defined by Jansen and Nap [37], could identify the genetic determinants of transcript abundances and is widely used for investigating gene regulation pathways. This approach treats transcript abundance of individual genes as quantitative traits in a segregating population. The eQTL map information enables genetic regulatory networks to be modeled that can provide a better understanding of the underlying phenotypic variation. It has been successfully applied in humans [38-40], plants [41-44], yeasts [45, 46], worms [47], flies [48], mice [49, 50], pigs [51] and rats [52] populations. These studies showed that transcript abundance was highly heritable and could be linked to either a local locus (cis-eQTL) or a distant locus (trans-eQTL). Cis-eQTL is mapped to the same genomic location like an expressed gene (within 5 Mb), and trans-eQTL is mapped to a different genomic location from an expressed gene (>5 Mb or on different chromosomes) [40, 53]. In general, cis-eQTL tends to produce stronger statistical associations than does by trans-eQTL [54]. This phenomenon is regarded as evidence of greater biological plausibility for the existence of true functional cis-eQTL [55]. Trans-eQTL could occur individually at a single genomic locus or could occur collectively as part of eQTL trans-bands [55]. This genomics approach has been employed to identify eQTL related genes in soybean [36, 56–58]. To date, no information concerning eQTLs underlying soybean isoflavone synthetic enzyme genes is available. It has been proved that many enzymes in the phenylpropanoid pathway underlie QTLs that determine the accumulation of isoflavone contents in soybean seeds [11]. Meanwhile, the modification of enzyme encoded genes that are involved in phenylpropanoid pathway could promote the biosynthesis of isoflavone [31, 35]. In this study, PAL, CHS, IFS and F3H in the phenylpropanoid pathway were selected as the target genes (TGs) to analyze isoflavone-relative eQTL. Potential candidate genes underlying the accumulation of isoflavone contents in soybean seed were also evaluated. In addition, overlapped loci both for eQTL and phenotypic QTL (pQTL) were identified.

Results

Total and individual isoflavone contents, target gene transcript abundance and correlation analysis

Transcript abundances of target genes (TGs) between parents from R3 to R8 developmental stages were compared. Total and individual isoflavone contents and transcript abundances of TGs at R6 stage of soybean development were measured in the F5:11 population. The results showed that significant differences among the transcript abundances of TGs between the two parents existed at the R6 stage. The phenotypic variation of individual and total isoflavones showed a continuous distribution (Table  1).
Table 1

Total and individual isoflavone content of the RIL populations and parents

Traitsa Meanb SDb Minb Maxb Zhongdou 27c Jiunong 20c SkewnessKurtosis
DZ9.613.044.3615.888.92 ± 2.974.79 ± 1.12−0.040−0.765
GC0.410.320.292.640.36 ± 0.160.42 ± 0.230.1380.825
GT4.382.550.779.514.22 ± 2.752.81 ± 1.010.480−0.860
TI14.405.215.7025.1113.50 ± 5.216.81 ± 2.270.187−1.061
PAL expression(∆∆CT)3.9267.3880.00937.5700.25200.5900.846
CHS expression(∆∆CT)0.0130.0130.00020.0510.32801.2030.616
IFS expression(∆∆CT)0.8961.3340.0025.1990.70700.9541.700
F3H expression(∆∆CT)4.7983.4810.01310.55010.550−16.0470.156−1.340

aDZ, Daidzein; GC, Glycitein; GT, Genistein; TI, Total isoflavone content.

bμg/100 g(DZ, GC, GT, TI).

cMean ± SD.

Total and individual isoflavone content of the RIL populations and parents aDZ, Daidzein; GC, Glycitein; GT, Genistein; TI, Total isoflavone content. bμg/100 g(DZ, GC, GT, TI). cMean ± SD. GT showed a high positive correlation coefficient with DZ (r = 0.762, P < 0.01; Table  2). The transcript abundance of PAL was positively correlated with both GT and TI, but exhibited no significant correlation with DZ and GC. The transcript abundance of CHS was positive correlated with DZ, GT and TI, but negatively associated with GC amount. The transcript abundance of IFS displayed a positive correlation with DZ, but showed no correlation with other isoflavone components. The transcript abundance of F3H showed significantly negative correlation with individual and total isoflavone contents.
Table 2

Correlations among individual and total isoflavone contents, as well as the transcript abundances of the four TGs in the RIL populations

TraitsDZGCGTTI PALexpression CHSexpression IFSexpression
GC0.249*
GT0.762**0.294*
TI0.943**0.363*0.928**
PAL expression−0.0940.0920.269*0.304*
CHS expression0.223*−0.191*0.201*0.230*0.063
IFS expression0.327*−0.0320.1690.140−0.0220.022
F3H expression−0.248*−0.248*−0.276*−0.273*0.1050.108−0.001

P values were as follows: *P < 0.05, **P < 0.01.

Correlations among individual and total isoflavone contents, as well as the transcript abundances of the four TGs in the RIL populations P values were as follows: *P < 0.05, **P < 0.01.

Identification of genomic region for target genes

Through BLAST searches (http://www.phytozome.net/soybean), the PAL has six homologous regions (E ≤ 0), which are located on Gm10 (LG O, PAL1/ PAL2), Gm13 (LG F, PAL1), Gm03 (LG N, PAL1), Gm19 (LG L, PAL1), Gm20 (LG I) and Gm02 (LG D1b, PAL1). Homologous regions encoding CHS (E-value ≤ 1.0E-05) are located on Gm11 (LG B1, CHS8), Gm01 (LG D1a, CHS6/CHS7), Gm08 (LG A2, CHS1/CHS2/CHS3/CHS4/CHS5/CHS9), Gm05 (LG A1, CHS2), Gm02 (LG D1b), Gm09 (LG K, CHS6), Gm19 (LG L) and Gm13 (LG F). Genes that encode F3H are located on Gm02 (LG D1b, F3H1/F3H2), Gm16 (LG J), Gm01 (LG D1a), Gm11 (LG B1), Gm18 (LG G) and Gm19 (LG L). Genes encoding IFS are located on Gm07 (LG M IFS1), Gm13 (LG F, IFS2), Gm10 (LG O), Gm03 (LG N), Gm12 (LG H), Gm19 (LG L), Gm17 (LG D2) and Gm11 (LG B1). Genes encoding IFS have the function of P450 cytochromes [27] and might have additional functional homologs.

eQTL analysis for four TGs

The linkage map that included 194 SSR markers (accepted by Molecular Biology Reports) and covered 2,312 cM with mean distance of about 12 cM between markers was used to identify eQTLs associated with the expression of the four TGs. Thirty-three eQTLs that appeared to underlie transcript abundance of the four TGs are detected and located on fifteen LGs (Table  3, Figure  1). Regarding to the locational relationships between the eQTL and the genes, thirteen of the eQTLs were cis-acting (within 5 Mb upstream or downstream of the genes) and twenty of the eQTLs were trans-acting (more than 5 Mb away or on different chromosomes) [40, 53].
Table 3

The eQTLs for target genes of , and

TraitseQTLa Gm(LG)MarkerMarker intervalPositionb EnvironmentLOD scoreR2(%)c
PAL dqPALB2_114(B2)Satt560Satt560 ~ Satt5560.012011Harbin3.398.11
dqPALD2_117(D2)Sat_209Sat_209 ~ Sat_02215.902011Harbin4.246.67
CHS qCHSA1_105(A1)Satt 236Satt 236-D26A0.012011Harbin5.484.21
qCHSDla_101(Dla)Satt436Satt436-Sat_3450.012011Harbin8.642.71
qCHSDlb_102(Dlb)Satt546Satt546-Satt459214.802011Harbin2.552.13
qCHSDlb_202(Dlb)Satt546Satt546-Satt459211.222011Harbin2.183.90
dqCHSD2_117(D2)Satt528Satt528-Satt25610.742011Harbin2.732.07
qCHSF_113(F)Sat_234Sat_234-Satt14946.172011Harbin2.723.57
qCHSL_119(L)Satt278Satt278-Sat_13414.002011Harbin2.4015.65
qCHSL_219(L)Sat_134Sat_134-Sct_01024.512011Harbin2.099.98
IFS dqIFSA2_108(A2)Sat_129Sat_129-Sat_18155.452011Harbin7.4617.67
dqIFSC1_104(C1)Sat_042Sat_042-Satt5246.672011Harbin5.6322.8
dqIFSD2_117(D2)Satt186Satt186-Satt22654.882011Harbin8.8716.67
qIFSF_113(F)Satt569Satt569-Satt4236.972011Harbin3.0915.84
qIFSF_213(F)Sat_234Sat_234-Satt14956.012011Harbin10.9217.89
dqIFSH_112(H)Satt302Satt302-Satt2790.012011Harbin3.237.27
dqIFSL_119(L)Sat_134Sat_134-Satt27820.992011Harbin7.2616.12
dqIFSL_219(L)Sct_010Sct_010-Sat_13443.952011Harbin9.7517.97
qIFSN_103(N)Satt152Satt152-Satt5306.672011Harbin2.5027.42
qIFSN_203(N)Satt530Satt530-Satt15229.532011Harbin2.5012.80
dqIFSO_110(O)Satt345Satt345-Satt5926.002011Harbin9.4319.43
dqIFSO_210(O)Sat_341Sat_341-Satt58588.392011Harbin9.7815.69
F3H dqF3HC2_106(C2)Satt322Satt322-Satt65857.642011Harbin2.272.37
qF3HDlb_102(Dlb)Satt157Satt157-Satt27125.712011Harbin3.6210.01
dqF3HDlb_202(Dlb)Sat_135Sat_135-Sat_09630.282011Harbin7.5314.32
qF3HDlb_302(Dlb)Sat_069Sat_069-Sat_279168.622011Harbin2.418.49
dqF3HDlb_402(Dlb)Satt459Satt459-Sat_069185.582011Harbin2.185.54
dqF3HD2_117(D2)Satt031Satt031-Sat_3260.012011Harbin2.671.05
dqF3HE_115(E)Sat_112Sat_112-Sat_38022.092011Harbin2.104.85
dqF3HF_113(F)Sat_262Sat_262-Sat_103101.222011Harbin2.632.24
dqF3HK_109(K)Satt349Satt349-Satt518141.562011Harbin2.081.57
dqF3HN_103(N)Sat_084Sat_084-Sat_30441.452011Harbin4.706.10
dqF3HO_110(O)Satt592Satt592-Satt63327.542011Harbin2.622.32

aeQTL: The nomenclature of the eQTL included four parts: QTL, trait, linkage group name and QTL order in the linkage group, respectively.

bPosition from the left marker of the interval on each linkage group.

cProportion of phenotypic variance (R2) explained by a eQTL.

dTrans-eQTL, others are cis-eQTL.

Figure 1

Summary of eQTL and QTL locations detected in the soybean genome. eQTL/ QTL represented by bars were shown on the left of the linkage groups, close to their corresponding markers. The lengths of the bars were proportional to the confidence intervals of the corresponding eQTL/QTL in which the inner line indicates the position of maximum LOD score.

The eQTLs for target genes of , and aeQTL: The nomenclature of the eQTL included four parts: QTL, trait, linkage group name and QTL order in the linkage group, respectively. bPosition from the left marker of the interval on each linkage group. cProportion of phenotypic variance (R2) explained by a eQTL. dTrans-eQTL, others are cis-eQTL. Summary of eQTL and QTL locations detected in the soybean genome. eQTL/ QTL represented by bars were shown on the left of the linkage groups, close to their corresponding markers. The lengths of the bars were proportional to the confidence intervals of the corresponding eQTL/QTL in which the inner line indicates the position of maximum LOD score. Among the identified eQTLs (Table  3), qPALB2_1 and qPALD2_1 were associated with PAL transcript abundance, and could explain 8.11% and 6.67% of the phenotypic variation, respectively. Eight eQTLs, underlying CHS transcript abundance, were located on six LGs, and could explain 2.07-15.65% of the phenotypic variation. qCHSDla_1 (Satt436-Sat_345, Gm01) was detected with a higher LOD score (8.64) in the regions where cis-elements and CHS family genes were located. Two eQTLs (qCHSDlb_1, qCHSDlb_2), located in the interval of Satt459 and Satt546, could explain 2.13% and 3.90% of phenotypic variance and overlap with qGCD1b_1. qCHSF_1 (Satt149-Sat_234), associated with CHS and IFS transcript abundance, were overlapped with the marker interval of qGTF_2, and could explain 3.57% of phenotypic variance. qCHSL_1 (Satt278-Sat_134) and qCHSL_2 (Sat_134-Sct_010) were associated with the same SSR marker (Sat_134), and contributed 16.12% and 17.97% of the variation of IFS transcript abundance. Twelve eQTLs were associated with IFS expression. Of them, qIFSD2_1 (Satt186-Satt226) explained 16.67% of the phenotypic variation. qIFSF_1 (Satt423-Satt569, R2 = 15.84%) shared the same SSR marker Satt569 with other three QTLs (qDZF_2, qGTF_1, qTIF_2). qIFSN shared the same SSR marker (Satt530) with qGCN_1 (Table  3, Figure  1). Eleven eQTLs were associated with F3H expression (Table  3, Figure  1). Of them, four eQTLs were located on Gm02 (LG Dlb), and explained 5.54-14.32% of the phenotypic variation. qF3HDlb_2 (Sat_135-Sat_096) had higher LOD score and explained 14.32% of the phenotypic variation. qF3HE_1 (R2 = 4.85%) had the same interval (Sat_112- Sat_380) with qGCE_1, qGTE_1 and qTIE_1, meanwhile, qF3HF_1 and qDZF_1 shared the same marker interval (Sat_262- Sat_103) (Figure  1).

Identification of candidate genes underlying the overlapped loci of pQTL and eQTL

Thirty four pQTLs for both individual and total seed isoflavone contents of soybean were compared with eQTLs to identify the overlapped loci. Five eQTL intervals were overlapped with pQTLs, and a total of eleven candidate genes within the overlapped eQTL and pQTL were identified (Table  4). Two genes, C4H (Glyma02g40290.1) and PAL1 (Glyma02g47940.1), were identified on Gm02 (LG D1b) between Satt546-Satt459. CHI (Glyma17g34430.1) and DFR (dihydroflavonol reductase; EC 1.1.1.219) were identified on Gm17 (LG D2) between Satt186-Satt226. Genes encoding 4-coumarate-CoA ligase (EC 6.2.1.12; Glyma13g01080.1/2), FLS (Glyma13g02740.1) and CHS (Glyma13g09640.1) were identified on Gm13 (LG F) between Satt423-Satt569. Additionally, CHS (Glyma13g24200.1) and IFS (Glyma13g09640.1) was found within another eQTL/pQTL interval (Satt149-Sat_234).
Table 4

Identification of candidate genes underlying overlapped locus of eQTL and QTL

  Marker interval  Gm(LG)  Physical location of markersCandidate genesPhysical location of candidate genesFunction of candidate genes
Satt546-Satt459Gm02(LGD1b)43,775,407-48,390,089 Glyma02g40290.1 45,490,798-45,495,043 C4H
Glyma02g47940.1 51,366,326-51,368,943 PAL1
Satt186-Satt226Gm17(LG D2)26,768,866-39,047,375 Glyma17g34430.1 38,398,978-38,401,025 CHI
Glyma17g37060.1 40,920,379-40,923,898 DFR
Satt423-Satt569Gm13(LG F)5,231,035-9,567,285 Glyma13g01080.1/2 798,836-805,844 4CL
Glyma13g02740.1 2,707,784-2,712,790 FLS
Glyma13g09640.1 11,153,569-11,158,812 CHS
Satt149-Sat_234Gm13(LG F)4,976,740-26,460,745 Glyma13g24200.1 27,567,360-27,569,061 IFS
Glyma13g20800.1 24,273,025-24,278,037 PAL1
Glyma13g27380.1 30,577,113-30,579,230 DFR
Glyma13g09640.1 11,153,569-11,158,812 CHS
Glyma13g02740.1 2,707,784-2,712,790 FLS
Sat_262-Sat_103Gm13(LG F)7,233,012-25,478,474 Glyma13g20800.1 24,273,025-24,278,037 PAL1
Glyma13g24200.1 27,567,360-27,569,061 IFS
Glyma13g09640.1 11,153,569-11,158,812 CHS
Glyma13g02740.1 2,707,784-2,712,790 FLS
Identification of candidate genes underlying overlapped locus of eQTL and QTL

Discussion

Soybean isoflavones have been broadly used in food, medicine, cosmetics and animal husbandry [59]. Increasing and decreasing seed isoflavone content will be an important target of soybean breeding. MAS based on genotype selection rather than solely on phenotype selection provides additional power for the selections during soybean breeding [60]. Cultivar ‘Zhongdou 27’ proved to have high-isoflavone content (3,791 μg/g isoflavone in seed) as reported previously [16]. Meng et al. [19] identified two QTL underlying resistance to soybean aphid through leaf isoflavone-mediated antibiosis in soybean cultivar ‘Zhongdou 27’. A number of pQTLs associated with seed isoflavone were identified in multiple environments from cultivar ‘Zhongdou 27’ using 194 SSR markers (accepted by Molecular Biology Reports). Therefore, ‘Zhongdou 27’ should be given more attention as an elite germplasm to improve soybean seed isoflavone concentration, disease and pest resistances. In our previous studies, some identified QTLs associated with individual/total isoflavone contents showed higher contribution to phenotypic variation. Some specific copies of genes (PAL, CHS, IFS, F3H) in the phenylpropanoid pathway were near or falling into these quantitative trait loci by browsing the reference genome sequence of Williams 82 (http://www.phytozome.net/soybean). To investigate the regulation mechanism of isoflavone synthetic enzyme genes, the transcript abundances of PAL, CHS, IFS and F3H in the mapping population were examined, and the genomic regions affecting the expression of the TGs were identified using the eQTL methodology [61]. A global microarray eQTL analysis of a limited number of samples can be used for exploring functional and regulatory gene networks and for scanning cis-eQTL, whereas the subsequent analysis of a subset of likely cis-regulated genes by real-time RT-PCR in a larger number of samples may identify QTL region by targeting these positional candidate genes [62]. In this study, real-time PCR reactions were used to analyze the transcript abundance variations of the four TGs in the F5:11 RI lines. When combined with classical QTL phenotypes, correlation analysis can directly provide an overview of potential genes underlying isoflavone traits [63, 64]. Through the comparison of the transcript abundances of the four TGs (PAL, CHS, IFS and F3H), the parents (‘Zhongdou 27’ and ‘Jiunong 20’) showed different patterns at the R6 stage. This observation was consistent with the previous report by Sarah et al. [65]. Significant correlations between the transcript abundances of TGs and isoflavone contents were found in developing seeds at the R6 stage, indicating that these genes could affect total and individual isoflavone accumulations (Table  2). Previously, two major QTLs that affect isoflavone content across multiple environments were mapped on Gm05 (LG A1) and Gm08 (LG A2) by Gutierrez et al. [17] and Yang et al. [20], respectively. In the present work, one eQTL qIFSA2_1 (Sat_129-Sat_181) was mapped close to qGCA2_1 on Gm08 (LG A2) (Figure  1, Table  5). This result suggested that qIFSA2_1 might be a cis-enzyme related locus. Some of these identified eQTLs associated with seed isoflavone content did not coincide with the TGs, suggesting that the differences in TGs transcript abundances might be caused by several trans-acting factors [66].
Table 5

Partial QTLs for individual and total isoflavone contents

Traitsa QTLb Gm (LG)MarkerMarker intervalPositionc Environmentd LOD scoreR2(%)e
DZ fqDZF_113(F)Sat_103Sat_103-Sat_262188.34E22.0010.57
GCqGCA2_108(A2)Sat_040Sat_040-Satt23338.46E32.656.01
fqGCD1b_102(Dlb)Satt546Satt546-Sat_459215.67E22.383.12
E52.214.17
GT fqGTD2_117(D2)Satt186Satt186-Satt22650.81E12.003.41
E22.365.23
E35.7610.98
E53.098.23
fqGTF_213(F)Satt149Satt149-Sat_23441.23E12.001.56
E32.494.17
E74.035.47
TI fqTIF_113(F)Satt423Satt423-Satt5696.01E64.593.21
E72.154.2

aDZ: Daidzein; GC:Glycitein; GT: Genistein; TI: Total isoflavone.

bThe nomenclature of the QTL included four parts : QTL, trait, linkage group name and QTL order in the linkage group, respectively.

cPosition from the left marker of the interval on each linkage group.

dE1: at Harbin in 2005, E2: at Harbin in 2006, E3: at Hulan in 2006, E4:at Suihua in 2006, E5: at Harbin in 2007, E6: at Hulan in 2007, E7: at Suihua in 2007.

eProportion of phenotypic variance (R2) explained by a QTL.

fOverlapped loci of pQTL and eQTL.

Partial QTLs for individual and total isoflavone contents aDZ: Daidzein; GC:Glycitein; GT: Genistein; TI: Total isoflavone. bThe nomenclature of the QTL included four parts : QTL, trait, linkage group name and QTL order in the linkage group, respectively. cPosition from the left marker of the interval on each linkage group. dE1: at Harbin in 2005, E2: at Harbin in 2006, E3: at Hulan in 2006, E4:at Suihua in 2006, E5: at Harbin in 2007, E6: at Hulan in 2007, E7: at Suihua in 2007. eProportion of phenotypic variance (R2) explained by a QTL. fOverlapped loci of pQTL and eQTL. In this study, since the 194 markers were not uniformly distributed, large gaps appeared with low marker density on chromosomes Gm02, 04, 13, 16 and 18, implying that more markers should be developed among these gaps and the authenticity of pQTL or eQTL should be further clarified. Among these gaps, special attention should be paid to eQTL qF3HDlb_2 on chromosome Gm02 and qIFSC1_1 on chromosome Gm04 because of their higher LOD score and contribution to phenotypic variation (Table  3). Overlapped loci of qF3HF_1 and qDZF_1, and genes that fall into this region should also be further clarified with more markers. Consequently, fine mapping on these intervals with more SSR or SNP markers and to determine the authenticity of these loci as well as the underlying genes were extremely essential in the future work. The analysis of eQTL overlapped with pQTL suggested that the candidate genes or elements among the marker intervals could affect phenotypic traits [49, 67, 68]. Therefore, overlapped loci of eQTLs and pQTLs were analyzed to find the potential candidate genes affecting the accumulation of isoflavone contents in soybean seed. Five eQTL intervals were overlapped with pQTLs according to the comparison of genomic regions between pQTLs and eQTLs (Table  5). These results indicated that some candidate genes or elements in these intervals could regulate the biosynthesis of isoflavone components, and affect their accumulation. Additionally, some eQTLs overlapped with other eQTLs or shared the same markers with pQTLs, suggesting that some candidate genes or elements were located near these loci. Several genes involved in isoflavone accumulation in soybean seed had been identified [22, 27, 31]. 11 candidate genes falling into the overlapped intervals of pQTL and eQTL were found (Table  4). Bolon et al. [58] identified eQTL for genes with seed-specific expression and discovered striking eQTL hotspots at distinct genomic intervals on chromosome Gm13. A chalcone isomerase (CHI3) and IFS2 gene were located in the same region identified by qGEN13 on Gm13 [11]. Another QTL for GC that encoded PAL and 4CL paralog was also reported on Gm13 [10, 11]. In the present work, seven candidate genes on Gm13 (LG F) were identified, implying that there could be a hotspot of gene cluster that regulated seed isoflavone content on Gm13. Among them, CHS (Glyma13g09640.1) and FLS (Glyma13g02740.1) were identified on three overlapped loci, implying that they could interact or trans-regulate other genes in the phenylpropanoid pathway. Furthermore, PAL1 (Glyma13g20800.1) and IFS (Glyma13g24200.1) paralogs were identified within two overlapped loci. In the marker interval (Satt149-Sat_234) associated with qCHSF_1, qIFSF_2 and qGTF_2, both Glyma13g24200.1 and Glyma13g09640.1 were found to encode CHS and IFS, indicating that they could be the potential candidate genes. It was supposed that Glyma13g09640.1 could interact or trans-regulate the expression of IFS. However, the function of these potential candidate genes should be tested in future works. Although open questions about the biology and applications of eQTL mapping still exist [69], there are considerable advances in the eQTL studies. Detailed analysis of eQTL combined with cluster analysis of transcript abundance and eventually gene expression patterns could assist map-based cloning of genes underlying these traits. Markers based on underlying genes are also desirable for MAS in soybean breeding programs. The mechanism underlying seed isoflavone synthesis and its accumulation may contribute to the development of marker-assisted selection for soybean cultivars with high or low isoflavone contents.

Conclusions

A total of thirty three eQTLs (thirteen cis-eQTLs and twenty trans-eQTLs) were identified on fifteen chromosomes. Five eQTL intervals were overlapped with pQTLs and a total of eleven candidate genes within the overlapped eQTL and pQTL were identified. These results might be beneficial for the development of marker-assisted selection to breed soybean cultivars with high isoflavone contents.

Methods

Plant materials and growing conditions

The mapping population of 130 F5:11 recombinant inbred (RI) lines were derived through single-seed-descent from the cross between ‘Zhongdou 27’ (developed by the Chinese Academy of Agricultural Sciences, Beijing, China) and ‘Jiunong 20’ (developed by Jilin Academy of Agricultural Sciences, Jilin, China). ‘Zhongdou 27’ contains high individual and total isoflavone (TI) contents in seed (daidzein, DZ, 1,865 μg/g; genistein, GT, 1,614 μg/g; glycitein, GC, 311 μg/g and total isoflavone, TI, 3,791 μg/g), whereas ‘Jiunong 20’ has low individual and TI contents (DZ, 844 μg/g; GT, 1,046 μg/g; GC, 193 μg/g and TI, 2,061 μg/g). To detect eQTL, the parents and the 130 F5:11 RI lines were planted at Harbin, Heilongjiang Province, China, in 2011. Randomized complete block designs were used for all experiments with rows 3 m long, 0.65 m apart, and a space of 6 cm between plants. Mature and immature seeds in the reproductive stages (from soybean growth stage R3 to R8) [70] were harvested from a bulked sample collected from three plants in each plot. These samples were quantified for individual and total seed isoflavone contents and transcript abundances.

Isoflavone extraction and quantification

Approximately 150 g of soybean seed samples were ground to a fine power using a commercial coffee grinder. Isoflavones were extracted from flour and separated using HPLC as described previously [16]. Measurements were done as micrograms of isoflavone per gram of seeds plus and minus the standard deviations (μg/g ± SD).

Synthesis of cDNA, Real-Time PCR and data collection

To investigate the expressions of four TGs, total RNA was isolated from soybean seed samples from R3 to R8 stages using plant RNA purification reagent Kit (D9108A, TaKaRa, Japan). RNAs were transcribed to cDNA using the first strand DNA synthesis reagent Kit (D6110A, TaKaRa, Otsu, Shiga, Japan). Four TGs (PAL, GenBank accession: GQ220305; CHS, GenBank accession: EU526827; IFS, GenBank accession: FJ770473 and F3H, GenBank accession: AY595420) in the phenylpropanoid pathway, were selected to analyze the transcript abundance variations in the F5:11 RI line population. These four TGs were analyzed by real-time PCR (Kit DRR081A, TaKaRa, Japan). Gene-specific primers for expression analysis of the four TGs were listed in Table  6. Primer specificity was confirmed based on each primer pair sequence against soybean genome sequences by BLASTing (http://www.phytozome.net/soybean) using the BLASTN algorithm. Moreover, through the BLASTN of the sequences of the TGs, PAL2 (located on Gm10 (LG O)) of the PAL gene family, CHS8 (located on Gm11 (LG B1)) of the CHS gene family, IFS1 (located on Gm07 (LG M)) of the IFS gene family, and F3H1 and F3H2 (located on Gm02 (LG D1b)) of the F3H gene family were amplified [11].
Table 6

Real-time PCR primer pairs for the expression analyses of , , , and genes

GeneForward primer (5′-3′)Reverse primer (5′-3′)PCR product length (bp)
Actin4 GTGTCAGCCATACTGTCCCCATTTGTTTCAAGCTCTTGCTCGTAATCA214
PAL ATTATGGATTCAAGGGAGCTAATGAGGAAAGTGGAGGACA182
CHS AAAATGCCATCTCCTCAAACAGGATCTCAGCTACGCTCACC155
F3H GCTTGCGAGAATTGGGGTATCCTTGGAGATGGCTGGAGAC176
IFS GCCCTGGAGTCAATCTGGCAAGACTATGTGCCCTTGGA171
Real-time PCR primer pairs for the expression analyses of , , , and genes PCR amplification was performed as follows: 95°C for 60 s, followed by 40 cycles of 95°C for 11 s, 60°C for 12 s and 72°C for 18 s. The soybean actin4 (GenBank accession: AF049106) gene was used as a reference to quantify the expression levels of the target genes [71]. Three replicates for each reaction were performed. The relative transcript abundance of TGs in different samples was calculated using 2-ΔΔCt method [72], defined as: ΔCt = Ct (target) – Ct (actin). Pearson correlations between total/individual isoflavone contents and the expression of the four TGs in F5:11 RILs were evaluated using SAS 8.2 (Cary, NC, USA) [73].

Identification of genomic region of target genes

The whole genome sequence Glyma1 assembly for Williams 82 [74] provided a powerful tool for interrogating QTL data. Previously reported genes for isoflavone biosynthesis [75] were used in BLAST searches against the whole genome sequence to identify homologous regions in the genome with assigned or putative functions. All twenty soybean chromosomes have regions sharing a high percentage of homology with genes of known function in the phenylpropanoid pathway [11]. The coding regions of TGs were compared with genome of Williams 82 through BLAST (E-value ≤ 1.0E-05, http://www.phytozome.net/soybean) to identify homologous regions.

eQTL analysis

In previous work, fifteen QTL underlying seed isoflavone contents of soybean were identified based on RI line populations derived from a cross between ‘Zhongdou 27’ (high isoflavone) and ‘Jiunong 20’ (low isoflavone) through a genetic linkage map including 99 SSR markers [16]. Another 95 SSR markers were added to the map of Zeng et al. [16] to identify novel phenotypic QTLs (pQTLs) associated with seed isoflavone contents of soybean (accepted by Molecular Biology Reports). In this study, 194 polymorphic markers were assembled onto the 20 linkage groups (LGs) by Mapmaker 3.0b with the Kosambi mapping function [76]. WinQTLCart2.1 [77] was used to detect eQTL between marker intervals by 1,000 permutations at significance (P ≤ 0.05). The genetic linkage map was constructed using Mapchart 2.1 [78]. The nomenclature of the eQTLs/pQTLs included four parts following the recommendations of the Soybean Germplasm Coordination Committee. For example, qCHSF_1, q, CHS, F and 1 represent eQTL, trait (CHS), linkage group name and eQTL order in the linkage group, respectively.

Identification of candidate genes underlying overlapped loci of pQTL and eQTL

Coincident genetic locations of eQTL and pQTL may be available to identify important regulatory genes underlying traits, and lead to the identification of molecular mechanisms [49, 67, 68]. Previous studies have combined eQTL and pQTL mapping to gain insight into regulatory pathways involved in determining phenotypic traits [49, 68, 79–81]. eQTL located in the same marker intervals of pQTL might contribute to significant phenotypic variations [49, 67, 68]. In this study, thirty four phenotypic QTL (pQTL) identified with the 194 SSR markers were compared with eQTL to identify overlapped loci. Genetic map positions were estimated by identifying the nearest flanking SSR markers using the genome browser (http://www.soybase.org). The candidate genes underlying overlapped loci of pQTL and eQTL were identified by browsing after using BLAST search of flanking markers against the whole genome sequence of Williams 82 (available at: http://www.phytozome.net/soybean).
  62 in total

1.  Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits.

Authors:  Margarete Mehrabian; Hooman Allayee; Jirina Stockton; Pek Yee Lum; Thomas A Drake; Lawrence W Castellani; Michael Suh; Christopher Armour; Stephen Edwards; John Lamb; Aldons J Lusis; Eric E Schadt
Journal:  Nat Genet       Date:  2005-10-02       Impact factor: 38.330

Review 2.  Quantitative genomics: analyzing intraspecific variation using global gene expression polymorphisms or eQTLs.

Authors:  Dan Kliebenstein
Journal:  Annu Rev Plant Biol       Date:  2009       Impact factor: 26.379

3.  Expression quantitative trait loci analysis of genes in porcine muscle by quantitative real-time RT-PCR compared to microarray data.

Authors:  S Ponsuksili; E Murani; C Phatsara; M Schwerin; K Schellander; K Wimmers
Journal:  Heredity (Edinb)       Date:  2010-02-10       Impact factor: 3.821

4.  Major locus and other novel additive and epistatic loci involved in modulation of isoflavone concentration in soybean seeds.

Authors:  Juan J Gutierrez-Gonzalez; Tri D Vuong; Rui Zhong; Oliver Yu; Jeong-Dong Lee; Grover Shannon; Mark Ellersieck; Henry T Nguyen; David A Sleper
Journal:  Theor Appl Genet       Date:  2011-08-18       Impact factor: 5.699

5.  Identifying regions of the wheat genome controlling seed development by mapping expression quantitative trait loci.

Authors:  Mark C Jordan; Daryl J Somers; Travis W Banks
Journal:  Plant Biotechnol J       Date:  2007-03-26       Impact factor: 9.803

6.  Molecular characterization of flavanone 3 beta-hydroxylases. Consensus sequence, comparison with related enzymes and the role of conserved histidine residues.

Authors:  L Britsch; J Dedio; H Saedler; G Forkmann
Journal:  Eur J Biochem       Date:  1993-10-15

7.  Identification of QTL underlying isoflavone contents in soybean seeds among multiple environments.

Authors:  Guoliang Zeng; Dongmei Li; Yingpeng Han; Weili Teng; Jian Wang; Liquan Qiu; Wenbin Li
Journal:  Theor Appl Genet       Date:  2009-03-06       Impact factor: 5.699

8.  Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease.

Authors:  Norbert Hubner; Caroline A Wallace; Heike Zimdahl; Enrico Petretto; Herbert Schulz; Fiona Maciver; Michael Mueller; Oliver Hummel; Jan Monti; Vaclav Zidek; Alena Musilova; Vladimir Kren; Helen Causton; Laurence Game; Gabriele Born; Sabine Schmidt; Anita Müller; Stuart A Cook; Theodore W Kurtz; John Whittaker; Michal Pravenec; Timothy J Aitman
Journal:  Nat Genet       Date:  2005-02-13       Impact factor: 38.330

9.  Identification of single nucleotide polymorphisms regulating peripheral blood mRNA expression with genome-wide significance: an eQTL study in the Japanese population.

Authors:  Daimei Sasayama; Hiroaki Hori; Seiji Nakamura; Ryo Miyata; Toshiya Teraishi; Kotaro Hattori; Miho Ota; Noriko Yamamoto; Teruhiko Higuchi; Naoji Amano; Hiroshi Kunugi
Journal:  PLoS One       Date:  2013-01-24       Impact factor: 3.240

10.  A systems biology approach identifies a R2R3 MYB gene subfamily with distinct and overlapping functions in regulation of aliphatic glucosinolates.

Authors:  Ida Elken Sønderby; Bjarne Gram Hansen; Nanna Bjarnholt; Carla Ticconi; Barbara Ann Halkier; Daniel J Kliebenstein
Journal:  PLoS One       Date:  2007-12-19       Impact factor: 3.240

View more
  12 in total

Review 1.  Quantitative trait loci from identification to exploitation for crop improvement.

Authors:  Jitendra Kumar; Debjyoti Sen Gupta; Sunanda Gupta; Sonali Dubey; Priyanka Gupta; Shiv Kumar
Journal:  Plant Cell Rep       Date:  2017-03-28       Impact factor: 4.570

2.  Detecting the QTL-allele system of seed isoflavone content in Chinese soybean landrace population for optimal cross design and gene system exploration.

Authors:  Shan Meng; Jianbo He; Tuanjie Zhao; Guangnan Xing; Yan Li; Shouping Yang; Jiangjie Lu; Yufeng Wang; Junyi Gai
Journal:  Theor Appl Genet       Date:  2016-05-17       Impact factor: 5.699

3.  Mapping isoflavone QTL with main, epistatic and QTL × environment effects in recombinant inbred lines of soybean.

Authors:  Yan Wang; Yingpeng Han; Xue Zhao; Yongguang Li; Weili Teng; Dongmei Li; Yong Zhan; Wenbin Li
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

Review 4.  QTLomics in Soybean: A Way Forward for Translational Genomics and Breeding.

Authors:  Giriraj Kumawat; Sanjay Gupta; Milind B Ratnaparkhe; Shivakumar Maranna; Gyanesh K Satpute
Journal:  Front Plant Sci       Date:  2016-12-21       Impact factor: 5.753

5.  eQTLs Regulating Transcript Variations Associated with Rapid Internode Elongation in Deepwater Rice.

Authors:  Takeshi Kuroha; Keisuke Nagai; Yusuke Kurokawa; Yoshiaki Nagamura; Miyako Kusano; Hideshi Yasui; Motoyuki Ashikari; Atsushi Fukushima
Journal:  Front Plant Sci       Date:  2017-10-13       Impact factor: 5.753

Review 6.  Exploiting Phenylpropanoid Derivatives to Enhance the Nutraceutical Values of Cereals and Legumes.

Authors:  Sangam L Dwivedi; Hari D Upadhyaya; Ill-Min Chung; Pasquale De Vita; Silverio García-Lara; Daniel Guajardo-Flores; Janet A Gutiérrez-Uribe; Sergio O Serna-Saldívar; Govindasamy Rajakumar; Kanwar L Sahrawat; Jagdish Kumar; Rodomiro Ortiz
Journal:  Front Plant Sci       Date:  2016-06-03       Impact factor: 5.753

7.  Overexpression of Soybean Isoflavone Reductase (GmIFR) Enhances Resistance to Phytophthora sojae in Soybean.

Authors:  Qun Cheng; Ninghui Li; Lidong Dong; Dayong Zhang; Sujie Fan; Liangyu Jiang; Xin Wang; Pengfei Xu; Shuzhen Zhang
Journal:  Front Plant Sci       Date:  2015-11-23       Impact factor: 5.753

Review 8.  Expanding Omics Resources for Improvement of Soybean Seed Composition Traits.

Authors:  Juhi Chaudhary; Gunvant B Patil; Humira Sonah; Rupesh K Deshmukh; Tri D Vuong; Babu Valliyodan; Henry T Nguyen
Journal:  Front Plant Sci       Date:  2015-11-24       Impact factor: 5.753

9.  Quantitative Trait Transcripts Mapping Coupled with Expression Quantitative Trait Loci Mapping Reveal the Molecular Network Regulating the Apetalous Characteristic in Brassica napus L.

Authors:  Kunjiang Yu; Xiaodong Wang; Feng Chen; Qi Peng; Song Chen; Hongge Li; Wei Zhang; Sanxiong Fu; Maolong Hu; Weihua Long; Pu Chu; Rongzhan Guan; Jiefu Zhang
Journal:  Front Plant Sci       Date:  2018-02-01       Impact factor: 5.753

10.  Fine-mapping of QTLs for individual and total isoflavone content in soybean (Glycine max L.) using a high-density genetic map.

Authors:  Zhandong Cai; Yanbo Cheng; Zhuwen Ma; Xinguo Liu; Qibin Ma; Qiuju Xia; Gengyun Zhang; Yinghui Mu; Hai Nian
Journal:  Theor Appl Genet       Date:  2017-11-20       Impact factor: 5.699

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.