Literature DB >> 30157757

Construction of a high-density genetic map and mapping of QTLs for soybean (Glycine max) agronomic and seed quality traits by specific length amplified fragment sequencing.

Yanwei Zhang1, Wei Li2, Yanhui Lin2, Lifeng Zhang2, Caijie Wang2, Ran Xu2.   

Abstract

BACKGROUND: Soybean is not only an important oil crop, but also an important source of edible protein and industrial raw material. Yield-traits and quality-traits are increasingly attracting the attention of breeders. Therefore, fine mapping the QTLs associated with yield-traits and quality-traits of soybean would be helpful for soybean breeders. In the present study, a high-density linkage map was constructed to identify the QTLs for the yield-traits and quality-traits, using specific length amplified fragment sequencing (SLAF-seq).
RESULTS: SLAF-seq was performed to screen SLAF markers with 149 F8:11 individuals from a cross between a semi wild soybean, 'Huapidou', and a cultivated soybean, 'Qihuang26', which generated 400.91 M paired-end reads. In total, 53,132 polymorphic SLAF markers were obtained. The genetic linkage map was constructed by 5111 SLAF markers with segregation type of aa×bb. The final map, containing 20 linkage groups (LGs), was 2909.46 cM in length with an average distance of 0.57 cM between adjacent markers. The average coverage for each SLAF marker on the map was 81.26-fold in the male parent, 45.79-fold in the female parent, and 19.84-fold average in each F8:11 individual. According to the high-density map, 35 QTLs for plant height (PH), 100-seeds weight (SW), oil content in seeds (Oil) and protein content in seeds (Protein) were found to be distributed on 17 chromosomes, and 14 novel QTLs were identified for the first time. The physical distance of 11 QTLs was shorter than 100 Kb, suggesting a direct opportunity to find candidate genes. Furthermore, three pairs of epistatic QTLs associated with Protein involving 6 loci on 5 chromosomes were identified. Moreover, 13, 14, 7 and 9 genes, which showed tissue-specific expression patterns, might be associated with PH, SW, Oil and Protein, respectively.
CONCLUSIONS: With SLAF-sequencing, some novel QTLs and important QTLs for both yield-related and quality traits were identified based on a new, high-density linkage map. Moreover, 43 genes with tissue-specific expression patterns were regarded as potential genes in further study. Our findings might be beneficial to molecular marker-assisted breeding, and could provide detailed information for accurate QTL localization.

Entities:  

Keywords:  Genetic map; Plant height; QTL; Quality traits; SLAF-seq; Seed weight; Soybean

Mesh:

Year:  2018        PMID: 30157757      PMCID: PMC6116504          DOI: 10.1186/s12864-018-5035-9

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Soybean is not only an important oil crop, but also an important source of edible protein and industrial raw material [1]. Agronomic traits, such as yield, plant height (PH), lodging and seed weight (SW), have been the primary focus of breeders for many years. As the major factors of the market price of soybean, seed quality traits are increasingly attracting the attention of breeders. However, the negative correlation between yield and quality of crops makes it much difficult to select for these traits [2]. Therefore, simultaneous improvement of yield and quality has become a major problem for soybean breeders. Molecular marker-assisted selection (MAS) might be an alternative to fit the increasing global demand for soybean products [3]. A number of QTLs underlying important agronomic traits and seed quality traits have been reported over the past decades. So far, at least 196, 265, 297 and 221 QTLs controlling PH, SW, Oil and Protein have been identified respectively (www.soybase.org), based on the different genetic backgrounds, environments and statistical methods. Furthermore, large confidence intervals around QTLs make the causative gene identification difficult. With the development of next generation sequencing technology, several methods for single nucleotide polymorphisms (SNP) discovery, such as restriction-site associated DNA sequencing (RADseq) [4, 5], genotyping-by-sequencing (GBS) [6], specific length amplified fragment sequencing (SLAF-seq) [7] have been produced, which make it possible to obtain thousands of SNPs suitable for high-density genetic map throughout the genome. SLAF markers, which have the properties of being present in large amount, being evenly distributed and avoiding repeated sequences, has been used for genetic analysis in plants, such as sesame [8], walnut [9], rice [10], sorghum [11], wax gourd [12], grape [13] and soybean [14-17]. Since the first high-density map was constructed by SLAF-seq [7], there have been several maps reported so far. Qi et al. constructed a map, including 5308 markers with 2655.68 cM in length, using a RIL population derived from a cross between ‘Charleston’ and ‘Dongnong594’ [16]. Li et al. constructed a high-density map, using a F5:8 population of 110 RILs from a cross between ‘Luheidou2’ and ‘Nanhuizao’, which was used to identified QTLs associated the isoflavone content and fatty acid composition in soybean [15, 18]. Zhang et al. reported 20 QTLs associated with phosphorus efficiency-related traits based on a high-density map constructed by SLAF-seq [17]. Cao et al. mapped QTL associated with plant height and flowering time according the map constructed by SLAF-seq using a population of 236 RILs derived from a cross between two summer planting varieties, ‘ZXD’ and ‘NN1138–2’ [14]. Nevertheless, based on the high-density map, little QTLs related to seed weight and/or protein have been reported. Therefore, we reported a high-density genetic linkage map using the SLAF-seq approach, which was based on an F8:11 RIL population with 149 individuals. Moreover, the QTLs associated with plant height, seed weight, oil and protein content were located and analyzed. The results presented here will aid molecular marker-assisted breeding and provide detailed information for accurate QTL localization.

Results

Analysis of SLAF-seq and SLAF markers

DNA sequencing generated about 400.91 M pair-end reads. The Q30 (indicating a 0.1% chance of error) was 90.69% and guanine-cytosine (GC) content was 40.3%. The numbers of SLAFs in the female and male parents were 312,740 and 275,046, respectively. The numbers of SLAFs in each individual ranged from 167,933 to 237,666, with an average of 207,105. Among the 391,476 SLAF markers detected, 53,132 markers were polymorphic. All polymorphic SLAFs were then genotyped separately for all individuals. After discarding the SLAF markers lacking parent information, 30,415 markers were genotyped successfully and were classified into eight segregation types (Fig. 1). As the population was derived from a cross between two fully homozygous parents, only 27,472 markers with aa×bb type might be suitable for map construction. After filtering low-quality SLAF markers, segregation distortion markers and makers with the MLOD value ≤3, 5111 markers were used for the map construction (Additional files 1, 2, and 3). The average depth of the markers was 81.26-fold in the female parent, 45.79-fold in the male parent, and 19.84-fold in the offspring.
Fig. 1

Number of markers for eight segregation patterns

Number of markers for eight segregation patterns

The basic characteristics of the genetic map

The length of final map was 2909.46 cM, with an average distance of 0.57 cM between adjacent markers (Table 1; Fig. 2). There were 8597 SNP loci among the 5111 markers on the map. For each chromosome, the average distance ranged from 0.24 cM to 2.55 cM (Table 1). The largest linkage group was LG18 (chr18) with 480 markers, a length of 202.52 cM, and an average distance of only 0.42 cM between adjacent markers. The smallest linkage group was LG7 (chr7) with 63 markers, a length of 54.57 cM, and an average distance of 0.87 cM between adjacent markers.
Table 1

Description on basic characteristics of the 20 linkage groups

Linkage group IDSLAF numberSNP numberTotal length (cM)Average distance between markers (cM)
chr15787145.472.55
chr2408628174.90.43
chr3329523163.130.5
chr4233373180.320.77
chr5126174143.231.14
chr67510793.211.24
chr76311054.570.87
chr8143216200.621.4
chr9136219101.890.75
chr108091431190.370.24
chr11103149123.311.2
chr12638465.391.04
chr13350656156.620.45
chr14170261167.730.99
chr155821089168.070.29
chr16239397185.310.78
chr17568955162.120.29
chr18480848202.520.42
chr19487860.851.27
chr20129212169.831.32
total511185972909.460.57
Fig. 2

The positions of QTLs for four traits. QTLs for four traits are depicted in different shapes on the right side of each linkage group. 35 QTLs identified by ICIM are colored in black; 18 QTLs identified in 2013 by CIM are colored in red; 21 QTLs identified in 2014 by CIM are colored in green

Description on basic characteristics of the 20 linkage groups The positions of QTLs for four traits. QTLs for four traits are depicted in different shapes on the right side of each linkage group. 35 QTLs identified by ICIM are colored in black; 18 QTLs identified in 2013 by CIM are colored in red; 21 QTLs identified in 2014 by CIM are colored in green

Phenotypic evaluation

‘Huapidou’ and ‘Qihuang26’ showed a significant difference in PH, SW and Protein, but did not differ from each other in Oil significantly (Table 2; Fig. 3). However, the phenotypic values were all in a condition of continuous distribution approximately (Fig. 3). The coefficients of variation for four traits were about 20%. The heritabilities of four traits ranged from 49.58 to 82.73%. However, the heritability of Protein was only 49.58%, indicating that other factors affected Protein should be considered.
Table 2

Summary of soybean traits in the RIL population and parents

TraitYearParentsRILs
HuapidouQihuang26maxminmeanSDSkewnessKurtosisCV(%)h2 (%)
PH (cm)201373.6042.50126.6046.2079.1417.620.37−0.7521.9282.73
201470.5046.40173.542.4088.4326.910.910.5520.53
SW (g)201311.4021.4021.7211.0516.302.120.29− 0.1619.8666.37
201412.1023.8026.4012.3519.292.480.610.6617.65
Oil (%)201318.8319.6522.7317.0020.230.91−0.200.8915.8870.71
201419.4419.7223.2617.8220.051.080.861.1219.85
Protein (%)201342.3246.4649.3241.9845.171.400.25−0.0619.0749.58
201442.5946.6449.3140.9445.151.700.04−0.3920.31
Fig. 3

The frequency distribution for soybean traits of the RIL population and parents. F: female parent, ‘Huapidou’; M: male parent, ‘Qihuang26’; (a): the frequency distribution for PH in 2013; (b): the frequency distribution for PH in 2014; (c): the frequency distribution for SW in 2013; (d): the frequency distribution for SW in 2014; (e): the frequency distribution for Oil in 2013; (f): the frequency distribution for Oil in 2014; (g): the frequency distribution for Protein in 2013; (h): the frequency distribution for Protein in 2014

Summary of soybean traits in the RIL population and parents The frequency distribution for soybean traits of the RIL population and parents. F: female parent, ‘Huapidou’; M: male parent, ‘Qihuang26’; (a): the frequency distribution for PH in 2013; (b): the frequency distribution for PH in 2014; (c): the frequency distribution for SW in 2013; (d): the frequency distribution for SW in 2014; (e): the frequency distribution for Oil in 2013; (f): the frequency distribution for Oil in 2014; (g): the frequency distribution for Protein in 2013; (h): the frequency distribution for Protein in 2014

Analyses of additive QTLs

In total, 35 additive QTLs for PH, SW, Oil and Protein were identified on 17 chromosomes by ICIM method (Table 3; Figs. 2, 4). A single QTL explained 2.66% (qPH16–1) to 37.61% (qPH8–1) of phenotypic variance. Among the QTLs, 14 QTLs of them were observed for the first time (Table 3). A total of 21 QTLs were related to the region of the QTLs reported previously, and 19 of them were co-located in the regions with shorter intervals than previously reported, which might provide more detailed information for gene identification.
Table 3

QTLs identified by ICIM

QTLChraLeft markerRight markerGenetic position (cM)Physical position (bp)LODbPVE(%)cADDdQTLs reportede
StartEndDistanceStartEndDistance
qPH1–1 1Marker4368622Marker421740920.6221.090.472,481,1992,518,04336,8448.814.662.65qPH-1-1 [14]
qPH5–1 5Marker1701487Marker17391840.811.290.483,041,0623,116,23075,1686.659.305.05qPH05–1 [26]
qPH6–1 6Marker131534Marker17259976.6977.500.8117,471,67118,171,743700,07212.687.625.02plant height 17–6 [25]
qPH7–1 7Marker5882404Marker576223452.8354.571.7442,930,49942,952,37321,8747.864.09−3.21
qPH8–1 8Marker2999607Marker298102363.8765.771.9013,832,45814,205,949373,49119.6137.618.58
qPH8–2 8Marker2962002Marker2806136174.90175.240.3445,424,86145,433,16583047.093.943.65
qPH16–1 16Marker3112758Marker318237281.5684.522.966,932,3367,381,056448,7204.192.66−3.20
qPH17–1 17Marker403730Marker59978541.5744.663.0815,333,36015,937,896604,5368.2811.885.78qPH17–1 [26]
qPH19–1 19Marker2154660Marker199346233.7735.141.3638,782,15438,837,78755,63317.2911.566.43plant height 6–4 [19]
qSW5–1 5Marker1790368Marker171328321.6222.631.015,022,4245,076,53154,1077.637.440.53seed weight 25–1 [37]; seedweight 50–17 [67]
qSW5–2 5Marker1954345Marker1983847100.58101.741.1637,080,53337,099,61019,0773.563.48−0.15
qSW9–1 9Marker4666534Marker470672796.89101.895.0013,567,36913,932,346364,9778.789.530.53seed weight 15–6 [28]
qSW11–1 11Marker1480158Marker16288382.555.272.721,243,4131,813,344569,9316.205.170.38
qSW11–2 11Marker1573686Marker160389059.1365.876.7410,726,15425,136,00214,409,8486.415.43−0.37seed weight 32–1 [68]
qSW12–1 12Marker5558494Marker559279118.4723.124.653,998,3954,965,904967,50917.8322.77−0.77
qSW13–1 13Marker3613921Marker3643913103.37104.050.6829,371,75729,486,787115,03015.1218.55−0.70seed weight 6–6 [27]; seed weight 15–3 [28]; seed weight 49–14 [30]; seed weight 44–2 [29]
qSW15–1 15Marker2580154Marker271258922.4926.303.8113,761,33413,880,441119,10712.5112.21−0.46seed weight 2–3 [31]; seed weight 34–12 [32]
qSW20–1 20Marker809331Marker99844387.3089.412.1135,452,60835,720,703268,0955.975.260.39seed weight 36–5 [32]
qSW20–2 20Marker846233Marker994705165.31169.834.5245,972,28646,673,050700,7643.963.970.35seed weight 50–16 [67]
qOil1–1 1Marker4303320Marker415666061.5066.404.9035,010,73837,973,8042,963,06622.8930.250.42
qOil1–2 1Marker4417270Marker444963277.8580.372.5135,669,86736,135,498465,63112.2914.19−0.24
qOil6–1 6Marker172599Marker29311577.5078.320.8218,171,37218,533,013361,6417.985.59−0.22seed oil 23–1 [28]
qOil10–1 10Marker1115605Marker10535739.009.810.825,243,9535,694,905450,95211.418.12−0.23
qOil10–2 10Marker1281634Marker1235885180.85183.082.2341,520,25441,545,66425,41023.6120.170.37
qOil13–1 13Marker3440932Marker363536842.2244.892.6714,747,49416,600,1181,852,6246.624.31−0.09seed oil 24–4 [33]
qOil19–1 19Marker2188637Marker20546498.9210.221.3134,198,68434,698,502499,81812.698.92−0.24mqseed oil-022 [34]
qOil19–2 19Marker2061159Marker227056118.2019.090.8836,014,69536,059,93145,2364.582.910.11seed oil 2–7 [35]
qProtein1–1 1Marker4337527Marker430332058.3261.503.1814,274,64637,973,80423,699,15811.0517.68−0.37
qProtein3–1 3Marker6774045Marker6819354146.11151.725.6137,481,86038,933,7281,451,8682.613.57−0.14seed protein 21–9 [25]
qProtein4–1 4Marker6548332Marker673076031.6532.831.185,276,1375,604,533328,3963.869.850.23seed protein 19–1 [69]
qProtein5–1 5Marker1695301Marker184774674.4377.653.2332,108,88232,889,638780,7562.904.05−0.17
qProtein8–1 8Marker2934032Marker2871125186.80189.122.3243,600,62343,628,81628,1933.194.560.18seed protein 21–1 [25]
qProtein10–1 10Marker1235885Marker1249450183.08184.241.1641,545,29342,037,748492,4556.9416.83−0.36seed protein 27–5 [37]; qPro10a [38]
qProtein15–1 15Marker2482668Marker237948256.9857.120.1315,668,56715,751,33882,7715.5114.36−0.25seed protein 4–13 [36]
qProtein16–1 16Marker3134296Marker3275652122.68125.743.0631,042,21331,372,812330,5994.186.240.23

achr, chromosome; b LOD, logarithm of odds; c PVE, phenotypic variance explained; d ADD, additive effect; e QTL names based on soybase.org and previous reports

Fig. 4

QTL mapping for the soybean traits with ICIM-ADD method. (a): LOD curve on the genome for PH; (b): LOD curve on the genome for SW; (c): LOD curve on the genome for Oil; (d): LOD curve on the genome for Protein

QTLs identified by ICIM achr, chromosome; b LOD, logarithm of odds; c PVE, phenotypic variance explained; d ADD, additive effect; e QTL names based on soybase.org and previous reports QTL mapping for the soybean traits with ICIM-ADD method. (a): LOD curve on the genome for PH; (b): LOD curve on the genome for SW; (c): LOD curve on the genome for Oil; (d): LOD curve on the genome for Protein CIM method was also used to identify QTLs separately for 2 years. In total, 18 QTLs were observed on 11 chromosomes in 2013, and 21 QTLs were found on 15 chromosomes in 2014 (Additional file 4; Fig. 2). Of these QTLs, 31 QTLs were both identified by two methodsMoreover, q2013PH19–1 and q2014PH19–2 with shorter intervals than previously reported [19], which were located in the same confidence intervals, might be stable across both years. It was noteworthy that q2013Oil1–1 was placed in the same confidence intervals as q2013Protein1–1, and q2014Oil10–2 was placed in the same confidence intervals as q2013protein10–1, which might be useful in the coordinated improvement of seed quality for soybean breeding.

Analyses of epistatic effects

A total of 3 pairs of epistatic QTLs involving 6 loci on 5 chromosomes were identified for Protein (Table 4). The epistatic effect explained 5.49%, 4.49% and 4.06% of the PV, respectively. Pair one was composed of 2 QTLs, qProtein2–1 and qProtein3–2, with the PVE of 5.49%. qProtein5–2 was observed to have the epistatic effect with qProtein12–1. Meanwhile, qProtein3–3 showed epistatic interaction with qProtein17–1. However, no epistatic effect was observed for PH, SW and Oil.
Table 4

Epistatic QTLs detected by ICIM-EPI

TraitQTLChraMarker intervialQTLChraMarker intervialLODbPVE(%)cAddd
Protein qProtein2–1 2Marker6076626 – Marker6311989 qProtein3–2 3Marker6826333 – Marker68480608.435.49−0.31
qProtein5–2 5Marker1775833 – Marker1882558 qProtein12–1 12Marker5423318 – Marker54863087.334.49−0.29
qProtein3–3 3Marker6856695 – Marker7017453 qProtein17–1 17Marker379161 – Marker4881096.734.060.24

achr, chromosome; b LOD, logarithm of odds; c PVE, phenotypic variance explained; d ADD, additive effect

Epistatic QTLs detected by ICIM-EPI achr, chromosome; b LOD, logarithm of odds; c PVE, phenotypic variance explained; d ADD, additive effect

Prediction of candidate genes

After filtering QTLs by the PVE and physical distance, 18 QTLs were used to mine candidate genes. According to the physical map, a total of 89, 144, 16 and 64 genes were screened in the interval of the filtered QTLs associated with PH, SW, Oil and Protein, respectively. Based on the expression data of candidate genes from phytozome and soybase (Fig. 5), 43 genes were considered to be potential candidates (Table 5). All genes from specific QTLs intervals were evaluated based on their expression pattern in different organs. In the case of genes from the QTLs associated with PH, 13 genes showed higher expression in stem and shoot apical meristem, indicating it might be considered as candidate genes related to PH (Fig. 5a). A total of 14 genes in the interval of the QTLs associated with SW, expressed in seed development stages (10 to 42 days after flowering), might participate in the pathways affecting SW (Fig. 5b and 5c). As the accumulation of oil and protein in soybean seed was throughout the seed development stage [20, 21], gene expressed sustainably in seed development stage might affect the biological process associated with oil and protein. In the present study, there were 7 genes found in the region of QTLs related to Oil, which expressed stably throughout the seed development stage, suggesting it might be associated with Oil (Fig. 5d). Meanwhile, 9 genes in the interval of QTLs for Protein, showed expression in the seed development stage, might be associated with Protein (Fig. 5e).
Fig. 5

Tissue specific expression of candidate genes. RH, root hair; SAM, shoot apical meristem; DAF, day after flowering; (a), expression of candidate gene related to PH; (b), expression of candidate genes related to SW on chr12; (c), expression of candidate genes related to SW on chr5, chr13 and chr15; (d), expression of candidate genes related to Oil; (e), expression of candidate genes related to Protein

Table 5

Annotations of candidate genes

QTLGeneStartStopAnnotation
qPH5–1 Glyma.05 g035000 3,078,0123,084,940Xanthine/uracil permease family protein
qPH8–1 Glyma.08 g173700 13,838,45813,841,300Photosystem II subunit R
Glyma.08 g174000 13,887,78113,892,150cAMP-regulated phosphoprotein 19-related protein
Glyma.08 g175200 13,981,18713,982,765Glutathione S-transferase TAU 19
Glyma.08 g175800 14,037,03914,042,017Aldolase-type TIM barrel family protein
Glyma.08 g175900 14,046,31414,050,528TCP-1/cpn60 chaperonin family protein
Glyma.08 g177100 14,195,42814,196,376
Glyma.08 g177200 14,199,86214,201,145Arabinogalactan protein 1
qPH8–2 Glyma.08 g337400 45,423,99945,433,706Transducin/WD40 repeat-like superfamily protein
qPH17–1 Glyma.17 g167700 15,479,88015,481,925CYCLIN D3;2
Glyma.17 g169100 15,650,60715,655,2152OG-Fe(II) oxygenase superfamily protein
Glyma.17 g169200 15,679,75315,681,009Protein of unknown function (DUF579)
Glyma.17 g169700 15,823,86615,827,109Sumo conjugation enzyme 1
qSW5–1 Glyma.05 g055700 5,050,7285,055,325Beta vacuolar processing enzyme
qSW12–1 Glyma.12 g055200 3,998,9473,999,707Histone superfamily protein
Glyma.12 g056700 4,123,1374,123,683
Glyma.12 g058200 4,235,9684,237,273HSP20-like chaperones superfamily protein
Glyma.12 g058500 4,254,1664,256,321Adenine nucleotide alpha hydrolases-like superfamily protein
Glyma.12 g064000 4,707,1974,709,987Heat shock protein 70
Glyma.12 g064300 4,739,2164,740,496Pathogenesis-related thaumatin superfamily protein
Glyma.12 g064500 4,752,3664,753,273Dessication-induced 1VOC superfamily protein
Glyma.12 g065100 4,793,4224,794,276Histone superfamily protein
Glyma.12 g065200 4,801,0834,802,209Histone superfamily protein
Glyma.12 g065400 4,806,2424,807,054Histone superfamily protein
Glyma.12 g065600 4,818,4894,822,602Ras-related small GTP-binding family protein
Glyma.12 g066700 4,824,8104,828,762Ribosomal protein L23AB
qSW13–1 Glyma.13 g180400 29,380,77929,383,908Thioredoxin family protein
qOil1–2 Glyma.01 g105600 35,687,68635,690,416
Glyma.01 g106000 35,856,49735,858,671Glutathione S-transferase TAU 8
Glyma.01 g106100 35,881,00735,883,036Glutathione S-transferase TAU 8
Glyma.01 g106200 35,918,03435,919,778Adenine nucleotide alpha hydrolases-like superfamily protein
Glyma.01 g106500 36,103,01036,105,425
qOil10–2 Glyma.10 g181800 41,516,21141,528,077Calcium-binding EF-hand family protein
Glyma.10 g181900 41,531,67441,535,236Trigalactosyldiacylglycerol 1
qProtein8–1 Glyma.08 g316200 43,611,37443,617,841Endoribonuclease L-PSP family protein
qProtein10–1 Glyma.10 g182500 41,577,97241,579,422Ribosomal protein L14
Glyma.10 g183800 41,699,75141,702,224Differentiation and greening-like 1
Glyma.10 g183900 41,719,26541,728,269Peptide transporter 3
Glyma.10 g184900 41,781,11141,786,726Ureidoglycolate amidohydrolase
Glyma.10 g186000 41,910,76041,912,094Phosphoenolpyruvate carboxylase kinase 1
Glyma.10 g186300 41,926,18741,928,530Small nuclear ribonucleoprotein family protein
Glyma.10 g187200 42,035,25242,037,969P-loop containing nucleoside triphosphate hydrolases superfamily protein
qProtein15–1 Glyma.15 g171800 15,667,78815,671,786Eukaryotic translation initiation factor 4A1
Tissue specific expression of candidate genes. RH, root hair; SAM, shoot apical meristem; DAF, day after flowering; (a), expression of candidate gene related to PH; (b), expression of candidate genes related to SW on chr12; (c), expression of candidate genes related to SW on chr5, chr13 and chr15; (d), expression of candidate genes related to Oil; (e), expression of candidate genes related to Protein Annotations of candidate genes

Discussion

Construction of a high-density genetic map based on SLAF markers

QTL mapping has been used as an efficient approach to analyze quantitative traits in plants. Parental genetic diversity and marker density are the major factors affecting the efficiency and accuracy of QTL mapping. In this study, the female parent, ‘Huapidou’, was a semi-wild soybean gerplasm, which showed high resistance to whitefly [22]. ‘Qihuang26’, with more than 46% of protein conten in seeds, was a main variety in Huang-Huai-Hai region of China. In the present study, four traits of the RIL population derived from Huapidou and Qihuang26 showed to be continuous with normal or skew normal distributions. Increasing marker density could improve the resolution of genetic map for a given mapping population [23]. SLAF-seq is an effective sequencing-based method for large-scale marker discovery and genotyping, which has been used for genetic analysis in many species [8–13, 24]. In the present study, we used 5111 high-quality SLAF markers to construct a high-density map, and a total of 8597 SNP loci were integrated into 20 LGs ultimately. This high-density genetic map, making QTL mapping more accurate and reliable, would be beneficial to MAS breeding.

QTL mapping in soybean using a high-density map

Soybean is a primary source of plant oil and protein for humans due to its high nutritional value. PH and SW were main yield-related traits in soybean. So far, markers associated with the QTL underlying PH, SW, Oil and Protein have been mapped onto all linkage groups. In total, there were 35 QTLs for PH, SW, Oil and Protein observed using a high-density map based on an F8:11 RIL population with 149 individuals from the cross between ‘Huapidou’ and ‘Qihuang26’. Furthermore, there were 14 novel QTLs related to PH, SW, Oil and Protein, indicating the distinct genetic architecture in the population derived from cultivated soybean and semi-wild soybean. Among the novel QTLs, qPH8–1 had the highest PVE value and the highest LOD value might be the major QTL related to PH. It was notable that qSW13–1 explained the hightest PV in the QTLs identified for SW. More remarkably, four novel QTLs for Oil, inculding qOil1–1, qOil1–2, qOil10–1 and qOil10–2 explained up to 72.73% of the PV for Oil, which suggested it might be potential loci to Oil. qProtein1–1, which explained 17.68% of the PV, might be an major QTL for further fine mapping. So many novel QTLs observed in the present study indicated that more germplasms need to be used for revealing the complex genetic basis of soybean. The stability of QTL is essential for the use in a breeding programme. In the study, 31 QTLs were identified by both ICIM and CIM methods. Furthermore, one QTL for PH was identified by CIM in both experiments from 2013 and 2014. In addition, more than half of the QTLs had been reported. Five QTLs for PH detected in this study, were colocalized as reported [14, 19, 25, 26]. Two major QTLs associated with SW, qSW13–1 and qSW15–1, both with the physical distance of approximate 11 Kb, explained 18.55% and 12.21 of the PV for SW, respectively. qSW13–1 had been reported as being associated with L050–14 [27], Satt144 [28, 29] and Sat_103 [30]. qSW15–1 had been detected in two soybean populations, derived from ‘Young’ and ‘PI416937’ (Pop1), ‘PI97100’ and ‘Coker 237’ (Pop2) [31]. Han et al. also identified the similar QTL on chr15 in the population from a cross between ‘Hefeng25’ and ‘Conrad’ [32]. Therefore, qSW13–1 and qSW15–1 might be considered as major and stable QTLs for further fine mapping and map-based cloning to elucidate the mechanisms of SW. In the present study, four QTLs related to Oil had been reported [28, 33–35], inculding qOil6–1, qOil13–1, qOil19–1 and qOil19–2, but none of them explained more than 10% of the PV. Lee et al. [36] reported cr274_1 associated with Protein on chr15 using a population derived from ‘Young’ and ‘PI416937’. The QTL for Protein between Satt173 and Satt581 on chr10 had been identified [37], similar with the result of Liu et al. [38]. Our study detected two QTLs related to Protein, qProtein10–1 and qProtein15–1, with 16.83% and 14.36% of the PVE, respectively, mapped on the same area as previous studies [36-38], might be good for MAS breeding and accurate QTL localization. Several QTLs of various traits can map to the same locus [14, 39]. In this study, two pairs of QTLs, q2013Oil1–1 and q2013Protein1–1 as well as q2014Oil10–2 and q2013protein10–1, with inverse additive effect for Oil and Protein, were located in the same marker interval (Fig. 4; Additional file 4), which implies that q2013Oil1–1 and q2014Oil10–2 not only control oil content in seeds but also affect protein content in seeds. It is consistent with previous reports that an negative correlation is in agreement between protein and oil concentration in soybean seeds [40, 41]. Knowledge of epistasis effect, which is defined as interactions between alleles of two or more genetic loci, is essential to understand the genetic mechanism and the gene networks underlying complex traits. In this study, 3 pairs of epistatic QTLs for Protein were identified by ICIMapping-EPI. However, these epistatic QTLs did not display additive effect alone. It might be considered modifying genes that have no significant effects alone but might affect the expression of Protein related genes through epistatic interactions. Nevertheless, epistatic interaction could not be detected in some map populations [42]. It might be the reason that no epistatic effect observed for PH, SW and Oil in the present study.

Gene mining based on precise QTLs

As the average ratio of gene to physical distance is about 1 gene per 20 Kb in soybean genome [43], the accuracy of QTL mapping is of great benifit to gene localization and identification. The physical distance of 11 QTLs in the current study was shorter than 100 Kb, which might lead to a direct opportunity to find candidate genes by bioinformatics tools. For example, the minimum confidence interval of qPH8–2 was 8.3 Kb, which was much shorter than 0.09 Mb detected previously [26]. Furthermore, Glyma.08 g337400, encoding a transducin/WD40 repeat-like superfamily protein, was predicted in the interval of qPH8–2, which might be a promising target to engineer transgenic plants with higher biomass and improved growth development for plant-based bioenergy production [44]. In the interval of qPH17–1, Glyma.17 g169100, encoding a 2OG-Fe(II) oxygenase superfamily protein, was one of the important gibberellin oxidase genes [45], which might affect plant height directly. Glyma.17 g167700, encoding a growth regulator CYCLIN D3–2, expressed in growing shoot apices preferentially [46-48]. In the interval of qSW5–1, Glyma.05 g055700, encoding beta vacuolar processing enzyme, was involved in seed coat formation at the early stage of seed development [49]. In the interval of qSW12–1, ribosomal protein L23AB encoded by Glyma.12 g066700, was required for normal development [50]. Glyma.01 g106000 and Glyma.01 g106100 in the interval of qOil1–2, encoding Glutathione S-transferase TAU 8, might influence Oil by suppressing lipid peroxidation [51]. There were 3 genes found in the confidence interval of qOil10–2, which explained 20.17% of the PV for oil, inculding Glyma.10 g181800, Glyma.10 g181900 and Glyma.10 g182000. Moreover, Glyma.10 g181900, encoding a trigalactosyldiacylglycerol 1 protein (TGD1), affected the metabolic flux of chloroplast lipid synthesis and photosynthetic capacity, which resulted in the change of fatty acid in leaf and seed [52-55]. It was noteworthy that qOil10–2 was placed in the same confidence intervals as q2013protein10–1. The inverse relationship between oil and protein in soybean seed is well documented in the previous reports [40, 41]. However, little study of TGD1 had been reported on the function of protein accumulation in seeds. In the interval of qProtein10–1, Glyma.10 g183900, encoding peptide transporter 3, contributed to nitrogen allocation and grain Yield [56], Glyma.10 g184900, encoding a ureidoglycolate amidohydrolase, played a key role in nitrogen transport and storage [57-59]. In a word, on the basis of the physical position of these precise QTLs detected using a high-density map in the present study, it would be easy to find candidate gene.

Conclusions

In this study, we genotyped a RIL population (Huapidou × Qihuang26) by SLAF-seq. A high-density genetic map for soybean was constructed and used to identify QTLs associated with four traits, including plant height, seed weight, oil content seed and protein content in seed. A total of 35 QTLs related to four traits were identified. Of these QTLs, 21 QTLs were coincident with previous research. Furthermore, three pairs epistatic QTLs involving 6 loci on 5 chromosomes were identified for Protein. In addition, 43 genes with tissue-specific expression patterns were considered to be potential genes in further study. Our findings might be of great useful for MAS breeding, and could provide detailed information for accurate QTL localization.

Methods

Plant material and phenotyping

F8:9 and F8:10 populations of 149 RILs derived from a cross between ‘Huapidou (ZDD09982)’ and ‘Qihuang26 (ZDD23189)’ were planted in the experiment field of Shandong Academy of Agricultural Sciences in Jinan, Shandong Province, China, in 2013 and 2014, respectively. Each individual was planted in one row using single seed sowing; each row was 3 m, with 50 cm row spacing and 10 cm plant spacing, with three replicates. Five plants in each replicate were selected randomly to calculate the plant height (PH). The weight of 100 random filled seeds was measured as seed weight (SW). Oil and protein in soybean seed were detected by DA 7200 NIR food analyzer (Perten, Switzerland). SW, Oil and Protein were repeat 3 times in each replicate. Frequency distribution, descriptive statistics, the broad-sense heritability (h) and the analysis of variance for RIL population and parents were analyzed with the SPSS statistics 17.0 and Microsoft Excel 2010. The h was estimated as described by previous study [60].

DNA extraction and genotyping

Seedlings of the F8:11 population of 149 RILs and parents were planted in 2016. Young healthy leaves from the two parents and RIL individuals were collected and genomic DNA was extracted by the CTAB method [61]. DNA was quantified with NanoDrop and by electrophoresis in 1% agarose gels with a λ DNA standard. SLAF-seq was used to genotype a total of 151 samples (149 individuals and two parents) as described by previous study [7]. All polymorphic SLAFs were genotyped with consistency in the offspring and parental SNP loci. All SLAF markers should be filtered in quality assessment. A SLAF marker with parental homozygous, which had less than three SNPs, average depths of each sample above 3, was used as a high quality SLAF marker.

Linkage map construction

Before map construction, SLAF marker should be filtered by linkage analysis, markers with the MLOD value > 3 were used to construct genetic linkage map. SLAF markers with high quality were located into 20 LGs. HighMap Strategy was used to order SLAF markers and correct genotyping errors [24]. All LGs should be undergone these procedures: first, markers were arranged by their locations on choromosome; second, genotyping errors or deletions were corrected by SMOOTH [62], according to the relationship between ordered markers; then MSTmap was used to order the map [63]; after that, SMOOTH was used again to corrected the new ordered genotype. High-quality map would be obtained after 4 or more cycles. Map distance was estimated using the Kosambi mapping function.

QTL mapping

Based on the high-density genetic map, the QTLs underlying PH, SW, Oil and Protein were identified by QTL ICIMapping V3.3 software [64]. Inclusive Composite Interval Mapping (ICIM) and Composition interval mapping (CIM) methods were used to identify the QTLs. The threshold of logarithm of odds (LOD) score for evaluating the statistical significance of QTL effect was determined using 1000 permutations at the 5% level of significance. The location of a QTL was described according to its LOD peak location and thesurrounding region with 95% confidence interval [65]. As a result, intervals with a LOD value above 2.5 were detected as effective additive QTLs using ICIM-ADD method; the pairs of QTLs with a LOD value above 5.0 were considered as valid epistatic QTLs using ICIM-EPI method.

Gene mining

According to the marker sequence, QTLs were projected from the genentic map onto Williams 82 physical map (Glyma.Wm82.a2.v1). The QTL with the PVE above 10% and the physical distance less than 1 Mb was used to screen the candidate genes. The QTL with the physical distance less than 100 Kb was also chosen to screen the candidate genes. Gene calls and annotations were retrieved using Glyma.Wm82.a2.v1 gene model from Soybase. The expression data of candidate genes in the seed development stages based on RNA-seq in previous study were obtained from soybase [66]. The expression values of candidate genes in flower, leaf, nodule, pod, root, root hair, seed, shoot apical meristem and stem were downloaded from phytozome (www.phytozome.org). Multiple array viewer (version 4.9.0) was used to construct the heat map to analyze the tissue-specific expression of candidate genes. Genotype list of 149 RILs. (XLSX 2691 kb) SLAF markers on the 20 linkage groups. (XLSX 129 kb) Information of SLAF markers on the genetic map. (PARTIAL 4956 kb) Additive QTLs identified by CIM in both experiments from 2013 and 2014. (PDF 168 kb)
  46 in total

1.  The CTAB-DNA precipitation method: a common mini-scale preparation of template DNA from phagemids, phages or plasmids suitable for sequencing.

Authors:  G Del Sal; G Manfioletti; C Schneider
Journal:  Biotechniques       Date:  1989-05       Impact factor: 1.993

2.  QTL mapping of ten agronomic traits on the soybean ( Glycine max L. Merr.) genetic map and their association with EST markers.

Authors:  W-K Zhang; Y-J Wang; G-Z Luo; J-S Zhang; C-Y He; X-L Wu; J-Y Gai; S-Y Chen
Journal:  Theor Appl Genet       Date:  2004-01-22       Impact factor: 5.699

3.  Construction and analysis of high-density linkage map using high-throughput sequencing data.

Authors:  Dongyuan Liu; Chouxian Ma; Weiguo Hong; Long Huang; Min Liu; Hui Liu; Huaping Zeng; Dejing Deng; Huaigen Xin; Jun Song; Chunhua Xu; Xiaowen Sun; Xilin Hou; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2014-06-06       Impact factor: 3.240

4.  High-resolution mapping of QTL for fatty acid composition in soybean using specific-locus amplified fragment sequencing.

Authors:  Bin Li; Shengxü Fan; Fukuan Yu; Ying Chen; Shengrui Zhang; Fenxia Han; Shurong Yan; Lianzheng Wang; Junming Sun
Journal:  Theor Appl Genet       Date:  2017-04-07       Impact factor: 5.699

5.  Construction of high-density genetic map and QTL mapping of yield-related and two quality traits in soybean RILs population by RAD-sequencing.

Authors:  Nianxi Liu; Mu Li; Xiangbao Hu; Qibin Ma; Yinghui Mu; Zhiyuan Tan; Qiuju Xia; Gengyun Zhang; Hai Nian
Journal:  BMC Genomics       Date:  2017-06-19       Impact factor: 3.969

6.  SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing.

Authors:  Xiaowen Sun; Dongyuan Liu; Xiaofeng Zhang; Wenbin Li; Hui Liu; Weiguo Hong; Chuanbei Jiang; Ning Guan; Chouxian Ma; Huaping Zeng; Chunhua Xu; Jun Song; Long Huang; Chunmei Wang; Junjie Shi; Rui Wang; Xianhu Zheng; Cuiyun Lu; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

7.  GIGANTUS1 (GTS1), a member of Transducin/WD40 protein superfamily, controls seed germination, growth and biomass accumulation through ribosome-biogenesis protein interactions in Arabidopsis thaliana.

Authors:  Emma W Gachomo; Jose C Jimenez-Lopez; Lyla Jno Baptiste; Simeon O Kotchoni
Journal:  BMC Plant Biol       Date:  2014-01-27       Impact factor: 4.215

8.  High-density genetic map construction and gene mapping of pericarp color in wax gourd using specific-locus amplified fragment (SLAF) sequencing.

Authors:  Biao Jiang; Wenrui Liu; Dasen Xie; Qingwu Peng; Xiaoming He; Yu'e Lin; Zhaojun Liang
Journal:  BMC Genomics       Date:  2015-12-09       Impact factor: 3.969

9.  Low-Temperature-Induced Expression of Rice Ureidoglycolate Amidohydrolase is Mediated by a C-Repeat/Dehydration-Responsive Element that Specifically Interacts with Rice C-Repeat-Binding Factor 3.

Authors:  Juan Li; Rui-Ying Qin; Hao Li; Rong-Fang Xu; Ya-Chun Yang; Da-Hu Ni; Hui Ma; Li Li; Peng-Cheng Wei; Jian-Bo Yang
Journal:  Front Plant Sci       Date:  2015-11-13       Impact factor: 5.753

10.  High-Density Genetic Mapping Identifies New Major Loci for Tolerance to Low-Phosphorus Stress in Soybean.

Authors:  Dan Zhang; Hongyan Li; Jinshe Wang; Hengyou Zhang; Zhenbin Hu; Shanshan Chu; Haiyan Lv; Deyue Yu
Journal:  Front Plant Sci       Date:  2016-03-30       Impact factor: 5.753

View more
  4 in total

1.  High-Resolution Mapping in Two RIL Populations Refines Major "QTL Hotspot" Regions for Seed Size and Shape in Soybean (Glycine max L.).

Authors:  Aiman Hina; Yongce Cao; Shiyu Song; Shuguang Li; Ripa Akter Sharmin; Mahmoud A Elattar; Javaid Akhter Bhat; Tuanjie Zhao
Journal:  Int J Mol Sci       Date:  2020-02-04       Impact factor: 5.923

2.  Quantitative trait loci analysis of seed oil content and composition of wild and cultivated soybean.

Authors:  Yanjie Yao; Qingbo You; Guozhan Duan; Jianjun Ren; Shanshan Chu; Junqing Zhao; Xia Li; Xinan Zhou; Yongqing Jiao
Journal:  BMC Plant Biol       Date:  2020-01-31       Impact factor: 4.215

3.  Seed protein content and its relationships with agronomic traits in pigeonpea is controlled by both main and epistatic effects QTLs.

Authors:  Jimmy Obala; Rachit K Saxena; Vikas K Singh; Sandip M Kale; Vanika Garg; C V Sameer Kumar; K B Saxena; Pangirayi Tongoona; Julia Sibiya; Rajeev K Varshney
Journal:  Sci Rep       Date:  2020-01-14       Impact factor: 4.379

4.  Identification of quantitative trait loci underlying five major agronomic traits of soybean in three biparental populations by specific length amplified fragment sequencing (SLAF-seq).

Authors:  Bo Hu; Yuqiu Li; Hongyan Wu; Hong Zhai; Kun Xu; Yi Gao; Jinlong Zhu; Yuzhuo Li; Zhengjun Xia
Journal:  PeerJ       Date:  2021-12-14       Impact factor: 2.984

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.