Literature DB >> 28389769

High-resolution mapping of QTL for fatty acid composition in soybean using specific-locus amplified fragment sequencing.

Bin Li1, Shengxü Fan1, Fukuan Yu1, Ying Chen1, Shengrui Zhang1, Fenxia Han1, Shurong Yan1, Lianzheng Wang1, Junming Sun2.   

Abstract

KEY MESSAGE: We constructed a high-density linkage map comprising 3541 markers developed by specific-locus amplified fragment sequencing, and identified 26 stable QTL including nine novel loci, for fatty acid composition in soybean. Soybean oil quality and stability are mainly determined by the fatty acid composition of the seed. In the present study, we constructed a high-density genetic linkage map using 200 recombinant inbred lines derived from a cross between cultivated soybean varieties Luheidou2 and Nanhuizao, and SNP markers developed by specific-locus amplified fragment sequencing (SLAF-seq). This map comprises 3541 markers on 20 linkage groups and spans a genetic distance of 2534.42 cM, with an average distance of 0.72 cM between adjacent markers. Inclusive composite interval mapping revealed 26 stable QTL for five fatty acids, explaining 0.4-37.0% of the phenotypic variance for individual fatty acids across environments. Of these QTL, nine are novel loci (qLA1, qLNA2_1, qPA4_1, qLA4_1, qPA6_1, qSA12_1, qPA16_1, qOA18_1, and qFA19_1). These stable QTL harbor three fatty acid biosynthesis genes (GmFabG, GmACP, and GmFAD8), and 66 genes encoding lipid-related transcription factors. These stable QTL and tightly linked SNP markers can be used for marker-assisted selection in soybean breeding programs.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28389769      PMCID: PMC5487593          DOI: 10.1007/s00122-017-2902-8

Source DB:  PubMed          Journal:  Theor Appl Genet        ISSN: 0040-5752            Impact factor:   5.699


Introduction

Soybean (Glycine max L. Merr.) is one of the most important oilseed crops worldwide. Providing most of the world’s supply of vegetable protein and oil, soybean accounted for approximately 61% of the world’s oilseed production in 2015 (http://soystats.com). The quality and stability of soybean oil are mainly determined by five predominant fatty acids, viz. palmitic, stearic, oleic, linoleic, and linolenic acids (Lee et al. 2007). Palmitic (16:0) and stearic (18:0) acids are saturated fatty acids, while oleic (18:1), linoleic (18:2), and linolenic (18:3) acids are unsaturated fatty acids. A high proportion of unsaturated fatty acids in the human diet benefits cardiovascular health (Connor 2000; Mensink and Katan 1992). However, polyunsaturated fatty acids, particularly linolenic acid, increase the oxidation of food oils, causing an off-flavor and reducing the shelf life of the oil (Hu et al. 1997; Mounts et al. 1988). Therefore, one important focus of soybean breeding is to improve the fatty acid composition in seed oil. Fatty acid content is a quantitative trait that depends on the combined effects of several major and minor genes (Bilyeu et al. 2005; Fan et al. 2015; Wang et al. 2014). Therefore, quantitative trait loci (QTL) mapping is an effective method to uncover the genetic basis of fatty acid formation. To date, numerous QTL for fatty acid contents have been detected (Alrefai et al. 1995; Bachlava et al. 2009; Brummer et al. 1997; Diers and Shoemaker 1992; Fan et al. 2015; Hyten et al. 2004; Panthee et al. 2006; Reinprecht et al. 2006; Wang et al. 2012, 2014; Xie et al. 2012). However, these QTL span fairly large genomic regions due to the relatively low density of genetic maps. The relatively low accuracy of QTL mapping using these maps limits not only the identification of fatty acid biosynthesis and regulatory networks, but also the application of these QTL in marker-assisted selection (MAS) breeding efforts in soybean. Recently, putative nucleotide polymorphisms responsible for fatty acid contents, which are usually denoted as quantitative trait nucleotides (QTN), were also identified in genome-wide association study (GWAS) based on population-wide linkage disequilibrium (LD) using soybean natural populations and genome-wide single nucleotide polymorphisms (SNP) (Li et al. 2015). The annotated candidate genes bearing these QTN demonstrated that fatty acid formation is governed by a complex genetic basis in soybean (Li et al. 2015). With the great development in next-generation sequencing (NGS), several procedures were developed for SNP discovery and genotyping in large population, including restriction-site associated DNA tag sequencing (RAD-seq), genotyping-by-sequencing (GBS), and specific-locus amplified fragment sequencing (SLAF-seq), etc. (Baird et al. 2008; Elshire et al. 2011; Sun et al. 2013). These procedures reduced the genome complexity by digesting genomic DNA with restriction enzymes, and the resultant reduced representation library (RRL) was sequenced to achieve SNP discovery and genotyping in large population. Specifically, a pre-design experiment was performed in SLAF-seq to evaluate restriction enzymes and sizes of restriction fragments using the soybean reference genome sequence, which improved the efficiency of SLAF-seq (Sun et al. 2013). Additionally, the fragments in GBS library were usually selected through PCR amplification (Elshire et al. 2011). In contrast, the fragments in SLAF library were gel-purified, and the fragments with specific size were selected in subsequent sequencing. That will improve the uniformity of fragments in RRL library (Sun et al. 2013). Previously, we developed 200 recombinant inbred lines (RILs) from a cross between two cultivated soybean with different fatty acid compositions. Using 100 of these 200 RILs, We constructed a linkage map consisting of 161 SSR markers, and the QTL for fatty acid composition were identified across 3 years (2009 through 2011) (Fan et al. 2015). In addition, we sequenced 110 of the 200 RILs and the two parents, and developed a high-density genetic map comprising 5785 markers based on the SLAF-seq method (Li et al. 2014). In the current study, we developed a great number of SNP-based markers using SLAF-seq with an increased mapping population size of 200 RILs to improve the efficiency and accuracy of QTL mapping. The results will benefit the improvement of the fatty acid composition of soybean in breeding project.

Materials and methods

Plant materials and field trials

Two hundred F5:7 RILs developed from the Luheidou2 (LHD2)/Nanhuizao (NHZ) cross together with the parental lines were planted with three replicates in randomized complete blocks at Shunyi Experimental Stations (N40°13′ and E116°34′) in Beijing from 2009 to 2011. Each plot comprised a 2-m row, with 0.5 m apart between rows and a space of 0.1 m between adjacent plants (Fan et al. 2015). Both of the parents are wild type cultivated soybean varieties with black seed coats, but their fatty acid composition differs significantly (Fan et al. 2015).

Fatty acid extraction and determination

The composition of five predominant fatty acids (palmitic, stearic, oleic, linoleic, and linolenic acids) was determined using gas chromatography (Fan et al. 2015). Briefly, 20 g of soybean seeds of each line were ground to a fine powder with a Sample Preparation Mill (Retsch ZM100, Φ = 1.0 mm, Rheinische, Germany). Three hundred milligrams of each powdered sample was transferred to a 2-ml centrifuge tube preloaded with 1.5 ml n-hexane. After vigorous mixing, the mixture was stored at 4 °C for 12 h. Then the samples were centrifuged at 5000×g (room temperature) for 10 min. The supernatant was collected and sodium methoxide solution was added. The mixture was shaken for 1 h on a twist mixer (TM-300, ASONE, Japan) for full methyl esterification of the fatty acids, and centrifuged again at 5000×g for 10 min. The supernatant was collected to determine the composition of the five fatty acids (Fan et al. 2015). Fatty acid composition was determined using an RTX-Wax Column (30 m × 0.25 mm × 0.25 mm) of gas chromatography (GC-2010, SHIMADZU, Japan). The injection volume was 1 μL. Nitrogen, hydrogen and air were used as carrier gases. The temperature was initially set at 180 °C for 1.5 min, increased to 210 °C at a rate of 10 °C min−1, and maintained at 210 °C for 2 min, increased to 220 °C at a rate of 5 °C min−1 and maintained at 220 °C for 5 min. The area normalization method was used to calculate the composition (percentage of total fatty acids by mass) of the five fatty acids using a GC2010 workstation (Fan et al. 2015).

Specific-locus amplified fragments (SLAF) library construction and sequencing

We previously constructed and sequenced a SLAF library for 110 of the 200 RILs (Li et al. 2014). In our current study, the SLAF library of the remaining 90 RILs and parents was constructed and sequenced, following the same method, with minor modifications: first, the genomic DNA of each sample was digested with a single restriction enzyme, MseI, rather than both EcoRI and MseI according to the pre-design experiment results based on the latest version of the soybean reference genome sequence (Wm82.a2.v1, https://phytozome.jgi.doe.gov) (Schmutz et al. 2010); second, amplified fragments that were 374–474 bp in length instead of 500–550 bp were gel-purified and diluted for pair-end sequencing using an Illumina high-throughput sequencing platform (Illumina, Inc; San Diego, CA, USA).

SLAF-seq data grouping and genotyping

The SLAF-seq data grouping and genotyping of the 90 RILs were performed following the previously reported method (Li et al. 2014; Sun et al. 2013). Briefly, low-quality reads (quality score <20e) were filtered out and then raw reads were sorted to each progeny according to duplex barcode sequences using SLAF_Poly.pl software (Biomarker, Beijing, China). After the barcodes and the terminal 5-bp positions were trimmed from each high-quality reads, clean reads from the same sample were mapped onto the soybean reference genome sequence (Wm82.a2.v1) using SOAP software (Li et al. 2008b; Schmutz et al. 2010). Sequences mapping to the same position with over 95% identity were defined as one SLAF locus. Since soybean is a diploid species and one locus can only contain at most four SLAF tags, groups containing more than four tags were filtered out as repetitive SLAFs, and the SLAFs with 2–4 tags were identified as polymorphic SLAFs (Sun et al. 2013). Genotype scoring was then performed using a Bayesian approach to further ensure the genotyping quality (Sun et al. 2013). First, a posteriori conditional probability was calculated using the coverage of each allele and the number of single nucleotide polymorphisms. Then, genotyping quality score translated from the probability was used to select qualified markers for subsequent analysis. Low-quality markers for each marker and each individual were counted and the worse markers or individuals were deleted during the dynamic process. When the average genotype quality scores of all SLAF markers reached the cutoff value, the process stopped. The resultant polymorphic SLAFs were integrated with those of other 110 RILs described in our previous study (Li et al. 2014), and the polymorphic SLAF markers for 200 RILs were obtained. Finally, high-quality SLAF markers for the genetic mapping were filtered by the following criteria: First, average sequence depths should >2-fold in each progeny and >10-fold in the parents. Second, markers with more than 25% missing data were filtered. Third, the Chi-square test was performed to examine the segregation distortion. Markers with significant segregation distortion (P < 0.05) were excluded. The final SNP-based polymorphic SLAF markers were used to construct a high-density linkage map.

Construction of a high-density genetic map

Based on the genotyping data of 200 RILs, a high-density genetic map comprising 20 linkage groups (LGs) was constructed using the Kosambi mapping function of the Joinmap v4.0 software with a LOD threshold of 5.0. The collinearity of 20 LGs with the soybean reference genome was analyzed by plotting the genetic positions of SLAF markers against their physical positions in the soybean reference genome (Wm82.a2.v1).

QTL mapping for fatty acid composition

The additive QTL for palmitic, stearic, oleic, linoleic, and linolenic acids were detected using inclusive composite interval mapping (ICIM) in the BIP (bi-parental populations) model of QTL IciMapping software v4.0 (Li et al. 2008a), with the P values for entering variables (PIN) = 0.05. The threshold of the logarithm of the odds (LOD) scores for evaluating the statistical significance of QTL effects was determined using 1000 permutations at the significance level of 0.05. As a result, a LOD score of 3.3 was used as the threshold to declare the presence of a QTL. As the fatty acid composition is affected by the environments, we focused mainly on the QTL for individual fatty acids identified across multiple environments. The epistatic effects of QTL were identified by the ICIM-EPI method based on the BIP model implemented in QTL IciMapping software v4.0, with PIN = 0.05. The LOD threshold of 5.0 was obtained through 1000 permutation to declare the epistatic QTL at the significance level of 0.05.

Annotation of genes within additive QTL intervals

The sequences within QTL intervals were identified according to the soybean reference genome sequence (Wm82.a2.v1, https://phytozome.jgi.doe.gov), and annotated against Nr (non-redundant), Swiss-Prot, and KOG/COG (clusters of orthologous groups) databases using Blastx program (https://blast.ncbi.nlm.nih.gov).

Results

Phenotypic analysis of fatty acid compositions in soybean RIL population

The fatty acid compositions of 200 RILs were determined from 2009 to 2011. As shown in Table 1, the five predominant fatty acids exhibit broad ranges in 200 RILs. Of them, linoleic acid shows the minimum of coefficient of variance (CV) ranging from 4.3 to 6.9%, while stearic acid presents the maximum of CV ranging from 11.3 to 17.8%. The broad sense heritability of five predominant fatty acids ranged from 0.74 to 0.88 over 3 years, suggesting the fatty acids are mainly controlled by genetic factor (Table 1).
Table 1

The characteristics of five fatty acids in 200 soybean RILs from 2009 to 2011

TraitEnvironmentMean ± SD (%)Minimum (%)Maximum (%)CVa Variance P (K–S)b H B2c H BC2d
Pamitic200910.24 ± 0.578.7813.250.0560.330.5630.810.75
201010.89 ± 0.549.2913.750.0490.290.5680.76
201110.75 ± 0.689.2312.960.0640.470.5000.57
Stearic20093.84 ± 0.482.575.110.1240.230.0990.870.74
20103.75 ± 0.432.624.950.1130.180.9680.77
20113.68 ± 0.662.227.060.1780.430.5150.63
Oleic200924.06 ± 2.6719.6135.070.1117.110.2160.880.81
201026.47 ± 4.3917.7139.130.16619.240.3690.87
201122.26 ± 3.0517.6736.280.1379.290.0010.86
Linoleic200953.27 ± 2.3145.9557.500.0435.320.7440.880.80
201051.40 ± 3.5341.6459.000.06912.440.1900.87
201155.05 ± 2.5143.6459.460.0466.280.0030.69
Linolenic20098.58 ± 0.696.7210.560.0810.480.8300.780.88
20107.48 ± 0.885.4710.570.1170.770.9100.84
20118.26 ± 0.805.7410.050.0970.650.8250.89

aCoefficient of variance

b P value in Kolmogorov–Smirnov test

cBroad-sense heritability in each environment

dBroad-sense heritability in combined environments (2009, 2010, and 2011)

The characteristics of five fatty acids in 200 soybean RILs from 2009 to 2011 aCoefficient of variance b P value in Kolmogorov–Smirnov test cBroad-sense heritability in each environment dBroad-sense heritability in combined environments (2009, 2010, and 2011) The frequency distributions of the five fatty acids were also analyzed. Almost all fatty acids exhibit continuous and normal distributions from 2009 to 2011, except oleic and linoleic acids in 2011 (Fig. 1; Table 1), suggesting fatty acids are inherited in a quantitative manner. Moreover, significant transgression segregations were also observed in progenies (Fig. 1), suggesting both parents contributed to fatty acid composition.
Fig. 1

Frequency distributions of five fatty acid contents in 200 RIL seeds from 2009 to 2011. The arrows indicate the fatty acid compositions in two parental lines (cv. LHD2 and NHZ)

Frequency distributions of five fatty acid contents in 200 RIL seeds from 2009 to 2011. The arrows indicate the fatty acid compositions in two parental lines (cv. LHD2 and NHZ)

SLAF-seq and genotyping of soybean RIL population

A total of 281.5 million pair-end reads from SLAF-seq were generated using the Illumina Genome Analyzer IIx in this study. Specifically, 28.4 and 28.9 million reads were generated for female parent LHD2 and male parent NHZ, respectively, while 224.3 million reads were obtained for 90 of the 200 RILs. After filtering, paired-end reads with clear information were mapped to the soybean reference genome (Version of Wm82.a2.v1, https://phytozome.jgi.doe.gov), and 453,524 effective SLAFs were developed. Polymorphisms of the integrated SLAFs were analyzed, and 16,199 polymorphic SLAFs were identified. These polymorphic SLAFs were integrated with the 9948 polymorphic SLAFs identified in other 110 RILs of the population in our previous study (Li et al. 2014), and integrated SLAFs were further screened to filter out markers that were unsuitable for genetic map construction. Finally, 3541 polymorphic SLAFs were obtained to construct a high-density linkage map. All of these markers are of the SNP-type.

Construction of a high-density genetic map in soybean

The genotyping data of the 3541 SLAF markers for 200 RILs were analyzed to determine the order of these SLAF markers in 20 LGs, and a new high-density genetic map was constructed, with a genetic distance of 2534.42 cM (Fig. 2; Supplementary Table S1). The average distance between adjacent markers was 0.72 cM. The largest LG was Gm18, with 339 SLAF markers and a length of 187.47 cM. The smallest LG was Gm04, with 260 SLAF markers and a length of 94.18 cM. The mean chromosome length was 126.72 cM (Table 2). A relatively high collinearity was observed between the 20 LGs and the reference genome (Supplementary Fig. S1), making the annotation of genes within QTL intervals feasible.
Fig. 2

High-density genetic map for soybean and 26 stable QTL for fatty acids. The SLAF marker distributions were depicted on the 20 linkage groups (Gm01–Gm20) based on their genetic positions in centiMorgans (cM). The 26 stable QTL for fatty acids, along with their interval distances (cM) are shown between the tightly linked SLAF markers on the right side of each linkage group. The three fatty acid biosynthesis genes are indicated within the QTL intervals in red color by comparing their physical positions with that of SLAF markers in the soybean reference genome (color figure online)

Table 2

Description of characteristics of the 20 linkage groups (LGs) in the soybean genetic map

LGa No. of SLAFsDistance (cM)Average distance between markers (cM)Largest gap (cM)
Gm01228130.350.575.27
Gm02154140.930.9213.74
Gm03231147.280.6415.40
Gm0426094.180.365.95
Gm058699.601.178.20
Gm06196143.700.747.10
Gm07142118.420.8414.89
Gm08152137.600.914.37
Gm09147151.621.0413.95
Gm10167110.810.675.75
Gm1169112.481.658.71
Gm1210096.850.988.86
Gm13178176.981.007.02
Gm14209136.250.6612.63
Gm1523799.960.426.97
Gm16172110.830.658.10
Gm17217120.510.564.82
Gm18339187.470.556.32
Gm19161120.240.757.18
Gm209698.361.047.38
Maximum339187.471.6515.40
Minimum6994.180.364.37
Total35412534.420.72

aLinkage group

High-density genetic map for soybean and 26 stable QTL for fatty acids. The SLAF marker distributions were depicted on the 20 linkage groups (Gm01–Gm20) based on their genetic positions in centiMorgans (cM). The 26 stable QTL for fatty acids, along with their interval distances (cM) are shown between the tightly linked SLAF markers on the right side of each linkage group. The three fatty acid biosynthesis genes are indicated within the QTL intervals in red color by comparing their physical positions with that of SLAF markers in the soybean reference genome (color figure online) Description of characteristics of the 20 linkage groups (LGs) in the soybean genetic map aLinkage group

QTL for fatty acid composition

We mapped 316 QTL to 20 soybean chromosomes for the five fatty acids. Of these QTL, 26 were identified for fatty acids across multiple environments, 64 for multiple fatty acids in single environments, and 226 for fatty acids in single environments (Supplementary Table S2). We focused mainly on the 26 stable QTL across multiple environments; these QTL were mapped to all chromosomes except Gm10 and Gm11 (Fig. 2), and the average phenotypic variance explained by individual QTL varied from 0.4 to 37.0% (Table 3). Eight of the 26 QTL explained the high phenotypic variance (>10%) for specific fatty acids. We detected three genes involved in fatty acid biosynthesis, and 66 genes encoding lipid-related transcription factors within the 26 stable QTL intervals (Table 3 and Supplementary Table S3).
Table 3

The stable QTL for specific fatty acids in soybean seed across multiple environments

QTLa Traitb Environmentc Chrd Marker intervalLODPVEe (%)NoveltyReported QTL or QTNGene annotationf
qPA4_1 PA2009/20114Gm04_47098–Gm04_447625.1–16.31.7–7.7Novel
qPA6_1 2009/20106Gm06_8091–Gm06_24578.0–14.44.1–6.1Novel
qPA8_1 2010/20118Gm08_9171–Gm08_3884943.8–48.528.0–37.0Bachlava et al. (2009)
qPA16_1 2009/201016Gm16_28849–Gm16_520985.9–13.52.2–7.5Novel
qSA8_1 SA2009/20118Gm08_53938–Gm08_325234.2–39.822.4–28.7Li et al. (2015)
qSA12_1 2009/201112Gm12_1782–Gm12_3090418.0–32.69.9–11.9Novel GmFabG
qSA14_1 2009/201014Gm14_21994–Gm14_554503.8–28.83.0–7.0Xie et al. (2012), Bachlava et al. (2009)
qSA18_1 2009/201118Gm18_84824–Gm18_775186.0–32.02.8–11.6Fan et al. (2015)
qOA7_1 OA2010/20117Gm07_58581–Gm07_2939013.0–54.01.3–33.6Fan et al. (2015)
qOA8_1 2009/20108Gm08_13258–Gm08_434226.0–6.11.1–6.5Bachlava et al. (2009)
qOA9_1 2009/20109Gm09_45996–Gm09_51526.9–23.11.3–14.9Fan et al. (2015), Li et al. (2015), Xie et al. (2012), Reinprecht et al. (2006), Wang et al. (2012) GmACP
qOA13_1 2009/201113Gm13_25899–Gm13_972211.9–28.92.3–3.5Fan et al. (2015), Wang et al. (2012)
qOA17_1 2009/201017Gm17_33278–Gm17_75738.9–15.01.5–1.7Xie et al. (2012)
qOA18_1 2009/201118Gm18_79781–Gm18_5396452.1–68.18.6–27.4Novel
qFA19_1 2009/201119Gm19_67930–Gm19_4289010.4–20.51.0–4.4Novel
qFA19_2 2009/201019Gm19_58422–Gm19_160803.6–20.91.8–4.4Fan et al. (2015)
qFA19_1 LA2009/201019Gm19_67930–Gm19_428907.9–26.00.7–12.0Novel
qFA19_2 2009/201019Gm19_58422–Gm19_1608028.3–40.313.6–19.3Fan et al. (2015)
qLA1_1 2009/20111Gm01_79467–Gm01_151476.7–25.93.0–4.4Novel
qLA4_1 2009/20104Gm04_67180–Gm04_615047.6–27.42.7–4.8Novel
qLA5_1 2010/20115Gm05_27463–Gm05_532363.9–4.10.4–1.3Bachlava et al. (2009)
qLA8_1 2009/2010/20118Gm08_47647–Gm08_88667.9–33.61.3–16.9Fan et al. (2015)
qLNA2_1 LNA2009/20112Gm02_57581–Gm02_699056.2–9.12.5–3.6Novel
qLNA3_1 2010/20113Gm03_40606–Gm03_1011813.1–42.13.7–21.7Fan et al. (2015) GmFAD8
qLNA15_1 2009/201015Gm15_4886–Gm15_161987.9–35.23.0–17.6Li et al. (2015)
qLNA16_1 2010/201116Gm16_11545–Gm16_477068.6–9.92.8–3.0Diers and Shoemaker (1992)
qLNA19_1 2010/201119Gm19_45448–Gm19_5703948.3–50.128.2–32.3Fan et al. (2015)
qLNA20_1 2009/2010/201120Gm20_19811–Gm20_4167314.9–44.43.6–26.0Fan et al. (2015)

aThe name of QTL, is a composite of the influenced trait: PA (palmitic acid), SA (stearic acid), OA (oleic acid), LA (linoleic acid), and LNA (linolenic acid), followed by the chromosome number. For QTL underlying multiple fatty acids across various environments, the name is designated as a composite of FA (fatty acid) followed by the chromosome number

bThe five predominant fatty acids are designated as follows: palmitic acid (PA), stearic acid (SA), oleic acid (OA), linoleic acid (LA), and linolenic acid (LNA)

cThe three environments are designated as follows: 2009, 2010, and 2011

dChromosome

ePVE indicates the phenotypic variance explained by individual QTL

fGene annotation shows the essential genes involved in fatty acid biosynthesis discovered within the additive QTL intervals

The stable QTL for specific fatty acids in soybean seed across multiple environments aThe name of QTL, is a composite of the influenced trait: PA (palmitic acid), SA (stearic acid), OA (oleic acid), LA (linoleic acid), and LNA (linolenic acid), followed by the chromosome number. For QTL underlying multiple fatty acids across various environments, the name is designated as a composite of FA (fatty acid) followed by the chromosome number bThe five predominant fatty acids are designated as follows: palmitic acid (PA), stearic acid (SA), oleic acid (OA), linoleic acid (LA), and linolenic acid (LNA) cThe three environments are designated as follows: 2009, 2010, and 2011 dChromosome ePVE indicates the phenotypic variance explained by individual QTL fGene annotation shows the essential genes involved in fatty acid biosynthesis discovered within the additive QTL intervals Particularly, four stable QTL for palmitic acid (qPA4_1, qPA6_1, qPA8_1, and qPA16_1) explained 1.7–37.0% of the phenotypic variance. qPA8_1 explained an averaged 32.5% of the phenotypic variance for palmitic acid. For stearic acid, four stable QTL (qSA8_1, qSA12_1, qSA14_1 and qSA18_1) accounted for 3.0–28.7% of the phenotypic variance. qSA8_1 and qSA12_1 explained averaged 25.6 and 10.9% of phenotypic variance, respectively, and a 3-oxoacyl-ACP reductase gene (GmFabG, Glyma.12G092900) was found within the genomic region of qSA12_1. For oleic acid, eight stable QTL were detected including qOA7_1, qOA8_1, qOA9_1, qOA13_1, qOA17_1, qOA18_1, qFA19_1, and qFA19_2, explaining 1.3–33.6% of the phenotypic variance. qOA18-1 explained 18.0% of the phenotypic variance on average. An acyl carrier protein gene (GmACP, Glyma.09G060900) was found within the genomic region of qOA9_1. For linoleic acid, six stable QTL were identified, including qFA19_1, FA19_2, qLA1_1, qLA4_1, qLA5_1, and qLA8_1, explaining 0.4–19.3% of the phenotypic variance. qFA19-2 explained an averaged 16.4% of the phenotypic variance. For linolenic acid, six QTL (qLNA2_1, qLNA3_1, qLNA15_1, qLNA16_1, qLNA19_1, and qLNA20_1) explained 2.5–32.3% of the phenotypic variance. qLNA3_1, qLNA15_1 and qLNA19_1 explained on average 12.7, 11.2, and 30.3% of the phenotypic variance, respectively. A ω-fatty acid desaturase gene (GmFAD8, Glyma.03G056700) was found within the genomic region of qLNA3_1 (Table 3). Moreover, qFA19_1 and qFA19_2 contributed to both oleic and linoleic acid composition across multiple environments, suggesting a pleiotropic effect for multiple fatty acids (Table 3).

Epistatic effect on fatty acid composition

Five epistatic QTL were detected for individual fatty acids, explaining 3.4–10.4% of the phenotypic variance (Table 4). Interestingly, a locus at 0 cM on Gm06 had epistatic effects with both loci on Gm07 and Gm10 for stearic acid across two environments (2009 and 2011), indicating a relatively stable epistatic effect (Table 4).
Table 4

Epistatic QTL for fatty acid contents in soybean seeds

Traita Chrb1Pc1 (cM)Marker interval 1Chr2P2 (cM)Marker interval 2LOD(E)d PVEe (%)A1f (%)A2 g (%)A1byA2 h (%)
PA_201110100Gm10_49818–Gm10_385211965Gm19_62762–Gm19_392585.310.40.0001−0.0882−0.2088
SA_200960Gm06_27520–Gm06_535047115Gm07_85356–Gm07_211865.25.6−0.03530.0005−0.1149
SA_2009875Gm08_54434–Gm08_140771215Gm12_67387–Gm12_127475.14.7−0.00010.0033−0.1036
SA_201160Gm06_27520–Gm06_535041090Gm10_74377–Gm10_254755.17.2−0.01410.0001−0.1768
LNA_2011230Gm02_14960–Gm02_4819113140Gm13_52660–Gm13_554245.13.4−0.00110.03730.1473

aThe five predominant fatty acids are designated as follows: PA (palmitic acid), SA (stearic acid), OA (oleic acid), LA (linoleic acid), and LNA (linolenic acid); the three environments are three continuous years from 2009 to 2011

bChromosome

cPosition in linkage groups in centiMorgans

dLOD score of epistatic QTL

ePhenotypic variance explained by epistatic QTL

fAdditive effect of the first QTL

gAdditive effect of the second QTL

hEpistatic effect between two QTL

Epistatic QTL for fatty acid contents in soybean seeds aThe five predominant fatty acids are designated as follows: PA (palmitic acid), SA (stearic acid), OA (oleic acid), LA (linoleic acid), and LNA (linolenic acid); the three environments are three continuous years from 2009 to 2011 bChromosome cPosition in linkage groups in centiMorgans dLOD score of epistatic QTL ePhenotypic variance explained by epistatic QTL fAdditive effect of the first QTL gAdditive effect of the second QTL hEpistatic effect between two QTL

Discussion

SLAF-seq is an effective sequencing-based method for large-scale marker discovery and genotyping. Furthermore, SLAF-seq is a highly efficient approach for marker development that is relatively inexpensive and can be based on large populations (Li et al. 2014; Sun et al. 2013). This method has been widely used to construct high-density genetic maps and identify QTL for agronomic traits and disease resistance in various crops (Han et al. 2015; Qin et al. 2015; Su et al. 2016). However, the application of this method in QTL mapping for soybean fatty acid composition has not been reported. Therefore, we used SLAF-seq to construct a high-density genetic map and identify QTL for fatty acid composition.

The effects of population size and marker density on QTL mapping

Efficiency and accuracy are important for QTL mapping (Li et al. 2010; Stange et al. 2013). We previously performed QTL mapping for fatty acid composition using a genetic linkage map based on 161 SSR marker sets and 100 RILs derived from the LHD2/NHZ cross (Fan et al. 2015). In the current study, all the 200 RILs from the same population were used to construct a new high-density genetic linkage map with 3541 SLAF markers. To analyze the efficiency and accuracy of QTL mapping with different population sizes and marker densities, we made a permutation for QTL detection of fatty acid composition using 100–200 RILs randomly selected from the population and compared the results each other and with our previous study. The number of detected QTL increased significantly with an increase in marker density and population size (Table 5). Therefore, the efficiency of QTL mapping improved significantly with an increase in population size and marker density. Many novel QTL could be identified with the improvement of QTL mapping.
Table 5

Number of QTL in soybean detected using different population sizes and marker densities

Population sizeMarker densityTotal QTL number
10016154
1003541149
1503541193
2003541316
Number of QTL in soybean detected using different population sizes and marker densities On the other hand, almost all of QTL intervals (45 of the 47 overlapping QTL intervals) were reduced significantly with the increase of marker densities. In fact, 91% of QTL intervals were smaller than 5.0 cM, and 38% of QTL intervals were smaller than 1.0 cM. For instance, qFA8_1 was mapped in an interval of 30.6 cM in our previous study (Fan et al. 2015), whereas the QTL interval was reduced to 1.1 cM in the current study (Supplementary Table S2). The smaller genomic region, in combination with the high collinearity of the genetic map with the reference genome sequence (Supplementary Fig. S1), will facilitate fine mapping of these QTL. That will help uncover the complex networks that govern fatty acid formation and regulation in soybean seeds. Another example, qLNA19_1, explaining an averaged 30.3% of the phenotypic variance for linolenic acid across two environments (2010 and 2011), was identified within a 0.5 cM interval, corresponding to a 55 kb genomic region in the soybean reference genome (Wm82.a2.v1). Therefore, the candidate genes within this region could be identified. Additionally, due to the low density of our previous genetic map, several closely linked QTL were assumed to be a single QTL. When the marker density and population size increased, these QTL could be detected accurately as several individual QTL. For instance, qOA7_2 was mapped in a 17.9 cM interval in our previous study (Fan et al. 2015), whereas two adjacent additive QTL (qFA7_7 and qFA7_8) were detected in the current study (Supplementary Table S2). Therefore, the accuracy of QTL mapping was improved significantly due to the increased population size and marker density.

Comparison analysis revealed the novel stable QTL for fatty acids

According to the SoyBase database (http://soybase.org), hundreds of QTL have been detected for individual fatty acids (Alrefai et al. 1995; Bachlava et al. 2009; Brummer et al. 1997; Diers and Shoemaker 1992; Fan et al. 2015; Hyten et al. 2004; Panthee et al. 2006; Reinprecht et al. 2006; Wang et al. 2012, 2014; Xie et al. 2012). GWAS has also revealed 33 QTN associated with individual fatty acids (Li et al. 2015). We compared the QTL detected in our current study with the QTL and QTN reported previously according to their physical position in the soybean reference genome, and found that 108 of the total 316 QTL are novel (Supplementary Table S2). For the 26 stable QTL detected across multiple environments, nine are novel loci for individual fatty acids. Specifically, for saturated fatty acid, three novel QTL (qPA4_1, qPA6_1, and qPA16_1) were stably identified for palmitic acid, while one novel QTL, qSA12_1, was detected for stearic acid. For unsaturated fatty acid, two novel QTL (qOA18_1 and qFA19_1) were detected for oleic acid; three novel QTL (qFA19_1, qLA1_1 and qLA4_1) were identified for linoleic acid, while one novel QTL, qLNA2_1, contributed to linolenic acid (Table 3). These novel stable QTL could help elucidate the genetic basis of fatty acid accumulation and regulation.

Major stable QTL and tightly linked markers could be applied for marker-assisted selection in soybean breeding programs

We also identified eight major stable QTL explaining the high phenotypic variance (>10%) for fatty acids. Particularly, qPA8_1, qSA8_1, qLNA19_1 explained approximately one-third of the phenotypic variance for palmitic, stearic, and linolenic acids, respectively, across multiple environments. By contrast, our results suggested that the epistatic QTL had less effect on fatty acid contents compared with the additive effects of major QTL (Table 4). Therefore, these major QTL and tightly linked SLAF markers could be applied in MAS in soybean breeding programs.

Gene annotation revealed fatty acid biosynthesis and lipid-related transcription factor genes

By annotating the genes within the 26 QTL intervals aginst Nr, Swiss-Prot, and KOG/COG databases, three essential genes involved in fatty acid biosynthesis (GmACP, GmFabG, and GmFAD8) were identified for oleic, stearic, and linolenic acids within the genomic region of qOA9_1, qSA12_1, and qLNA3_1, respectively. GmACP is an essential substrate that functions in upstream of de novo fatty acid biosynthesis. GmFabG is one of the core enzymes of fatty acid synthase (FAS). It catalyzes the reduction of acetoacetyl-ACP to β-hydroxybutyryl-ACP. In combination with other enzymes of FAS, palmitoyl-ACP (16:0-ACP) is produced (Somerville and Browse 1996). GmFAD8 is a ω-fatty acid desaturase that catalyzes the conversion of linoleic acid (18:2) to linolenic acid (18:3), and thereby is important for linolenic acid accumulation (Somerville and Browse 1996). Consistent with their functions, the QTL harboring these three genes contributed to oleic, stearic, and linolenic acids, respectively. The presence of these genes within the QTL suggests that they may contribute to the major effects of these loci. In addition to the structural genes involved in fatty acid biosynthesis, 66 genes encoding lipid-related transcription factors, such as MYB, WRKY, bZIP, and bHLH, were also detected within the 26 QTL intervals (Supplementary Table S3). Several transcription factors play essential regulatory roles in fatty acid formation (Baud et al. 2007, 2009; Mendes et al. 2013; Mu et al. 2008; Raffaele et al. 2008; To et al. 2012; Wang et al. 2007). Therefore, the functional validation of these 66 genes in fatty acid regulation will help uncover the complex network underlying fatty acid composition in soybean seed. In summary, we developed a high-density genetic map comprising 3541 SLAF markers using 200 RILs. With this high-resolution genetic map, we identified 26 stable QTL for fatty acid composition. Nine of these QTL are novel loci for individual fatty acids. Three genes involved in fatty acid biosynthesis (GmACP, GmFabG, and GmFAD8) were found within the genomic region of qOA9_1, qSA12_1, and qLNA3_1, respectively, suggesting they may contribute to the major effect for oleic, stearic, and linolenic acids, respectively. The stable and novel QTL detected in the present study will not only facilitate studies of the genetic basis of fatty acid formation and regulation, but they may also be useful in MAS for the improvement of soybean quality.

Author contribution statement

BL conducted the data analysis, QTL mapping, genomic comparative analysis, and wrote the manuscript. SXF, FKY, and YC extracted the DNA from the RIL populations, performed SNP calling, and developed the genetic linkage map for soybean. SRZ designed and edited the figures. FXH, SRY, and LZW provided advice on experimental design and edited the manuscript. JMS designed, supervised, and financed the work and edited the manuscript. All authors read and approved of the final manuscript. Below is the link to the electronic supplementary material. Supplementary material 1 (DOCX 142 kb) Supplementary material 2 (XLSX 149 kb) Supplementary material 3 (XLSX 58 kb) Supplementary material 4 (DOCX 30 kb)
  26 in total

1.  Statistical properties of QTL linkage mapping in biparental genetic populations.

Authors:  H Li; S Hearne; M Bänziger; Z Li; J Wang
Journal:  Heredity (Edinb)       Date:  2010-05-12       Impact factor: 3.821

2.  Inclusive composite interval mapping (ICIM) for digenic epistasis of quantitative traits in biparental populations.

Authors:  Huihui Li; Jean-Marcel Ribaut; Zhonglai Li; Jiankang Wang
Journal:  Theor Appl Genet       Date:  2007-11-06       Impact factor: 5.699

3.  The soybean Dof-type transcription factor genes, GmDof4 and GmDof11, enhance lipid content in the seeds of transgenic Arabidopsis plants.

Authors:  Hui-Wen Wang; Bo Zhang; Yu-Jun Hao; Jian Huang; Ai-Guo Tian; Yong Liao; Jin-Song Zhang; Shou-Yi Chen
Journal:  Plant J       Date:  2007-09-18       Impact factor: 6.417

4.  Dietary fat intake and the risk of coronary heart disease in women.

Authors:  F B Hu; M J Stampfer; J E Manson; E Rimm; G A Colditz; B A Rosner; C H Hennekens; W C Willett
Journal:  N Engl J Med       Date:  1997-11-20       Impact factor: 91.245

5.  WRINKLED1 specifies the regulatory action of LEAFY COTYLEDON2 towards fatty acid metabolism during seed maturation in Arabidopsis.

Authors:  Sébastien Baud; Monica Santos Mendoza; Alexandra To; Erwana Harscoët; Loïc Lepiniec; Bertrand Dubreucq
Journal:  Plant J       Date:  2007-04-05       Impact factor: 6.417

6.  A MYB transcription factor regulates very-long-chain fatty acid biosynthesis for activation of the hypersensitive cell death response in Arabidopsis.

Authors:  Sylvain Raffaele; Fabienne Vailleau; Amandine Léger; Jérôme Joubès; Otto Miersch; Carine Huard; Elisabeth Blée; Sébastien Mongrand; Frédéric Domergue; Dominique Roby
Journal:  Plant Cell       Date:  2008-03-07       Impact factor: 11.277

7.  SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing.

Authors:  Xiaowen Sun; Dongyuan Liu; Xiaofeng Zhang; Wenbin Li; Hui Liu; Weiguo Hong; Chuanbei Jiang; Ning Guan; Chouxian Ma; Huaping Zeng; Chunhua Xu; Jun Song; Long Huang; Chunmei Wang; Junjie Shi; Rui Wang; Xianhu Zheng; Cuiyun Lu; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

8.  Rapid SNP discovery and genetic mapping using sequenced RAD markers.

Authors:  Nathan A Baird; Paul D Etter; Tressa S Atwood; Mark C Currey; Anthony L Shiver; Zachary A Lewis; Eric U Selker; William A Cresko; Eric A Johnson
Journal:  PLoS One       Date:  2008-10-13       Impact factor: 3.240

9.  Genetic characteristics of soybean resistance to HG type 0 and HG type 1.2.3.5.7 of the cyst nematode analyzed by genome-wide association mapping.

Authors:  Yingpeng Han; Xue Zhao; Guanglu Cao; Yan Wang; Yinghui Li; Dongyuan Liu; Weili Teng; Zhiwu Zhang; Dongmei Li; Lijuan Qiu; Hongkun Zheng; Wenbin Li
Journal:  BMC Genomics       Date:  2015-08-13       Impact factor: 3.969

10.  Identification of favorable SNP alleles and candidate genes for traits related to early maturity via GWAS in upland cotton.

Authors:  Junji Su; Chaoyou Pang; Hengling Wei; Libei Li; Bing Liang; Caixiang Wang; Meizhen Song; Hantao Wang; Shuqi Zhao; Xiaoyun Jia; Guangzhi Mao; Long Huang; Dandan Geng; Chengshe Wang; Shuli Fan; Shuxun Yu
Journal:  BMC Genomics       Date:  2016-08-30       Impact factor: 3.969

View more
  14 in total

1.  Identification of Genomic Loci and Candidate Genes Related to Seed Tocopherol Content in Soybean.

Authors:  Suprio Ghosh; Shengrui Zhang; Muhammad Azam; Kwadwo Gyapong Agyenim-Boateng; Jie Qi; Yue Feng; Yecheng Li; Jing Li; Bin Li; Junming Sun
Journal:  Plants (Basel)       Date:  2022-06-27

2.  Construction of a high-density genetic map and mapping of QTLs for soybean (Glycine max) agronomic and seed quality traits by specific length amplified fragment sequencing.

Authors:  Yanwei Zhang; Wei Li; Yanhui Lin; Lifeng Zhang; Caijie Wang; Ran Xu
Journal:  BMC Genomics       Date:  2018-08-29       Impact factor: 3.969

3.  An integrated omics analysis reveals molecular mechanisms that are associated with differences in seed oil content between Glycine max and Brassica napus.

Authors:  Zhibin Zhang; Jim M Dunwell; Yuan-Ming Zhang
Journal:  BMC Plant Biol       Date:  2018-12-04       Impact factor: 4.215

4.  Genome-wide discovery of DNA polymorphisms among chickpea cultivars with contrasting seed size/weight and their functional relevance.

Authors:  Mohan Singh Rajkumar; Rohini Garg; Mukesh Jain
Journal:  Sci Rep       Date:  2018-11-14       Impact factor: 4.379

5.  Construction of a high-density genetic linkage map and QTL mapping of oleic acid content and three agronomic traits in sunflower (Helianthus annuus L.) using specific-locus amplified fragment sequencing (SLAF-seq).

Authors:  Fei Zhou; Yan Liu; Chunbo Liang; Wenjun Wang; Cen Li; Yongli Guo; Jun Ma; Ying Yu; Lijuan Fan; Yubo Yao; Dongsheng Zhao; Xuemei Liu; Xutang Huang
Journal:  Breed Sci       Date:  2018-11-21       Impact factor: 2.086

6.  Identification of candidate genes for key fibre-related QTLs and derivation of favourable alleles in Gossypium hirsutum recombinant inbred lines with G. barbadense introgressions.

Authors:  Furong Wang; Jingxia Zhang; Yu Chen; Chuanyun Zhang; Juwu Gong; Zhangqiang Song; Juan Zhou; Jingjing Wang; Chengjie Zhao; Mengjia Jiao; Aiying Liu; Zhaohai Du; Youlu Yuan; Shoujin Fan; Jun Zhang
Journal:  Plant Biotechnol J       Date:  2019-09-20       Impact factor: 9.803

7.  Deciphering the Genetic Architecture of Plant Height in Soybean Using Two RIL Populations Sharing a Common M8206 Parent.

Authors:  Yongce Cao; Shuguang Li; Guoliang Chen; Yanfeng Wang; Javaid Akhter Bhat; Benjamin Karikari; Jiejie Kong; Junyi Gai; Tuanjie Zhao
Journal:  Plants (Basel)       Date:  2019-09-26

8.  Quantitative trait loci analysis of seed oil content and composition of wild and cultivated soybean.

Authors:  Yanjie Yao; Qingbo You; Guozhan Duan; Jianjun Ren; Shanshan Chu; Junqing Zhao; Xia Li; Xinan Zhou; Yongqing Jiao
Journal:  BMC Plant Biol       Date:  2020-01-31       Impact factor: 4.215

9.  Effect of marker segregation distortion on high density linkage map construction and QTL mapping in Soybean (Glycine max L.).

Authors:  Jian-Fang Zuo; Yuan Niu; Peng Cheng; Jian-Ying Feng; Shi-Feng Han; Ying-Hao Zhang; Guoping Shu; Yibo Wang; Yuan-Ming Zhang
Journal:  Heredity (Edinb)       Date:  2019-05-31       Impact factor: 3.821

10.  Construction of a High-Density Genetic Map and Analysis of Seed-Related Traits Using Specific Length Amplified Fragment Sequencing for Cucurbita maxima.

Authors:  Yunli Wang; Chaojie Wang; Hongyu Han; Yusong Luo; Zhichao Wang; Chundong Yan; Wenlong Xu; Shuping Qu
Journal:  Front Plant Sci       Date:  2020-02-21       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.