Literature DB >> 27866151

Genotyping-by-Sequencing-Based Investigation of the Genetic Architecture Responsible for a ∼Sevenfold Increase in Soybean Seed Stearic Acid.

Crystal B Heim1, Jason D Gillman2,3.   

Abstract

Soybean oil is highly unsaturated but oxidatively unstable, rendering it nonideal for food applications. Until recently, the majority of soybean oil underwent partial chemical hydrogenation, which produces trans fats as an unavoidable consequence. Dietary intake of trans fats and most saturated fats are conclusively linked to negative impacts on cholesterol levels and cardiovascular health. Two major soybean oil breeding targets are: (1) to reduce or eliminate the need for chemical hydrogenation, and (2) to replace the functional properties of partially hydrogenated soybean oil. One potential solution is the elevation of seed stearic acid, a saturated fat which has no negative impacts on cardiovascular health, from 3 to 4% in typical cultivars to > 20% of the seed oil. We performed QTL analysis of a population developed by crossing two mutant lines, one with a missense mutation affecting a stearoyl-acyl-carrier protein desaturase gene resulting in ∼11% seed stearic acid crossed to another mutant, A6, which has 24-28% seed stearic acid. Genotyping-by-sequencing (GBS)-based QTL mapping identified 21 minor and major effect QTL for six seed oil related traits and plant height. The inheritance of a large genomic deletion affecting chromosome 14 is the basis for largest effect QTL, resulting in ∼18% seed stearic acid. This deletion contains SACPD-C and another gene(s); loss of both genes boosts seed stearic acid levels to ≥ 18%. Unfortunately, this genomic deletion has been shown in previous studies to be inextricably correlated with reduced seed yield. Our results will help inform and guide ongoing breeding efforts to improve soybean oil oxidative stability.
Copyright © 2017 Heim and Gillman.

Entities:  

Keywords:  Glycine max; QTL mapping; fatty acid composition; genotyping-by-sequencing; soybean

Mesh:

Substances:

Year:  2017        PMID: 27866151      PMCID: PMC5217118          DOI: 10.1534/g3.116.035741

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Soybean [Glycine max (L.) Merr.] is the source of the most widely consumed edible oil in the US, representing 55% of total oil consumption (http://soystats.com/2015-soystats/). Soybean oil and its derivatives are ubiquitous ingredients in packaged foods, especially in cookies and other snack foods (Van Camp ). Soybean oil from a “typical” cultivar is comprised of five main fatty acids: palmitic, stearic, oleic, linoleic, and linolenic (11, 3–4, 25, 52, and 8%, respectively) (Fehr 2007). Due to the high proportion of polyunsaturated fatty acids, which are oxidatively unstable (Frankel 1991), the majority of soybean oil is chemically hydrogenated in order to increase stability and shelf life. Unfortunately, the process of hydrogenation produces trans fats, which have been tied to increased risk of heart disease and elevated low-density lipoprotein (LDL) levels in blood serum (Hunter ). In 2003, the Food and Drug Administration (FDA) mandated labeling of packaged foods containing trans fats in amounts exceeding 0.5 g per serving (Van Camp ), culminating in the removal of the “Generally Regarded As Safe” status for partially hydrogenated foods, due to the presence of trans fats. (http://www.fda.gov/Food/IngredientsPackagingLabeling/FoodAdditivesIngredients/ucm449162.htm accessed 08-24-2016). For these reasons, a major soybean breeding target has been modification of soybean oil composition, so as to reduce or eliminate the need for chemical hydrogenation. Achievement of this goal would potentially translate to decreased trans fat intake for consumers, as well as fewer steps in processing for manufacturers. Biotechnological and traditional breeding have independently increased the oleic acid content of soybean oil and thereby increased oxidative stability (Bilyeu ; Buhr ; Pham ). Soybean oil high in oleic acid (> 70%) has been shown to be more chemically stable than typical soybean oil (Warner and Gupta 2005). However, both conventional and high oleic acid soybean oil have very low melting points, and for many applications fully hydrogenated oil is more suitable (Ribeiro ). For instance, edible oil high in saturated fats is desirable for solid fat baking applications (Renzyaeva 2013; Tarancón ). Interestingly, new research has shown that stearic acid (C18:0), a long-chain saturated fatty acid, does not have the same cholesterolemic effects in humans as shorter acyl-chain saturated fats (Hunter ; Yu ); it is effectively “heart-neutral.” Unfortunately, soybean oil generally contains only ∼3–4% stearic acid (Wilson 2004), and > 20% must be achieved to meet current market demands (K. Whiting, Smith-Bucklin/United Soybean Board, personal communication). Elevated stearic acid is an apparently rare trait; out of 21,849 USDA-GRIN soybean collection entries that have been evaluated for oil composition, only a single entry has stearic acid > 9% (https://npgsweb.ars-grin.gov/gringlobal/descriptors.aspx, accessed 08-19-2016). It is known that soybean seed stearic acid content is determined largely by alterations in the Stearoyl-Acyl Carrier Protein Desaturase-C (SACPD-C) gene (Glyma14g27990). SACPD-C is a soluble enzyme that selectively desaturates stearic acid precursors (C18:0) to oleic acid precursors (C18:1) (Zhang ). It has been previously reported that SACPD-C is a major determinant of seed stearic acid levels (Ruddle ,b, 2012; Gillman ). To date, there is only one known naturally occurring source of elevated stearic acid, FAM94-41, which contains a missense mutation in SACPD-C and ∼13% seed stearic acid (Zhang ). All other lines with elevated stearic acid are due to mutagenesis (Bolon ; Gillman ; Rahman , 1997). The majority of lines contain 8–12.5% seed stearic acid, but one notable exception is mutant line A6, which has been reported to have arisen by sodium azide-induced mutagenesis (Hammond and Fehr 1983), but which contains several large genomic deletions more consistent with radiation-induced mutagenesis. A6 seeds exhibit a dramatic increase in seed stearic acid levels (∼24–28%, as compared to the wild-type 4%). A6 bears several large genomic deletions, including a large portion of chromosome 14 (∼6,221,000 bp) corresponding to ∼1/8 of the chromosome (Gillman ). Regrettably, A6 is also extremely agronomically deficient, exhibiting poor germination, low seed yield, reduced seed quality, extreme early maturity (MG 0), and short plant stature when alleles for elevated stearic acid are present (Lundeen ; Hammond and Fehr 1983). Some of the defects are likely due to one (or more) of the significant genomic deletions (Gillman ). We identified multiple SACPD-C deletion and missense mutation lines from multiple genetic backgrounds and, except for line A6, loss of functional SACPD-C per se can only elevate seed stearic acid levels to ∼13%, regardless of mutation or deletion (Gillman ). The identification of multiple mutants with genetically controlled variation in elevated stearic acid content presented an opportunity to investigate the genetic basis for the extremely elevated stearic acid content (∼24%) in line A6. To that end, a recombinant inbred line (RIL) population was developed by crossing a line with ∼11% stearic acid (194D) to a line with ∼24–28% stearic acid (A6). We then employed a GBS method (Elshire ), which is routine in maize, but remains less utilized in soybean. GBS is a simple, repeatable, and robust method for identifying single nucleotide polymorphisms (SNPs) via sequencing of reduced representation libraries produced with methylation-sensitive restriction enzymes, which bias against highly repetitive and gene-poor genomic regions (Poland ). A dense genetic linkage map was constructed from 2977 high-quality GBS markers. Using phenotypic data from 173 RILs over three environments, 21 QTL were identified for seven traits in soybean, including four QTL for seed stearic acid content.

Materials and Methods

Plant material and population development

Two RIL populations were developed, derived from the cross between A6 and 194D (designated population 2), and the reciprocal cross between 194D and A6 (designated population 3). Population 2 consisted of 100 individuals in 2014 and 2015, and population 3 consisted of 95 individuals in 2014 and 94 individuals in 2015. Walter Fehr generously provided seeds from the high stearic acid line, A6. 194D is a midstearic line (∼13%) that has a V211E mutation affecting SACPD-C identified by Gillman from an EMS-induced TILLiNG population created by Kristin Bilyeu, described previously (Cooper ). Both populations and their parent lines were grown in three distinct locations: (1) South Farm in 2014 (Columbia, MO, Latitude 38.908189, Longitude −92.278693, Mexico silt loam soil); (2) Bradford Research Center in 2014 (Columbia, MO, Latitude 38.893990, Longitude 92.196405, Mexico silt loam soil); and (3) Hinkson Field in 2015 (Columbia, MO, Latitude 38.928015, Longitude −92.351425, Haymond silt loam soil). Both populations were grown in single-row plots in randomized complete block design with three replicates over all locations in both years, except at South Farm in 2014, where population 2 was not grown due to severe weather constraints and seed availability during planting season. A small number of lines were found to be unrelated to A6 or 194D via GBS markers and were excluded; the final population size was 173 RILs.

Quantification of seed traits

The F4:5 and the F5:6 populations were harvested at physiological maturity and stored in a ventilated grain bin until threshing. After threshing, seed were moved to a temperature and humidity controlled facility (Sears plant growth facility, University of Missouri, 4° and 39% relative humidity) for a minimum of four weeks prior to subsequent analyses, in order to minimize differences in seed moisture content between samples. Samples were allowed to warm in a ∼25° lab overnight and then total seed oil content (percent w/w) was estimated using NMR spectroscopy (La ). A benchtop Oxford MQC NMR machine (Oxford Instruments, Oxford, UK) was used to analyze 5 g seed samples. Total seed oil content is expressed as the percentage of total weight of the seeds. As part of an independent project, seed from 200 RILs produced in SF 2014 and RB 2015 were scanned on a NIRS monochromator model FOSS 6500 (FOSS North America, Eden Prairie, MN) using the transport quarter cup (dimension 97 × 55 mm). The NIRS reflectance (R) spectra were collected at 2 nm intervals in the NIRS region of 400–2500 nm at room temperature. Calibrations previously developed by FOSS were used to estimate moisture content. In 200 samples from multiple plots stored in this manner, seed moisture content was found to be minimally variable (7.7 ± 0.4% SD). Seed samples were also analyzed for seed fatty acid composition using the derivatized lipid gas chromatography method (Bilyeu ). In 2014, two samples of three seeds each per plot were analyzed, and in 2015, four or five individual seeds per plot were analyzed. Seeds were crushed then extracted in 1 ml chloroform-hexane-methanol (8:5:2, v/v/v) overnight. Derivatization of 150 μl extracted oils was done with 75 μl methylating reagent (0.25 M methanolic sodium methoxide-petroleum ether-ethyl ether, 1:5:2, v/v/v/). Seed samples were then diluted with hexane to 1 ml. An Agilent (Palo Alto, CA) series 6890 capillary gas chromatograph fitted with a flame ionization detector (275°) was used with an AT-Silar capillary column (Alltech Associates, Deerfield, IL). Standard fatty acid mixtures (Animal and Vegetable Oil Reference Mixture 6, AOACS) were used as reference standards.

Quantification of field traits

Plant height was measured in centimeters on three separate plants within each plot (three plots per genotype) when 95% of the pods showed mature pod color.

Statistical analyses

ANOVA was performed using the “anova” and “aov” functions in R (R Foundation for Statistical Computing, Vienna, Austria 2008) to determine if there were significant differences between any locations or blocks. Pearson’s correlation coefficients between phenotypes were determined using PROC CORR in SAS (SAS Institute Inc., 2002). Best linear unbiased predictors (BLUPs) were calculated using the lme4 package in R (R Foundation for Statistical Computing). All effects were considered random. Heritability was calculated in the broad-sense (h2) as follows:where h2 = heritability, σ2g = genotypic variance among RILs, σ2ge = genotype × environment (location), σ2 = error variance, r = number of reps, and n = number of environments.

DNA isolation and gGBS

DNA was isolated from ∼40 mg of lyophilized leaf tissue using the DNeasy Plant Mini kit (QIAGEN, Valencia, CA) according to the manufacturer’s instructions. DNA samples from 190 lines from populations 2 and 3 were submitted to the Institute for Genomic Diversity (IGD) at Cornell University. GBS libraries were prepared as previously described (Elshire ; Swarts ) using the ApeKI enzyme, DNA ligase, and appropriate Illumina adapters. The IGD staff performed the library construction, read mapping, and downstream SNP calls. A total of 455,924,779 reads were produced, 428,671,333 of which were used for downstream analysis. A total of 61.5% of the reads were successfully aligned to unique positions in the G. max Wm82.a2.v1 Williams 82 reference sequence (Schmutz , http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Gmax) using the BWA 0.7.8-r455 program (Li and Durbin 2009). SNPs were called using the TASSEL 5.0 pipeline (Bradbury ) resulting in 33,728,018 tags for all 190 samples. A total of 27,672 SNPs were identified, with an average SNP coverage of 10.98-fold. SNPs were filtered using TASSEL 5.0 to exclude those with > 20% missing data, and to restrict allele frequency to between 0.2 and 0.8, bringing the SNP count down to 5423. Lastly, a chi-square test was performed to remove any severely distorted markers (at F4:5), resulting in a final count of 2977 high-quality markers for mapping. Parental allele assignment and imputation were performed using the Window LD function of FsFHap (Swarts ) incorporated into TASSEL 5.0.

Linkage map construction

Linkage map construction was performed in R/qtl (R Foundation for Statistical Computing) using the 2977 polymorphic, high-quality SNP markers obtained from GBS (Supplemental Material, Figure S1, File S1, File S2, and Table 1). Genetic distances between markers were determined using the est.map function, with an estimated genotyping error rate of 0.01. Iterative evaluation of chromosomes with excessive map distances (> 200 cM) was performed using droponemarker and est.map functions. Finally, all chromosomes were evaluated for correct marker order using the ripple function, using five markers at a time. No better order was identified than that assigned by the position in the Wm82.a2.v1 assembly.
Table 1

Details on final GBS linkage map for A6 × 194D cross

ChromosomeMarker #Length (cM)Average Spacing (cM)Maximum Spacing (cM)
112367.50.615.5
2190131.80.713.1
3115125.71.173.7
4197136.00.752.5
5145117.80.819.2
6233141.10.616.4
76166.01.118.0
895162.71.751.0
919662.60.36.9
10219122.60.612.7
1138111.93.065.6
1249125.52.662.0
13180126.20.733.0
1415185.90.630.3
15203107.40.525.9
16102118.81.262.0
1716545.30.38.1
18190126.00.749.8
19191105.40.627.8
20134115.20.916.9
Overall29772201.40.773.7

Covariate analysis and QTL mapping

BLUP phenotypes were used for a one-dimensional QTL scan using the SIM method to determine the most significant marker associated with seed stearic acid for use in covariate analysis. Marker c14.loc58 (chromosome 14 @ 58 cM, Gm14:42206409) was identified as significantly associated with stearic acid and was subsequently designated as an interactive covariate using the pull.geno and cbind functions in R/qtl. Further analysis using this interactive covariate was performed using BLUPs values and the SIM method through the “scanone” function. QTL analysis was performed using the qtl package in R (Broman and Sen 2009). Because R/qtl assumes an F2, the cross file was first converted into an F4 using the BCSFT tool (Shannon ). Conditional genotypic probabilities were calculated using an error probability rate of 0.01, due to the inherent likelihood of genotyping errors of the GBS method (Elshire ). Single QTL analysis was run using the Standard Interval mapping method as described in Broman and Sen (2009). Significance thresholds were calculated using 1000 permutation tests for each phenotype, but the extremely large effects from the sacpd-c deletion resulted in substantially elevated LOD scores for oil related traits; in general, the only QTL above threshold was the sacpd-c deletion. As a result, we set a LOD significance threshold of LOD > 4. Interactions between QTL and additional QTL were examined using the “addint” and “addqtl” functions. Final model fit was determined by dropping one QTL at a time using the “fitqtl” function. QTL were refined using the “refineqtl” function and fit to obtain a final model. Confidence intervals for QTL locations are presented as 1.5-LOD intervals. QTL were evaluated for overlap with selected literature (listed in results) and historical QTL using genetic/physical map tools available online (http://soybase.org/search/qtllist_by_symbol.php, accessed 01-01-2016 through 05-30-2016). The effect of each QTL was estimated in R/QTL using averaged phenotypic data over the three study locations via the effectplot command, following sim.geno with 1000 draws and an error probability of 0.01.

Data availability

The authors state that all data necessary for confirming the conclusions presented in the article are fully represented either within the article or within Supplemental Materials.

Results

Phenotypic distributions

Analysis of variance showed that there were no significant differences between population 2 and population 3, so we conclude that there is no significant cytoplasmic effect on any of the traits evaluated in this study. Full results of the statistical analysis are presented in Table 2. There was significant genetic variation among RILs for all traits (Table 2), as well as a significant location effect for all traits (P < 0.05). The two parents, A6 and 194D, showed significant differences in all traits examined except oleic acid (P < 0.05). There were highly significant line and environment effects for all traits (P < 0.001) and a genotype × environment interaction was significant for all traits except palmitic acid and plant height (P < 0.01). Heritability estimates were moderately high for the five main fatty acid species, the highest value being h2 = 0.89 for palmitic acid.
Table 2

Statistical analysis of seven phenotypic traits in soybean

TraitSource of VariationMSFh2
Plant heightLocation16050.1584.85***0.54
Line387.272.05***
Location × line208.221.10
Rep874.844.62**
Error189.16
Total seed oilLocation146.5588.55***0.28
Line3.071.86***
Location × line2.571.55***
Rep1.971.19
Error1.66
PalmiticLocation12.8334.25***0.89
Line2.396.38***
Location × line0.330.89
Rep0.170.46
Error0.38
StearicLocation300.9448.50***0.88
Line58.789.47***
Location × line9.371.51***
Rep0.840.14
Error6.20
OleicLocation1278.44216.52***0.81
Line42.497.20***
Location × line9.951.69***
Rep15.352.60
Error5.90
LinoleicLocation925.85112.96***0.73
Line53.096.48***
Location × line16.932.07***
Rep10.571.29
Error8.20
LinolenicLocation337.56666.36***0.72
Line2.274.48***
Location × line0.791.55***
Rep1.312.58
Error0.51

P < 0.05; ** P < 0.01; *** P < 0.001. MS, mean sum of squares; F, value; h, broad sense heritability.

P < 0.05; ** P < 0.01; *** P < 0.001. MS, mean sum of squares; F, value; h, broad sense heritability. Phenotypic distributions and parent values are shown in Figure S2 and Table 3. All traits were approximately normally distributed. Transgressive segregation was noted for all traits of interest, particularly fatty acid traits, which suggested the contribution of multiple genes and/or multiple genes of small effect.
Table 3

Descriptive statistics of traits of the RIL populations and the parents

RILsParents
TraitAverageaRangeSDbA6194D
BR 2014Height (cm)49.3122.50–87.5011.1346.25*64.17
Total seed oil (%)18.6315.04–21.441.1315.8419.81
Palmitic (% seed oil)8.987.14–11.160.678.709.62
Stearic (% seed oil)15.127.83–26.113.4423.88*9.97
Oleic (% seed oil)2115.17–33.293.3818.5718.05
Linoleic (% seed oil)48.0633.09–55.403.8342.60**54.57
Linolenic (% seed oil)6.854.47–9.370.866.25**8.05
2014 SFHeight (cm)63.8231.67–86.6711.014068.75
Total seed oil (%)17.314.16–19.571.1511.8518.59
Palmitic (% seed oil)9.448.42–10.750.519.139.77
Stearic (% seed oil)14.349.66–20.992.8722.32*11.77
Oleic (% seed oil)1815.13–23.431.7516.9218.67
Linoleic (% seed oil)49.944.46–54.422.4343.51*48.92
Linolenic (% seed oil)8.326.82–10.300.698.138.12
RB 2015Height (cm)56.838.00–91.119.4260.7578.94
Total seed oil (%)17.7914.91–19.470.7318.5517.02
Palmitic (% seed oil)9.197.73–11.300.688.769.09
Stearic (% seed oil)15.916.03–24.763.3520.85**12.67
Oleic (% seed oil)21.6917.24–33.562.8418.76**22.19
Linoleic (% seed oil)46.7634.77–53.353.5144.88*49.13
Linolenic (% seed oil)6.454.634–7.870.586.756.91

Parent values are significantly different at P < 0.05; ** parent values are significantly different at P < 0.01. RIL, recombinant inbred line; BR, Bradford Research Center; SF, South Farm; RB, Hinkson Field.

Full details on individual RILs are in Supplemental Materials.

SD, standard deviation.

Parent values are significantly different at P < 0.05; ** parent values are significantly different at P < 0.01. RIL, recombinant inbred line; BR, Bradford Research Center; SF, South Farm; RB, Hinkson Field. Full details on individual RILs are in Supplemental Materials. SD, standard deviation.

Correlations between traits

Significant correlations (P < 0.01) were found between several traits with the strongest being between the five fatty acid species (P < 0.0001). Pearson’s correlation coefficients and associated P-values are presented in Table S1. Significant coefficients between traits ranged between 0.10 and −0.81. The majority of significant correlations were detected in multiple environments for each trait. Stearic acid was significantly associated with all fatty acid traits, including total seed oil, in at least one environment.

Results using BLUPs

In order to attempt to lower the residual phenotypic variance even further, we chose to use the most significant marker on chromosome 14 as a covariate in all subsequent analyses. Using the same BLUPs as described above, we ran the SIM method in order to pick out the most significant marker associated with stearic acid on chromosome 14, (Figure 1). Once this marker was determined, the SIM was rerun with marker c14.loc58 as an interactive covariate. Using this method, the LOD scores and R2 values were the highest, and an additional QTL was identified on chromosome 2 that no other method had identified previously (Figure 2). The 1.5-LOD intervals were also much narrower using this method than with SIM alone (the largest interval was only 5 cM wide for the QTL on chromosome 4) (Table 4).
Figure 1

QTL results for the seed stearic acid trait using BLUPs and the SIM method. BLUPs, best linear unbiased predictors; LOD, logarithm of the odds; QTL, quantitative trait loci; SIM, standard interval mapping.

Figure 2

QTL results using BLUPs and the most significant marker associated with stearic acid as a covariate in SIM. LOD threshold is shown in red at LOD = 4. BLUPs, best linear unbiased predictors; LOD, logarithm of the odds; QTL, quantitative trait loci; SIM, standard interval mapping.

Table 4

SIM and SIM + covariate analysis for stearic acid-related QTL using BLUPs

TraitMethodChromosomeSNP Marker NucleotidePositionLODR21.5 Interval
Stearic acidSIM2Gm2:1555287978.004.384.444–111
4Gm4:1831299376.004.084.172–83
14Gm14:4220640949.0034.2553.248–54
Stearic acidSIM + Cov2Gm2:594691239.0014.6313.7538–42
2Gm2:1555287979.799.057.978–80
4Gm4:1831299375.907.236.173–78
14c14.loc5858.0040.8456.857–59

LOD, logarithm of the odds; SIM, SIM, standard interval mapping; Cov, covariate analysis.

QTL results for the seed stearic acid trait using BLUPs and the SIM method. BLUPs, best linear unbiased predictors; LOD, logarithm of the odds; QTL, quantitative trait loci; SIM, standard interval mapping. QTL results using BLUPs and the most significant marker associated with stearic acid as a covariate in SIM. LOD threshold is shown in red at LOD = 4. BLUPs, best linear unbiased predictors; LOD, logarithm of the odds; QTL, quantitative trait loci; SIM, standard interval mapping. LOD, logarithm of the odds; SIM, SIM, standard interval mapping; Cov, covariate analysis.

QTL analysis

A total of 21 QTL were detected for seven traits on 9 of the 20 chromosomes. We observed colocation of QTL for several traits. The same significant region on chromosome 14 was detected for all oil traits examined (total seed oil, palmitic, stearic, oleic, linoleic, linolenic, and linolenic; see Discussion). The same QTL was detected on chromosome 2 for palmitic, stearic, and linolenic acids. A region on chromosome 4 was detected for stearic, oleic, linoleic, and linolenic acids. QTL were considered to be colocated when there was significant overlap between the 1.5-LOD intervals for each QTL. Phenotypic variation explained by each QTL ranged from 6.12% (q.4s) to 56.76% (q.14s) (Table 5). A6 alleles contributed to an increase in phenotype for 12 out of 21 QTL, while 194D alleles contributed to an increase in phenotype for 11 out of 21 QTL.
Table 5

Summary of QTL results for seven traits in soybean A6 × 194D cross

TraitQTLChrSNP Marker NucleotidePosition (cM)1.5 IntervalLODR2 (%)QTL Effect
Palmitic acidq.2p2Gm2:713845144.5838–1285.5510.49+0.36%
q.5p5Gm5:14206862.480–86.7913.06+0.44%
q.14p14Gm14:1250661544.9142–586.8913.26+0.44%
Stearic acidq.2.1s2Gm2:59469123938–4214.6313.75−1.16%
q.2.2s2Gm2:1555287979.7978–809.057.86−0.52%
q.4s4Gm4:1831299375.9973–787.236.12+1.24%
q.14s14Gm14:422064095857–5940.8456.76−5.00%
Oleic acidq.4o4Gm4:1949679676.3874–806.5712.17+1.37%
q.14o14Gm14:322904525149–5413.728.05+2.9%
Linoleic acidq.4e4Gm4:3235100778.8873–846.7414.75−2.31%
q.14e14Gm14:349185005445–634.489.49+2.02%
Linolenic acidq.2n2Gm2:1555287979.7978–817.0510.4+0.33%
q.4n4Gm4:2733218078.1276–847.0410.39−0.21%
q.8n8Gm8:44116750150148–1526.9310.2−0.15%
q.10n10Gm10:4531079889.6881–988.1312.16−0.42%
q.13n13Gm13:41141355118.19108–1256.489.48−0.17%
q.14n14Gm14:3237263551.2744–535.527.97−0.25%
Plant heightq.10h10Gm10:446393599088–949.7920.9−6.54 cm
q.19h19Gm19:49398020100.5191–1045.210.42+5.03 cm
Oilq.9oi9Gm9:68015131615–176.8514.43+0.28%
q.14oi14Gm14:422064096249–689.4320.58+0.71%

“QTL Effect” represents the impact of converting homozygous A6 alleles to homozygous 194D alleles. QTL, quantitative trait loci; Chr, chromosome; SNP, single nucleotide polymorphism; LOD, logarithm of the odds.

“QTL Effect” represents the impact of converting homozygous A6 alleles to homozygous 194D alleles. QTL, quantitative trait loci; Chr, chromosome; SNP, single nucleotide polymorphism; LOD, logarithm of the odds.

Total seed oil QTL

Two QTL were detected for total seed oil, one of minor effect on chromosome 9, accounting for 14.43% of the phenotypic variation, and one major effect QTL on chromosome 14, which accounted for 20.58% of the variation. For both loci, individuals homozygous for the 194D allele showed the highest total seed oil content. Mansur detected an association between markers on chromosome 9 and total seed oil whose QTL intervals overlapped with q.9oi. There are several QTL published for total seed oil on chromosome 14 within 10 cM of q.14oi on the soybean consensus map (Chen ; Csanadi ; Eskandari ; Liang ; Qi ).

Palmitic acid (C16:0) QTL

Three minor QTL were detected for palmitic acid. QTL on chromosome 2, 5, and 10 each accounted for 10–13% of the phenotypic variation. For all three loci, homozygosity for the 194D allele increased palmitic acid content. Q.2p mapped to the same genomic region as fatty acid desaturase gene FAD3-B (Glyma02g39230) (Bilyeu ; Gillman and Bilyeu 2012). Two QTL for palmitic acid have been published in this region (Panthee ; Reinprecht ). Two QTL for palmitic acid have also been published in the same region of chromosome 5 as q.5p (Li ; Wang ). Q.5p mapped to the same region as FATB1-A (Glyma05g26110), a keto-acyl ACP synthase gene (Cardinal ). The map location of q.14p corresponded to SACPD-C (Glyma14g27990), a stearoyl-acyl carrier protein desaturase gene. No positional confirmation exists for this particular region; however, two QTL have been reported that are within 10 cM (Diers and Shoemaker 1992).

Stearic acid (C18:0) QTL

Four QTL were detected for seed stearic acid content; three minor effect QTL, two on chromosome 2, and one on chromosome 4, as well as one major effect QTL on chromosome 14. For the QTL on chromosomes 2 and 14 (Figure 3), homozygosity for the A6 allele contributed to elevated stearic acid content. Q.14s had an overwhelming major effect, which contributed 56.76% of the total variation in seed stearic acid content. Q.14s also mapped to the same genomic region as SACPD-C, which is also the same result we saw when examining palmitic acid. The two QTL identified on chromosome 2 accounted for 13.75 and 7.86% of the variation in seed stearic acid content. QTL associated with stearic acid have been previously reported on chromosome 2 within 5 cM of q.2.1s (Li ; Xie ). On chromosome 4, the 194D allele contributed to elevated stearic acid. Q.2s also interacted with q.14s, which contributed another 9.97% of the phenotypic variation. Q.4s may be a novel QTL, as no marker associations for seed stearic acid content have been previously reported on chromosome 4.
Figure 3

Interaction plot between q.2.1s q.2.2s and q.14s. Genotype “AA” indicates homozygosity for A6 alleles, allele “BB” indicates homozygosity for alleles from 194D. Middle bar indicates mean of genotypes, top and bottom bars indicate ± 1 SEM. Parental stearic acid contents averaged over all locations were: A6 = 22.4 ± 3.8%, 194D = 11.4 ± 1.6%. Chr, chromosome.

Interaction plot between q.2.1s q.2.2s and q.14s. Genotype “AA” indicates homozygosity for A6 alleles, allele “BB” indicates homozygosity for alleles from 194D. Middle bar indicates mean of genotypes, top and bottom bars indicate ± 1 SEM. Parental stearic acid contents averaged over all locations were: A6 = 22.4 ± 3.8%, 194D = 11.4 ± 1.6%. Chr, chromosome.

Oleic and linoleic acid (C18:1, C18:2) QTL

QTL for oleic acid content were detected on chromosomes 4 and 14, which accounted for 12.17 and 28.05% of the phenotypic variation, respectively. For both QTL, homozygosity for the 194D allele contributed to higher oleic acid levels. Nearly identical QTL for linoleic acid content were detected on chromosomes 4 and 14, which accounted for 14.75 and 9.49% of the phenotypic variation, respectively. On chromosome 4, individuals homozygous for the A6 allele had the highest linoleic acid content, while on chromosome 14, individuals homozygous for the 194D had the highest linoleic acid content. As with palmitic and stearic acid, the chromosome 14 QTL (q.14o and q.14e) aligned with SACPD-C deletion from A6. QTL for seed oleic as well as linoleic acid content have been identified in previous studies within 10 cM of q.14o and q.14e chromosome 14 (Diers and Shoemaker 1992; Kim ). Q.4o and q.4e on chromosome 4 were effectively identical to q.4s for seed stearic acid and, like stearic acid, no QTL for oleic or linoleic acid have been previously published on this chromosome.

Linolenic acid (C18:3) QTL

Six QTL were detected for linolenic acid content on chromosomes 2, 4, 8, 10, 13, and 14, more than were detected for any other trait. All six QTL had minor effects, ranging from 7.97 to 12.16% of the phenotypic variation explained (60.6% total). Two of the QTL, q.8n and q.13n interacted epistatically (Table 6), and accounted for an additional 7.45% of the variation in linolenic acid content. Though all the QTL were of minor effect, we saw some positional overlap with published QTL for seed linolenic acid. Li identified an association between Satt537 and seed linolenic acid content on chromosome 2 at nearly the same map position as q.2n (within 5 cM). In the same study, a QTL was identified on chromosome 4, however it was not in the same region as q.4n. One QTL for linolenic acid has been published on chromosome 8 within 5 cM of q.8n (Bachlava ). A QTL on chromosome was reported (Shibata ) in an adjacent region to q.10n on chromosome 10, not likely associated with FAD2-1a, as our interval was > 20 cM away on the genetic map. Lastly, a single QTL was previously reported on chromosome 13, however it was on the opposite side of the chromosome as q.13n (Hyten ). Several markers have been reported to be associated with alterations in seed linolenic acid on chromosome 14, however they were all greater than 10 cM away from SACPD-C based on the consensus map (Bachlava ; Reinprecht ; Spencer ; Xie ).
Table 6

Epistatic interactions between QTL

TraitQTLChromosomeMarker 1Marker 2LODR2P-value
Stearic acidq.2.1s × q.14s2 × 14c2.loc39c14.loc5811.149.971.07E−09
Linolenic acidq.8n × q.13n8 × 13c8.loc150c13.loc118.25.187.450.000235

QTL, quantitative trait loci; LOD, logarithm of the odds.

QTL, quantitative trait loci; LOD, logarithm of the odds.

Plant height QTL

QTL of major and minor effect for plant height were detected on chromosomes 10 and 19, respectively. Q.10h accounted for 20.90% of the variation, while q.19h accounted for 10.42% of the phenotypic variation. Q.19h mapped to nearly the same genomic region as the known maturity gene E3 (Glyma19g224200) and linked gene Dt1 (Glyma19g194300) (Watanabe ; Tian ). There were no published QTL near q.10h; however, at least one report (Wang ) identified two chromosome 10 microsatellite markers associated with plant height within 10 cM of q.10h.

Discussion

QTL mapping method selection in the soybean literature

The ability to detect QTL is entirely dependent on population structure and the level of genetic diversity within that population, as well as the number and quality of markers used in the analysis. Considering the multitude of available mapping strategies and software, as well as usage of individualized linkage maps of differing genetic lengths, it is difficult to compare historical mapping results with our study. However, several of our QTL in this study were positionally confirmed. In the public genetic database for soybean, there are very few QTL mapping studies published that analyzed seed stearic acid, in relation to the number of QTL published for other oil composition traits (www.soybase.org, accessed 05-01-2016). Four of the studies utilized the CIM method and successfully detected associations between markers and the stearic acid trait (Hyten ; Panthee ; Reinprecht ; Wang ). All other stearic acid QTL published in the database were determined by simple marker regression (Diers and Shoemaker 1992; Xie ).

Difficulties in detecting QTL

A major confounding effect in our study was the extreme differences between the parent line A6, which exhibits a remarkably large deletion on chromosome 14 encompassing the entirety of the SACPD-C gene (as well as ∼142 other genes), and the missense mutation in 194D (Gillman ). The extreme phenotype values paired with the large deletion swelled the LOD score for the QTL on chromosome 14; LOD = 40.84 for stearic acid, which is extraordinarily high in the context of quantitative trait mapping. A QTL of such major effect makes it extremely difficult to detect other, relatively minor effect QTL. In addition, because of the genomic deletion on chromosome 14, any markers on chromosome 14 that showed a significant association with fatty acid traits flank the deletion. This experiment is the first to conduct QTL mapping using SACPD-C mutants, and as SACPD-C is a known factor in determining seed stearic acid levels, we can safely make the association between q.14p, q.14s, q.14o, q.14e, q.14n, and q.14oi and the genomic deletion that contains SACPD-C. The QTL on chromosome 14 was the same as discussed for palmitic, stearic, oleic, and linoleic acids, and is almost certainly due to the differences between the missense SACPD-C in 194D and the deletion in A6. Q.14p, q.14s, q.14o, q.14e, q.14n, and q.14oi correspond to SACPD-C (Glyma14g27990). Ultimately, we chose to continue with the final method described, SIM with covariate analysis using BLUPs for our analysis. By calculating BLUPs, we were able to take environmental variance into account, which strengthened the association between each marker and the stearic acid phenotype. We sought to further reduce the residual phenotypic variation by conducting covariate analysis. When a locus of strong effect exists in a population, it can mask the associations between any small effect loci and confound understanding of genetic architecture behind the phenotype (Xu 2003; Mackay 2001; Broman and Sen 2009). Using this method, we successfully identified an additional 20 QTL in our study, and strengthened the association of QTL with phenotypes. In this way, we were able to dampen the effects of the chromosome 14 QTL, thereby strengthening the associations between other markers and each trait. We noted a pronounced effect of the A6 chromosomal deletion on seed stearic acid: ∼18% for all lines with the chromosome 14 deletion, + ∼5% stearic acid relative to lines with the 194D missense mutation. This difference was greater than expected based on our previous studies with other SACPD-C genomic deletions of smaller size (Gillman ). Although unproven, our results strongly suggest that the A6 genomic deletion, which encompasses SACPD-C, also contains another gene (or genes) whose loss also elevates stearic acid content from ∼12.5 to ∼18%. Identification of such gene(s) is impossible using our mapping population and, as the A6 chromosomal deletion contains at least 142 additional genes, it is a daunting task for reverse genetic approaches.

Minor effect QTL for other fatty acid traits and plant height

We detected QTL for several annotated and molecularly characterized fatty acid biosynthetic genes as well as one major plant height/maturity gene. Although the significant marker may differ slightly for each trait due to maximum LOD optimization in model selection, for the sake of discussion, the QTL on chromosome 2 for palmitic, stearic, and linolenic acids can be considered to be identical because their support intervals overlap very significantly. Q.2p may be near the map position of FAD3-B (Glyma02g39230), which participates in the desaturation of C18:2 (linoleic acid) to C18:3 (linolenic acid) (Bilyeu ; Gillman and Bilyeu 2012); however, our interval for this particular QTL was quite wide so the association is tenuous. This same level of overlap was seen with the chromosome 14 QTL found for all five fatty acid species, as well as for total seed oil. We expected to see this high level of colocation because of the close genetic association among fatty acid traits due to genes that participate in the fatty acid biosynthetic pathway (|r| values ranging from 0.23 to 0.81). This level of overlap is also highly indicative of the strength and reliability of our QTL results. Q.5p is near the genomic region of FATB1-A (Glyma05g12300), which catalyzes the conversion of 16:0-ACP to C16:0 (palmitic acid) (Cardinal ). Q.19h is almost certainly E3 (Glyma19g224200), which affects flowering time in soybean in combination with the other “E” genes to determine flowering time/plant maturity (Watanabe ). In the context of dissection of genetic architecture of complex traits, these results are purely confirmatory, i.e., they serve to provide confidence in our ability to detect QTL.

Environmental influence and heritability of fatty acid composition

It is a strongly accepted fact that seed oil traits are highly dependent on environmental conditions during seed fill, particularly temperature (Kumar ). A6’s stearic acid phenotype was much lower in the three Missouri environments we studied (23.4 ± 4.4, 20.9 ± 2.9, and 22.3 ± 5.5%) as compared with the original report of 28.1% stearic acid (Hammond and Fehr 1983; Lundeen ), and this difference may be due to different environmental conditions in Northern Missouri compared to Ames, Iowa during seed fill/maturation. Fatty acid traits are relatively simple from a genetic perspective, and the target of ∼20% stearic acid is almost reached with three QTL from A6 (Figure 3), although we noted relatively large environmental variation (Table 2). However, heritability values (h2) for fatty acid traits in our study were quite strong (0.72 < h2 < 0.89), and twelve out of the 176 lines examined exhibited stearic acid levels > 20% consistently across environments, which suggests that the target may be achievable, at least in certain environments.

Impact of the chromosome 14 deletion on yield, seed traits, and plant morphology

Early studies have definitively demonstrated that A6 alleles for elevated stearic acid are also associated with decreased seed yield and unacceptable agronomic performance (Lundeen ), presumably due to the large genomic deletion responsible for elevated stearic acid. In our recent work, we demonstrated that loss of SACPD-C activity is inextricably correlated with nodule morphological abnormalities, early nodule senescence, and an impaired ability to fix nitrogen (Gillman ; Krishnan ). Although we did not measure seed yield in our present study, we noted a decrease in seed oil content in lines that inherited the A6 chromosome 14 deletion (Table 5). We saw no correlation between plant height and seed stearic acid content in this study, and our goal was a rapid QTL analysis for seed composition rather than a true seed yield study. Nevertheless, it is expected that the large chromosome 14 deletion will result in an unavoidable yield penalty as has been previously described (Lundeen ). In the continued effort to develop a high stearic acid soybean variety that meets the current breeding target (> 20% stearic acid), the genomic deletion present in A6 provided valuable genetic information, but deletions/mutations from A6 per se will almost certainly not be of direct use in development of germplasm or cultivars with > 20% seed stearic acid and acceptable seed yield.

Identification of minor effect QTL for stearic acid

We identified three additional minor effect seed stearic acid QTL (Table 5), which do not appear to overlap with deletions identified in our previous work (Gillman ), and it is tempting to report genes within the other QTL intervals whose annotations would suggest they may be associated with oil biosynthesis. However, the confidence intervals for our QTL are still quite large and the effects are relatively small. Further research would be needed to fine map the minor effect QTL. Moreover, the vast majority of gene annotation in soybean is almost completely inferred based on protein similarity to Arabidopsis genes (http://soybase.org/genomeannotation/, accessed 05-01-2016), and even in Arabidopsis only about 40% of enzyme and transporter genes have strong evidence for function (Niehaus )

Conclusions

We evaluated the genetic architecture for extremely elevated stearic acid (> 20%) through GBS/QTL analysis of RILs produced by reciprocal crosses between two mutant lines. Numerous minor effect QTL were identified for soybean fatty acids, total oil, and for plant height. In contrast to a priori expectation, a major effect QTL for stearic acid corresponded to a large genomic deletion from line A6, which explained ∼57% of the phenotypic variance for seed stearic acid. However, the loss of SACPD-C activity per se boosts seed stearic acid from ∼3% stearic acid to ∼12.5%. These results suggest that the large genomic deletion in A6 contains both SACPD-C and an unidentified gene; the loss of both in line A6 increases seed stearic acid content to ∼18% of seed oil. A combination of the major QTL and two minor effect QTL was able to boost seed stearic acid to above 20% of seed oil and meet the market target, but this elevation is expected to occur at a significant cost to seed yield. These genetic results will help guide and inform breeding efforts to increase soybean oil oxidative stability.

Supplementary Material

Supplemental material is available online at www.g3journal.org/lookup/suppl/doi:10.1534/g3.116.035741/-/DC1. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  25 in total

Review 1.  The genetic architecture of quantitative traits.

Authors:  T F Mackay
Journal:  Annu Rev Genet       Date:  2001       Impact factor: 16.830

2.  Theoretical basis of the Beavis effect.

Authors:  Shizhong Xu
Journal:  Genetics       Date:  2003-12       Impact factor: 4.562

3.  TASSEL: software for association mapping of complex traits in diverse samples.

Authors:  Peter J Bradbury; Zhiwu Zhang; Dallas E Kroon; Terry M Casstevens; Yogesh Ramdoss; Edward S Buckler
Journal:  Bioinformatics       Date:  2007-06-22       Impact factor: 6.937

4.  Ribozyme termination of RNA transcripts down-regulate seed fatty acid genes in transgenic soybean.

Authors:  Tony Buhr; Shirley Sato; Farida Ebrahim; Aiqiu Xing; You Zhou; Michelle Mathiesen; Bruce Schweiger; Anthony Kinney; Paul Staswick
Journal:  Plant J       Date:  2002-04       Impact factor: 6.417

Review 5.  Cardiovascular disease risk of dietary stearic acid compared with trans, other saturated, and unsaturated fatty acids: a systematic review.

Authors:  J Edward Hunter; Jun Zhang; Penny M Kris-Etherton
Journal:  Am J Clin Nutr       Date:  2009-11-25       Impact factor: 7.045

6.  Effect of a novel mutation in a Δ9-stearoyl-ACP-desaturase on soybean seed oil composition.

Authors:  Paul Ruddle; Rebecca Whetten; Andrea Cardinal; Robert G Upchurch; Lilian Miranda
Journal:  Theor Appl Genet       Date:  2012-09-09       Impact factor: 5.699

7.  Identification of putative QTL that underlie yield in interspecific soybean backcross populations.

Authors:  D Wang; G L Graef; A M Procopiuk; B W Diers
Journal:  Theor Appl Genet       Date:  2003-09-19       Impact factor: 5.699

8.  Effect of Δ9-stearoyl-ACP-desaturase-C mutants in a high oleic background on soybean seed oil composition.

Authors:  Paul Ruddle; Rebecca Whetten; Andrea Cardinal; Robert G Upchurch; Lilian Miranda
Journal:  Theor Appl Genet       Date:  2013-11-06       Impact factor: 5.699

9.  Map-based cloning of the gene associated with the soybean maturity locus E3.

Authors:  Satoshi Watanabe; Rumiko Hideshima; Zhengjun Xia; Yasutaka Tsubokura; Shusei Sato; Yumi Nakamoto; Naoki Yamanaka; Ryoji Takahashi; Masao Ishimoto; Toyoaki Anai; Satoshi Tabata; Kyuya Harada
Journal:  Genetics       Date:  2009-05-27       Impact factor: 4.562

10.  Plasma cholesterol-predictive equations demonstrate that stearic acid is neutral and monounsaturated fatty acids are hypocholesterolemic.

Authors:  S Yu; J Derr; T D Etherton; P M Kris-Etherton
Journal:  Am J Clin Nutr       Date:  1995-05       Impact factor: 7.045

View more
  4 in total

1.  Diversity in Grain Amaranths and Relatives Distinguished by Genotyping by Sequencing (GBS).

Authors:  Xingbo Wu; Matthew W Blair
Journal:  Front Plant Sci       Date:  2017-11-17       Impact factor: 5.753

2.  Linkage analysis and residual heterozygotes derived near isogenic lines reveals a novel protein quantitative trait loci from a Glycine soja accession.

Authors:  Yia Yang; Thang C La; Jason D Gillman; Zhen Lyu; Trupti Joshi; Mariola Usovsky; Qijian Song; Andrew Scaboo
Journal:  Front Plant Sci       Date:  2022-07-29       Impact factor: 6.627

3.  GBS-derived SNP catalogue unveiled wide genetic variability and geographical relationships of Italian olive cultivars.

Authors:  Nunzio D'Agostino; Francesca Taranto; Salvatore Camposeo; Giacomo Mangini; Valentina Fanelli; Susanna Gadaleta; Monica Marilena Miazzi; Stefano Pavan; Valentina di Rienzo; Wilma Sabetta; Luca Lombardo; Samanta Zelasco; Enzo Perri; Concetta Lotti; Elena Ciani; Cinzia Montemurro
Journal:  Sci Rep       Date:  2018-10-26       Impact factor: 4.379

4.  Genotyping-by-Sequencing of Gossypium hirsutum Races and Cultivars Uncovers Novel Patterns of Genetic Relationships and Domestication Footprints.

Authors:  Shulin Zhang; Yaling Cai; Jinggong Guo; Kun Li; Renhai Peng; Fang Liu; Jeremy A Roberts; Yuchen Miao; Xuebin Zhang
Journal:  Evol Bioinform Online       Date:  2019-11-20       Impact factor: 1.625

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.