Literature DB >> 31442274

Identification of SNP loci and candidate genes related to four important fatty acid composition in Brassica napus using genome wide association study.

Qianglong Zhu1, Graham J King2, Xingyue Liu1, Nan Shan1, Priyakshee Borpatragohain2, Abdul Baten2, Putao Wang1, Sha Luo1, Qinghong Zhou1.   

Abstract

Rapeseed oil (canola, Brassica napus L.) is an important healthy vegetable oil throughout the world, the nutritional and economical value of which largely depends on its seed fatty acid composition. In this study, based on 201,187 SNP markers developed from the SLAF-seq (specific locus amplified fragment sequencing), a genome wide association study of four important fatty acid content traits (erucic acid, oleic acid, linoleic acid and linolenic acid) in a panel of 300 inbred lines of rapeseed in two environments (JXAU and JXRIS) was carried out. A total of 148 SNP loci significantly associated with these traits were detected by MLM model analysis respectively, and 30 SNP loci on A08 and C03 chromosomes were detected in three traits of erucic acid, oleic acid and linoleic acid contents simultaneously. Furthermore, 108 highly favorable alleles for increasing oleic acid and linoleic acid content, also for decreasing erucic acid content simultaneously were observed. By a basic local alignment search tool (BLAST) search with in a distance of 100 Kb around these significantly SNP-trait associations, we identified 20 orthologs of the functional candidate genes related to fatty acid biosynthesis, including the known vital fatty acid biosynthesis genes of BnaA.FAE1 and BnaC. FAE1 on the A08 and C03 chromosomes, and other potential candidate genes involving in the fatty acid biosynthesis pathway, such as the orthologs genes of FAD2, LACS09, KCS17, CER4, TT16 and ACBP5. This study lays a basis for uncovering the genetic variations and the improvement of fatty acid composition in B. napus.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31442274      PMCID: PMC6707581          DOI: 10.1371/journal.pone.0221578

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Canola (rapeseed; Brassica napus, AACC genome, 2n = 38) is the world’s second largest oil producing crop after soybean (Glycine max L. Merrill), cultivated in temperate regions of many countries worldwide, and accounts for 14% of all edible vegetable oil production [1]. The nutritional and healthy oil qualities of canola seed are mostly determined by the fatty acid compositions [2]. Canola oil is rich in unsaturated fatty acids compared with other vegetable oils, comprised primarily of monounsaturated oleic acid and polyunsaturated linoleic and linolenic acid having a 2:1 optimal ratio [3, 4]. Of these the oleic acid (C18:1) and linoleic acid (C18:2) are considered to be healthy and nutritious. However, the three double-bonds of extracted linolenic acid (C18:3) are easily oxidized, which leads to a reduced frying thermal stability and storage time of the oil. With erucic acid originally thought to lead to health problems and difficult to digest in humans and livestock due to its long-chain, modern canola-type rapeseed was selected based on ‘double-low’ seed erucic acid and glucosinolate content [5], Reducing erucic acid and linolenic acid has continued to be an important target for canola/rapeseed production [6]. The genetic basis of seed fatty acid biosynthesis and modification pathways have been well characterised in Arabidopsis thaliana [7-9]. Barker et al (2007) established desaturation and elongation pathways well, along with substrate: product relationships assigned to specific enzymes [10]. To better understand the genetic control of seed fatty composition and biosynthesis in rapeseed, in the last few decades, many underlying QTLs of seed quality traits have been detected in bi-parental segregating populations. These QTL have included traits such as oil content [11-14], protein content [15, 16], glucosinolate content [17, 18], and fatty acid composition [19-21]. In a number of cases, some candidate genes have been identified that coincide with the position of QTL. For example, four loci for B. napus orthologues of FAD2 (BnaFAD2 loci) were mapped to A1, A5, C1 and C5 chromosomes [22], Schierholt et al. (2000) also mapped a locus linked to BnaFAD2 on A05 [23], and Hu et al. (2006) identify a major locus for high oleic acid (C18:1) on A5 chromosome [3], which was proven to be the Fatty acid desaturase-2 (FAD2) gene. In addition, two important FAE1 loci on chromosome A08 and C03 were described by Qiu et al. (2006) [11]. In recent years, genome wide association study (GWAS), also known as association mapping based on linkage disequilibrium, have aimed to identify genetic variants linked to traits. GWAS uncover QTLs or genes from natural populations, and have the advantage of higher resolution and greater cost-effectiveness relative due to screening a larger effective number of recombination events than are accessible in moderate size bi-parental segregating populations. GWAS has successfully been demonstrated to be a powerful tool for dissecting complex traits for crop improvement programs, with the availability of numerous SNPs it has been applied to Zea mays, Triticum aestivum and Oryza sativa [24]. In recent years, many studies have used the Illumina Infinium Brassica 60K SNP array and Dart-seq genotyping approaches to carry out GWAS to detect genetic variation for flowering time, as well as seed quality traits in rapeseed [25-33]. The number of robust and well-distributed SNP markers across genome is significant for the efficiency of GWAS in wider germplasm sets. Development of next generation sequencing (NGS) allows identification of a large number of genetic makers for associating with traits of interest based on linkage disequilibrium quickly and efficiently [34]. To identify novel loci and candidate genes that may contribute to variation in fatty acid composition, we carried out an extensive GWAS based on 201,817 SNPs previously developed by SLAF-seq (specific length amplified fragment sequencing) [35,36] and a collection of 300 inbred rapeseed lines. Interactions between the traits were investigated and significantly associated SNP loci were explored along with candidate genes. Favourable allelic variants contributing to an optimal fatty acid composition were identified. This study provides useful information for a more comprehensive understanding of the genetic variation and metabolism mechanism of important fatty acid composition.

Materials and methods

Plant materials, growth conditions and field trials

A world-wide collection of 300 diverse rapeseed inbred lines (S4 generation or greater) (S1 Table) was assembled. The provenance and meta-data and genetic relationships for all accessions have been described previously [35,36]. The association population was planted and harvested in the crop field of Jiangxi Agricultural University (JXAU, 115.84E, 28.77N) and Jiangxi Institute of Red Soil (JXIRS, 116.27E, 28.37N) with two biological replications per experimental site in 2014–2015. All seeds were sown on September 29th 2014 simultaneously in both places. Each variety was planted in a plot with three rows (40 cm line width and 20 cm plant distance), and each row had 12 plants (final seeding time was at the 5–7 leaf phase). Field experiments were arranged and laid out in a randomized complete block design at all sites. All rapeseed inbred lines grown in both environments was cultivated under uniform agronomic practices. Ripe seeds from six plants each accession were harvested and used for seed quality trait analysis after harvests.

Fatty acid compositions evaluation and statistical analysis

The harvested seeds (2g each accession) of 300 rapeseed lines were analysed for estimates of the four fatty acids (erucic acid, oleic acid, linoleic acid and linolenic acid) (%) using DA7200 near infrared spectroscopy (NIRS) (DA 7200, Perten Instruments, Huddinge, Sweden), and the data of four fatty acid content from NIRS were adjusted by the results of gas chromatograph. The value of each fatty acid was expressed as a percentage of the total amount of fatty acids identified. The four fatty acid composition traits of each accession were defined as the mean of the two replicates in the same location. The correlation coefficients between each pair of traits were determined using Student’s t-test, and the variance and statistical analysis of four components were obtained using DPS software [37].

SNP genotyping and population structure analysis

Total genomic DNA was extracted from young leave of each rapeseed accession using a modified cetyltrimethylammonium bromide (CTAB) method based on Murray & Thompson (1980) [38], the DNA concentrations and purities of all samples were calculated by a Nanodrop 2000 UV-Vis spectrophotometer (NanoDrop, Wilmington, DE, USA). Quantified DNA samples were used for SLAF sequencing by an Illumina HiseqTM 2500 [39]. The library construction, paired-end sequencing and SNP calling were conducted as previously described [35,36], a total of 201,817 SNPs with minor allele frequency (MAF) > 0.05 and integrity > 0.8 were selected and used for subsequent analysis, population structure (Q matrix) and relative kinship (K matrix) were analysed by using the Admixture software package [40] and SPAGeDi software [41], respectively, as previously described in our previous research [35,36].

Genome wide association analysis

Based on the 201,817 SNP markers developed for the 300 rapeseed accessions, genome-wide association analysis for the four fatty acid traits was carried out by mixed linear models (MLM) using the Tassel 5.0 software [42]. Fixed effects and random effects in the MLM model were assessed by a Q and K matrix, respectively. The Manhattan plot and Quantile-Quantile plot (Q-Q plot) was drawn by QQman [43] and GGplot2 software [44]. The ideal threshold value was set as 1/201,817 SNPs (-log10(p) = 5.3) for identifying the marker-trait associations. Finally, to ensure the accuracy of significant SNPs associated with traits, we removed the unique SNPs associated with a trait in the range of LD decay, others were considered as valid associated-trait SNPs.

Discovery of useful allelic variation for fatty acid composition

The epistatic effect of linked candidate SNPs was evaluated using the epistatic association mapping (EAM) method [45]. When the effect value of SNP is positive, it was taken as increasing effect allele for trait value, conversely, when the effect value is negative as a decreasing effect allele. SNPs with positive allelic effect values highly associated with oleic acid content and linoleic acid content were analysed, and the SNPs with negative allelic effect value related to erucic acid content and linolenic acid content were counted. Furthermore, the number of varieties with favourable alleles for fatty acid composition were counted.

Prediction of candidate genes for four fatty acid composition

Candidate genes located within the 100 Kb region upstream or downstream of significant associated-trait SNPs were identified based on GO terms (fatty acid biosynthetic process; very long-chain fatty acid metabolic process; fatty acid elongation; fatty acid metabolic; acetyl-CoA metabolic process; fatty-acyl-CoA reductase activity; phosphatidylinositol transporter activity etc.) for fatty acid synthesis, desaturation, elongation and metabolism. Then the identified candidate genes related to fatty acid composition were further confirmed by BLASTX searching against the Arabidopsis protein database.

Candidate gene expression analysis by qRT-PCR

Seven rape varieties from this studied population with significantly different fatty acid contents were selected to carry out the expression analysis of candidate genes (S2 Table), the total RNA from their frozen root, stem, leaf, flower, and seed were extracted using Eastep® Super total RNA extraction Kit (Promega, Beijing, China) according to the manufacturer’s instructions. RNA quality and quantity were checked using NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE) and RNA integrity was determined by agarose gel electrophoresis. Three candidate genes (ACP5, FAD2, KCS17) that are significant for fatty acid metabolism were selected to validate the GWAS results by qPCR. The primers for PCR amplification of the three candidate genes were designed with Primer3Plus [46] under strict standards and are provided in supplement file (S3 Table). A total of 1 μg of total RNA was reverse-transcribed to complementary DNA (cDNA) using the PrimeScriptTM RT reagent Kit with gDNA Eraser (Perfect Real Time) (TaKaRa, Japan). according to the manufacturer’s instructions. A 20 ul reaction was prepared with 10 ul of SYBR Green Master mix for real time quantitative PCR (Takara, Japan), 1 ul of each primer pair and 1 ul of cDNA templates, 7 ul ddH2O was added to the final reaction volume of 20 ul. The PCR amplification of the target genes was performed on a CFX96 Real-Time PCR system (Biorad, Hercules, CA) with the program as follows: 1 cycle of 95°C for 20s; and 40 cycles of 95°C for 15s, 60°C for 30s and 72°C for 30s; a final melt curve analysis in which the temperature was increased from 55°C to 95°C at a rate of 0.5°C/5s; and a maintenance at 4°C. In every sample, β-actin was taken as the house-keeping gene, at least 3 technical replicates were performed for each experiment. The relative quantification of gene expression was calculated using the 2-ΔΔCT method.

Results

Phenotypic variation and correlation analysis for four fatty acid compositions in 300 rapeseed accessions

Continuous and extensive phenotypic variations for each of the four fatty acid composition traits were observed in both environments (JXAU and JXIRS). Linolenic acid content was normally distributed, whilst the contents of erucic acid, oleic acid and linoleic acid had multimodal distributions across the 300 accessions (Fig 1). Descriptive statistical analysis was summarized in Table 1 for oleic acid [C18:1], linoleic acid [C18:2], linolenic acid [C18:3] and erucic acid [C22:1] of 300 accessions under two JXAU and JXIRS environments. The average erucic acid content was 22.23% and 22.72%, ranging from 0 to 56.38% with each CV of 78.92% and 81.90% in JXAU and JXRIS respectively, which was the largest variation of all results; the average content of oleic acid was 48.97% (JXAU) and 49.37% (JXIRS), ranging from10.01% from 81.29% severally; the average linoleic acid content was 15.51% (JXAU) and 14.66% (JXIRS), ranging from 4.35% to 22.64% with the CV of 19.17% and 20.47%, and finally the average content of linolenic acid was 6.89% and 6.67% in two places, ranging from 4.63% to 9.82% with low CV of 9.97% and 11.48%. These data indicated a broad phenotypic variability in four fatty acid compositions within the studied rapeseed population.
Fig 1

Frequency distribution of four fatty acid content traits of B. napus.

Table 1

Statistical analysis of four fatty acids of rapeseed.

TraitEnvironmentMean±SE (%)Range (%)Coefficient of variation CV (%)Shapiro wilk
Erucic acidJXAU22.23±1.010.03–54.7178.92W = 0.887946 P = 0.000000
JXIRS22.72±1.070–56.3881.90W = 0.876088 P = 0.000000
Oleic acidJXAU48.97±0.9511.39–81.2933.63W = 0.972434 P = 0.000016
JXIRS49.37±1.1010.01–79.9038.48W = 0.945312 P = 0.000000
Linoleic acidJXAU15.51±0.175.21–22.6419.17W = 0.91188 P = 0.000001
JXIRS14.66±0.174.35–22.3520.47W = 0.979837 P = 0.000311
Linolenic acidJXAU6.89±0.044.85–9.829.9 7W = 0.988571 P = 0.018354
JXIRS6.67±0.044.63–9.1411.48W = 0.996549 P = 0.764799
There was strong evidence based on ANOVA that traits varied significantly across the 300 genotypes, and had very significant difference under the interaction between genotype and environment (G×E) (P<0.01; Table 2). However, except for oleic acid content, there were no significant effects in other three fatty acids between the two environments. For the correlations between four traits, oleic acid content in both environments had a highly significant negative correlation with erucic acid and linolenic acid content (Table 3). Here, their phenotypic correlation coefficients of -0.8376** and -0.5862** in JXAU and -0.7942** and -0.2775** in JXIRS respectively (P<0.01) were observed, but which has a significant positive correlation with linoleic acid content with phenotypic correlation coefficients of 0.4215** in JXAU and 0.5824** in JXIRS. Moreover, linoleic acid content had a highly significant positive correlation with linolenic acid content, with phenotypic correlation coefficients of 0.6879** (JXAU) and 0.5748** (JXIRS).
Table 2

Variance analysis of seed fatty acid composition of rapeseed in two environments.

Source of variationDFErucic acid(%)Oleic acid(%)Linoleic acid(%)Linolenic acid(%)
Block139.669268.828970.1079**14.5636**
Environment (E)125.0014123.3815*1.65390.863
Genotype (G)299332.3849**281.0727**13.891**1.1998**
G×E299286.9632**274.9371**11.7758**0.9602**
Error59948.3631.92872.22980.3323

Note

*,** present significant at 5% and 1% probability levels respectively.

Table 3

The correlation analyses four fatty acid composition of B.napus in JXAU/JXIRS.

Correlation coefficientErucic acid (%)Oleic acid (%)Linoleic acid (%)Linolenic acid (%)
Erucic acid (%)1-0.8376**/-0.7942**-0.6653**/-0.6449**0.0218**/0.1321
Oleic acid (%)10.4215**/0.5824**-0.5862**/-0.2775**
Linoleic acid (%)10.6879**/0.5748**
Linolenic acid (%)1

Note

*,** present significant at 5% and 1% probability levels respectively.

Note *,** present significant at 5% and 1% probability levels respectively. Note *,** present significant at 5% and 1% probability levels respectively.

Genome-wide association analysis for the four fatty acid content traits in the 300 rapeseed accessions

To reveal the genetic variations of four fatty acid compositions in B. napus, GWAS for these traits based on MLM models was conducted, The predictive QQ plots show expected distribution agrees of p-values have a high consistency with the observations (Fig 2), and the significantly associated SNPs per traits were displayed on Manhattan plots (Fig 3). The total results of the significant SNP loci associated with the four fatty acids combined under two environments are given in Table 4. GWAS identified 148 SNPs significantly associated with four fatty acid content traits on 8 chromosomes. However, the majority of trait-linked SNPs were mainly distributed on A08 and C03 chromosomes, with 82 and 48 significant SNP on chromosome A08 and C03, respectively (S4–S7 Tables), and all trait-linked closely SNPs explained 6.09% ~16.55% of observed phenotypic variation.
Fig 2

Quantile-quantile plots of estimated-lg (P) from association analysis of four fatty acid content traits using MLM model in two environments (JXAU and JXIRS).

Fig 3

Manhattan of GWAS for four fatty acid content traits of B. napus by MLM in two environments (JXAU and JXIRS).

Table 4

Summary of SNP loci associated with four fatty acid composition in B. napus.

TraitEnvironmentsChromosomeP valueR (%)No. of SNPs
Erucic acidJXAUA084.09E-06–6.33E-077.06–10.5724
Erucic acidJXIRSA082.93E-06–1.79E-077.03–9.419
Erucic acidJXAUA093.60E-067.241
Erucic acidJXIRSA094.53E-067.231
Erucic acidJXAUC034.93E-06–5.92E-086.68–9.2029
Erucic acidJXIRSC034.41E-06–1.81E-076.98–8.8210
Oleic acidJXAUA084.88E-06–3.10E-077.26–11.3530
Oleic acidJXIRSA084.88E-06–2.26E-077.40–9.9018
Oleic acidJXIRSA102.21E-068.781
Oleic acidJXAUC034.90E-06–1.84E-077.04–8.9225
Oleic acidJXIRSC034.11E-06–1.06E-077.25–9.3415
Oleic acidJXAUC073.05E-07–2.55E-076.09–6.402
Oleic acidJXIRSC074.23E-06–4.11E-076.15–6.192
Oleic acidJXIRSC082.57E-0610.021
Linoleic acidJXAUA061.16E-06–9.42E-078.76–15.152
Linoleic acidJXAUA084.45E-06–3.883E-97.89–16.5541
Linoleic acidJXIRSA084.76E-06–7.93E-97.48–16.2532
Linoleic acidJXIRSA091.70E-0816.191
Linoleic acidJXAUA103.15E-068.361
Linoleic acidJXAUC034.97E-06–8.45E-087.59–16.3936
Linoleic acidJXIRSC033.62E-06–6.82E-077.43–9.644
Linoleic acidJXIRSC073.59E-067.541
Linolenic acidJXAUA101.49E-0712.751
Linolenic acidJXAUC021.75E-0612.701
A total of 78 SNPs of which associated with erucic acid content significantly were detected in GWAS, 12 (15%) were both detected in two environments (S4 Table). In addition, 57 in JXAU and 37 in JXIRS with 22 same SNP loci on 5 chromosomes for oleic acid were detected (S5 Table). Furthermore, we identified 118 SNPs for linoleic acid content trait (80 in JXAU and 38 in JXIRS with 21 uniform SNP loci (S6 Table), but only 2 significant SNP loci in JXAU were found (S7 Table). In addition, 51 peak SNPs (30 of erucic acid, 34 of oleic acid, 37 of linoleic acid and 12 of linolenic acid) were detected on 8 chromosomes (Table 5).
Table 5

Peak SNPs associated with four fatty acid contents in seed of B. napus.

ChromosomePositionP valueR2Erucic acidOleic acidLinoleic acidLinolenic acid
A06117117505.47E-080.13304
A0852332352.72E-070.04969
A0877742533.60E-060.04354
A0880257603.91E-070.06056
A0882684433.02E-070.06263
A0883478084.08E-070.05391
A0883901894.08E-080.09987
A0883952971.73E-080.07549
A0885716991.70E-070.05227
A0891712592.51E-060.04949
A0893123821.75E-070.06197
A0896396953.88E-070.09779
A08101467702.02E-090.09783
A08101800375.94E-080.09362
A08102330499.15E-070.07426
A08103375768.26E-080.07682
A08104067252.62E-080.11265
A08104337642.58E-100.09296
A08104420112.48E-080.10185
A08104612921.25E-070.09281
A08104718051.30E-110.14511
A08104720121.03E-110.13556
A08104815322.58E-100.12307
A08104959711.12E-100.10555
A08105152632.19E-110.10883
A08105828119.13E-120.11441
A08108890003.95E-080.09829
A08109583114.07E-090.093
A08111369864.91E-080.06772
A08111585519.94E-080.05835
A0917524792.05E-100.13748
A0925391853.50E-070.05646
A10145916503.37E-090.13573
C02300414011.19E-070.13582
C03543054742.39E-090.06921
C03543205316.05E-090.068
C03554421221.24E-070.07011
C03555229997.05E-080.073
C03555666452.19E-080.07326
C03556976025.89E-090.06611
C03557121025.81E-070.06493
C03557384834.18E-100.06887
C03557410153.81E-080.0739
C03558519091.48E-060.06278
C03559176567.45E-080.13888
C03559363147.56E-080.08354
C03566905896.61E-080.09359
C0762714463.74E-070.04589
C07184034164.73E-070.04454
C0848453502.88E-070.0493
C08144725953.58E-080.08004

Note

"√" indicates the corresponding trait that the significant associated SNP locus.

Note "√" indicates the corresponding trait that the significant associated SNP locus. For the SNP locus analysis, there were some SNP loci significantly associated with several traits simultaneously. For example, a total of 30 SNP loci on A08, C03 and C06 chromosomes were detected in three traits of erucic acid content, oleic acid content and linoleic acid content. Apart from what is outlined above, 16 SNP loci linked to erucic acid content and oleic acid content, 25 loci for oleic acid content and linoleic acid content, and 7 SNPs for erucic acid content and linoleic acid content concurrently. These consistent SNPs detected simultaneously in different traits indicated that they might control the different fatty acid composition traits synchronously. What’s more, we found two remarkable associated regions on chromosome A08 and C03 were consistent with the QTL results based on bi-parental mapping previously [19, 31, 47]. To identify highly favourable alleles with for fatty acid composition in B. napus, we investigated the allelic effects of all the significantly associated SNPs. We observed 108 highly favourable alleles for increasing oleic acid and linoleic acid content, also for decreasing erucic acid content simultaneously (S8 Table). Specially, we found 66 highly favourable alleles for reducing the content of erucic acid and linolenic acid (S9 Table), 82 and 105 highly favourable alleles contributed to increase oleic acid and linoleic acid content respectively (S10 Table, S11 Table). Discovery of SNP loci and those favourable alleles in current study will provide insight into the genetic basis of four fatty acid biosynthesis in rapeseed.

Candidate genes identification of four fatty acids in B. napus

To further reveal the molecular function of the SNPs significantly associated with the four traits, we extracted the genes within the 100 Kb upstream or downstream regions of the trait-associated SNPs in the reference genome of B. napus “Darmor v4.1”. We found 802 genes were identified located in the candidate regions around 52 SNPs (S12 Table), 29 of these candidate genes were enriched into fatty acid biosynthetic process, fatty acid metabolic process and Lipid transport and metabolism in the GO terms annotation analysis (S13 Table). 20 fatty acid composition candidate genes were homologous to A. thaliana genes involved in metabolic networks of fatty acids. These candidate genes were located in four chromosomes (A08, A09, A10 and C03), 16 of which were distributed on the A subgenome, the chromosome A08 have the most genes (12), and chromosome A09 and A10 have 2 candidate genes, respectively, chromosome C03 have other 4 fatty acid candidate genes (Table 6), which suggested the A subgenome pay a more important role than C subgenome in the metabolic networks of fatty acids of rapeseed.
Table 6

Candidate genes tagged by the associated SNPs with fatty acid biosynthesis and metabolism in B. napus and their orthologs in A. thaliana.

GeneChr.GenestartGeneendSNP locationDistance (Kb)Arabidopsis genesLD interval (bp) (R2>0.6)AliasDescription
BnaA08g08120DA0880069128010592802576018.848AT4G209306-phosphogluconate dehydrogenase family protein
BnaA08g08850DA0885888278591995857165317.174AT4G185507205628–8404857DSELDermatan sulfate epimerase-like
BnaA08g09510DA0890870209088618917125984.239At4g208308404821–8659578FAD-binding Berberine family protein
BnaA08g09990DA0893724429374898931239860.044AT4G209308404821–92379806-phosphogluconate dehydrogenase family protein
BnaA08g11130DA0810187701101892211014677040.931AT4G345209296073–9461455FAE1Fatty acid elongase 1
BnaA08g11140DA0810193789101952131014677047.019AT4G3451010146770–10233049KCS173-ketoacyl-CoA synthase 17
BnaA08g11440DA0810385625103918181043376441.946AT4G3379010146770–10233049CER4Eceriferum 4
BnaA08g11650DA0810512857105159561043376479.093AT4G3403010337911–10495999MCCB3-methylcrotonyl-CoA carboxylase
BnaA08g11810DA0810599612106006251050774391.869AT4G23335510337911–10495999Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein
BnaA08g11640DA0810507282105087231058281175.529AT4G3405010507743–10625538CCOAOMT1Caffeoyl-CoA O-methyltransferase 1
BnaA08g12350DA0810994361109974551095831136.05AT4G3181010496004–10641083Enoyl-CoA hydratase/isomerase family protein
BnaA08g12370DA0811043381110462271095831185.07AT4G3175010958311–11136986WIN2HOPW1-1-interacting 2 (WIN2)
BnaA09g03610DA0918235131824394175247971.034AT5G2720010958311–11136986ACP5Acyl carrier protein 5
BnaA09g05410DA0926422922646291255401888.274AT5G232601733014–1984004TT16Transparent testa 16
BnaA10g20970DA1014540117145406801459165050.970AT1G775902539185–2943582LACS9Long chain acyl-CoA synthetase 9
BnaA10g21150DA1014626208146277911459165034.558AT5G6703014516759–14778935ZEPZeaxanthin epoxidase
BnaC03g03500DC0317100521711864166487545.177AT3G1212014516759–14778935FAD2Fatty desaturase 2
BnaC03g65730DC0355426564554280535544208915.525AT5G139301557947–1756319ATCHSChalcone synthase 
BnaC03g65980DC0355684352556857795561898565.367AT4G3452054559600–55522974FAE1Fatty acid elongase 1
BnaC03g66040DC0355810262558116865571210298.160AT4G3451055256311–55847797KCS173-ketoacyl-CoA synthase 17
Some genes tagged by associated SNPs in our study represent known fatty acid composition genes, and were enriched in biological processes of FAs biosynthesis. For erucic acid, two Fatty acid elongation 1 (FAE1) orthologous genes (BnaA08g11130D and BnaC03g65980D) were found in the distance of 40.93 Kb from SNP Bn-A08-10146770 and 65.37 Kb form Bn-C03-55618985 respectively, which were well-known Bna.FAE1 homologs controlling erucic acid content found in previous studies [25, 31, 48–50]. Moreover, another two candidate genes of BnaA08g11140D and BnaC03g66040D orthologous to 3-ketoacyl-CoA synthase 17 (KSC17) in the distance of 47.02 Kb from SNP Bn-A08-10146770 and 98.16 Kb form Bn-C03- 55712102 were detected respectively, which is also a member of the 3-ketoacyl-CoA synthase family involved in the biosynthesis of very long chain fatty acids [51]. For the synthesis of unsaturated fatty acids, which are affected by the fatty acid desaturase activity in plant, the fatty acid desaturase genes have been proved as the major genes for the control of oleic acid content as same as the ratio of polyunsaturated fatty acid, we found the important candidate genes of BnaC03g03500D orthologous to Fatty acid desaturase 2 (FAD2) in the distance of 45.18 from Bn-C03-1664875, which is the major enzyme responsible for the synthesis of 18:2 fatty acids in plant endoplasmic reticulum [3], and FAD2 gene had been mapped in B. napus on chromosomes A1, A5, C1, and C5 in previous studies [4, 52]. Furthermore, many other important candidate genes for fatty acid biosynthesis were detected, which were orthologous to A. thaliana, such as Dermatan sulfate epimerase-like (DSEL), Acyl carrier protein 5 (ACP5), Long chain acyl-CoA synthetase 9 (LACS9) and so on, these candidate genes except for BnaA.FAE1 (BnaA08g11130D) and BnaC. FAE1 (BnaC03g65980D) have not been verified in previous genetic analyses, which are likely play important roles in fatty acid synthesis and transport. Therefore, these candidate genes detected in this study should be certified by further analysis in the future.

The expression of three candidate genes in diverse rapeseed accessions and tissues

In order to validate the candidate genes significantly associated fat acids compositions, we selected three key genes involved in fat acid synthesis and measured their gene expression in five different tissues (root, stem, leaf, flower and seed) of seven diverse rapeseed accessions using qRT-PCR. We found the three candidate genes have high expression in seeds or flowers of these selected rapeseed (S1 Fig), and the expression of ACP5 have a strong positive correlation with erucic acid (0.68), have strong negative correlation with oleic acid (-0.67) and linoleic acid (-0.71), suggested the high expression of ACP5 could promote the erucic acid synthesis, but decrease the oleic acid and linoleic acid. FAD2 and KCS17 have positive correlation with oleic acid and linoleic acid, indicated they could increase the two fat acids in the studied population (Table 7).
Table 7

The correlation analyses between the expression of three candidate genes and four fatty acid composition of B.napus in JXAU.

Correlation coefficient (R)ACP5FAD2KCS17
Erucic acid0.68-0.15-0.46
Oleic acid-0.670.290.50
Linoleic acid-0.710.230.51
Linolenic acid-0.390.16-0.22

Discussion

Fatty acid compositions in seed of rapeseed play important roles in improving the edible oil nutritional and storage quality. In this study, we investigated the four important fatty acid content traits of erucic, oleic, linoleic and linolenic for 300 rapeseed accessions, the four traits exhibited large variation in two different environments, and there proved to be have the weak or no genetic relationship among 300 accessions in our previous study [35,36], validating the suitability of GWAS for these traits in this study population [53]. In addition, there are high correlations among these fatty acid traits, significant negative correlation between the content of erucic acid and oleic acid was identified, and linoleic acid content trait had high positive correlation with other three traits, they were co-localized in a small region on chromosome A08 and C03 chromosomes, the strong correlation between these traits was also observed in previous studies [19, 21, 45, 54]. Furthermore, in current study, although the materials were grown in two near environments with near locations, but there were very distinct differences in these fatty acid compositions under the interaction between genotype and environment, we speculate the acid soil with low pH value (4.67) in JXIRS was the main reason resulting in very distinct differences of four fatty acids content [55]. Fatty acid composition are typical quantitative traits, based on the genetic markers (SNP, SSR, RFLP and AFLP), bi-parent population and statistical methods, many QTLs for fatty acid composition distributed on most of chromosomes of B. napus were detected in the past few decades [3, 4, 19–21, 25, 30]. Recent efforts have also been made in detecting the genetic loci responsible for fatty acid composition in B. napus based on the Illumina Infinium Brassica 60K SNP array, some genome wide analysis were carried out to identified genomic region and candidate genes associated with fatty acid content in the past few years [26, 27, 31]. In this study, using 201,187 genome-wide SNP markers developed by SLAF-seq technology [35,36], we carried out the GWAS for four fatty acid traits in 300 rapeseed inbred lines, and hundreds of SNP loci highly associated with the four fatty acid compositions traits were identified on 8 chromosomes of B. napus, many of these SNPs were simultaneously detected in two environments, most of significantly associated SNPs (more than 80%) for the content of erucic acid, oleic acid and linoleic acid were identified on A08 and C03 chromosomes, which was in accordance with above QTL and GWAS studies for fatty acid composition, from this we can infer that the genomic region controlling the fatty acid biosynthesis were mainly distributed on A08 and C03 chromosomes. In addition, we also found a few trait-associated SNPs on other chromosomes, some identical chromosome regions were exist in Qu et al. (2017) [31], for example, the significant SNPs associated with olenic acid on A09 chromosomes and the candidate gene of BnaA09g05410D (orthologous to Transparent testa 16 TT16) was identified both in two studies, but other related gene groups found in A02 and C01 linkages of Qu et al. (2017) were not detected in current study[31], which probably attribute to the different population used in different studies. Under the reference-guided analysis, As the physical position and alleles of SNP markers used in this study are known, therefore, breeders and researchers could easily obtain valuable information for other related rapeseed research based on our results. In addition, by evaluating the allelic effects of trait-associated SNPs, we observed many favourable alleles for decreasing the erucic acid content, which had positive effects for increasing the oleic and linoleic acid content in the same time, these SNPs could be used for low-erucic and high-oleic acid rapeseed breeding in the future. The genetic basis of seed fatty acid biosynthesis and modification pathways have been well characterised in Arabidopsis thaliana [9]. Well established desaturation and elongation pathways, along with substrate: product relationships assigned to specific enzymes [10], which was a co-operation of many genes of seed development, energy metabolism, fatty acid and triacylglycerol (TAG) biosynthesis pathways [9, 56]. De novo synthesis of fatty acids occur in the seed plastid, where the palmtic acid (16:0), stearic acid (18:0) and oleic acid (18:1) are formed and then released to the endoplasmic reticulum (ER) by two kinds of acyl-ACP thioesterase enzymes (FATA and FATB with high affinity to 18:1-ACP and 16:0-ACP respectively) [57, 58]. These fatty acids were modified by desaturation enzymes (FAD2 and FAD3) to produce the polyunsaturated linoleic (18:2) and linolenic acid (18:3) [52, 59], the gene expression analysis in this study supported further that the FAD2 gene would promote the linolenic acid synthesis. In addition, the fatty acids could be elongated to erucic acid by FAE1 gene or esterified to glycerol to produce the TAG stored as a major seed oil form in plants [60]. In current study, a total of 14 candidate genes were detected on A08 chromosomes, including DSEL, FAE1, MCCB, CCOAOMT1 and WIN2 etc. In addition, there were 4 fatty acid biosynthesis genes were found on C03 chromosomes, including FAD2, ATCHS, FAE1 and KCS17 etc., Specially, two known homoeologous genes (namely BnaA.FAE1 and BnaC.FAE1) controlling erucic acid content were identified in our study [25, 31, 50, 61], and candidate genes BnaA08g11140D and BnaC03g66040D orthologous to KCS17 of A. thaliana involving in the biosynthesis of very long chain fatty acids were also found on chromosomes A8 and C3. Moreover, the gene of BnaA08g11440D near the SNP locus of Bn-A08-10317341 (68.284 Kb) is orthologous to A. thaliana CER4 encoding an Alcohol-Forming Fatty Acyl-Coenzyme A Reductase involved in the synthesis of very long chain fatty acids [62]. These results obviously indicate that there are genomic regions controlling the seed fatty acid biosynthesis on A08 and C03 chromosomes. By the way, some important fatty acid candidate genes were also scanned on other chromosomes. On A09 chromosome, the candidate gene of BnaA09g03610D orthologous to Acyl carrier protein 5 (ACP5) was located in the distance of 71.03 Kb from SNP Bn-A09-1752479 on chromosome A09, which is a small acidic proteins functioning as important cofactors in the de novo synthesis of fatty acids, overexpression of AtACP5 further led to an a decrease of oleic acid (C18:1) and an increase of palmitic acid (C16:0) [63], the gene expression of ACP5 in this study have strong negative relationship with oleic acid, suggested it seem to has the same gene function as AtACP5. In addition, On A10 chromosome, three candidate genes were ascribed to fatty acid, BnaA10g20970D orthologous to A. thaliana Long chain acyl-CoA synthetase 9 (LACS9) encoding major plastid long chain acyl-CoA synthetase with a slight substrate preference of oleic acid over any of the other fatty acids. Protein-tyrosine phosphatase-like (PTPLA) (BnaA10g21780D) with acyl-CoA dehydratase activity has a potential role in synthesis of VLCFAs (very long chain fatty acids) under the interaction with Eceriferum 10 (CER10), which is a component of the microsomal fatty acid elongase complex. On C06 chromosome, FatA acyl-ACP thioesterase (FATA) affects the oil content and fatty acid composition of the seeds in Arabidopsis when reducing expression of FatA thioesterases [64]. Our GWAS analysis led to the identification of promising candidate genes for fatty acid biosynthesis and metabolism efficiently.

Conclusion

In this study, based on GWAS with MLM model analysis, significant association signals for the content of erucic acid, oleic acid, linoleic acid and linolenic acid in seeds of B. napus were found on A06, A08, A09, A10, C02, C03, CO6, C07 and C08 chromosomes in two environments. The genomic regions controlling the fatty acid biosynthesis were inferred to distribute mainly on A08 and C03 chromosomes. 20 orthologs of the functional candidate genes related to fatty acid biosynthesis in a distance of 100 Kb around these significantly SNP-trait associations were identified by BLAST analysis and comparison of previous linkage mappings, including the known vital fatty acid biosynthesis genes of BnaA.FAE1 and BnaC. FAE1 on the A08 and C03 chromosomes, and other potential candidate genes involving in the fatty acid biosynthesis pathway, such as the orthologs genes of FAD2, LACS09, KCS17, CER4, TT16 and ACBP5. This study lays a foundation for uncovering the genetic variations and the improvement of fatty acid composition in B. napus. The gene expression of ACP5 (A), FAD2 (B), KCS17 (C) in diverse rapeseed accessions and tissues. (DOCX) Click here for additional data file.

Material information of 300 inbred lines of B. napus.

(XLS) Click here for additional data file.

The fatty acid content of selected seven inbred lines of B. napus.

(XLS) Click here for additional data file.

Excel file containing the primer sequences used for real-time PCR.

(XLSX) Click here for additional data file.

SNPs significantly associated with erucic acid content in B. napus by MLM.

(XLSX) Click here for additional data file.

SNPs significantly associated with oleic acid content in B. napus by MLM.

(XLSX) Click here for additional data file.

SNPs significantly associated with linoleic acid content in B. napus by MLM.

(XLSX) Click here for additional data file.

SNPs significantly associated with linolenic acid content in B. napus by MLM.

(XLSX) Click here for additional data file.

SNP loci with positive effects for oleic acid, linoleic acid content and negative effects for erucic acid content in B. napus.

(XLSX) Click here for additional data file.

SNP loci with negative effects for erucic acid content in B. napus.

(XLSX) Click here for additional data file.

SNP loci with positive effects for oleic acid content in B. napus.

(XLSX) Click here for additional data file.

SNP loci with positive effects for linoleic acid content in B. napus.

(XLSX) Click here for additional data file.

Genes closely linked with trait-associated SNPs in B. napus.

(XLSX) Click here for additional data file.

Candidate genes for fatty acid biosythesis and metablism closely linked with SNPs in B. napus based on GO annotation.

(XLSX) Click here for additional data file.
  47 in total

1.  A comparative linkage map of oilseed rape and its use for QTL analysis of seed oil and erucic acid content.

Authors:  D Qiu; C Morgan; J Shi; Y Long; J Liu; R Li; X Zhuang; Y Wang; X Tan; E Dietrich; T Weihmann; C Everett; S Vanstraelen; P Beckett; F Fraser; M Trick; S Barnes; J Wilmer; R Schmidt; J Li; D Li; J Meng; I Bancroft
Journal:  Theor Appl Genet       Date:  2006-10-11       Impact factor: 5.699

Review 2.  Metabolic engineering of fatty acid biosynthesis in plants.

Authors:  Jay J Thelen; John B Ohlrogge
Journal:  Metab Eng       Date:  2002-01       Impact factor: 9.783

3.  Disruption of the FATB gene in Arabidopsis demonstrates an essential role of saturated fatty acids in plant growth.

Authors:  Gustavo Bonaventure; Joaquin J Salas; Michael R Pollard; John B Ohlrogge
Journal:  Plant Cell       Date:  2003-04       Impact factor: 11.277

4.  Identification of quantitative trait loci (QTL) for oil and protein contents and their relationships with other seed quality traits in Brassica juncea.

Authors:  T Mahmood; M H Rahman; G R Stringam; F Yeh; A G Good
Journal:  Theor Appl Genet       Date:  2006-09-08       Impact factor: 5.699

5.  CER4 encodes an alcohol-forming fatty acyl-coenzyme A reductase involved in cuticular wax production in Arabidopsis.

Authors:  Owen Rowland; Huanquan Zheng; Shelley R Hepworth; Patricia Lam; Reinhard Jetter; Ljerka Kunst
Journal:  Plant Physiol       Date:  2006-09-15       Impact factor: 8.340

6.  Mapping of the loci controlling oleic and linolenic acid contents and development of fad2 and fad3 allele-specific markers in canola (Brassica napus L.).

Authors:  Xueyi Hu; Mandy Sullivan-Gilbert; Manju Gupta; Steven A Thompson
Journal:  Theor Appl Genet       Date:  2006-06-10       Impact factor: 5.699

7.  Introgression of genomic components from Chinese Brassica rapa contributes to widening the genetic diversity in rapeseed (B. napus L.), with emphasis on the evolution of Chinese rapeseed.

Authors:  W Qian; J Meng; M Li; M Frauen; O Sass; J Noack; C Jung
Journal:  Theor Appl Genet       Date:  2006-04-08       Impact factor: 5.699

8.  Conditional QTL mapping of oil content in rapeseed with respect to protein content and traits related to plant development and grain yield.

Authors:  Jianyi Zhao; Heiko C Becker; Dongqing Zhang; Yaofeng Zhang; Wolfgang Ecke
Journal:  Theor Appl Genet       Date:  2006-04-14       Impact factor: 5.699

9.  QTL analysis of an intervarietal set of substitution lines in Brassica napus: (i) Seed oil content and fatty acid composition.

Authors:  M J Burns; S R Barnes; J G Bowman; M H E Clarke; C P Werner; M J Kearsey
Journal:  Heredity (Edinb)       Date:  2003-01       Impact factor: 3.821

10.  Characterization of substrate specificity of plant FatA and FatB acyl-ACP thioesterases.

Authors:  Joaquín J Salas; John B Ohlrogge
Journal:  Arch Biochem Biophys       Date:  2002-07-01       Impact factor: 4.013

View more
  7 in total

1.  Genetic Characterization of Russian Rapeseed Collection and Association Mapping of Novel Loci Affecting Glucosinolate Content.

Authors:  Rim Gubaev; Lyudmila Gorlova; Stepan Boldyrev; Svetlana Goryunova; Denis Goryunov; Pavel Mazin; Alina Chernova; Elena Martynova; Yakov Demurin; Philipp Khaitovich
Journal:  Genes (Basel)       Date:  2020-08-12       Impact factor: 4.096

Review 2.  Breeding and biotechnological interventions for trait improvement: status and prospects.

Authors:  Roshan Kumar Singh; Ashish Prasad; Mehanathan Muthamilarasan; Swarup K Parida; Manoj Prasad
Journal:  Planta       Date:  2020-09-18       Impact factor: 4.116

3.  Genetic Dissection of Mature Root Characteristics by Genome-Wide Association Studies in Rapeseed (Brassica napus L.).

Authors:  Sani Ibrahim; Keqi Li; Nazir Ahmad; Lieqiong Kuang; Salisu Bello Sadau; Ze Tian; Lintao Huang; Xinfa Wang; Xiaoling Dun; Hanzhong Wang
Journal:  Plants (Basel)       Date:  2021-11-24

Review 4.  Advances and Challenges for QTL Analysis and GWAS in the Plant-Breeding of High-Yielding: A Focus on Rapeseed.

Authors:  Shahid Ullah Khan; Sumbul Saeed; Muhammad Hafeez Ullah Khan; Chuchuan Fan; Sunny Ahmar; Osvin Arriagada; Raheel Shahzad; Ferdinando Branca; Freddy Mora-Poblete
Journal:  Biomolecules       Date:  2021-10-15

5.  Construction of a Quantitative Genomic Map, Identification and Expression Analysis of Candidate Genes for Agronomic and Disease-Related Traits in Brassica napus.

Authors:  Nadia Raboanatahiry; Hongbo Chao; Jianjie He; Huaixin Li; Yongtai Yin; Maoteng Li
Journal:  Front Plant Sci       Date:  2022-03-11       Impact factor: 5.753

6.  Mapping-by-Sequencing Reveals Genomic Regions Associated with Seed Quality Parameters in Brassica napus.

Authors:  Hanna Marie Schilbert; Boas Pucker; David Ries; Prisca Viehöver; Zeljko Micic; Felix Dreyer; Katrin Beckmann; Benjamin Wittkop; Bernd Weisshaar; Daniela Holtgräwe
Journal:  Genes (Basel)       Date:  2022-06-23       Impact factor: 4.141

Review 7.  The Use of Genetic and Gene Technologies in Shaping Modern Rapeseed Cultivars (Brassica napus L.).

Authors:  Linh Bao Ton; Ting Xiang Neik; Jacqueline Batley
Journal:  Genes (Basel)       Date:  2020-09-30       Impact factor: 4.096

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.