| Literature DB >> 32859172 |
Xiaoyi Liu1,2,3, Di Qin4,5, Angela Piersanti2, Qi Zhang3, Cristina Miceli6, Piwu Wang7.
Abstract
BACKGROUND: Soybean oil is a complex mixture of five fatty acids (palmitic, stearic, oleic, linoleic, and linolenic). Soybean oil with a high oleic acid content is desirable because this monounsaturated fatty acid improves the oxidative stability of the oil. To investigate the genetic architecture of oleic acid in soybean seeds, 260 soybean germplasms from Northeast China were collected as natural populations. A genome-wide association study (GWAS) was conducted on a panel of 260 germplasm resources.Entities:
Keywords: Genome-wide association study; Oleic acid; Oleic acid-associated genes; SNPs; Soybean
Mesh:
Substances:
Year: 2020 PMID: 32859172 PMCID: PMC7456086 DOI: 10.1186/s12870-020-02607-w
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Frequency distribution of oleic acid content in soybean seeds in 2018 and 2019. The X-axis on the graph shows the range of content of oleic acid. The Y-axis is the number of soybean lines. The distributions of oleic acid content in 2018 (a) and in 2019 (b) resemble the bell-shaped curve for a normal distribution. (c) Statistical analysis of oleic acid content in soybean seeds from 2018 to 2019
Fig. 2The relationship between oleic acid and other fatty acids. a Association between percentages of oleic acid and linoleic acid, y = 1.06x + 76.79, is a linear equation, where y represents linoleic acid content and x represents oleic acid content. b Association between percentages of oleic acid and linolenic acid. c Association between percentages of oleic acid and palmitic acid, y = 0.3x + 1.71, is a linear equation, where y represents palmitic acid content and x represents oleic acid content. d Association between percentages of oleic acid and stearic acid
Correlations of the relative oleic acid content in soybean
| Oleic acid | Linoleic acid | Linolenic acid | Palmitic acid | Stearic acid | ||
|---|---|---|---|---|---|---|
| Oleic acid | Pearson correlation | 1 | −0.660b | 0.332a | 0.581b | −0.254b |
| Linoleic acid | Pearson correlation | −0.660b | 1 | 0.571b | 0.051b | −0.893b |
| Linolenic acid | Pearson correlation | 0.332a | 0.571b | 1 | 0.447b | −0.473b |
| Palmitic acid | Pearson correlation | 0.581b | 0.051b | 0.447b | 1 | −0.043b |
| Stearic acid | Pearson correlation | −0.254a | −0.893b | − 0.473b | −0.043b | 1 |
Note: a indicates significance at 0.05, b indicates significance at 0.01
Fig. 3Genotyping and SNP Annotation. The abscissa is the length of the chromosome. Each band represents one chromosome. The genome is divided according to the size of 1 Mb. a Distribution Map of SLAF on chromosomes. The SLAF label is concentrated in the red area of map. b Distribution Map of SNPs on different chromosome. The red color represents more than 5706 SNP markers in the region, the yellow color represents a number of SNP markers between 2852 and 3565. c Pie chart of SNPs annotation. Left panel: SNPs percentages generally associated to genes. Right panel: SNPs percentages in coding regions
Fig. 4Group Structure of 260 soybean lines a Phylogenetic tree of 260 soybean lines. b Clustering analysis when the number of subgroups is in the range 2–13, the colours represent separate groups. c Three-dimensional score plot (PC1, PC2, PC3) to discriminate between soybeans lines from three provinces of China. d Diagram showing the value of 260 samples based on clustering from 1 to 15; Cross validation error rate for each K value of 1–15, X-axis is K value 1–15, Y-axis is cross-validation errors. Aea1 represents the soybean lines from Jilin province, Area2 represents the soybean lines from Heilongjiang province, Area3 represents the soybean lines from Liaoning province
Fig. 5Genome-wide Manhattan plots of associations to oleic acid content in the 2018 analysis. In the left panel, the X-axis indicates the SNPs along each chromosome; the Y-axis is the −log 10 (P-value) for the association, the threshold value was set at –log(p) > 6.20 (red) and –log(p) > 7.20 (blue). a to e are GWAS results based on the cmlm model, EmMax model, fastlmm model, GLM, and MLM are in F to J, respectively. e GWAS result based on the MLM model. Q-Q plots for oleic acid using cmlm model (f), EmMax model (g), fastlmm model (h), GLM (I), MLM (j). The grey area represents the 95% concentration band. Each dot represents a SNP
Fig. 6Genome-wide Manhattan plots of associations for oleic acid content for 2019 analysis. In the left panel, the X-axis indicates the SNPs along each chromosome; the Y-axis is the −log 10 (P-value) for the association, the threshold value was set at –log(p) > 6.20 (red) and –log(p) > 7.20 (blue). a GWAS result based on the cmlm model, b GWAS result based on the EmMax model, c GWAS result based on the fastlmm model, d GWAS result based on the GLM model, e GWAS result based on the MLM model. Q-Q plots for oleic acid using cmlm model (f), EmMax model (g), fastlmm model (h), GLM (I), MLM (j). The grey area represents the 95% concentration band. Each dot represents a SNP
Correlation between relative gene expression and oleic acid content in soybean
| Year | Chromosome | Gene | Predicted function | Length | Contribution rate |
|---|---|---|---|---|---|
| 2018 | Chr03 | 3PREDICTED: | 687 | 0.10 | |
| 3PREDICTED: Glycine max pleiotropic drug resistance protein 1-like (LOC100791601), mRNA | 4662 | 0.07 | |||
| Chr04 | 3PREDICTED: Glycine max probable pectate lyase 18-like (LOC100814679), mRNA | 1657 | 0.32 | ||
| 2522 | 0.43 | ||||
| 3PREDICTED: Glycine max respiratory burst oxidase homolog protein C-like (LOC100800248), mRNA | 2440 | 0.08 | |||
| Chr05 | 3PREDICTED: Glycine max ATP carrier protein 2, chloroplastic-like (LOC100797684), mRNA | 1655 | 0.11 | ||
| Chr07 | 3PREDICTED: Glycine max ADP,ATP carrier protein 1, chloroplastic-like (LOC100793284), mRNA | 2317 | 0.12 | ||
| 3PREDICTED: Glycine max VIN3-like protein 1-like (LOC100780157), transcript variant X2, mRNA | 2756 | 0.10 | |||
| Chr08 | 3PREDICTED: Glycine max calcium-dependent protein kinase 3-like (LOC100777096), transcript variant 1, mRNA | 1877 | 0.16 | ||
| 3PREDICTED: Glycine max probable plastid-lipid-associated protein 4, chloroplastic-like (LOC100803367), transcript variant 1, mRNA | 979 | 0.15 | |||
| Chr11 | 3PREDICTED: Glycine max DNA replication complex BAG protein, transcript variant 2, mRNA, | 1257 | 0.47 | ||
| Chr13 | 3PREDICTED: Glycine max protein S-acyltransferase 24-like (LOC100777470), misc_RNA | 2490 | 0.32 | ||
| Chr14 | 3PREDICTED: Glycine max abscisic-aldehyde oxidase-like (LOC100812604), mRNA | 4517 | 0.22 | ||
| Chr15 | 3PREDICTED: Glycine max uncharacterized LOC102666654 (LOC102666654), mRNA | 693 | 0.17 | ||
| 3PREDICTED: Glycine max tRNA methyltransferase 10 homolog A-like (LOC100779099), mRNA | 1337 | 0.10 | |||
| 3PREDICTED: Glycine max uncharacterized LOC102665381 (LOC102665381), mRNA | 1227 | 0.08 | |||
| 3PREDICTED: Glycine max polygalacturonase-like (LOC100785701), mRNA | 1551 | 0.10 | |||
| 3PREDICTED: Glycine max uncharacterized LOC100814752 (LOC100814752), mRNA | 1945 | 0.11 | |||
| 3PREDICTED: Glycine max alpha,alpha-trehalose-phosphate synthase [UDP-forming] 1-like (LOC100797320), transcript variant X6, mRNA | 3542 | 0.12 | |||
| 3PREDICTED: Glycine max uncharacterized LOC100814749 (LOC100814749), mRNA | 1213 | 0.10 | |||
| 3PREDICTED: Glycine max uncharacterized LOC100801946 (LOC100801946), transcript variant X1, mRNA | 3888 | 0.09 | |||
| 2019 | Chr19 | 3PREDICTED: Glycine max uncharacterized LOC102659858 (LOC102659858), mRNA | 1709 | 0.10 | |
| Chr02 | 2PREDICTED: Glycine max ataxin-2-like (LOC100788042), mRNA | 1135 | 0.18 | ||
| Chr04 | 2522 | 0.10 | |||
| Chr08 | 2PREDICTED: Glycine max metacaspase-3-like (LOC100796113), transcript variant X2, mRNA | 1839 | 0.12 | ||
| Chr11 | 3PREDICTED: Glycine max DNA replication complex BAG protein, transcript variant 2, mRNA, | 1257 | 0.47 | ||
| Chr12 | 2PREDICTED: Glycine max uncharacterized LOC102660202 (LOC102660202), mRNA | 2799 | 0.17 | ||
| 2PREDICTED: Glycine max DNA ligase 1-like (LOC100818049), mRNA | 2728 | 0.17 | |||
| Chr20 | 2PREDICTED: Glycine max 26S proteasome non-ATPase regulatory subunit 7 homolog A-like (LOC100816479), mRNA | 896 | 0.08 |
Fig. 7Distribution Map of Candidate Genes in the different functional process
Fig. 8Box-plot representation of two candidate genes expression in different tissues. a Variance analysis of Glyma.11G229600.1 expression in different tissues of soybean. b Variance analysis of Glyma.04G102900.1 expression in different tissues of soybean. The different uppercase letters indicate significant differences at P < 0.01, as determined by Duncan’s multiple-range test
Names of 14 soybean lines and average levels of five fatty acids
| Group | Name of line | Oil | Protein | Oleic acid | Linoleic acid | Linolenic acid | Palmitic acid | Stearic acid |
|---|---|---|---|---|---|---|---|---|
| Low oleic acid content | q001 | 21.1 | 37.62 | 9.56 | 67.85 | 10.77 | 9.79 | 1.26 |
| q024 | 20.18 | 39.1 | 10.42 | 67.19 | 8.57 | 9.02 | 0.75 | |
| q020 | 20.94 | 39.12 | 11.55 | 65.56 | 8.6 | 8.59 | 1.01 | |
| q008 | 17.94 | 43.49 | 11.67 | 64.32 | 7.75 | 9.48 | 1.01 | |
| q035 | 19.13 | 39.17 | 11.99 | 62.45 | 9.43 | 8.73 | 1.38 | |
| T7287 | 20.96 | 37.76 | 12.23 | 60.62 | 7.74 | 8.22 | 2.36 | |
| q033 | 21.1 | 38.03 | 12.4 | 62.92 | 9.33 | 9.86 | 0.81 | |
| High oleic acid content | q318 | 16.88 | 40.29 | 25.34 | 57.71 | 8.17 | 2.32 | 3.27 |
| q20 | 22.19 | 40.42 | 26.41 | 54.1 | 6.04 | 2.65 | 1.8 | |
| q073 | 21.29 | 41.54 | 28.08 | 50.94 | 7.97 | 1.72 | 2.69 | |
| q176 | 12.36 | 38.91 | 29.73 | 55.13 | 8.85 | 0.76 | 5.06 | |
| q070 | 20.9 | 40.99 | 29.78 | 51.08 | 7.09 | 1.67 | 1.89 | |
| q68 | 16.91 | 39.44 | 29.94 | 53.64 | 6.01 | 0.56 | 4.43 | |
| q353 | 15.59 | 39.49 | 31.02 | 52.18 | 8.6 | 0.99 | 5.73 |