| Literature DB >> 34225652 |
Alina I Chernova1,2, Rim F Gubaev3,4, Anupam Singh5, Katrina Sherbina5, Svetlana V Goryunova3,6,7, Elena U Martynova3, Denis V Goryunov3,8, Stepan V Boldyrev3,4, Anna A Vanyushkina3, Nikolay A Anikanov3, Elena A Stekolshchikova3, Ekaterina A Yushina3,9, Yakov N Demurin10, Zhanna M Mukhina11, Vera A Gavrilova12, Irina N Anisimova12, Yulia I Karabitsina12, Natalia V Alpatieva12, Peter L Chang5, Philipp Khaitovich3, Pavel V Mazin3, Sergey V Nuzhdin5.
Abstract
BACKGROUND: Sunflower is an important oilseed crop domesticated in North America approximately 4000 years ago. During the last century, oil content in sunflower was under strong selection. Further improvement of oil properties achieved by modulating its fatty acid composition is one of the main directions in modern oilseed crop breeding.Entities:
Keywords: Fatty acids; GBS; GWAS; Genetic markers; Sunflower; Triglycerides; UPLC-MS
Mesh:
Substances:
Year: 2021 PMID: 34225652 PMCID: PMC8256595 DOI: 10.1186/s12864-021-07768-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Population structure of germplasm and linkage disequilibrium (LD) values. A Estimated cross-validation error value for possible cluster number from 1 to 10. B Subpopulations were assessed using Principal Component Analysis. Each dot corresponds to a sunflower accession used in the study. Color corresponds to sunflower lines from different collections. Agroplasma_SM indicates sterility maintainer lines from Agroplasma; Agroplasma_FR indicates fertility restorer lines. C, D Genome-wide (C) and per-chromosome 3 (D) LD-decay. Lines correspond to loess curves
Fig. 2The relationship between sunflower germplasm of different origins estimated based on 2345 SNPs shared between this and the Hübner (2019) studies. A The first and the second PCA components. B The first and the third PCA components. Each dot corresponds to a plant accession. Colors indicate the origin
Fig. 3FAs concentrations in replication experiments. A M ultidimensional scaling plot (two dimensions, 1 - Spearman correlation coefficient between FAs abundances was used as distance). One sample is shown by one point; accessions are shown by different colors; different years are shown by points of different shapes. B Minus log10 p-values for the differences between lines (ANOVA) are shown, FAs are ordered by p-value increase. Bonferroni adjusted 0.05 significance level is shown by red line; C-G) abundances of oleic acid (18:1) (C), linoleic acid (18:2) (D),) palmitic acid (16:0) (E), eicosenoic acid (20:1) (F), and linolenic acid (18:3) (G) are shown across lines and years. Each point represents 1 sample, point shapes, and colors as in (A), lines show per-year averages
Fig. 4Lipid annotation. A Mz/rt. plot. One point represents one peak; different lipid categories are shown in different colors. Only peaks with sample intensities at least two times higher than blank intensities are shown. B Relative intensities of all lipid categories. The intensity of a given category was calculated as the sum of intensities of all lipids in the category. GL- glycerolipids, GP- glycophospholipids, FA -fatty acids, ST- sterols, PR- prenols, PK- polyketides and SP-saccharolipids
Fig. 5Schematic representation of fatty acid properties (fatty acid chain length and degree of saturation) for detected lipids. A Cumulative chain length and double bonds number of the three fatty acid residues composing the detected TAG molecules. B Chain length and double bonds number of fatty acid (FAs) released after lipid hydrolysis. Each circle corresponds to a FA or a TAG. The circles’ size corresponds to the mean relative amount of this molecule in a sample (log-transformed MS peak intensity)
Fig. 6GWAS results for FAs in sunflower lines and candidate genes for docosanoic acid improvement. A Cumulative plot representing the number of significant associations for each of the traits. Traits represented by colors. Chromosome number and the number of SNPs are presented on the X and Y-axes, respectively. B LD block in Chr3 (Location 44,696,624–46,188,263). C LD block in Chr3 (Location 42,596,595–43,078,214). D LD block in Chr14 (Location 91,496,885–91,547,710). Candidate genes in blue associated with lipid metabolism, Candidate genes in green associated with lipid metabolism described by Badouin et.al (2017) [31]
LD blocks with significant associations
| Phenotype | Chromosome | LD block Location | Length (kb) | |
|---|---|---|---|---|
| Start position | End position | |||
| Oleic Acid (18:1) | 6 | 64,066,219 | 64,889,534 | 823 |
| 9 | 168,736,699 | 169,306,761 | 570 | |
| 13 | 116,940,760 | 117,370,881 | 430 | |
| 15 | 38,043,597 | 38,078,709 | 35 | |
| Linoleic Acid (18:2) | 3 | 66,733,584 | 68,666,170 | 932 |
| 5 | 37,199,838 | 37,569,381 | 396 | |
| 11 | 5,004,818 | 50,619,247 | 414 | |
| 11 | 95,051,157 | 92,468,132 | 416 | |
| Linolenic acid (18:3) | 11 | 43,846,946 | 44,328,722 | 481 |
| Oleic/Linoleic ratio | 3 | 66,733,584 | 68,666,170 | 932 |
| 12 | 121,534,492 | 121,906,701 | 372 | |
| Nonadecanoic acid (19:0) | 2 | 179,620,148 | 179,872,251 | 252 |
| 14 | 53,394,600 | 53,480,813 | 86 | |
| 14 | 59,829,070 | 60,503,626 | 664 | |
| Docosanoic acid (22:0) | 3 | 32,332,262 | 32,562,669 | 230 |
| 3 | 42,596,595 | 43,078,214 | 481 | |
| 3 | 44,696,624 | 46,188,263 | 1491 | |
| 3 | 48,304,030 | 49,705,352 | 1401 | |
| 3 | 53,949,047 | 54,230,339 | 281 | |
| 3 | 57,635,146 | 57,714,809 | 79 | |
| 14 | 91,496,885 | 91,547,710 | 50 | |
| 14 | 96,632,645 | 97,927,934 | 614 | |
| 16 | 176,846,705 | 176,869,659 | 22 | |
| Tetracosanoic acid (24:0) | 2 | 56,777,868 | 56,880,436 | 102 |
| 2 | 73,398,255 | 74,229,960 | 831 | |
| 3 | 102,040,303 | 102,070,280 | 29 | |
| Nervonic acid (24:1) | 3 | 44,696,624 | 46,188,263 | 1491 |
| 3 | 57,635,146 | 57,714,809 | 79 | |