| Literature DB >> 32117401 |
Amelia Salimonti1, Fabrizio Carbone1, Elvira Romano1, Massimiliano Pellegrino1, Cinzia Benincasa1, Sabrina Micali2, Alessandro Tondelli3, Francesca L Conforti4, Enzo Perri1, Annamaria Ienco5, Samanta Zelasco1.
Abstract
Cultivated olive (Olea europaea L. subsp. europaea var. europaea) is the most ancient and spread tree crop in the Mediterranean basin. An important quality trait for the extra virgin olive oil is the fatty acid composition. In particular, a high content of oleic acid and low of linoleic, linolenic, and palmitic acid is considered very relevant in the health properties of the olive oil. The oleate desaturase enzyme encoding-gene (FAD2-2) is the main responsible for the linoleic acid content in the olive fruit mesocarp and, therefore, in the olive oil revealing to be the most important candidate gene for the linoleic acid biosynthesis. In this study, an in silico and structural analysis of the 5'UTR intron of the FAD2-2 gene was conducted with the aim to explore the natural sequence variability and its role in the gene expression regulation. In order to identify functional allele variants, the 5'UTR intron was isolated and partially sequenced in 97 olive cultivars. The sequence analysis allowed to find a 117-bp insertion including two long duplications never found before in FAD2-2 genes in olive and the existence of many intron-mediated enhancement (IME) elements. The sequence polymorphism analysis led to detect 39 SNPs. The candidate gene association study conducted for oleic and linoleic acids content revealed seven SNPs and one indel significantly associated able to explain a phenotypic variation ranging from 7% to 16% among the years. Our study highlighted new structural variants within the FAD2-2 gene in olive, putatively involved in the regulation mechanisms of gene expression associated with the variation of the content of oleic and linoleic acid.Entities:
Keywords: 5′UTR intron; FAD2-2 gene; Olea europaea; SNP; association study
Year: 2020 PMID: 32117401 PMCID: PMC7031445 DOI: 10.3389/fpls.2020.00066
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Pearson correlation indexes (A, B) for climate parameters and Spearman correlation indexes (C, D) for fatty acid composition among years. The asterisks indicate the significance of statistical test.
| A | C | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Temperature | 2003 | 2004 | 2005 | 2006 | Oleic acid | 2003 | 2004 | 2005 | 2006 |
| 2004 | 0.95*** | 2004 | 0.83*** | ||||||
| 2005 | 0.95*** | 0.95*** | 2005 | 0.9*** | 0.88*** | ||||
| 2006 | 0.97*** | 0.98*** | 0.98*** | 2006 | 0.69*** | 0.65*** | 0.71*** | ||
| 2007 | 0.97*** | 0.98*** | 0.98*** | 0,99*** | 2007 | 0.83*** | 0.88*** | 0.9*** | 0.67*** |
| B | D | ||||||||
| Temperature | 2003 | 2004 | 2005 | 2006 | Oleic acid | 2003 | 2004 | 2005 | 2006 |
| 2004 | −0.04 | 2004 | 0.8*** | ||||||
| 2005 | −0.05 | 0.04 | 2005 | 0.8*** | 0.9*** | ||||
| 2006 | −0.05 | 0.33*** | −0.05 | 2006 | 0.6*** | 0.61*** | 0.6*** | ||
| 2007 | 0.15* | 0.07 | −0.04 | 0.07 | 2007 | 0.7*** | 0.6*** | 0.86*** | 0.61*** |
*significant (P = 0.05); ***high significant (P = 0.01).
Figure 1Frequency distribution of the 97 olive varieties for oleic acid (A) and linoleic acid (B) content.
Figure 2(A) Alignment of a fragment of the intron region of OeFAD2-2a and OeFAD2-2b from cultivar Nocellara messinese. Three long indels of 13 bp, 117 bp and 5 bp, respectively, are underlined. (B) Partial view of OeFAD2-2b intron region spanning the 117 nucleotide-long insertion (highlighted in grey). Two stretches of 49 nucleotides (underlined with a solid black line) and 53 nucleotides (dashed black line) are duplicated. Polymorphic bases within duplications are marked in bold italic. * Similar nucleotide.
Figure 3Partial nucleotide sequence of the OeFAD2-2b 5′UTR from olive cultivar Nocellara messinese. In italics the sequence regions analyzed in other 96 cultivars and in bold single-nucleotide polymorphisms (SNPs). In the boxes, the GT and AG dinucleotides at both ends of the intronic region and ATG as translational initiation are shown. The insertion of 117bp, not present in the sequence of the OeFAD2-2a allele, is shaded grey. In dark grey the pentamer CGATT belonging to IME signals. Moreover, several potential cis-regulatory elements are underlined and designated with the names of each of the motifs.
Results of the LD analysis where the distance between pair of SNPs and their significant pairwise associations were calculated using both the statistical D′ and R.
| SNP1 | SNP2 | Dist | D′* | R* |
|---|---|---|---|---|
| SNP5 | SNP9 | 324 | −1 | −0.175 |
| SNP5 | SNP13 | 383 | −1 | −0.183 |
| SNP4 | SNP9 | 437 | −0.833 | −0.186 |
| SNP13 | SNP167 | 1338 | −0.835 | −0.186 |
| SNP14 | SNP167 | 1313 | −1 | −0.193 |
| SNP15 | SNP167 | 1312 | −1 | −0.193 |
| SNP5 | SNP20 | 513 | −1 | −0.195 |
| SNP12 | SNP23 | 194 | −1 | −0.196 |
| SNP12 | SNP26 | 337 | −1 | −0.199 |
| SNP4 | SNP14 | 521 | −1 | −0.201 |
| SNP4 | SNP15 | 522 | −1 | −0.202 |
| SNP2 | SNP12 | 653 | 0.696 | 0.203 |
| SNP2 | SNP25 | 908 | 0.228 | 0.227 |
| SNP4 | SNP13 | 496 | −1 | −0.233 |
| SNP2 | SNP4 | 169 | −0.458 | −0.236 |
| SNP6 | SNP14 | 394 | 0.35 | 0.239 |
| SNP6 | SNP15 | 395 | 0.35 | 0.239 |
| SNP12 | SNP25 | 255 | −1 | −0.241 |
| SNP2 | SNP168 | 2023 | 0.636 | 0.245 |
| SNP14 | SNP23 | 157 | −1 | −0.263 |
| SNP15 | SNP23 | 156 | −1 | −0.264 |
| SNP14 | SNP26 | 300 | −1 | −0.266 |
| SNP15 | SNP26 | 299 | −1 | −0.267 |
| SNP5 | SNP23 | 565 | −0.816 | −0.27 |
| SNP4 | SNP5 | 113 | 0.349 | 0.273 |
| SNP13 | SNP23 | 182 | −0.894 | −0.273 |
| SNP5 | SNP26 | 708 | −0.819 | −0.274 |
| SNP11 | SNP12 | 32 | 1 | 0.277 |
| SNP6 | SNP20 | 499 | 0.333 | 0.281 |
| SNP2 | SNP11 | 621 | −0.329 | −0.284 |
| SNP9 | SNP23 | 241 | −1 | −0.291 |
| SNP20 | SNP23 | 52 | −0.905 | −0.294 |
| SNP9 | SNP26 | 384 | −1 | −0.295 |
| SNP11 | SNP168 | 1402 | −0.895 | −0.298 |
| SNP20 | SNP26 | 195 | −0.906 | −0.298 |
| SNP6 | SNP9 | 310 | 0.404 | 0.306 |
| SNP13 | SNP26 | 325 | −1 | −0.309 |
| SNP25 | SNP168 | 1115 | 0.817 | 0.313 |
| SNP6 | SNP13 | 369 | 0.401 | 0.318 |
| SNP4 | SNP26 | 821 | −0.754 | −0.323 |
| SNP14 | SNP25 | 218 | −1 | −0.323 |
| SNP15 | SNP25 | 217 | −1 | −0.324 |
| SNP25 | SNP167 | 1095 | 0.564 | 0.337 |
| SNP20 | SNP25 | 113 | −0.851 | −0.339 |
| SNP11 | SNP20 | 174 | 0.741 | 0.34 |
| SNP13 | SNP25 | 243 | −0.917 | −0.343 |
| SNP4 | SNP11 | 452 | 0.576 | 0.344 |
| SNP167 | SNP168 | 20 | 0.546 | 0.351 |
| SNP9 | SNP25 | 302 | −1 | −0.357 |
| SNP11 | SNP167 | 1382 | −0.7 | −0.362 |
| SNP5 | SNP11 | 339 | 0.781 | 0.366 |
| SNP11 | SNP13 | 44 | 0.856 | 0.369 |
| SNP4 | SNP23 | 678 | −0.875 | −0.37 |
| SNP11 | SNP15 | 70 | 1 | 0.371 |
| SNP11 | SNP14 | 69 | 1 | 0.372 |
| SNP2 | SNP6 | 296 | 0.673 | 0.385 |
| SNP2 | SNP26 | 990 | 0.466 | 0.387 |
| SNP5 | SNP25 | 626 | −0.964 | −0.392 |
| SNP23 | SNP168 | 1176 | 0.845 | 0.397 |
| SNP6 | SNP12 | 357 | 0.789 | 0.402 |
| SNP9 | SNP11 | 15 | 1 | 0.412 |
| SNP4 | SNP25 | 739 | −0.804 | −0.417 |
| SNP2 | SNP23 | 847 | 0.513 | 0.421 |
| SNP26 | SNP167 | 1013 | 0.59 | 0.426 |
| SNP26 | SNP168 | 1033 | 0.922 | 0.428 |
| SNP23 | SNP167 | 1156 | 0.667 | 0.488 |
| SNP12 | SNP20 | 142 | 1 | 0.604 |
| SNP12 | SNP13 | 12 | 1 | 0.644 |
| SNP11 | SNP26 | 369 | −0.905 | −0.648 |
| SNP9 | SNP12 | 47 | 1 | 0.674 |
| SNP23 | SNP25 | 61 | 0.832 | 0.679 |
| SNP11 | SNP23 | 226 | −0.968 | −0.685 |
| SNP12 | SNP14 | 37 | 1 | 0.746 |
| SNP12 | SNP15 | 38 | 1 | 0.746 |
| SNP25 | SNP26 | 82 | 0.917 | 0.757 |
| SNP11 | SNP25 | 287 | −0.874 | −0.759 |
| SNP9 | SNP20 | 189 | 0.903 | 0.81 |
| SNP14 | SNP20 | 105 | 1 | 0.81 |
| SNP15 | SNP20 | 104 | 1 | 0.81 |
| SNP13 | SNP20 | 130 | 0.91 | 0.854 |
| SNP13 | SNP14 | 25 | 1 | 0.863 |
| SNP13 | SNP15 | 26 | 1 | 0.863 |
| SNP23 | SNP26 | 143 | 0.88 | 0.87 |
| SNP9 | SNP14 | 84 | 1 | 0.903 |
| SNP9 | SNP15 | 85 | 1 | 0.903 |
| SNP9 | SNP13 | 59 | 0.952 | 0.91 |
| SNP14 | SNP15 | 1 | 1 | 1 |
*Significant pairwise associations using both the Fischer’s exact test and Chi-square test and Bonferroni correction.
Figure 4LD decay calculated on inferred haplotypes using r2 parameters.
Figure 5Genotypic effects of the significantly associated SNPs on oleic and linoleic acid content in different years. The X-axis indicates the genotype status of cultivars (letters) and the absolute frequency of genotypes (number). R2: is the statistical used for association analysis and p is the Benjamini-Hochberg Adjusted p value.