| Literature DB >> 32490345 |
Joel A Malek1,2, Sweety Mathew2, Lisa S Mathew2, Shameem Younuskunju2,3, Yasmin A Mohamoud2, Karsten Suhre1,4.
Abstract
The fruit of date palm trees are an important part of the diet for a large portion of the Middle East and North Africa. The fruit is consumed both fresh and dry and can be stored dry for extended periods of time. Date fruits vary significantly across hundreds of cultivars identified in the main regions of cultivation. Most dried date fruit are low in sucrose but high in glucose and fructose. However, high sucrose content is a distinctive feature of some date fruit and affects flavor as well as texture and water retention. To identify the genes controlling high sucrose content, we analyzed date fruit metabolomics for association with genotype data from 120 date fruits. We found significant association of dried date sucrose content and a genomic region that contains 3 tandem copies of the beta-fructofuranosidase (invertase) gene in the reference Khalas genome, a low-sucrose fruit. High-sucrose cultivars including the popular Deglet Noor had a homozygous deletion of two of the 3 copies of the invertase gene. We show the deletion allele is derived when compared to the ancestral allele that retains all copies of the gene in 3 other species of Phoenix. The fact that 2 of the 3 tandem invertase copies are associated with dry fruit sucrose content will assist in better understanding the distinct roles of multiple date palm invertases in plant physiology. Identification of the recessive alleles associated with end-point sucrose content in date fruit may be used in selective breeding in the future.Entities:
Keywords: SNP association; date palm; fruit quality; invertase; marker‐assisted breeding; sucrose
Year: 2020 PMID: 32490345 PMCID: PMC7251787 DOI: 10.1002/pld3.214
Source DB: PubMed Journal: Plant Direct ISSN: 2475-4455
SNPs with high association with the sucrose phenotype. Based on alignment to PDK30 genome. Ref/Alt determination is based on the Khalas reference sequence
| Date palm scaffold ID | Location (bp) | Reference allele | Alternative (Sucrose) allele |
|---|---|---|---|
| PDK30s742521_60107 | 39,241 | C | G |
| PDK30s742521_60107 | 39,533 | A | T |
| PDK30s742521_60107 | 39,691 | C | A |
| PDK30s742521_60107 | 40,462 | A | C |
| PDK30s742521_60107 | 40,498 | T | C,A |
| PDK30s6550997_92357 | 13,026 | C | A |
| PDK30s6550997_92357 | 7,860 | C | G |
| PDK30s6550997_92357 | 6,838 | C | T |
| PDK30s6550997_92357 | 9,130 | A | G |
| PDK30s6550997_92357 | 9,214 | G | A |
| PDK30s6550997_92357 | 9,215 | C | A,T |
| PDK30s6550997_92357 | 9,216 | A | C |
Figure 1Identification of an allele associated with sucrose content in date fruit. (a) Date palm sequence contigs containing SNPs associated with fruit content were aligned to the oil palm reference sequence and found to derive mainly from a 1 Mb region of contig NC_026001.1 of Elaeis guineensis chromosome 9. PDK30: Date Palm contigs are represented as blue rectangles and PDK30s742521_60107 that contains the invertase is outlined in red. A red line is drawn at the multiple testing corrected genome‐wide significance of [−log(10) of 10]. (b) Two randomly selected PDK30 contigs with SNPs plotted as in (a) showing SNP p‐values do not reach genome‐wide significance. (c) A heatmap of normalized coverage for selected samples from varying genotypes across the region. Samples with high sucrose are shown to be deleted in the region containing the invertase gene. dnPdF is predicted to have this value of sucrose given other Deglet Noor genomes. Sucrose (norm) column is Bradford normalized ion count values for each fruit type. (d) Coverage grouped by genotype in the region of the invertase gene. Lines are loess‐smoothed and diploid coverage is normalized to 1. The deleted samples who know coverage in the region while heterozygotes show reduced coverage. Phoenix species are likely homozygous for the allele containing both copies of the invertase but show slightly lower coverage than homozygous date fruit samples likely due to sequence divergence impacting sequence alignment of Phoenix sequences to the Khalas reference
Figure 2High sucrose content in date palm is associated with a deleted allele. A plot of normalized sucrose content in date fruit shows significant association (Wilcoxon rank‐sum test, one‐sided) for samples homozygous deleted for the invertase genes (Del) versus those heterozygous for the deletion (Het) or homozygous for the ancestral, non‐deleted allele (Hmz). Genotypes at the position 39533 of contig PDK30s742521_60107
Figure 3A dot plot of the sequence similarity between the sucrose‐associated region in high‐sucrose Deglet Noor and low‐sucrose Khalas (GenBank Accession numbers MT009343 and MT009344, respectively). Sequences are highly conserved between the two genomes except in the region from 0 to 50 kb where the Deglet Noor genome shows loss of the duplicated invertase genes
Figure 4SNPs are linked to the sucrose content associated deletion. Normalized genome coverage in the deletion region for all samples containing either of the three SNP genotypes at the scaffold PDK30s742521_60107 position 39533 (Table 1). These 3 SNPs may be used as tags for the deletion
Occurrence of the deleted allele by geographical region
| Location | Genotype | Count | % of samples |
|---|---|---|---|
| East | Del | 3 | 5.5 |
| Het | 1 | 1.9 | |
| Hmz | 50 | 92.6 | |
| Total | 54 | ||
| West | Del | 3 | 4.5 |
| Het | 15 | 22.4 | |
| Hmz | 49 | 73.1 | |
| Total | 67 |
Abbreviations: Del, homozygous deletion; Het, heterozygous for the reference allele and deletion; Hmz, homozygous for the reference allele.