| Literature DB >> 23917264 |
Ibrahim S Al-Mssallem1, Songnian Hu, Xiaowei Zhang, Qiang Lin, Wanfei Liu, Jun Tan, Xiaoguang Yu, Jiucheng Liu, Linlin Pan, Tongwu Zhang, Yuxin Yin, Chengqi Xin, Hao Wu, Guangyu Zhang, Mohammed M Ba Abdullah, Dawei Huang, Yongjun Fang, Yasser O Alnakhli, Shangang Jia, An Yin, Eman M Alhuzimi, Burair A Alsaihati, Saad A Al-Owayyed, Duojun Zhao, Sun Zhang, Noha A Al-Otaibi, Gaoyuan Sun, Majed A Majrashi, Fusen Li, Jixiang Wang, Quanzheng Yun, Nafla A Alnassar, Lei Wang, Meng Yang, Rasha F Al-Jelaify, Kan Liu, Shenghan Gao, Kaifu Chen, Samiyah R Alkhaldi, Guiming Liu, Meng Zhang, Haiyan Guo, Jun Yu.
Abstract
Date palm (Phoenix dactylifera L.) is a cultivated woody plant species with agricultural and economic importance. Here we report a genome assembly for an elite variety (Khalas), which is 605.4 Mb in size and covers >90% of the genome (~671 Mb) and >96% of its genes (~41,660 genes). Genomic sequence analysis demonstrates that P. dactylifera experienced a clear genome-wide duplication after either ancient whole genome duplications or massive segmental duplications. Genetic diversity analysis indicates that its stress resistance and sugar metabolism-related genes tend to be enriched in the chromosomal regions where the density of single-nucleotide polymorphisms is relatively low. Using transcriptomic data, we also illustrate the date palm's unique sugar metabolism that underlies fruit development and ripening. Our large-scale genomic and transcriptomic data pave the way for further genomic studies not only on P. dactylifera but also other Arecaceae plants.Entities:
Mesh:
Year: 2013 PMID: 23917264 PMCID: PMC3741641 DOI: 10.1038/ncomms3274
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
An overview of the Phoenix dactylifera genome assembly.
| Newbler contig | 194,980 | 5.8 | 92.6 | 507.9 |
| Contig | 82,863 | 195.4 | 1,898.0 | 553.3 |
| Scaffold | 82,354 | 329.9 | 4,533.7 | 558.0 |
Figure 1A Venn diagram showing the distribution of shared gene families among representative angiosperms.
Phoenix dactylifera, Arabidopsis thaliana, Oryza sativa, Sorghum bicolor and Vitis vinifera (OrthoMCL 2.0.2, E<1e-5). The number of gene families, the number of genes in the families and the total number of genes are indicated under species names, which are separated by slashes. A total of 12,624 sequences in 1,127 gene families are unique to P. dactylifera, and these unique gene families are mostly related to DNA/RNA metabolic process and ion binding (Supplementary Table S15).
Figure 2Ks distribution of GWD in selected monocotyledons and dicotyledons.
The density distribution of each monocotyledon (a) or dicotyledon (b) was calculated based on kernel density estimation with a bin size of 0.001. The evolutionary rate is relative slow in trees4748, such as those of P. trichocarpa and V. vinifera. We suppose that the P. dactylifera genome also has this character similar to oil palm7. The minor peak of O. sativa (Ks ~0.06) was recently suggested to be a GWD event happened ~70 Mya and the genome underwent an illegitimate recombination followed by fast evolution49. Z. mays had a recent GWD event (Ks~0.2, ~5–12 Mya) as compared with other Gramineae species50. In M. truncatula, a peak near 0 (Ks~0.05) is due to local duplication and faster evolution rate as compared with other vascular plants51.
Figure 3Macro-synteny among P. dactylifera scaffolds and other monocotyledon chromosomes.
Scaffolds pdS00001 (4.5 M), pdS00072 (1.1 M), pdS00080 (1 M) pdS00086 (1 M) and pdS00147 (0.7 M) have the same syntenic pattern with other monocotyledons. In addition, the latter four scaffolds have syntenic regions with pdS00001 and are inferred to be duplicated at monocotyledon ancestry (Supplementary Fig. S5). The unit of scale is million base pairs or Mb for other monocotyledon chromosomes. All P. dactylifera scaffolds are enlarged 30 × for illustration.
Figure 4Expression patterns of the LEA genes among 12 tissues of P. dactylifera.
The LEA subfamilies are colour-coded (on the right side). The expression levels are normalized based on the Z-score method (scale bar) over all libraries (on the top).
Figure 5Expression profiling of sugar metabolism-related genes at different fruiting stages.
(a) Gene expression profiling of 30 key enzymes related to sugar metabolism. The hierarchical clustering shows that these genes can be classified into two patterns: genes expressed at the early developmental stages (EE, blue, 0–75 DPP) and at the late developmental stages (LE, red, 105–135 DPP). (b) Summary of sugar metabolisms during fruit development and ripening. Fruits at 150 DPP are completely ripe and we were unable to extract enough RNA for RNA-seq.
Figure 6Phylogenetic analysis of 11 P. dactylifera varieties based on SNPs and their SNPs frequency distributions.
The phylogenetic tree was constructed by using all SNP sites of the varieties (NJ method with 1,000 bootstrap, MEGA 5.0). The series of plots show SNP density (SNP per kb, the horizontal axis) and frequencies (the vertical axis). SNP frequency was calculated based on a 10-kb sliding window in 1-kb steps. The modes of SNP distribution are compounding, where the minor mode is contributed by SNP deserts.
Genes and gene families enriched in SNP deserts.
| | |||||
| Total number of genes | 41,660 | 9,036 | NA | NA | NA |
| LEA gene family | 84 | 21 | 0.339 | 0.5606 | – |
| NBS gene family | 144 | 35 | 0.364 | 0.5463 | – |
| Energy- and sugar-related genes | 390 | 124 | 13.753 | 0.0002 | Enriched |
LEA, late embryogenesis abundant, NA, not applicable; SNP, single-nucleotide polymorphism.
*Detail is shown at Supplementary Note 4.
†Detail is shown at Supplementary Note 5.