| Literature DB >> 24576357 |
Linhai Wang, Sheng Yu, Chaobo Tong, Yingzhong Zhao, Yan Liu, Chi Song, Yanxin Zhang, Xudong Zhang, Ying Wang, Wei Hua, Donghua Li, Dan Li, Fang Li, Jingyin Yu, Chunyan Xu, Xuelian Han, Shunmou Huang, Shuaishuai Tai, Junyi Wang, Xun Xu, Yingrui Li, Shengyi Liu, Rajeev K Varshney, Jun Wang, Xiurong Zhang.
Abstract
BACKGROUND: Sesame, Sesamum indicum L., is considered the queen of oilseeds for its high oil content and quality, and is grown widely in tropical and subtropical areas as an important source of oil and protein. However, the molecular biology of sesame is largely unexplored.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24576357 PMCID: PMC4053841 DOI: 10.1186/gb-2014-15-2-r39
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary of sesame genome assembly and annotation
| Contigs | All | 26,239 | 52.17 kb/1,545 | 11.40 kb/5,534 | 270 Mb |
| Scaffolds | All | 16,444 | 2.10 Mb/42 | 268.23 kb/169 | 274 Mb |
| Anchored on chromosomes | 150 | - | - | 234 Mb | |
| Anchored on chromosomes and oriented | 117 | - | - | 207 Mb | |
| Annotation | | Number | Total length | Percentage of the assembly | |
| Protein coding genes | All | 27,148 | 86.08 Mb | 31.46 | |
| Transposable elements | All | - | 78.86 Mb | 28.46 | |
| LTR retroelements | - | 48.03 Mb | 17.56 | ||
| Non-LTR retrotransposonsa | - | 11.70 Mb | 4.28 | ||
| DNA transposons | - | 10.88 Mb | 3.98 | ||
| Unknown | - | 14.64 Mb | 5.35 | ||
| Non-coding RNAs | rRNA fragments | 386 | 89.66 kb | <0.04 | |
| tRNAs | 870 | 65.31 kb | <0.03 | ||
| miRNAs | 207 | 25.41 kb | <0.01 | ||
| snRNAs | 268 | 33.93 kb | <0.02 | ||
aLong interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs). LTR, long terminal repeats; miRNA, microRNA; snRNA, small nuclear RNA.
Figure 1Distribution of the basic genomic elements of sesame. (A) Pseudomolecules. (B) Gene density (mRNA); the frequency of sites within gene regions per 500 kb ranged from 0.04 to 0.61. (C) DNA transposon element (TE) density; the frequency of sites within DNA TE regions per 500 kb ranged from 0 to 0.22. (D) Retrotransposon element density; the frequency of sites per 500 kb within retrotransposon element regions ranged from 0 to 0.71. (E) GC content; the ratio of GC sites per 100 kb ranged from 0.32 to 0.40. Inner ribbons indicate self-collinearity of sesame, and the homologous regions of more than 1 Mb are highlighted. Circos [24] was used to construct the diagram.
Figure 2Genome evolution of sesame. (a) Left: shared and unique gene families in each lineage. Right: distribution of the asterids-specific gene families in sesame, U. gibba, tomato and potato. Gene numbers in related gene families are listed in brackets. (b) The relationships between grapevine and two subgenomes of sesame. These regions are listed in Table S15 in Additional file 1. For each segment in the grapevine chromosome, two obvious duplicated collinear segments from sesame were aligned. According to the retained gene ratio in duplicated segments, the high (red) and low-fractionated segments (blue) are represented. (c) Polyploidization events in the asterids I lineage. Age was estimated according to Ks distribution. γ, gamma triplication event in eudicot ancestor; T, triplication event in the tomato-potato lineage; D, recent duplication event in sesame; Dmr, the most recent duplication event in U. gibba; Sin, Sesamum indicum; Sly, Solanum lycopersicum; Stu, Solanum tuberosum; Vvi, Vitis vinifera; Ugi, Utricularia gibba. See Supplementary Note 4 in Additional file 1 for more details.
Figure 3Gene expansion of LTP1 in sesame. (a) Maximum-likelihood tree showing expansions of LTP1 in sesame. Red dots, sesame; green dots, Arabidopsis; yellow dots, soybean; blue dots, grapevine. (b) Expression patterns and tandem arrays of LTP1 in sesame. Twelve transcriptomes corresponding to the seeds of 10, 20, 25 and 30 days post anthesis (DPA) of three accessions (from left to right): ZZM4728 (high oil content, 59% of seed), ZZM3495 (low oil content, 51% of seed) and ZZM2161 (low oil content, 48% of seed).
Figure 4Expression profiles of the lipid-related genes in sesame seed. (a) Hierarchical clustering of the sesame seed samples based on the expression levels (RPKM) of 416 predicted lipid-related genes. (b) Differently expressed genes (DEGs) of 10 DPA seeds in the downstream part of the triacylglycerol (TAG) synthesis pathway. DEGs between the high and low lipid content accessions in this pathway are marked in red. Colored squares indicate the expression level (RPKM) of the 10 DEGs encoding phosphatidylcholine:diacylglycerol cholinephosphotransferase (PDCT), phospholipid: diacylglycerol acyltransferase (PDAT), oil-body proteins (steroleosin, caleosin and oleosin) and transcription factors (ABI3, ABI4 and WRI1) in the three accessions (from left to right): ZZM4728 (high oil content, 59% of seed), ZZM3495 (low oil content, 51% of seed), and ZZM2161 (low oil content, 48% of seed). The correlation (r) of expression patterns between transcription factors and genes was calculated using Pearson’s correlation coefficients (PCCs) based on all 12 transcriptomes. Dashed arrows indicate potential positive regulation.
Figure 5Pairwise nucleotide diversity () and total polymorphism () of sesame. (a) Distributions of π (red) and θw (blue) of the sesame genome and positions of lipid-related genes (Figure S23 in Additional file 1). The two lines of bars below the axis of π or θw indicate the positions of the lipid-related genes in sesame. Blue bars, lipid-related genes except for LTP1; red bars, LTP1 genes. (b) Scatterplot of π and θw values of LTP1 genes (red triangles), non-LTP1 lipid-related genes (blue triangles) and the other genes (green circles).