| Literature DB >> 26477832 |
Xin Wei1, Kunyan Liu2, Yanxin Zhang1, Qi Feng2, Linhai Wang1, Yan Zhao2, Donghua Li1, Qiang Zhao2, Xiaodong Zhu1, Xiaofeng Zhu1, Wenjun Li2, Danlin Fan2, Yuan Gao1, Yiqi Lu2, Xianmei Zhang3, Xiumei Tang4, Congcong Zhou2, Chuanrang Zhu2, Lifeng Liu3, Ruichun Zhong4, Qilin Tian2, Ziruo Wen2, Qijun Weng2, Bin Han2, Xuehui Huang2, Xiurong Zhang1.
Abstract
Oilseed crops are used to produce vegetable oil. Sesame (Sesamum indicum), an oilseed crop grown worldwide, has high oil content and a small diploid genome, but the genetic basis of oil production and quality is unclear. Here we sequence 705 diverse sesame varieties to construct a haplotype map of the sesame genome and de novo assemble two representative varieties to identify sequence variations. We investigate 56 agronomic traits in four environments and identify 549 associated loci. Examination of the major loci identifies 46 candidate causative genes, including genes related to oil content, fatty acid biosynthesis and yield. Several of the candidate genes for oil content encode enzymes involved in oil metabolism. Two major genes associated with lignification and black pigmentation in the seed coat are also associated with large variation in oil content. These findings may inform breeding and improvement strategies for a broad range of oilseed crops.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26477832 PMCID: PMC4634326 DOI: 10.1038/ncomms9609
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Phylogenetic tree and geographic distributions of 705 sesame varieties.
(a) Neighbour-joining tree of all the varieties calculated from whole-genome SNPs. The two recognizable groups are coloured in red and blue, respectively. (b) Geographic distributions of all the varieties are indicated as spots in the world map, with the two groups colour coded as in a. (c) The level of genetic differentiation (FST) between the two groups is plotted against the whole genome.
Figure 2Large-scale genetic discoveries of agronomic traits in sesame.
For each trait, the associated loci (P<1 × 10−6) are indicated in the genome. The agronomic traits (labelled on the right) can be divided into six categories (labelled on the left). Among them, the traits of fatty acid composition were measured in Luohe and all the others were phenotyped in Sanya.
Figure 3Candidate causative genes and variants underlying oil content in sesame oilseeds.
(a) Negative log10 P-values for association of oil content in Luohe (Y axis) are plotted against SNP positions (X axis). The genome-wide significant P-value threshold (10−6) is indicated by a horizontal dash–dot line. The candidate genes are indicated near the association peaks. (b) Candidate causative variants in SiPPO. (c) qRT–PCR result of SiPPO in seeds. The mutated allele (with high oil content) shows quite low transcripts. The bar indicates s.d. (d) The candidate causative variants (a C-to-A missense SNP) in SiNST1. (e) The content of lignin in the seed coats from 14 sesame accessions. The varieties with ‘A' allele show higher level of lignin than those with ‘C' allele significantly (P<0.0001, Student's t-test). The bar indicates s.d.
Figure 4Network of genes controlling oil composition.
(a) Fatty acid composition in sesame seed. (b) Correlation of phenotypic variation for oil content and composition. (c) The simplified lipid metabolic pathway. The pathway is modified from Arabidopsis and maize1628. The candidate causative genes discovered for the variation of fatty acid synthesis in sesame germplasm are highlighted in red. The dotted lines represent multiple reaction steps. The plastid, endoplasmic reticulum (ER) and oil body are marked with light orange, light blue and yellow, respectively.
Figure 5A major gene for oilseed yield in sesame breeding.
(a) Photos of the phenotypic change from one capsule per axil in wild sesame to three capsules per axil in modern cultivars. (b) Negative log10 P-values for association of capsule number per axil in Sanya (Y axis) are plotted against SNP positions (X axis). The association peak is indicated. (c) The homologues of SiACS in plants and local alignment of protein sequences of the homologues around the candidate causative variant.
Figure 6Genetic analysis of flowering time and plant height for oilseed yield.
(a) Negative log10 P-values for association of flowering time in Sanya (Y axis) are plotted against SNP positions (X axis). Two loci (SiDOG1 and SiIAA14) significantly associated with oilseed yield are indicated. (b) Negative log10 P-values for association of plant height in Sanya (Y axis) are plotted against SNP positions (X axis). Two loci (SiDFL1 and SiILR1) significantly associated with oilseed yield are indicated. (c) Pyramiding of desired alleles (at SiDOG1, SiIAA14, SiDFL1 and SiILR1) in sesame varieties. The varieties accumulated the desired alleles generally shows better yield performance. The bar indicates s.e. (d) Allele frequency changes from landraces to modern cultivars for all lead SNPs underlying flowering time and plant height. The loci for plant height showed to be selected more intensively than those for flowering time.