| Literature DB >> 30689773 |
Hideki Hirakawa1, Katsuhiko Sumitomo2, Tamotsu Hisamatsu2, Soichiro Nagano1,3, Kenta Shirasawa1, Yohei Higuchi4, Makoto Kusaba5, Masaji Koshioka6, Yoshihiro Nakano2, Masafumi Yagi2, Hiroyasu Yamaguchi2, Kenji Taniguchi5, Michiharu Nakano5, Sachiko N Isobe1.
Abstract
Cultivated chrysanthemum (Chrysanthemum morifolium Ramat.) is one of the most economically important ornamental crops grown worldwide. It has a complex hexaploid genome (2n = 6x = 54) and large genome size. The diploid Chrysanthemum seticuspe is often used as a model of cultivated chrysanthemum, since the two species are closely related. To expand our knowledge of the cultivated chrysanthemum, we here performed de novo whole-genome assembly in C. seticuspe using the Illumina sequencing platform. XMRS10, a C. seticuspe accession developed by five generations of self-crossing from a self-compatible strain, AEV2, was used for genome sequencing. The 2.72 Gb of assembled sequences (CSE_r1.0), consisting of 354,212 scaffolds, covered 89.0% of the 3.06 Gb C. seticuspe genome estimated by k-mer analysis. The N50 length of scaffolds was 44,741 bp. For protein-encoding genes, 71,057 annotated genes were deduced (CSE_r1.1_cds). Next, based on the assembled genome sequences, we performed linkage map construction, gene discovery and comparative analyses for C. seticuspe and cultivated chrysanthemum. The generated C. seticuspe linkage map revealed skewed regions in segregation on the AEV2 genome. In gene discovery analysis, candidate flowering-related genes were newly found in CSE_r1.1_cds. Moreover, single nucleotide polymorphism identification and annotation on the C. × morifolium genome showed that the C. seticuspe genome was applicable to genetic analysis in cultivated chrysanthemums. The genome sequences assembled herein are expected to contribute to future chrysanthemum studies. In addition, our approach demonstrated the usefulness of short-read genome assembly and the importance of choosing an appropriate next genome sequencing technology based on the purpose of the post-genome analysis.Entities:
Keywords: zzm321990 Chrysanthemum seticuspezzm321990 ; flowering genes; genome sequence assembly; linkage map
Mesh:
Year: 2019 PMID: 30689773 PMCID: PMC6589549 DOI: 10.1093/dnares/dsy048
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of the C. seticuspe genome assembly and CDS
| CSE_r1.0 | CSE_r1.1_maker_cds | CSE_r1.1_cds | |
|---|---|---|---|
| Genome | CDS | CDS (after filtering) | |
| Number of sequences | 354,212 | 158,375 | 71,057 |
| Total length (bp) | 2,721,839,164 | 188,275,935 | 89,043,396 |
| Average length (bp) | 7,684 | 1,189 | 1,253 |
| Maximum length (bp) | 447,582 | 16,482 | 16,482 |
| Minimum length (bp) | 500 | 300 | 300 |
| N50 length (bp) | 44,741 | 1,560 | 1,584 |
| GC% | 36.0 | 41.1 | 40.8 |
| Repeat % | 72.5 | — | — |
| Number of complete genes | — | 15,796 | 15,304 |
| Number of partial genes | — | 142,579 | 55,753 |
| TEs | — | 55,741 | — |
Figure 1Gene discovery analysis related to flowering. (A) Phylogenetic tree of PEBP family proteins from the plant species including C. seticuspe homologues. The tree was constructed using the Neighbor-Joining method in MEGA7 software. Bootstrap values for 1,000 re-samplings are shown on each branch. Black and red dots represent C. seticuspe homologues identified in previously and the present studies, respectively. (B) Amino acid alignment of FT/TFL1 family proteins at the conserved positions that determine inducer or repressor activities. (C) Effects of chilling on expression of the newly identified CEN-like gene (Cse_sc005398.1_g060.1) and a floral meristem identity gene, CsFL (a homologue of LEAFY) in the rosette-forming plants. Data are means ± SE (n = 5). The value of a non-chilled plant at the start of the experiment was set to one.
Figure 2AEV2 (left) and NIFS-0 (right) bin linkage maps. Blue, red and green bars show locus positions mapped by SNPs, SSR markers and both types of markers. Grey bars represent regions where mapped loci showed significant segregation distortion (P ≤ 0.05).
Figure 3Venn diagram showing numbers of gene clusters in C. seticuspe, sunflower (H. annuus), lettuce (L. sativa), tomato (S. lycopersicum) and A. thaliana.
Figure 4A phylogenetic tree based on the 2,280 common single-copy genes of the four Asterids species and A. thaliana.