| Literature DB >> 34849775 |
Natascha van Lieshout1, Martijn van Kaauwen1, Linda Kodde1, Paul Arens1, Marinus J M Smulders1, Richard G F Visser1, Richard Finkers1.
Abstract
Chrysanthemum is among the top 10 cut, potted, and perennial garden flowers in the world. Despite this, to date, only the genomes of two wild diploid chrysanthemums have been sequenced and assembled. Here, we present the most complete and contiguous chrysanthemum de novo assembly published so far, as well as a corresponding ab initio annotation. The cultivated hexaploid varieties are thought to originate from a hybrid of wild chrysanthemums, among which the diploid Chrysanthemum makinoi has been mentioned. Using a combination of Oxford Nanopore long reads, Pacific Biosciences long reads, Illumina short reads, Dovetail sequences, and a genetic map, we assembled 3.1 Gb of its sequence into nine pseudochromosomes, with an N50 of 330 Mb and a BUSCO complete score of 92.1%. Our ab initio annotation pipeline predicted 95,074 genes and marked 80.0% of the genome as repetitive. This genome assembly of C. makinoi provides an important step forward in understanding the chrysanthemum genome, evolution, and history.Entities:
Keywords: zzm321990 Chrysanthemum makinoizzm321990 ; annotation; chrysanthemum; genome assembly
Mesh:
Year: 2022 PMID: 34849775 PMCID: PMC8727959 DOI: 10.1093/g3journal/jkab358
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Sequencing details of cDNA from different plant organs in C. makinoi
| Source | Mean read length (b) | Mean read quality | Number of reads | Read length N50 (b) | Total bases |
|---|---|---|---|---|---|
| Leaf (short d) | 977.0 | 9.0 | 7,587,930 | 1,189 | 7,413,732,394 |
| Leaf (long d) | 973.0 | 9.4 | 6,780,899 | 1,209 | 6,597,664,366 |
| Calyx | 1,040.2 | 10.0 | 4,833,397 | 1,247 | 5,027,548,849 |
| Flower disk florets | 1,012.1 | 9.5 | 7,072,131 | 1,331 | 7,157,901,095 |
| Flower buds | 993.6 | 9.1 | 7,917,800 | 1,256 | 7,867,489,497 |
| Flower ray florets | 1,002.8 | 9.6 | 7,000,372 | 1,250 | 7,020,311,566 |
| Meristem | 1,048.0 | 10.2 | 5,075,164 | 1,263 | 5,318,808,662 |
| Stem (short d) | 997.1 | 9.5 | 7,936,023 | 1,232 | 7,912,613,286 |
| Root | 1,060 | 8.4 | 5,272,384 | 1,389 | 5,591,241,404 |
Figure 1A k-mer (K = 31) distribution based on the illumina reads, modeled and visualized using genomescope.
C. makinoi de novo genome assembly metrics estimated using QUAST
| Assembly |
|
|---|---|
| # Ns per 100 kbp | 89.51 |
| # contigs/scaffolds | 10 |
| Total length | 3,113,668,257 |
| N50 | 330,012,911 |
| N75 | 317,988,395 |
| L50 | 5 |
| L75 | 7 |
| Largest contig/scaffold | 376,468,909 |
| GC content (%) | 36.01 |
Output from the Busco analysis pipeline to assess gene complement completeness
| BUSCO term | V1.0 |
|---|---|
| Complete (%) | 92.1 |
| Complete and single copy (%) | 83.8 |
| Complete and duplicated (%) | 8.3 |
| Fragmented (%) | 1.8 |
| Missing (%) | 6.1 |
| Total | 1375 |
Figure 2Circos plot showing the pseudomolecules (outer ring), gene density (red ring), repetitive element content (orange ring), and gc content (yellow ring) with a bin size of 500 kb.