| Literature DB >> 24344172 |
Masafumi Yagi1, Shunichi Kosugi2, Hideki Hirakawa2, Akemi Ohmiya1, Koji Tanase1, Taro Harada1, Kyutaro Kishimoto1, Masayoshi Nakayama1, Kazuo Ichimura1, Takashi Onozaki1, Hiroyasu Yamaguchi1, Nobuhiro Sasaki3, Taira Miyahara3, Yuzo Nishizaki3, Yoshihiro Ozeki3, Noriko Nakamura4, Takamasa Suzuki5, Yoshikazu Tanaka4, Shusei Sato2, Kenta Shirasawa2, Sachiko Isobe2, Yoshinori Miyamura2, Akiko Watanabe2, Shinobu Nakayama2, Yoshie Kishida2, Mitsuyo Kohara2, Satoshi Tabata6.
Abstract
The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp.Entities:
Keywords: Dianthus caryophyllus L.; carnation; gene prediction; genome sequencing
Mesh:
Substances:
Year: 2013 PMID: 24344172 PMCID: PMC4060945 DOI: 10.1093/dnares/dst053
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of D. caryophyllus cv. ‘Francesco’ genome assemblies
| Limited lengtha | Contigsb | Scaffolds | |
|---|---|---|---|
| Total number | ≥100 bp | 88 654 | 45 088 |
| Total length (kb) | 50 0218 | 568 887 | |
| N50 (bp) | 16 644 | 60 737 | |
| N90 (bp) | 2036 | 6700 | |
| Total number | ≥1 kb | 64 159 | 30 716 |
| Total length (Mb) | 483 703 | 558 518 | |
| N50 (bp) | 17 553 | 62 616 | |
| N90 (bp) | 2678 | 8036 | |
| Maximum length (kb) | 363 | 1287 | |
| GC content (%) | 36.3 |
aContigs or scaffolds shorter than the indicated length were excluded from the statistics.
bSubcontigs in scaffolds, which were split by gap regions with length >4 bp.
Repetitive sequences identified in the carnation genome
| Repeat class | Number | Total length (kb) | Genome content (%)a | Repeat composition (%) |
|---|---|---|---|---|
| Class 1 TE | ||||
| LTR | 86 111 | 36 558 | 7.3 | 22.1 |
| SINE | 441 | 35.7 | 0.007 | 0.02 |
| LINE | 7362 | 3447 | 0.69 | 2.1 |
| Other | 10 | 1.7 | 0.0003 | 0 |
| Class 2 TE | ||||
| DNA transposon | 16 729 | 3914 | 0.78 | 2.4 |
| Rolling circle | 1425 | 535 | 0.11 | 0.33 |
| Other type TE | 48 | 21.1 | 0.004 | 0.01 |
| Simple repeat | 295 621 | 16 847 | 3.4 | 10.3 |
| Low complexity | 58 600 | 3528 | 0.71 | 2.2 |
| Other repeat | 411 | 65 | 0.004 | 0.01 |
| Unclassified (novel) repeat | 331 831 | 100 097 | 20 | 60.6 |
| Total | 794 297 | 165 027 | 33 | 100 |
LTR, long terminal repeat; SINE, short interspersed nuclear element; LINE, long interspersed nuclear element.
aPercentage of total length of repeats in the continuous sequences in the assembled genome.
Figure 1.Gene assignment to KOG functional categories.