| Literature DB >> 34424280 |
Kenta Shirasawa1, Sogo Nishio2, Shingo Terakami2, Roberto Botta3, Daniela Torello Marinoni3, Sachiko Isobe1.
Abstract
Japanese chestnut (Castanea crenata Sieb. et Zucc.), unlike other Castanea species, is resistant to most diseases and wasps. However, genomic data of Japanese chestnut that could be used to determine its biotic stress resistance mechanisms have not been reported to date. In this study, we employed long-read sequencing and genetic mapping to generate genome sequences of Japanese chestnut at the chromosome level. Long reads (47.7 Gb; 71.6× genome coverage) were assembled into 781 contigs, with a total length of 721.2 Mb and a contig N50 length of 1.6 Mb. Genome sequences were anchored to the chestnut genetic map, comprising 14,973 single nucleotide polymorphisms (SNPs) and covering 1,807.8 cM map distance, to establish a chromosome-level genome assembly (683.8 Mb), with 69,980 potential protein-encoding genes and 425.5 Mb repetitive sequences. Furthermore, comparative genome structure analysis revealed that Japanese chestnut shares conserved chromosomal segments with woody plants, but not with herbaceous plants, of rosids. Overall, the genome sequence data of Japanese chestnut generated in this study is expected to enhance not only its genetics and genomics but also the evolutionary genomics of woody rosids.Entities:
Keywords: chestnut; genetic mapping; genome synteny; rosids; whole-genome sequence
Mesh:
Year: 2021 PMID: 34424280 PMCID: PMC8435554 DOI: 10.1093/dnares/dsab016
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Estimation of the genome size of Japanese chestnut (Castanea crenata), based on k-mer analysis (k = 17) with the given multiplicity values.
Statistics of the contig sequences of Japanese chestnut
| CCR_r1.0 | CCR_r1.0.haplotig | |
|---|---|---|
| Total contig size (bases) | 721,168,657 | 629,166,897 |
| Number of contigs | 781 | 3,355 |
| Contig N50 length (bases) | 1,595,543 | 275,768 |
| Longest contig size (bases) | 13,981,633 | 1,688,135 |
| Gap (bases) | 0 | 0 |
| Complete BUSCOs | 96.6 | n.a. |
| Single-copy BUSCOs | 91.9 | n.a. |
| Duplicated BUSCOs | 4.7 | n.a. |
| Fragmented BUSCOs | 0.9 | n.a. |
| Missing BUSCOs | 2.5 | n.a. |
| #Genes | 69,980 | n.a. |
n.a., not analyzed.
Statistics of Japanese chestnut pseudomolecule sequences (CCR_r1.0.pmol)
| Linkage group | No. of SNPs | Genetic distance (cM) | No. of bins | No. of contigs | % | Total length (bp) | % |
|---|---|---|---|---|---|---|---|
| CCR1.0A | 2,483 | 173.7 | 284 | 59 | 7.6 | 89,228,932 | 12.4 |
| CCR1.0B | 1,514 | 172.2 | 202 | 53 | 6.8 | 53,118,407 | 7.4 |
| CCR1.0C | 1,232 | 152.6 | 204 | 44 | 5.6 | 56,655,719 | 7.9 |
| CCR1.0D | 1,083 | 137.5 | 161 | 57 | 7.3 | 52,875,950 | 7.3 |
| CCR1.0E | 1,342 | 165.2 | 228 | 61 | 7.8 | 71,160,265 | 9.9 |
| CCR1.0F | 1,339 | 132.3 | 154 | 42 | 5.4 | 61,824,531 | 8.6 |
| CCR1.0G | 924 | 137.2 | 153 | 44 | 5.6 | 44,683,286 | 6.2 |
| CCR1.0H | 816 | 138.9 | 170 | 34 | 4.4 | 53,252,922 | 7.4 |
| CCR1.0I | 1,199 | 157.9 | 168 | 42 | 5.4 | 45,426,588 | 6.3 |
| CCR1.0J | 1,036 | 148.0 | 157 | 32 | 4.1 | 40,655,448 | 5.6 |
| CCR1.0K | 1,026 | 144.9 | 196 | 62 | 7.9 | 67,434,223 | 9.4 |
| CCR1.0L | 979 | 147.4 | 140 | 45 | 5.8 | 47,439,869 | 6.6 |
| Total | 14,973 | 1,808 | 2,217 | 575 | 73.6 | 683,756,140 | 94.8 |
Repetitive sequences in the Japanese chestnut genome
| Repeat type | No. of elements | Physical length (bp) | % |
|---|---|---|---|
| SINEs | 3,222 | 349,743 | 0.0 |
| LINEs | 58,824 | 28,244,400 | 3.9 |
| LTR elements | 173,629 | 172,741,932 | 24.0 |
| DNA transposons | 206,227 | 47,812,722 | 6.6 |
| Unclassified | 672,305 | 153,965,018 | 21.3 |
| Small RNA | 1,548 | 851,212 | 0.1 |
| Satellites | 1,888 | 1,522,847 | 0.2 |
| Simple repeats | 308,508 | 12,661,150 | 1.8 |
| Low complexity | 43,949 | 2,095,795 | 0.3 |
Figure 2Number of genes in the genomes of Japanese chestnut supported by members of the Castanea. Predicted genes in the genomes of Japanese chestnut (CCR_r1.0) and Chinese chestnut (GWHANWH00000000), and the transcriptome sequences of American chestnut (AC454_v3), European chestnut (csi_csn), and Japanese chestnut (cci_ccn).
Figure 3Comparative analysis of the genome sequence and structure of Japanese chestnut and other woody members of rosids. Dots and colors indicate genome structure and sequence similarity, respectively, among Castanea mollissima, Acer yangbiense, Citrus sinensis, Fragaria vesca, Malus × domestica, Manihot esculenta, Populus trichocarpa, Prunus persica, Pyrus betulifolia, Theobroma cacao, Vitis vinifera, and Ziziphus jujuba. Chromosome numbers are indicted on upper side of x-axis and right side of y-axis and genome sizes (Mb) are shown on lower side of x-axis and left side of y-axis.