| Literature DB >> 35958219 |
Guanglong Hu1, Lili Cheng1, Yunhe Cheng1, Weitao Mao1, Yanjie Qiao1, Yanping Lan1.
Abstract
Chinese chestnut (Castanea mollissima Blume) is one of the earliest domesticated plants of high nutritional and ecological value, yet mechanisms of C. mollissima underlying its growth and development are poorly understood. Although individual chestnut species differ greatly, the molecular basis of the formation of their characteristic traits remains unknown. Though the draft genomes of chestnut have been previously released, the pan-genome of different variety needs to be studied. We report the genome sequence of three cultivated varieties of chestnut herein, namely Hei-Shan-Zhai-7 (H7, drought-resistant variety), Yan-Hong (YH, easy-pruning variety), and Yan-Shan-Zao-Sheng (ZS, early-maturing variety), to expedite convenience and efficiency in its genetics-based breeding. We obtained three chromosome-level chestnut genome assemblies through a combination of Oxford Nanopore technology, Illumina HiSeq X, and Hi-C mapping. The final genome assemblies are 671.99 Mb (YH), 790.99 Mb (ZS), and 678.90 Mb (H7), across 12 chromosomes, with scaffold N50 sizes of 50.50 Mb (YH), 65.05 Mb (ZS), and 52.16 Mb (H7). Through the identification of homologous genes and the cluster analysis of gene families, we found that H7, YH and ZS had 159, 131, and 91 unique gene families, respectively, and there were 13,248 single-copy direct homologous genes in the three chestnut varieties. For the convenience of research, the chestnut genome database was constructed. Based on the results of gene family identification, the presence/absence variations (PAVs) information of the three sample genes was calculated, and a total of 2,364, 2,232, and 1,475 unique genes were identified in H7, YH and ZS, respectively. Our results suggest that the GBSS II-b gene family underwent expansion in chestnut (relative to nearest source species). Overall, we developed high-quality and well-annotated genome sequences of three C. mollissima varieties, which will facilitate clarifying the molecular mechanisms underlying important traits, and shortening the breeding process.Entities:
Keywords: Castanea mollissima; Nanopore sequencing; de novo assembly; genome database; pan-genome; waxy gene
Year: 2022 PMID: 35958219 PMCID: PMC9358723 DOI: 10.3389/fpls.2022.916550
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Summary of three C. mollissima genomes assembly.
| Parameter | YH | H7 | ZS |
|---|---|---|---|
| No. of contigs | 1,514 | 1,460 | 827 |
| Contig length (bp) | 679,866,993 | 687,236,598 | 790,986,026 |
| N50 (bp) | 3,649,215 | 3,389,933 | 2,174,699 |
| N90 (bp) | 330,218 | 448,929 | 436,758 |
| Contig max (bp) | 24,666,180 | 21,536,999 | 14,385,919 |
Figure 1Hi-C interaction heat maps showing interactions among 12 chromosomes of each Chinese chestnut variety with the bin size of 10 kb resolution, using ggplot2 in the R package to evaluate the quality of the chromosomal-level genome assembly. (A): YH; (B): H7; and (C): ZS.
Assessment of the integrity of core genes in the three Chinese chestnut varieties.
| Variety | No. of 458 CEGs present in the assembly | Percentage of 458 CEGs present in the assembly | No. of 248 highly conserved CEGs present | Percentage of 248 highly conserved CEGs present |
|---|---|---|---|---|
| YH | 422 | 92.14% | 204 | 82.26% |
| ZS | 438 | 95.63% | 214 | 86.29% |
| H7 | 426 | 93.01% | 202 | 81.45% |
Assessment of BUSCO notations in the C. mollissima genomes.
| YH | ZS | H7 | |
|---|---|---|---|
| Complete BUSCOs (C) | 1,296 (90.00%) | 1,368 (95.00%) | 1,310 (90.97%) |
| Complete and single-copy BUSCOs (S) | 1,244 (86.39%) | 1,257 (87.29%) | 1,262 (87.64%) |
| Complete and duplicated BUSCOs (D) | 52 (3.61%) | 111 (7.71%) | 48 (3.33%) |
| Fragmented BUSCOs (F) | 27 (1.88%) | 25 (1.74%) | 28 (1.94%) |
| Missing BUSCOs (M) | 117 (8.12%) | 47 (3.26%) | 102 (7.08%) |
| Total Lineage BUSCOs | 1,440 | 1,440 | 1,440 |
Figure 2Analysis of collinearity between C. mollissima and Q. robur genomes using the MUMmer software. (A): YH; (B): H7; and (C): ZS.
Figure 3Analysis of evolutionary relationships among Chinese chestnut and nine other species. The phylogenetic tree was constructed by PHYML software using the single-copy protein sequences.
Figure 4Venn diagram of the number of homologous gene families in H7, YH, and ZS genomes (A) and the number of homologous genes in H7, YH, and ZS genomes (B), results from PAV analysis.
Figure 5Evolution tree and motifs analysis of Chestnut GBSS II family members and 32 orther species. Phylogenetic trees were constructed using IQtree software.
Figure 6The user interface of Chestnut Genome Database for browsing genomes, searching for homologous sequences and designing primers.