| Literature DB >> 35482027 |
Jine Fu1, Lingyun Wan1, Lisha Song1, Lili He1, Ni Jiang1, Hairong Long1, Juan Huo1, Xiaowen Ji1, Fengyun Hu1, Shugen Wei1, Limei Pan1.
Abstract
The hemiparasitic Taxillus chinensis (DC.) Danser is a root-parasitizing medicinal plant with photosynthetic ability, which is lost in other parasitic plants. However, the cultivation and medical application of the species are limited by the recalcitrant seeds of the species, and even though the molecular mechanisms underlying this recalcitrance have been investigated using transcriptomic and proteomic methods, genome resources for T. chinensis have yet to be reported. Accordingly, the aim of the present study was to use nanopore, short-read, and high-throughput chromosome conformation capture sequencing to construct a chromosome-level assembly of the T. chinensis genome. The final genome assembly was 521.90 Mb in length, and 496.43 Mb (95.12%) could be grouped into nine chromosomes with contig and scaffold N50 values of 3.80 and 56.90 Mb, respectively. In addition, a total of 33,894 protein-coding genes were predicted, and gene family clustering identified 11 photosystem-related gene families, thereby indicating photosynthetic ability, which is a characteristic of hemiparasitic plants. This chromosome-level genome assembly of T. chinensis provides a valuable genomic resource for elucidating the genetic basis underlying the recalcitrant characteristics of T. chinensis seeds and the evolution of photosynthesis loss in parasitic plants.Entities:
Keywords: zzm321990 Taxillus chinensiszzm321990 ; Hi-C proximity mapping; chromosomal assembly; nanopore sequencing
Mesh:
Year: 2022 PMID: 35482027 PMCID: PMC9113316 DOI: 10.1093/gbe/evac060
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 4.065
Fig. 1.Genome assembly of Taxillus chinensis. (A) Taxillus chinensis. (B) Workflow used to generate the chromosome-level genome assembly. (C) Genome-wide Hi-C heatmap of chromatin interaction counts in 100 kb bins. Only sequences anchored on chromosomes are shown. The abbreviations Chr01–09 represent the nine chromosomes, and the color bar represents the log2 value of interaction counts.
Genome Sequencing, Assembly, and Annotation Statistics
| Statistics | |
|---|---|
| Genome assembly and chromosomes construction | |
| Contig N50 size (bp) | 3,797,897 |
| Contig N90 size (bp) | 554,497 |
| Maximum contig size (bp) | 13,585,695 |
| Scaffold number | 434 |
| Scaffold N50 (bp) | 56,927,202 |
| Scaffold N90 (bp) | 47,601,956 |
| Maximum scaffold size (bp) | 59,987,258 |
| Genome size (bp) | 521,908,327 |
| Number of chromosomes | 9 |
| Total length of chromosomes (bp) | 496,429,085 |
| GC content (%) | 40.17 |
| Genome quality evaluation | |
| Proportion of complete BUSCO orthologs (%) | 95 |
| Proportion of complete and single-copy BUSCO orthologs (%) | 92.4 |
| Proportion of complete and duplicated BUSCO orthologs (%) | 2.6 |
| Proportion of fragmented BUSCO orthologs (%) | 1.5 |
| Proportion of missing BUSCO orthologs (%) | 3.5 |
| Gene annotation | |
| Number of GO annotation | 9,362 |
| Number of KEGG annotation | 19,863 |
| Number of KOG annotation | 20,225 |
| Number of TrEMBL annotation | 28,335 |
| Number of Interpro annotation | 26,400 |
| Number of SwissProt annotation | 21,376 |
| Number of NR annotation | 27,967 |
| Number of all annotated | 33,894 |