| Literature DB >> 34864990 |
Baoqing Ren1, Dafu Ru2, Luqin Chen1, Na Duan3, Yong Li3, Jianwei Shi3, Jianting Cao1, Bingbing Liu3.
Abstract
Elaeagnus mollis Diels (Elaeagnaceae) is a species of shrubs and/or dwarf trees that produces highly nutritious nuts with abundant oil and pharmaceutical properties. It is endemic to China but endangered. Therefore, to facilitate the protection of its genetic resources and the development of its commercially attractive traits we generated a high-quality genome of E. mollis. The contig version of the genome (630.96 Mb long) was assembled into 14 chromosomes using Hi-C data, with contig and scaffold N50 values of 18.40 and 38.86 Mb, respectively. Further analyses identified 397.49 Mb (63.0%) of repetitive sequences and 27,130 protein-coding genes, of which 26,725 (98.5%) were functionally annotated. Benchmarking Universal Single-Copy Ortholog assessment indicated that 98.0% of highly conserved plant genes are completely present in the genome. This is the first reference genome for any species of Elaeagnaceae and should greatly facilitate future efforts to conserve, utilize, and elucidate the evolution of this endangered endemic species.Entities:
Keywords: zzm321990 Elaeagnus molliszzm321990 ; endangered; oil tree
Mesh:
Year: 2021 PMID: 34864990 PMCID: PMC8691057 DOI: 10.1093/gbe/evab266
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Statistics of the Elaeagnus mollis Genome and Gene Model Predictions
| Parameter | Value | |
|---|---|---|
| Contig assembly | ||
| Total number of contigs | 131 | |
| Assembly size (bp) | 630,949,870 | |
| N50 (bp) | 18,396,748 | |
| N90 (bp) | 5,302,868 | |
| Largest contig (bp) | 45,531,911 | |
| Scaffold assembly | ||
| Total number of scaffolds | 50 | |
| Assembly size (bp) | 630,958,270 | |
| N50 (bp) | 38,861,146 | |
| N90 (bp) | 31,177,146 | |
| Largest scaffold (bp) | 115,470,569 | |
| Annotation | ||
| GC content | 31.88% | |
| Repeat density | 63% | |
| Number of protein-coding genes | 27130 | |
| Average length of protein-coding genes (bp) | 4381.18 | |
| Complete BUSCOs | 1581 (97.96%) | |
| Fragmented BUSCOs | 10 (0.62%) | |
| Missing BUSCOs | 23 (1.43%) | |
—The genome features of Elaeagnus mollis. (A) Circos plot showing features of the E. mollis genome. The concentric circles from the inner to outer show the GC density, gene density, repetitive sequence density, and collinearity; (B) Hi-C interaction matrices of the ordered scaffolds along the 14 pseudochromosomes.