| Literature DB >> 33045048 |
Bangxing Han1, Yi Jing2, Jun Dai1, Tao Zheng2,3, Fangli Gu1, Qun Zhao1, Fucheng Zhu1, Xiangwen Song1, Hui Deng1, Peipei Wei1, Cheng Song1, Dong Liu1, Xueping Jiang1, Fang Wang1, Yanjun Chen1, Chuanbo Sun1, Houjun Yao1, Li Zhang1, Naidong Chen1, Shaotong Chen1, Xiaoli Li1, Yuan Wei4, Zhen Ouyang4, Hui Yan5, Jiangjie Lu6, Huizhong Wang6, Lanping Guo7, Lingdong Kong8, Jing Zhao9, Shaoping Li9, Lifen Luo10, Karsten Kristiansen3, Zhan Feng2, Silong Sun2, Cunwu Chen1, Zhen Yue2, Naifu Chen1.
Abstract
Dendrobium huoshanense is used to treat various diseases in traditional Chinese medicine. Recent studies have identified active components. However, the lack of genomic data limits research on the biosynthesis and application of these therapeutic ingredients. To address this issue, we generated the first chromosome-level genome assembly and annotation of D. huoshanense. We integrated PacBio sequencing data, Illumina paired-end sequencing data, and Hi-C sequencing data to assemble a 1.285 Gb genome, with contig and scaffold N50 lengths of 598 kb and 71.79 Mb, respectively. We annotated 21,070 protein-coding genes and 0.96 Gb transposable elements, constituting 74.92% of the whole assembly. In addition, we identified 252 genes responsible for polysaccharide biosynthesis by Kyoto Encyclopedia of Genes and Genomes functional annotation. Our data provide a basis for further functional studies, particularly those focused on genes related to glycan biosynthesis and metabolism, and have implications for both conservation and medicine.Entities:
Keywords: Hi-C assembly; annotation; de novo assembly; genome; orchid
Year: 2020 PMID: 33045048 PMCID: PMC7846097 DOI: 10.1093/gbe/evaa215
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary of the genome assembly and annotation tables
| Genome assembly | Estimated genome size | 1.29 Gb | |
| Guanine and Cytosine content | 38% | ||
| N50 length (contig) | 598 kb | ||
| Longest contig | 6.11 Mb | ||
| Total length of contigs | 1.28 Gb | ||
| N50 length (scaffold) | 71.79 Mb | ||
| Longest scaffold | 100.20 Mb | ||
| Total length of scaffolds | 1.29 Gb | ||
| Transposable elements | Annotation | Percent (%) | Total length |
| DNA | 5.56 | 71.48 Mb | |
| LINE | 12.04 | 154.75 Mb | |
| SINE | 0.01 | 131.45 kb | |
| LTR | 65.53 | 842.36 Mb | |
| Other | 0.00 | 9.35 kb | |
| Unknown | 4.38 | 56.29 Mb | |
| Total | 74.92 | 0.96 Gb | |
| Protein-coding genes | Predicted genes | 21,070 | |
| Average transcript length (bp) | 9,877.52 | ||
| Average coding sequence length (bp) | 1,202.62 | ||
| Average exon length (bp) | 270.66 | ||
| Average intron length (bp) | 2206.22 | ||
| Functionally annotated | 20,904 |
. 1.(A) Functional classification of Dendrobium huoshanense genes using the KEGG database. (B) Copia and Gypsy element distributions along the chromosomes of D. huoshanense. Copia and Gypsy densities represent the proportions of Copia and Gypsy elements within 1 Mb intervals.