| Literature DB >> 36108314 |
Haimeng Li1,2,3, Minhui Shi1,3, Qing Wang1,3, Tian Xia4, Sunil Kumar Sahu3, Yu Zhang5, Jiangang Wang3, Tianfeng Li5, Yue Ma2, Tianlu Liu5, Huan Liu2,3,6, Tianming Lan2,3, Suying Bai5.
Abstract
The muskrat (Ondatra zibethicus) is a semi-aquatic rodent species with ecological, economic, and medicinal importance. Here, we present an improved genome assembly, which is the first high-quality chromosome-level genome of the muskrat with high completeness and contiguity assembled using single-tube long fragment read, BGISEQ, and Hi-C sequencing technologies. The genome size of the final assembly was 2.63 Gb with 27 pseudochromosomes. The length of scaffold N50 reached 80.25 Mb with a Benchmarking Universal Single-Copy Ortholog score of 91.3%. We identified a 66.98 Mb X chromosome and a 1.14-Mb Y-linked genome region, and these sex-linked regions were validated by resequencing 32 extra male individuals. We predicted 19,396 protein-coding genes, among which 19,395 (99.99%) were functionally annotated. The expanded gene families in the muskrat genome were found to be enriched in several organic synthesis- and metabolism-related Gene Ontology terms, suggesting the likely genomic basis for the production and secretion of musk. This chromosome-level genome represents a valuable resource for improving our understanding of muskrat ecology and musk secretion.Entities:
Keywords: chromosome-level genome; musk; muskrat; sex chromosome
Mesh:
Year: 2022 PMID: 36108314 PMCID: PMC9539402 DOI: 10.1093/gbe/evac138
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 4.065
Genome Assembly and Annotation Data Related to the Muskrat Genome Assembled in This Study
| Item | Category | Number |
|---|---|---|
| Sequencing data | stLFR (Gb) | 212.90 |
| WGS (Gb) | 130.28 | |
| Hi-C (Gb) | 542.59 | |
| Resequencing (32 individuals) (Gb) | 1379.49 | |
| RNA-seq (Gb) | 105.29 | |
| Assembly (stLFR) | Estimated genome size (Gb) | 2.69 |
| Assembled genome size (Gb) | 2.71 | |
| Karyotype | 2 | |
| Contig N50 (Kb) | 56.15 | |
| Longest scaffold (Mb) | 34.52 | |
| Assembly (Hi-C) | Assembled genome size (Gb) | 2.63 |
| Scaffold N50 (Mb) | 80.25 | |
| Longest scaffold (Mb) | 196.46 | |
| Annotation | GC content (%) | 37.8 |
| Repeat sequences (%) | 34.32 | |
| Number of protein-coding genes | 19,396 | |
| Number of functional annotated genes | 19,395 | |
| Average gene length (Kb) | 31.92 | |
| Average exon length (bp) | 181.71 | |
| Average intron length (Kb) | 3.90 | |
| Average exon per gene | 8.78 |
Fig. 1.Genome landscape of the muskrat genome, comparative genomics analysis, and enrichment analysis of expanded gene families. (A) Overview of the chromosome-scale genome of the muskrat. (1) The 27 chromosomes; (2) read depth mapped to the genome; (3) GC content; (4) repeat density; and (5) gene density. (B) Identification of Ylinked regions and the X pseudochromosome. The sequencing depth of the sex-linked genome regions is nearly half that of the autosomes. (C) Divergence time estimation and the inference of expanded/contracted gene families. Green and red numbers on each node represent the number of expanded and contracted gene families, respectively. (D) Significantly enriched GO terms in the muskrat genome.