| Literature DB >> 31775624 |
Shifeng Cheng1,2, Yuan Fu1,3, Yaolei Zhang1,3, Wenfei Xian1,2, Hongli Wang1,3, Benedikt Grothe4, Xin Liu1,3, Xun Xu1,3, Achim Klug5, Elizabeth A McCullagh6,7.
Abstract
BACKGROUND: The Mongolian gerbil (Meriones unguiculatus) has historically been used as a model organism for the auditory and visual systems, stroke/ischemia, epilepsy and aging related research since 1935 when laboratory gerbils were separated from their wild counterparts. In this study we report genome sequencing, assembly, and annotation further supported by transcriptome sequencing and assembly from 27 different tissues samples.Entities:
Keywords: Gerbil genome; Meriones unguiculatus; Model organism; Transcriptome
Mesh:
Year: 2019 PMID: 31775624 PMCID: PMC6882081 DOI: 10.1186/s12864-019-6276-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Global statistics of the Mongolian gerbil genome
| Statistic | Value |
|---|---|
| Size (Gbp) | 2.54 |
| Scaffold number (> 2000 bp) | 31,769 |
| Scaffold N50 (Kbp) | 500.0 |
| Contig number (> 2000 bp) | 114,522 |
| Contig N50 (Kbp) | 31.4 |
General statistics of predicted protein-coding genes
| Gene set | Number | Average transcript length (bp) | Average CDS length (bp) | Average exon per gene | Average exon length (bp) | Average intron length (bp) | |
|---|---|---|---|---|---|---|---|
| De novo | SNAP | 76,858 | 42,227.63 | 742.83 | 5.52 | 134.62 | 9182.18 |
| AUGUSTUS | 24,675 | 19,838.68 | 1133.22 | 5.61 | 201.97 | 4056.79 | |
| GENESCAN | 49,390 | 24,183.55 | 1023.1 | 6.25 | 163.54 | 4406.54 | |
| Homolog | 38,750 | 31,095 | 1809 | NA | 262 | 3803 | |
| 22,728 | 26,977.32 | 1465.18 | 8.02 | 182.61 | 3632.46 | ||
| 23,686 | 23,564.96 | 1336.56 | 7.43 | 179.83 | 3455.8 | ||
| 17,131 | 31,217.18 | 1580.27 | 9.11 | 173.55 | 3656.27 | ||
| GLEAN | 19,893 | 18,835.39 | 1418.26 | 7.72 | 183.69 | 2691.49 | |
| Transcriptome | 36,019 | 33,752.29 | 1758.58 | 10.74 | 163.77 | 3285.43 | |
| Final set | 22,998 | 23,846.58 | 1535.48 | 7.76 | 197.9 | 3300.83 | |
NA Not available
Functional annotation of the final gene set
| Number | Percent (%) | |
|---|---|---|
| Total | 22,998 | 100 |
| InterPro | 18,570 | 80.7 |
| GO | 14,591 | 63.4 |
| KEGG | 17,572 | 76.4 |
| Swissprot | 20,113 | 87.5 |
| TrEMBL | 20,666 | 89.9 |
| Annotated | 20,760 | 90.3 |
| Unannotated | 2238 | 9.7 |
Summary of mobile element types
| Type | Length (Kb) | Percentage of the genome (%) |
|---|---|---|
| DNA | 20,498 | 0.81 |
| LINE | 697,185 | 27.5 |
| SINE | 94,229 | 3.7 |
| LTR | 164,504 | 6.5 |
| Other | 40,254 | 1.6 |
| Total | 1,016,671 | 40.0 |
Genome annotation comparisons with other model organisms
| Species | Common name | Protein coding genes | Assembly Size | Divergence time to gerbils, Myr | RefSeq/Genbank assembly accession | Annotation release ID | Reference |
|---|---|---|---|---|---|---|---|
| Mongolian gerbil | 22,998 | 2,537,533,819 | – | GCA_008131255.1 | – | This work | |
| Mongolian gerbil | 22,144 | 2,620,810,971 | – | GCF_002204375.1 | 100 | [ | |
| mouse | 22,077 | 2,730,855,475 | 22.5 | GCF_000001635.26 | 108 | [ | |
| rat | 23,347 | 2,870,184,193 | 22.5 | GCF_000001895.5 | 106 | [ | |
| Chinese hamster | 20,747 | 2,360,130,144 | 25 | GCF_000419365.1 | 102 | [ |
Fig. 1Gene Family Construction. The number of genes is similar between species compared (human, mouse, rat, and gerbil)
Completeness of gerbil genome and transcriptome assembly as assessed by BUSCO
| Genome | Transcriptome | |
|---|---|---|
| Complete BUSCOs | 2601 | 2508 |
| Duplicated BUSCOs | 55 | 46 |
| Fragmented BUSCOs | 170 | 293 |
| Missing BUSCOs | 252 | 222 |
| Total BUSCO groups searched | 3023 | 3023 |