| Literature DB >> 34581030 |
Mao-Sen Ye1,2, Jin-Yan Zhang1,2, Dan-Dan Yu1,3, Min Xu1,3, Ling Xu1,3, Long-Bao Lv3, Qi-Yun Zhu4, Yu Fan1,3, Yong-Gang Yao1,2,5.
Abstract
The Chinese tree shrew (Tupaia belangeri chinensis) is emerging as an important experimental animal in multiple fields of biomedical research. Comprehensive reference genome annotation for both mRNA and long non-coding RNA (lncRNA) is crucial for developing animal models using this species. In the current study, we collected a total of 234 high-quality RNA sequencing (RNA-seq) datasets and two long-read isoform sequencing (ISO-seq) datasets and improved the annotation of our previously assembled high-quality chromosome-level tree shrew genome. We obtained a total of 3 514 newly annotated coding genes and 50 576 lncRNA genes. We also characterized the tissue-specific expression patterns and alternative splicing patterns of mRNAs and lncRNAs and mapped the orthologous relationships among 11 mammalian species using the current annotated genome. We identified 144 tree shrew-specific gene families, including interleukin 6 (IL6) and STT3 oligosaccharyltransferase complex catalytic subunit B (STT3B), which underwent significant changes in size. Comparison of the overall expression patterns in tissues and pathways across four species (human, rhesus monkey, tree shrew, and mouse) indicated that tree shrews are more similar to primates than to mice at the tissue-transcriptome level. Notably, the newly annotated purine rich element binding protein A (PURA) gene and the STT3B gene family showed dysregulation upon viral infection. The updated version of the tree shrew genome annotation (KIZ version 3: TS_3.0) is available at http://www.treeshrewdb.org and provides an essential reference for basic and biomedical studies using tree shrew animal models.Entities:
Keywords: Gene family; Genome annotation; Transcriptome; Tree shrew; Virus infection
Mesh:
Substances:
Year: 2021 PMID: 34581030 PMCID: PMC8645884 DOI: 10.24272/j.issn.2095-8137.2021.272
Source DB: PubMed Journal: Zool Res ISSN: 2095-8137
Figure 1Reference-guided transcriptome assembly of tree shrew TS_3.0 genome annotation
Figure 2Characteristics of tree shrew TS_3.0 transcripts
Comparisons of five tree shrew genome annotations
| Parameters | TupChi_1.0 (NCBI) | TupaiaBase | TS_1.0 | TS_2.0 | TS_3.0 |
| Tree shrew genome annotations TS_1.0 ( | |||||
| Total number of coding genes | 23 527 | 19 230 | 22 121 | 23 568 | 27 082 |
| Transcript per coding gene | 1.59 | 1 | 1 | 1 | 2.17 |
| Annotated coding genes | 23 537 | 12 612 | 20 225 | 20 811 | 25 127 |
| Average mRNA length | 48 104 | 33 712 | 40 114 | 41 239 | |
| Average CDS length | 1 682 | 1 419 | 1 404 | 1 527 | 1 684 |
| Average exon number | 8.34 | 7.68 | 7.54 | 8.86 | 9.32 |
| Average exon length | 229 | 185 | 186 | 172 | 181 |
| Average intron length | 6 003 | 3 411 | 4937 | 4907 | 4 863 |
| Complete BUSCOs
| 216
| 195
| 221 (86.7%) | 235
| 250
|
| Complete BUSCOs
| 7 884
| 6 080
| 7 568
| 7 519
| 8 559
|
| Total number of lncRNA genes | 3 718 | – | – | – | 56 401 |
| Transcripts per lncRNA gene | 5 179 | – | – | – | 2.05 |
| Average lncRNA length | 914 | – | – | – | 823 |
| Average exon number | 3.54 | – | – | – | 3.06 |
| Average intron length | 17 614 | – | – | – | 4 658 |
Figure 3Tissue expression and alternative splicing profiles of tree shrew TS_3.0 transcripts
Figure 4Orthologous relationships and gene family size changes among different species
Figure 5Expression similarities among different species
Figure 6Changes in expression of genes in virus-infected tree shrew tissues and cells