| Literature DB >> 29330436 |
Xiaoning Sun1, Ruibo Cai1, Xuelin Jin2, Aaron B A Shafer3, Xiaolong Hu1, Shuang Yang1, Yimeng Li1, Lei Qi1, Shuqiang Liu4, Defu Hu5.
Abstract
Forest musk deer (Moschus berezovskii; FMD) are both economically valuable and highly endangered. A problem for FMD captive breeding programs has been the susceptibility of FMD to abscesses. To investigate the mechanisms of abscess development in FMD, the blood transcriptomes of three purulent and three healthy individuals were generated. A total of ~39.68 Gb bases were generated using Illumina HiSeq 4000 sequencing technology and 77,752 unigenes were identified after assembling. All the unigenes were annotated, with 63,531 (81.71%) mapping to at least one database. Based on these functional annotations, 45,798 coding sequences (CDS) were detected, along with 12,697 simple sequence repeats (SSRs) and 65,536 single nucleotide polymorphisms (SNPs). A total of 113 unigenes were found to be differentially expressed between healthy and purulent individuals. Functional annotation indicated that most of these differentially expressed genes were involved in the regulation of immune system processes, particularly those associated with parasitic and bacterial infection pathways.Entities:
Mesh:
Year: 2018 PMID: 29330436 PMCID: PMC5766596 DOI: 10.1038/s41598-017-18534-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Unigenes of the forest musk deer (Moschus berezovskii) transcriptome generated in this study.
| Sample | Total unigenes | Total length of unigenes (bp) | Mean length of unigenes (bp) | N50 (bp) | N70 (bp) | N90 (bp) | GC (%) |
|---|---|---|---|---|---|---|---|
| Purulent 1 | 44097 | 36962323 | 838 | 1549 | 792 | 306 | 51.45 |
| Purulent 2 | 46283 | 42021313 | 907 | 1736 | 906 | 328 | 51.37 |
| Purulent 3 | 53331 | 40364593 | 756 | 1357 | 676 | 282 | 51.93 |
| Healthy 1 | 43475 | 33762201 | 776 | 1384 | 698 | 290 | 52.37 |
| Healthy 2 | 50759 | 37704954 | 742 | 1312 | 652 | 279 | 52.59 |
| Healthy 3 | 64862 | 62759668 | 967 | 2052 | 1041 | 327 | 51.64 |
| All-Unigene | 77752 | 78169844 | 1005 | 2206 | 1173 | 328 | 51.91 |
N50/N70/N90: a weighted median statistic where 50%/70%/90% of the total sequence length is contained in unigenes greater than or equal to this value; GC (%): the percentage of G and C bases in all transcripts.
Percentage of genes that were successfully annotated in each functional database searched.
| Item | Number | Percentage |
|---|---|---|
| Total unigenes | 77,752 | 100% |
| NR | 40,344 | 51.89% |
| NT | 58,736 | 75.54% |
| SwissProt | 34,717 | 44.65% |
| KEGG | 29,087 | 37.41% |
| COG | 17,796 | 22.89% |
| InterPro | 29,941 | 38.51% |
| GO | 20,510 | 26.38% |
| Overall | 63,531 | 81.71% |
‘Overall’ indicates the number of unigenes that were annotated by at least one functional database.
Figure 1Venn diagram of unigene mapping results. Areas of overlap show the number of unigenes successfully mapped to all overlapping databases. Results are shown only for Non-Redundant protein database (NR), Cluster of Orthologous Groups of proteins(COG), Kyoto encyclopedia of genes and genomes (KEGG), Swiss-prot, and InterPro databases.
Figure 2Distribution of potential SSRs by motif type. The 16 primary SSR motifs are shown; the remaining 88 SSR motifs identified are grouped in the “Others” category. Green, light blue, pink and dark blue represent mononucleotide repeats, dinucleotide repeats, trinucleotide repeats and other types of repeats, respectively.
Figure 3Summary of genes differentially expressed in the healthy and purulent individuals. Each pair of individuals compared is plotted on the x-axis, and the number of differently expressed genes (DEGs) is plotted on the y-axis. The red bars represent the upregulated genes, while the blue bars represent the downregulated DEGs.
Figure 4Functional distribution of differently expressed genes (DEGs). (a) Functional distribution of DEGs according to the gene ontology (GO) database. The y-axis shows the GO functional categories, while the number of genes in each category is plotted on the x-axis. (b) Functional distribution of DEGs according to the KEGG pathway database. The y-axis shows the KEGG functional categories, while the number of genes in each category is plotted on the x-axis.
Genes related to immune system function that were differentially expressed in purulent individuals, as compared to healthy individuals.
| Gene | Functional annotation | Regulation in purulent samples |
|---|---|---|
| Unigene53584_All | immunoglobulin heavy chain variable region precursor, mRNA | Up |
| CL119.Contig12_All | immunoglobulin light chain variable region (IGLV) mRNA | Up |
| CL984.Contig6_All | Ig kappa chain, mRNA | Up |
| Unigene53558_All | immunoglobulin mu heavy chain variable region mRNA | Up |
| CL465.Contig8_All | mRNA for immunoglobulin heavy chain variable region (ighv gene) | Up |
| CL6305.Contig5_All | immunoglobulin lambda-like polypeptide 5-like (LOC102250240), mRNA | Up |
| Unigene4040_All | interleukin 2 receptor, beta (IL2RB), mRNA | Up |
| Unigene26557_All | immunoglobulin mu heavy chain variable region mRNA | Up |
| Unigene25688_All | immunoglobulin V lambda chain (V lambda 12.2) gene | Up |
| Unigene25744_All | immunoglobulin V lambda chain 5.3.6 gene | Up |
| CL6457.Contig5_All | immunoglobulin lambda-like polypeptide 1-like | Up |
| CL119.Contig13_All | immunoglobulin V lambda chain (V lambda 12.2) gene | Up |
| Unigene55659_All | immunoglobulin lambda-like polypeptide 1-like | Up |
| CL1983.Contig6_All | CD6 molecule (CD6), transcript variant X 4, mRNA | Down |
| CL4014.Contig3_All | interferon regulatory factor 2 (IRF2), mRNA | Down |
| CL4718.Contig2_All | B-cell translocation gene 1, anti-proliferative, mRNA | Down |