| Literature DB >> 28588571 |
Shuai Liu1, Xiaosu Zhou1, Lili Hao2, Xianyu Piao1, Nan Hou1, Qijun Chen1,3.
Abstract
Alternative splicing (AS), as one of the most important topics in the post-genomic era, has been extensively studied in numerous organisms. However, little is known about the prevalence and characteristics of AS in Echinococcus species, which can cause significant health problems to humans and domestic animals. Based on high-throughput RNA-sequencing data, we performed a genome-wide survey of AS in two major pathogens of echinococcosis-Echinococcus granulosus and Echinococcus multilocularis. Our study revealed that the prevalence and characteristics of AS in protoscoleces of the two parasites were generally consistent with each other. A total of 6,826 AS events from 3,774 E. granulosus genes and 6,644 AS events from 3,611 E. multilocularis genes were identified in protoscolex transcriptomes, indicating that 33-36% of genes were subject to AS in the two parasites. Strikingly, intron retention instead of exon skipping was the predominant type of AS in Echinococcus species. Moreover, analysis of the Kyoto Encyclopedia of Genes and Genomes pathway indicated that genes that underwent AS events were significantly enriched in multiple pathways mainly related to metabolism (e.g., purine, fatty acid, galactose, and glycerolipid metabolism), signal transduction (e.g., Jak-STAT, VEGF, Notch, and GnRH signaling pathways), and genetic information processing (e.g., RNA transport and mRNA surveillance pathways). The landscape of AS obtained in this study will not only facilitate future investigations on transcriptome complexity and AS regulation during the life cycle of Echinococcus species, but also provide an invaluable resource for future functional and evolutionary studies of AS in platyhelminth parasites.Entities:
Keywords: Echinococcus granulosus; Echinococcus multilocularis; alternative splicing; nextgeneration sequencing; transcriptome
Year: 2017 PMID: 28588571 PMCID: PMC5440512 DOI: 10.3389/fmicb.2017.00929
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary data of the transcriptome analysis.
| Class | ||
|---|---|---|
| Number of single reads | 43,509,848 | 44,379,588 |
| Number of reads and percentages mapped to genome | 34,717,856 (79.79%)a | 38,882,179 (87.61%) |
| 31,156,736 (71.61%)b | ||
| Assembled genes (loci) | 11,330 | 10,101 |
| Known | 6,815 | 7,051 |
| Novel | 4,515 | 3,050 |
| Assembled transcripts | 24,550 | 23,771 |
| Known | 7,925 | 8,432 |
| Novel | 16,625 | 15,339 |
| Mean length of transcripts | 2,016 | 2,087 |
Pathway enrichment analysis of AS genes in E. granulosus and E. multilocularis.
| Pathway ID | Description | ||||||
|---|---|---|---|---|---|---|---|
| All genes | AS genes | All genes | AS genes | ||||
| ko04111 | Cell cycle – yeast | 45 | 22 | 2.16E - 02 | 50 | 31 | 1.67E - 04 |
| ko04113 | Meiosis – yeast | 40 | 21 | 9.36E - 03 | 46 | 25 | 8.56E - 03 |
| ko04070 | Phosphatidylinositol signaling system | 25 | 16 | 1.62E - 03 | 26 | 22 | 4.90E - 07 |
| ko04630 | Jak-STAT signaling pathway | 12 | 9 | 3.82E - 03 | 11 | 8 | 1.50E - 02 |
| ko04370 | VEGF signaling pathway | 22 | 13 | 1.15E - 02 | 23 | 15 | 4.24E - 03 |
| ko04330 | Notch signaling pathway | 15 | 9 | 3.06E - 02 | 13 | 10 | 3.26E - 03 |
| ko04090 | Cellular antigens | 29 | 15 | 3.08E - 02 | 28 | 18 | 2.22E - 03 |
| ko03022 | Basal transcription factors | 26 | 14 | 2.45E - 02 | 25 | 17 | 1.17E - 03 |
| ko03015 | mRNA surveillance pathway | 55 | 30 | 9.30E - 04 | 53 | 31 | 7.20E - 04 |
| ko03013 | RNA transport | 92 | 48 | 1.38E - 04 | 87 | 47 | 4.58E - 04 |
| ko00510 | 30 | 20 | 1.88E - 04 | 30 | 23 | 6.78E - 06 | |
| ko00562 | Inositol phosphate metabolism | 20 | 13 | 3.67E - 03 | 20 | 18 | 8.70E - 07 |
| ko00513 | Various types of | 26 | 18 | 1.92E - 04 | 23 | 17 | 2.47E - 04 |
| ko00071 | Fatty acid metabolism | 10 | 7 | 1.95E - 02 | 10 | 7 | 3.09E - 02 |
| ko00280 | Leucine and isoleucine degradation | 7 | 6 | 6.82E - 03 | 8 | 6 | 2.96E - 02 |
| ko00052 | Galactose metabolism | 19 | 11 | 2.38E - 02 | 19 | 11 | 4.33E - 02 |
| ko00640 | Propanoate metabolism | 12 | 9 | 3.82E - 03 | 13 | 9 | 1.56E - 02 |
| ko00563 | GPI-anchor biosynthesis | 17 | 11 | 7.92E - 03 | 16 | 10 | 2.85E - 02 |
| ko00061 | Fatty acid biosynthesis | 3 | 3 | 3.69E - 02 | 3 | 3 | 4.70E - 02 |
| ko00561 | Glycerolipid metabolism | 18 | 13 | 8.38E - 04 | 20 | 13 | 8.04E - 03 |
| ko00230 | Purine metabolism | 84 | 41 | 2.26E - 03 | 86 | 41 | 1.77E - 02 |
| ko04380 | Osteoclast differentiation | 17 | 11 | 7.92E - 03 | 16 | 10 | 2.85E - 02 |
| ko04664 | Fc epsilon RI signaling pathway | 17 | 10 | 2.71E - 02 | 16 | 11 | 8.04E - 03 |
| ko04912 | GnRH signaling pathway | 26 | 13 | 4.78E - 02 | 31 | 17 | 2.52E - 02 |
| ko04660 | T cell receptor signaling pathway | 31 | 17 | 1.09E - 02 | 31 | 18 | 1.03E - 02 |
| ko04620 | Toll-like receptor signaling pathway | 14 | 10 | 4.00E - 03 | 14 | 10 | 7.72E - 03 |
Representative homologous gene pairs with AS events from groups I, II, and III.
| AS | ||||||||||||
| Group | EGa gene | EM gene | Annotation | AAS | ADS | AFE | ALE | MXE | IR | ES | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| I | EG.664 | EM.6854 | Serine/threonine protein kinase | +/•b | ||||||||
| EG.5998 | EM.9833 | Similar to β-tubulin | + | +/• | +/• | +/• | • | |||||
| EG.7097 | EM.1247 | Dynamin | + | +/• | +/• | +/• | ||||||
| EG.3316 | EM.4881 | Vacuolar proton ATPases | +/• | + | + | +/• | • | |||||
| EG.1088 | EM.7236 | Pre-mRNA-splicing factor CLF1 | • | + | ||||||||
| EG.665 | EM.6855 | Phospholipase C | + | + | +/• | + | +/• | |||||
| EG.5106 | EM.4247 | Multiple ankyrin repeats single KH domain protein | +/• | • | +/• | |||||||
| EG.5383 | EM.2230 | + | • | +/• | ||||||||
| EG.3284 | EM.4849 | Developmentally regulated GTP-binding protein 2 | + | +/• | ||||||||
| EG.9830 | EM.6246 | Eukaryotic translation initiation factor 2c | + | +/• | ||||||||
| II | EG.9768 | EM.6188 | Peroxidasin | + | • | |||||||
| EG.8983 | EM.6048 | Filamin | +/• | |||||||||
| EG.7075 | EM.1228 | Adducin-related protein | • | +/• | ||||||||
| EG.4660 | EM.3827 | Serine/arginine rich splicing factor | +/• | |||||||||
| EG.5720 | EM.2557 | SNARE protein Sec22 | +/• | |||||||||
| EG.2713 | EM.8982 | Voltage-dependent calcium channel | +/• | +/• | • | +/• | +/• | |||||
| EG.215 | EM.6429 | Hypothetical protein | +/• | |||||||||
| EG.3215 | EM.9455 | Aspartyl aminopeptidase | • | + | +/• | +/• | ||||||
| EG.8105 | EM.327 | Venom allergen-like protein | + | +/• | ||||||||
| EG.4029 | EM.5528 | Hypothetical protein | +/• | +/• | • | |||||||
| III | EG.583 | EM.6781 | Heat shock protein 70 | +/• | ||||||||
| EG.10583 | EM.149 | Acyl-CoA dehydrogenase 9 | +/• | +/• | ||||||||
| EG.3709 | EM.5622 | Hypothetical protein | +/• | |||||||||
| EG.5622 | EM2474 | Hypothetical protein | +/• | +/• | ||||||||
| EG.1536 | EM7652 | Hypothetical protein | + | +/• | ||||||||
| EG.8356 | EM581 | Tegument antigen | • | +/• | ||||||||
| EG.7014 | EM.1953 | PDZ domain-containing protein GIPC3-like | • | + | +/• | |||||||
| EG.1931 | EM.8226 | Hypothetical protein | +/• | +/• | + | • | +/• | +/• | ||||
| EG.2541 | EM.8817 | Transmembrane protein 144 | • | +/• | + | |||||||
| EG.2553 | EM.8828 | Tetraspanin | + | • | ||||||||