| Literature DB >> 23049872 |
Xuhang Wu1, Yan Fu, Deying Yang, Runhui Zhang, Wanpeng Zheng, Huaming Nie, Yue Xie, Ning Yan, Guiying Hao, Xiaobin Gu, Shuxian Wang, Xuerong Peng, Guangyou Yang.
Abstract
BACKGROUND: The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 23049872 PMCID: PMC3458062 DOI: 10.1371/journal.pone.0045830
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Transcriptome summary of the adult stage of T. multiceps and detailed bioinformatics annotations.
|
| |
| Raw reads | 28,320,027 |
| Clean reads | 27,447,770, each 90 bp in length |
| GC content | 49.04% |
| Contigs (≥300) (mean length; max; N50) | 53,568 (974 bp; 11,875 bp; 1,268 bp) |
| Unigenes (≥300) (mean length; max; N50) | 31,282 (920 bp; 11,875 bp; 1,206 bp) |
|
| |
| Gene annotation against animal proteins of Nr | 17,618 (56.3%) |
| Gene annotation against Drosophila protein of Nr | 5,925 (18.9%) |
| Gene annotation against UniProtKB/Swiss-Prot | 14,350 (45.9%) |
| Gene annotation against UniProtKB/TrEMBL | 16,286 (52.1%) |
| Gene annotation against COG | 6,653 (21.3%), 24 categories |
| Gene annotation against KEGG | 11,645 (37.3%), 213 pathway |
| All unigenes matching Nr, UniProtKB, COG, KEGG | 17,768 (56.8%) |
| Gene annotation against InterPro | 25,457 (81.38%), 4,562 domains/families |
| Gene annotation against Pfam | 12,909 (41.27%), 3,396 domains/families |
| Predicted coding sequence (CDS) | 20,896 (66.8%) |
| All annotated unigenes | 26,110 (83.47%) |
| Unigenes matching all seven databases | 5,509 (17.61%) |
| GO annotation for Nr protein hits | 4,706 (15.04%), 2,360 GO terms, 48 sub-categories |
| Biological process | 2,315 (1,578 GO terms), 27 sub-categories |
| Cellular component | 3,354 (270 GO terms), 10 sub-categories |
| Molecular function | 2,809 (512 GO terms), 11 sub-categories |
Tm, T. multiceps.
Figure 1Overview of the T. multiceps transcriptome by Trinity assembling.
Figure 2KEGG categories of T. multiceps unigenes.
Overall, 11,645 unigenes were annotated against KEGG database. The GIP category represents ‘genetic information processing’ and EIP denotes ‘environmental information processing’.
The 30 most abundant InterPro domains/families in T. multiceps unigenes.
| InterPro entry | InterPro domains/families | No. of |
| IPR002110 | Ankyrin repeat | 226 |
| IPR001680 | WD40 repeat | 142 |
| IPR019781 | WD40 repeat, subgroup | 110 |
| IPR001452 | Src homology-3 domain | 103 |
| IPR003961 | Fibronectin, type III | 91 |
| IPR001650 | Helicase, C-terminal | 86 |
| IPR000504 | RNA recognition motif domain | 81 |
| IPR007087 | Zinc finger, C2H2-type | 79 |
| IPR019734 | Tetratricopeptide repeat | 79 |
| IPR019782 | WD40 repeat 2 | 75 |
| IPR015880 | Zinc finger, C2H2-like | 74 |
| IPR020683 | Ankyrin repeat-containing domain | 72 |
| IPR000980 | SH2 motif | 60 |
| IPR001715 | Calponin homology domain | 60 |
| IPR013783 | Immunoglobulin-like fold | 57 |
| IPR011009 | Protein kinase-like domain | 55 |
| IPR000242 | Protein-tyrosine phosphatase, receptor/non-receptor type | 53 |
| IPR003591 | Leucine-rich repeat, typical subtype | 53 |
| IPR001478 | PDZ/DHR/GLGF | 51 |
| IPR013032 | EGF-like region, conserved site | 49 |
| IPR015943 | WD40/YVTN repeat-like-containing domain | 48 |
| IPR002453 | Beta tubulin | 48 |
| IPR001781 | Zinc finger, LIM-type | 47 |
| IPR000719 | Protein kinase, catalytic domain | 46 |
| IPR013083 | Zinc finger, RING/FYVE/PHD-type | 45 |
| IPR002126 | Cadherin | 44 |
| IPR017868 | Filamin/ABP280 repeat-like | 43 |
| IPR000217 | Tubulin | 42 |
| IPR011011 | Zinc finger, FYVE/PHD-type | 39 |
| IPR006210 | Epidermal growth factor-like | 39 |
Figure 3Transcriptomic Gene Ontology (GO) term comparison of T. multiceps and T. pisiformis.
Pie chart illustrating similarities and differences between GO terms (according to the categories ‘cellular component’ and ‘molecular function’ and ‘biological process’) assigned to peptides from T. multiceps and T. pisiformis inferred from transcriptomic data.
Figure 4GO terms similarity distribution among T. multiceps, E. granulosus and E. multilocularis.
Bar graph plotted using a web-based tool, WEGO.
Figure 5Venn diagram showing the overlap sequences among four intestinal parasites and five Taeniidae cestodes.
(A) 145 common genes shared by T. multiceps, T. spiralis and A. caninum. (B) 109 common genes shared by T. multiceps, T. spiralis, A. caninum and A. suum. (C) 5,100 common genes among T. multiceps, T. pisiformis and T. solium. (D) 261 conserved genes between T. multiceps, T. pisiformis, T. solium, E. granulosus and E. multilocularis.