| Literature DB >> 22911833 |
Yan Fu1, Jingchao Lan, Zhihe Zhang, Rong Hou, Xuhang Wu, Deying Yang, Runhui Zhang, Wanpeng Zheng, Huaming Nie, Yue Xie, Ning Yan, Zhi Yang, Chengdong Wang, Li Luo, Li Liu, Xiaobin Gu, Shuxian Wang, Xuerong Peng, Guangyou Yang.
Abstract
BACKGROUND: The heartworm Dirofilaria immitis is the causal agent of cardiopulmonary dirofilariosis in dogs and cats, and also infects a wide range of wild mammals as well as humans. One bottleneck for the design of fundamentally new intervention and management strategies against D. immitis may be the currently limited knowledge of fundamental molecular aspects of D. immitis. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22911833 PMCID: PMC3402454 DOI: 10.1371/journal.pone.0041639
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of transcriptome data for adult Dirofilaria immitis prior to and following assembly as well as detail bioinformatics annotation and analyses.
| Raw reads (paired-end) | 22,609,857 |
| clean reads (paired-end) | 22,250,630 |
| GC content percentage | 38.17% |
| Contigs (≥300 bp) (average length; N50 ) | 25,824 (1,388; 2,021) |
| Singletons (≥300 bp) (average length; N50 ) | 17,832 (1,145; 1,674) |
| Clusters (≥300 bp) (average length; N50 ) | 2,978 (2,021; 2,582) |
| Total unigenes (average length; N50; min-max length) | 20,810 (1,270; 1,852; 300–16,275) |
| mean RPKM value of unigenes (min-mix RPKM value) | 32,93 (0–6,617.22) |
| Protein coding sequence (CDS) | 15,698 |
| Gene annotation against Animals protein of Nr (%) | 15,602 (75.0) |
| Gene annotation against UniProtKB/Swiss-Prot (%) | 11,481 (55.2) |
| Gene annotation against UniProtKB/TrEMBL (%) | 15,659 (75.2) |
| Gene annotation against Nemabse4 (%) | 14,093 (67.7) |
| Gene annotation against KEGG (%) | 9,139 (43.9); 3,704 KO terms; 216 biological pathways |
| Gene annotation against InterPro (%) | 16,729 (80.3); 4,229 domains/families |
| Gene annotation against Pfam (%) | 10,839 (52.1); 3,247 domains/families |
| Gene annotation against COG (%) | 5,604 (26.9); 1,322 COG functional terms |
| All annotated unigenes (%) | 17,719 (85.1) |
| GO Ontology (%) | 2,930 (14.1); 1,637 GO terms |
| Biological process category | 2,334; 1,045 GO terms |
| Cellular component category | 1,754; 373 GO terms |
| Molecular function category | 1,940; 219 GO terms |
Figure 1Venn diagram illustrating distribution of high-score matches among eight public databases.
(a) By integrating sequence-similarity search results from the animal protein dataset of the Nr, UniProtKB/Swiss-Prot, UniProtKB/TrEMBL, KEGG, and NEMBASE4 databases, a total of 16,005 unigenes were returned with unique best BLASTx hits (e-value <0.00001). (b) By integrating sequence-similarity search results from InterPro, Pfam and COG databases, a total of 16,845 unigenes obtained unique domain-based annotations (e-value <0.00001). (c) Consolidating from both unique sequence-based annotations and unique domain-based annotations produces 17,719 unique annotated unigenes. The ellipses ‘a’ and ‘b’ imply the two subsets of D. immitis unigenes (16,005 counts in Figure 1a; 16,845 counts in Figure 1b).
The 30 most represented (InterPro) protein domains/families in adult Dirofilaria immitis unigenes.
| InterPro description | InterPro code | No. of unigenes |
| Protein kinase, catalytic domain | IPR000719 | 466 |
| Serine/threonine-protein kinase-like domain | IPR017442 | 303 |
| Protein kinase, ATP binding site | IPR017441 | 283 |
| Tyrosine-protein kinase, catalytic domain | IPR020635 | 235 |
| WD40/YVTN repeat-like-containing domain | IPR015943 | 224 |
| Nucleotide-binding, alpha-beta plait | IPR012677 | 210 |
| Zinc finger, RING/FYVE/PHD-type | IPR013083 | 196 |
| Zinc finger, C2H2-type | IPR007087 | 191 |
| WD40 repeat | IPR001680 | 190 |
| Immunoglobulin-like fold | IPR013783 | 185 |
| WD40 repeat, subgroup | IPR019781 | 175 |
| RNA recognition motif domain | IPR000504 | 172 |
| Zinc finger, C2H2-like | IPR015880 | 171 |
| WD40-repeat-containing domain | IPR017986 | 134 |
| Armadillo-like helical | IPR011989 | 128 |
| Zinc finger, C2H2-type/integrase, DNA-binding | IPR013087 | 127 |
| Serine/threonine-protein kinase domain | IPR002290 | 109 |
| ATPase, AAA+ type, core | IPR003593 | 105 |
| Tetratricopeptide-like helical | IPR011990 | 105 |
| Nuclear hormone receptor, ligand-binding | IPR008946 | 101 |
| Helicase, superfamily 1/2, ATP-binding domain | IPR014021 | 98 |
| Zinc finger, RING-type | IPR001841 | 97 |
| Pleckstrin homology-type | IPR011993 | 91 |
| GPCR, rhodopsin-like superfamily | IPR017452 | 91 |
| Nuclear hormone receptor, ligand-binding, core | IPR000536 | 90 |
| Ankyrin repeat-containing domain | IPR020683 | 90 |
| Ankyrin repeat | IPR002110 | 89 |
| NAD(P)-binding domain | IPR016040 | 89 |
| Zinc finger, nuclear hormone receptor-type | IPR001628 | 87 |
| EF-hand-like domain | IPR011992 | 87 |
Figure 2Histogram presenting clusters of orthologous groups (COG) classification.
Of 20,810 unigenes, 5,604 sequences were assigned to 25 COG classifications.
Figure 3Pie charts showing gene ontology (GO) classification.
The distribution of D. immitis unigenes in three main categories (‘biological process’, ‘cellular component’ and ‘molecular function’) are shown in the subgraphs a–b, respectively.
Figure 4Distribution of sequence homology identified between heartworm unigenes and intestinal-expressed genes from three nematode species.
A total of 1,101 heartworm unigenes (5.3% of the total yield) contained similarities (e-value <0.00001) identified by BLAST searches.
Figure 5Distribution of sequence homology identified between heartworm unigenes and several groups of EST-clusters from NEMBASE4.
(a) Group A includes 22 animal-parasitic nematode species without filarial species; Group B includes 29 other nematode species without animal-parasitic species (including eight free-living species, 18 plant-parasitic species and three entomopathogenic species); Group C&D includes 8 filarial species excluding D. immitis. (b) Group C includes six Wolbachia-containing filarial species excluding D. immitis; Group D includes two non-Wolbachia filarial species; Group A&B includes 51 non-filarial nematode species.