| Literature DB >> 20485481 |
Cinzia Cantacessi1, Makedonka Mitreva, Aaron R Jex, Neil D Young, Bronwyn E Campbell, Ross S Hall, Maria A Doyle, Stuart A Ralph, Elida M Rabelo, Shoba Ranganathan, Paul W Sternberg, Alex Loukas, Robin B Gasser.
Abstract
BACKGROUND: The blood-feeding hookworm Necator americanus infects hundreds of millions of people worldwide. In order to elucidate fundamental molecular biological aspects of this hookworm, the transcriptome of the adult stage of Necator americanus was explored using next-generation sequencing and bioinformatic analyses. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2010 PMID: 20485481 PMCID: PMC2867931 DOI: 10.1371/journal.pntd.0000684
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Summary of the expressed sequence tag (EST) data for the adult stage of Necator americanus determined following 454 sequencing and detailed bioinformatics annotation and analyses.
| No. of EST clusters | 19,997 |
| Average length (±standard deviation) | 369 bp±215.31 |
| Containing an Open Reading Frame | 12,799 |
| Signal peptides | 274 |
| Returning InterProScan results | 7,214 (2,381 domains) |
| Gene Ontology | 2,950 (887 terms) |
| | 4,830 (314 terms) |
| | 3,087 (117 terms) |
| | 8,671 (456 terms) |
| Prediction of biological pathways (KOBAS) | 235 |
The thirty most abundant protein domains inferred using the InterProScan software from peptides inferred for Necator americanus and Ancylostoma caninum.
| InterProScan domain | No. of | No. of |
| WD40 | 315 (10.6) ▾ | 553 (14.5) ▴ |
| EF-HAND | 196 (6.7) | 187 (2.6) |
| Proteinase inhibitors | 230 (7.8) ▴ | 126 (3.3) ▾ |
| Proteases | 179 (6.1) | 177 (4.6) |
| Protein kinases | 131 (4.4) ▾ | 388 (10.1) ▴ |
| NAD(P)-binding domain | 114 (3.9) ▾ | 160 (4.2) ▴ |
| Transthyretin-like | 97 (3.3) ▴ | 19 (0.5) ▾ |
| Galectin, carbohydrate recognition domain | 95 (3.2) ▴ | 66 (1.7) ▾ |
| SCP-like extracellular | 94 (3.2) ▾ | 362 (9.5) ▴ |
| Peptidyl-prolyl cis-trans isomerase | 91 (3.1) ▴ | 25 (0.6) ▾ |
| RNA recognition motif, RNP-1 | 83 (2.8) ▾ | 198 (6.2) ▴ |
| Mitochondrial substrate/solute carrier | 88 (3) | 88 (2.3) |
| Thioredoxin fold | 81 (2.7) | 70 (1.8) |
| Allergen V5/Tpx-1 related | 64 (2.2) ▾ | 232 (6) ▴ |
| Zinc finger, C2H2-type | 64 (2.2) ▾ | 185 (4.8) ▴ |
| Aldo/keto reductase | 60 (2) ▴ | 9 (0.2) ▾ |
| Scr homology-3 domain | 57 (2) | 98 (2.6) |
| Actin/actin like | 56 (2) | 49 (1.3) |
| Short-chain dehydrogenase/reductase SDR | 51 (1.7) | 51 (6.2) |
| Metridin-like ShK toxin | 47 (1.6) ▴ | 6 (0.1) ▾ |
| Histone-fold | 44 (1.5) ▴ | 19 (0.5) ▾ |
| Nucleotide binding, alpha beta plait | 43 (1.4) ▾ | 80 (2.1) ▴ |
| Heat shock protein Hsp20 | 41 (1.4) ▴ | 14 (0.4) ▾ |
| Chaperonin Cpn60/TCP-1 | 39 (1.3) | 50 (1.3) |
| Cytochrome P450 | 39 (1.3) ▴ | 4 (0.1) ▾ |
| Ankyrin | 38 (1.2) ▾ | 271 (7) ▴ |
| Annexin repeat | 37 (1.2) | 53 (1.4) |
| Ubiquitin-conjugating enzyme, E2 | 37 (1.2) ▴ | 15 (0.4) ▾ |
| Tetratricopeptide repeat | 36 (1.2) | 20 (0.5) |
| Protein-tyrosine phosphatase, receptor/non-receptor type | 35 (1.2) ▾ | 76 (2) ▴ |
The arrows infer statistically significant (p<0.05; chi-square) higher (▴) or lower (▾) number of genes encoding proteins (with particular InterPro domains) common to N. americanus and A. caninum.
The twenty most abundant Gene Ontology (GO) terms (according to the categories ‘biological process’, ‘cellular component’ and ‘molecular function’) for peptides inferred for Necator americanus and Ancylostoma caninum.
| GO term | GO code | No. of | No. of |
|
| |||
| Translation | GO:0006412 | 599 (20.3) | 146 (3.8) |
| Metabolic process | GO:0008152 | 438 (14.9) | 284 (7.4) |
| Proteolysis | GO:0006508 | 329 (11.2) | 254 (6.6) |
| Oxidation reduction | GO:0055114 | 197 (6.7) | 85 (2.2) |
| Protein amino acid phosphorylation | GO:0006468 | 147 (5) | 159 (4.2) |
| Regulation of transcription, DNA-dependent | GO:0006355 | 137 (4.6) | 53 (1.4) |
| Transport | GO:0006810 | 134 (4.5) | 114 (3) |
| ATP synthesis coupled proton transport | 111 (3.7) | 34 (0.9) | |
| Protein folding | GO:0006457 | 104 (3.5) | 48 (1.3) |
| Carbohydrate metabolic process | GO:0005975 | 101 (3.4) | 100 (2.6) |
| Small GTPase mediated signal transduction | GO:0007264 | 62 (2.1) | 38 (1) |
| Ubiquitin-dependent protein catabolic process | GO:0006511 | 62 (2.1) | 30 (0.8) |
| Intracellular protein transport | GO:0006886 | 59 (2) | 52 (1.4) |
| Vesicle-mediated transport | GO:0016192 | 54 (1.8) | 39 (1) |
| Nucleosome assembly | GO:0006334 | 53 (1.8) | 21 (0.5) |
| Protein transport | GO:0015031 | 50 (1.7) | 40 (1) |
| Response to oxidative stress | GO:0006979 | 48 (1.6) | 9 (0.2) |
| Protein amino acid dephosphorylation | GO:0006470 | 47 (1.6) | 45 (1.2) |
| Protein polymerization | GO:0051258 | 46 (1.6) | 15 (0.4) |
|
| |||
| Intracellular | GO:0005622 | 798 (25.1) | 297 (7.5) |
| Ribosome | GO:0005840 | 499 (17) | 88 (2.3) |
| Membrane | GO:0016020 | 296 (9.7) | 251 (6.6) |
| Nucleus | GO:0005634 | 280 (9.5) | 174 (4.6) |
| Integral to membrane | GO:0016021 | 185 (6.3) | 143 (3.7) |
| Cytoplasm | GO:0005737 | 141 (4.8) | 122 (3.2) |
| Extracellular region | GO:0005576 | 86 (2.9) | 156 (4) |
| Nucleosome | GO:0000786 | 51 (1.7) | 18 (0.5) |
| Protein complex | GO:0043234 | 46 (1.6) | 15 (0.4) |
| Endoplasmic reticulum | GO:0005783 | 38 (1.3) | 35 (0.9) |
| Mitochondrion | GO:0005739 | 36 (1.2) | 5 (0.1) |
| Cytoskeleton | GO:0005856 | 33 (1.1) | 14 (0.4) |
| Microtubule | GO:0005874 | 31 (1) | 15 (0.4) |
| Proton-transporting two-sector ATPase complex, catalytic domain | GO:0033178 | 29 (1) | 19 (0.5) |
| Proton-transporting two-sector ATPase complex, proton-transporting domain | GO:0033177 | 27 (0.9) | 8 (0.2) |
| Mitochondrial inner membrane | GO:0005743 | 22 (0.7) | 7 (0.2) |
| Proton-transporting ATP synthase complex, catalytic core F(1) | GO:0045261 | 18 (0.6) | 9 (0.2) |
| Clathrin adaptor complex | GO:0030131 | 15 (0.5) | 9 (0.2) |
| Proteasome core complex | GO:0005839 | 15 (0.5) | 15 (0.4) |
| Eukaryotic translation elongation factor 1 complex | GO:0005853 | 15 (0.5) | 8 (0.2) |
|
| |||
| ATP binding | GO:0005524 | 558 (18.9) | 514 (13.4) |
| Structural constituent of ribosome | GO:0003735 | 527 (17.9) | 91 (2.4) |
| Catalytic activity | GO:0003824 | 429 (14.5) | 346 (9) |
| Oxidoreductase activity | GO:0016491 | 317 (10.8) | 185 (4.8) |
| Protein binding | GO:0005515 | 311 (10.5) | 212 (5.6) |
| Binding | GO:0005488 | 287 (9.3) | 237 (6.2) |
| Zinc ion binding | GO:0008270 | 287 (9.3) | 214 (5.6) |
| DNA binding | GO:0003677 | 242 (8.2) | 121 (3.2) |
| Serine-type endopeptidase inhibitor activity | GO:0004252 | 230 (7.8) | 25 (0.7) |
| Nucleic acid binding | GO:0003676 | 204 (6.9) | 192 (5) |
| GTP binding | GO:0005525 | 202 (6.8) | 95 (2.5) |
| Calcium ion binding | GO:0005509 | 169 (5.8) | 79 (2) |
| Electron carrier activity | GO:0009055 | 140 (4.7) | 59 (1.5) |
| Heme binding | GO:0020037 | 134 (4.5) | 17 (0.4) |
| RNA binding | GO:0003723 | 124 (4.2) | 90 (2.3) |
| Iron ion binding | GO:0008270 | 123 (4.2) | 18 (0.5) |
| Nucleotide binding | GO:0000166 | 113 (3.8) | 138 (3.6) |
| Aspartic-type endopeptidase activity | GO:0004190 | 101 (3.4) | 27 (0.7) |
| Sugar binding | GO:0005529 | 99 (3.4) | 18 (0.5) |
| Transcription factor activity | GO:0003700 | 98 (3.3) | 47 (1.2) |
Figure 1Simitri analysis.
Relationships of proteins predicted for Necator americanus with homologues from Ancylostoma caninum and Caenorhabditis elegans, displayed in a SimiTri plot [44]. The description of proteins with most abundant InterPro domains identified in each similarity group is given in the boxes.
Description of Caenorhabditis elegans orthologues of Necator americanus contigs for which inferred peptides were associated with ‘druggable’ InterPro domains and/or Enzyme Commission (EC) numbers, and examples of candidate nematocidal compounds linked to these domains predicted using the BRENDA database (see Materials and Methods).
| C. elegans |
|
|
| A. caninum |
|
|
| WBGene00003816 |
| Lethal, larval lethal, larval arrest, sterile | Asparagine synthetase | √ | 1-methyl-4-(1-methylethyl)-7-oxabicyclo[2.2.1]heptane | |
| WBGene00012166 | W01A8.4 | Lethal, embryonic lethal, larval arrest, sterile | Mitochondrial NADH dehydrogenase (ubiquinone) complex (complex I) subunit | √ | 1-Geranyl-2-methylbenzimidazole | |
| WBGene00021952 |
| Larval lethal, maternal sterile, sick | V0 subunit of V-ATPase | √ | Efrapeptin | |
| WBGen00007001 |
| Embryonic lethal, slow growth, sick, sterile progeny | Elongation factor Tu | √ | Leu-Gly-Asn repeat-enriched protein | |
| WBGene00010562 |
| Embryonic lethal, larval lethal, larval arrest, slow growth, sick | AAA ATPase | √ | 1-beta-D-ribofuranosyl-1,2,4-triazole-3-carboxamide-5′-triphosphate | |
| WBGene00018963 |
| Embryonic lethal, larval arrest, maternal sterile | Mitochondrial processing protease enhancing protein | √ | IPR001431 (Insulinase family) | |
| WBGene00001566 |
| Embryonic lethal, sterile progeny, slow growth | Acyl-CoA dehydrogenase | √ | ||
| WBGene00004020 |
| Embryonic lethal, larval arrest, sterile | Intestinal acid phosphatase | √ | 1,10-phenanthroline | |
| WBGene00009161 | F26E4.6 | Embryonic lethal, larval arrest, slow growth, maternal sterile | cytochrome c oxidase | √ | amyloid beta | |
| WBGene00022169 | Y71H2aM.4 | Embryonic lethal, larval arrest, slow growth | NADH:ubiquinone oxidoreductase, NDUFC2/B14.5B subunit | √ | 1-Geranyl-2-methylbenzimidazole | |
| WBGene00000151 |
| Embryonic lethal | AP endonuclease (family 2) | √ | ||
| WBGene00019885 | R05D8.7 | Embryonic lethal | Reductases with broad range of substrate specificities | √ | IPR002198 (Short-chain dehydrogenase/reductase) | 4-Trifluoromethyl-2,3-dihydro-2,3-dihydroxybenzoate |
| WBGene00020089 | R119.3 | Embryonic lethal | Dehydrogenase | IPR002198 (Short-chain dehydrogenase/reductase) | 4-Trifluoromethyl-2,3-dihydro-2,3-dihydroxybenzoate | |
| WBGene00011803 | T16G12.1 | Embryonic lethal | Aminopeptidase | √ | ||
| WBGene00020149 | T01D1.4 | Embryonic lethal | Uncharacterized conserved protein, contains double-stranded beta-helix domain | √ | ||
| WBGene00006592 |
| Embryonic lethal | Zinc metalloprotease | √ | ||
| WBGene00019001 | F57B10.3 | Embryonic lethal, larval lethal, slow growth | Phosphoglycerate mutase | √ | 1,10-phenanthroline | |
| WBGene00016356 | C33F10.8 | Embryonic lethal, slow growth | Protein tyrosine phosphatase | √ | IPR000387 (Tyrosine protein phosphatases) | 1,3-difluoro-2-((E)-2-nitrovinyl)benzene |
These genes are not present in H. sapiens. The presence of known Ancylostoma caninum orthologues is also indicated (‘√’).