| Literature DB >> 15854223 |
Makedonka Mitreva1, James P McCarter, Prema Arasu, John Hawdon, John Martin, Mike Dante, Todd Wylie, Jian Xu, Jason E Stajich, Wadim Kapulkin, Sandra W Clifton, Robert H Waterston, Richard K Wilson.
Abstract
BACKGROUND: Hookworms, infecting over one billion people, are the mostly closely related major human parasites to the model nematode Caenorhabditis elegans. Applying genomics techniques to these species, we analyzed 3,840 and 3,149 genes from Ancylostoma caninum and A. ceylanicum.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15854223 PMCID: PMC1112591 DOI: 10.1186/1471-2164-6-58
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Ancylostoma libraries sequenced and their properties
| Nematode stage (vector or SL1 based) | ESTs Submitted | Nucleotides (million) | Mean read length (bp) | StDev | |
| Infective L3 (UniZap) | 5,679 | 2,632 | 358 | 109 | |
| Tissue arrested L3 (SL1) | 820 | 0,318 | 344 | 151 | |
| Serum stimulated L3 (pAMP) | 2,832 | 1,273 | 441 | 150 | |
| Overall | 9,331 | 4,223 | 381 | 137 | |
| Contigs – 5,484 (4,020 clusters) | 502 | 168 | |||
| Infective L3 (λZAP II) | 3,359 | 2,021 | 500 | 127 | |
| Infective L3 (SL1) | 3,306 | 1,550 | 400 | 143 | |
| Adult (M1 SL1) | 629 | 0,319 | 460 | 134 | |
| Adult (M2 SL1) | 480 | 0,255 | 467 | 131 | |
| Adult (λZAP II) | 2,817 | 1,646 | 500 | 139 | |
| Overall | 10,591 | 5,791 | 465 | 135 | |
| Contigs – 4,953 (3,369 clusters) | 572 | 179 | |||
Figure 1Ancylostoma NemaGene v2.0 clustering showing the distribution of ESTs by cluster size. For example, there are three A. caninum cluster of size 22 containing a sum of 66 ESTs and there were eight A. ceylanicum clusters of size 22 containing a sum of 176 ESTs. Cluster size (x-axis) is shown to scale for 1–75 members, with the size of larger clusters indicated.
Figure 2Venn diagram of A. caninum (A) and A. ceylanicum (B) clusters, based on stage of origin of each cluster's EST members. The majority of clusters are represented by only one stage in this investigation, though greater depth of sampling would likely increase representation by multiple stages.
Most abundantly represented protein domains in A. caninum and A. ceylanicum datasets
| Species | InterPro ID | Clusters # | Domain descriptor |
| IPR001230 | 141 | Prenyl group, CAAX box, attachment site | |
| IPR001687 | 66 | ATP/GTP-binding site motif A (P-loop) | |
| IPR000694 | 62 | Proline-rich region | |
| IPR001472 | 51 | Bipartite nuclear localization signal | |
| IPR000345 | 29 | Cytochrome c heme-binding site | |
| IPR001283 | 25 | Allergen V5/Tpx-1 related | |
| IPR002048 | 24 | Calcium-binding EF-hand | |
| IPR000504 | 21 | RNA-binding region RNP-1 (RNA recognition motif) | |
| IPR000719 | 19 | Protein kinase | |
| IPR007087 | 17 | Zn-finger, C2H2 type | |
| IPR000694 | 214 | Proline-rich region | |
| IPR001230 | 153 | Prenyl group, CAAX box, attachment site | |
| IPR001687 | 125 | ATP/GTP-binding site motif A (P-loop) | |
| IPR000345 | 50 | Cytochrome c heme-binding site | |
| IPR006209 | 48 | EGF-like domain | |
| IPR000504 | 37 | RNA-binding region RNP-1 (RNA recognition motif) | |
| IPR001283 | 34 | Allergen V5/Tpx-1 related | |
| IPR001472 | 32 | Bipartite nuclear localization signal | |
| IPR000169 | 32 | Eukaryotic thiol (cysteine) protease | |
| IPR001534 | 27 | Transthyretin-like |
GO mappings for A. caninum and A. ceylanicum clusters
| Categories and subcategories | Representation | % Representation of total | Representation | % Representation of total | |
| biological process | |||||
| cellular process | 192 | 4.80 | 238 | 4.36 | |
| cell communication | 62 | 1.55 | 66 | 1.21 | |
| cell motility | 1 | 0.03 | 0 | 0.00 | |
| cell death | 1 | 0.03 | 1 | 0.02 | |
| cell growth and/or maintaince | 140 | 3.50 | 180 | 3.30 | |
| transport | 119 | 2.98 | 153 | 2.80 | |
| cell organization and biogenesis | 21 | 0.53 | 32 | 0.59 | |
| cell proliferation | 4 | 0.10 | 6 | 0.11 | |
| cell homeostasis | 1 | 0.03 | 4 | 0.07 | |
| physiological process | 579 | 14.48 | 785 | 14.38 | |
| response to endogenous stimulus | 1 | 0.03 | 5 | 0.09 | |
| response to external stimulus | 12 | 0.30 | 16 | 0.29 | |
| response to stress | 8 | 0.20 | 14 | 0.26 | |
| death | 1 | 0.03 | 1 | 0.02 | |
| metabolism | 466 | 11.65 | 653 | 11.96 | |
| hemostasis | 1 | 0.03 | 0 | 0.00 | |
| homeostasis | 3 | 0.08 | 4 | 0.07 | |
| secretion | 0 | 0.00 | 1 | 0.02 | |
| development | 11 | 0.28 | 18 | 0.33 | |
| cellular component | |||||
| cell | 327 | 8.18 | 385 | 7.05 | |
| intracellular | 212 | 5.30 | 277 | 5.08 | |
| cytoplasm | 166 | 4.15 | 181 | 3.32 | |
| nucleus | 45 | 1.13 | 84 | 1.54 | |
| ribonucleoprotein complex | 102 | 2.55 | 102 | 1.87 | |
| respiratory chain complex | 4 | 0.10 | 5 | 0.09 | |
| chromosome | 8 | 0.20 | 13 | 0.24 | |
| thylakoid | 0 | 0.00 | 1 | 0.02 | |
| membrane | 146 | 3.65 | 146 | 2.67 | |
| extracellular | 33 | 0.83 | 50 | 0.92 | |
| Unlocalized | 1 | 0.03 | 7 | 0.13 | |
| molecular function | |||||
| binding | 323 | 8.08 | 521 | 9.55 | |
| carbohydrate binding | 6 | 0.15 | 20 | 0.37 | |
| lipid binding | 6 | 0.15 | 12 | 0.22 | |
| metal ion binding | 56 | 1.40 | 72 | 1.32 | |
| nucleic acid binding | 109 | 2.73 | 201 | 3.68 | |
| nucleotide binding | 133 | 3.33 | 214 | 3.92 | |
| protein binding | 11 | 0.28 | 22 | 0.40 | |
| apoptosis regulator activity | 1 | 0.03 | 1 | 0.02 | |
| chaperone activity | 5 | 0.13 | 10 | 0.18 | |
| cell adhesion molecule activity | 2 | 0.05 | 1 | 0.02 | |
| catalytic activity | 293 | 7.33 | 445 | 8.15 | |
| enzyme regulator activity | 24 | 0.60 | 35 | 0.64 | |
| molecular function unknown | 38 | 0.95 | 46 | 0.84 | |
| motor activity | 3 | 0.08 | 12 | 0.22 | |
| signal transducer activity | 55 | 1.38 | 61 | 1.12 | |
| structural molecule activity | 107 | 2.68 | 127 | 2.33 | |
| transcription regulator activity | 19 | 0.48 | 33 | 0.60 | |
| translation regulator activity | 14 | 0.35 | 17 | 0.31 | |
| transporter activity | 128 | 3.20 | 180 | 3.30 | |
Kegg Biochemical pathway mappings for A. caninum and A. ceylanicum clusters
| AC | Clusters per library | AE | Clusters per library | Total # of enzymes in KEGG | ||||||||
| KEGG CATEGORY REPRESENTEDa | Clb | iL3 | taL3 | ssL3 | Mixed | Enzc | Clb | iL3 | Ad | Mixed | Enzc | |
| 1. Carbohydrate metabolism | ||||||||||||
| 1.1 Glycolysis / Gluconeogenesis | 23 | 13 | 0 | 5 | 5 | 22 | 25 | 10 | 11 | 4 | 23 | 40 |
| 1.2 Citrate cycle (TCA cycle) | 17 | 8 | 0 | 4 | 5 | 16 | 15 | 4 | 8 | 3 | 15 | 23 |
| 1.3 Pentose phosphate pathway | 9 | 3 | 0 | 4 | 2 | 9 | 12 | 5 | 3 | 4 | 8 | 34 |
| 1.4 Pentose and glucuronate interconversions | 8 | 3 | 0 | 3 | 2 | 9 | 9 | 5 | 4 | 0 | 8 | 53 |
| 1.5 Fructose and mannose metabolism | 14 | 6 | 0 | 4 | 4 | 15 | 20 | 7 | 11 | 2 | 15 | 61 |
| 1.6 Galactose metabolism | 10 | 4 | 0 | 4 | 2 | 8 | 12 | 6 | 5 | 1 | 12 | 37 |
| 1.7 Ascorbate and aldarate metabolism | 7 | 4 | 0 | 2 | 1 | 4 | 5 | 5 | 0 | 0 | 4 | 29 |
| 1.8 Pyruvate metabolism | 25 | 11 | 0 | 11 | 3 | 23 | 27 | 8 | 17 | 2 | 26 | 67 |
| 1.9 Glyoxylate and dicarboxylate metabolism | 13 | 6 | 0 | 5 | 2 | 14 | 9 | 1 | 5 | 3 | 17 | 58 |
| 1.10 Propanoate metabolism | 22 | 7 | 1 | 10 | 4 | 20 | 25 | 11 | 8 | 6 | 22 | 46 |
| 1.11 Butanoate metabolism | 22 | 9 | 1 | 7 | 5 | 23 | 29 | 14 | 14 | 1 | 26 | 52 |
| 1.12 C5-Branched dibasic acid metabolism | 4 | 3 | 0 | 1 | 0 | 2 | 2 | 1 | 0 | 1 | 1 | 20 |
| 1.13 Inositol metabolism | 6 | 2 | 0 | 1 | 3 | 4 | 7 | 2 | 3 | 2 | 4 | 5 |
| 2. Energy metabolism | ||||||||||||
| 2.1 Oxidative phosphorylation | 24 | 7 | 0 | 6 | 11 | 11 | 33 | 10 | 14 | 9 | 13 | 14 |
| 2.2 ATP synthesis | 8 | 2 | 0 | 3 | 3 | 1 | 11 | 4 | 3 | 4 | 1 | 1 |
| 2.4 Carbon fixation | 11 | 3 | 0 | 3 | 5 | 11 | 11 | 3 | 5 | 3 | 13 | 23 |
| 2.5 Reductive carboxylate cycle (CO2 fixation) | 12 | 7 | 0 | 1 | 4 | 8 | 9 | 2 | 4 | 3 | 7 | 13 |
| 2.6 Methane metabolism | 6 | 4 | 0 | 0 | 2 | 5 | 6 | 0 | 4 | 2 | 6 | 26 |
| 2.7 Nitrogen metabolism | 11 | 2 | 0 | 5 | 4 | 14 | 12 | 5 | 5 | 2 | 15 | 64 |
| 2.8 Sulfur metabolism | 5 | 1 | 0 | 1 | 3 | 9 | 6 | 3 | 1 | 2 | 9 | 30 |
| 3. Lipid metabolism | ||||||||||||
| 3.1 Fatty acid biosynthesis (path 1) | 6 | 2 | 0 | 3 | 1 | 11 | 7 | 3 | 3 | 1 | 6 | 14 |
| 3.2 Fatty acid biosynthesis (path 2) | 8 | 2 | 0 | 5 | 1 | 6 | 6 | 3 | 2 | 1 | 5 | 8 |
| 3.3 Fatty acid metabolism | 14 | 6 | 1 | 6 | 1 | 17 | 21 | 13 | 7 | 1 | 16 | 28 |
| 3.4 Synthesis and degradation of ketone bodies | 2 | 0 | 0 | 1 | 1 | 2 | 8 | 4 | 3 | 1 | 3 | 6 |
| 3.5 Sterol biosynthesis | 4 | 1 | 1 | 2 | 0 | 4 | 4 | 2 | 2 | 0 | 9 | 35 |
| 3.6 Bile acid biosynthesis | 11 | 7 | 1 | 2 | 1 | 11 | 11 | 6 | 4 | 1 | 10 | 27 |
| 3.8 Androgen and estrogen metabolism | 7 | 5 | 0 | 2 | 0 | 9 | 7 | 4 | 3 | 0 | 8 | 26 |
| 4. Nucleotide metabolism | ||||||||||||
| 4.1 Purine metabolism | 27 | 11 | 0 | 11 | 5 | 28 | 32 | 14 | 11 | 7 | 32 | 99 |
| 4.2 Pyrimidine metabolism | 16 | 8 | 1 | 5 | 2 | 15 | 22 | 9 | 12 | 1 | 22 | 61 |
| 4.3 Nucleotide sugars metabolism | 6 | 4 | 0 | 0 | 2 | 3 | 4 | 2 | 2 | 0 | 4 | 30 |
| 5. Amino acid metabolism | ||||||||||||
| 5.1 Glutamate metabolism | 11 | 3 | 0 | 5 | 3 | 14 | 16 | 8 | 7 | 1 | 18 | 36 |
| 5.2 Alanine and aspartate metabolism | 14 | 1 | 0 | 8 | 5 | 15 | 14 | 5 | 6 | 3 | 15 | 38 |
| 5.3 Glycine, serine and threonine metabolism | 19 | 7 | 0 | 9 | 3 | 14 | 21 | 8 | 10 | 3 | 24 | 56 |
| 5.4 Methionine metabolism | 6 | 1 | 0 | 4 | 1 | 9 | 6 | 5 | 1 | 0 | 8 | 24 |
| 5.5 Cysteine metabolism | 8 | 2 | 0 | 2 | 4 | 11 | 7 | 5 | 2 | 0 | 9 | 23 |
| 5.6 Valine, leucine and isoleucine degradation | 16 | 3 | 1 | 8 | 4 | 16 | 22 | 11 | 6 | 5 | 18 | 32 |
| 5.7 Valine, leucine and isoleucine biosynthesis | 7 | 1 | 0 | 4 | 2 | 7 | 9 | 6 | 2 | 1 | 8 | 15 |
| 5.8 Lysine biosynthesis | 11 | 1 | 0 | 6 | 4 | 10 | 9 | 4 | 3 | 2 | 8 | 31 |
| 5.9 Lysine degradation | 19 | 8 | 0 | 8 | 3 | 14 | 19 | 12 | 6 | 1 | 17 | 47 |
| 5.10 Arginine and proline metabolism | 18 | 4 | 0 | 8 | 6 | 20 | 22 | 11 | 7 | 4 | 20 | 71 |
| 5.11 Histidine metabolism | 10 | 4 | 0 | 4 | 2 | 8 | 10 | 5 | 4 | 1 | 8 | 39 |
| 5.12 Tyrosine metabolism | 18 | 8 | 0 | 7 | 3 | 19 | 19 | 11 | 5 | 3 | 19 | 67 |
| 5.13 Phenylalanine metabolism | 13 | 5 | 0 | 3 | 5 | 11 | 14 | 7 | 6 | 1 | 12 | 40 |
| 5.14 Tryptophan metabolism | 17 | 8 | 0 | 8 | 1 | 15 | 22 | 16 | 4 | 2 | 18 | 61 |
| 5.15 Phenylalanine, tyrosine and tryptophan biosynthesis | 5 | 1 | 0 | 2 | 2 | 6 | 4 | 2 | 0 | 2 | 7 | 31 |
| 5.16 Urea cycle and metabolism of amino groups | 10 | 1 | 0 | 5 | 4 | 14 | 10 | 2 | 6 | 2 | 11 | 35 |
| 6. Metabolism of other amino acids | ||||||||||||
| 6.1 beta-Alanine metabolism | 14 | 4 | 1 | 7 | 2 | 13 | 11 | 8 | 1 | 2 | 10 | 32 |
| 6.2 Taurine and hypotaurine metabolism | 1 | 0 | 0 | 0 | 1 | 1 | 2 | 2 | 0 | 0 | 3 | 14 |
| 6.3 Aminophosphonate metabolism | 4 | 0 | 0 | 3 | 1 | 3 | 5 | 2 | 3 | 0 | 5 | 15 |
| 6.4 Selenoamino acid metabolism | 7 | 0 | 0 | 4 | 3 | 12 | 12 | 6 | 4 | 2 | 15 | 22 |
| 6.5 Cyanoamino acid metabolism | 2 | 1 | 0 | 1 | 0 | 1 | 7 | 5 | 1 | 1 | 6 | 19 |
| 6.6 D-Glutamine and D-glutamate metabolism | 2 | 0 | 0 | 1 | 1 | 2 | 2 | 1 | 0 | 1 | 2 | 12 |
| 6.7 D-Arginine and D-ornithine metabolism | 3 | 1 | 0 | 0 | 2 | 2 | 3 | 0 | 2 | 1 | 2 | 10 |
| 6.9 Glutathione metabolism | 5 | 1 | 0 | 0 | 4 | 4 | 9 | 5 | 3 | 1 | 6 | 27 |
| 7. Metabolism of complex carbohydrates | ||||||||||||
| 7.1 Starch and sucrose metabolism | 18 | 2 | 0 | 13 | 3 | 18 | 20 | 13 | 6 | 1 | 20 | 75 |
| 7.2 N-Glycans biosynthesis | 7 | 4 | 0 | 1 | 2 | 7 | 7 | 2 | 4 | 1 | 9 | 27 |
| 7.3 O-Glycans biosynthesis | 3 | 1 | 0 | 1 | 1 | 2 | 6 | 5 | 1 | 0 | 3 | 8 |
| 7.5 Aminosugars metabolism | 6 | 3 | 0 | 2 | 1 | 6 | 10 | 5 | 5 | 0 | 10 | 39 |
| 7.8 Glycosaminoglycan degradation | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 13 |
| 7.9 Chondroitin / Heparan sulfate biosynthesis | 5 | 3 | 0 | 1 | 1 | 4 | 6 | 2 | 4 | 0 | 4 | 18 |
| 7.10 Keratan sulfate biosynthesis | 1 | 0 | 0 | 1 | 0 | 1 | 2 | 1 | 1 | 0 | 1 | 6 |
| 8. Metabolism og complex lipids | ||||||||||||
| 8.1 Glycerolipid metabolism | 24 | 10 | 0 | 9 | 5 | 22 | 25 | 10 | 13 | 2 | 22 | 80 |
| 8.3 Inositol phosphate metabolism | 8 | 4 | 0 | 4 | 0 | 4 | 8 | 5 | 2 | 1 | 3 | 25 |
| 8.4 Sphingophospholipid biosynthesis | 1 | 1 | 0 | 0 | 0 | 1 | 2 | 1 | 1 | 0 | 2 | 8 |
| 8.5 Phospholipid degradation | 3 | 2 | 0 | 0 | 1 | 3 | 1 | 1 | 0 | 0 | 1 | 11 |
| 8.6 Sphingoglycolipid metabolism | 11 | 1 | 1 | 9 | 0 | 7 | 10 | 8 | 2 | 0 | 4 | 20 |
| 8.9 Globoside metabolism | 2 | 1 | 0 | 1 | 0 | 2 | 2 | 1 | 1 | 0 | 1 | 12 |
| 8.11 Prostaglandin and leukotriene metabolism | 8 | 2 | 0 | 1 | 5 | 8 | 7 | 4 | 3 | 0 | 6 | 19 |
| 9. Metabolism of cofactors and vitamins | ||||||||||||
| 9.2 Riboflavin metabolism | 4 | 1 | 0 | 3 | 0 | 2 | 2 | 2 | 0 | 0 | 2 | 13 |
| 9.3 Vitamin B6 metabolism | 6 | 4 | 0 | 1 | 1 | 3 | 6 | 4 | 1 | 1 | 5 | 23 |
| 9.4 Nicotinate and nicotinamide metabolism | 11 | 2 | 0 | 7 | 2 | 7 | 15 | 8 | 7 | 0 | 7 | 32 |
| 9.5 Pantothenate and CoA biosynthesis | 8 | 2 | 0 | 4 | 2 | 9 | 8 | 6 | 2 | 0 | 7 | 27 |
| 9.7 Folate biosynthesis | 5 | 2 | 1 | 0 | 2 | 5 | 6 | 1 | 4 | 1 | 5 | 25 |
| 9.8 One carbon pool by folate | 5 | 3 | 0 | 2 | 0 | 10 | 7 | 3 | 3 | 1 | 8 | 24 |
| 9.10 Porphyrin and chlorophyll metabolism | 18 | 4 | 0 | 10 | 4 | 12 | 27 | 15 | 8 | 4 | 13 | 56 |
| 9.11 Ubiquinone biosynthesis | 19 | 10 | 0 | 5 | 4 | 13 | 28 | 11 | 14 | 3 | 14 | 22 |
| 10. Biosynthesis of secondary metabolites | ||||||||||||
| 10.1 Terpenoid biosynthesis | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | 0 | 4 | 12 |
| 10.3 Flavonoids, stilbene and lignin biosynthesis | 6 | 3 | 0 | 1 | 2 | 7 | 8 | 5 | 3 | 0 | 7 | 39 |
| 10.4 Alkaloid biosynthesis I | 5 | 2 | 0 | 3 | 0 | 6 | 3 | 3 | 0 | 0 | 5 | 36 |
| 10.8 Streptomycin biosynthesis | 2 | 0 | 0 | 2 | 0 | 3 | 4 | 2 | 1 | 1 | 4 | 14 |
| 10.9 Erythromycin biosynthesis | 2 | 0 | 0 | 2 | 0 | 3 | 3 | 2 | 1 | 0 | 3 | 6 |
| 11. Biodegradation of xenobiotics | ||||||||||||
| 11.4 Nitrobenzene degradation | 4 | 1 | 0 | 3 | 0 | 5 | 4 | 2 | 1 | 1 | 3 | 17 |
| 11.9 Tetrachloroethene degradation | 6 | 4 | 0 | 1 | 1 | 3 | 2 | 2 | 0 | 0 | 3 | 5 |
| 11.10 Styrene degradation | 4 | 2 | 0 | 2 | 0 | 3 | 6 | 5 | 0 | 1 | 5 | 18 |
| 11.1 gamma-Hexachlorocyclohexane degradation | 6 | 3 | 0 | 3 | 0 | 5 | 5 | 3 | 2 | 0 | 4 | 12 |
| 11.1 Fluorene degradation | 3 | 1 | 0 | 2 | 0 | 4 | 2 | 2 | 0 | 0 | 2 | 13 |
| 11.2 Benzoate degradation via CoA ligation | 22 | 8 | 1 | 10 | 3 | 18 | 25 | 14 | 10 | 1 | 18 | 38 |
| 11.2 Benzoate degradation via hydroxylation | 7 | 2 | 0 | 5 | 0 | 7 | 5 | 4 | 1 | 0 | 5 | 45 |
aA. caninum – 839 multiple and 786 unique mappings; A. ceylanicum – 957 multiple and 840 unique mappings. b Cluster, c Enzymes
Figure 3Venn diagram showing distribution of A. caninum (A) and A. ceylanicum (B) cluster BLAST matches by database. Amino acid level homologies (≥ e-05) were identified to non-Ancylostoma sequences for 65.8% (2,646/4,020) of A. caninum and 83.1% (2,801/3,369) of A. ceylanicum clusters. Databases used are: for C. elegans, Wormpep v.97 and mitochondrial protein sequences; for other nematodes, all GenBank nucleotide data for nematodes except C. elegans and Ancylostoma; for non-nematodes, nrGenBank (3/20/2003) with all nematode sequences removed.
Figure 4Distribution of A. caninum and A. ceylanicum contigs with and without database amino acid level homology by size of the longest predicted open reading frame (ORF).
Figure 5Distribution of dN/dS ratios among Ancylostoma ortholog pairs. dN and dS are the rates of nonsynonymous and synonymous amino acid substitutions, respectively.
Figure 6Relative distance based upon protein maximum likelihood. A. ceylanicum to A. cananum distance is similar to C. elegans to C. briggsae distance. Ancylostoma to Caenorhabditis distance for any species is 4.3X the Ancylostoma to Ancylostoma distance and 4.1X the Caenorhabditis to Caenorhabditis distance. The length of each line segment is proportional to the calculated branch length between the species.
The most abundantly represented transcripts in the A. caninum cDNA libraries
| non-redundant GenBank | ||||||||
| ESTs per cluster | Accession | |||||||
| iL3 | taL3 | ssL3 | Best identity descriptor | SW / TRa | E-value | Wormpep97b | ||
| AC00932.cl | 203 | 186 | 0 | 17 | CAD10437 | 2e – 304 | C06G4.2b | |
| AC00471.cl | 120 | 2 | 0 | 118 | O02345 | 2e – 83 | ZK896.9b | |
| AC00048.cl | 114 | 100 | 11 | 3 | Q20165 | 6e – 40 | F38E11.2b | |
| AC01032.cl | 104 | 104 | 0 | 0 | AAL50814 | 1e – 09 | - | |
| AC01031.cl | 93 | 90 | 0 | 3 | CAD10435 | 2e – 143 | - | |
| AC00807.cl | 69 | 4 | 2 | 63 | AAD13340 | 3e – 28 | F33A8.2 | |
| AC00205.cl | 65 | 51 | 1 | 13 | NP_579953 | 4e – 125 | F26E4.12 | |
| AC00967.cl | 54 | 9 | 0 | 45 | AAO22152 | 8e – 174 | R12H7.2 | |
| AC00134.cl | 44 | 44 | 0 | 0 | NP_497272 | 9e – 53 | K02F3.9b | |
| AC01029.cl | 39 | 37 | 2 | 0 | NP_510604 | 9e – 37 | F59F4.2b | |
| AC00137.cl | 38 | 37 | 0 | 1 | CAB03222 | 4e – 35 | R06C1.4b | |
| AC00023.cl | 38 | 36 | 0 | 2 | Q9XVF7 | 3e – 148 | B0250.1b | |
| AC00060.cl | 36 | 36 | 0 | 0 | AAC35986 | 2e – 134 | F11C7.3b | |
| AC01400.cl | 32 | 1 | 0 | 31 | NP_508781 | 1e – 20 | C07A12.1b | |
| AC00976.cl | 32 | 29 | 0 | 3 | AAL87687 | 6e – 12 | K11G9.6 | |
| AC00193.cl | 31 | 7 | 0 | 24 | BAB33421 | 2e – 61 | F58H1.7 | |
| AC00931.cl | 29 | 6 | 0 | 23 | novel | - | - | - |
| AC00079.cl | 28 | 25 | 2 | 1 | AAC08432 | 8e – 06 | - | |
| AC00980.cl | 27 | 11 | 0 | 16 | AAA79749 | 8e – 70 | R11F4.1b | |
| AC00971.cl | 26 | 9 | 0 | 17 | NP_491061 | 5e – 116 | Y71F9AL.13ab | |
| AC01023.cl | 25 | 22 | 2 | 1 | CAD20464 | 9e – 87 | F02A9.2 | |
| AC00913.cl | 25 | 6 | 17 | 2 | O62388 | 2e – 50 | W01D2.1b | |
| AC02930.cl | 24 | 0 | 0 | 24 | NP_504712 | 4e – 119 | T25F10.6b | |
| AC00252.cl | 24 | 12 | 12 | 0 | CAB54440 | 4e – 49 | Y48B6A.2b | |
| AC01020.cl | 23 | 20 | 3 | 0 | NP_492384 | 1e – 88 | F36A2.6b | |
a SW/TR is Swiss-prot and TrEMBL Proteinknowledgebase .
b C. elegans homolog has higher probability match than the best GenBank descriptor.
The most abundantly represented transcripts in the A. ceylanicum cDNA libraries
| non-redundant GenBank | |||||||
| ESTs Per cluster | Accession | ||||||
| iL3 | Ad | Best identity descriptor | SW / TRa | E-value | Wormpep97b | ||
| AE00908.cl | 323 | 320 | 3 | NP_510604 | 9e – 37 | F59F4.2b | |
| AE00787.cl | 205 | 205 | 0 | Q20165 | 7e – 39 | F38E11.2b | |
| AE01104.cl | 155 | 155 | 0 | Q9XWW1 | 8e – 80 | Y37D8A.10b | |
| AE00463.cl | 119 | 118 | 1 | NP_498422 | 9e – 56 | T26A5.9b | |
| AE00121.cl | 110 | 0 | 110 | NP_508613 | 7e – 121 | F59D8.1b | |
| AE00890.cl | 84 | 84 | 0 | NP_509237 | 5e – 16 | T08A9.8b | |
| AE00065.cl | 84 | 84 | 0 | NP_508656 | 1e – 35 | F47B7.1b | |
| AE00360.cl | 74 | 74 | 0 | novel | - | - | - |
| AE00048.cl | 72 | 72 | 0 | NP_502671 | 6e – 25 | B0513.3b | |
| AE00003.cl | 70 | 3 | 67 | novel | - | - | - |
| AE01410.cl | 63 | 60 | 3 | CAD20464 | 3e – 86 | F02A9.2 | |
| AE00056.cl | 62 | 56 | 6 | AAK77617 | 5e – 39 | M01H9.3ab | |
| AE00464.cl | 61 | 61 | 0 | novel | - | - | - |
| AE00227.cl | 59 | 0 | 59 | S49915 | 2e – 31 | ZK84.1 | |
| AE00746.cl | 54 | 0 | 54 | AAA19083 | 3e – 54 | B0280.5b | |
| AE00750.cl | 50 | 36 | 14 | NP_493708 | 7e – 45 | K01A2.2ab | |
| AE00072.cl | 47 | 0 | 47 | S51939 | 6e – 12 | C34D4.11 | |
| AE00591.cl | 45 | 45 | 0 | AAF99918 | 7e – 26 | F29B9.11b | |
| AE01221.cl | 44 | 44 | 0 | CAB62280 | 1e – 29 | Y59A8B.19 | |
| AE01407.cl | 41 | 41 | 0 | CAA93510 | 5e – 18 | K02B9.4b | |
| AE00033.cl | 41 | 0 | 41 | CAA50655 | 8e – 56 | T27E4.3 | |
| AE01361.cl | 40 | 39 | 1 | AAA68776 | 1e – 75 | C56C10.8b | |
| AE00322.cl | 39 | 37 | 2 | CAB54416 | 7e – 28 | Y38E10A.24b | |
| AE00536.cl | 36 | 34 | 2 | BAB71316 | 4e – 134 | F25B5.4a | |
| AE00503.cl | 35 | 13 | 22 | NP_498520 | 3e – 283 | F31E3.5b | |
a SW/TR is Swiss-prot and TrEMBL Proteinknowledgebase .
b C. elegans homolog has higher probability match than the best GenBank descriptor.
C. elegans candidate orphans (1,358 out of 21,437) matching Ancylostoma clusters
| Descriptor | ESTs | E-value | C. elegans | Matched region length (%) | %ID | ||
| F31E8.1 | Hypothetical protein | AC05087.cl | 1 | 1e – 07 | 249 | 14.5 | 45 |
| F57B10.14 | Hypothetical protein | AE02023.cl | 2 | 3e – 20 | 56 | 75.0 | 69 |
| R10E9.3 | Contains Cytochrome C heme-binding site | AC02329.cl | 1 | 2e – 10 | 149 | 79.2 | 32 |
| AE00556.cl | 2 | 5e – 19 | 149 | 97.3 | 31 | ||
| T07A9.13 | Putative nuclear encoded protein | AE02236.cl | 1 | 4e – 31 | 111 | 91.9 | 49 |
| Y35H6.1 | Hypothetical protein | AE03902.cl | 1 | 6e – 23 | 161 | 47.2 | 48 |
| Y41C4A.3 | Hypothetical protein | AE00269.cl | 14 | 1e – 05 | 162 | 49.4 | 37 |
| Y54G2A.27 | Hypothetical protein | AC04390.cl | 1 | 2e – 05 | 229 | 14.4 | 38 |
| AE01938.cl | 9 | 8e – 07 | 229 | 11.4 | 48 | ||
| ZC487.3 | Hypothetical protein | AC04655.cl | 1 | 2e – 08 | 79 | 81.0 | 38 |
| ZK686.1 | Nuclear transition protein | AC00867.cl | 16 | 2e – 07 | 44 | 68.2 | 62 |
| AE01651.cl | 3 | 3e – 07 | 44 | 68.2 | 62 | ||
| ZK84.5 | Hypothetical protein | AC00410.cl | 3 | 6e – 11 | 84 | 70.2 | 46 |
a Of 21,437 proteins 1,358 were candidate orphans
b AC, Ancylostoma caninum; AE, A. ceylanicum