| Literature DB >> 15186490 |
Yvonne M Harcus1, John Parkinson, Cecilia Fernández, Jennifer Daub, Murray E Selkirk, Mark L Blaxter, Rick M Maizels.
Abstract
BACKGROUND: Parasitism is a highly successful mode of life and one that requires suites of gene adaptations to permit survival within a potentially hostile host. Among such adaptations is the secretion of proteins capable of modifying or manipulating the host environment. Nippostrongylus brasiliensis is a well-studied model nematode parasite of rodents, which secretes products known to modulate host immunity.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15186490 PMCID: PMC463072 DOI: 10.1186/gb-2004-5-6-r39
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Analysis of transcripts represented in conventional and oligo-capped cDNA libraries
| Conventional cDNA libraries | Oligo-capped cDNA library | |
| Total sequences providing peptide predictions | 734 | 500 |
| In-frame ATG followed by ≥ 99-nucleotide open reading frame (ORF) | 567 (77.2%) | 430 (86.0%) |
| Predicted ORF length (average) | 114.6 | 101.5 |
| % Signal peptide or signal anchor | SP: 74 (10.1%) | SP: 102 (20.4%) |
| SA: 16 (2.2%) | SA: 5 (1.0%) | |
| % Spliced leader | 0 | 37 (7.4%) |
Figure 1Similarity of N. brasiliensis ESTs to sequences from other nematodes. SimiTri [54] was used to plot 736 N. brasiliensis EST clusters against related species database entries. For each consensus sequence associated with the 736 Nippo clusters, a BLAST was performed against a series of different databases. Each tile in the graphic represents a unique consensus sequence and its relative position is computed from the raw BLAST scores derived above (with a cutoff of ≥ 50). Hence each tile's position shows its degree of sequence similarity to each of the three selected databases. Sequences showing similarity to only one database are not shown. Sequences showing sequence similarity to only two databases appear on the lines joining the two databases. Tiles are colored by their highest TBLASTX score to each of the databases: red ≥ 300; yellow ≥ 200; green ≥ 150, blue ≥ 100 and purple < 100. (a) SimiTri plot showing sequence similarity relationships between N. brasiliensis consensus sequences and database entries of Ancylostoma caninum/duodenale ESTs (20,177 entries, 386 hits), Haemonchus contortus ESTs (22,337 entries, 384 hits) and Teladorsagia circumcincta ESTs (5,300 entries, 264 hits). Database comparisons were performed using TBLASTX. (b) SimiTri plot showing sequence similarity relationships between N. brasiliensis consensus sequences and database entries of Necator americanus ESTs (4,821 entries, 244 hits), Teladorsagia circumcincta ESTs (5,300 entries, 264 hits), and C. elegans wormpep (21,600 entries, 466 hits). Database comparisons were performed using TBLASTX for N. americanus and T. circumcincta, while C. elegans wormpep comparions used BLASTX.
ESTs from adult cDNAs with known homologs, classified by function
| Cluster number | Conventional cDNAs | Oligo-capped cDNAs | Accession | Description | |
| NBC00018 | 2 | 0 | 1e-33 | S66528 | 26S proteinase regulatory complex, non-ATPase chain ( |
| NBC00030 | 2 | 0 | 8e-56 | U41556 | |
| NBC00086 | 1 | 0 | 3e-29 | A48454 | Cathepsin B-like cysteine proteinase ( |
| 5e-28 | D48435 | Cysteine proteinase AC-3 ( | |||
| NBC00168 | 1 | 0 | 2e-42 | NM_065563 | Calpain thiol protease ( |
| NBC00198 | 1 | 0 | 7e-60 | NM_073736 | Cysteine protease ( |
| NBC00204 | 3 | 0 | 2e-32 | NM_072733 | Protease ( |
| NBC00231 | 2 | 0 | 5e-90 | NM_064106 | Serine carboxypeptidase ( |
| NBC00307 | 1 | 0 | 2e-32 | NM_015277 | Ubiquitin-protein ligase NEDD4-like; neural precursor ( |
| NBC00311 | 1 | 0 | 5e-31 | NM_073736 | Cysteine protease ( |
| NBC00352 | 2 | 0 | 6e-31 | NM_065253 | Ubiquitin ( |
| NBC00348 | 1 | 0 | 2e-83 | A48145 | Ubiquitin-conjugating enzyme, UBC-2 ( |
| NBC00362 | 1 | 0 | 1e-76 | S17521 | Multicatalytic endopeptidase complex ( |
| NBC00368 | 1 | 0 | 9e-13 | LCE_ORYLA | Low choriolytic enzyme precursor ( |
| NBC00377 | 1 | 0 | 3e-75 | PSA4_CAEEL | Proteasome subunit, alpha type 4, PAS-3 ( |
| NBC00459 | 2 | 1 | 2e-26 | NM_072733 | Protease ( |
| NBC00469 | 1 | 0 | 7e-17 | NM_060215 | Zinc metalloprotease ( |
| NBC00509 | 1 | 1 | 4e-71 | AL161503 | Polyubiquitin, UBQ10 ( |
| NBC00664 | 0 | 1 | 5e-09 | NM_074798 | Cathepsin-like ( |
| NBC00670 | 0 | 1 | 3e-18 | S17435 | Polyubiquitin 6 ( |
| NBC00772 | 0 | 1 | 4e-24 | NM_003352 | Sentrin, ubiquitin-like small protein ( |
| NBC00783 | 0 | 1 | 2e-89 | U41556 | |
| NBC00828 | 0 | 1 | 9e-63 | NC_003424 | Pad1 protein; 26S proteasome subunit ( |
| NBC00045 | 2 | 0 | 2e-92 | NM_065870 | Fructose-biphosphate aldolase ( |
| NBC00049 | 1 | 0 | 9e-50 | NM_070783 | Lipase ( |
| NBC00066 | 2 | 1 | 7e-76 | NM_074348 | Peptidyl-prolyl |
| NBC00079 | 1 | 0 | 2e-35 | NM_058712 | Helicase ( |
| NBC00102 | 1 | 0 | 7e-37 | NM_074031 | Peroxidase-like ( |
| NBC00139 | 1 | 0 | 8e-29 | NM_060074 | Hexokinase ( |
| NBC00143 | 1 | 0 | 4e-66 | ADHX_MYXGL | Alcohol dehydrogenase class III ( |
| NBC00147 | 1 | 0 | 6e-19 | XM_087230 | Similar to Uridine phosphorylase (UDRPase) ( |
| NBC00157 | 1 | 0 | 3e-13 | XM_058660 | Similar to Protein tyrosine phosphatase 1E ( |
| NBC00173 | 1 | 0 | 5e-72 | AJ440747 | Protein disulphide isomerase 1 ( |
| NBC00183 | 1 | 0 | 3e-56 | T46280 | Isocitrate dehydrogenase, NADP+, cytosolic ( |
| NBC00189 | 1 | 0 | 1e-21 | XM_129069 | Similar to Acetyltransferase (GNAT) family ( |
| NBC00212 | 1 | 0 | 6e-57 | NM_016100 | N-terminal acetyltransferase complex ard1 subunit ( |
| NBC00283 | 1 | 0 | 4e-27 | NM_012088 | 6-phosphogluconolactonase ( |
| NBC00285 | 1 | 0 | 2e-47 | LDHA_ANGRO | L-lactate dehydrogenase A chain ( |
| NBC00290 | 1 | 0 | 3e-17 | I55976 | Dihydrolipoamide S-acetyltransferase ( |
| NBC00292 | 1 | 0 | 1e-40 | NM_006223 | Peptidyl-prolyl |
| NBC00304 | 1 | 0 | 4e-12 | NM_073341 | Glucose-1-dehydrogenase ( |
| NBC00309 | 1 | 0 | 1e-18 | NM_066225 | Hydroxymethylglutaryl-coA reductase ( |
| NBC00326 | 1 | 0 | 1e-65 | NM_065761 | Protein phosphatase 2A ( |
| NBC00337 | 1 | 0 | 2e-60 | GMD1_CAEEL | Probable GDP-mannose 4,6 dehydratase 1 ( |
| NBC00353 | 1 | 0 | 2e-56 | NM_065537 | ATP synthase B chain ( |
| NBC00378 | 1 | 0 | 2e-43 | NM_073253 | Acetyltransferase (GNAT) family ( |
| NBC00382 | 1 | 0 | 4e-49 | NM_063827 | Phospholipase A2 ( |
| NBC00389 | 2 | 0 | 1e-48 | NM_058626 | Phosphotransferase ( |
| NBC00404 | 1 | 0 | 2e-76 | NM_064078 | Glucosamine-fructose-6-phosphate aminotransferase ( |
| NBC00413 | 1 | 0 | 6e-22 | NM_078324 | AMP-activated protein kinase ( |
| NBC00427 | 1 | 0 | 2e-20 | NC_003423 | 3-oxoacyl-(acyl-carrier-protein)-synthase ( |
| NBC00475 | 1 | 0 | 3e-42 | NM_065313 | Serine/threonine protein phosphatase ( |
| NBC00483 | 1 | 0 | 4e-25 | NM_059984 | Phospholipase, similar to ADRAB-b ( |
| NBC00504 | 1 | 0 | 7e-65 | AF292096 | Protein kinase AIRK2 ( |
| NBC00508 | 1 | 2 | 5e-64 | PPCK_HAECO | Phosphoenolpyruvate carboxykinase ( |
| NBC00528 | 1 | 0 | 5e-66 | PPCK_HAECO | Phosphoenolpyruvate carboxykinase ( |
| NBC00561 | 0 | 7 | 1e-54 | NDKB_RAT | Nucleoside diphosphate kinase B ( |
| NBC00713 | 0 | 1 | 1e-08 | XM_140038 | Similar to tau-tubulin kinase ( |
| NBC00729 | 0 | 2 | 4e-21 | NM_079041 | Flap endonuclease 1 ( |
| NBC00743 | 0 | 1 | 3e-64 | G3P_BRUMA | Glyceraldehyde 3-phosphate dehydrogenase ( |
| NBC00745 | 0 | 1 | 4e-13 | NM_068436 | Casein kinase ( |
| NBC00689 | 0 | 3 | 2e-17 | CLYC_CAEEL | Serine hydroxymethyltransferase MEL-32 ( |
| NBC00696 | 0 | 2 | 2e-15 | NM_000414 | Hydroxysteroid (17-beta) dehydrogenase 4 ( |
| NBC00770 | 0 | 1 | 3e-45 | NM_066907 | Serine/threonine kinase, casein kinase-like ( |
| NBC00777 | 0 | 1 | 8e-21 | OAZ_PRIPA | Ornithine decarboxylase antizyme ( |
| NBC00796 | 0 | 1 | 8e-52 | XM_125017) | Putative lysophosphatidic acid acyltransferase ( |
| NBC00802 | 0 | 1 | 4e-49 | NM_078623 | Enoyl Coenzyme A hydratase, short chain 1 ( |
| NBC00056 | 1 | 0 | 4e-58 | NM_071024 | Actin depolymerizing factor ( |
| NBC00062 | 1 | 0 | 1e-11 | NM_006400 | Dynactin 2; dynactin complex 50 kD subunit; dynamitin ( |
| NBC00078 | 2 | 0 | 0 | NM_059538 | Calponin ( |
| NBC00097 | 1 | 0 | 1e-42 | MLR1_CAEEL | Myosin regulatory light chain 1 ( |
| NBC00142 | 1 | 0 | 2e-76 | S53776 | Beta-tubulin isotype I ( |
| NBC00172 | 2 | 0 | 0 | NM_073416 | Actin ( |
| NBC00224 | 1 | 0 | 2e-40 | NM_063850 | Troponin C ( |
| NBC00239 | 4 | 1 | 2e-39 | NM_077559 | Collagen ( |
| NBC00241 | 2 | 0 | 2e-47 | NM_069715 | Collagen ( |
| 6e-47 | NM_077291 | Cuticular collagen ( | |||
| NBC00246 | 1 | 1 | 3e-19 | NM_077087 | Troponin I ( |
| NBC00287 | 2 | 0 | 2e-61 | MLR1_CAEEL | Myosin regulatory light chain 1 ( |
| NBC00360 | 1 | 1 | 3e-30 | NM_145671 | Actinfilin ( |
| NBC00396 | 1 | 0 | 2e-67 | MYSP_CAEEL | Paramyosin ( |
| NBC00403 | 1 | 0 | 3e-32 | NM_077291 | Cuticular collagen ( |
| NBC00418 | 1 | 0 | 6e-27 | NM058881 | Calponin ( |
| NBC00430 | 1 | 0 | 3e-11 | NM_011722 | Dynactin 6; p27 dynactin subunit ( |
| NBC00526 | 1 | 0 | 2e-44 | NM_060857 | Profilin ( |
| NBC00552 | 0 | 1 | 9e-47 | MYSP_CAEEL | Paramyosin ( |
| NBC00569 | 0 | 1 | 1e-23 | NM_060369 | Alpha crystallin B chain ( |
| NBC00749 | 0 | 1 | 3e-43 | NM_060857 | Profilin ( |
| NBC00068 | 3 | 0 | 1e-25 | VIT5_CAEEL | Vitellogenin 5 precursor ( |
| NBC00161 | 1 | 0 | 2e-15 | VIT5_CAEEL | Vitellogenin 5 precursor ( |
| NBC00397 | 1 | 9 | 7e-61 | MS10_CAEEL | Major Sperm Protein 10 ( |
| NBC00523 | 1 | 0 | 4e-69 | XM_038960 | Similar to preimplantation protein 3 ( |
| NBC00585 | 0 | 5 | 2e-30 | NM_076467 | Vitellogenin ( |
| NBC00611 | 0 | 1 | 1e-25 | NM_060189 | Placental protein 11 ( |
| NBC00027 | 2 | 0 | 9e-17 | NM_062882 | Lectin, |
| 5e-15 | NM_076712 | Asialoglycoprotein receptor ( | |||
| NBC00110 | 1 | 0 | 4e-17 | NC_001263 | Acyl-CoA-binding protein ( |
| NBC00118 | 1 | 0 | 4e-41 | T31073 | Multidrug resistance P-glycoprotein ( |
| NBC00128 | 3 | 0 | 1e-92 | NM_067381 | ADP/ATP carrier protein/translocase ( |
| NBC00167 | 1 | 0 | 2e-12 | NM_130415 | Lysosomal amino acid transporter 1 ( |
| NBC00175 | 1 | 0 | 7e-15 | A48925 | Mannose receptor ( |
| NBC00319 | 1 | 0 | 8e-15 | NXT2_HUMAN | NTF2-related export protein 2 (p15-2 protein) ( |
| NBC00324 | 2 | 0 | 7e-15 | AJ243873 | Galectin ( |
| NBC00340 | 1 | 0 | 2e-61 | NM_077246 | Galectin ( |
| NBC00355 | 1 | 0 | 8e-21 | NM_059527 | Fatty acid-binding protein LBP-6 ( |
| NBC00363 | 1 | 0 | 6e-48 | NM_016208 | Vacuolar protein sorting 28 homolog ( |
| NBC00583 | 0 | 5 | 4e-35 | NM_065836 | Low density lipoprotein receptor ( |
| NBC00593 | 0 | 2 | 2e-26 | NM_059525 | Fatty acid-binding protein LBP-6 ( |
| NBC00752 | 0 | 1 | 3e-08 | NM_059071 | Acetylcholine receptor UNV-38 ( |
| NBC00766 | 0 | 1 | 7e-44 | POR2_MELGA | Voltage-dependent anion-selective channel protein 2 (VDAC-2) ( |
| NBC00808 | 0 | 1 | 6e-53 | NM_072174 | Calreticulin precursor ( |
| NBC00838 | 0 | 1 | 1e-78 | NM_063349 | T-complex protein, delta subunit (cytosolic chaperonin CCT-4) ( |
| NBC00207 | 1 | 0 | 0 | RAB2_LYMST | RAS-Related protein RAB-2 ( |
| NBC00252 | 1 | 0 | 8e-97 | NM_070558 | RAS-like GTP-binding protein RhoA ( |
| NBC00297 | 1 | 0 | 4e-17 | NM_009106 | Rhotekin ( |
| NBC00312 | 1 | 0 | 4e-46 | A35350 | Protein kinase C inhibitor ( |
| NBC00269 | 1 | 0 | 1e-43 | NM_058274 | RAS-related protein RAB-11 ( |
| NBC00282 | 1 | 0 | 9e-25 | NP_741191 | A kinase anchor protein 1 ( |
| NBC00395 | 1 | 0 | 2e-29 | NM_07328 | RAS-like GTP-binding protein ( |
| NBC00436 | 1 | 0 | 2e-44 | NM_070985 | Calmodulin ( |
| NBC00462 | 1 | 0 | 2e-13 | SSRP_DROME | Single-strand recognition protein (SSRP) (Chorion-factor 5) ( |
| NBC00409 | 1 | 0 | 1e-16 | NM_019746 | Programmed cell death 5/TFAR19 protein ( |
| NBC00440 | 1 | 0 | 3e-72 | S43599 | SNF5 homolog R07E5.3 ( |
| NBC00510 | 1 | 0 | 2e-28 | XM_129572 | Calcyclin ( |
| NBC00629 | 0 | 1 | 1e-20 | NM_026297 | RAB (RAS oncogene family-like 3) ( |
| NBC00648 | 0 | 1 | 3e-20 | NM_002624 | Prefoldin 5 isoform alpha; myc modulator-1; c-myc binding protein ( |
| NBC00727 | 0 | 1 | 3e-17 | AB091687 | TGF-beta induced apotosis protein 3 ( |
| NBC00768 | 0 | 1 | 3e-18 | NM_078471 | TGF-beta-1 induced anti-apoptotic factor 1 isoform 1 ( |
| NBC00829 | 0 | 1 | 1e-42 | A49146 | Developmental regulator WNT-4 ( |
| NBC00841 | 0 | 1 | 1e-31 | NM_012453 | Transducin (beta)-like 2, isoform 1 ( |
| NBC00024 | 1 | 0 | 1e-37 | NM_003752 | Eukaryotic translation initiation factor 3, subunit 8 ( |
| NBC00048 | 1 | 0 | 1e-28 | NM_069150 | Glycine-rich RNA-binding protein ( |
| 5e-21 | NM_007007 | Cleavage and polyadenylation specific factor 6 ( | |||
| NBC00050 | 1 | 0 | 2e-12 | HEXP_LEIMA | DNA-binding protein HEXBP (Hexamer-binding protein) ( |
| NBC00055 | 1 | 1 | 2e-24 | NM_060622 | RNA recognition motif (RRM, RBD, or RNP domain) ( |
| NBC00090 | 2 | 1 | 0 | NM_066119 | Elongation factor 1-alpha ( |
| NBC00099 | 1 | 0 | 2e-30 | NM_067248 | Splicing factor ( |
| NBC00170 | 1 | 0 | 2e-56 | NM_011304 | RuvB DNA helicase -like protein 2 ( |
| NBC00181 | 1 | 0 | 4e-13 | NM_001698 | AU RNA-binding protein/enoyl-Coenzyme A hydratase ( |
| NBC00192 | 1 | 0 | 2e-26 | NM_060622 | RNA recognition motif (RRM, RBD, or RNP domain) ( |
| NBC00210 | 1 | 0 | 3e-15 | NM_018403 | Transcription factor (SMIF gene) ( |
| NBC00267 | 1 | 0 | 4e-20 | T2EB_XENLA | Transcription initiation factor IIE, beta subunit ( |
| NBC00321 | 1 | 0 | 1e-16 | NM_033224 | Purine-rich element binding protein B ( |
| NBC00280 | 1 | 0 | 3e-58 | NM_006578 | Guanine nucleotide-binding protein, beta-5 subunit ( |
| NBC00350 | 1 | 0 | 6e-40 | DPOD_DROME | DNA polymerase delta catalytic subunit ( |
| NBC00366 | 2 | 0 | 6e-79 | NM_066119 | Elongation factor 1-alpha ( |
| NBC00370 | 1 | 0 | 1e-17 | NM_031992 | Eukaryotic translation initiation factor 4H, isoform 2 ( |
| NBC00374 | 1 | 2 | 2e-53 | NM_070415 | Elongation factor 1-beta/delta chain ( |
| NBC00480 | 1 | 0 | 3e-21 | NM_061014 | Regulator of chromosome condensation, RCC1 ( |
| NBC00543 | 0 | 2 | 5e-23 | NM_065536 | Zinc finger, C3HC4 type (RING finger) ( |
| NBC00577 | 0 | 7 | 2e-31 | NP_872244 | Translation elongation factor EFT-4 ( |
| NBC00600 | 0 | 1 | 3e-74 | NM_063406 | Initiation factor 5A ( |
| NBC00630 | 0 | 1 | 9e-39 | SFR4_MOUSE | Splicing factor, arginine/serine-rich 4 ( |
| NBC00764 | 0 | 1 | 4e-16 | XM_132357 | Similar to Translation Initiation factor EIF-2B alpha ( |
| NBC00776 | 0 | 1 | 6e-27 | SN2L_CAEEL | Potential global transcription activator SNF2L ( |
| NBC00791 | 0 | 1 | 5e-38 | NM_001207 | Basic transcription factor 3 ( |
| NBC00816 | 0 | 1 | 2e-24 | S3B2_HUMAN | Splicing factor 3B subunit 2 (Spliceosome associated protein 145) ( |
| NBC00025 | 1 | 0 | 3e-16 | AF352714 | HC40 putative secretory protein precursor (ASP homolog) ( |
| NBC00065 | 1 | 0 | 6e-20 | AA063577 | Secreted protein 5 precursor (ASP homolog) ( |
| NBC00095 | 1 | 0 | 8e-59 | GLB2_NIPBR | Myoglobin (body wall isoform globin) ( |
| NBC00103 | 1 | 0 | 9e-12 | DIM1_CAEEL | Protein dim-1 (2D-page protein spot 8) ( |
| NBC00029 | 1 | 0 | 5e-17 | NM_001545 | Immature colon carcinoma transcript 1 ( |
| NBC00141 | 1 | 0 | 2e-35 | NM_018984 | Slingshot 1 ( |
| NBC00160 | 1 | 0 | 5e-12 | NM_053810 | Synaptosomal-associated protein, 29kD ( |
| NBC00199 | 1 | 0 | 9e-39 | AF278538 | Nucleosome assembly protein 1 ( |
| NBC00256 | 2 | 0 | 2e-09 | NM_075227 | Transthyretin-like family ( |
| NBC00293 | 1 | 0 | 7e-08 | NC_003424 | F-box protein ( |
| NBC00399 | 1 | 0 | 2e-22 | NM_076443 | Calumenin, calcium-binding protein ( |
| NBC00429 | 1 | 0 | 4e-14 | XM_122362 | Chromobox homolog 2 ( |
| NBC00491 | 1 | 0 | 3e-21 | NM_076885 | Thrombospondin ( |
| NBC00518 | 1 | 0 | 3e-73 | T37461 | Mago nashi-like protein ( |
| NBC00544 | 0 | 1 | 2e-45 | NM_061213 | Alpha-2-macroglobulin family ( |
| NBC00560 | 0 | 1 | 1e-35 | NM_021305 | SEC61, alpha subunit 2 ( |
| NBC00705 | 0 | 1 | 3e-31 | DVA1_DICVI | DVA-1 nematode polyprotein allergen precursor (NPA) ( |
| 2e-12 | ABA1_ASCSU | ABA-1 nematode polyprotein allergen precursor (Body fluid allergen-1) ( | |||
| NBC00753 | 0 | 1 | 4e-10 | AF089728 | Ancylostoma-secreted protein 2 precursor, ASP-2 ( |
| NBC00755 | 0 | 1 | 2e-40 | TCPB_CAEEL | T-complex protein 1, beta subunit (CCT-beta) ( |
| NBC00757 | 0 | 1 | 2e-68 | 1432_SCHMA | 14-3-3 Protein homolog 2 (14-3-3-2) ( |
| NBC00803 | 0 | 1 | 3e-09 | ASP_ANCCA | Ancylostoma secreted protein ( |
| 3e-09 | AF079521 | Ancylostoma-secreted protein 1 precursor ( | |||
| NBC00827 | 0 | 1 | 3e-14 | NM_070108 | Testis-specific protein TPX-1 like ( |
The table gives, for each numbered cluster, the highest homolog with a functional description where available; in a number of cases a C. elegans homolog exists with a higher similarity, but has no description. Similarities to entries described as 'hypothetical proteins' are excluded, as are heat-shock proteins, cytochromes, mitochondrial and ribosomal products. Where C. elegans protein description is ambiguous (for example, protease, lectin), further descriptors added manually are italicized. Different clusters may derive from a single gene if sequences are non-overlapping; for example, NBC00198 and NBC00311 align to different segments of the C. elegans protease gene NM_073736. This table does not include N. brasiliensis gene products discovered previously and/or reported by other laboratories. All entries for this species are aggregated on the NEMBASE website.
Figure 2Proportion of ESTs predicted to encode signal sequences. (a) EST sequences were classified as conserved (similarities to non-nematode database entries), nematode-specific (similarities only to C. elegans or other nematode sequences), or novel (no similarities to existing entries), using a cutoff score of 80 in BLASTX (P < e-10). The number of ESTs bearing potential signal sequences was then calculated and the results are shown here. (b) Effects of relaxing cutoff scores on distribution of signal peptide-containing predicted gene products among conserved, nematode-specific and novel categories. Numbers of clusters in each category are given for cutoffs of 80 (P
ESTs from adult cDNAs with predicted amino-terminal signal peptides and with homologs in C. elegans
| Cluster | Score | Conventional cDNAs | Oligo-capped cDNAs | Wormpep ID | SignalP criteria | SignalP scores | Signal in | Description of | ||||
| C-p | Amino acids | SP-p | SP? | |||||||||
| NBC00012 | 86 | 6e-18 | 4 | 0 | CE20223 | YYYYS | 0.533 | 16 | 1.000 | Y | Y | Unknown (similar to NBC00237) |
| NBC00031 | 80 | 3e-16 | 2 | 2 | CE17924 | YYYYS | 0.932 | 18 | 0.999 | Y | Y | Unknown |
| NBC00237 | 84 | 5e-17 | 1 | 2 | CE20223 | YYYYS | 0.671 | 19 | 1.000 | Y | Y | Unknown (similar to NBC00012) |
| NBC00258 | 145 | 1e-35 | 1 | 0 | CE00133 | YYYYS | 0.524 | 19 | 0.999 | Y | Y | FAR-1 fatty acid/retinol-binding protein |
| NBC00266 | 129 | 6e-31 | 1 | 0 | CE19630 | YYYYS | 0.662 | 20 | 1.000 | Y | Y | Unknown |
| NBC00314 | 147 | 3e-36 | 1 | 1 | CE03639 | YYYYS | 0.708 | 19 | 0.987 | Y | Y | Transthyretin-like family |
| NBC00327 | 94 | 2e-20 | 1 | 0 | CE00906 | YYYYS | 0.542 | 25 | 0.998 | Y | Y | Unknown |
| NBC00336 | 138 | 2e-33 | 1 | 0 | CE23545 | YYYYS | 0.903 | 17 | 1.000 | Y | Y | Unknown |
| NBC00354 | 91 | 4e-21 | 4 | 0 | CE16530 | YYYYS | 0.511 | 17 | 0.943 | Y | Y | Unknown |
| NBC00472 | 215 | 8e-57 | 1 | 0 | CE04886 | YYYYS | 0.319 | 15 | 0.999 | Y | Y | Signal sequence receptor |
| NBC00487 | 55 | 7e-09 | 1 | 0 | CE05972 | YYYYS | 0.979 | 21 | 0.988 | Y | Y | Unknown |
| NBC00495 | 51 | 3e-07 | 1 | 1 | CE13171 | YYYYS | 0.566 | 19 | 0.999 | Y | Y | Transthyretin-like family |
| NBC00502 | 176 | 3e-45 | 1 | 0 | CE32298 | YYYYS | 0.634 | 20 | 1.000 | Y | Y | Ectonucleotide pyrophosphatase/phosphodiesterase |
| NBC00592 | 80 | 1e-15 | 0 | 3 | CE17924 | YYYYS | 0.920 | 16 | 1.000 | Y | Y | Unknown |
| NBC00606 | 81 | 4e-16 | 0 | 2 | CE02454 | YYYYS | 0.399 | 20 | 1.000 | Y | Y | Similar to |
| NBC00615 | 207 | 3e-54 | 0 | 1 | CE04533 | YYYYS | 0.995 | 18 | 1.000 | Y | Y | LBP-1 fatty acid-binding protein |
| NBC00616 | 61 | 3e-10 | 0 | 1 | CE20257 | YYYYS | 0.754 | 19 | 0.993 | Y | Y | Unknown |
| NBC00633 | 153 | 4e-38 | 0 | 1 | CE03639 | YYYYS | 0.450 | 17 | 1.000 | Y | Y | Transthyretin-like family |
| NBC00641 | 145 | 1e -35 | 0 | 1 | CE33289 | YYYYS | 0.219 | 19 | 0.930 | Y | Y | Unknown |
| NBC00643 | 102 | 2e-22 | 0 | 2 | CE27850 | YYYYS | 0.961 | 17 | 0.999 | Y | Y | Unknown |
| NBC00706 | 50 | 9e-07 | 0 | 1 | CE06014 | YYYYS | 0.466 | 20 | 1.000 | Y | Y | Unknown |
| NBC00720 | 12 | 3e-30 | 0 | 1 | CE16958 | YYYYS | 0.967 | 19 | 0.998 | Y | Y | NLP-13 neuropeptide |
| NBC00742 | 60 | 3e-10 | 0 | 1 | CE16731 | YYYYS | 0.880 | 21 | 0.993 | Y | Y | Unknown |
| NBC00748 | 50 | 4e-07 | 0 | 1 | CE02932 | YYYYS | 0.804 | 17 | 0.998 | Y | Y | Transthyretin-like family |
| NBC00767 | 79 | 7e-16 | 0 | 1 | CE31662 | YYYYS | 0.559 | 17 | 1.000 | Y | Y | Unknown |
| NBC00028 | 104 | 1e-23 | 1 | 1 | CE00431 | YYYYS | 0.731 | 18 | 0.999 | Y | N | Globin |
| NBC00124 | 128 | 8e-31 | 1 | 1 | CE00431 | YYYYS | 0.731 | 18 | 0.999 | Y | N | Globin |
| NBC00144 | 195 | 7e-51 | 1 | 0 | CE29663 | YYNYS | 0.866 | 19 | 0.963 | Y | N | Transport-secretion protein |
| NBC00197 | 143 | 8e-35 | 3 | 6 | CE00431 | YYYYS | 0.557 | 16 | 1.000 | Y | N | Globin |
| NBC00272 | 144 | 2e-35 | 1 | 0 | CE32475 | YYNYS | 0.262 | 22 | 0.513 | Y | N | Unknown |
| NBC00328 | 147 | 4e-36 | 3 | 4 | CE00431 | YYYYS | 0.523 | 17 | 0.999 | Y | N | Globin |
| NBC00581 | 122 | 7e-29 | 0 | 1 | CE00431 | YYYYS | 0.404 | 21 | 0.998 | Y | N | Globin |
| NBC00601 | 93 | 5e-20 | 0 | 1 | CE30218 | YYYYS | 0.535 | 34 | 0.944 | Y | N | Unknown |
| NBC00607 | 159 | 4e-40 | 0 | 1 | CE29597 | YYNYS | 0.529 | 18 | 0.786 | Y | N | Unknown |
Entries in table do not match numbers in Figure 2, which includes predicted signal anchors. SignalP criteria are C-score (raw cleavage site score); S-score (signal peptide score); Y-score (combined cleavage site score); mean S score; and assignation as signal peptide (S as in all entries above; otherwise A for signal anchor or N for neither). SignalP scores are as follows: C-p: probability of predicted cleavage site being correct; amino acids: length of predicted signal peptide in amino acids; SP-p: probability of existence of signal peptide; SP?: overall prediction for signal peptide. Note that NBC00028 is almost identical to the cuticular globin of N. brasiliensis (P51536), and NBC00197 and NBC00328 are closely related, whereas NBC0124 and NBC00581 are more similar to, but not identical to, the body-wall form of globin (P51535).