| Literature DB >> 21200421 |
Jonas Lundström1, Fernando Salazar-Anton, Ellen Sherwood, Björn Andersson, Johan Lindh.
Abstract
BACKGROUND: Neurocysticercosis is a disease caused by the oral ingestion of eggs from the human parasitic worm Taenia solium. Although drugs are available they are controversial because of the side effects and poor efficiency. An expressed sequence tag (EST) library is a method used to describe the gene expression profile and sequence of mRNA from a specific organism and stage. Such information can be used in order to find new targets for the development of drugs and to get a better understanding of the parasite biology. METHODS ANDEntities:
Mesh:
Year: 2010 PMID: 21200421 PMCID: PMC3006133 DOI: 10.1371/journal.pntd.0000919
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Summary of T. solium ESTs.
| Description | Number | Percentage |
| Total number of sequenced clones | 5760 | |
| Total number of successful sequences | 5551 | 96.4%a |
| Number of high quality sequences | 4674 | 84.2%b |
| Unique sequences | 1650 | 35.3%c |
| Number of contigs | 434 | |
| Number of clones included in contigs | 3462 | 74.1%c |
| Average clones per contig | 7,9 | |
| Number of singletons | 1212 | 25.9% |
Percentages are calculated as, part of total number of ESTs (a), part of successful sequences (b) and part of number of high quality sequences (c).
Summary of the 25 most abundant ESTs and their putative identity.
| Contig. | Putative identity (BLASTX) | Length (nt) | No. of ESTs | Percentage (%) of high quality sequences. | Identified within other EST libraries from | Poly-A tail identified |
|
| gi|189235991|ref|XP_972419.2|PREDICTED: similar to DNA-J, putative | 1012 | 17 | 0,36% | + | − |
|
| gi|188485737|gb|ACD50951.1|Nc-DigChim-324430 [synthetic construct] | 1120 | 20 | 0,43% | + | − |
|
| gi|59709858|gb|AAW88559.1|oncosphere protein Tso31d [ | 1011 | 21 | 0,45% | + | + |
|
| gi|149364041|gb|ABR24229.1|gyceraldehyde-3-phosphate dehydrogenase [ | 1182 | 21 | 0,45% | + | + |
|
| Unknown 6 | 851 | 21 | 0,45% | + | − |
|
| gi|37778984|gb|AAP20152.1|alpha-actin protein [ | 931 | 22 | 0,47% | + | + |
|
| gi|221113094|ref|XP_002155286.1|PREDICTED: similar to Annexin-B12 | 1167 | 24 | 0,51% | + | + |
|
| gi|116687782|gb|AAT74668.2|cysteine-rich secreted protein 2 precursor | 878 | 24 | 0,51% | + | + |
|
| Unknown 5 | 667 | 25 | 0,53% | + | + |
|
| Unknown 4 | 1086 | 28 | 0,60% | + | − |
|
| Unknown 3 | 767 | 29 | 0,62% | + | + |
|
| Unknown 2 | 442 | 30 | 0,64% | + | + |
|
| gi|13539680|gb|AAK29203.1|AF225905_1ribosomal protein S15a [ | 982 | 34 | 0,75% | + | + |
|
| dbj|AB086256.1| | 983 | 35 | 0,75% | + | + |
|
| Unknown 1 | 991 | 37 | 0,79% | + | + |
|
| gi|158934366|emb|CAO82075.1|HP6 protein [ | 1111 | 39 | 0,83% | + | − |
|
| gi|2114399|gb|AAC47532.1|45W antigen ToW5/7 [ | 1088 | 40 | 0,86% | + | + |
|
| gi|56753429|gb|AAW24918.1|SJCHGC05540 protein [ | 965 | 58 | 1,24% | + | − |
|
| gi|37786712|gb|AAP47268.1| t24[ | 817 | 69 | 1,48% | + | − |
|
| gi|256050212|ref|XP_002569521.1|hypothetical protein [Schistosoma mansoni] | 813 | 70 | 1,50% | + | + |
|
| dbj|AB086256.1| | 1235 | 74 | 1,58% | + | − |
|
| gb|AAH30393.1| ATPase, H+ transporting, lysosomal V0 subunit B | 803 | 99 | 2,12% | + | − |
|
| dbj|BAD88768.1| tubulin | 1865 | 143 | 3,12% | + | + |
|
| gi|207298859|gb|ACI23578.1|beta-actin | 1251 | 214 | 4,58% | + | + |
|
| gi|117956206|gb|ABK58679.1|PHGPx isoform 1 | 1359 | 414 | 8,86% | + | + |
BLASTX was used with an E-value cut off <10−5.
*Putative signal peptide.
Figure 1Alignment of Tsol15 and T. ovis 45 ToW 5/7 amino acid sequences.
Alignment of Tsol15, T. solium (genbank accession no. GU338867) and 45 ToW 5/7, T. ovis (genbank accession no. gb|AAC47532.1|). Identical aminoacids are marked in grey. Numbers indicate the following amino acid number.
Figure 2Taxonomical categories.
EST submitted to Genbank NR database and categorized according to taxonomical closeness to T. solium. Scores with a value higher than E−5 were set as unknowns.
Figure 3Pie charts of 2nd level gene ontology (GO) terms.
Together, 620 unique ESTs and contigs were given a GO category. The three GO categories is presented include, Cellular components (A), Molecular functions (B) and Biological processes (C).