| Literature DB >> 17151081 |
Junichi Watanabe1, Hiroyuki Wakaguri, Masahide Sasaki, Yutaka Suzuki, Sumio Sugano.
Abstract
Comparasite is a database for comparative studies of transcriptomes of parasites. In this database, each data is defined by the full-length cDNAs from various apicomplexan parasites. It integrates seven individual databases, Full-Parasites, consisting of numerous full-length cDNA clones that we have produced and sequenced: 12,484 cDNA sequences from Plasmodium falciparum, 11,262 from Plasmodium yoelii, 9633 from Plasmodium vivax, 1518 from Plasmodium berghei, 7400 from Toxoplasma gondii, 5921 from Cryptosporidium parvum and 10,966 from the tapeworm Echinococcus multilocularis. Putatively counterpart gene groups are clustered and comparative analysis of any combination of six apicomplexa species is implemented, such as interspecies comparisons regarding protein motifs (InterPro), predicted subcellular localization signals (PSORT), transmembrane regions (SOSUI) or upstream promoter elements. By specifying keywords and other search conditions, Comparasite retrieves putative counterpart gene groups containing a given feature in common or in a species-specific manner. By enabling multi-faceted comparative analyses of genes of apicomplexa protozoa, monophyletic organisms that have evolved to diversify to parasitize various hosts by adopting complex life cycles, Comparasite should help elucidate the mechanism behind parasitism. Our full-length cDNA databases and Comparasite are accessible from http://fullmal.ims.u-tokyo.ac.jp.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17151081 PMCID: PMC1781114 DOI: 10.1093/nar/gkl1039
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics of the data contents for each of the individual full-length cDNA databases
| Species | Host | Stage | Library method | Number of cDNA sequenced | Number of RefFull | DB (URL) |
|---|---|---|---|---|---|---|
| Human | Erythrocytic, gametocyte | Oligo-capping | 12 484 | 1465 | Fullmal ( | |
| Human | Erythrocytic, gametocyte | Oligo-capping | 11 262 | 1566 | Fullmal ( | |
| Mouse | Erythrocytic, gametocyte | Oligo-capping | 9633 | 1206 | Fullmal ( | |
| Mouse | Erythrocytic, gametocyte | Oligo-capping | 1518 | 416 | Fullmal ( | |
| Mammals | Tachyzoite | Oligo-capping | 7400 | 762 | FullToxo ( | |
| Human/cow | Sporozoite | Oligo-capping | 5921 | 682 | FullCrypto ( | |
| Dog/fox | Larva | V-capping | 10 966 | ND | FullEchino ( |
Statistics of the number of RefFulls corresponded to the putative counterpart genes in each species
| Species | 5 | 4 | 3 | 2 | 1 | 0 | Total |
|---|---|---|---|---|---|---|---|
| 1 | 18 | 44 | 140 | 444 | 818 | 1465 | |
| 1 | 20 | 49 | 150 | 363 | 983 | 1566 | |
| 1 | 19 | 49 | 155 | 329 | 653 | 1206 | |
| 1 | 18 | 39 | 89 | 74 | 195 | 416 | |
| 1 | 17 | 30 | 41 | 85 | 588 | 762 | |
| 1 | 18 | 33 | 49 | 101 | 480 | 682 |
Number of RefFulls corresponded to putative counterpart genes with indicated number of species.
Figure 1Screen shots of the individual databases of the full-length cDNAs (A) and Comparasite (B).
Figure 2Example of comparison of putative promoter elements. (A) Positions of conserved promoter elements. The predicted positions of the indicated promoter elements in the promoters of putative orhthlogous gene group, XPF2n2934 (Pf), XPVi011046 (Pv), XPYw0792 (Py), BP114799 (Pb) are shown by blue boxes. (B) Sequence alignment of the surrounding regions of TSSs in Py and Pb genes. Predicted TATA boxes and identified TSSs are shown by blue and red boxes, respectively.
Number of annotation terms identified from the RefFulls in each species
| Annotations for RefFull (category) | Pf | Pv | Py | Pb | Tg | Cp |
|---|---|---|---|---|---|---|
| ‘Antigen’ (keyword) | 107 | 41 | 0 | 11 | 3 | 12 |
| ‘Transcription’ (keyword) | 42 | 16 | 0 | 3 | 2 | 9 |
| ‘Kinase’ (Pfam) | 116 | 38 | 41 | 0 | 0 | 0 |
| ‘Mitochondria’ (PSORT) | 81 | 117 | 49 | 31 | 67 | 19 |
| ‘Transmembrane domain’ (SOSUI) | 508 | 250 | 166 | 62 | 79 | 43 |
| ‘Transporter’ (GO term) | 131 | 92 | 60 | 27 | 30 | 20 |
| ‘TATA box’ (promoter) | 1157 | 601 | 530 | 227 | 42 | 233 |
| ‘TATA box; Pf’ (promoter) | 1328 | 1007 | 741 | 273 | 158 | 299 |
Number of putatively counterpart genes containing corresponding annotation terms in indicated number of species in common
| Annotations for RefFull (category) | 6 | 5 | 4 | 3 | 2 |
|---|---|---|---|---|---|
| ‘Antigen’ (keyword) | 0 | 0 | 4 | 14 | 56 |
| ‘Transcription’ (keyword) | 0 | 0 | 1 | 6 | 31 |
| ‘Kinase’ (Pfam) | 0 | 1 | 3 | 4 | 29 |
| ‘Mitochondria’ (PSORT) | 0 | 0 | 3 | 5 | 25 |
| ‘Transmembrane domain’ (SOSUI) | 0 | 4 | 3 | 21 | 108 |
| ‘Transporter’ (GO term) | 0 | 5 | 1 | 18 | 5 |
| ‘TATA box’ (promoter) | 0 | 2 | 14 | 87 | 443 |
| ‘TATA box; Pf’ (promoter) | 0 | 17 | 46 | 192 | 646 |