| Literature DB >> 21051343 |
Josef Tuda1, Arthur E Mongan, Mohammed E M Tolba, Mihoko Imada, Junya Yamagishi, Xuenan Xuan, Hiroyuki Wakaguri, Sumio Sugano, Chihiro Sugimoto, Yutaka Suzuki.
Abstract
Full-Parasites (http://fullmal.hgc.jp/) is a transcriptome database of apicomplexa parasites, which include Plasmodium and Toxoplasma species. The latest version of Full-Parasites contains a total of 105,786 EST sequences from 12 parasites, of which 5925 full-length cDNAs have been completely sequenced. Full-Parasites also contain more than 30 million transcription start sites (TSS) for Plasmodium falciparum (Pf) and Toxoplasma gondii (Tg), which were identified using our novel oligo-capping-based protocol. Various types of cDNA data resources were interconnected with our original database functionalities. Specifically, in this update, we have included two unique RNA-Seq data sets consisting of 730 million mapped RNA-Seq tags. One is a dataset of 16 time-lapse experiments of cultured bradyzoite differentiation for Tg. The other dataset includes 31 clinical samples of Pf. Parasite RNA was extracted together with host human RNA, and the extracted mixed RNA was used for RNA sequencing, with the expectation that gene expression information from the host and parasite would be simultaneously represented. By providing the largest unique full-length cDNA and dynamic transcriptome data, Full-Parasites is useful for understanding host-parasite interactions and will help to eventually elucidate how monophyletic organisms have evolved to become parasites by adopting complex life cycles.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21051343 PMCID: PMC3013703 DOI: 10.1093/nar/gkq1111
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics for TSS Seq tags
| Species | Strain | Stage | Total tags | Mapped TSS tags | TSS positions |
|---|---|---|---|---|---|
| Tg | RH | Tachyzoite | 6 801 945 | 2 591 387 | 85 750 |
| Tg | ME49 | Tachyzoite | 12 101 228 | 2 484 257 | 242 889 |
| Tg | ME49 | Bradyzoite | 8 418 271 | 357 792 | 67 091 |
| Pf | 3D7 | Erythrocyte | 4 870 527 | 673 313 | 239 284 |
Statistics for 5′-ESTs and complete cDNA sequences
| Species | 5′-ESTs | Completely sequenced cDNAs |
|---|---|---|
| Pf | 9937 | 348 |
| Pv | 9633 | 2041 |
| Py | 11 581 | 311 |
| Pb | 2047 | 329 |
| Cp | 10 110 | 1066 |
| Tg | 7398 | 1830 |
| Bb | 12 286 | n.d. |
| Be | 7767 | n.d. |
| Bc | 10 769 | n.d. |
| Nc | 3456 | n.d. |
| Et | 7362 | n.d. |
| Tp | 13 440 | n.d. |
The genome sequences used for cDNA mapping are shown in the Database Glossary. Bb, Babesia bovis; Be, Babesia equi; Bc, Babesia caballi; Nc, Neospora caninum; Et, Eimeria tenella; Tp, Theileria parva.
n.d., not determined.
Figure 1.Updated Browser of Full-Parasites. Screen shots of the Genome Browser (left panel) implemented with the TSS Viewer (middle panel), the Annotation Viewer (middle panel) and the Phylogenetic Analysis viewer (right panel). To search the database, specify the species and gene name/cDNA ID in the boxes indicated by the green circle at the top of the page (http://fullmal.hgc.jp/). To search for genes having particular evolutionary conservation patterns, specify the shape of the phylogenetic trees or base substitution rate at the top page of the Phylogenetic Analysis Viewer (http://fullmal.hgc.jp/cgi-bin/evolution.cgi). To search for genes having particular expression patterns, follow the link to the Dynamic RNA-Seq Viewer (http://fullmal.hgc.jp/cgi-bin/dynamic.cgi). These pages are also linked from the Annotation Viewer. Details of the search conditions, legends for coloring and items are described in the Database Glossary (http://fullmal.hgc.jp/docs/glossary.html).
Statistics for dynamic RNA-Seq tags
| Species | Data sets | Total mapped tags | Average frequency of parasite tags, % | Mapped tags | Average represented gene (>0 ppm) | Average represented gene (>1 ppm) | Average cSNPs detected |
|---|---|---|---|---|---|---|---|
| Tg | 16 | 234 121 334 | 2.1 | 4 076 308 | 4207 | 3409 | n.d |
| Human | 230 045 026 | 19 426 | 16 151 | n.d. | |||
| Pf | 31 | 501 067 025 | 2.3 | 11 071 543 | 2952 | 2917 | 161 |
| Human | 489 995 482 | 18 797 | 14 464 | n.d. |
acSNPs were detected using Genome Studio (Illumina) with default settings.
n.d., not determined.
Figure 2.Search Page and Results Page of the Dynamic RNA-Seq Viewer. The search page (left panel) and search results page (right panel) of the Dynamic RNA-Seq Viewer. To search the database, specify the search conditions or expression patterns at the top of the page of the Dynamic RNA-Seq viewer (http://fullmal.hgc.jp/cgi-bin/dynamic.cgi); also specify whether the expression patterns should be considered to be values relative to the indicated time point/patient or to be absolute tag counts and whether the expression patterns should show typical patterns or specified patterns.