| Literature DB >> 18987005 |
Hiroyuki Wakaguri1, Yutaka Suzuki, Toshiaki Katayama, Shuichi Kawashima, Eri Kibukawa, Kazushi Hiranuka, Masahide Sasaki, Sumio Sugano, Junichi Watanabe.
Abstract
Full-Malaria/Parasites is a database for transcriptome studies of apicomplexa and other parasites, which is based on our original full-length cDNA sequences and physical cDNA clone resources. In this update, the database has been expanded to contain the shogun sequencing for the entire sequences of 14,818 non-redundant full-length cDNA clones from six apicomplexa parasites and 6.8 million of transcription start sites (TSS), both of which had been produced by novel protocols using the oligo-capping method and the Illumina GA sequencer. The former should be the ultimate data for exact annotation of the expressed genes, while the latter should be useful for ultra-deep expression analysis. Furthermore, we have launched Full-Arthropods, a full-length cDNA database for arthropods of medical importance. Full-Arthropods contains 50 343 one-pass sequences, 10 399 shotgun complete sequences and 22.4 million TSS tags in anopheles mosquitoes that transmit malaria, tsetse flies that transmit trypanosomiasis and dust mites that cause allergic dermatitis and bronchial asthma. By providing the largest integrated full-length cDNA data resources in the apicomplexa parasites as well as their vectors, Full-Malaria/Parasites and Full-Arthropods should help combat parasitic diseases. Full-Malaria/Parasites and Full-Arthropods are accessible from http://fullmal.hgc.jp/.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18987005 PMCID: PMC2686583 DOI: 10.1093/nar/gkn856
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics of the 5′EST and Shotgun Sequences. (Panel A) Full-Parasites and (Panel B) Full-Arthropods
| Species | Stage | No. of ESTs | No. of shotgun clones Loci | No. of total tags | No. of putative assembles | No. of complete assembles | Coverage |
|---|---|---|---|---|---|---|---|
| Panel A | |||||||
| | Erythrocytic | 11 762 | 4229 | 25 866 778 | 1482 | 348 | X 39 |
| 2847 | 1239 | 330 | |||||
| | Erythrocytic | 13 501 | 3504 | 45 322 478 | 1871 | 1256 | X 247 |
| 2659 | 1439 | 1063 | |||||
| | Erythrocytic | 13 955 | 2892 | 31 444 398 | 1130 | 795 | X 417 |
| 2181 | 983 | 713 | |||||
| | Erythrocytic | 1275 | 678 | 30 791 291 | 211 | 138 | X 1541 |
| 573 | 178 | 126 | |||||
| | Tachyzoite | 9862 | 2213 | 16 742 565 | 1244 | 865 | X 239 |
| 1390 | 871 | 662 | |||||
| | Sporozoite | 11 873 | 1302 | 18 319 304 | 713 | 557 | X 249 |
| | 851 | 512 | 426 | ||||
| Total | 62 228 | 14 818 | 168 486 814 | 6651 | 3959 | ND | |
| 10 501 | 5222 | 3320 | |||||
| Panel B | |||||||
| | Larva | 12 590 | 4053 | 23 023 172 | ND | 2802 | X 115 |
| 2225 | 1054 | ||||||
| | Larva/pupa | 14 713 | 6346 | 29 343 254 | ND | 4973 | X 59 |
| | 2596 | 2062 | |||||
| | All stages | 23 040 | ND | ND | ND | ND | ND |
| Total | 50 343 | 10 399 | 52 366 426 | ND | 7775 | ND | |
| 4821 | 3116 |
Note that for Full-Anopheles, gap closure was not performed due to the lack of the reference genome information.
ND: not determined.
aNumber of assembles were counted for which an open reading frame of the amino acids of ≥100 aa was detected. Putative assembles contain gaps, while complete assemble do not (see the text).
bCoverage was calculated against the ‘sequence assembled gap closed’ population.
Statistics of the TSS tags
| Species | Stage | No. of TSS tags | No. of TSS positions | No. of represented genes |
|---|---|---|---|---|
| No. of total mapped TSS tags | ||||
| Tachyzoite | 6 801 945 | 104 926 | 5647 | |
| 2 739 596 | ||||
| Larva | 8 354 743 | 21 897 | 542 | |
| 97 395 | ||||
| Pupa | 5 734 822 | 129 706 | 1961 | |
| 1 519 515 | ||||
| Larva | 8 330 172 | 149 861 | ||
| 2 434 906 |
aNumber of nucleotides to which at least one TSS tags were mapped.
bNumber of annotated genes represented by TSS tags that were mapped to the genic region (−1 kb to the 3′-end) in T. gondii. Number of assembled cDNA sequences represented by TSS tags in A. stephensi and G. morsitans.
Figure 1.Screen shots of the Genome Browser (left panel) and annotation viewer (right panel). A purple square represents assembled complete cDNA sequences. Red and blue squares indicate TSS tags and shotgun tags, respectively. To search the database, specify the species and gene name/cDNA ID at the boxes in a green circle. Legends for coloring are described in Database Glossary (http://fullmal.hgc.jp/docs/glossary.html).