Literature DB >> 18987005

Full-Malaria/Parasites and Full-Arthropods: databases of full-length cDNAs of parasites and arthropods, update 2009.

Hiroyuki Wakaguri1, Yutaka Suzuki, Toshiaki Katayama, Shuichi Kawashima, Eri Kibukawa, Kazushi Hiranuka, Masahide Sasaki, Sumio Sugano, Junichi Watanabe.   

Abstract

Full-Malaria/Parasites is a database for transcriptome studies of apicomplexa and other parasites, which is based on our original full-length cDNA sequences and physical cDNA clone resources. In this update, the database has been expanded to contain the shogun sequencing for the entire sequences of 14,818 non-redundant full-length cDNA clones from six apicomplexa parasites and 6.8 million of transcription start sites (TSS), both of which had been produced by novel protocols using the oligo-capping method and the Illumina GA sequencer. The former should be the ultimate data for exact annotation of the expressed genes, while the latter should be useful for ultra-deep expression analysis. Furthermore, we have launched Full-Arthropods, a full-length cDNA database for arthropods of medical importance. Full-Arthropods contains 50 343 one-pass sequences, 10 399 shotgun complete sequences and 22.4 million TSS tags in anopheles mosquitoes that transmit malaria, tsetse flies that transmit trypanosomiasis and dust mites that cause allergic dermatitis and bronchial asthma. By providing the largest integrated full-length cDNA data resources in the apicomplexa parasites as well as their vectors, Full-Malaria/Parasites and Full-Arthropods should help combat parasitic diseases. Full-Malaria/Parasites and Full-Arthropods are accessible from http://fullmal.hgc.jp/.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18987005      PMCID: PMC2686583          DOI: 10.1093/nar/gkn856

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Malaria and other parasites that belong to the phylum, apicomplexa, are causative agents of worldwide health problems that need immediate actions based on scientific investigation. Genome researches have been enthusiastically pursued during the last decade and the entire genome sequences of various malaria species, such as Plasmodium falciparum, P. vivax and P. yoelii, have been determined (1,2). Meanwhile we have constructed full-length cDNA libraries and collected cDNA sequences from P. falciparum, P. yoelii, P. vivax, P. berghei, Toxoplasma gondii, Cryptosporidium parvum and Echinococcus multilocularis. Each of the libraries was constructed using our original oligo-capping method. The obtained cDNA information together with physical cDNA clones have been made publicly available from our database Full-Malaria (http://fullmal.hgc.jp) (3). Also, we have published a database, Comparasite, in which the cDNA information was integrated and the comparative genomic studies between different species are enabled. Although the partial cDNA sequences (ESTs) are powerful to identify the presence of the transcripts at the corresponding genomic region or to determine the transcriptional start sites (TSS) (4,5), a serious drawback is that EST data is insufficient for determination of precise protein-coding regions. However, sequencing in entirety has been very costly so far and the available data are still scarce in parasites. Indeed, we have had to use RefFull sequences, which were virtual hybrid of the ESTs and the predicted gene models, for annotations. Consequently, the current dataset inevitably contains intrinsic errors in the annotated gene models, which were recently reported to be quite error prone, especially when they were not supported by any cDNA information. Newly developed massively parallel sequencing technologies, such as Roche GS20, Illumina GA and ABI SOLiD sequencer systems (6), have drastically reduced the sequencing cost per base (7). We have recently developed a new procedure to determine the entire sequences of full-length cDNAs by shotgun method using the Illumina GA platform with a cost less than a dollar per clone. We have produced 168 million shotgun sequence tags for 14 818 cDNAs sequences that represent expressed genes of apicomplexa parasites, corresponding to a significant population of the annotated gene models in each of the parasites. Based on these complete sequences, related annotations and inter-species comparisons have also been updated. This is the culmination of full-length cDNA analysis of parasites. On the other hand, we have recently developed a method to generate numerous TSS tags, which are short sequences immediately downstream of the TSSs, by combining our oligo-capping method and Illumina GA technology (4). In this update, we included the 6.8 million TSS tags collected from the tachyzoite stage parasite of T. gondii, which is a dramatic increase from the previous TSS data provided by Sanger sequencing of full-length clones. Combination of these two modalities can also be applied to host cells and revolutionize the study of parasitism. Furthermore, we have applied a similar approach to Anopheles stephensi that transmits malaria (8), Glossina morsitans that transmits trypanosomiasis (9) and Dermatophagoides farinae that causes various allergy including atopic dermatitis and bronchial asthma (10). The newly developed database, Full-Arthropods now contains 50 343 ESTs, 10 399 shotgun sequences and 22.4 million TSS tags. Unlike parasites, those arthropods have multi-cellular bodies with widely differentiated cell types and their genomes are far more complex with more genes. Many of the genes are expressed by alternative splicing, thus, cDNA analyses should be even more valuable. We expect that integrative knowledge from both causative parasites and vector insects should be indispensable to eventually develop an effective way to prevent infectious diseases mediated by them. Full-Parasites/Comparasite and Full-Arthropods are accessible from (http://fullmal.hgc.jp/).

DATA PRODUCTION

Shotgun sequencing of the cDNAs

We generated a non-redundant cDNA set for the shotgun sequencing from our EST resources; from 11 762 P. falciparum ESTs, 13 5012 P. vivax ESTs, 13 955 P. yoelii ESTs, 1275 P. berghei ESTs, 9682 T. gondii ESTs and 11 873 C. parvum ESTs. Respectively, 4229, 3504, 2892, 678, 2213, 1302 ESTs were selected for shotgun sequencing. Selected cDNAs were combined in a group of 800 per lane and sequenced using the Illumina Solexa platform. In total, 36-base 168 million short read sequence tags were generated for 14 818 cDNAs with the coverage of more than X 40. Those sequence tags were mapped to the respective genome sequences using ELAND (a part of the Illumina suite). Briefly, assembling of the obtained sequences were performed as follows: (i) sequences were mapped to the reference genome; (ii) scaffolds were made when the coverage per base was equal or greater than 10 and (iii) introns were identified so, when the sequentially separated (from 16 + 20 base to 20 + 16 base) short read sequences were mapped to two adjacent scaffolds. Further details of the protocol for the genome-assisted short read sequence assembly and its evaluation will be published elsewhere. For generating the ‘gap closed cDNA’ dataset, remaining gaps were tentatively closed either by the genome sequences or by skipping the gap regions (gap closure type 1 and type 2, respectively). Similarly, for Full-Arthropods, 4053 and 6346 cDNAs were selected from 12 590 of A. stephensi ESTs and 14 713 G. morsitans ESTs. For these ESTs, 52 million 48-bp short read sequence tags were generated. As the genome sequences of A. stephensi and G. morsitans are not available yet, we assembled the short reads sequences using Velvet (a de novo short tag assembler) (11). For these, gap closure was not performed due to the lack of the genome sequences and genome mapping information. We further mapped the assembled sequences to the genomes of Anopheles gambiae and Drosophila melanogaster to correlate them with the genome browser. Further details will be published elsewhere. Statistics of the shotgun assembling are shown in Table 1.
Table 1.

Statistics of the 5′EST and Shotgun Sequences. (Panel A) Full-Parasites and (Panel B) Full-Arthropods

SpeciesStageNo. of ESTsNo. of shotgun clones LociNo. of total tagsNo. of putative assemblesa LociNo. of complete assemblesa LociCoverageb
Panel A
    Plasmodium falciparumErythrocytic11 762422925 866 7781482348X 39
28471239330
    Plasmodium vivaxErythrocytic13 501350445 322 47818711256X 247
265914391063
    Plasmodium yoeliiErythrocytic13 955289231 444 3981130795X 417
2181983713
    Plasmodium bergheiErythrocytic127567830 791 291211138X 1541
573178126
    Toxoplasma gondiiTachyzoite9862221316 742 5651244865X 239
1390871662
    CryptosporidiumSporozoite11 873130218 319 304713557X 249
        parvum851512426
Total62 22814 818168 486 81466513959ND
10 50152223320
Panel B
    Anopheles stephensiLarva12 590405323 023 172ND2802X 115
22251054
    Glossina morsitansLarva/pupa14 713634629 343 254ND4973X 59
        farinae25962062
    DermatophagoidesAll stages23 040NDNDNDNDND
Total50 34310 39952 366 426ND7775ND
48213116

Note that for Full-Anopheles, gap closure was not performed due to the lack of the reference genome information.

ND: not determined.

aNumber of assembles were counted for which an open reading frame of the amino acids of ≥100 aa was detected. Putative assembles contain gaps, while complete assemble do not (see the text).

bCoverage was calculated against the ‘sequence assembled gap closed’ population.

Statistics of the 5′EST and Shotgun Sequences. (Panel A) Full-Parasites and (Panel B) Full-Arthropods Note that for Full-Anopheles, gap closure was not performed due to the lack of the reference genome information. ND: not determined. aNumber of assembles were counted for which an open reading frame of the amino acids of ≥100 aa was detected. Putative assembles contain gaps, while complete assemble do not (see the text). bCoverage was calculated against the ‘sequence assembled gap closed’ population.

Functional annotations of the full-length cDNAs

For both Full-Parasites and Full-Arthropods, computational genome annotations were carried out based on the assembled full-length cDNA sequences. Putative protein-coding regions (CDSs) were identified as the longest open reading frame in the cDNA sequences. Deducted amino acid sequences, i.e. ORFs (≥100 aa) were used for the annotations. Currently annotations include; protein motif search (InterPro: http://www.ebi.ac.uk/interpro/ and Pfam: http://www.sanger.ac.uk/Software/Pfam/), GO term assignment (http://www.geneontology.org/index.shtml), hydropathy plot [using the standard protocol; see reference (3)], predictions of subcellular localization signals (PSORT: http://psort.hgc.jp/) and transmembrane domains (SOSUI: http://sosui.proteome.bio.tuat.ac.jp/sosuiframe0.html.). For details in functional annotation procedures, cut-offs and other parameters/criteria, see our web site (http://fullmal.hgc.jp/comparas/Glossary.htm).

Generation of the TSS tags from Toxoplasma, malaria mosquito and Tsetse fly

TSS tags were generated from T. gondii (tachyzoite), A. stephensi (L1–L4 larvae) and G. morsitans (larva and pupa) using Illumina GA. Briefly, the 5′- and 3′-adaptor sequences necessary for the Illumina GA sequencing were introduced as the 5′-end oligo at the RNA ligation and as the random hexamer primer at the first strand cDNA synthesis, respectively. Details of the experimental procedure to generate TSS tags will be published elsewhere. The generated short tag reads were mapped to the respective genomes using ELAND in T. gondii and BLASTN in A. stephensi and G. morsitans. For the latter, TSS tags were also mapped to the assembled cDNA sequences. The position to which the 5′-end of the Illumina GA sequence tag was mapped was defined as a putative TSS. Statistics of the TSS tags are shown in Table 2.
Table 2.

Statistics of the TSS tags

SpeciesStageNo. of TSS tagsNo. of TSS positionsaNo. of represented genesb
No. of total mapped TSS tags
Toxoplasma gondiiTachyzoite6 801 945104 9265647
2 739 596
Anopheles stephensiLarva8 354 74321 897542
97 395
Pupa5 734 822129 7061961
1 519 515
Glossina morsitansLarva8 330 172149 861
2 434 906

aNumber of nucleotides to which at least one TSS tags were mapped.

bNumber of annotated genes represented by TSS tags that were mapped to the genic region (−1 kb to the 3′-end) in T. gondii. Number of assembled cDNA sequences represented by TSS tags in A. stephensi and G. morsitans.

Statistics of the TSS tags aNumber of nucleotides to which at least one TSS tags were mapped. bNumber of annotated genes represented by TSS tags that were mapped to the genic region (−1 kb to the 3′-end) in T. gondii. Number of assembled cDNA sequences represented by TSS tags in A. stephensi and G. morsitans.

DATABASE DESCRIPTIONS

Full-Parasites/Comparasite with the shotgun sequence viewer and the TSS viewer

Taking advantage of our unique full-length cDNAs sequence data, various kinds of transcripts-based annotations are enabled in Full-Parasites. The newly assembled complete sequences of the full-length cDNAs appear as a new track in addition to the original tracks in the genome viewer (left panel, Figure 1). As shown in Table 1, number of currently represented shotgun complete sequences is 14 818 (10 501 loci) in total. Of these, amino acids (≥100 aa) were successfully generated for 6651 cases (5222 loci). Considering that estimated total numbers of genes are approximately 5000–8000 in apicomplexa parasites, significant part of the genes is represented in our database as the entire sequence of full-length cDNAs.
Figure 1.

Screen shots of the Genome Browser (left panel) and annotation viewer (right panel). A purple square represents assembled complete cDNA sequences. Red and blue squares indicate TSS tags and shotgun tags, respectively. To search the database, specify the species and gene name/cDNA ID at the boxes in a green circle. Legends for coloring are described in Database Glossary (http://fullmal.hgc.jp/docs/glossary.html).

Screen shots of the Genome Browser (left panel) and annotation viewer (right panel). A purple square represents assembled complete cDNA sequences. Red and blue squares indicate TSS tags and shotgun tags, respectively. To search the database, specify the species and gene name/cDNA ID at the boxes in a green circle. Legends for coloring are described in Database Glossary (http://fullmal.hgc.jp/docs/glossary.html). Overall contents and functionality of the viewer remained unchanged from the previous version. Currently attached functional annotations include: (i) the protein motif and GO terms, identified by InterProScan and Pfam; (ii) the subcellular localization signals, predicted by PSORT; (iii) hydropathy plot and (iv) transmembrane domain predicted by SOSUI. Comparasite, which is a database for the comparative studies of the parasites, was updated accordingly. Similar to the previous versions, the user can search the respective databases by inputting keywords (cDNA/gene ID), genomic positions, presence or absence of the various kinds of annotation features attached to annotated gene models or newly assembled complete full-length cDNA sequences. A new feature of the database is the ‘assemble viewer’ (blue squares, Figure 1). The viewer is linked to each of the shotgun sequences. As an inevitable attribute to the data obtained by the shotgun approach, some of the assembled sequences contain gap or incompletely assembled part. The assemble viewer was constructed so that the user can empirically understand the quality of the assemblies which were used for the annotations. Every nucleotide was heat-map colored according to the coverage of the shotgun sequences. At gaps, users can select whether the open reading frame is generated by filling the gap with the genome sequence (type 1 gap closure; see the database) or by skipping the gap so that the downstream extension of the reading frame becomes the largest (type 2 gap closure; see the database). Another new feature of the database is the TSS viewer (red squares, Figure 1). The TSS viewer illustrates how the collected TSS tags are distributed along the genome. This viewer is embedded in the main part of the genome browser, supporting smooth zooming in/out. The user can easily browse and empirically understand where the TSSs are located from the genome-wide view to the single base resolution. This is the largest dataset of TSSs collected from apicomplexa parasites. Those TSS tags information should be also useful to define the exact gene boundaries or to identify hitherto overlooked transcripts. In addition, exact TSS could not be identified by standard gene predictions, which generally predict protein-coding regions. In addition to the main genome browser, details of the assembled tag information and TSS tag information are also presented in the annotation viewer.

Full-Arthropods

Full-Arthropods is the counterpart database of Full-Parasites in vector insects. Full-Arthropods is also based on our original full-length cDNAs resources, that is, 5′-EST, shotgun sequences and TSS tag information. The basic concept of the database is the same as that of Full-Parasites. Above mentioned functionalities of the database are also implemented in Full-Arthropods. As shown in Table 1, complete sequencing of 4053 cDNAs (2225 loci) and 6346 cDNAs (2596 loci) from malaria mosquito, A. stephensi and tsetse fly, G. morsitans are presented, respectively. Of these, amino acids (≥100 aa) were successfully deduced in 2802 (1054 loci) and 4973 cases (2062 loci), respectively. In addition, 8.4 million, 5.6 million and 8.3 million TSS tags generated from A. stephensi larvae and larva and pupa stages G. morsitans are presented, respectively. Similar to Full-Parasites, this information can be viewed by the assemble viewer and the TSS tag viewer. As the genomic sequences of those insects mostly remain in the stage of a very rough draft, we could not develop counterpart database of the Comparasite in this update. Also, in the current genome viewer, positional information of cDNAs is represented against the genomes of A. gambiae and D. melanogaster, thus, their exon–intron structures are not always correctly represented. However, on release of the genomic information, this part should be immediately incorporated to the current database in full.

Search example

For an example of the search using assemble viewer, follow the link as follows (Figure 1): Full-Parasites top; select the species, Toxoplasma and specify the ‘Annotated gene ID’ as ‘52m00006’ (in ‘Search Box’ shown in Figure 1). For a search example in Full-Arthropods, select the species, G. morsitans and specify the ‘Annotated gene ID as’ ‘NM_057259’. Particularly in the former cases, the represented Toxoplasma gene seems to have two alternative promoters, both of which are supported by ESTs and TSS tags. Since there is almost no report describing the presence of the alternative promoters in parasites, it is interesting to further investigate its biological relevance.

Glossary, data and clone repository

A detailed user manual and used technical terms, definitions and parameters for the annotations are described in ‘Glossary and Experimental Procedure’ sections in our web sites (http://fullmal.hgc.jp/docs/glossary.html; http://fullmal.hgc.jp/docs/procedure.html). The user can follow the links to further detailed information from each item displayed there. Statistics of the current database is also presented in Statistics section (http://fullmal.hgc.jp/docs/statistics.html). All of the short read sequences used for the database have been deposited to NCBI Short Read Archives (http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?) under the accession numbers of SRA002052–SRA002063. Also, cDNA clones registered in the database are freely available and should serve as indispensable resources to explore functions of genes to combat the relevant diseases.

CONCLUSIONS AND FUTURE PERSPECTIVES

Here, we introduce the update of our Full-Parasite/Comparasites databases with the extensive data of shotgun sequencing of full-length cDNAs and TSS tags. To visualize newly generated short read sequences, we implemented the assemble viewer and the TSS viewer. We also launched a brand new database, Full-Arthropods in which equivalent amount of full-length cDNA data is retrievable with similar database functions. New libraries from other species, including Eimeria, Theileria and Babesia, which are all parasitic species representing additional three genera in the phyla of apicomplexa, have been constructed and data production has been started. Further enrichment of the cDNA data from all the available stages of life cycles are also contemplated. Based on the firm evidence of the physical cDNA clones, our database should be unique from other parasite database, such as PlasmoDB (http://www.plasmodb.org/plasmo/home.jsp), CryptoDB (http://cryptodb.org/cryptodb/), ToxoDB (http://www.toxodb.org/toxo-release4-0/home.jsp) and Vector-Base (http://www.vectorbase.org/index.php). It should also be noted that these clones can be used for detailed experimental validation of gene functions. In this release, a significant population of the short-read sequence tags could not be used for the assembling and the TSS detection (Tables 1 and 2). Further refinement of the bioinformatic tools should help extensive curation. On the other hand, efforts for sequencing currently unavailable genomes are underway. Once they become available, both the fidelity and the coverage of the database contents will be improved. With further enhanced functions as well as improved reliability of individual data, our database will allow us to understand how the apicomplexa parasites interact with vector and/or host transcriptomes and realize such complex life cycles using a limited number of genes.

FUNDING

This database has been constructed and maintained by a Grant-in-Aid for Publication of Scientific Research Results from Japan Society for Promotion of Science. Funding for open access charge: KAKENHI (Grant-in-Aid for Scientific Research) on Priority Areas “comparative Genomics” from the Ministry of Education, Culture, Sports, Science and Technology of Japan. Conflict of interest statement. None declared.
  11 in total

1.  Construction of a full-length enriched and a 5'-end enriched cDNA library using the oligo-capping method.

Authors:  Yutaka Suzuki; Sumio Sugano
Journal:  Methods Mol Biol       Date:  2003

2.  African labs win major role in tsetse-fly genome project.

Authors:  Declan Butler
Journal:  Nature       Date:  2004-01-29       Impact factor: 49.962

Review 3.  Whole-genome re-sequencing.

Authors:  David R Bentley
Journal:  Curr Opin Genet Dev       Date:  2006-10-18       Impact factor: 5.578

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Genome sequencing. New tricks of tick-borne pathogen.

Authors:  A G Barbour; W R Zückert
Journal:  Nature       Date:  1997-12-11       Impact factor: 49.962

6.  The genome sequence of the malaria mosquito Anopheles gambiae.

Authors:  Robert A Holt; G Mani Subramanian; Aaron Halpern; Granger G Sutton; Rosane Charlab; Deborah R Nusskern; Patrick Wincker; Andrew G Clark; José M C Ribeiro; Ron Wides; Steven L Salzberg; Brendan Loftus; Mark Yandell; William H Majoros; Douglas B Rusch; Zhongwu Lai; Cheryl L Kraft; Josep F Abril; Veronique Anthouard; Peter Arensburger; Peter W Atkinson; Holly Baden; Veronique de Berardinis; Danita Baldwin; Vladimir Benes; Jim Biedler; Claudia Blass; Randall Bolanos; Didier Boscus; Mary Barnstead; Shuang Cai; Angela Center; Kabir Chaturverdi; George K Christophides; Mathew A Chrystal; Michele Clamp; Anibal Cravchik; Val Curwen; Ali Dana; Art Delcher; Ian Dew; Cheryl A Evans; Michael Flanigan; Anne Grundschober-Freimoser; Lisa Friedli; Zhiping Gu; Ping Guan; Roderic Guigo; Maureen E Hillenmeyer; Susanne L Hladun; James R Hogan; Young S Hong; Jeffrey Hoover; Olivier Jaillon; Zhaoxi Ke; Chinnappa Kodira; Elena Kokoza; Anastasios Koutsos; Ivica Letunic; Alex Levitsky; Yong Liang; Jhy-Jhu Lin; Neil F Lobo; John R Lopez; Joel A Malek; Tina C McIntosh; Stephan Meister; Jason Miller; Clark Mobarry; Emmanuel Mongin; Sean D Murphy; David A O'Brochta; Cynthia Pfannkoch; Rong Qi; Megan A Regier; Karin Remington; Hongguang Shao; Maria V Sharakhova; Cynthia D Sitter; Jyoti Shetty; Thomas J Smith; Renee Strong; Jingtao Sun; Dana Thomasova; Lucas Q Ton; Pantelis Topalis; Zhijian Tu; Maria F Unger; Brian Walenz; Aihui Wang; Jian Wang; Mei Wang; Xuelan Wang; Kerry J Woodford; Jennifer R Wortman; Martin Wu; Alison Yao; Evgeny M Zdobnov; Hongyu Zhang; Qi Zhao; Shaying Zhao; Shiaoping C Zhu; Igor Zhimulev; Mario Coluzzi; Alessandra della Torre; Charles W Roth; Christos Louis; Francis Kalush; Richard J Mural; Eugene W Myers; Mark D Adams; Hamilton O Smith; Samuel Broder; Malcolm J Gardner; Claire M Fraser; Ewan Birney; Peer Bork; Paul T Brey; J Craig Venter; Jean Weissenbach; Fotis C Kafatos; Frank H Collins; Stephen L Hoffman
Journal:  Science       Date:  2002-10-04       Impact factor: 47.728

7.  Genome sequence of the human malaria parasite Plasmodium falciparum.

Authors:  Malcolm J Gardner; Neil Hall; Eula Fung; Owen White; Matthew Berriman; Richard W Hyman; Jane M Carlton; Arnab Pain; Karen E Nelson; Sharen Bowman; Ian T Paulsen; Keith James; Jonathan A Eisen; Kim Rutherford; Steven L Salzberg; Alister Craig; Sue Kyes; Man-Suen Chan; Vishvanath Nene; Shamira J Shallom; Bernard Suh; Jeremy Peterson; Sam Angiuoli; Mihaela Pertea; Jonathan Allen; Jeremy Selengut; Daniel Haft; Michael W Mather; Akhil B Vaidya; David M A Martin; Alan H Fairlamb; Martin J Fraunholz; David S Roos; Stuart A Ralph; Geoffrey I McFadden; Leda M Cummings; G Mani Subramanian; Chris Mungall; J Craig Venter; Daniel J Carucci; Stephen L Hoffman; Chris Newbold; Ronald W Davis; Claire M Fraser; Bart Barrell
Journal:  Nature       Date:  2002-10-03       Impact factor: 49.962

8.  Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii.

Authors:  Jane M Carlton; Samuel V Angiuoli; Bernard B Suh; Taco W Kooij; Mihaela Pertea; Joana C Silva; Maria D Ermolaeva; Jonathan E Allen; Jeremy D Selengut; Hean L Koo; Jeremy D Peterson; Mihai Pop; Daniel S Kosack; Martin F Shumway; Shelby L Bidwell; Shamira J Shallom; Susan E van Aken; Steven B Riedmuller; Tamara V Feldblyum; Jennifer K Cho; John Quackenbush; Martha Sedegah; Azadeh Shoaibi; Leda M Cummings; Laurence Florens; John R Yates; J Dale Raine; Robert E Sinden; Michael A Harris; Deirdre A Cunningham; Peter R Preiser; Lawrence W Bergman; Akhil B Vaidya; Leo H van Lin; Chris J Janse; Andrew P Waters; Hamilton O Smith; Owen R White; Steven L Salzberg; J Craig Venter; Claire M Fraser; Stephen L Hoffman; Malcolm J Gardner; Daniel J Carucci
Journal:  Nature       Date:  2002-10-03       Impact factor: 49.962

9.  Comparasite: a database for comparative study of transcriptomes of parasites defined by full-length cDNAs.

Authors:  Junichi Watanabe; Hiroyuki Wakaguri; Masahide Sasaki; Yutaka Suzuki; Sumio Sugano
Journal:  Nucleic Acids Res       Date:  2006-12-06       Impact factor: 16.971

10.  DBTSS: database of transcription start sites, progress report 2008.

Authors:  Hiroyuki Wakaguri; Riu Yamashita; Yutaka Suzuki; Sumio Sugano; Kenta Nakai
Journal:  Nucleic Acids Res       Date:  2007-10-16       Impact factor: 16.971

View more
  12 in total

1.  Comparative fitness assessment of Anopheles stephensi transgenic lines receptive to site-specific integration.

Authors:  D A Amenya; M Bonizzoni; A T Isaacs; N Jasinskiene; H Chen; O Marinotti; G Yan; A A James
Journal:  Insect Mol Biol       Date:  2010-01-24       Impact factor: 3.585

2.  New insights into the blood-stage transcriptome of Plasmodium falciparum using RNA-Seq.

Authors:  Thomas D Otto; Daniel Wilinski; Sammy Assefa; Thomas M Keane; Louis R Sarry; Ulrike Böhme; Jacob Lemieux; Bart Barrell; Arnab Pain; Matthew Berriman; Chris Newbold; Manuel Llinás
Journal:  Mol Microbiol       Date:  2010-02-04       Impact factor: 3.501

3.  Hammondia hammondi harbors functional orthologs of the host-modulating effectors GRA15 and ROP16 but is distinguished from Toxoplasma gondii by a unique transcriptional profile.

Authors:  Katelyn A Walzer; Gregory M Wier; Rachel A Dam; Ananth R Srinivasan; Adair L Borges; Elizabeth D English; Daland C Herrmann; Gereon Schares; Jitender P Dubey; Jon P Boyle
Journal:  Eukaryot Cell       Date:  2014-10-03

4.  Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels.

Authors:  Marcel H Schulz; Daniel R Zerbino; Martin Vingron; Ewan Birney
Journal:  Bioinformatics       Date:  2012-02-24       Impact factor: 6.937

5.  Recent Progress in Functional Genomic Research in Plasmodium falciparum.

Authors:  Jianbing Mu; Karl B Seydel; Adam Bates; Xin-Zhuan Su
Journal:  Curr Genomics       Date:  2010-06       Impact factor: 2.236

6.  Full-parasites: database of full-length cDNAs of apicomplexa parasites, 2010 update.

Authors:  Josef Tuda; Arthur E Mongan; Mohammed E M Tolba; Mihoko Imada; Junya Yamagishi; Xuenan Xuan; Hiroyuki Wakaguri; Sumio Sugano; Chihiro Sugimoto; Yutaka Suzuki
Journal:  Nucleic Acids Res       Date:  2010-11-04       Impact factor: 16.971

7.  DB-AT: a 2015 update to the Full-parasites database brings a multitude of new transcriptomic data for apicomplexan parasites.

Authors:  Marcin Jąkalski; Hiroyuki Wakaguri; Tabea G Kischka; Yoshifumi Nishikawa; Shin-ichiro Kawazu; Makoto Matsubayashi; Fumiya Kawahara; Naotoshi Tsuji; Shinuo Cao; Fujiko Sunaga; Xuenan Xuan; Kazuhiro Okubo; Ikuo Igarashi; Josef Tuda; Arthur E Mongan; Yuki Eshita; Ryuichiro Maeda; Wojciech Makałowski; Yutaka Suzuki; Junya Yamagishi
Journal:  Nucleic Acids Res       Date:  2014-11-20       Impact factor: 16.971

8.  Generation, annotation, and analysis of ESTs from midgut tissue of adult female Anopheles stephensi mosquitoes.

Authors:  Deepak P Patil; Santosh Atanur; Dhiraj P Dhotre; D Anantharam; Vineet S Mahajan; Sandeep A Walujkar; Rakesh K Chandode; Girish J Kulkarni; Pankaj S Ghate; Abhishek Srivastav; Kannayakanahalli M Dayananda; Neha Gupta; Bhakti Bhagwat; Rajendra R Joshi; Devendra T Mourya; Milind S Patole; Yogesh S Shouche
Journal:  BMC Genomics       Date:  2009-08-20       Impact factor: 3.969

9.  Inconsistencies of genome annotations in apicomplexan parasites revealed by 5'-end-one-pass and full-length sequences of oligo-capped cDNAs.

Authors:  Hiroyuki Wakaguri; Yutaka Suzuki; Masahide Sasaki; Sumio Sugano; Junichi Watanabe
Journal:  BMC Genomics       Date:  2009-07-15       Impact factor: 3.969

10.  Efficient CRISPR-Cas9-mediated genome editing in Plasmodium falciparum.

Authors:  Jeffrey C Wagner; Randall J Platt; Stephen J Goldfless; Feng Zhang; Jacquin C Niles
Journal:  Nat Methods       Date:  2014-08-10       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.