| Literature DB >> 17545197 |
Shivashankar H Nagaraj1, Nandan Deshpande, Robin B Gasser, Shoba Ranganathan.
Abstract
The analysis of expressed sequence tag (EST) datasets offers a rapid and cost-effective approach to elucidate the transcriptome of an organism, but requiring several computational methods for assembly and annotation. ESTExplorer is a comprehensive workflow system for EST data management and analysis. The pipeline uses a 'distributed control approach' in which the most appropriate bioinformatics tools are implemented over different dedicated processors. Species-specific repeat masking and conceptual translation are in-built. ESTExplorer accepts a set of ESTs in FASTA format which can be analysed using programs selected by the user. After pre-processing and assembly, the dataset is annotated at the nucleotide and protein levels, following conceptual translation. Users may optionally provide ESTExplorer with assembled contigs for annotation purposes. Functionally annotated contigs/ESTs can be analysed individually. The overall outputs are gene ontologies, protein functional identifications in terms of mapping to protein domains and metabolic pathways. ESTExplorer has been applied successfully to annotate large EST datasets from parasitic nematodes and to identify novel genes as potential targets for parasite intervention. ESTExplorer runs on a Linux cluster and is freely available for the academic community at http://estexplorer.biolinfo.org.Entities:
Mesh:
Year: 2007 PMID: 17545197 PMCID: PMC1933243 DOI: 10.1093/nar/gkm378
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.ESTExplorer input and analysis output pages. (A) EST or contig submission with optional parameters for organism and program selection, (B) Status page to monitor the progress of different programs and (C) Result download page from where processed data can be downloaded and annotation links accessed. Screenshots of the results showing Gene Ontologies (accessed via a link for each EST/contig), InterProScan and KEGG pathway mapping obtained from processed EST data are shown.