| Literature DB >> 19491309 |
Fabian Schreiber1, Gert Wörheide, Burkhard Morgenstern.
Abstract
In the absence of whole genome sequences for many organisms, the use of expressed sequence tags (EST) offers an affordable approach for researchers conducting phylogenetic analyses to gain insight about the evolutionary history of organisms. Reliable alignments for phylogenomic analyses are based on orthologous gene sequences from different taxa. So far, researchers have not sufficiently tackled the problem of the completely automated construction of such datasets. Existing software tools are either semi-automated, covering only part of the necessary data processing, or implemented as a pipeline, requiring the installation and configuration of a cascade of external tools, which may be time-consuming and hard to manage. To simplify data set construction for phylogenomic studies, we set up a web server that uses our recently developed OrthoSelect approach. To the best of our knowledge, our web server is the first web-based EST analysis pipeline that allows the detection of orthologous gene sequences in EST libraries and outputs orthologous gene alignments. Additionally, OrthoSelect provides the user with an extensive results section that lists and visualizes all important results, such as annotations, data matrices for each gene/taxon and orthologous gene alignments. The web server is available at http://orthoselect.gobics.de.Entities:
Mesh:
Year: 2009 PMID: 19491309 PMCID: PMC2703962 DOI: 10.1093/nar/gkp434
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The main workflow of OrthoSelect: Each EST (A) is assigned to a pre-defined orthologous group (OG) by the KOG database, and translated (B). After all ESTs have been assigned to OGs, a subset of the OGs can be selected which will be further processed to exclude all redundant sequences, compute a sequence alignment and refine it in the last step (C).
Figure 2.The output of the OrthoSelect web server. Besides a general overview page of the results (A), our web server reports information about functional annotations (B), a gene/taxa presence/absence matrix (C, D), annotations for each taxon (E), as well as an overview of the orthologous groups (F). Additionally, for each orthologous group the resulting alignments are visualized using the Jalview (17) applet (G).