| Literature DB >> 17626053 |
Tomás Norambuena1, Rodrigo Malig, Francisco Melo.
Abstract
We describe a web server for the accurate mapping of experimental tags in serial analysis of gene expression (SAGE). The core of the server relies on a database of genomic virtual tags built by a recently described method that attempts to reduce the amount of ambiguous assignments for those tags that are not unique in the genome. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. The output of the server consists of a table in HTML format that contains links to a graphic representation of the results and to some external servers and databases, facilitating the tasks of analysis of gene expression and gene discovery. Also, a table in tab delimited text format is produced, allowing the user to export the results into custom databases and software for further analysis. The current server version provides the most accurate and complete SAGE tag mapping source that is available for the yeast organism. In the near future, this server will also allow the accurate mapping of experimental SAGE-tags from other model organisms such as human, mouse, frog and fly. The server is freely available on the web at: http://dna.bio.puc.cl/SAGExplore.html.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17626053 PMCID: PMC1933165 DOI: 10.1093/nar/gkm429
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Layout and flowchart of the SAGExplore server. The core of the server is a MySQL database of virtual genomic SAGE tags with confidence assignments that was generated by the HGA methodology and it has been recently described in detail (2). The server has three modules: (i) Genome Explore, (ii) Genome Mapping and (iii) Library Mapping. The module I is linked to NCBI BLAST server and to the Saccharomyces Genome Database (SGD) and does not require any input from the user. The modules II and III require experimental tag sequences and their counts (optional) as input, are linked to the module I and, through this, to the external servers and databases. For more details about the functioning of these modules see the main text. This server has been programmed in MySQL and PHP languages and uses the JpGraph graphics library.
Input options and requirements of the different modules of the SAGExplore server
| Input | Description | Genome explore | Genome mapping | Library mapping |
|---|---|---|---|---|
| User-specified input parameters | Organism specification | Yes | Yes | Yes |
| Anchoring-Tagging enzyme pair | Yes | Yes | Yes | |
| Odds ratio for tag confidence assignment | Yes | Yes | No | |
| Genomic mapping context and tag categories | Yes | No | No | |
| Output display options | Yes | Yes | Yes | |
| User-provided input data | List of genomic regions | Yes | No | No |
| List of experimental SAGE tags | No | Yes | Yes |
Figure 2.(A) Input form of genome explore module. (B) Output table of genome mapping module. (C) Graphic display of the genomic context for a selected tag. (D) Graphical genome expression map. (E) An example of the sequence details that are displayed for a particular gene where a given tag maps. (F) An example of the genomic details available for each tag.
Output data given by the different modules of the SAGExplore server
| Column description | Genome explore | Genome mapping | Library mapping |
|---|---|---|---|
| Sequential tag number | Yes (1) | Yes (1) | Yes (1) |
| Tag confidence assignment | Yes (2) | Yes (2) | No |
| Tag sequence | Yes (3) | Yes (3) | Yes (2) |
| Tag frequency of occurrence in the genome | Yes (4) | Yes (4) | No |
| Tag odds ratio | Yes (5) | Yes (5) | No |
| Tag class | Yes (6) | Yes (6) | No |
| Tag genomic location description | Yes (7) | Yes (7) | No |
| Tag genomic location type | Yes (8) | Yes (8) | No |
| Tag position within a transcript | Yes (9) | Yes (9) | No |
| Chromosome number | Yes (10) | Yes (10) | No |
| Initial position of the tag in the chromosome | Yes (11) | Yes (11) | No |
| Chromosome strand | Yes (12) | Yes (12) | No |
| Standard gene name | Yes (13) | Yes (13) | No |
| Systematic gene name | Yes (14) | Yes (14) | No |
| Genomic context | Yes (15) | Yes (15) | No |
| Tag details | Yes (16) | Yes (16) | No |
| Display sequence | Yes (17) | Yes (17) | No |
| BLAST | Yes (18) | Yes (18) | No |
| Tag counts on each experimental library | No | No | Yes (3–10) |
| Tag counts | No | Yes (19) | Yes (11) |
| Tag user-defined information | No | Yes (20) | Yes (12) |
aThis field in the output table has additional information dynamically linked. Some of these links currently point to external servers and databases such as BLAST server and SGD database.
The description of the columns displayed for each SAGE tag by the output tables of the server as a result of a particular query issued to each of the independent modules is specified. The numbers between parenthesis represent the sequential column number of each output table displayed by the server on its three different modules as a result of an issued query.
Existing tools for the analysis and mapping of SAGE tags
| Name | Database | Type | Tag counts | Tag mapping | Graphical interface | Organism | Web address |
|---|---|---|---|---|---|---|---|
| TAGmapper ( | RefSeq, ESTs | Server | No | Yes | No | Several | |
| WebSAGE ( | RefSeq, ESTs | Server | No | Yes | No | Human | |
| SAGEmap ( | SAGE libraries | Server | Yes | Yes | Yes | Several | |
| SAGEnet (N.A.) | SAGE libraries | Database | No | No | No | Human, Mouse, Yeast | |
| SAGE genie ( | SAGE libraries | Database | No | No | Yes | Human, Mouse | |
| ACTG ( | RefSeq, ESTs | Server | Yes | Yes | No | Human, Mouse | |
| 5’SAGE ( | Genome, ESTs, SAGE libraries | Server | No | Yes | Yes | Human | |
| Mouse SAGE Site ( | RefSeq, Genome, ESTs, SAGE libraries | Server | No | Yes | No | Mouse | |
| Discovery Space ( | RefSeq, Genome, SAGE libraries | Standalone | Yes | Yes | Yes | Human | |
| Identitag ( | ESTs, cDNA, SAGE libraries (user provided) | Standalone | No | Yes | No | Several | |
| USAGE ( | RefSeq, Genome, SAGE libraries | Server | Yes | Yes | No | Several | |
| SAGExplore | Genome, SAGE libraries | Server | Yes | Yes | Yes | Yeast |
aThis work.
bOther organisms will be soon available for tag mapping on this server. N.A., ‘Not Available’.
The available publications describing the listed tools are cited between parenthesis next to their names. RefSeq and ESTs stand for databases of reference sequences and expressed sequence tags, respectively. Both vary on clustering parameters and definitions. Tag counts column reflects if the tool takes into consideration in some way the observed counts of experimental tags. Some tools perform statistical analysis based on those counts, others only allow to display that information along with the results of the query. Tag mapping column reflects if the tool is able to map experimental tags against a specific built-in database.