| Literature DB >> 18671881 |
Kei Chin Christine Cheng1, Martina V Strömvik.
Abstract
BACKGROUND: Experiments using whole transcriptome microarrays produce massive amounts of data. To gain a comprehensive understanding of this gene expression data it needs to be integrated with other available information such as gene function and metabolic pathways. Bioinformatics tools are essential to handle, organize and interpret the results. To date, no database provides whole transcriptome analysis capabilities integrated with terms describing biological functions for soybean (Glycine max (L) Merr.). To this end we have developed SoyXpress, a relational database with a suite of web interfaces to allow users to easily retrieve data and results of the microarray experiment with cross-referenced annotations of expressed sequence tags (EST) and hyperlinks to external public databases. This environment makes it possible to explore differences in gene expression, if any, between for instance transgenic and non-transgenic soybean cultivars and to interpret the results based on gene functional annotations to determine any changes that could potentially alter biological processes.Entities:
Mesh:
Year: 2008 PMID: 18671881 PMCID: PMC2536680 DOI: 10.1186/1471-2164-9-368
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The main page of the soy database. Users can submit queries to the database to retrieve all available IDs and annotations for the soybean transcript of interest. Queries can be made using EST ID, GenBank accession number, GenBank GI number, Affymetrix probe Set ID, SwissProt protein ID, name or keyword, GO number or term. A clickable GO tree is available to assist searching for a GO term from the gene ontology hierarchy structure.
Figure 2Database structure. A) Gene transcript (sequence) information section. Tables for EST sequences (DNA_SEQUENCE, LIBRARY, REPEATS, SEQ_ACCESSION, TAIL, TRIM, VECTOR), a table for mRNA sequences downloaded from GenBank (GB_ACCESSION) and tables for DFCI/TIGR contigs data (TIGR_GB, TIGR_GO) link through the tables GB_ACCESSION and SEQ_ACCESSION to annotation and microarray data. B) Protein and functional annotations section. A table, BLAST, for BLASTX search results, tables for gene ontology terms information (GENE_ANNOTATION, GRAPH_PATH, TERM, TERM2TERM, TERM_DEFINITION) and tables for KEGG pathways with the enzyme commission numbers (EC_DEF, EC_SWISS, EC_MAP) organize the annotation section. The BLAST table links protein annotation data to transcript sequence information and to microarray experiments data. C) Microarray experiment data section. Tables for chip information (CDF_FILE, PROBE_SET), a table for raw data (CEL_DATA), tables for normalized data (LIWONG_RESULT, MAS_RESULT, RMA_RESULT), and tables for analyzed results (EXPERIMENT, FOLD_CHANGE, LIMA_RESULT, NORMALIZE) organize the microarray data section. The PROBE_SET table links the microarray data to transcript sequence information and protein annotations data.
Figure 3Database result page, showing information about an EST sequence sag44h11.y1. Data records are presented in three tables: a) EST table displays the BLASTX result with scores and the negative exponent for the e-value for the EST sequence. b) AFFY table displays the BLASTX result with scores and the negative exponent for the e-value for the Affymetrix probe sequence. c) TC table displays the information for the corresponding DFCI/TIGR contig. The SwissProt protein ID/description, GO number/terms and EC enzyme numbers are displayed with the corresponding EST ID, GenBank accession number or Affymetrix probe ID with hyperlinks to the original public database. If available, links to TOXLINE are also given.
Figure 4Flowchart of the microarray web interface to access the database for displaying differentially expressed genes or functional gene classes based on GO terms. A) The query is built in three steps. B) There are four different views for the query results: by probe or functional level analysis either displayed as a list or as more informative annotated list.