| Literature DB >> 16845059 |
Iain W Manfield1, Chih-Hung Jen, John W Pinney, Ioannis Michalopoulos, James R Bradford, Philip M Gilmartin, David R Westhead.
Abstract
The Arabidopsis Co-expression Tool, ACT, ranks the genes across a large microarray dataset according to how closely their expression follows the expression of a query gene. A database stores pre-calculated co-expression results for approximately 21,800 genes based on data from over 300 arrays. These results can be corroborated by calculation of co-expression results for user-defined sub-sets of arrays or experiments from the NASC/GARNet array dataset. Clique Finder (CF) identifies groups of genes which are consistently co-expressed with each other across a user-defined co-expression list. The parameters can be altered easily to adjust cluster size and the output examined for optimal inclusion of genes with known biological roles. Alternatively, a Scatter Plot tool displays the correlation coefficients for all genes against two user-selected queries on a scatter plot which can be useful for visual identification of clusters of genes with similar r-values. User-input groups of genes can be highlighted on the scatter plots. Inclusion of genes with known biology in sets of genes identified using CF and Scatter Plot tools allows inferences to be made about the roles of the other genes in the set and both tools can therefore be used to generate short lists of genes for further characterization. ACT is freely available at www.Arabidopsis.leeds.ac.uk/ACT.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16845059 PMCID: PMC1538833 DOI: 10.1093/nar/gkl204
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Screen shot of a typical ACT output showing co-expression of genes encoding sub-units of the proteasome. Gene identifiers, correlation r-value, measures of statistical significance and annotation are shown.
Figure 2The Clique Finder algorithm for identification of groups of consistently co-expressed genes.
Figure 3Co-correlation Scatter Plot. The r-values for all genes against two queries, cca1 (266719_at) and lhy (261569_at), are displayed on a scatter plot. The values for all of the myb genes represented on the array are highlighted in red. The labelled genes are discussed in the text.
Comparison of ACT output from the pre-calculated database with co-expression results based on a user-selected set of arrays for myb transcription factor gene, lhy
| Co-expression result from a pre-calculated database | Co-expression result from a user-selected set of arrays | ||||
|---|---|---|---|---|---|
| GeneID | Annotation | GeneID | Annotation | ||
| 0.88 | 0.97 | ||||
| 0.87 | 0.95 | ||||
| 0.86 | 0.94 | At3g47420 | glycerol-3-phosphate transporter, | ||
| 0.85 | 0.93 | ||||
| 0.85 | 0.92 | At2g47490 | mitochondrial substrate carrier | ||
| 0.84 | 0.90 | ||||
| 0.83 | 0.89 | ||||
| 0.83 | 0.89 | ||||
| 0.80 | 0.89 | ||||
| 0.80 | 0.89 | ||||
| 0.79 | 0.88 | ||||
| 0.78 | 0.88 | At5g18670 | beta-amylase | ||
| 0.77 | 0.87 | At1g14280 | phytochrome kinase, | ||
| 0.77 | 0.87 | ||||
The query gene is perfectly correlated with itself and therefore this r-value is given in brackets. Only the top-ranked 15 probes (one gene is represented by two probes) are presented from each list. Genes present on both lists are highlighted in boldface and genes ranked in the top 100 of the other list are indicated in italics.
CF output for a myb transcription factor showing only one of the three clusters produced
| Query probe: 266719_at AT2G46830 | |||
| Neighbour list size: 100; edge limit: 4.0%; number of clusters found: 3 | |||
| MORE edges (5.0%) => larger clusters | |||
| FEWER edges (3.0%) => smaller clusters | |||
| Mean | |||
| Mean | |||
| 261569_at | 0.870453 | AT1G01060 | |
| 261958_at | 0.796923 | AT1G64500 | |
| 265892_at | 0.787191 | AT2G15020 | expressed protein |
| 263796_at | 0.778055 | AT2G24540 | |
| 251869_at | 0.777933 | AT3G54500 | expressed protein |
| 258497_at | 0.728698 | AT3G02380 | |
| 265939_at | 0.704785 | AT2G19650 | DC1 domain-containing protein |
| 258724_at | 0.687695 | AT3G09600 | |
| 258723_at | 0.674703 | AT3G09600 | |
This output is edited from the format produced by the website. The expression of genes highlighted in boldface has been shown to be red-light responsive (see text for details).