| Literature DB >> 18808668 |
Carsten O Daub1, Erik Ll Sonnhammer.
Abstract
BACKGROUND: Observing co-expression between genes suggests that they are functionally coupled. Co-expression of orthologous gene pairs across species may improve function prediction beyond the level achieved in a single species.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18808668 PMCID: PMC2561017 DOI: 10.1186/1752-0509-2-81
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Schematic representation of orthologous co-regulation. For two genes A and B in one species, co-expressed genes are more likely to functionally interact than is expected by chance. If the co-expression is conserved, i.e. also found in an orthologous gene pair, A' and B' in another species, where A is orthologous to A' (A and A' arouse from a speciation event, red circle in the figure) and B is orthologous to B', the probability of a functional interaction is increased. This likelihood might be even further increased if the conservation of co-expression occurs across three species at the same time. One might speculate that such conserved co-regulations have an ancient origin and constitute basic functionalities.
Figure 2Sizes of the gene expression datasets used. We based our analyses on 'synchronized' gene expression datasets for the three species Saccharomyces cerevisiae (yeast), Drosophila melanogaster (fly), and Caenorhabditis elegans (worm) where each gene in one species had a corresponding orthologous gene in one of the other species (number on the edges) or among all three datasets (number inside the triangle).
Figure 3Evaluation of distance measures. Four of the most commonly used gene co-expression distance measures were evaluated in terms of their ability to detect biologically relevant gene associations according to biological process Gene Ontology annotations (A, B, C) and the KEGG PATHWAY map annotations (D, E, F). We incorporated gene pairs, starting with the highest co-expressed ones, continuing successively with weaker interactions, and predicted their accuracies. The impact of this incorporation on the prediction accuracies for the three species S. cerevisiae (yeast) (A, D), D. melanogaster (fly) (B, E), and C. elegans (worm) (C, F) and for four commonly used distance measures are shown. The background accuracy 'b' (grey horizontal lines) is the average over all gene association and represents the expected accuracy for a randomly chosen gene pairs. For the S. cerevisiae, D. melanogaster, and C. elegans datasets, the analysis is based on 1771, 2065, and 2065 genes, respectively (see also Figure 2). The top 8000 interactions obtained using the Spearman correlation, which was used in the conservation of co-expression study shown in Figure 4, contain 1131, 1308, and 1117 genes for S. cerevisiae, D. melanogaster, and C. elegans, respectively.
Figure 4Function prediction can be improved by combining co-expression of orthologous gene pairs. The prediction accuracy for co-expressed genes in a species was compared to the accuracies obtained from gene associations that are conserved across two or three species considered in this study. The gain in accuracy was obtained as the ratio of the accuracy of the combined co-expression to the accuracy obtained for one species alone. The analysis was based on 697 genes (see also Figure 2) for which orthologous genes were found across all three species.