| Literature DB >> 16968540 |
Panayiotis Tsaparas1, Leonardo Mariño-Ramírez, Olivier Bodenreider, Eugene V Koonin, I King Jordan.
Abstract
BACKGROUND: A genome-wide comparative analysis of human and mouse gene expression patterns was performed in order to evaluate the evolutionary divergence of mammalian gene expression. Tissue-specific expression profiles were analyzed for 9,105 human-mouse orthologous gene pairs across 28 tissues. Expression profiles were resolved into species-specific coexpression networks, and the topological properties of the networks were compared between species.Entities:
Mesh:
Year: 2006 PMID: 16968540 PMCID: PMC1601971 DOI: 10.1186/1471-2148-6-70
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Gene coexpression networks. a) The expression profile of a gene i (G) can be represented as a row vector with dimensions (n) equal to the number of tissues (28); all profiles taken together yield an m × n gene expression matrix (X) where m = the number of genes (9,105), n = the number of tissues (28) and the expression value of Gi in tissue j is represented as x[45]. b) Gene expression vectors can be compared using a number of different similarity (distance) measures such as the Pearson correlation coefficient (r). Genes (nodes) are connected by an edge if their vectors are sufficiently similar (e.g. r ≥ 0.7). A relatively tightly linked cluster (subgraph) of coexpressed genes is shown. c) Visual representation of the expression patterns of the genes in this cluster underscores their similarity. Color scale based on log2 (G/median G...G).
Global characteristics of the coexpression networks
| < | < | < | |||
| Human | 7,208 | 158,418 | 43.96 | 0.3744 | 4.75 |
| Mouse | 7,730 | 178,166 | 46.10 | 0.4003 | 4.80 |
| Intersection | 2,257 | 13,060 | 11.57 | 0.4006 | 6.89 |
1Number of genes (nodes) in the network – i.e. nodes with one or more edges
2Number of coexpressed gene pairs (edges) in the network
3Average degree (k), number of edges shared with other nodes, per node
4Average clustering coefficient (C) per node
5Average shortest path length () between any two nodes in the network
Figure 2Node degree (. All distributions are plotted in log10-log10 scale. Frequency distributions showing f(k) × k for human (a) and mouse (b).
Figure 3Clustering coefficient against node degree . The degree (k) is shown on the x-axis and the average clustering coefficient
Figure 4Human-mouse conserved intersection network. a) Procedure for computing the intersection network whereby conserved edges that link the corresponding orthologous genes in both species are preserved. b) Node degree (k) and c) clustering coefficient against node degree C(k) distributions for the intersection network.
Local conservation of the human-mouse intersection network
| Nodes | 2,257 | 31.31 | 63.20 | 29.20 | 41.51 |
| Edges | 13,060 | 8.24 | 11.71 | 7.33 | 4.93 |
1Number of nodes and edges in the human-mouse intersection network
2Percentage of the nodes and edges conserved in the intersection network relative to the human and mouse networks
3Normalized percentage of the nodes and edges (see Additional file 1) conserved in the intersection network
Figure 5GO similarity versus gene profile correlation matrix. Genes are plotted along both axes of the matrices. Genes were clustered according to the pairwise similarity between their GO biological process annotation terms for the a) human-specific coexpression network, b) mouse-specific coexpression network and c) the human-mouse conserved intersection network. Pearson correlations (r) for all pairs of tissue-specific gene expression profiles are plotted according to the color bar. The inset of each plot shows a negative control where genes are randomly plotted, i.e. without regard to functional similarity, along the axes of the matrix.
Average GO similarity for mammalian gene coexpression networks versus average GO similarity for all gene pairs
| Human | 0.2637 ± 9.1e-4 | 0.1989 ± 4.9e-5 | 80.78 | 0 |
| Mouse | 0.2736 ± 8.9e-4 | 0.2150 ± 8.2e-5 | 75.23 | 0 |
1Average GO similarities for all pairs of genes connected by an edge in the coexpression network
2Average GO similarities all possible pairs among the 9,105 human-mouse orthologs
3Value of test statistic, t = (μ1-μ2)/(σd*sqrt(1/n1+1/n2)) where μ are the respective means and σd is the standard deviation of the difference
4Level of significance based on Students t-distribution with degrees of freedom = n1+n2-2
Correlation (r) between pairwise GO similarity and pairwise gene expression profile r-values
| Human | 0.1012 | 49303 | 22.59 | 2.2e-112 |
| Mouse | 0.0974 | 57685 | 23.50 | 1.4e-121 |
| Intersection | 0.1927 | 5370 | 14.39 | 4.4e-46 |
1Pearson correlation coefficient
2Number of gene pairs compared
3Value of test statistic, t = r*sqrt((n-2)/(1-r2))
4Level of significance based on Students t-distribution with degrees of freedom = n-2
Average GO similarity for species-specific mammalian gene coexpression networks versus average GO similarity for the conserved human-mouse intersection network
| Human-specific | 0.2556 ± 9.3e-4 | 25.65 | 3.3e-144 |
| Mouse-specific | 0.2678 ± 9.2e-4 | 20.31 | 2.2e-91 |
| Intersection | 0.3299 ± 3.4e-3 |
1Average GO similarities for all pairs of genes connected by an edge in the coexpression network
2Value of the test statistic for the comparison between the species-specific network and the intersection network, t = (μ1-μ2)/(σd*sqrt(1/n1+1/n2)) where μ are the respective means and σd is the standard deviation of the difference
3Level of significance, i.e., probability that the distributions of GO similarity values for the intersection and species-specific networks are identical, based on Students t-distribution with degrees of freedom = n1+n2-2
Figure 6GO biological process semantic similarity cumulative frequency distributions. Distributions [Pr(X≤x)] of GO term similarities are shown for all human and mouse gene pairs, for pairs of genes linked in the species-specific coexpression networks and for pairs of genes linked in the conserved human-mouse intersection network.
Correlation (r) between pairwise GO similarity and pairwise gene expression profile r-values
| Human-specific | 0.0581 | 43933 | 9.47 | 0 |
| Mouse-specific | 0.0730 | 52315 | 8.51 | 0 |
| Intersection | 0.1927 | 5370 |
1Pearson correlation coefficient
2Number of gene pairs compared
3Value of the test statistic for the comparison between the species-specific network and the intersection network, z = (zf1-zf2)/sqrt(1/(n1–3)+1/(n2–3)) where zf is the Fisher transform, zf = 1/2*ln((1+r)/(1-r))
4Level of significance, i.e., probability that the correlations in the intersection and species-specific networks are indistinguishable, based on normal distribution with infinite degrees of freedom
Figure 7Clusters of tightly coexpressed and functionally coherent genes. Examples of clusters involved in spermatogenesis (a) and host immune response (b) are shown along with their tissue-specific expression patterns.