| Literature DB >> 14718066 |
Yutaka Fukuoka1, Hidenori Inaoka, Isaac S Kohane.
Abstract
BACKGROUND: There is increasing evidence that gene order within the eukaryotic genome is not random. In yeast and worm, adjacent or neighboring genes tend to be co-expressed. Clustering of co-expressed genes has been found in humans, worm and fruit flies. However, in mice and rats, an effect of chromosomal distance (CD) on co-expression has not been investigated yet. Also, no cross-species comparison has been made so far. We analyzed the effect of CD as well as normalized distance (ND) using expression data in six eukaryotic species: yeast, fruit fly, worm, rat, mouse and human.Entities:
Mesh:
Substances:
Year: 2004 PMID: 14718066 PMCID: PMC331401 DOI: 10.1186/1471-2164-5-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Dataset information
| species | label | genes | pairs | microarrays | weight |
| human | Su (a) [ | 5510 | 770014 | 85 | 0.0482 |
| human | Armstrong [ | 5492 | 765803 | 72 | 0.0406 |
| human | Pomeroy [ | 3107 | 239951 | 266 | 0.0470 |
| human | Ramasawamy [ | 6971 | 1217536 | 280 | 0.2508 |
| human | Yeoh [ | 5503 | 768242 | 254 | 0.1436 |
| human | Houmard [ | 15681 | 6344163 | 24 | 0.1120 |
| human | Diette [ | 24810 | 15248313 | 29 | 0.3253 |
| human | Dyrskjot [ | 3159 | 245900 | 40 | 0.0072 |
| human | Alizadeh (a) [ | 2932 | 160672 | 31 | 0.0037 |
| human | Alizadeh (b) [ | 2701 | 187917 | 35 | 0.0048 |
| human | Alizadeh (c) [ | 3696 | 341437 | 67 | 0.0168 |
| mouse | Su (b) [ | 7651 | 1616309 | 90 | 0.2437 |
| mouse | Schinke (a) [ | 9124 | 2296857 | 30 | 0.1154 |
| mouse | Schinke (b) [ | 7649 | 1615129 | 54 | 0.1461 |
| mouse | Neptune [ | 13150 | 4744714 | 24 | 0.1908 |
| mouse | Scott [ | 9124 | 2296857 | 79 | 0.3040 |
| rat | Faden [ | 5519 | 884975 | 79 | 0.5741 |
| rat | Almon [ | 4227 | 529217 | 47 | 0.2043 |
| rat | Almon [ | 4227 | 529217 | 51 | 0.2216 |
| worm | Kim [ | 15368 | 19987830 | 553 | 1 |
| fruit fly | Montalta-He and Egger [ | 5534 | 3078761 | 31 | 0.8171 |
| fruit fly | Arbeitman [ | 1138 | 133501 | 160 | 0.1829 |
| yeast | Eisen [ | 2478 | 232362 | 79 | 0.4315 |
| yeast | Cho [ | 6123 | 1422380 | 17 | 0.5685 |
Label provides the reference information. The genes, pairs and microarrays columns show the total number of the genes, pairs and microarrays in each dataset, respectively. Weight denotes the dataset's weight in each species.
Figure 1The co-expression rates (CoERs) in the eleven human datasets are plotted as a function of chromosomal distance (CD): the results from the highly-correlated (HC) pairs (A), the zero-correlation (ZC) pairs (B) and the negatively-correlated (NC) pairs (C). The reference information is as follows: 1) Armstrong [21], 2) Ramasawamy [23], 3) Pomeroy [22], 4) Su (a) [17], 5) Yeoh [24], 6) Houmard [25], 7) Diette [26], 8) Dyrskjot [27], 9) Alizadeh (a) [28], 10) Alizadeh (b) [28], 11) Alizadeh (c) [28]. The weighted average and standard deviation of the CoER over the eleven datasets: the results from the HC pairs (D), the ZC pairs (E) and the NC pairs (F).
Figure 2A comparison of the six species using the weighted average of each species. (A) the highly correlated pairs, (B) the zero correlation pairs and (C) the negatively correlated pairs. Note that worm2 represents the CoERs of the worm dataset without the pairs in operons and duplicates.
Gene Ontology categories shared by the two genes in pairs with a CD below 20 kbp
| category | description | # pairs | ||
| h | molecular functions | GO:0005132 | interferon-alpha/beta receptor ligand | 2 |
| h | GO:0005194 | cell adhesion | 2 | |
| h | GO:0008089 | anterograde axon cargo transport | 4 | |
| h | biological process | GO:0006954 | inflammatory response | 6 |
| h | GO:0006955 | immune response | 3 | |
| h | GO:0007165 | signal transduction | 5 | |
| h | cellular component | GO:0005634 | nucleus | 5 |
| h | GO:0005886 | plasma membrane | 5 | |
| h | GO:0005887 | integral to plasma membrane | 2 | |
| y | molecular functions | GO:0003735 | structural cons tituent of ribosome | 12 (1) |
| y | biological process | GO:0006412 | protein biosynthesis | 13 (1) |
| y | cellular component | GO:0005737 | cytoplasm | 12 (2) |
| h | molecular functions | GO:0005200 | cytoskeletal structural protein | 3 |
| h | GO:0008009 | chemokine | 2 | |
| h | GO:0008417 | fucosyltransferase | 2 | |
| h | GO:0015464 | acetylcholine receptor | 2 | |
| h | biological process | GO:0006954 | inflammatory response | 3 |
| h | GO:0007048 | oncogenesis | 4 | |
| h | GO:0007165 | signal transduction | 8 | |
| h | cellular component | GO:0005882 | intermediate filament | 3 |
| h | GO:0005886 | plasma membrane | 4 | |
| h | GO:0005887 | integral to plasma membrane | 5 | |
| h | molecular functions | GO:0005624 | membrane fraction | 3 |
| h | biological process | GO:0006508 | proteolysis and peptidolysis | 2 |
| h | GO:0007165 | signal transduction | 2 | |
| h | cellular component | GO:0005624 | membrane fraction | 3 |
| h | GO:0005887 | integral to plasma membrane | 2 | |
A 'h' and 'y' in the left column denotes that the pairs were from the human (Diette [26]) and yeast (Cho et al. [9]) dataset, respectively. In yeast, none pair shared the same category in the ZC and NC groups. A number in parentheses in the right column indicates the number of duplicated pairs.
Distribution of expected values obtained in pair-wise protein BLAST
| Expected value | ||||||||
| species | label | # of analyzed pairs | 0 – 0.2 | 0.2 – 0.4 | 0.4 – 0.6 | 0.6 – 0.8 | 0.8 – 1 | > 1 |
| yeast | Eisen [ | 10000 | 800 | 515 | 402 | 316 | 369 | 7598 |
| yeast | Cho [ | 9980 | 768 | 422 | 377 | 325 | 322 | 7766 |
| worm | Kim [ | 8370 | 2658 | 267 | 206 | 182 | 210 | 4847 |
| mouse | Su (b) [ | 3631 | 437 | 151 | 111 | 116 | 101 | 2715 |
| mouse | Schinke (a) [ | 3554 | 340 | 149 | 131 | 107 | 99 | 2728 |
| mouse | Schinke (b) [ | 3599 | 396 | 139 | 110 | 109 | 129 | 2716 |
| mouse | Neptune [ | 3122 | 344 | 118 | 115 | 77 | 101 | 2367 |
| mouse | Scott [ | 3659 | 408 | 117 | 110 | 104 | 111 | 2809 |
The results of a multiple comparison with the Ryan procedure
| worm | rat | human | mouse | fly | yeast | |
| worm | * | * | * | * | ||
| rat | p = 0.703 | * | * | |||
| human | p < 0.001 | p = 0.094 | * | * | ||
| mouse | p < 0.001 | p = 0.017 | p = 0.250 | * | ||
| fly | p < 0.001 | p < 0.001 | p < 0.001 | p = 0.077 | * | |
| yeast | p < 0.001 | p < 0.001 | p < 0.001 | p < 0.001 | p = 0.001 |
A * denotes significant difference (p < 0.01).
Figure 3The co-expression rates shown in Figure 2(A) were re-plotted against the normalized distance. A value on the horizontal axis can be smaller than 1 because all possible pairs next to each other were involved in the calculation (i.e. we did not exclude a non-coding region, cetromere, etc from the calculation). Note that worm2 represents the CoERs of the worm dataset without the pairs in operons and duplicates.