| Literature DB >> 21637855 |
Jie Xiong1, Dongxia Yuan, Jeffrey S Fillingham, Jyoti Garg, Xingyi Lu, Yue Chang, Yifan Liu, Chengjie Fu, Ronald E Pearlman, Wei Miao.
Abstract
BACKGROUND: Genome-wide expression data of gene microarrays can be used to infer gene networks. At a cellular level, a gene network provides a picture of the modules in which genes are densely connected, and of the hub genes, which are highly connected with other genes. A gene network is useful to identify the genes involved in the same pathway, in a protein complex or that are co-regulated. In this study, we used different methods to find gene networks in the ciliate Tetrahymena thermophila, and describe some important properties of this network, such as modules and hubs. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 21637855 PMCID: PMC3102692 DOI: 10.1371/journal.pone.0020124
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The node and edge number against cutoff values for three methods.
For the CLR method, the cutoff value represents the Z-score with a minimal value of 3.34 corresponding to the 60% confidence level of the FDR test; for the PCC and SCC methods, the cutoff value represents the correlation coefficient. The minimal correlation coefficient is 0.6.
Figure 2Overall performance of three methods for four protein complexes.
The F-score against the cutoff values (X-axis) of three methods for each protein complex is presented. Blue, CLR method; Pink, PCC method; Green, SCC method. For the CLR method, the cutoff value means the different confidence levels of the FDR test; for the PCC and SCC methods, the cutoff values represent the correlation coefficient.
Detailed information of 21 modules containing more than 100 genes in the TGN.
| Module | Gene NO | GO ANNO NO | GO ANNO PER | BP ANNO NO | Ortho No | Ortho PER |
| Module-1 | 3533 | 1287 | 36.43% | 795 | 741 | 20.97% |
| Module-2 | 1703 | 790 | 46.39% | 548 | 605 | 35.53% |
| Module-3 | 1369 | 510 | 37.25% | 327 | 266 | 19.43% |
| Module-4 | 988 | 271 | 27.43% | 156 | 278 | 28.14% |
| Module-5 | 929 | 314 | 33.80% | 196 | 179 | 19.27% |
| Module-6 | 836 | 287 | 34.33% | 154 | 114 | 13.64% |
| Module-7 | 827 | 365 | 44.14% | 233 | 336 | 40.63% |
| Module-8 | 824 | 156 | 18.93% | 132 | 29 | 3.52% |
| Module-9 | 583 | 264 | 45.28% | 152 | 307 | 52.66% |
| Module-10 | 565 | 202 | 35.75% | 124 | 139 | 24.60% |
| Module-11 | 338 | 134 | 39.64% | 86 | 71 | 21.01% |
| Module-12 | 332 | 93 | 28.01% | 52 | 53 | 15.96% |
| Module-13 | 314 | 65 | 20.70% | 39 | 27 | 8.60% |
| Module-14 | 313 | 87 | 27.80% | 61 | 8 | 2.56% |
| Module-15 | 260 | 106 | 40.77% | 64 | 45 | 17.31% |
| Module-16 | 181 | 82 | 45.30% | 49 | 58 | 32.04% |
| Module-17 | 162 | 93 | 57.41% | 56 | 89 | 54.94% |
| Module-18 | 127 | 35 | 27.56% | 19 | 22 | 17.32% |
| Module-19 | 117 | 77 | 65.81% | 64 | 77 | 65.81% |
| Module-20 | 114 | 18 | 15.79% | 9 | 11 | 9.65% |
| Module-21 | 109 | 38 | 34.86% | 25 | 22 | 20.18% |
The modules are named by the gene numbers in descending order. GO ANNO NO, indicates the number of GO annotated genes; GO ANNO PER, indicates the percentage of GO annotated genes; BP, biological process; Ortho, orthologs with other eukaryotes (see Materials and Methods).
Figure 3Possible components of the Tetrahymena proteasome complex.
A, KEGG annotated Tetrahymena proteasome complex (http://www.kegg.jp/kegg-bin/show_pathway?tet03050); B, Silver stained gel of a pull down experiment using Dss1 (Rpn15) as bait that identifies proteins of the Tetrahymena proteasome; C, The network of the possible Tetrahymena proteasome complex.
Figure 4The distribution of the gained partners number of genes in TGN.
The X-axis indicates the gained partners (each represents an edge) of genes, the value was Log2 transformed. The Y-axis indicates the frequency of gained partners number, also Log2 transformed.
Figure 5The heatmap of the 1200 hub genes in TGN.
The heatmap was clustered by Euclidean distance of expression. The levels of expression are illustrated by different grades of color as determined from microarray data indicated along the top (from left to right). The color scale is as follows: dark color, low expression; light color, high expression. Levels of expression were obtained for 20 points in time during three physiological/developmental stages of the life cycle of Tetrahymena: For growing cells, L-l, L-m and L-h correspond to ∼1×105 cells/ml, ∼3.5×105 cells/ml and ∼1×106 cells/ml, respectively. For measurements of expression during starvation, ∼2×105 cells/ml were collected at intervals of 0, 3, 6, 9, 12, 15 and 24 hours (referred to as S-0, S-3, S-6, S-9, S-12, S-15 and S-24, respectively). For measurements of expression during conjugation, equal volumes of B2086 and CU428 cells were mixed following 18 h of starvation, and samples were collected at intervals of 0, 2, 4, 6, 8, 10, 12, 14, 16 and 18 h after mixing (referred to as C-0, C-2, C-4, C-6, C-8, C-10, C-12, C-14, C-16 and C-18, respectively). The 1,200 genes were sorted into six groups according to clustering analysis.