| Literature DB >> 20565989 |
Wensheng Zhang1, Andrea Edwards, Wei Fan, Dongxiao Zhu, Kun Zhang.
Abstract
BACKGROUND: Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods.Entities:
Mesh:
Year: 2010 PMID: 20565989 PMCID: PMC2905369 DOI: 10.1186/1471-2105-11-338
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flow diagram of svdPPCS.
Figure 2Illustration of the implementation of svdPPCS. A: The plots of the first four eigengenes of ACG data across ages. B: The plots of the first four eigengenes of MT data across ages. C: The distribution of genes among the five primary clusters (PCLs) based on the maximum projection rule for ACG data (red) and MT data (blue). D: The identification of co-expression modules by splitting a two-way chart. E: Determine the cutoffs , and based on SVD-p. F: The heatmap of a conserved gene module.
Figure 3Heatmap of eight co-expression gene modules generated from a pattern pair by svdPPCS. M1: A conserved module with the down-regulation tendency appearing in both ACG data and MT data. M2: A divergent module with the down-regulation tendency appeared in ACG data but not MT data. M3: A divergent module with the down-regulation tendency appeared in MT data but not ACG data. M4: A conserved module with the up-regulation pattern appearing in both data sets. M5: A divergent module with the up-regulation tendency appearing in ACG data but not MT data. M6: A divergent module with the up regulation tendency appearing in MT data but not ACG data. M7: A divergent module with the down-regulation tendency appearing in ACG data and up-regulation tendency appearing in MT data. M8: A divergent module with up-regulation tendency appearing in ACG data and down-regulation tendency appearing in MT data. corr1, corr2: the average of the Pearson correlation coefficients among the member genes of the modules in ACG data and MT data, respectively. In plots of M1-M6, twenty randomly selected genes from the modules are mapped.
Functional enrichment analysis of the gene modules identified by svdPPCS
| Category | Term | Count | pValue | FDR |
|---|---|---|---|---|
| M1 | ||||
| CC | GO:0044444~cytoplasmic part | 15 | 4.53E-05 | 6.67E-04 |
| CC | GO:0032991~macromolecular complex | 14 | 1.80E-04 | 2.64E-03 |
| CC | GO:0005811~lipid particle | 5 | 2.37E-03 | 3.44E-02 |
| CC | GO:0043234~protein complex | 11 | 2.83E-03 | 4.08E-02 |
| CC | GO:0044446~intracellular organelle part | 12 | 4.96E-03 | 7.07E-02 |
| M2 | ||||
| MF | GO:0003735~structural constituent of ribosome | 42 | 2.73E-12 | 4.54E-11 |
| CC | GO:0030529~ribonucleoprotein complex | 53 | 1.70E-09 | 2.50E-08 |
| BP | GO:0009059~macromolecule biosynthetic process | 60 | 5.76E-09 | 1.03E-07 |
| BP | GO:0006412~translation | 52 | 1.71E-08 | 3.05E-07 |
| CC | GO:0044432~endoplasmic reticulum part | 15 | 5.71E-07 | 8.40E-06 |
| CC | GO:0005783~endoplasmic reticulum | 30 | 1.71E-06 | 2.51E-05 |
| MF | GO:0005198~structural molecule activity | 50 | 2.64E-06 | 4.39E-05 |
| CC | GO:0044444~cytoplasmic part | 120 | 4.06E-06 | 5.98E-05 |
| BP | GO:0009058~biosynthetic process | 78 | 3.50E-06 | 6.25E-05 |
| M3 | ||||
| MF | GO:0016491~oxidoreductase activity | 26 | 3.75E-08 | 6.23E-07 |
| BP | GO:0006091~generation of precursor metabolites and energy | 21 | 4.74E-08 | 8.46E-07 |
| BP | GO:0006118~electron transport | 17 | 2.59E-07 | 4.63E-06 |
| CC | GO:0044429~mitochondrial part | 19 | 3.65E-06 | 5.37E-05 |
| BP | GO:0042775~organelle ATP synthesis coupled electron transport | 9 | 5.57E-06 | 9.96E-05 |
| BP | GO:0006119~oxidative phosphorylation | 11 | 3.22E-05 | 5.76E-04 |
| CC | GO:0005743~mitochondrial inner membrane | 11 | 1.19E-04 | 1.74E-03 |
| CC | GO:0005740~mitochondrial envelope | 12 | 2.15E-04 | 3.17E-03 |
| CC | GO:0005746~mitochondrial respiratory chain | 7 | 5.32E-04 | 7.81E-03 |
| M4 | ||||
| BP | GO:0006259~DNA metabolic process | 9 | 7.59E-04 | 1.35E-02 |
| BP | GO:0043285~biopolymer catabolic process | 6 | 3.14E-03 | 5.46E-02 |
| BP | GO:0006325~establishment and/or maintenance of chromatin architecture | 6 | 3.32E-03 | 5.76E-02 |
| BP | GO:0006323~DNA packaging | 6 | 3.32E-03 | 5.76E-02 |
| BP | GO:0006464~protein modification process | 12 | 3.44E-03 | 5.97E-02 |
| BP | GO:0043412~biopolymer modification | 12 | 4.70E-03 | 8.07E-02 |
| M5 | ||||
| BP | GO:0007010~cytoskeleton organization and biogenesis | 57 | 5.19E-11 | 9.28E-10 |
| BP | GO:0030029~actin filament-based process | 30 | 7.43E-11 | 1.33E-09 |
| BP | GO:0030036~actin cytoskeleton organization and biogenesis | 29 | 1.90E-10 | 3.39E-09 |
| CC | GO:0005856~cytoskeleton | 41 | 1.29E-08 | 1.90E-07 |
| CC | GO:0044430~cytoskeletal part | 34 | 5.62E-08 | 8.27E-07 |
| CC | GO:0015629~actin cytoskeleton | 20 | 1.49E-07 | 2.19E-06 |
| BP | GO:0006996~organelle organization and biogenesis | 83 | 1.40E-07 | 2.50E-06 |
| BP | GO:0048869~cellular developmental process | 85 | 1.43E-07 | 2.55E-06 |
| BP | GO:0007015~actin filament organization | 17 | 3.96E-07 | 7.08E-06 |
| M6 | ||||
| MF | GO:0032555~purine ribonucleotide binding | 32 | 3.22E-05 | 5.34E-04 |
| MF | GO:0017076~purine nucleotide binding | 32 | 5.81E-05 | 9.64E-04 |
| MF | GO:0005524~ATP binding | 27 | 7.40E-05 | 1.23E-03 |
| MF | GO:0032559~adenyl ribonucleotide binding | 27 | 7.40E-05 | 1.23E-03 |
| MF | GO:0030554~adenyl nucleotide binding | 27 | 1.39E-04 | 2.30E-03 |
| MF | GO:0016876~ligase activity, forming aminoacyl-tRNA and related component | 7 | 2.28E-04 | 3.77E-03 |
| MF | GO:0016875~ligase activity, forming carbon-oxygen bonds | 7 | 2.28E-04 | 3.77E-03 |
| MF | GO:0004812~aminoacyl-tRNA ligase activity | 7 | 2.28E-04 | 3.77E-03 |
| BP | GO:0043039~tRNA aminoacylation | 7 | 3.02E-04 | 5.39E-03 |
| M7 | ||||
| BP | GO:0019752~carboxylic acid metabolic process | 3 | 2.68E-03 | 4.68E-02 |
Comparison of the results from svdPPCS and regression analysisa
| Module | |||||
|---|---|---|---|---|---|
| G1/M1 | 25 | 26 | 12 | 0.08 | 0.28 |
| G2/M2 | 255 | 406 | 196 | 0.09 | 0.06 |
| G3/M3 | 146 | 114 | 73 | 0.24 | 0.27 |
| G4/M4 | 27 | 53 | 17 | 0.04 | 0.11 |
| G5/M5 | 352 | 516 | 306 | 0.19 | 0.21 |
| G6/M6 | 173 | 174 | 102 | 0.03 | 0.15 |
| G7/M7 | 8 | 5 | 2 | 0 | 0.40 |
| G8/M8 | 17 | 13 | 6 | 0.58 | 0 |
aNreg: the number of genes in the module identified by regression analysis. Nsvd: the number of genes in the modules identified by svdPPCS. Nreg-svd: the number of genes common in the regression module and the svdPPCS module. Ireg: the information index in the regression module. Isvd: the information index in svdPPCS module.
Figure 4Simulation of gene expression patterns and the recognition through the profiles of the right singular vectors (eigengenes) of the simulated human and mouse data sets. A: Four statistical curves used in the simulation. B: The plots of the four eigengenes of the human data set versus 18 time points. C: The plots of the four eigengenes of the mouse data set versus 18 time points.
Comparison between simulated groups and identified modules
| Group | Module | N1 | N2 | N3 |
|---|---|---|---|---|
| C1 | CL1 | 30 | 30 | 30 |
| C2 | CL2 | 30 | 34 | 24 |
| C3 | CL3 | 30 | 29 | 23 |
| C4 | CL4 | 30 | 30 | 27 |
| C5 | CL5 | 30 | 35 | 29 |
| C6 | CL6 | 30 | 34 | 30 |
aN1: The number of genes in the simulated group; N2: The number of genes in the identified module; N3: The numbers of genes in both the simulated group and the identified module.
Figure 5Co-expression gene patterns of the modules from the simulated human (H) and mouse (M) time series microarray data sets. The curves were polished using a smooth spline technique.