| Literature DB >> 25435002 |
Abstract
BACKGROUND: High-throughput expression data, such as gene expression and metabolomics data, exhibit modular structures. Groups of features in each module follow a latent factor model, while between modules, the latent factors are quasi-independent. Recovering the latent factors can shed light on the hidden regulation patterns of the expression. The difficulty in detecting such modules and recovering the latent factors lies in the high dimensionality of the data, and the lack of knowledge in module membership.Entities:
Mesh:
Year: 2014 PMID: 25435002 PMCID: PMC4248660 DOI: 10.1186/1471-2105-15-S13-S6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1An illustration of the procedure using a toy example dataset. The features belong to two modules. One module is controlled by a single factor, and the other controlled by two factors.
Figure 2Simulation results from modular latent structure models. In every simulation, 10 modules, each consisting of 100 simulated genes, were generated. The number of latent factors per module was randomly selected between 1 and 3. The latent factors were either independent Gaussian (two right columns), or randomly chosen from a mixture of four types (two left columns). Gaussian random noise was added to achieve different signal to noise ratios (columns), and different levels of within-module sparsity (proportion of zero loadings) were tested (rows). An additional 1000 pure noise genes were generated from the standard Gaussian distribution. Each simulation setting was repeated 100 times. The success of latent factor recovery was evaluated by the R2 values obtained by the regression of each latent factor against the identified factors assigned to the module to which the latent factor belongs. The relative frequencies (10 equal-sized bins between 0 and 1, equivalent to the histogram) of the R2 values are plotted.
Figure 3The top four factors from the Spellman dataset form two modules. On the x-axis the four time series are displayed in sequential order.
Top 25 GO terms overrepresented by genes associated with factor 1.
| GOBPID | Pvalue | Term |
|---|---|---|
| GO:0042254 | 3.27E-75 | ribosome biogenesis |
| GO:0022613 | 2.04E-68 | ribonucleoprotein complex biogenesis |
| GO:0034470 | 6.20E-67 | ncRNA processing |
| GO:0034660 | 1.31E-65 | ncRNA metabolic process |
| GO:0006364 | 1.55E-60 | rRNA processing |
| GO:0016072 | 1.47E-59 | rRNA metabolic process |
| GO:0006396 | 3.00E-51 | RNA processing |
| GO:0071843 | 1.48E-50 | cellular component biogenesis at cellular level |
| GO:0042273 | 5.74E-33 | ribosomal large subunit biogenesis |
| GO:0000460 | 4.04E-30 | maturation of 5.8S rRNA |
| GO:0000466 | 2.05E-29 | maturation of 5.8S rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) |
| GO:0016070 | 1.33E-26 | RNA metabolic process |
| GO:0042274 | 4.46E-26 | ribosomal small subunit biogenesis |
| GO:0044085 | 3.25E-25 | cellular component biogenesis |
| GO:0030490 | 1.57E-21 | maturation of SSU-rRNA |
| GO:0000462 | 3.11E-21 | maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) |
| GO:0000469 | 3.59E-21 | cleavage involved in rRNA processing |
| GO:0010467 | 7.58E-21 | gene expression |
| GO:0090304 | 2.29E-16 | nucleic acid metabolic process |
| GO:0009451 | 2.58E-16 | RNA modification |
| GO:0000478 | 8.00E-16 | endonucleolytic cleavage involved in rRNA processing |
| GO:0000479 | 8.00E-16 | endonucleolytic cleavage of tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) |
| GO:0006139 | 1.09E-15 | nucleobase-containing compound metabolic process |
| GO:0000447 | 1.37E-15 | endonucleolytic cleavage in ITS1 to separate SSU-rRNA from 5.8S rRNA and LSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) |
| GO:0000472 | 1.42E-14 | endonucleolytic cleavage to generate mature 5'-end of SSU-rRNA from (SSU-rRNA, 5.8S rRNA, LSU-rRNA) |
Top 25 GO terms overrepresented by genes associated with factor 2.
| GOBPID | Pvalue | Term |
|---|---|---|
| GO:0006511 | 1.99E-09 | ubiquitin-dependent protein catabolic process |
| GO:0019941 | 2.44E-09 | modification-dependent protein catabolic process |
| GO:0044257 | 2.65E-09 | cellular protein catabolic process |
| GO:0051603 | 3.79E-09 | proteolysis involved in cellular protein catabolic process |
| GO:0043632 | 4.21E-09 | modification-dependent macromolecule catabolic process |
| GO:0030163 | 7.26E-09 | protein catabolic process |
| GO:0010499 | 2.24E-08 | proteasomal ubiquitin-independent protein catabolic process |
| GO:0044265 | 9.94E-08 | cellular macromolecule catabolic process |
| GO:0007005 | 1.38E-07 | mitochondrion organization |
| GO:0043623 | 2.59E-07 | cellular protein complex assembly |
| GO:0043248 | 3.56E-07 | proteasome assembly |
| GO:0009057 | 5.19E-07 | macromolecule catabolic process |
| GO:0006508 | 5.25E-07 | proteolysis |
| GO:0071842 | 3.02E-06 | cellular component organization at cellular level |
| GO:0015031 | 4.89E-06 | protein transport |
| GO:0008104 | 6.28E-06 | protein localization |
| GO:0043161 | 6.81E-06 | proteasomal ubiquitin-dependent protein catabolic process |
| GO:0045184 | 7.86E-06 | establishment of protein localization |
| GO:0010498 | 1.01E-05 | proteasomal protein catabolic process |
| GO:0006461 | 1.35E-05 | protein complex assembly |
| GO:0034613 | 1.41E-05 | cellular protein localization |
| GO:0070271 | 2.51E-05 | protein complex biogenesis |
| GO:0070727 | 2.56E-05 | cellular macromolecule localization |
| GO:0034621 | 3.00E-05 | cellular macromolecular complex subunit organization |
| GO:0009987 | 3.93E-05 | cellular process |
Figure 4Expression levels of genes involved in module 2. Genes are hierarchically clustered. The four time series are displayed in sequential order.
Top 25 GO terms overrepresented by genes associated with factor 3.
| GOBPID | Pvalue | Term |
|---|---|---|
| GO:0000278 | 4.02E-19 | mitotic cell cycle |
| GO:0022402 | 4.08E-17 | cell cycle process |
| GO:0007049 | 6.05E-17 | cell cycle |
| GO:0022403 | 5.72E-15 | cell cycle phase |
| GO:0007017 | 8.75E-15 | microtubule-based process |
| GO:0048285 | 2.31E-13 | organelle fission |
| GO:0000280 | 5.08E-13 | nuclear division |
| GO:0051301 | 8.45E-13 | cell division |
| GO:0000226 | 8.50E-13 | microtubule cytoskeleton organization |
| GO:0000087 | 9.22E-13 | M phase of mitotic cell cycle |
| GO:0007067 | 1.39E-12 | mitosis |
| GO:0000279 | 2.71E-09 | M phase |
| GO:0007010 | 1.75E-08 | cytoskeleton organization |
| GO:0007059 | 2.33E-08 | chromosome segregation |
| GO:0030472 | 3.11E-08 | mitotic spindle organization in nucleus |
| GO:0032886 | 3.46E-08 | regulation of microtubule-based process |
| GO:0070507 | 3.46E-08 | regulation of microtubule cytoskeleton organization |
| GO:0016043 | 5.99E-08 | cellular component organization |
| GO:0007051 | 7.48E-08 | spindle organization |
| GO:0051329 | 7.69E-08 | interphase of mitotic cell cycle |
| GO:0007052 | 1.09E-07 | mitotic spindle organization |
| GO:0051325 | 1.12E-07 | interphase |
| GO:0006928 | 1.15E-07 | cellular component movement |
| GO:0007018 | 2.33E-07 | microtubule-based movement |
| GO:0010564 | 6.27E-07 | regulation of cell cycle process |
Top 25 GO terms overrepresented by genes associated with factor 4.
| GOBPID | Pvalue | Term |
|---|---|---|
| GO:0007049 | 5.54E-08 | cell cycle |
| GO:0051301 | 8.21E-08 | cell division |
| GO:0000278 | 7.99E-07 | mitotic cell cycle |
| GO:0000087 | 1.12E-06 | M phase of mitotic cell cycle |
| GO:0022402 | 1.57E-06 | cell cycle process |
| GO:0022403 | 6.92E-06 | cell cycle phase |
| GO:0048285 | 2.16E-05 | organelle fission |
| GO:0000280 | 3.03E-05 | nuclear division |
| GO:0010458 | 3.55E-05 | exit from mitosis |
| GO:0000910 | 3.70E-05 | Cytokinesis |
| GO:0000279 | 4.43E-05 | M phase |
| GO:0007067 | 7.91E-05 | Mitosis |
| GO:0033205 | 0.000523865 | cell cycle cytokinesis |
| GO:0032506 | 0.000572574 | cytokinetic process |
| GO:0010970 | 0.001012686 | microtubule-based transport |
| GO:0030473 | 0.001012686 | nuclear migration along microtubule |
| GO:0030705 | 0.001012686 | cytoskeleton-dependent intracellular transport |
| GO:0072384 | 0.001012686 | organelle transport along microtubule |
| GO:0000114 | 0.00177365 | regulation of transcription involved in G1 phase of mitotic cell cycle |
| GO:0046459 | 0.00185316 | short-chain fatty acid metabolic process |
| GO:0007018 | 0.002062712 | microtubule-based movement |
| GO:0016575 | 0.002062712 | histone deacetylation |
| GO:0032392 | 0.002399826 | DNA geometric change |
| GO:0010696 | 0.002438214 | positive regulation of spindle pole body separation |
| GO:0007097 | 0.002544351 | nuclear migration |