| Literature DB >> 19105848 |
Abstract
BACKGROUND: Gene expression is controlled over a wide range at the transcript level through complex interplay between DNA and regulatory proteins, resulting in profiles of gene expression that can be represented as normal, graded, and bimodal (switch-like) distributions. We have previously performed genome-scale identification and annotation of genes with switch-like expression at the transcript level in mouse, using large microarray datasets for healthy tissue, in order to study the cellular pathways and regulatory mechanisms involving this class of genes. We showed that a large population of bimodal mouse genes encoding for cell membrane and extracellular matrix proteins is involved in communication pathways. This study expands on previous results by annotating human bimodal genes, investigating their correspondence to bimodality in mouse orthologs and exploring possible regulatory mechanisms that contribute to bimodality in gene expression in human and mouse.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19105848 PMCID: PMC2631022 DOI: 10.1186/1471-2164-9-628
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Microarray datasets used in this study representing normal human tissue.
| Tissue | Samples | (Used in mouse study) |
| adipose | 10 | (6) |
| adrenal | 10 | (6) |
| brain | 89 | (89) |
| colon | 10 | (5) |
| epidermal | 25 | (25) |
| heart | 38 | (38) |
| kidney | 10 | (3) |
| liver | 10 | (8) |
| lung | 26 | (26) |
| mammary | 15 | (15) |
| muscle | 64 | (64) |
| ovary | 10 | (10) |
| pancreas | 6 | (5) |
| peripheral_blood | 12 | (12) |
| small_intestine | 7 | (3) |
| spleen | 12 | (12) |
| stomach | 10 | (1) |
| testis | 38 | (49) |
| thymus | 5 | (11) |
| total | 407 | (388) |
The number of tissue samples used in the previous work that identified bimodal genes in mouse are included in parentheses. Sample accession numbers from GEO and ArrayExpress are listed in Additional File 1.
KEGG pathway enrichment for human switch-like genes.
| Human bimodal genes | Mouse bimodal genes | |||||||
| KEGG Pathway | Genes Observed | Genes Expected | Ration of Enrichment | P-values | Genes Observed | Genes Expected | Ration of Enrichment | P-values |
| Cell adhesion molecules (CAMs) | 36 | 16.81 | 2.14 | 4.31E-06 | ||||
| Regulation of actin cytoskeleton | 51 | 27.54 | 1.85 | 6.10E-06 | ||||
| Calcium signaling pathway | 44 | 23.31 | 1.89 | 1.54E-05 | ||||
| Citrate cycle (TCA cycle) | 11 | 3.53 | 3.12 | 2.85E-04 | ||||
| Glycolysis/Gluconeogenesis | 18 | 7.91 | 2.28 | 4.64E-04 | ||||
| Long-term potentiation | 20 | 9.32 | 2.15 | 5.47E-04 | ||||
| Leukocyte transendothelial migration | 28 | 15.11 | 1.85 | 7.12E-04 | ||||
| Carbon fixation | 9 | 2.97 | 3.03 | 1.29E-03 | ||||
| Adherens junction | 20 | 10.17 | 1.97 | 1.81E-03 | ||||
| Gap junction | 24 | 13.14 | 1.83 | 2.03E-03 | ||||
| Melanogenesis | 23 | 12.57 | 1.83 | 2.44E-03 | ||||
| Type I diabetes mellitus | 13 | 5.65 | 2.30 | 2.49E-03 | ||||
| Alanine and aspartate metabolism | 10 | 3.96 | 2.53 | 3.57E-03 | ||||
| Neurodegenerative Diseases | 12 | 5.23 | 2.30 | 3.67E-03 | ||||
| MAPK signaling pathway | 50 | 34.61 | 1.44 | 4.14E-03 | ||||
| Reductive carboxylate cycle (CO2 fixation) | 5 | 1.27 | 3.93 | 4.29E-03 | ||||
| Endometrial cancer | 14 | 7.06 | 1.98 | 7.76E-03 | ||||
| Pyruvate metabolism | 11 | 5.09 | 2.16 | 8.75E-03 | ||||
| Long-term depression | 17 | 9.12 | 1.86 | 6.16E-03 | ||||
| Cell Communication | 34 | 15.43 | 2.20 | 2.47E-06 | ||||
| Insulin signaling pathway | 26 | 15.93 | 1.63 | 6.19E-03 | ||||
| Fructose and mannose metabolism | 12 | 5.31 | 2.26 | 3.63E-03 | ||||
| Cysteine metabolism | 4 | 1.00 | 4.02 | 8.53E-03 | ||||
KEGG pathways enrichment was computed using the set of human bimodal genes as well as previously identified bimodal genes in mouse. Pathways enriched with p ≤ 0.01 in both bimodal human genes and the set of bimodal mouse genes are shown in bold. Italicized values do not meet the p ≤ 0.01 significance threshold.
Gene ontology enrichment for human switch-like genes.
| Human bimodal genes | Mouse bimodal genes | ||||||||
| Gene Ontology | Genes Observed | Genes Expected | Ration of Enrichment | P-values | Genes Observed | Genes Expected | Ration of Enrichment | P-values | |
| CC | |||||||||
| striated muscle thick filament | 12 | 2.26 | 5.31 | 6.39E-08 | |||||
| muscle myosin complex | 12 | 2.54 | 4.72 | 4.94E-07 | |||||
| actin cytoskeleton | 36 | 15.68 | 2.30 | 6.87E-07 | |||||
| myosin complex | 21 | 6.92 | 3.03 | 9.19E-07 | |||||
| neuromuscular junction | 6 | 0.85 | 7.08 | 7.89E-06 | |||||
| Z disc | 9 | 1.98 | 4.55 | 2.24E-05 | |||||
| membrane fraction | 97 | 66.25 | 1.46 | 5.02E-05 | |||||
| growth cone | 10 | 2.68 | 3.73 | 8.41E-05 | |||||
| plasma membrane | 239 | 192.10 | 1.24 | 9.90E-05 | |||||
| cell projection | 23 | 11.02 | 2.09 | 3.38E-04 | |||||
| costamere | 4 | 0.57 | 7.08 | 3.97E-04 | |||||
| clathrin coat of trans-Golgi network vesicle | 6 | 1.27 | 4.72 | 4.51E-04 | |||||
| sarcoplasmic reticulum membrane | 6 | 1.27 | 4.72 | 4.51E-04 | |||||
| integral to plasma membrane | 184 | 148.88 | 1.24 | 9.15E-04 | |||||
| axon | 18 | 7.96 | 2.26 | 3.97E-04 | |||||
| sarcolemma | 13 | 3.82 | 3.41 | 1.51E-05 | |||||
| basement membrane | 17 | 5.81 | 2.93 | 1.12E-05 | |||||
| collagen | 14 | 4.81 | 2.91 | 7.34E-05 | |||||
| postsynaptic membrane | 21 | 10.29 | 2.04 | 6.69E-04 | |||||
| I band | 4 | 0.66 | 6.03 | 7.55E-04 | |||||
| synaptic vesicle membrane | 6 | 1.33 | 4.52 | 4.27E-04 | |||||
| basal lamina | 9 | 2.49 | 3.62 | 1.79E-04 | |||||
| BP | |||||||||
| nervous system development | 79 | 42.38 | 1.86 | 1.36E-08 | |||||
| sensory perception of sound | 33 | 15.26 | 2.16 | 8.44E-06 | |||||
| glycolysis | 19 | 7.06 | 2.69 | 2.44E-05 | |||||
| regulation of heart contraction | 13 | 3.96 | 3.29 | 4.01E-05 | |||||
| antigen processing and presentation of peptide antigen via MHC class I | 7 | 1.41 | 4.96 | 8.99E-05 | |||||
| dephosphorylation | 24 | 10.88 | 2.21 | 9.77E-05 | |||||
| regulation of striated muscle contraction | 6 | 1.13 | 5.31 | 1.71E-04 | |||||
| central nervous system development | 25 | 12.01 | 2.08 | 1.99E-04 | |||||
| cell differentiation | 72 | 47.88 | 1.50 | 1.99E-04 | |||||
| multicellular organismal development | 136 | 102.41 | 1.33 | 2.26E-04 | |||||
| regulation of the force of heart contraction | 5 | 0.85 | 5.90 | 2.96E-04 | |||||
| neuromuscular synaptic transmission | 5 | 0.85 | 5.90 | 2.96E-04 | |||||
| neuron migration | 12 | 4.10 | 2.93 | 3.05E-04 | |||||
| neuron differentiation | 13 | 4.66 | 2.79 | 3.08E-04 | |||||
| glycogen metabolic process | 12 | 4.24 | 2.83 | 4.44E-04 | |||||
| protein amino acid dephosphorylation | 29 | 15.54 | 1.87 | 5.06E-04 | |||||
| tricarboxylic acid cycle | 10 | 3.39 | 2.95 | 9.10E-04 | |||||
| very-long-chain fatty acid metabolic process | 5 | 0.99 | 5.06 | 9.16E-04 | |||||
| ion transport | 67 | 44.13 | 1.52 | 1.88E-04 | |||||
| transport | 188 | 153.46 | 1.23 | 9.54E-04 | |||||
| calcium ion transport | 21 | 8.96 | 2.34 | 7.22E-05 | |||||
| synaptic transmission | 21 | 7.96 | 2.64 | 8.49E-06 | |||||
| MF | actin binding | 78 | 34.32 | 2.27 | 0.00E+00 | ||||
| structural constituent of muscle | 28 | 5.65 | 4.96 | 0.00E+00 | |||||
| structural constituent of cytoskeleton | 32 | 12.15 | 2.63 | 8.12E-08 | |||||
| calmodulin binding | 37 | 16.95 | 2.18 | 1.93E-06 | |||||
| GTPase activity | 43 | 24.15 | 1.78 | 8.61E-05 | |||||
| phosphoprotein phosphatase activity | 31 | 15.96 | 1.94 | 1.52E-04 | |||||
| phosphoric monoester hydrolase activity | 25 | 11.87 | 2.11 | 1.61E-04 | |||||
| microfilament motor activity | 9 | 2.40 | 3.75 | 1.82E-04 | |||||
| NAD binding | 13 | 4.52 | 2.88 | 2.14E-04 | |||||
| transferase activity, transferring phosphorus-containing groups | 7 | 1.55 | 4.51 | 2.17E-04 | |||||
| protein tyrosine phosphatase activity | 25 | 12.43 | 2.01 | 3.59E-04 | |||||
| transporter activity | 61 | 40.68 | 1.50 | 6.35E-04 | |||||
| long-chain-fatty-acid-CoA ligase activity | 6 | 1.41 | 4.25 | 9.92E-04 | |||||
| creatine kinase activity | 4 | 0.66 | 6.03 | 7.55E-04 | |||||
| extracellular matrix structural constituent | 18 | 6.80 | 2.65 | 3.52E-05 | |||||
| ion channel activity | 46 | 25.38 | 1.81 | 2.17E-05 | |||||
GO category enrichment was computed using the set of human bimodal genes as well as the previously identified bimodal genes in mouse. GO terms enriched with p ≤ 0.001 in both bimodal human genes and the set of bimodal mouse genes are shown in bold. Italicized values do not meet the p ≤ 0.001 significance threshold
Figure 1ECM-receptor interaction pathway enriched by human switch-like genes. Nodes enriched for human bimodal genes are colored orange, while nodes also identified as bimodal in mouse orthologs are outlined in bold. In all, the overlap between bimodal human and bimodal mouse orthologs contains thirteen unique genes represented in seven unique nodes in the ECM-receptor pathway. Nodes colored in gray were not identified as bimodal, while white nodes are used for genes that are not represented on the HGU133A array.
Figure 2KEGG Focal adhesion pathway enriched by human switch-like genes. Nodes enriched for human bimodal genes are colored orange, while nodes also identified as bimodal in mouse orthologs are outlined in bold. In all, the overlap between human and mouse orthologs contains twenty-two unique genes represented in nine unique nodes in the focal adhesion pathway. Nodes colored in gray were not identified as bimodal, while white nodes are used for genes that are not represented on the HGU133A array.
KEGG pathways enriched for bimodal genes with "high" or "low" expression within brain, heart, and skeletal muscle tissues in human and mouse.
| Brain | Heart | Skeletal Muscle | ||||
| Human | Mouse | Human | Mouse | Human | Mouse | |
| Calcium signaling pathway | high | |||||
| Carbon fixation | high | High | high | |||
| Cell Communication | low | low | high | High | high | |
| Cell adhesion molecules (CAMs) | low | |||||
| Citrate cycle (TCA cycle) | high | high | ||||
| ECM-receptor interaction | low | low | high | high | high | high |
| Focal adhesion | low | low | high | high | high | high |
| Gap junction | high | high | low | low | low | |
| Glycolysis/Gluconeogenesis | high | high | high | high | ||
| Insulin signaling pathway | high | |||||
| Leukocyte transendothelial migration | high | |||||
| Long-term depression | high | low | low | low | low | |
| Long-term potentiation | high | low | ||||
| Neurodegenerative Diseases | high | |||||
| Pyruvate metabolism | high | high | high | |||
| Reductive carboxylate cycle (CO2 fixation) | high | high | ||||
| Regulation of actin cytoskeleton | high | |||||
| Type I diabetes mellitus | low | |||||
The mode of expression is indicated for tissue types where pathways are significantly enriched for the set of bimodal genes in the "high" mode or the set of bimodal genes in the "low" mode of expression. Significance was determined from a hypergeometric distribution with cutoff p = 0.05.
Gene Ontology terms enriched for bimodal genes with "high" or "low" expression within brain, heart, and skeletal muscle tissues in human and mouse.
| Brain | Heart | Skeletal Muscle | |||||
| Human | Mouse | Human | Mouse | Human | Mouse | ||
| CC | actin cytoskeleton | high | high | high | |||
| axon | high | high | low | low | low | low | |
| basal lamina | low | ||||||
| basement membrane | low | high | high | ||||
| cell junction | high | high | low | low | low | ||
| cell projection | high | high | |||||
| collagen | low | low | high | high | high | ||
| cytoplasm | high | high | high | ||||
| cytoskeleton | low | high | high | high | high | ||
| growth cone | high | high | low | ||||
| I band | high | ||||||
| integral to plasma membrane | low | ||||||
| membrane | high | low | |||||
| membrane fraction | low | low | |||||
| muscle myosin complex | low | high | |||||
| myosin complex | low | low | high | ||||
| plasma membrane | low | ||||||
| postsynaptic membrane | high | high | low | low | low | ||
| proteinaceous extracellular matrix | low | low | high | high | high | ||
| sarcolemma | low | high | high | ||||
| sarcoplasmic reticulum | low | low | high | high | high | ||
| striated muscle thick filament | low | high | |||||
| synapse | high | high | low | low | low | low | |
| synaptic vesicle membrane | low | ||||||
| troponin complex | low | high | high | ||||
| Z disc | low | low | high | high | high | high | |
| BP | antigen processing and presentation of peptide antigen via MHC class I | low | high | high | |||
| cell adhesion | low | high | high | high | |||
| cell differentiation | high | ||||||
| central nervous system development | high | low | |||||
| cytoskeleton organization and biogenesis | high | ||||||
| glycogen metabolic process | low | high | high | ||||
| glycolysis | high | high | high | ||||
| ion transport | high | high | low | low | |||
| muscle contraction | low | low | high | high | high | ||
| muscle development | low | high | high | ||||
| nervous system development | high | high | low | low | |||
| neuron differentiation | high | low | |||||
| neuron migration | high | ||||||
| regulation of heart contraction | high | high | |||||
| regulation of muscle contraction | low | high | |||||
| regulation of striated muscle contraction | high | ||||||
| regulation of the force of heart contraction | high | high | high | ||||
| sensory perception of sound | high | high | |||||
| striated muscle contraction | low | low | high | high | |||
| synaptic transmission | high | high | low | low | low | low | |
| transport | high | high | low | low | |||
| tricarboxylic acid cycle | high | high | high | high | |||
| MF | actin binding | high | high | high | |||
| calcium ion binding | high | high | high | ||||
| extracellular matrix structural constituent | low | low | high | high | high | ||
| GTPase activity | high | ||||||
| ion channel activity | high | low | low | ||||
| microfilament motor activity | high | ||||||
| NAD binding | high | ||||||
| protein binding | low | ||||||
| structural constituent of muscle | low | low | high | high | |||
| structural molecule activity | low | high | |||||
The mode of expression is indicated for tissue types where terms are significantly enriched for the set of bimodal genes in the "high" mode or the set of bimodal genes in the "low" mode of expression. Significance was determined from a hypergeometric distribution with cutoff p = 0.05.
Figure 3Summary of promoter usage between bimodal and non-bimodal subsets. The relative frequency of core promoter types cataloged in MPromDB [13] is shown for bimodal and non-bimodal gene subsets in A) human and B) mouse. The number of alternative promoters per gene is shown for bimodal genes and non-bimodal genes for C) human and D) mouse. For a subset of bimodal genes with multiple alternative promoters, tissue-dependent alternative promoters from DBTSS [14] corresponded to the mode of expression, as shown for glutamate receptors E) GRIA2 in human and F) Gria1 in mouse.
Figure 4Bimodal gene enrichment for promoter region methylation of lysine 4 of histone H3 (H3K4me3). A) Fraction of bimodal vs. non-bimodal genes enriched for histone methylation within their promoters as reported by Guenther et al. [12] for H9 hES cells, liver (hepatocytes), and B-cells. The fourth and fifth sets of bars represent the set of genes enriched at high confidence within one tissue but not the other liver versus hES cells and B-cell versus hES cells. B) H3K4me3 enrichment ratio from Guenther et al. [12] for liver vs. stem cells is shown for bimodal genes. Genes expressed with "high" mode in liver and "low" mode in H9 stem cells are shown with green "+" symbols, while bimodal genes expressed with "low" mode in liver and "high" mode in stem cells are shown with blue "x" symbols. Black points are used for the remaining bimodal genes expressed in common modes between these two tissue types. The standard deviation around the line y = x (solid red line) is shown as dashed red lines.
Figure 5Bimodal gene expression. The histogram and normal mixture probability density function (pdf) are shown for a typical bimodal gene. The threshold between the high and low mode of expression is labeled as XT and darker shading is used to represent the misclassification region.