| Literature DB >> 28448511 |
Rajat Anand1, Samrat Chatterjee1.
Abstract
Metabolic disorders such as obesity and diabetes are diseases which develop gradually over time through the perturbations of biological processes. These perturbed biological processes usually work in an interdependent way. Systematic experiments tracking disease progression at gene level are usually conducted through a temporal microarray data. There is a need for developing methods to analyze such highly complex data to capture disease progression at the molecular level. In the present study, we have considered temporal microarray data from an experiment conducted to study development of obesity and diabetes in mice. We first constructed a network between biological processes through common genes. We analyzed the data to obtain perturbed biological processes at each time point. Finally, we used the biological process network to find links between these perturbed biological processes. This enabled us to identify paths linking initial perturbed processes with final perturbed processes which capture disease progression. Using different datasets and statistical tests, we established that these paths are highly precise to the dataset from which these are obtained. We also established that the connecting genes present in these paths might contain some biological information and thus can be used for further mechanistic studies. The methods developed in our study are also applicable to a broad array of temporal data.Entities:
Mesh:
Year: 2017 PMID: 28448511 PMCID: PMC5407620 DOI: 10.1371/journal.pone.0176172
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Gene set enrichment analysis to find perturbed biological processes at each time point.
(A) Heatmap of the microarray data is shown with x axis as time points, y axis as genes (19303 in total) and color representing fold change (log2 transformed). The heatmap is shown after hierarchical clustering of the data using an algorithm by eisen et al. [5]. (B) An example of gene set enrichment analysis method is shown for a process receiving high nes value at Day 1. The running sum used to calculate es values is shown along with instances of genes of the biological process along the ordered gene list. The ordered gene list is obtained by sorting all the genes according to the absolute log fold change values of genes at Day 1. The absolute log fold change values are also shown. The process contains 10 genes and is highly perturbed at Day 1. (C) Ordering the genes (total 19303) according to fold change values at each time point gives different gene lists for each time point. For each such gene list and each gene set in our database of biological processes, we use the procedure given in (B) to obtain es, nes and pvalues. (D) The nes values obtained using procedure in (B) and (C) for gene lists for each time point and 816 gene sets can be represented in matrix form and is shown as a heatmap.
Enriched biological processes.
| Time | GO Biological Process (Number of genes, nes value, P-value) | Time | GO Biological Process (Number of genes, nes value, P-value) |
|---|---|---|---|
'preassembly of GPI anchor in ER membrane (GO:0016254)' (16, 2.12, 0) 'negative regulation of translational initiation (GO:0045947)' (18, 1.83, 0.003) 'purineribonucleoside bisphosphate metabolic process (GO:0034035)' (17, 1.82, 0.001) '3''-phosphoadenosine 5''-phosphosulfate metabolic process (GO:0050427)' (17, 1.82,0.001) 'hydrogen peroxide catabolic process (GO:0042744)' (20, 1.80, 0.002) | 'termination of RNA polymerase I transcription (GO:0006363)' (24, 2.20, 0.005) 'pseudouridine synthesis (GO:0001522)' (17, 2.12, 0.013) 'bile acid biosynthetic process (GO:0006699)' (21, 2.08, 0) 'SCF-dependent proteasomal ubiquitin-dependent protein catabolic process (GO:0031146)' (18, 1.96, 0.001) 'negative regulation of TOR signaling (GO:0032007)' (22, 1.85, 0.001) | ||
'cellular response to interleukin-4 (GO:0071353)' (27, 1.54, 0) 'cellular senescence (GO:0090398)' (28, 1.51, 0) 'T cell homeostasis (GO:0043029)' (25, 1.48, 0.001) 'ER to Golgi vesicle-mediated transport (GO:0006888)' (60, 1.45, 0) 'anterograde synaptic vesicle transport (GO:0048490)' (16, 1.43, 0.038) | 'oxygen transport (GO:0015671)' (16, 1.36, 0.023) 'protein export from nucleus (GO:0006611)' (27, 1.35, 0) 'positive regulation of osteoclast differentiation (GO:0045672)' (22, 1.34, 0.001) 'cytolysis (GO:0019835)' (22, 1.34, 0) 'retrograde vesicle-mediated transport, Golgi to ER (GO:0006890)' (25, 1.31, 0.002) | ||
'negative regulation of oxidative stress-induced intrinsic apoptotic signaling pathway(GO:1902176)' (18, 1.43, 0.017) 'nucleosome disassembly (GO:0006337)' (17, 1.39, 0.011) 'protein-DNA complex disassembly (GO:0032986)' (17, 1.39, 0.011)'formation of translation preinitiation complex (GO:0001731)' (19, 1.38, 0.046) 'positive regulation of release of cytochrome c from mitochondria (GO:0090200)' (26, 1.37, 0.015) | 'cellular response to ammonium ion (GO:0071242)' (17, 1.87, 0) 'neurotransmitter secretion (GO:0007269)' (57, 1.78, 0) 'phosphatidylinositol acyl-chain remodeling (GO:0036149)' (16, 1.78, 0.004) 'glutamate secretion (GO:0014047)' (18, 1.78, 0.003) 'regulation of sensory perception of pain (GO:0051930)' (23, 1.78, 0) | ||
'positive regulation of protein dephosphorylation (GO:0035307)' (16,1.63, 0.013) 'DNA integration (GO:0015074)' (20, 1.63, 0.121) 'positive regulation of acute inflammatory response (GO:0002675)' (26, 1.49, 0.005) 'nucleotide-binding oligomerization domain containing signaling pathway (GO:0070423)'(28, 1.44, 0.025) 'cellular glucuronidation (GO:0052695)' (17, 1.40, 0.148) | 'modulation of growth of symbiont involved in interaction with host (GO:0044144)' (17, 1.41, 0.019) 'negative regulation of growth of symbiont in host (GO:0044130)' (17, 1.41, 0.019) 'regulation of growth of symbiont in host (GO:0044126)' (17, 1.41, 0.019) 'negative regulation of growth of symbiont involved in interaction with host (GO:0044146)' (17, 1.41, 0.019) 'chromatin silencing (GO:0006342)' (21, 1.37, 0.008) | ||
'Arp2/3 complex-mediated actin nucleation (GO:0034314)' (17, 1.87, 0.045) 'positive regulation of synaptic transmission, glutamatergic (GO:0051968)' (18, 1.78, 0.005) 'double-strand break repair via nonhomologous end joining (GO:0006303)' (18, 1.72, 0.065) 'non-recombinational repair (GO:0000726)' (18, 1.73, 0.065) 'immunoglobulin mediated immune response (GO:0016064)' (16, 1.72, 0.023) | 'regulation of peptidyl-serine phosphorylation of STAT protein (GO:0033139)' (18, 1.27, 0.079) 'positiveregulation of peptidyl-serine phosphorylation of STAT protein (GO:0033141)' (18, 1.27, 0.079) 'mitochondrial respiratory chain complex I assembly (GO:0032981)' (16, 1.24, 0.06) 'mitochondrial respiratory chain complex I biogenesis (GO:0097031)' (16, 1.24, 0.06) 'NADH dehydrogenase complex assembly (GO:0010257)' (16, 1.24, 0.06) |
Fig 2Biological Process network and finding perturbed paths in the network.
(A) Network of connected biological processes is shown with 816 nodes and 51549 edges. Node size is proportional to number of genes in the corresponding process. In the middle inset, the distribution of number of processes (normalized to total number) with size within a bin is plotted against the starting of the bin s. In the rightmost inset, the distribution of number of processes (normalized to total number) with degree within a bin is plotted against the starting of the bin k. (B) A schematic is shown where overlaying the nes matrix on the network of biological processes gives 10 versions of the network. (C) Systematic procedure is shown where the circled process 1 is first selected having highest nes value among all processes at 1st time point. Then, the nes values of its connected processes at 2nd time point are checked and the one with highest nes value is selected as shown as circled process 2 in network at 2nd time point. This gives a path of 2 nodes going from process 1 to process 2. Repeating this process for further networks at 3rd, 4th etc. time point in (B) gives a path of 10 nodes. (D) The average nes of paths when generated randomly as well as when generated using systematic procedure shows that paths generated using systematic procedure has paths perturbed with high average nes values.
Two example paths which start from different processes and converge to same process at Day 35.
| Time point | Path a | Path b |
|---|---|---|
| Day 1 | 'preassembly of GPI anchor in ER membrane (GO:0016254)' | 'negative regulation of translational initiation (GO:0045947)' |
| Day 6 | 'mannosylation (GO:0097502)' | 'nuclear-transcribed mRNA catabolic process, nonsense-mediated decay (GO:0000184)' |
| Day 10 | 'dolichol-linked oligosaccharide biosynthetic process (GO:0006488)' | 'formation of translation preinitiation complex (GO:0001731)' |
| Day 14 | 'glutamine metabolic process (GO:0006541)' | 'translation (GO:0006412)' |
| Day 35 | 'positive regulation of synaptic transmission, glutamatergic (GO:0051968)' | 'positive regulation of synaptic transmission, glutamatergic (GO:0051968)' |
| Day 56 | 'glutamine metabolic process (GO:0006541)' | 'glutamine metabolic process (GO:0006541)' |
| Day 77 | 'protein ADP-ribosylation (GO:0006471)' | 'protein ADP-ribosylation (GO:0006471)' |
| Day 98 | 'substantia nigra development (GO:0021762)' | 'substantia nigra development (GO:0021762)' |
| Day 119 | 'chromatin silencing (GO:0006342)' | 'chromatin silencing (GO:0006342)' |
| Day 140 | 'cellular response to glucose starvation (GO:0042149)' | 'cellular response to glucose starvation (GO:0042149)' |
Fig 3Significant overlap between set of perturbed paths.
(A,B,C) In left panels, the visual display of set of 1024 paths are shown with edge thickness proportional to the overlap factor of corresponding edge and node size proportional to number of paths with same node. In middle panels, the distribution of mean nes value of 1024 paths is shown and in right panels, the distribution of overlap factor of 1024 paths is shown. For the set of 1024 perturbed paths in A, the overlap between paths can clearly be seen (left panel) with distribution of mean nes values of paths around 1.42 nes values. (Distribution of mean nes values is same as that shown in Fig 2C systematic procedure). Compare this with distribution of mean nes values of set of unperturbed paths in (B) middle panel showing higher nes values of perturbed paths as expected. The overlap factor of perturbed paths is around 380 (A, right panel) while that for unperturbed paths is around 340 (B, right panel) and this difference is statistically significant. This difference of overlap factors is also seen in visual display of paths in (A,B) left panels with thicker edges in plots of perturbed paths. Compare the overlap factors with that for random paths with no or very little overlap as seen in distribution (C, right panel) and also seen in visualization (C, left panel). (D) The mean of the distribution of mean nes values of 1024 randomly selected paths were calculated and this procedure repeated for 10000 sets of 1024 paths and this distribution is plotted in (D, upper panel). The mean nes values of set is low as also seen in distribution of mean nes values of a single set of 1024 randomly selected paths in (C, middle panel). Similar procedure was applied for overlap factors showing that for many sets of 1024 randomly selected paths, the mean of overlap factors of set remains around 1 as seen for a single set of 1024 randomly selected paths in (C, right panel).
Two paths are shown in path 1 column with same processes till Day 119 and two different processes at Day 140.
Same for Path2. Two paths in Path1 and in Path2 pair up to give 4 pairs of paths representing 80% of all paths.
| Time point | Path1 | Path2 | ||
|---|---|---|---|---|
| Process Name | Gene perturbed, abs fold change | Process Name | Gene perturbed, abs fold change | |
| Day 1 | preassembly of GPI anchor in ER membrane. | PIGC, ~32 fold | 'negative regulation of translational initiation (GO:0045947)' | 'EIF2AK1', 'RPL13A', >5 fold |
| Day 6 | GPI anchor biosynthetic process | 'PIGV', ~4 fold | 'nuclear-transcribed mRNA catabolic process, nonsense-mediated decay ' | 'DCP1A', 'PARN', 'RPL31', 'RPS15A', > 2fold |
| Day 10 | dolichol-linked oligosaccharide biosynthetic process | 'ALG5', 'MVD', 'PMM1', 'SRD5A3', >2fold | 'I-kappaB kinase/NF-kappaB signaling ' | 'NLRC3', 'UBA52', > 2 fold |
| Day 14 | glutamine metabolic process | 'ASNS', 'CAD', 'MECP2', 'PFAS', 'PHGDH', >2 fold | 'nucleotide-binding oligomerization domain containing signaling pathway ' | 'CARD9', 'CHUK', 'MAP2K6', 'TNFAIP3', 'UBA52', >2 fold |
| Day 35 | positive regulation of synaptic transmission, glutamatergic | 'ADCYAP1', 'GLUL', 'NLGN2', 'NLGN3', 'NRXN1', 'NTRK1', 'NTRK2', 'OXTR', 'PTK2B', 'SHANK3', 'TNR', >2fold | 'innate immune response in mucosa | ‘CAMP', 'DEFA1', 'DEFB1', 'HIST1H2BC', 'HIST1H2BK', 'LTF', 'RPL39', > 2 fold |
| Day 56 | positive regulation of blood vessel endothelial cell migration | 'AKT1', 'ANGPT1', 'FGF2', 'HDAC9', 'HSPB1', 'PDGFB', 'PRKD1', 'PTGS2', 'THBS1', >2 fold | 'modulation of growth of symbiont involved in interaction with host ' | 'CAMP', 'IFNG', 'LBP', 'LTA', 'MPO', > 2 fold |
| Day 77 | positive regulation of peptidyl-threonine phosphorylation | 'CHI3L1', >6 fold | 'positive regulation of osteoclast differentiation ' | 'FOS', >25 fold |
| Day 98 | positive regulation of protein dephosphorylation | 'ADORA1', 'CALM1', 'DUSP26', 'PPP2R5A', 'PRKCD', >2 fold | 'bicarbonate transport ' | 'AQP1', 'SLC4A10', 'SLC4A3', 'SLC4A4', 'SLC4A7', 'SLC4A8', 'SLC4A9', > 2 fold |
| Day 119 | lymph node development | 'CD248', 'CXCL13', 'CXCR5', 'FADD', 'IL15', 'LTB', 'NKX2-3', 'RORC', >2 fold | 'hydrogen peroxide catabolic process ' | 'APOA4', 'CAT', 'MPO', 'TPO', >2 fold |
| Day 140 | lung alveolus development/myeloid dendritic cell differentiation | 'SFTPD, >4 fold/ 'CAMK4', 'UBD', >2 fold | 'response to increased oxygen levels (GO:0036296)'/'response to hyperoxia ' | CDKN1A', 'FAS', 'NCF2', > 4 fold |
Fig 4Paths overlapping with 80% paths.
(A,B) Paths are shown as a network with two kinds of node: one representing processes and other genes. Nodes representing processes connects to the genes present in those processes. Numbers present near each process represent its position in the corresponding path and hence the time point. 10 and 10’ implies two different processes and implies that there are two paths with same processes at 1 to 9 time points and two different processes at 10th time point (A) Path1 representing two paths is shown along with the nes values of the processes at corresponding time points in the adjacent graph. The nes values for two processes at 10th time point are same and hence can’t be visually distinguished in graph. (B) Path2 representing two paths is shown along with the nes values of the processes at corresponding time points in the adjacent graph. The nes values for two processes at 10th time point are same and hence can’t be visually distinguished in graph.
Datasets used in the precision analysis.
| No. | Murine tissues under low-fat control diet (LFC), high-fat diet (HFC), and three different doses (5 μg, 20μg and 75 μg) of ethano-botanical formulation Kal-1 with HFC diet. (GSE63178) |
|---|---|
| 1 | brown adipose (GSE63168) |
| 2 | brown infiltrating macrophages (GSE63169) |
| 3 | epididymal adipose (GSE63170) |
| 4 | epididymal infiltrating macrophages (GSE63171) |
| 5 | subcutaneous adipose (GSE63172) |
| 6 | subcutaneous infiltrating macrophages (GSE63173) |
| 7 | Hippocampus (GSE63174) |
| 8 | Liver (GSE63175) |
| 9 | skeletal muscle (GSE63176) |
| 10 | Spleen (GSE63177) |
Fig 5Precision of the selected paths.
(A,B) Distribution of mean nes values of a path in Path1 column and a path in Path2 column from Table 3 showing that the obtained paths have high mean nes values when obtained from liver obesity dataset as compared to when obtained from other temporal datasets showing a 100% precision. (C) Distribution of nes values of topmost perturbed process at Day 6 (Table 1) from liver obesity dataset as compared to when obtained from other temporal datasets showing some datasets give nes values of the given process more than that as obtained from liver obesity dataset giving a precision of 9%.