| Literature DB >> 29784986 |
Mohamed Hamed1, Yvonne Gladbach1, Steffen Möller1, Sarah Fischer1, Mathias Ernst1, Stephan Struckmann1, Alexander Storch2, Georg Fuellen3.
Abstract
The volume of molecular observations on human diseases in public databases is continuously increasing at accelerating rates. A bottleneck is their computational integration into a coherent description, from which researchers may derive new well-founded hypotheses. Also, the need to integrate data from different technologies (genetics, coding and regulatory RNA, proteomics) emerged in order to identify biomarkers for early diagnosis and prognosis of complex diseases and therefore facilitating the development of novel treatment approaches. We propose here a workflow for the integrative transcriptomic description of the molecular pathology in Parkinsons's Disease (PD), including suggestions of compounds normalizing disease-induced transcriptional changes as a paradigmatic example. We integrated gene expression profiles, miRNA signatures, and publicly available regulatory databases to specify a partial model of the molecular pathophysiology of PD. Six genetic driver elements (2 genes and 4 miRNAs) and several functional network modules that are associated with PD were identified. Functional modules were assessed for their statistical significance, cellular functional homogeneity, literature evidence, and normalizing small molecules. In summary, our workflow for the joint regulatory analysis of coding and non-coding RNA, has the potential to yield clinically as well as biologically relevant information, as demonstrated here on PD data.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29784986 PMCID: PMC5962550 DOI: 10.1038/s41598-018-25754-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1A schematic diagram for the integrative transcriptomic workflow. The sketch describes data processing and integration of two different transcriptomic datasets to detect major determinants and functional modules controlling PD.
Figure 2Differential analysis of gene expression and sample clustering. (a) The heatmap of the expression patterns of the 116 identified dysregulated genes between the PD and the control cohorts. Blue spots represent down-regulation whereas red-yellow spots denote up-regulation patterns. The dendrograms on the upper and left sides show the hierarchical clustering tree of genes or samples. (b) The PCA clustering for the normalized gene expression samples. The two highlighted PD and control samples are incorrectly clustered to the corresponding cohort, however they had almost no impact on the analysis results when we excluded them. (c) The Log fold change (LFC) of the 24 dysregulated genes and the 3 dysregulated miRNAs, which are known to be highly associated with PD progression and pathways. Each colour refers to a different gene or miRNA. The heatmap and PCA clustering for miRNA samples are shown in Figure S1.
Enrichment of functional terms, diseases, and tissue specificity within the dysregulated miRNAs.
| Category | Term | Count | adj p.value | miRNAs |
|---|---|---|---|---|
| Function | Epithelial-mesenchymal transition | 4 | 0.0399 | hsa-let-7b,hsa-mir-370,hsa-mir-382,hsa-mir-205 |
| Function | Cell death | 6 | 0.0059 | hsa-let-7b,hsa-mir-130b,hsa-mir-129-5p,hsa-mir-205,hsa-mir-885-5p,hsa-mir-212 |
| Disease | Heart Failure | 9 | 0.0027 | hsa-let-7b,hsa-mir-520d-5p,hsa-mir-129-5p,hsa-mir-139-3p,hsa-mir-382,hsa-mir-205,hsa-mir-130b,hsa-mir-212,hsa-mir-29b |
| Disease | Leukemia Myeloid Acute | 5 | 0.0064 | hsa-let-7b,hsa-mir-323-3p,hsa-mir-382,hsa-mir-29b,hsa-mir-370 |
| Disease | Neurodegenerative Diseases | 2 | 0.0199 | hsa-let-7b,hsa-mir-139-3p |
| Disease | Parkinson’s Disease | 2 | 0.0481 | hsa-mir-433,hsa-mir-29b |
| Disease | Stomach Neoplasms | 5 | 0.0297 | hsa-mir-129-5p,hsa-mir-139-3p,hsa-mir-212,hsa-mir-433,hsa-mir-130b |
| TissueSpecific | Adrenal | 2 | 0.0145 | hsa-mir-370,hsa-mir-485-5p |
| TissueSpecific | Brain | 4 | 0.0007 | hsa-mir-323-3p,hsa-mir-383,hsa-mir-433,hsa-mir-129-5p |
Figure 3The PD gene regulatory network (PD-GRN) constructed from the dysregulated genes and miRNAs. Large nodes represent key driver genes and miRNAs. Square orange nodes denote the miRNAs, whereas the circular grey nodes represent genes. The network was visualized using the Cytoscape tool.
Figure 4Enrichment analysis of the PD-GRN genes and visualization of five network modules corresponding to PD-related GO terms. Five GO terms are often affected in PD cases. The total list of the enriched GO terms is found in Table S2. The central scatter plot shows the visualization of the top enriched generic GO terms of the PD-GRN in a two dimensional space based on the GO semantic similarities. GO term node colours indicate the p-values for the enrichment of the GO terms. These generic GO terms represent implicitly their subterms, which are not visualized in the plot. The scatter plot was generated using the web tool REVIGO[79]. All five network modules include both miRNAs and genes. The main TF CEBPB is highlighted by a cyan triangle while the miRNAs are represented by orange squares. The genes co-targeted by TFs and/or miRNAs are depicted in larger pink circles. The regulated genes, regulated by a TF or by a miRNA, are coloured in grey. The network modules were visualized using the Cytoscape tool.
Figure 5The reduced visualization of the detected motifs in the PD-GRN network. Motifs A and B and associated functional homogeneity plots depicting the cumulative distribution of GO functional semantic scores of gene pairs of co-regulated genes in the examined motif (red) versus randomly selected genes (black). The p-value was calculated using the Kolmogorov-Smirnov test. The network motifs were visualized using the Cytoscape tool.