| Literature DB >> 28065937 |
Jasmin Straube1,2, Bevan Emma Huang3, Kim-Anh Lê Cao2.
Abstract
Dynamic changes in biological systems can be captured by measuring molecular expression from different levels (e.g., genes and proteins) across time. Integration of such data aims to identify molecules that show similar expression changes over time; such molecules may be co-regulated and thus involved in similar biological processes. Combining data sources presents a systematic approach to study molecular behaviour. It can compensate for missing data in one source, and can reduce false positives when multiple sources highlight the same pathways. However, integrative approaches must accommodate the challenges inherent in 'omics' data, including high-dimensionality, noise, and timing differences in expression. As current methods for identification of co-expression cannot cope with this level of complexity, we developed a novel algorithm called DynOmics. DynOmics is based on the fast Fourier transform, from which the difference in expression initiation between trajectories can be estimated. This delay can then be used to realign the trajectories and identify those which show a high degree of correlation. Through extensive simulations, we demonstrate that DynOmics is efficient and accurate compared to existing approaches. We consider two case studies highlighting its application, identifying regulatory relationships across 'omics' data within an organism and for comparative gene expression analysis across organisms.Entities:
Year: 2017 PMID: 28065937 PMCID: PMC5220332 DOI: 10.1038/srep40131
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Relationship between angular differences, correlation and delay for a reference trajectory x (red dots) and a query trajectory y (green line).
The trajectories are (a) positively correlated with no delay (Δ = 0); (b) positively correlated with negative delay (0 < Δ ≤ 90); (c) negatively correlated with positive delay (90 < Δ < 180); (d) negatively correlated with no delay (Δ = 180); (e) negatively correlated with negative delay (180 < Δ < 270); (f) positively correlated with positive delay (270 ≤ Δ < 360).
Figure 2MiRNA and mRNA expression associations in Lung Organogenesis study.
Scaled LMMS modelled expression levels (y-axis) are depicted over time in 14 equally spaced time units from embryo day 12 to postnatal day 30 (x-axis) for the miRNAs mmu-miR-429, mmu-let-7g, and mmu-miR-134 (red lines). Solid lines depict actual scaled expression levels, while dashed lines depict inverted scaled expression levels to account for the negative correlation with mRNA. Modelled expression levels of the mRNAs identified as associated with each miRNA using DynOmics are displayed (DynOmics correlation <−0.9, delay < 0) (a) before and (b) after shifting the trajectories using the DynOmics estimated delay. The blue color gradient reflects the amount of delay.
Orthologous transcripts identified as associated by DynOmics.
| Delay | Mouse vs Human (%) | Bovine vs Human (%) |
|---|---|---|
| 0 | 6,582 (20) | 2,766 (10) |
| 0 | 18,065 (56) | 17,906 (67) |
| <0 | 7,682 (24) | 6,097 (23) |
| Total | 32,329 | 26,769 |
Number (percentage) of mouse and bovine transcripts identified as associated with orthologous human transcripts at an absolute correlation threshold of 0.9. The number of associations are divided according to different types of delay, indicating whether changes in expression levels of the mouse and bovine transcripts occurred prior to (delay >0), simultaneously to (delay = 0), or after (delay <0) expression changes of the orthologous human transcript.
IPA enrichment analysis of human orthologs for three types of delay relative to mouse/bovine transcripts.
| Delay compared to human | Organism | Pathway (# Transcripts identified/# Transcripts in pathway) | P value |
|---|---|---|---|
| >0 | Mouse | EIF2 Signaling (79/173) | 7.94 × 10−18 |
| mTOR Signaling (71/183) | 5.64 × 10−12 | ||
| Regulation of eIF4 and p70S6K Signaling (60/143) | 5.72 × 10−11 | ||
| Bovine | Protein Ubiquitination Pathway (54/254) | 6.28 × 10−09 | |
| Amyloid Processing (19/50) | 4.36 × 10−08 | ||
| Glucocorticoid Receptor Signaling (51/272) | 1.03 × 10−06 | ||
| 0 | Mouse | Role of Macrophages, Fibroblasts and Endothelial Cells in Rheumatoid Arthritis (196/286) | 1.84 × 10−31 |
| Role of Osteoblasts, Osteoclasts and Chondrocytes in Rheumatoid Arthritis (149/213) | 6.88 × 10−27 | ||
| Axonal Guidance Signaling (115/157) | 1.29 × 10−22 | ||
| Bovine | Protein Kinase A Signaling (199/370) | 7.11 × 10−16 | |
| Thrombin Signaling (116/187) | 1.44 × 10−15 | ||
| Acute Phase Response Signaling (106/168) | 4.51 × 10−15 | ||
| <0 | Mouse | Ephrin Receptor Signaling (76/172) | 8.39 × 10−13 |
| Molecular Mechanism of Cancer (128/359) | 5.26 × 10−12 | ||
| B Cell Receptor Signaling (71/171) | 1.54 × 10−10 | ||
| Bovine | EIF2 Signaling (91/173) | 1.75 × 10−25 | |
| Regulation of eIF4 (55/143) | 3.48 × 10−09 | ||
| Protein Ubiquitination Pathway (83/254) | 5.11 × 10−09 | ||
| >0, 0, <0 | Mouse, | EIF2 Signaling (32/173) | 1.59 × 10−17 |
| Bovine | Regulation of eIF4 (21/143) | 5.25 × 10−10 | |
| Acetyl-CoA Biosynthesis I (Pyruvate Dehydrogenase Complex) (4/6) | 8.71 × 10−06 |
The top three IPA enriched pathways are listed. Associated transcripts were analysed separately with respect to the delay: positive (negative) delay indicates that the mouse or bovine ortholog’s expression changes occurred prior to (after) the human expression changes. No delay indicates that all expression changes occurred simultaneously. P values were obtained from a right tailed Fisher’s Exact Test as implemented by IPA.
Figure 3EIF2 Signaling.
Modelled transcripts expression levels (scaled for each time point for visual purposes, y-axis) with respect to time (x-axis) involved in EIF2 Signaling (a) in human with (b) their orthologs in mouse (DynOmics correlation >0.9, delay >0). Hierarchical clustering was performed on the human transcripts to extract three main expression patterns in EIF2 Signaling (a); 1–3). The three main patterns of expression in humans (a) were visualised in separate plots (1–3). The mouse expression profiles in (b) were separated by the classification of their human orthologs (1–3) and were coloured according to the DynOmics estimates of delay.
Acetyl-CoA Biosynthesis I orthologous transcripts.
| Gene name | TranscriptID Human | TranscriptID Organism | Organism | DynOmics Delay | Pearson Correlation |
|---|---|---|---|---|---|
| DBT | 205369_x_at | BT.18489.1.A1_AT | Bovine | −2 | 0.99 |
| DBT | 205369_x_at | 1449118_AT | Mouse | −5 | 0.98 |
| DLAT | 211150_s_at | 1426264_AT | Mouse | 3 | 0.92 |
| DLAT | 211150_s_at | 1426265_X_AT | Mouse | 3 | 0.91 |
| DLD | 230426_at | BT.27889.1.S1_AT | Bovine | 4 | 0.99 |
| DLD | 230426_at | 1423159_AT | Mouse | 4 | 0.9 |
| PDHB | 208911_s_at | BT.2973.2.S1_A_AT | Bovine | −2 | 0.98 |
| PDHB | 208911_s_at | BT.2973.3.A1_AT | Bovine | 3 | 0.97 |
| PDHB | 208911_s_at | 1416090_AT | Mouse | 3 | 0.97 |
Orthologous transcripts identified as associated by DynOmics and involved in the Acetyl-CoA Biosynthesis I pathway. Gene names, transcript IDs in human, bovine and mouse are indicated, as well as the estimated DynOmics delay and the Pearson correlation between the reference trajectory and the query trajectory after shifting based on the DynOmics delay estimate.