| Literature DB >> 24432028 |
David L Gibbs1, Lisa Gralinski2, Ralph S Baric2, Shannon K McWeeney3.
Abstract
To better understand dynamic disease processes, integrated multi-omic methods are needed, yet comparing different types of omic data remains difficult. Integrative solutions benefit experimenters by eliminating potential biases that come with single omic analysis. We have developed the methods needed to explore whether a relationship exists between co-expression network models built from transcriptomic and proteomic data types, and whether this relationship can be used to improve the disease signature discovery process. A naïve, correlation based method is utilized for comparison. Using publicly available infectious disease time series data, we analyzed the related co-expression structure of the transcriptome and proteome in response to SARS-CoV infection in mice. Transcript and peptide expression data was filtered using quality scores and subset by taking the intersection on mapped Entrez IDs. Using this data set, independent co-expression networks were built. The networks were integrated by constructing a bipartite module graph based on module member overlap, module summary correlation, and correlation to phenotypes of interest. Compared to the module level results, the naïve approach is hindered by a lack of correlation across data types, less significant enrichment results, and little functional overlap across data types. Our module graph approach avoids these problems, resulting in an integrated omic signature of disease progression, which allows prioritization across data types for down-stream experiment planning. Integrated modules exhibited related functional enrichments and could suggest novel interactions in response to infection. These disease and platform-independent methods can be used to realize the full potential of multi-omic network signatures. The data (experiment SM001) are publically available through the NIAID Systems Virology (https://www.systemsvirology.org) and PNNL (http://omics.pnl.gov) web portals. Phenotype data is found in the supplementary information. The ProCoNA package is available as part of Bioconductor 2.13.Entities:
Keywords: SARS; biomarkers; data integration; networks; omics; proteomics; transcriptomics; virology
Year: 2014 PMID: 24432028 PMCID: PMC3882664 DOI: 10.3389/fgene.2013.00309
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Prioritization of integrated data leading to a multi-omic integrated co-expression signature for SARS-CoV infection. Given mice infected with a mouse adapted SARS-CoV virus, transcript (gene microarray) and peptide (LC-MS) data are collected from lung tissue. The data is used to construct independent co-expression networks for each data type. Using three metrics, transcript and peptide modules are compared and joined, producing a bipartite module graph. In the module graph, two kinds of edges are shown. Solid edges indicate a significant correlation between module eigenvectors. Dashed edges indicate significant overlap in terms of module membership. Node sizes correspond to the size of the module and node outlines show the direction of correlation compared to the degree of lung pathology (the overall-pathology-score).
Figure 2Module graph prioritization by examination of the relationship of module-pairs to all phenotypic variables. Clear patterns show transcript module 1 and peptide module 4 with the bulk of maximum negative correlations and transcript module 3 and peptide module 2 with the bulk of maximum positive correlations with the phenotype. This matrix clearly provides ranking on sub-graphs.
Figure 3Overlapping eigenvector summaries show similar response patterns observed across transcript and peptide modules. The blue lines show collapsed transcript module eigenvectors plotted over days post infection. Red dotted lines show the collapsed peptide module eigenvectors. The top row shows the module sub-network of transcript module 3 and peptide module 2. The middle row shows transcript module 4 and peptide modules 12 and 13. The bottom row shows transcript module 1 with peptide modules 4 and 8. There is clear evidence of a shared response between transcript and peptide modules, demonstrating a multi-omic signature. See the supplementary tables for lists of the drivers and enriched functional categories.
Figure 4An increase in the number of entities mapped to GO terms using the union of module members within a module sub-graph. Enrichment was performed using only the module members with correlations to the module eigenvector greater than 0.8, these GO terms were all in the top 5 most significant after GO term enrichment within each module, and, after taking the union, they were still in the top 5 most significant. Adjusted p-values for all modules can be found in Supplementary Table 1.