| Literature DB >> 31823712 |
Abstract
BACKGROUND: Studies on multiple modalities of omics data such as transcriptomics, genomics and proteomics are growing in popularity, since they allow us to investigate complex mechanisms across molecular layers. It is widely recognized that integrative omics analysis holds the promise to unlock novel and actionable biological insights into health and disease. Integration of multi-omics data remains challenging, however, and requires combination of several software tools and extensive technical expertise to account for the properties of heterogeneous data.Entities:
Keywords: Data analysis; Data integration; Multi-omics; Transparency
Mesh:
Year: 2019 PMID: 31823712 PMCID: PMC6902525 DOI: 10.1186/s12859-019-3224-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Package design. The miodin package provides a software infrastructure for data analysis implemented as a set of S4 classes. The base API contains standard generics for object manipulation and the user API provides convenience functions to facilitate package usage
Fig. 2Data analysis in miodin. The user starts by defining a project, a study and a workflow. The study contains the design of the experiment and the workflow is defined by instantiating analysis modules, which generate datasets and analysis results upon execution. The user can then inspect and export the data and results
Study design helper functions
| Function | Description |
|---|---|
| studySamplingPoints | Set the sampling points (e.g. time points) |
| studyFactor | Define a factor (experimental variable) |
| studyGroup | Define a sample group based on existing factors |
| studyContrast | Define a contrast (sample group comparison) |
| studySampleTable | Add a table with sample annotation data |
| studyAssayTable | Add a table with assay annotation data |
Common study design functions
| Function | Description |
|---|---|
| studyDesignCaseControl | Single factor dividing samples into two groups |
| studyDesignMultipleGroups | Single factor dividing samples into multiple groups |
| studyDesignRepeatedMeasures | Single factor and multiple sampling points |
| studyDesignTwoFactors | Two factors and multiple sampling points |
Workflow modules
| Function | Description |
|---|---|
| downloadRepositoryData | Downloads data from an online repository |
| importMicroarrayData | Imports raw microarray data from Affymetrix and Illumina arrays |
| importProcessedData | Imports processed RNA, SNP, methylation and protein data |
| processMicroarrayData | Pre-processes microarray data |
| processSequencingData | Pre-processes sequencing data |
| processMassSpecData | Pre-processes mass spectrometry data |
| integrateAssays | Integrates several datasets into one |
| performFactorAnalysis | Performs factor analysis by matrix factorization |
| performHypothesisTest | Performs hypothesis testing |
| performLinearModeling | Performs generalized linear modeling with snpStats |
| performOmicsModeling | Performs modeling with limma, DMRcate or edgeR depending on the input object |
Supported experimental techniques and data types
| RNA | SNP | Methylation | Protein | |
|---|---|---|---|---|
| Microarray | Raw and processed | Raw and processed | Raw and processed | |
| Sequencing | Processed | |||
| Mass spectrometry | Processed |
Package dependencies
| Package | Description |
|---|---|
| AffyCompatible | Annotation of Affymetrix microarrays |
| ArrayExpress | Access to the ArrayExpress online repository |
| crlmm | Genotyping of microarray SNP data |
| DESeq2 | Processing of RNA-seq data |
| DMRcate | Statistical analysis of methylation data |
| edgeR | Statistical analysis of RNA-seq data |
| ff | Store large in-memory datasets on disk |
| limma | Statistical analysis of microarray RNA data |
| minfi | Import and normalization of microarray methylation data |
| mixOmics | Methods for integrative analysis of multi-omics data |
| MOFA | Integrative analysis by multi-omics factor analysis |
| MSnbase | Import of proteomics data |
| MultiDataSet | Data integration of multi-omics data |
| oligo | Import and normalization of microarray RNA data |
| RefFreeEWAS | Correction for cell type composition in methylation data |
| SNPRelate | Processing SNP data |
| snpStats | Statistical analysis of SNP data |
| SummarizedExperiment | Import of RNA-seq data |
| wateRmelon | Normalization of microarray methylation data |
Fig. 3Venn diagram of the number of genes identified as differentially expressed in the Wei2012, Hou2010 and Zhang 2012 datasets
Fig. 4Assessment of the fitted MOFA model. a shows the total amount of variance explained by the model in each omics modality (view) and variance explained per factor. b shows a sample ordination plot based on latent factor (LF) 1 and 2
Fig. 5Interpretation of MOFA factor 1. a through c show sample heatmaps with the top features in the factor for RNA-seq gene, miRNA and 450 k methylation data, respectively. d through f reveal the loadings of the top features corresponding to the heatmaps in a through c