| Literature DB >> 31003499 |
Farhana R Pinu1, David J Beale2, Amy M Paten3, Konstantinos Kouremenos4,5, Sanjay Swarup6, Horst J Schirra7, David Wishart8,9.
Abstract
The use of multiple omics techniques (i.e., genomics, transcriptomics, proteomics, and metabolomics) is becoming increasingly popular in all facets of life science. Omics techniques provide a more holistic molecular perspective of studied biological systems compared to traditional approaches. However, due to their inherent data differences, integrating multiple omics platforms remains an ongoing challenge for many researchers. As metabolites represent the downstream products of multiple interactions between genes, transcripts, and proteins, metabolomics, the tools and approaches routinely used in this field could assist with the integration of these complex multi-omics data sets. The question is, how? Here we provide some answers (in terms of methods, software tools and databases) along with a variety of recommendations and a list of continuing challenges as identified during a peer session on multi-omics integration that was held at the recent 'Australian and New Zealand Metabolomics Conference' (ANZMET 2018) in Auckland, New Zealand (Sept. 2018). We envisage that this document will serve as a guide to metabolomics researchers and other members of the community wishing to perform multi-omics studies. We also believe that these ideas may allow the full promise of integrated multi-omics research and, ultimately, of systems biology to be realized.Entities:
Keywords: data analysis; data integration; databases; experimental design; mathematical modeling; metabolic networks; pathway analysis; quantitative omics; translational metabolomics
Year: 2019 PMID: 31003499 PMCID: PMC6523452 DOI: 10.3390/metabo9040076
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1A conceptual model for designing a systems biology experiment.
Potential limitations while designing multi-omics studies and possible strategies to overcome them.
| Potential Limitations | Strategies to Overcome Limitation |
|---|---|
| Limited biomass of sample small sample or limited accessibility to sample (e.g., single cell, skin or saliva swab) [ |
Pooling Specific methods for small samples (e.g., methods for single cell omics) |
| Heterogeneity of cell type/ composition (e.g., microbiome community, whole organism, tissue or single cell). Proportions of multiple cell types in a sample can change substantially and shift omics profile [ |
Replication Adequate sample size (n) Homogenization Reference samples/data |
| Differences in specific biomolecules in sample types (e.g., urine may have many metabolites but very few proteins, DNA and RNA in comparison to blood, stool or tissue samples). |
Choose appropriate sample for omics analysis or appropriate omics analyses for sample based on hypothesis. |
| Technical artifacts, including batch effects. |
Reference and quality control (QC) samples Internal standards Randomization Appropriate statistical models (e.g., mixed effects) to account for batch effects [ |
| Multiple testing loss of statistical power |
Replication Adequate sample size |
| Background contamination (e.g., in a microbiome study stool samples will have host DNA, RNA, protein, and metabolites) |
Specific methods depending on omics analysis and contamination (e.g., rRNA or host depletion for RNA extraction for meta-transcriptomics) Background control/reference samples |
| Differences in analytical platforms and integrating data sets of multi-omics that measure fundamentally different biomolecules |
Standard control samples used across all omics data sets may help to harmonize measurements and variations |
Figure 2Top-down and bottom-up data reduction integration approaches used in system biology.
Figure 3Principal differences between post-analysis data integration (left) and integrated data analysis (right) for handling multi-omics data sets.
Summary of multi-omics integration software tools and web applications (adapted from Fondi and Liò [90] and Beale et al. [112]).
| Software Tool | Omics Integrated | Domain | Functionality | Type of license | Reference |
|---|---|---|---|---|---|
|
|
Transcriptomics Proteomics Metabolomics | Medical (human) |
Correlation network analysis Co-expression analysis Phenotype generation KEGG/HumanCyc pathway enrichment GO enrichment Name to ID conversion | Open | [ |
|
|
Transcriptomics Proteomics Metabolomics | Biofilm |
Experiment library Data depository | Open | [ |
|
|
Genomics Proteomics Metabolomics | Unspecified |
Online encyclopedia of metabolism Predicted metabolic pathways in sequenced genomes Enzyme data set Metabolite database | Open | [ |
|
|
Genomic Transcriptomics Proteomics | Unspecified |
Draw biological pathway models and simulations Run biological cellular simulations and graphical display results | Licensed | [ |
|
Transcriptomics Proteomics Metabolomics | Unspecified |
Open source language for biological cellular models | Open | [ | |
|
|
Transcriptomics Proteomics Metabolomics Fluxomics | Unspecified |
Genome scale integrated modeling of cell metabolism and macro-molecular expression | Open | [ |
|
Transcriptomics Proteomics Metabolomics Fluxomics | Unspecified |
Multi-Omic Data Miner and OmicsAnalyzer were designed as an accessible and handy Cytoscape plugin that facilitates omics analysis Compile all biologically-relevant information regarding the model system through web link associationMap the network components with multi-omics data Model omics data | Open | [ | |
|
|
Transcriptomics Proteomics Metabolomics | Unspecified (Cells) |
Modeling, simulation, and analysis of complex, heterogeneous and multi-scale cellular systems | Open | [ |
|
|
Genomics Proteomics Metabolomics | Unspecified |
Web application for visualizing data on biological pathways. Rapidly design new pathway maps based on user data and genome-scale models Visualize data related to genes or proteins on the associated reactions and pathways Identify trends in common genomic data types | Open (MIT license) | [ |
|
| Variety of omics platform bioinformatics solutions | Unspecified |
Inoperability of the following tools: Bioinformatics resource manager Cytoscape DataMatrixViewer KEGG Genome Browser MeV PIPE BioTapestry N-Browse | Open | [ |
|
Transcriptomics Metabolomics | Unspecified |
Establishes metabolite use requirements with metabolomics data Model-paired transcriptomics data to find experimentally supported solutions Calculates the turnover (production/consumption) flux of metabolites | Open; Phython based and requires COBRApy 0.2.x. | [ | |
|
Transcriptomics Metabolomics | Medical and Clinical |
Meta and integrative analysis of data Pathway analysis | Open | [ | |
|
Transcriptomics Proteomics Metabolomics | Medical and clinical |
Enrichment analysis Pathway analysis | Academic only | [ | |
|
|
Metagenomics Transcriptomics Proteomics Metabolomics | Medical (human) and clinical. |
Metabolic pathway analysis Network visualization Data integration Upstream regulator analysis Mechanistic networks Causal network analysis Downstream effects analysis | Commercial | [ |
|
Proteomics Metabolomics | Unspecified |
Integrates proteomics and metabolomics data to predict flux distributions | Open | [ | |
|
|
Transcriptomics Metabolomics | Plants |
Integrates transcriptomics and metabolomics data to map pathways | Open | [ |
|
Metagenomics Transcriptomics Metabolomics | Plants, Medical and Clinical |
Integrates omics data Statistical analysis and pathway mapping | Open | [ | |
|
|
Metagenomics Transcriptomics Metabolomics | Plants (developed for use with |
Compare data across these two species KEGG classification Classification into KOG clusters Mapping expression responses | Open | [ |
|
Metagenomics Transcriptomics Metabolomics | Unspecified |
Toolbox for interactive ranking, filtering, combination, clustering, visualization, and functional analysis of data sets containing intensity-based profile vectors | Academic only | [ | |
|
|
Transcriptomics Metabolomics | Unspecified |
Integration of data Generation of colored pathway maps KEGG data analysis | Open | [ |
|
|
Genomics Transcriptomics Proteomics Metabolomics | Plants, Microbial, Microbiome, Medical and Clinical |
Data processing and statistical analysis Pathway analysis Multi-omics integration | Open | [ |
|
|
Metabolomics | Unspecified |
Database for storing metabolomics experiments and derived information Database for cross-species and cross omics technique | Open | [ |
|
|
Transcriptomics Metabolomics | Medical and Clinical |
Integrates data from KEGG and EHMN databases | Open | [ |
|
Metagenomics Transcriptomics Proteomics Metabolomics | Unspecified |
Integration of data Chemometric analysis (similarity/difference) | Open | [ | |
|
|
Metagenomics Transcriptomics Proteomics Metabolomics | Unspecified |
Integration of data Chemometric analysis (similarity/difference) R-package with an open-source implementation of two-way orthogonal PLS | Open | [ |
|
Transcriptomics Proteomics Metabolomics Fluxomics | Unspecified |
Integration and visualization of omics data | Open | [ | |
|
|
Transcriptomics Proteomics Metabolomics Fluxomics | Unspecified |
Integration and visualization of omics data | Annual license fee | [ |
|
|
Transcriptomics Metabolomics | 100 top species of different biological kingdoms |
Integration and visualization of transcriptomics and metabolomics data | Open | [ |
|
|
Transcriptomics Proteomics Metabolomics | Unspecified |
Integration of omics data Visualize omics data based on common data nodes and interactions in the pathway | Open (Apache) | [ |
|
|
Transcriptomics Metabolomics | Medical and Clinical |
Interactive visualizations of metabolite concentrations together with transcript measurements mapped on the pathways and GenomeMaps | Open | [ |
|
|
Genomics Transcriptomics Proteomics Metabolomics | Unspecified |
Multi-omics data visualization Metabolic map of known biological processes and pathways | Open | [ |
|
|
Genomics Proteomics Metabolomics | Human |
Computation resource and comprehensive human metabolic network model | Open | [ |
|
|
Genomics Transcriptomics Proteomics Metabolomics | Unspecified |
Graphical modeling tool | Open | [ |
|
|
Metagenomics Transcriptomics Proteomics Metabolomics | Unspecified |
Integration of data Chemometric analysis (similarity/difference) | Commercial | [ |
|
Metagenomics Transcriptomics Proteomics Metabolomics | Unspecified |
Comparison of multiple omics data sets Visualization of metabolic maps Correlation networks analysis | Open | [ | |
|
|
Metagenomics Transcriptomics Proteomics Metabolomics | Grapes |
Integration of data Visualization of connectivity | Open | [ |
|
|
Proteins Metabolomics | Unspecified |
Mathematica package designed to aid biological modeling | Open | [ |
List of databases that aid multi-omics data integration process.
| Database | Omics | Domain | Functionality | Type of license | Reference |
|---|---|---|---|---|---|
|
|
Metabolomics | Unspecified |
Metabolomics database and ontology | Open | [ |
|
|
Metabolomics | Microbial |
Annotated metabolomics and metabolite pathway database | Open | [ |
|
|
Genomics Transcriptomics |
|
Genes and RNA-seq data of different | License | [ |
|
|
Proteomics | Numerous (over 100,000 organisms) |
Proteomics database Open access, annotated collection of all publically available nucleotide sequences and their protein transitions. | Open | [ |
|
|
Metabolomics | Human |
Human metabolite and pathway database | Open | [ |
|
|
Genomes Transcriptomics Proteomics Metabolomics | PlantsAnimalsMicrobes |
Collection of databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances. | Open and licensed. | [ |
|
|
Lipidomics | Unspecified |
Lipids database, ontology, and resource for standards and protocols relating to lipidomics | Open | [ |
|
|
Genomics Proteomics Metabolomics | Plants |
Plant-specific database containing pathways, enzymes, reactions, and compounds | Open | [ |
|
|
Proteomics | Unspecified |
Proteomics database | Open | [ |
|
|
Transcriptomics Metabolomics | Medical and Clinical |
Interactive visualizations of metabolite concentrations together with transcript measurements mapped on the pathways and GenomeMaps | Open | [ |
|
|
Genomics Transcriptomics Proteomics Metabolomics | Unspecified |
Database for molecular details of signal transduction, transport, DNA replication, metabolism, and other cellular processes | Open | [ |
|
|
Genomics | Microbe (yeast) |
Database for genome sequence of | Open | [ |
|
|
Proteomics | Unspecified |
Proteomics database | Open | [ |
|
|
Genomics | Helminths |
Database that contains information on biology, genetics, and genomics of | Open | [ |
|
|
Metabolomics | Microbe (yeast) |
Metabolite database | Open | [ |