| Literature DB >> 33325641 |
Kathryn J Burton-Pimentel1, Grégory Pimentel1, Maria Hughes2,3,4, Charlotte Cjr Michielsen5, Attia Fatima2,4, Nathalie Vionnet6, Lydia A Afman5, Helen M Roche2,3,4,7, Lorraine Brennan8, Mark Ibberson9,10, Guy Vergères1.
Abstract
SCOPE: Combining different "omics" data types in a single, integrated analysis may better characterize the effects of diet on human health. METHODS ANDEntities:
Keywords: Data Integration Analysis for Biomarker discovery using Latent variable approaches for “Omics”; Similarity Network Fusion tool; classification; data integration; nutritional intervention
Year: 2021 PMID: 33325641 PMCID: PMC8221028 DOI: 10.1002/mnfr.202000647
Source DB: PubMed Journal: Mol Nutr Food Res ISSN: 1613-4125 Impact factor: 5.914
Figure 1Overview of study designs used for data integration. A) Study 1 used a randomized, controlled, crossover study design to test the postprandial and short‐term effects of acidified milk and probiotic yogurt.[ , On dairy test days (D1 and D2) blood samples were collected fasting and postprandially for transcriptome and untargeted metabolome profiling (n = 7 subjects). Dietary restriction applied 3 days before the postprandial tests and diet was controlled throughout the study. B) Study 2 evaluated the response of obese (n = 5) and non‐obese (n = 8) participants to a standard metabolic challenge comprised of a lipid overload that was completed after an overnight fast. The postprandial response to the challenge was evaluated by blood sampling over 5 h for lipid profiling and transcriptome analysis.[ ] C) Study 3 used a parallel, controlled study design to evaluate the effect of different dietary patterns on the fasting metabolic profile and transcriptome. After a 2 week run‐in phase of SFA diet, participants were randomly assigned to test for 8 weeks either a MED diet (n = 17), SFA diet (n = 16), or MUFA diet (n = 14).[ ] D1/D2, dairy test; MED, Mediterranean; SFA, saturated fatty acid.
Figure 2Visualization of models constructed with SNF for study 1. Affinity matrices for net iAUC show metabolite, gene, and SNF integrated models (respectively, A–C); SNF network showing the top 30% connections between samples using Cytoscape with edge weighted spring embedded layout in panel D. Samples are labeled by subject (S) number and intervention type milk (M, n = 7) or yogurt (Y, n = 7), with colors to indicate test meal: M (blue), Y (red). Diagonal of heatmaps shows median similarity across all samples. Network connections are colored according to whether the connection was identified in the top 30% connections for networks created with the metabolome (turquoise), transcriptome (purple), in both metabolome and transcriptome separate networks (black) or only with the datasets combined (SNF model) (yellow). iAUC, net incremental area under curve; SNF, similarity network fusion.
Classification error rates (CER) for SNF models (separate and integrated models) and for PLS‐DA and DIABLO models presented for all studies.CER is validated by M‐fold cross‐validation tests (respectively, 7‐, 5‐, and 10‐fold for studies 1, 2, and 3)
| Study 1 CER ± SEM | Study 2 CER ± SEM | Study 3 CER ± SEM | ||||
|---|---|---|---|---|---|---|
| Milk versus yogurt | Non‐obese versus obese | All diets | SFA diet versus MED diet | SFA diet versus MUFA diet | MUFA diet versus MED diet | |
| SNF analysis | ||||||
| Metabolome/lipidome model | 0.21a ± 0.004 | 0.15a ± 0.003 | 0.41a ± 0.005 | 0.13b ± 0.003 | 0.29a ± 0.006 | 0.30a ± 0.006 |
| Transcriptome model | 0.14b ± 0 | 0.03c ± 0.006 | 0.38b ± 0.004 | 0.19a ± 0.005 | 0.21b ± 0.004 | 0.33b ± 0.004 |
| SNF integrated model | 0.02c ± 0 | 0.08b ± 0.002 | 0.30c ± 0.005 | 0.08c ± 0.002 | 0.23b ± 0.005 | 0.32b ± 0.004 |
| DIABLO analysis | ||||||
| Metabolome/lipidome (PLS‐DA) | 0.14a ± 0 | 0.13a ± 0.007 | 0.27a ± 0.003 | 0.11a ± 0.004 | 0.18a ± 0.002 | 0.19a ± 0.004 |
| Transcriptome (PLS‐DA) | 0.14a ± 0 | 0b ± 0 | 0.06c ± 0.002 | 0.07b ± 0.003 | 0.01b ± 0.003 | 0.07b ± 0.005 |
| DIABLO model | 0.03b ± 0.007 | 0b ± 0 | 0.08b ± 0.003 | 0.06b ± 0.004 | 0.02b ± 0.003 | 0.08b ± 0.005 |
Different letters (a–c) indicate significant differences between CERs for comparisons between models for each study (as assessed by linear mixed‐effect models with post hoc pairwise comparisons, p adj < 0.05). #indicates a difference comparing equivalent models created by the SNF tool and DIABLO (as assessed by paired t‐test, p < 0.05).CER, classification error rate; DIABLO, data integration analysis for biomarker discovery using latent variable approaches for “omics” studies; MED, Mediterranean; PLS‐DA, partial least squares discriminant analysis; SFA, saturated fatty acid; SNF, similarity network fusion.
Figure 3Network plots for the top 5% most important genes and metabolites for differentiating blood samples taken after milk intake or yogurt intake in study 1, selected by A) SNF and B) DIABLO. Connections between nodes (metabolites, green; genes, purple) are shown for the strongest associations (ρ < −0.90 or ρ > 0.90, Spearman's correlation) (SNF, n = 199 nodes; DIABLO, n = 209 nodes). Nodes present in both networks are highlighted by a black outline and a larger size (metabolites, n = 12; genes, n = 80). DIABLO, data integration analysis for biomarker discovery using latent variable approaches for “Omics” studies; PLS‐DA, partial least squares discriminant analysis; SNF, similarity network fusion.