| Literature DB >> 31924279 |
Ilan Shomorony1,2, Elizabeth T Cirulli1, Lei Huang1, Lori A Napier1, Robyn R Heister1, Michael Hicks1, Isaac V Cohen1, Hung-Chun Yu1, Christine Leon Swisher1, Natalie M Schenker-Ahmed1, Weizhong Li1,3, Karen E Nelson1,3, Pamila Brar1,3, Andrew M Kahn1,4, Timothy D Spector5, C Thomas Caskey1,6, J Craig Venter1,3, David S Karow1, Ewen F Kirkness1,3, Naisha Shah7,8.
Abstract
BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures.Entities:
Keywords: Cardiometabolic syndrome; Metabolomics; Multimodal; Network analysis; Preventative medicine; Unsupervised machine learning
Mesh:
Substances:
Year: 2020 PMID: 31924279 PMCID: PMC6953286 DOI: 10.1186/s13073-019-0705-z
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1a In the study, we collected multimodal data (n = 1385 features) from 1253 individuals. b We analyzed the data by performing cross-modality associations between features after correcting for age, sex, and ancestry. c Using the associations, we performed community detection analysis and found modules of densely connected features. d To reduce the number of indirect associations and identify key biomarker features, we performed conditional independence network analysis (also referred to as a Markov network). e Using the identified key biomarkers, we clustered individuals into distinct groups with similar signatures that are consistent with different health statuses. We characterize the clusters and perform disease risk enrichment analysis
Fig. 2The number of significant cross-modality correlations for each pair of modalities is shown (a). The percentages shown are the proportion of correlations that were significant out of all possible pairwise associations between the modality pair. b Associations between p-cresol sulfate metabolite and (top) abundance of Intestinimonas genus, and (bottom) an abundance of unclassified genus in Erysipelotrichaceae family
Fig. 3The cardiometabolic module. a We built a Markov network to identify the key biomarker features that represent the cardiometabolic module. This network highlights the most important associations after removing edges corresponding to indirect associations. We observed that the microbiome genera Butyrivibrio and Pseudoflavonifractor are the most relevant microbiome genera in the context of this module that interface with features from other modalities. b We clustered individuals using the key biomarkers. The heatmap shows z-statistics from logistic regression for an association between each cluster and each feature. The plot on the left shows the 22 key cardiometabolic biomarkers. The plot on the right shows associations that emerged from an analysis against the full set of 1385 features with p < 1 × 10−10 as well as 3-hydroxybutyrate (BHBA) and Apolipoprotein B because of their particular enrichment in clusters 3 and 6, respectively. Some correlated features have been collapsed, with the mean z-statistics displayed; the full set of features can be found in Additional file 1: Figure S1. All of these significant associations showed consistent directions of effect in the TwinsUK cohort (Additional file 2: Table S3); however, the microbiome features and 5 of the glycerophosphocholines were not measured in the TwinsUK cohort and thus could not be assessed for replication. Met, metabolome
Fig. 4Disease enrichment and longitudinal outcomes of cardiometabolic clusters. a Bar plots showing the prevalence of disease at baseline (combined discovery and TwinsUK baseline cohorts; Additional file 1: Figure S2 shows them individually) and the incidence of disease (i.e., only the new cases of disease) after a median of 5.6 years of follow-up (TwinsUK cohort). For Fisher’s exact test comparison of the rate in each cluster vs. the other clusters, *p < 0.05, **p < 0.005. b The rates at which individuals from each cluster transition into other clusters after a median of 5.6 years of follow-up. The plot shows individuals per cluster (1 to 7) at baseline visit that transition to other clusters during the follow-up. TIA, transient ischemic attack
Fig. 5The microbiome richness module. a We built a Markov network to identify the key biomarker features that represent the microbiome richness module. Most of the associations between the microbiome and the metabolome were mediated by species richness. b We clustered individuals using the key biomarkers. The heatmap shows z-statistics from logistic regression for an association between each cluster and each feature. The plot on the left shows the 24 key biomarkers representing the module. Met, metabolome