Literature DB >> 35695757

MiMIR: R-shiny application to infer risk factors and endpoints from Nightingale Health's 1H-NMR Metabolomics data.

D Bizzarri1,2, M J T Reinders2,3, M Beekman1, P E Slagboom1,4, E B van den Akker1,2,3.   

Abstract

MOTIVATION: 1H-NMR metabolomics is rapidly becoming a standard resource in large epidemiological studies to acquire metabolic profiles in large numbers of samples in a relatively low-priced and standardized manner. Concomitantly, metabolomics-based models are increasingly developed that capture disease risk or clinical risk factors. These developments raise the need for user-friendly toolbox to inspect new 1H-NMR metabolomics data and project a wide array of previously established risk models.
RESULTS: We present MiMIR (Metabolomics-based Models for Imputing Risk), a graphical user interface that provides an intuitive framework for ad-hoc statistical analysis of Nightingale Health's 1H-NMR metabolomics data and allows for the projection and calibration of 24 pre-trained metabolomics-based models, without any pre-required programming knowledge. AVAILABILITY: The R-shiny package is available in CRAN or downloadable at https://github.com/DanieleBizzarri/MiMIR, together with an extensive user manual (also available as Supplementary Documents to the paper). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Year:  2022        PMID: 35695757      PMCID: PMC9344846          DOI: 10.1093/bioinformatics/btac388

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

Quantitative high-throughput metabolomics data is steadily gaining popularity in epidemiological research, as exemplified by consortia like COMETS (Yu ), BBMRI-nl (van den Akker Erik ) and the UK-Biobanks (Nightingale Health UK Biobank Initiative ), accumulating 1H-NMR metabolomics data in ∼46 000, ∼26 000 and ∼120 000 individuals respectively. Combinations of metabolic features on measured by the Nightingale Health platform have previously been shown to be indicative of various disease risks: COVID severity (Nightingale Health UK Biobank Initiative et al., 2021), Type2 Diabetes (Ahola-Olli ) and cardiovascular disease (Würtz ). Deelen ) even derived a mortality score using this platform, which has a higher accuracy in predicting all-cause mortality, as compared to models of conventional risk factors. Moreover, we recently demonstrated that 1H-NMR metabolomics data can be used to impute ‘at risk status’ for 19 conventional clinical variables, and that these models can act as metabolic surrogates for their original values (Bizzarri ).

2 Materials and methods

As there is a growing interest in integrating metabolomics-based scores in epidemiological studies, we developed MiMIR (Metabolomics-based Models for Imputing Risk), a toolbox enabling visual inspection, rapid data exploration and easy projection of the pre-trained metabolic models on Nightingale Health 1H-NMR metabolomics data. We currently have included models trained to disease risks (e.g. COVID severity, mortality), and conventional risk factors (e.g. age, prevalent diabetes), with the intention to update the software with scores released in the future. MiMIR is an interactive downloadable R-shiny web app that runs locally, and thus is compliant with current GDPR regulations. It guides users through simple steps to obtain 24 metabolomics-based scores suitable for downstream epidemiological analyses, using metabolite quantifications created by Nightingale Health. The analyses available in MiMIR are grouped in four main sections: Data exploration: A user can upload Nightingale Health 1H-NMR metabolomics data as well as associated phenotypic variables (optionally uploaded). With MiMIR, distributions, completeness and correlations of all uploaded variables can be explored and compared to their distributions in ∼26 000 samples from BBMRI.nl. Pre-processing: The pre-trained metabolic models in the app follow the pre-processing and transformation of the metabolomics data, as outlined in their respective publications. MiMIR runs these different procedures in parallel, allowing for a faithful derivation of each score. Generally, these steps consist of sample selection, normalization and imputation. Metabolic surrogates: MiMIR calculates metabolic surrogates that represent the probability of a sample to be at risk for 19 variables clinically used in epidemiological studies (Bizzarri ). The surrogate scores can be calibrated using Platt Calibration (Platt, 1999), which can be evaluated visually (reliability diagrams) or statistically summarized (ECE, MCE and decrease log-loss) using a holdout set of 20% of the uploaded data (Supplementary Materials S5.1.4). Area Under the ROC Curves (AUCs) are calculated for the scores if their corresponding phenotypic variable is available and compared to a Leave One Biobank Out Validation (LOBOV) performed within BBMRI-nl (Bizzarri ). Analysis: All 24 metabolomics-based scores, surrogates and risk models, can be subjected to exploratory association analyses. For instance, scores can be mutually correlated, associated with individual metabolites, or with phenotypic variables of interest, or used as potential confounders. Results can be comprehensively visualized, for instance, Kaplan Meier plots can be used to examine the ability of the metabolic scores to predict events over time.

3 Results

We illustrate the use of MiMIR with data assayed in a middle-aged subset of the Leiden Longevity Study (LLS_PAROFFs = 2307 individuals, median age = 59 y.o.) (Westendorp ). First, we evaluate the metabolomics features assayed by Nightingale Health through correlation plots and histograms of their distributions. When comparing these distributions with those from BBMRI-nl (Fig. 1A), a user can, for example, notice that serum total cholesterol (serum_c) and apoalipoprotein A1 (apoa1) show higher values in LLS_PAROFFs compared to BBMRI-nl (Fig. 1A).
Fig. 1.

Application in the Leiden Longevity Study: (A) distributions of (I) serum_c and (II) apoa1 in LLS_PAROFFs (blue) and BBMRI-nl (grey); (B) AUC of the 19 surrogate clinical variables in the uploaded dataset (blue) compared with the results of the LOBOV (red); (C) Calibration of the surrogate for high hscrp (blue = uncalibrated, red = calibrated); (D) Kaplan Meier for mortality comparing people in the higher (blue) and lower (red) tertiles of the mortality score (A color version of this figure appears in the online version of this article.)

Application in the Leiden Longevity Study: (A) distributions of (I) serum_c and (II) apoa1 in LLS_PAROFFs (blue) and BBMRI-nl (grey); (B) AUC of the 19 surrogate clinical variables in the uploaded dataset (blue) compared with the results of the LOBOV (red); (C) Calibration of the surrogate for high hscrp (blue = uncalibrated, red = calibrated); (D) Kaplan Meier for mortality comparing people in the higher (blue) and lower (red) tertiles of the mortality score (A color version of this figure appears in the online version of this article.) A valuable feature of the scores obtained with MiMIR is that they can be used to impute unmeasured clinical variables. In case a clinical variable is at least partially measured, we can evaluate the accuracy of the imputations as generated by the surrogate models. As both, serum_c and apoa1, play a major role within the ‘lipid medication’ surrogate model, one can notice that the ‘lipid medication’ surrogate in LLS_PAROFF is less accurate, as compared to what is generally observed in BBMRI-nl (Bizzarri ) (Fig. 1B). In comparison, other surrogates perform better: ‘diabetes’ (AUCLLS_PAROFF=0.85, mean ), or similarly: ‘high hscrp’ (AUCLLS_PAROFF=0.811, mean 1), in LLS_PAROFFs than in other cohorts of BBMRI-nl (Fig. 1B). Optionally, surrogate models can be calibrated in case some data on the respective clinical variables is at least partially available. A first glance to the reliability diagram of ‘high hscrp’ shows more appropriate probability distributions after applying Platt Calibration (Fig. 1C), with the calibrated values being closer to the diagonal in the reliability diagram and decreasing calibration errors (, , ) in LLS_PAROFFs. MiMIR allows to further explore the association of the metabolic features with the metabolomics-based scores or other phenotypic characteristics. For instance, 110 metabolic features are (individually) significantly associated with diabetes in LLS_PAROFFs, even after correcting for age, sex and the surrogate ‘obesity’, with glucose being the most significant (Supplementary Fig. S44). Longitudinal data can also be inspected by examining Kaplan–Meier curves. Although some metabolomics-based surrogates show association with mortality (e.g. ‘high hscrp’, Supplementary Fig. S42), one can appreciate that the mortality score is still the best score in predicting mortality (Fig. 1D).

4 Conclusion

MiMIR is a unique interactive R-shiny application that ties into the wealth of published 1H-NMR metabolomics-based models being released nowadays. Its potentiality comes from: (i) being able to inspect distributions of measurements in the context of other datasets, (ii) calculation and application of previously published metabolomics-based scores and (iii) imputation of clinically relevant variables in cohorts where this data is not readily present using metabolic surrogates. Due to its user-friendly interface, MiMIR empowers epidemiologists to inspect and analyze their own metabolic measurements and easily integrate these scores in their downstream analyses. Click here for additional data file.
  8 in total

1.  Nonagenarian siblings and their offspring display lower risk of mortality and morbidity than sporadic nonagenarians: The Leiden Longevity Study.

Authors:  Rudi G J Westendorp; Diana van Heemst; Maarten P Rozing; Marijke Frölich; Simon P Mooijaart; Gerard-Jan Blauw; Marian Beekman; Bastiaan T Heijmans; Anton J M de Craen; P Eline Slagboom
Journal:  J Am Geriatr Soc       Date:  2009-07-17       Impact factor: 5.562

2.  Metabolic Age Based on the BBMRI-NL 1H-NMR Metabolomics Repository as Biomarker of Age-related Disease.

Authors:  Erik B van den Akker; Stella Trompet; Jurriaan J H Barkey Wolf; Marian Beekman; H Eka D Suchiman; Joris Deelen; Folkert W Asselbergs; Eric Boersma; Davy Cats; Petra M Elders; J Marianne Geleijnse; M Arfan Ikram; Margreet Kloppenburg; Haillang Mei; Ingrid Meulenbelt; Simon P Mooijaart; Rob G H H Nelissen; Mihai G Netea; Brenda W J H Penninx; Mariska Slofstra; Coen D A Stehouwer; Morris A Swertz; Charlotte E Teunissen; Gisela M Terwindt; Leen M 't Hart; Arn M J M van den Maagdenberg; Pim van der Harst; Iwan C C van der Horst; Carla J H van der Kallen; Marleen M J van Greevenbroek; W Erwin van Spil; Cisca Wijmenga; Alexandra Zhernakova; Aeilko H Zwinderman; Naveed Sattar; J Wouter Jukema; Cornelia M van Duijn; Dorret I Boomsma; Marcel J T Reinders; P Eline Slagboom
Journal:  Circ Genom Precis Med       Date:  2020-08-14

3.  The Consortium of Metabolomics Studies (COMETS): Metabolomics in 47 Prospective Cohort Studies.

Authors:  Bing Yu; Krista A Zanetti; Marinella Temprosa; Demetrius Albanes; Nathan Appel; Clara Barrios Barrera; Yoav Ben-Shlomo; Eric Boerwinkle; Juan P Casas; Clary Clish; Caroline Dale; Abbas Dehghan; Andriy Derkach; A Heather Eliassen; Paul Elliott; Eoin Fahy; Christian Gieger; Marc J Gunter; Sei Harada; Tamara Harris; Deron R Herr; David Herrington; Joel N Hirschhorn; Elise Hoover; Ann W Hsing; Mattias Johansson; Rachel S Kelly; Chin Meng Khoo; Mika Kivimäki; Bruce S Kristal; Claudia Langenberg; Jessica Lasky-Su; Deborah A Lawlor; Luca A Lotta; Massimo Mangino; Loïc Le Marchand; Ewy Mathé; Charles E Matthews; Cristina Menni; Lorelei A Mucci; Rachel Murphy; Matej Oresic; Eric Orwoll; Jennifer Ose; Alexandre C Pereira; Mary C Playdon; Lucilla Poston; Jackie Price; Qibin Qi; Kathryn Rexrode; Adam Risch; Joshua Sampson; Wei Jie Seow; Howard D Sesso; Svati H Shah; Xiao-Ou Shu; Gordon C S Smith; Ulla Sovio; Victoria L Stevens; Rachael Stolzenberg-Solomon; Toru Takebayashi; Therese Tillin; Ruth Travis; Ioanna Tzoulaki; Cornelia M Ulrich; Ramachandran S Vasan; Mukesh Verma; Ying Wang; Nick J Wareham; Andrew Wong; Naji Younes; Hua Zhao; Wei Zheng; Steven C Moore
Journal:  Am J Epidemiol       Date:  2019-06-01       Impact factor: 4.897

4.  Metabolite profiling and cardiovascular event risk: a prospective study of 3 population-based cohorts.

Authors:  Peter Würtz; Aki S Havulinna; Pasi Soininen; Tuulia Tynkkynen; David Prieto-Merino; Therese Tillin; Anahita Ghorbani; Anna Artati; Qin Wang; Mika Tiainen; Antti J Kangas; Johannes Kettunen; Jari Kaikkonen; Vera Mikkilä; Antti Jula; Mika Kähönen; Terho Lehtimäki; Debbie A Lawlor; Tom R Gaunt; Alun D Hughes; Naveed Sattar; Thomas Illig; Jerzy Adamski; Thomas J Wang; Markus Perola; Samuli Ripatti; Ramachandran S Vasan; Olli T Raitakari; Robert E Gerszten; Juan-Pablo Casas; Nish Chaturvedi; Mika Ala-Korpela; Veikko Salomaa
Journal:  Circulation       Date:  2015-01-08       Impact factor: 29.690

5.  Metabolic biomarker profiling for identification of susceptibility to severe pneumonia and COVID-19 in the general population.

Authors:  Heli Julkunen; Anna Cichońska; P Eline Slagboom; Peter Würtz
Journal:  Elife       Date:  2021-05-04       Impact factor: 8.140

6.  A metabolic profile of all-cause mortality risk identified in an observational study of 44,168 individuals.

Authors:  Joris Deelen; Johannes Kettunen; Krista Fischer; Ashley van der Spek; Stella Trompet; Gabi Kastenmüller; Andy Boyd; Jonas Zierer; Erik B van den Akker; Mika Ala-Korpela; Najaf Amin; Ayse Demirkan; Mohsen Ghanbari; Diana van Heemst; M Arfan Ikram; Jan Bert van Klinken; Simon P Mooijaart; Annette Peters; Veikko Salomaa; Naveed Sattar; Tim D Spector; Henning Tiemeier; Aswin Verhoeven; Melanie Waldenberger; Peter Würtz; George Davey Smith; Andres Metspalu; Markus Perola; Cristina Menni; Johanna M Geleijnse; Fotios Drenos; Marian Beekman; J Wouter Jukema; Cornelia M van Duijn; P Eline Slagboom
Journal:  Nat Commun       Date:  2019-08-20       Impact factor: 14.919

7.  Circulating metabolites and the risk of type 2 diabetes: a prospective study of 11,896 young adults from four Finnish cohorts.

Authors:  Ari V Ahola-Olli; Linda Mustelin; Maria Kalimeri; Johannes Kettunen; Jari Jokelainen; Juha Auvinen; Katri Puukka; Aki S Havulinna; Terho Lehtimäki; Mika Kähönen; Markus Juonala; Sirkka Keinänen-Kiukaanniemi; Veikko Salomaa; Markus Perola; Marjo-Riitta Järvelin; Mika Ala-Korpela; Olli Raitakari; Peter Würtz
Journal:  Diabetologia       Date:  2019-10-04       Impact factor: 10.122

8.  1H-NMR metabolomics-based surrogates to impute common clinical risk factors and endpoints.

Authors:  D Bizzarri; M J T Reinders; M Beekman; P E Slagboom; E B van den Akker
Journal:  EBioMedicine       Date:  2021-12-20       Impact factor: 8.143

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.