Literature DB >> 33577591

From heterogeneous healthcare data to disease-specific biomarker networks: A hierarchical Bayesian network approach.

Ann-Kristin Becker1, Marcus Dörr2,3, Stephan B Felix2,3, Fabian Frost4, Hans J Grabe5, Markus M Lerch4, Matthias Nauck6, Uwe Völker7, Henry Völzke8, Lars Kaderali1.   

Abstract

In this work, we introduce an entirely data-driven and automated approach to reveal disease-associated biomarker and risk factor networks from heterogeneous and high-dimensional healthcare data. Our workflow is based on Bayesian networks, which are a popular tool for analyzing the interplay of biomarkers. Usually, data require extensive manual preprocessing and dimension reduction to allow for effective learning of Bayesian networks. For heterogeneous data, this preprocessing is hard to automatize and typically requires domain-specific prior knowledge. We here combine Bayesian network learning with hierarchical variable clustering in order to detect groups of similar features and learn interactions between them entirely automated. We present an optimization algorithm for the adaptive refinement of such group Bayesian networks to account for a specific target variable, like a disease. The combination of Bayesian networks, clustering, and refinement yields low-dimensional but disease-specific interaction networks. These networks provide easily interpretable, yet accurate models of biomarker interdependencies. We test our method extensively on simulated data, as well as on data from the Study of Health in Pomerania (SHIP-TREND), and demonstrate its effectiveness using non-alcoholic fatty liver disease and hypertension as examples. We show that the group network models outperform available biomarker scores, while at the same time, they provide an easily interpretable interaction network.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33577591      PMCID: PMC7906470          DOI: 10.1371/journal.pcbi.1008735

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  27 in total

1.  Several remarks on the metric space of genetic codes.

Authors:  David Weisman; Dan A Simovici
Journal:  Int J Data Min Bioinform       Date:  2012       Impact factor: 0.667

2.  A new, accurate predictive model for incident hypertension.

Authors:  Henry Völzke; Glenn Fung; Till Ittermann; Shipeng Yu; Sebastian E Baumeister; Marcus Dörr; Wolfgang Lieb; Uwe Völker; Allan Linneberg; Torben Jørgensen; Stephan B Felix; Rainer Rettig; Bharat Rao; Heyo K Kroemer
Journal:  J Hypertens       Date:  2013-11       Impact factor: 4.844

3.  Modelling access to renal transplantation waiting list in a French healthcare network using a Bayesian method.

Authors:  Sahar Bayat; Marc Cuggia; Michel Kessler; Serge Briançon; Pierre Le Beux; Luc Frimat
Journal:  Stud Health Technol Inform       Date:  2008

Review 4.  Nonobese Fatty Liver Disease.

Authors:  Donghee Kim; W Ray Kim
Journal:  Clin Gastroenterol Hepatol       Date:  2016-08-28       Impact factor: 11.382

Review 5.  Genetics, ancestry, and hypertension: implications for targeted antihypertensive therapies.

Authors:  Nora Franceschini; Daniel I Chasman; Rhonda M Cooper-DeHoff; Donna K Arnett
Journal:  Curr Hypertens Rep       Date:  2014-08       Impact factor: 5.369

6.  A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies.

Authors:  Raphaël Mourad; Christine Sinoquet; Philippe Leray
Journal:  BMC Bioinformatics       Date:  2011-01-12       Impact factor: 3.169

7.  Laboratory parameter-based machine learning model for excluding non-alcoholic fatty liver disease (NAFLD) in the general population.

Authors:  T C-F Yip; A J Ma; V W-S Wong; Y-K Tse; H L-Y Chan; P-C Yuen; G L-H Wong
Journal:  Aliment Pharmacol Ther       Date:  2017-06-06       Impact factor: 8.171

Review 8.  The multiple-hit pathogenesis of non-alcoholic fatty liver disease (NAFLD).

Authors:  Elena Buzzetti; Massimo Pinzani; Emmanuel A Tsochatzis
Journal:  Metabolism       Date:  2016-01-04       Impact factor: 8.694

9.  Validating module network learning algorithms using simulated data.

Authors:  Tom Michoel; Steven Maere; Eric Bonnet; Anagha Joshi; Yvan Saeys; Tim Van den Bulcke; Koenraad Van Leemput; Piet van Remortel; Martin Kuiper; Kathleen Marchal; Yves Van de Peer
Journal:  BMC Bioinformatics       Date:  2007-05-03       Impact factor: 3.169

10.  Area under precision-recall curves for weighted and unweighted data.

Authors:  Jens Keilwagen; Ivo Grosse; Jan Grau
Journal:  PLoS One       Date:  2014-03-20       Impact factor: 3.240

View more
  3 in total

1.  Synthetic data generation with probabilistic Bayesian Networks.

Authors:  Grigoriy Gogoshin; Sergio Branciamore; Andrei S Rodin
Journal:  Math Biosci Eng       Date:  2021-10-09       Impact factor: 2.080

2.  Analysis of epidemiological association patterns of serum thyrotropin by combining random forests and Bayesian networks.

Authors:  Ann-Kristin Becker; Till Ittermann; Markus Dörr; Stephan B Felix; Matthias Nauck; Alexander Teumer; Uwe Völker; Henry Völzke; Lars Kaderali; Neetika Nath
Journal:  PLoS One       Date:  2022-07-21       Impact factor: 3.752

3.  Diagnosis of temporomandibular disorders using artificial intelligence technologies: A systematic review and meta-analysis.

Authors:  Nayansi Jha; Kwang-Sig Lee; Yoon-Ji Kim
Journal:  PLoS One       Date:  2022-08-18       Impact factor: 3.752

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.