Literature DB >> 26093035

From genome-scale data to models of infectious disease: A Bayesian network-based strategy to drive model development.

Weiwei Yin1, Jessica C Kissinger2, Alberto Moreno3, Mary R Galinski4, Mark P Styczynski5.   

Abstract

High-throughput, genome-scale data present a unique opportunity to link host to pathogen on a molecular level. Forging such connections will help drive the development of mathematical models to better understand and predict both pathogen behavior and the epidemiology of infectious diseases, including malaria. However, the datasets that can aid in identifying these links and models are vast and not amenable to simple, reductionist, and univariate analyses. These datasets require data mining in order to identify the truly important measurements that best describe clinical and molecular observations. Moreover, these datasets typically have relatively few samples due to experimental limitations (particularly for human studies or in vivo animal experiments), making data mining extremely difficult. Here, after first providing a brief overview of common strategies for data reduction and identification of relationships between variables for inclusion in mathematical models, we present a new generalized strategy for performing these data reduction and relationship inference tasks. Our approach emphasizes the importance of robustness when using data to drive model development, particularly when using genome-scale, small-sample in vivo data. We identify the use of appropriate feature reduction combined with data permutations and subsampling strategies as being critical to enable increasingly robust results from network inference using high-dimensional, low-observation data.
Copyright © 2015. Published by Elsevier Inc.

Entities:  

Keywords:  Bayesian network inference; Infectious diseases; Large-scale data analysis; Malaria; Model development

Mesh:

Year:  2015        PMID: 26093035      PMCID: PMC4679518          DOI: 10.1016/j.mbs.2015.06.006

Source DB:  PubMed          Journal:  Math Biosci        ISSN: 0025-5564            Impact factor:   2.144


  35 in total

Review 1.  Exploring expression data: identification and analysis of coexpressed genes.

Authors:  L J Heyer; S Kruglyak; S Yooseph
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

2.  Validating clustering for gene expression data.

Authors:  K Y Yeung; D R Haynor; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-04       Impact factor: 6.937

3.  A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information.

Authors:  Xiaohui Lin; Fufang Yang; Lina Zhou; Peiyuan Yin; Hongwei Kong; Wenbin Xing; Xin Lu; Lewen Jia; Quancai Wang; Guowang Xu
Journal:  J Chromatogr B Analyt Technol Biomed Life Sci       Date:  2012-05-24       Impact factor: 3.205

4.  Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling.

Authors:  Hiroyuki Toh; Katsuhisa Horimoto
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

5.  Some strains of Plasmodium falciparum, a human malaria parasite, evade the complement-like system of Anopheles gambiae mosquitoes.

Authors:  Alvaro Molina-Cruz; Randall J DeJong; Corrie Ortega; Ashley Haile; Ekua Abban; Janneth Rodrigues; Giovanna Jaramillo-Gutierrez; Carolina Barillas-Mury
Journal:  Proc Natl Acad Sci U S A       Date:  2012-05-23       Impact factor: 11.205

6.  A molecular marker of artemisinin-resistant Plasmodium falciparum malaria.

Authors:  Frédéric Ariey; Benoit Witkowski; Chanaki Amaratunga; Johann Beghain; Anne-Claire Langlois; Nimol Khim; Saorin Kim; Valentine Duru; Christiane Bouchier; Laurence Ma; Pharath Lim; Rithea Leang; Socheat Duong; Sokunthea Sreng; Seila Suon; Char Meng Chuor; Denis Mey Bout; Sandie Ménard; William O Rogers; Blaise Genton; Thierry Fandeur; Olivo Miotto; Pascal Ringwald; Jacques Le Bras; Antoine Berry; Jean-Christophe Barale; Rick M Fairhurst; Françoise Benoit-Vical; Odile Mercereau-Puijalon; Didier Ménard
Journal:  Nature       Date:  2013-12-18       Impact factor: 49.962

7.  Plasmodium vivax malaria.

Authors:  Dhanpat K Kochar; Vishal Saxena; Narvachan Singh; Sanjay K Kochar; S Vijay Kumar; Ashis Das
Journal:  Emerg Infect Dis       Date:  2005-01       Impact factor: 6.883

8.  Systems analysis of multiple regulator perturbations allows discovery of virulence factors in Salmonella.

Authors:  Hyunjin Yoon; Charles Ansong; Jason E McDermott; Marina Gritsenko; Richard D Smith; Fred Heffron; Joshua N Adkins
Journal:  BMC Syst Biol       Date:  2011-06-28

9.  From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data.

Authors:  Rainer Opgen-Rhein; Korbinian Strimmer
Journal:  BMC Syst Biol       Date:  2007-08-06

10.  Multiple populations of artemisinin-resistant Plasmodium falciparum in Cambodia.

Authors:  Olivo Miotto; Jacob Almagro-Garcia; Magnus Manske; Bronwyn Macinnis; Susana Campino; Kirk A Rockett; Chanaki Amaratunga; Pharath Lim; Seila Suon; Sokunthea Sreng; Jennifer M Anderson; Socheat Duong; Chea Nguon; Char Meng Chuor; David Saunders; Youry Se; Chantap Lon; Mark M Fukuda; Lucas Amenga-Etego; Abraham V O Hodgson; Victor Asoala; Mallika Imwong; Shannon Takala-Harrison; François Nosten; Xin-Zhuan Su; Pascal Ringwald; Frédéric Ariey; Christiane Dolecek; Tran Tinh Hien; Maciej F Boni; Cao Quang Thai; Alfred Amambua-Ngwa; David J Conway; Abdoulaye A Djimdé; Ogobara K Doumbo; Issaka Zongo; Jean-Bosco Ouedraogo; Daniel Alcock; Eleanor Drury; Sarah Auburn; Oliver Koch; Mandy Sanders; Christina Hubbart; Gareth Maslen; Valentin Ruano-Rubio; Dushyanth Jyothi; Alistair Miles; John O'Brien; Chris Gamble; Samuel O Oyola; Julian C Rayner; Chris I Newbold; Matthew Berriman; Chris C A Spencer; Gilean McVean; Nicholas P Day; Nicholas J White; Delia Bethell; Arjen M Dondorp; Christopher V Plowe; Rick M Fairhurst; Dominic P Kwiatkowski
Journal:  Nat Genet       Date:  2013-04-28       Impact factor: 38.330

View more
  3 in total

Review 1.  From within host dynamics to the epidemiology of infectious disease: Scientific overview and challenges.

Authors:  Juan B Gutierrez; Mary R Galinski; Stephen Cantrell; Eberhard O Voit
Journal:  Math Biosci       Date:  2015-10-16       Impact factor: 2.144

2.  New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.

Authors:  Grigoriy Gogoshin; Eric Boerwinkle; Andrei S Rodin
Journal:  J Comput Biol       Date:  2016-09-28       Impact factor: 1.479

3.  A tree-like Bayesian structure learning algorithm for small-sample datasets from complex biological model systems.

Authors:  Weiwei Yin; Swetha Garimalla; Alberto Moreno; Mary R Galinski; Mark P Styczynski
Journal:  BMC Syst Biol       Date:  2015-08-28
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.