Literature DB >> 21226914

A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies.

Raphaël Mourad1, Christine Sinoquet, Philippe Leray.   

Abstract

BACKGROUND: Discovering the genetic basis of common genetic diseases in the human genome represents a public health issue. However, the dimensionality of the genetic data (up to 1 million genetic markers) and its complexity make the statistical analysis a challenging task.
RESULTS: We present an accurate modeling of dependences between genetic markers, based on a forest of hierarchical latent class models which is a particular class of probabilistic graphical models. This model offers an adapted framework to deal with the fuzzy nature of linkage disequilibrium blocks. In addition, the data dimensionality can be reduced through the latent variables of the model which synthesize the information borne by genetic markers. In order to tackle the learning of both forest structure and probability distributions, a generic algorithm has been proposed. A first implementation of our algorithm has been shown to be tractable on benchmarks describing 105 variables for 2000 individuals.
CONCLUSIONS: The forest of hierarchical latent class models offers several advantages for genome-wide association studies: accurate modeling of linkage disequilibrium, flexible data dimensionality reduction and biological meaning borne by latent variables.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21226914      PMCID: PMC3033325          DOI: 10.1186/1471-2105-12-16

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  Bayesian graphical models for genomewide association studies.

Authors:  Claudio J Verzilli; Nigel Stallard; John C Whittaker
Journal:  Am J Hum Genet       Date:  2006-05-30       Impact factor: 11.025

2.  BNTagger: improved tagging SNP selection using Bayesian networks.

Authors:  Phil Hyoun Lee; Hagit Shatkay
Journal:  Bioinformatics       Date:  2006-07-15       Impact factor: 6.937

3.  Linkage disequilibrium mapping for complex disease genes.

Authors:  Andrew DeWan; Robert J Klein; Josephine Hoh
Journal:  Methods Mol Biol       Date:  2007

4.  HaploBuild: an algorithm to construct non-contiguous associated haplotypes in family based genetic studies.

Authors:  Jason M Laramie; Jemma B Wilk; Anita L DeStefano; Richard H Myers
Journal:  Bioinformatics       Date:  2007-06-22       Impact factor: 6.937

5.  A genome-wide association study of global gene expression.

Authors:  Anna L Dixon; Liming Liang; Miriam F Moffatt; Wei Chen; Simon Heath; Kenny C C Wong; Jenny Taylor; Edward Burnett; Ivo Gut; Martin Farrall; G Mark Lathrop; Gonçalo R Abecasis; William O C Cookson
Journal:  Nat Genet       Date:  2007-09-16       Impact factor: 38.330

6.  Sparse inverse covariance estimation with the graphical lasso.

Authors:  Jerome Friedman; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2007-12-12       Impact factor: 5.899

7.  Efficient association study design via power-optimized tag SNP selection.

Authors:  B Han; H M Kang; M S Seo; N Zaitlen; E Eskin
Journal:  Ann Hum Genet       Date:  2008-08-13       Impact factor: 1.670

8.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

9.  Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

Authors:  Clive J Hoggart; John C Whittaker; Maria De Iorio; David J Balding
Journal:  PLoS Genet       Date:  2008-07-25       Impact factor: 5.917

10.  Haplotype block partitioning as a tool for dimensionality reduction in SNP association studies.

Authors:  Cristian Pattaro; Ingo Ruczinski; Danièle M Fallin; Giovanni Parmigiani
Journal:  BMC Genomics       Date:  2008-08-29       Impact factor: 3.969

View more
  8 in total

1.  Multiple quantitative trait analysis using bayesian networks.

Authors:  Marco Scutari; Phil Howell; David J Balding; Ian Mackay
Journal:  Genetics       Date:  2014-09       Impact factor: 4.562

Review 2.  Computational approaches for inferring 3D conformations of chromatin from chromosome conformation capture data.

Authors:  Dario Meluzzi; Gaurav Arya
Journal:  Methods       Date:  2019-08-27       Impact factor: 3.608

3.  Visualization of pairwise and multilocus linkage disequilibrium structure using latent forests.

Authors:  Raphaël Mourad; Christine Sinoquet; Christian Dina; Philippe Leray
Journal:  PLoS One       Date:  2011-12-13       Impact factor: 3.240

Review 4.  Machine Learning and Radiogenomics: Lessons Learned and Future Directions.

Authors:  John Kang; Tiziana Rancati; Sangkyu Lee; Jung Hun Oh; Sarah L Kerns; Jacob G Scott; Russell Schwartz; Seyoung Kim; Barry S Rosenstein
Journal:  Front Oncol       Date:  2018-06-21       Impact factor: 6.244

5.  From heterogeneous healthcare data to disease-specific biomarker networks: A hierarchical Bayesian network approach.

Authors:  Ann-Kristin Becker; Marcus Dörr; Stephan B Felix; Fabian Frost; Hans J Grabe; Markus M Lerch; Matthias Nauck; Uwe Völker; Henry Völzke; Lars Kaderali
Journal:  PLoS Comput Biol       Date:  2021-02-12       Impact factor: 4.475

6.  High-sensitivity pattern discovery in large, paired multiomic datasets.

Authors:  Andrew R Ghazi; Kathleen Sucipto; Ali Rahnavard; Eric A Franzosa; Lauren J McIver; Jason Lloyd-Price; Emma Schwager; George Weingart; Yo Sup Moon; Xochitl C Morgan; Levi Waldron; Curtis Huttenhower
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

7.  A prostate cancer model build by a novel SVM-ID3 hybrid feature selection method using both genotyping and phenotype data from dbGaP.

Authors:  Sait Can Yücebaş; Yeşim Aydın Son
Journal:  PLoS One       Date:  2014-03-20       Impact factor: 3.240

8.  A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies.

Authors:  Christine Sinoquet
Journal:  BMC Bioinformatics       Date:  2018-03-27       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.