Literature DB >> 25350511

Identifying disease genes by integrating multiple data sources.

Bolin Chen, Jianxin Wang, Min Li, Fang-Xiang Wu.   

Abstract

BACKGROUND: Now multiple types of data are available for identifying disease genes. Those data include gene-disease associations, disease phenotype similarities, protein-protein interactions, pathways, gene expression profiles, etc.. It is believed that integrating different kinds of biological data is an effective method to identify disease genes.
RESULTS: In this paper, we propose a multiple data integration method based on the theory of Markov random field (MRF) and the method of Bayesian analysis for identifying human disease genes. The proposed method is not only flexible in easily incorporating different kinds of data, but also reliable in predicting candidate disease genes.
CONCLUSIONS: Numerical experiments are carried out by integrating known gene-disease associations, protein complexes, protein-protein interactions, pathways and gene expression profiles. Predictions are evaluated by the leave-one-out method. The proposed method achieves an AUC score of 0.743 when integrating all those biological data in our experiments.

Entities:  

Mesh:

Year:  2014        PMID: 25350511      PMCID: PMC4243092          DOI: 10.1186/1755-8794-7-S2-S2

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


  35 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Maneuvering in the complex path from genotype to phenotype.

Authors:  Richard Strohman
Journal:  Science       Date:  2002-04-26       Impact factor: 47.728

3.  Guilt by association.

Authors:  D Altshuler; M Daly; L Kruglyak
Journal:  Nat Genet       Date:  2000-10       Impact factor: 38.330

4.  An integrated probabilistic model for functional prediction of proteins.

Authors:  Minghua Deng; Ting Chen; Fengzhu Sun
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

5.  Towards a proteome-scale map of the human protein-protein interaction network.

Authors:  Jean-François Rual; Kavitha Venkatesan; Tong Hao; Tomoko Hirozane-Kishikawa; Amélie Dricot; Ning Li; Gabriel F Berriz; Francis D Gibbons; Matija Dreze; Nono Ayivi-Guedehoussou; Niels Klitgord; Christophe Simon; Mike Boxem; Stuart Milstein; Jennifer Rosenberg; Debra S Goldberg; Lan V Zhang; Sharyl L Wong; Giovanni Franklin; Siming Li; Joanna S Albala; Janghoo Lim; Carlene Fraughton; Estelle Llamosas; Sebiha Cevik; Camille Bex; Philippe Lamesch; Robert S Sikorski; Jean Vandenhaute; Huda Y Zoghbi; Alex Smolyar; Stephanie Bosak; Reynaldo Sequerra; Lynn Doucette-Stamm; Michael E Cusick; David E Hill; Frederick P Roth; Marc Vidal
Journal:  Nature       Date:  2005-09-28       Impact factor: 49.962

6.  A human protein-protein interaction network: a resource for annotating the proteome.

Authors:  Ulrich Stelzl; Uwe Worm; Maciej Lalowski; Christian Haenig; Felix H Brembeck; Heike Goehler; Martin Stroedicke; Martina Zenkner; Anke Schoenherr; Susanne Koeppen; Jan Timm; Sascha Mintzlaff; Claudia Abraham; Nicole Bock; Silvia Kietzmann; Astrid Goedde; Engin Toksöz; Anja Droege; Sylvia Krobitsch; Bernhard Korn; Walter Birchmeier; Hans Lehrach; Erich E Wanker
Journal:  Cell       Date:  2005-09-23       Impact factor: 41.582

7.  Predicting disease genes using protein-protein interactions.

Authors:  M Oti; B Snel; M A Huynen; H G Brunner
Journal:  J Med Genet       Date:  2006-04-12       Impact factor: 6.318

8.  A gene atlas of the mouse and human protein-encoding transcriptomes.

Authors:  Andrew I Su; Tim Wiltshire; Serge Batalov; Hilmar Lapp; Keith A Ching; David Block; Jie Zhang; Richard Soden; Mimi Hayakawa; Gabriel Kreiman; Michael P Cooke; John R Walker; John B Hogenesch
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-09       Impact factor: 11.205

9.  BioGRID: a general repository for interaction datasets.

Authors:  Chris Stark; Bobby-Joe Breitkreutz; Teresa Reguly; Lorrie Boucher; Ashton Breitkreutz; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  IntAct--open source resource for molecular interaction data.

Authors:  S Kerrien; Y Alam-Faruque; B Aranda; I Bancarz; A Bridge; C Derow; E Dimmer; M Feuermann; A Friedrichsen; R Huntley; C Kohler; J Khadake; C Leroy; A Liban; C Lieftink; L Montecchi-Palazzi; S Orchard; J Risse; K Robbe; B Roechert; D Thorneycroft; Y Zhang; R Apweiler; H Hermjakob
Journal:  Nucleic Acids Res       Date:  2006-12-01       Impact factor: 16.971

View more
  3 in total

1.  A fast and high performance multiple data integration algorithm for identifying human disease genes.

Authors:  Bolin Chen; Min Li; Jianxin Wang; Xuequn Shang; Fang-Xiang Wu
Journal:  BMC Med Genomics       Date:  2015-09-23       Impact factor: 3.063

2.  Ensemble disease gene prediction by clinical sample-based networks.

Authors:  Ping Luo; Li-Ping Tian; Bolin Chen; Qianghua Xiao; Fang-Xiang Wu
Journal:  BMC Bioinformatics       Date:  2020-03-11       Impact factor: 3.169

3.  pBRIT: gene prioritization by correlating functional and phenotypic annotations through integrative data fusion.

Authors:  Ajay Anand Kumar; Lut Van Laer; Maaike Alaerts; Amin Ardeshirdavani; Yves Moreau; Kris Laukens; Bart Loeys; Geert Vandeweyer
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.