Literature DB >> 31990289

Heterogeneous networks integration for disease-gene prioritization with node kernels.

Van Dinh Tran1, Alessandro Sperduti2, Rolf Backofen1,3, Fabrizio Costa4.   

Abstract

MOTIVATION: The identification of disease-gene associations is a task of fundamental importance in human health research. A typical approach consists in first encoding large gene/protein relational datasets as networks due to the natural and intuitive property of graphs for representing objects' relationships and then utilizing graph-based techniques to prioritize genes for successive low-throughput validation assays. Since different types of interactions between genes yield distinct gene networks, there is the need to integrate different heterogeneous sources to improve the reliability of prioritization systems.
RESULTS: We propose an approach based on three phases: first, we merge all sources in a single network, then we partition the integrated network according to edge density introducing a notion of edge type to distinguish the parts and finally, we employ a novel node kernel suitable for graphs with typed edges. We show how the node kernel can generate a large number of discriminative features that can be efficiently processed by linear regularized machine learning classifiers. We report state-of-the-art results on 12 disease-gene associations and on a time-stamped benchmark containing 42 newly discovered associations.
AVAILABILITY AND IMPLEMENTATION: Source code: https://github.com/dinhinfotech/DiGI.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2020        PMID: 31990289     DOI: 10.1093/bioinformatics/btaa008

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  Hypergraph models of biological networks to identify genes critical to pathogenic viral response.

Authors:  Song Feng; Emily Heath; Brett Jefferson; Cliff Joslyn; Henry Kvinge; Hugh D Mitchell; Brenda Praggastis; Amie J Eisfeld; Amy C Sims; Larissa B Thackray; Shufang Fan; Kevin B Walters; Peter J Halfmann; Danielle Westhoff-Smith; Qing Tan; Vineet D Menachery; Timothy P Sheahan; Adam S Cockrell; Jacob F Kocher; Kelly G Stratton; Natalie C Heller; Lisa M Bramer; Michael S Diamond; Ralph S Baric; Katrina M Waters; Yoshihiro Kawaoka; Jason E McDermott; Emilie Purvine
Journal:  BMC Bioinformatics       Date:  2021-05-29       Impact factor: 3.169

2.  Graph Embedding Based Novel Gene Discovery Associated With Diabetes Mellitus.

Authors:  Jianzong Du; Dongdong Lin; Ruan Yuan; Xiaopei Chen; Xiaoli Liu; Jing Yan
Journal:  Front Genet       Date:  2021-11-25       Impact factor: 4.599

3.  RNMFMDA: A Microbe-Disease Association Identification Method Based on Reliable Negative Sample Selection and Logistic Matrix Factorization With Neighborhood Regularization.

Authors:  Lihong Peng; Ling Shen; Longjie Liao; Guangyi Liu; Liqian Zhou
Journal:  Front Microbiol       Date:  2020-10-27       Impact factor: 5.640

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.