Literature DB >> 33409574

A network-based machine-learning framework to identify both functional modules and disease genes.

Kuo Yang1,2, Kezhi Lu1,3, Yang Wu4, Jian Yu5, Baoyan Liu6, Yi Zhao4, Jianxin Chen7, Xuezhong Zhou8,9.   

Abstract

Disease gene identification is a critical step towards uncovering the molecular mechanisms of diseases and systematically investigating complex disease phenotypes. Despite considerable efforts to develop powerful computing methods, candidate gene identification remains a severe challenge owing to the connectivity of an incomplete interactome network, which hampers the discovery of true novel candidate genes. We developed a network-based machine-learning framework to identify both functional modules and disease candidate genes. In this framework, we designed a semi-supervised non-negative matrix factorization model to obtain the functional modules related to the diseases and genes. Of note, we proposed a disease gene-prioritizing method called MapGene that integrates the correlations from both functional modules and network closeness. Our framework identified a set of functional modules with highly functional homogeneity and close gene interactions. Experiments on a large-scale benchmark dataset showed that MapGene performs significantly better than the state-of-the-art algorithms. Further analysis demonstrates MapGene can effectively relieve the impact of the incompleteness of interactome networks and obtain highly reliable rankings of candidate genes. In addition, disease cases on Parkinson's disease and diabetes mellitus confirmed the generalization of MapGene for novel candidate gene identification. This work proposed, for the first time, an integrated computing framework to predict both functional modules and disease candidate genes. The methodology and results support that our framework has the potential to help discover underlying functional modules and reliable candidate genes in human disease.

Entities:  

Year:  2021        PMID: 33409574     DOI: 10.1007/s00439-020-02253-0

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   4.132


  37 in total

1.  Combining the interactome and deleterious SNP predictions to improve disease gene identification.

Authors:  M A Care; J R Bradford; C J Needham; A J Bulpitt; D R Westhead
Journal:  Hum Mutat       Date:  2009-03       Impact factor: 4.878

2.  Genomic analyses identify molecular subtypes of pancreatic cancer.

Authors:  Peter Bailey; David K Chang; Katia Nones; Amber L Johns; Ann-Marie Patch; Marie-Claude Gingras; David K Miller; Angelika N Christ; Tim J C Bruxner; Michael C Quinn; Craig Nourse; L Charles Murtaugh; Ivon Harliwong; Senel Idrisoglu; Suzanne Manning; Ehsan Nourbakhsh; Shivangi Wani; Lynn Fink; Oliver Holmes; Venessa Chin; Matthew J Anderson; Stephen Kazakoff; Conrad Leonard; Felicity Newell; Nick Waddell; Scott Wood; Qinying Xu; Peter J Wilson; Nicole Cloonan; Karin S Kassahn; Darrin Taylor; Kelly Quek; Alan Robertson; Lorena Pantano; Laura Mincarelli; Luis N Sanchez; Lisa Evers; Jianmin Wu; Mark Pinese; Mark J Cowley; Marc D Jones; Emily K Colvin; Adnan M Nagrial; Emily S Humphrey; Lorraine A Chantrill; Amanda Mawson; Jeremy Humphris; Angela Chou; Marina Pajic; Christopher J Scarlett; Andreia V Pinho; Marc Giry-Laterriere; Ilse Rooman; Jaswinder S Samra; James G Kench; Jessica A Lovell; Neil D Merrett; Christopher W Toon; Krishna Epari; Nam Q Nguyen; Andrew Barbour; Nikolajs Zeps; Kim Moran-Jones; Nigel B Jamieson; Janet S Graham; Fraser Duthie; Karin Oien; Jane Hair; Robert Grützmann; Anirban Maitra; Christine A Iacobuzio-Donahue; Christopher L Wolfgang; Richard A Morgan; Rita T Lawlor; Vincenzo Corbo; Claudio Bassi; Borislav Rusev; Paola Capelli; Roberto Salvia; Giampaolo Tortora; Debabrata Mukhopadhyay; Gloria M Petersen; Donna M Munzy; William E Fisher; Saadia A Karim; James R Eshleman; Ralph H Hruban; Christian Pilarsky; Jennifer P Morton; Owen J Sansom; Aldo Scarpa; Elizabeth A Musgrove; Ulla-Maja Hagbo Bailey; Oliver Hofmann; Robert L Sutherland; David A Wheeler; Anthony J Gill; Richard A Gibbs; John V Pearson; Nicola Waddell; Andrew V Biankin; Sean M Grimmond
Journal:  Nature       Date:  2016-02-24       Impact factor: 49.962

Review 3.  Network propagation: a universal amplifier of genetic associations.

Authors:  Lenore Cowen; Trey Ideker; Benjamin J Raphael; Roded Sharan
Journal:  Nat Rev Genet       Date:  2017-06-12       Impact factor: 53.242

4.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes.

Authors:  Lude Franke; Harm van Bakel; Like Fokkens; Edwin D de Jong; Michael Egmont-Petersen; Cisca Wijmenga
Journal:  Am J Hum Genet       Date:  2006-04-25       Impact factor: 11.025

5.  Variations in DNA elucidate molecular networks that cause disease.

Authors:  Yanqing Chen; Jun Zhu; Pek Yee Lum; Xia Yang; Shirly Pinto; Douglas J MacNeil; Chunsheng Zhang; John Lamb; Stephen Edwards; Solveig K Sieberts; Amy Leonardson; Lawrence W Castellini; Susanna Wang; Marie-France Champy; Bin Zhang; Valur Emilsson; Sudheer Doss; Anatole Ghazalpour; Steve Horvath; Thomas A Drake; Aldons J Lusis; Eric E Schadt
Journal:  Nature       Date:  2008-03-16       Impact factor: 49.962

6.  Graph Regularized Nonnegative Matrix Factorization for Data Representation.

Authors:  Deng Cai; Xiaofei He; Jiawei Han; Thomas S Huang
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-12-23       Impact factor: 6.226

7.  DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization.

Authors:  Sinan Erten; Gurkan Bebek; Rob M Ewing; Mehmet Koyutürk
Journal:  BioData Min       Date:  2011-06-24       Impact factor: 2.522

8.  Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes.

Authors:  Mona Alshahrani; Robert Hoehndorf
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

9.  Comparative Analysis of Normalization Methods for Network Propagation.

Authors:  Hadas Biran; Martin Kupiec; Roded Sharan
Journal:  Front Genet       Date:  2019-01-22       Impact factor: 4.599

10.  MEME SUITE: tools for motif discovery and searching.

Authors:  Timothy L Bailey; Mikael Boden; Fabian A Buske; Martin Frith; Charles E Grant; Luca Clementi; Jingyuan Ren; Wilfred W Li; William S Noble
Journal:  Nucleic Acids Res       Date:  2009-05-20       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.