Cesim Erten1, Aissa Houdjedj2, Hilal Kazan3. 1. Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey. 2. Electrical and Computer Engineering Graduate Program, Antalya Bilim University, Antalya, Turkey. 3. Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey. hilal.kazan@antalya.edu.tr.
Abstract
BACKGROUND: Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. A key remaining problem in cancer genomics is the identification of driver genes. RESULTS: We propose BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets. CONCLUSIONS: Our evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.
BACKGROUND: Recent cancer genomic studies have generated detailed molecular data on a large number of cancerpatients. A key remaining problem in cancer genomics is the identification of driver genes. RESULTS: We propose BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets. CONCLUSIONS: Our evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.
Authors: M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock Journal: Nat Genet Date: 2000-05 Impact factor: 38.330
Authors: Ilir Agalliu; Marta San Luciano; Anat Mirelman; Nir Giladi; Bjorg Waro; Jan Aasly; Rivka Inzelberg; Sharon Hassin-Baer; Eitan Friedman; Javier Ruiz-Martinez; Jose Felix Marti-Masso; Avi Orr-Urtreger; Susan Bressman; Rachel Saunders-Pullman Journal: JAMA Neurol Date: 2015-01 Impact factor: 18.302
Authors: Rachel Saunders-Pullman; Matthew J Barrett; Kaili M Stanley; Marta San Luciano; Vicki Shanker; Lawrence Severt; Ann Hunt; Deborah Raymond; Laurie J Ozelius; Susan B Bressman Journal: Mov Disord Date: 2010-11-15 Impact factor: 10.338
Authors: Mark D M Leiserson; Fabio Vandin; Hsin-Ta Wu; Jason R Dobson; Jonathan V Eldridge; Jacob L Thomas; Alexandra Papoutsaki; Younhun Kim; Beifang Niu; Michael McLellan; Michael S Lawrence; Abel Gonzalez-Perez; David Tamborero; Yuwei Cheng; Gregory A Ryslik; Nuria Lopez-Bigas; Gad Getz; Li Ding; Benjamin J Raphael Journal: Nat Genet Date: 2014-12-15 Impact factor: 38.330