Sach Mukherjee1, Steven M Hill. 1. Department of Statistics, University of Warwick, Coventry, UK. s.n.mukherjee@warwick.ac.uk
Abstract
MOTIVATION: Networks and pathways are important in describing the collective biological function of molecular players such as genes or proteins. In many areas of biology, for example in cancer studies, available data may harbour undiscovered subtypes which differ in terms of network phenotype. That is, samples may be heterogeneous with respect to underlying molecular networks. This motivates a need for unsupervised methods capable of discovering such subtypes and elucidating the corresponding network structures. RESULTS: We exploit recent results in sparse graphical model learning to put forward a 'network clustering' approach in which data are partitioned into subsets that show evidence of underlying, subset-level network structure. This allows us to simultaneously learn subset-specific networks and corresponding subset membership under challenging small-sample conditions. We illustrate this approach on synthetic and proteomic data. AVAILABILITY: go.warwick.ac.uk/sachmukherjee/networkclustering.
MOTIVATION: Networks and pathways are important in describing the collective biological function of molecular players such as genes or proteins. In many areas of biology, for example in cancer studies, available data may harbour undiscovered subtypes which differ in terms of network phenotype. That is, samples may be heterogeneous with respect to underlying molecular networks. This motivates a need for unsupervised methods capable of discovering such subtypes and elucidating the corresponding network structures. RESULTS: We exploit recent results in sparse graphical model learning to put forward a 'network clustering' approach in which data are partitioned into subsets that show evidence of underlying, subset-level network structure. This allows us to simultaneously learn subset-specific networks and corresponding subset membership under challenging small-sample conditions. We illustrate this approach on synthetic and proteomic data. AVAILABILITY: go.warwick.ac.uk/sachmukherjee/networkclustering.
Authors: Uma T Shankavaram; William C Reinhold; Satoshi Nishizuka; Sylvia Major; Daisaku Morita; Krishna K Chary; Mark A Reimers; Uwe Scherf; Ari Kahn; Douglas Dolginow; Jeffrey Cossman; Eric P Kaldjian; Dominic A Scudiero; Emanuel Petricoin; Lance Liotta; Jae K Lee; John N Weinstein Journal: Mol Cancer Ther Date: 2007-03-05 Impact factor: 6.261
Authors: C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein Journal: Nature Date: 2000-08-17 Impact factor: 49.962
Authors: Dragana M Pavlovic; Petra E Vértes; Edward T Bullmore; William R Schafer; Thomas E Nichols Journal: PLoS One Date: 2014-07-02 Impact factor: 3.240
Authors: Nicolas Städler; Frank Dondelinger; Steven M Hill; Rehan Akbani; Yiling Lu; Gordon B Mills; Sach Mukherjee Journal: Bioinformatics Date: 2017-09-15 Impact factor: 6.937
Authors: Ricardo Ramirez; Allen Michael Herrera; Joshua Ramirez; Chunjiang Qian; David W Melton; Paula K Shireman; Yu-Fang Jin Journal: BMC Bioinformatics Date: 2019-12-18 Impact factor: 3.169