Kahn Rhrissorrakrai1, Kristin C Gunsalus. 1. Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003, USA.
Abstract
BACKGROUND: Graphical models of network associations are useful for both visualizing and integrating multiple types of association data. Identifying modules, or groups of functionally related gene products, is an important challenge in analyzing biological networks. However, existing tools to identify modules are insufficient when applied to dense networks of experimentally derived interaction data. To address this problem, we have developed an agglomerative clustering method that is able to identify highly modular sets of gene products within highly interconnected molecular interaction networks. RESULTS: MINE outperforms MCODE, CFinder, NEMO, SPICi, and MCL in identifying non-exclusive, high modularity clusters when applied to the C. elegans protein-protein interaction network. The algorithm generally achieves superior geometric accuracy and modularity for annotated functional categories. In comparison with the most closely related algorithm, MCODE, the top clusters identified by MINE are consistently of higher density and MINE is less likely to designate overlapping modules as a single unit. MINE offers a high level of granularity with a small number of adjustable parameters, enabling users to fine-tune cluster results for input networks with differing topological properties. CONCLUSIONS: MINE was created in response to the challenge of discovering high quality modules of gene products within highly interconnected biological networks. The algorithm allows a high degree of flexibility and user-customisation of results with few adjustable parameters. MINE outperforms several popular clustering algorithms in identifying modules with high modularity and obtains good overall recall and precision of functional annotations in protein-protein interaction networks from both S. cerevisiae and C. elegans.
BACKGROUND: Graphical models of network associations are useful for both visualizing and integrating multiple types of association data. Identifying modules, or groups of functionally related gene products, is an important challenge in analyzing biological networks. However, existing tools to identify modules are insufficient when applied to dense networks of experimentally derived interaction data. To address this problem, we have developed an agglomerative clustering method that is able to identify highly modular sets of gene products within highly interconnected molecular interaction networks. RESULTS: MINE outperforms MCODE, CFinder, NEMO, SPICi, and MCL in identifying non-exclusive, high modularity clusters when applied to the C. elegans protein-protein interaction network. The algorithm generally achieves superior geometric accuracy and modularity for annotated functional categories. In comparison with the most closely related algorithm, MCODE, the top clusters identified by MINE are consistently of higher density and MINE is less likely to designate overlapping modules as a single unit. MINE offers a high level of granularity with a small number of adjustable parameters, enabling users to fine-tune cluster results for input networks with differing topological properties. CONCLUSIONS: MINE was created in response to the challenge of discovering high quality modules of gene products within highly interconnected biological networks. The algorithm allows a high degree of flexibility and user-customisation of results with few adjustable parameters. MINE outperforms several popular clustering algorithms in identifying modules with high modularity and obtains good overall recall and precision of functional annotations in protein-protein interaction networks from both S. cerevisiae and C. elegans.
Authors: Jing-Dong J Han; Nicolas Bertin; Tong Hao; Debra S Goldberg; Gabriel F Berriz; Lan V Zhang; Denis Dupuy; Albertha J M Walhout; Michael E Cusick; Frederick P Roth; Marc Vidal Journal: Nature Date: 2004-06-09 Impact factor: 49.962
Authors: H W Mewes; D Frishman; C Gruber; B Geier; D Haase; A Kaps; K Lemcke; G Mannhaupt; F Pfeiffer; C Schüller; S Stocker; B Weil Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971
Authors: Nicolas Simonis; Jean-François Rual; Anne-Ruxandra Carvunis; Murat Tasan; Irma Lemmens; Tomoko Hirozane-Kishikawa; Tong Hao; Julie M Sahalie; Kavitha Venkatesan; Fana Gebreab; Sebiha Cevik; Niels Klitgord; Changyu Fan; Pascal Braun; Ning Li; Nono Ayivi-Guedehoussou; Elizabeth Dann; Nicolas Bertin; David Szeto; Amélie Dricot; Muhammed A Yildirim; Chenwei Lin; Anne-Sophie de Smet; Huey-Ling Kao; Christophe Simon; Alex Smolyar; Jin Sook Ahn; Muneesh Tewari; Mike Boxem; Stuart Milstein; Haiyuan Yu; Matija Dreze; Jean Vandenhaute; Kristin C Gunsalus; Michael E Cusick; David E Hill; Jan Tavernier; Frederick P Roth; Marc Vidal Journal: Nat Methods Date: 2009-01 Impact factor: 28.547
Authors: Desalegn W Etalo; Iris J E Stulemeijer; H Peter van Esse; Ric C H de Vos; Harro J Bouwmeester; Matthieu H A J Joosten Journal: Plant Physiol Date: 2013-05-29 Impact factor: 8.340