Vikram Saraph1, Tijana Milenković2. 1. Department of Computer Science and Engineering, University of Notre Dame, IN 46556, Department of Computer Science, Brown University, RI 02912, ECK Institute for Global Health, University of Notre Dame, IN 46556 and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, IN 46556, USA Department of Computer Science and Engineering, University of Notre Dame, IN 46556, Department of Computer Science, Brown University, RI 02912, ECK Institute for Global Health, University of Notre Dame, IN 46556 and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, IN 46556, USA. 2. Department of Computer Science and Engineering, University of Notre Dame, IN 46556, Department of Computer Science, Brown University, RI 02912, ECK Institute for Global Health, University of Notre Dame, IN 46556 and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, IN 46556, USA Department of Computer Science and Engineering, University of Notre Dame, IN 46556, Department of Computer Science, Brown University, RI 02912, ECK Institute for Global Health, University of Notre Dame, IN 46556 and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, IN 46556, USA Department of Computer Science and Engineering, University of Notre Dame, IN 46556, Department of Computer Science, Brown University, RI 02912, ECK Institute for Global Health, University of Notre Dame, IN 46556 and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, IN 46556, USA.
Abstract
MOTIVATION: Biological network alignment aims to identify similar regions between networks of different species. Existing methods compute node similarities to rapidly identify from possible alignments the high-scoring alignments with respect to the overall node similarity. But, the accuracy of the alignments is then evaluated with some other measure that is different than the node similarity used to construct the alignments. Typically, one measures the amount of conserved edges. Thus, the existing methods align similar nodes between networks hoping to conserve many edges (after the alignment is constructed!). RESULTS: Instead, we introduce MAGNA to directly 'optimize' edge conservation while the alignment is constructed, without decreasing the quality of node mapping. MAGNA uses a genetic algorithm and our novel function for 'crossover' of two 'parent' alignments into a superior 'child' alignment to simulate a 'population' of alignments that 'evolves' over time; the 'fittest' alignments survive and proceed to the next 'generation', until the alignment accuracy cannot be optimized further. While we optimize our new and superior measure of the amount of conserved edges, MAGNA can optimize any alignment accuracy measure, including a combined measure of both node and edge conservation. In systematic evaluations against state-of-the-art methods (IsoRank, MI-GRAAL and GHOST), on both synthetic networks and real-world biological data, MAGNA outperforms all of the existing methods, in terms of both node and edge conservation as well as both topological and biological alignment accuracy. AVAILABILITY: Software: http://nd.edu/∼cone/MAGNA CONTACT: : tmilenko@nd.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Biological network alignment aims to identify similar regions between networks of different species. Existing methods compute node similarities to rapidly identify from possible alignments the high-scoring alignments with respect to the overall node similarity. But, the accuracy of the alignments is then evaluated with some other measure that is different than the node similarity used to construct the alignments. Typically, one measures the amount of conserved edges. Thus, the existing methods align similar nodes between networks hoping to conserve many edges (after the alignment is constructed!). RESULTS: Instead, we introduce MAGNA to directly 'optimize' edge conservation while the alignment is constructed, without decreasing the quality of node mapping. MAGNA uses a genetic algorithm and our novel function for 'crossover' of two 'parent' alignments into a superior 'child' alignment to simulate a 'population' of alignments that 'evolves' over time; the 'fittest' alignments survive and proceed to the next 'generation', until the alignment accuracy cannot be optimized further. While we optimize our new and superior measure of the amount of conserved edges, MAGNA can optimize any alignment accuracy measure, including a combined measure of both node and edge conservation. In systematic evaluations against state-of-the-art methods (IsoRank, MI-GRAAL and GHOST), on both synthetic networks and real-world biological data, MAGNA outperforms all of the existing methods, in terms of both node and edge conservation as well as both topological and biological alignment accuracy. AVAILABILITY: Software: http://nd.edu/∼cone/MAGNA CONTACT: : tmilenko@nd.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.