Literature DB >> 31004478

The EntOptLayout Cytoscape plug-in for the efficient visualization of major protein complexes in protein-protein interaction and signalling networks.

Bence Ágg1,2,3, Andrea Császár4, Máté Szalay-Bekő4,5, Dániel V Veres4,6, Réka Mizsei7, Péter Ferdinandy1,3, Péter Csermely4, István A Kovács8,9,10.   

Abstract

MOTIVATION: Network visualizations of complex biological datasets usually result in 'hairball' images, which do not discriminate network modules.
RESULTS: We present the EntOptLayout Cytoscape plug-in based on a recently developed network representation theory. The plug-in provides an efficient visualization of network modules, which represent major protein complexes in protein-protein interaction and signalling networks. Importantly, the tool gives a quality score of the network visualization by calculating the information loss between the input data and the visual representation showing a 3- to 25-fold improvement over conventional methods.
AVAILABILITY AND IMPLEMENTATION: The plug-in (running on Windows, Linux, or Mac OS) and its tutorial (both in written and video forms) can be downloaded freely under the terms of the MIT license from: http://apps.cytoscape.org/apps/entoptlayout. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31004478      PMCID: PMC6821346          DOI: 10.1093/bioinformatics/btz257

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Informative network layouts enable us an intuitive, direct, qualitative understanding of complex systems preceding more elaborate quantitative studies (Hu and Nöllenburg, 2016; Miryala ). Recent contributions to network representation (like Dehmamy ; McInnes ; Muscoloni ) may provide additional approaches in the future. The widely used Cytoscape program has several useful network visualization tools (Shannon ). Modular organization is especially informative in interactomes and signalling networks, where network modules represent major protein complexes offering an intuitive insight of their functions (Fessenden, 2017; King ; Szalay-Bekő ). However, existing network visualization methods lack an information theoretic foundation, and often result in ‘hairball’ images, which are unable to discriminate network modules. Here, we introduce the EntOptLayout Cytoscape plug-in, which uses the novel, relative entropy minimization-based network representation method we developed earlier (Kovács ). This method introduces network nodes as probability distributions and selects their best spatial representation, which is the hardest to distinguish from the input data. This is achieved by minimizing the relative entropy (also known as the Kullback-Leibler divergence) between the input data and their representation (Kovács ). The EntOptLayout plug-in is able to visualize network modules, highlighting major protein and signalling complexes.

2 Materials and methods

The EntOptLayout Cytoscape plug-in initializes the layout using user provided or random coordinates by assigning a Gaussian probability distribution to each node. The relationships between the nodes are then captured by pairwise overlaps of the node distributions. For a network of n nodes and e edges, the runtime complexity of the plug-in is ∼O(n (Kovács ). The layout is updated in a user-selected frequency to see partial results, while an adjustable time limit is also available. EntOptLayout has several optimization features and user-friendly options as detailed in the Supplementary Data and its Tutorial. As an important option, EntOptLayout may raise the adjacency matrix on the square, which captures the interaction profile similarity of the nodes, and improves the detection of functional network modules even further. EntOptLayout is compatible with Cytoscape 3.7.1 and will be upgraded to its later versions. The source code of the plug-in can be accessed and support tickets can be issued here: https://sourceforge.net/projects/entopt/. Between January 2017 and February 2019 the plug-in was downloaded more than 4700 times and received only maximal, 5-star evaluations.

3 Results and conclusion

Figure 1A and B show the Interactome3D human protein–protein interaction network (Mosca ), visualized by the prefuse force-directed layout option of Cytoscape (Fig. 1A) or by the subsequent use of the EntOptLayout plug-in (Fig. 1B). While the core of the standard Cytoscape visualization was a typical ‘hairball’ image, where protein complexes had a large overlap, EntOptLayout using the ‘square of adjacency matrix’ option displayed the major protein complexes as distinct visual subgroups of the interactome. The same, or even larger differences were observed comparing various other standard Cytoscape and EntOptLayout images, and examining the Cytoscape example genetic interaction, human disease and 75 top node STRING Alzheimer's disease-related interactome network, as well as weighted normal or heat shocked yeast BioGrid interactomes, the map of human cancer signalling, the Reactome human pathway network or network modules of benchmark graphs (see Fig. 1C and D; Supplementary Data; Supplementary Figs S1–S9 and S12S14). In case of the affinity purification and 500 top node STRING network (Supplementary Figs S10 and S11) all the four visualizations (original, prefuse force-directed, spring embedded and EntOptLayout) showed significant overlaps of the modules. On the contrary, modules were clearly distinct and well separated on the EntOptLayout image in case of all the nine other networks listed above, while they showed significant overlaps when original layouts, spring embedded layouts, prefuse force-directed layouts or other layout options were examined (Fig. 1; Supplementary Figs S1–S9 and S12S14). Importantly, the normalized information loss (relative entropy, Kullback-Leibler divergence) between the input data and their layout representation showed a 3- to 25-fold improvement when the EntOptLayout method was compared to conventional methods in all cases examined (Fig. 1 and Supplementary Figs S1–S14).
Fig. 1.

Visualization of major protein complexes by the EntOptLayout Cytoscape plug-in. Coloured segments of the image represent various major protein complexes, showing the same, maximum 200 core nodes of the respective network module/community identified by the ModuLand Cytoscape plug-in (Szalay-Bekő ) as detailed in the legend of Supplementary Figure S1. Panels A and B show the Interactome3D human protein–protein interaction network (Mosca ) visualized by the Cytoscape prefuse force-directed layout option alone or by the subsequent use of the EntOptLayout plug-in (switching on the square of the adjacency matrix, ignoring the square of the diagonal and performing consecutive optimizations for 10 000 s each for node position, node width, node position, node width and node position after a pre-ordering made by the prefuse force directed layout), respectively. ‘D’ denotes the normalized information loss (relative entropy) of the layouts stored in the Network Table of the plug-in (in case of the Cytoscape layout its node positions were imported to the EntOptLayout plug-in, and only the node probability distributions were optimized keeping the node positions intact as described in Chapter 5 of the plug-in Tutorial). We note that the 10 000 s alternating position and node width optimization steps should be continued until the ‘D’ value (the normalized information loss) of the layout is decreasing. ‘D’ values are usually becoming minimal after 4–5 subsequent optimization steps. We also note that the use of the prefuse force-directed layout as a pre-ordering layout option before the use of the EntOptLayout shortens the required optimization time and allows the correct positioning of a few (usually 1–4) nodes, which became mis-positioned if this pre-ordering is not used. We recommend the use of the prefuse force-directed algorithm as pre-ordering, since the combination of only this algorithm with the EntOptLayout (but not 3 other Cytoscape layout options) resulted the correct positioning of all nodes (see Supplementary Fig. S2). Panels C and D show the map of human cancer signalling (Cui ) visualized in the same way as shown in Panels A and B

Visualization of major protein complexes by the EntOptLayout Cytoscape plug-in. Coloured segments of the image represent various major protein complexes, showing the same, maximum 200 core nodes of the respective network module/community identified by the ModuLand Cytoscape plug-in (Szalay-Bekő ) as detailed in the legend of Supplementary Figure S1. Panels A and B show the Interactome3D human protein–protein interaction network (Mosca ) visualized by the Cytoscape prefuse force-directed layout option alone or by the subsequent use of the EntOptLayout plug-in (switching on the square of the adjacency matrix, ignoring the square of the diagonal and performing consecutive optimizations for 10 000 s each for node position, node width, node position, node width and node position after a pre-ordering made by the prefuse force directed layout), respectively. ‘D’ denotes the normalized information loss (relative entropy) of the layouts stored in the Network Table of the plug-in (in case of the Cytoscape layout its node positions were imported to the EntOptLayout plug-in, and only the node probability distributions were optimized keeping the node positions intact as described in Chapter 5 of the plug-in Tutorial). We note that the 10 000 s alternating position and node width optimization steps should be continued until the ‘D’ value (the normalized information loss) of the layout is decreasing. ‘D’ values are usually becoming minimal after 4–5 subsequent optimization steps. We also note that the use of the prefuse force-directed layout as a pre-ordering layout option before the use of the EntOptLayout shortens the required optimization time and allows the correct positioning of a few (usually 1–4) nodes, which became mis-positioned if this pre-ordering is not used. We recommend the use of the prefuse force-directed algorithm as pre-ordering, since the combination of only this algorithm with the EntOptLayout (but not 3 other Cytoscape layout options) resulted the correct positioning of all nodes (see Supplementary Fig. S2). Panels C and D show the map of human cancer signalling (Cui ) visualized in the same way as shown in Panels A and B Interestingly, the edge structure of the spring embedded layout also showed dense clusters, which may imply a modular structure. However, this modular structure became covered if the diameter of the nodes was increased to a usual size. On the contrary, the same modular structure remained clearly identifiable when we used the EntOptLayout, since this latter algorithm gave a distinct localization of the modules. Such distinct localization could not be observed when using the spring embedded layout (Supplementary Fig. S3). In summary, we highlight the ‘pros’ and ‘cons’ of using the EntOptLayout network visualization Cytoscape plugin. The major advantage of using EntOptLayout is that it is the only algorithm, which gives a clear visual discrimination of functional protein complexes in most networks. Better optical discrimination of protein complexes may help to discover the emergence of novel functions in changing interactomes or signalling networks during the propagation of a disease, cellular differentiation, wound healing, embryogenesis, etc. Importantly, the algorithm also minimizes the information loss during the visualization process, thus its image is not only functionally better but is also theoretically closer to an ‘optimal’ image. It is a disadvantage of the EntOptLayout algorithm that it sometimes—mostly when using the ‘square of adjacency matrix option’—gives aesthetically less pleasing images than other visualization algorithms, such as the widely used force-directed algorithm. This is due to the fact that the EntOptLayout does not optimize the image for the shortest length of edges or for crossing edges. An additional disadvantage of the EntOptLayout in case of large networks is the 10 000 s suggested running time of each optimization cycle as described in the legend of Figure 1. We are currently developing an upgrade of the algorithm which will allow shorter running times. In conclusion, the use of the EntOptLayout plug-in in 9 out of 11 cases outperformed alternative Cytoscape layout options in the visual discrimination of network modules. This is especially important in human interactomes and signalling networks, providing an intuitive insight into the functional organization under healthy and pathological conditions. Click here for additional data file.
  10 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Protein complex prediction via cost-based clustering.

Authors:  A D King; N Przulj; I Jurisica
Journal:  Bioinformatics       Date:  2004-06-04       Impact factor: 6.937

3.  ModuLand plug-in for Cytoscape: determination of hierarchical layers of overlapping network modules and community centrality.

Authors:  Máté Szalay-Beko; Robin Palotai; Balázs Szappanos; István A Kovács; Balázs Papp; Péter Csermely
Journal:  Bioinformatics       Date:  2012-06-19       Impact factor: 6.937

4.  Interactome3D: adding structural details to protein networks.

Authors:  Roberto Mosca; Arnaud Céol; Patrick Aloy
Journal:  Nat Methods       Date:  2012-12-16       Impact factor: 28.547

Review 5.  Discerning molecular interactions: A comprehensive review on biomolecular interaction databases and network analysis tools.

Authors:  Sravan Kumar Miryala; Anand Anbarasu; Sudha Ramaiah
Journal:  Gene       Date:  2017-11-10       Impact factor: 3.688

6.  Protein maps chart the causes of disease.

Authors:  Marissa Fessenden
Journal:  Nature       Date:  2017-09-13       Impact factor: 49.962

7.  A unified data representation theory for network visualization, ordering and coarse-graining.

Authors:  István A Kovács; Réka Mizsei; Péter Csermely
Journal:  Sci Rep       Date:  2015-09-08       Impact factor: 4.379

8.  A structural transition in physical networks.

Authors:  Nima Dehmamy; Soodabeh Milanlouei; Albert-László Barabási
Journal:  Nature       Date:  2018-11-28       Impact factor: 49.962

9.  A map of human cancer signaling.

Authors:  Qinghua Cui; Yun Ma; Maria Jaramillo; Hamza Bari; Arif Awan; Song Yang; Simo Zhang; Lixue Liu; Meng Lu; Maureen O'Connor-McCourt; Enrico O Purisima; Edwin Wang
Journal:  Mol Syst Biol       Date:  2007-12-18       Impact factor: 11.429

10.  Machine learning meets complex networks via coalescent embedding in the hyperbolic space.

Authors:  Alessandro Muscoloni; Josephine Maria Thomas; Sara Ciucci; Ginestra Bianconi; Carlo Vittorio Cannistraci
Journal:  Nat Commun       Date:  2017-11-20       Impact factor: 14.919

  10 in total
  10 in total

1.  Contrasting effects of genotype and root size on the fungal and bacterial communities associated with apple rootstocks.

Authors:  Jia Liu; Ahmed Abdelfattah; Birgit Wasserman; Michael Wisniewski; Samir Droby; Gennaro Fazio; Mark Mazzola; Xuehong Wu
Journal:  Hortic Res       Date:  2022-01-05       Impact factor: 6.793

2.  A Year in the Life of the EU-CardioRNA COST Action: CA17129 Catalysing Transcriptomics Research in Cardiovascular Disease.

Authors:  Emma Louise Robinson; Clarissa Pedrosa da Costa Gomes; Ines Potočnjak; Jan Hellemans; Fay Betsou; David de Gonzalo-Calvo; Monika Stoll; Mehmet Birhan Yilmaz; Bence Ágg; Dimitris Beis; Maria Carmo-Fonseca; Francisco J Enguita; Soner Dogan; Bilge G Tuna; Blanche Schroen; Wim Ammerlaan; Gabriela M Kuster; Irina Carpusca; Thierry Pedrazzini; Costanza Emanueli; Fabio Martelli; Yvan Devaux
Journal:  Noncoding RNA       Date:  2020-05-18

3.  Modular Reorganization of Signaling Networks during the Development of Colon Adenoma and Carcinoma.

Authors:  Klára Schulc; Zsolt T Nagy; Sebestyén Kamp; János Molnár; Daniel V Veres; Peter Csermely; Borbála M Kovács
Journal:  J Phys Chem B       Date:  2021-02-09       Impact factor: 2.991

4.  Efficient link prediction in the protein-protein interaction network using topological information in a generative adversarial network machine learning model.

Authors:  Olivér M Balogh; Bettina Benczik; András Horváth; Mátyás Pétervári; Péter Csermely; Péter Ferdinandy; Bence Ágg
Journal:  BMC Bioinformatics       Date:  2022-02-19       Impact factor: 3.169

5.  Computational Inference of Synaptic Polarities in Neuronal Networks.

Authors:  Michael R Harris; Thomas P Wytock; István A Kovács
Journal:  Adv Sci (Weinh)       Date:  2022-03-31       Impact factor: 17.521

6.  Disease- and headache-specific microRNA signatures and their predicted mRNA targets in peripheral blood mononuclear cells in migraineurs: role of inflammatory signalling and oxidative stress.

Authors:  Timea Aczél; Bettina Benczik; József Kun; Zsuzsanna Helyes; Bence Ágg; Tamás Körtési; Péter Urbán; Witold Bauer; Attila Gyenesei; Bernadett Tuka; János Tajti; Péter Ferdinandy; László Vécsei; Kata Bölcskei
Journal:  J Headache Pain       Date:  2022-09-02       Impact factor: 8.588

7.  Exploring the Novel Computational Drug Target and Associated Key Pathways of Oral Cancer.

Authors:  Fatema Akhter; Fawzia Haif Al Kahtani; Zainah Mohammed Sambawa; Deema Abdulrahman Alhassan; Reema Abdulaziz AlSaif; Tahsinul Haque; Mohammad Khursheed Alam; Md Tanvir Hasan; Md Rakibul Islam; Kawsar Ahmed; Rehana Basri
Journal:  Curr Issues Mol Biol       Date:  2022-08-09       Impact factor: 2.976

8.  Network Analysis for Signal Detection in Spontaneous Adverse Event Reporting Database: Application of Network Weighting Normalization to Characterize Cardiovascular Drug Safety.

Authors:  Bence Ágg; Péter Ferdinandy; Mátyás Pétervári; Bettina Benczik; Olivér M Balogh; Balázs Petrovich
Journal:  Drug Saf       Date:  2022-10-06       Impact factor: 5.228

9.  Cardiac miRNA Expression and their mRNA Targets in a Rat Model of Prediabetes.

Authors:  Éva Sághy; Imre Vörös; Bence Ágg; Bernadett Kiss; Gábor Koncsos; Zoltán V Varga; Anikó Görbe; Zoltán Giricz; Rainer Schulz; Péter Ferdinandy
Journal:  Int J Mol Sci       Date:  2020-03-20       Impact factor: 5.923

10.  Ischaemic post-conditioning in rats: Responder and non-responder differ in transcriptome of mitochondrial proteins.

Authors:  Rolf Schreckenberg; Johann Klein; Hanna Sarah Kutsche; Rainer Schulz; Kamilla Gömöri; Péter Bencsik; Bettina Benczik; Bence Ágg; Éva Sághy; Péter Ferdinandy; Klaus-Dieter Schlüter
Journal:  J Cell Mol Med       Date:  2020-04-16       Impact factor: 5.310

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.