Literature DB >> 26404759

Graph theoretic network analysis reveals protein pathways underlying cell death following neurotropic viral infection.

Sourish Ghosh1, G Vinodh Kumar1, Anirban Basu1, Arpan Banerjee1.   

Abstract

Complex protein networks underlie any cellular function. Certain proteins play a pivotal role in many network configurations, disruption of whose expression proves fatal to the cell. An efficient method to tease out such key proteins in a network is still unavailable. Here, we used graph-theoretic measures on protein-protein interaction data (interactome) to extract biophysically relevant information about individual protein regulation and network properties such as formation of function specific modules (sub-networks) of proteins. We took 5 major proteins that are involved in neuronal apoptosis post Chandipura Virus (CHPV) infection as seed proteins in a database to create a meta-network of immediately interacting proteins (1(st) order network). Graph theoretic measures were employed to rank the proteins in terms of their connectivity and the degree upto which they can be organized into smaller modules (hubs). We repeated the analysis on 2(nd) order interactome that includes proteins connected directly with proteins of 1(st) order. FADD and Casp-3 were connected maximally to other proteins in both analyses, thus indicating their importance in neuronal apoptosis. Thus, our analysis provides a blueprint for the detection and validation of protein networks disrupted by viral infections.

Entities:  

Mesh:

Year:  2015        PMID: 26404759      PMCID: PMC4585883          DOI: 10.1038/srep14438

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Metabolic functions are outcomes of interactions among various cellular proteins. An emerging concept in the field of proteomics is that the understanding of these interactions is critical for elucidating the mechanism of metabolic functions12. However, parsing interactions important for certain functions or a disease involves analyzing huge interactomes containing information about a large number of genes and proteins along with their interacting partners. Mathematical modelling has been instrumental in analyzing these huge datasets and systematically understanding the interplay between various proteins and the metabolic functions involved345. Recent technical developments consider the huge protein interactome as a complex graph wherein individual proteins are nodes of the graph and the interactions are modelled as the edges6. Graph theoretic analysis provides an efficient handle to decipher various aspects of proteins in a network that interact with a specific functional objective. For example, is one protein more important than others, does a group of protein exhibit more interactions (densely connected) than other groups, do some proteins act as hubs through which majority of interactions are routed? Graph theory provides several parameters to study properties of constituent proteins in an interactome: degree centrality, clustering, betweenness, shortest path, modularity, etc., each of which may be meaningful for understanding function789. Based on a hypothesis about the operational structure of the interactome, researchers can decide upon what parameters to investigate. Modularity quantifies how the nodes of a network are interacting among each other to form “hubs”10. Hubs or modules are closely interacting group of nodes with more connections within the module and sparse connections between modules. Real world networks such as the Internet, power grids, brain network exhibit such properties111213. Thus, using modularity, researchers can quantify how many “hubs” of proteins are formed within a given interactome and whether a particular module is the key facilitator of a specific function/disease. Another useful measure using graph theory on interactome data is degree centrality. Degree centrality quantifies the individual contribution of a node (protein) to the interactome14. Depending upon the degree centrality score, the most dominating protein in a particular network can be characterized. Chandipura Virus (CHPV) a member of the Rhabdoviridae family, has been ranked among the emerging viruses in the Indian subcontinent. CHPV was first identified in two patients in the year 1965 from the Chandipura village in Maharashtra (India)15. The first major outbreak took place in 2003 and resulted in death of 183 children. This was followed by sporadic attacks every year. Presently CHPV has a case-by-case fatality rate of around 55–77%161718. The virus has been reported to cause encephalitis along with neurodegeneration leading to death. Common symptoms which have been diagnosed are high grade fever, vomiting, altered sensorium, generalized convulsions, decerebrate posture and coma. CHPV, being an arbovirus with sand flies (Phlebotomus sps.) as the carrier (vector), enters the host system through the skin, penetrating into the circulatory system of the body (which is also referred to as peripheral circulatory system). CHPV is cleared off the peripheral circulatory system by the host immune system within a couple of days post infection as observed in a mouse model181920. But this virus finds a safe place to replicate in the brain. In an earlier article some of us have shown in a mouse model that CHPV induces neuronal death through a Fas-mediated extrinsic apoptosis pathway17. From there we identified 5 proteins pertaining to the extrinsic apoptotic pathway. However, from this analysis we did not get the information about all the proteins that may be involved in the apoptotic process following CHPV infection. In this article, we identified a large number of proteins (from an online database) that interact with the five proteins whose expressions were monitored in the earlier wet-lab experiment of CHPV infection17. The resultant network of proteins constituted a “1st order interactome” Furthermore, we estimated a “2nd order interactome” by identifying the proteins that were directly interacting with the 1st order interactome. We calculated the modularity of 1st order and 2nd order interactomes and degree centrality of individual proteins. These two measures quantified both a global measure of segregation of network and an individual connectivity measure of candidate proteins. 2nd order connectome results were used to test the robustness of 1st order connectome results and address the issue of predictive validity of the model. Together they revealed the protein-protein network configuration underlying neuronal apoptosis following CHPV infection. The issue of face validity was addressed by comparisons of empirical measures with those computed on simulated random networks. The methods and results obtained here provide an operational blueprint for understanding the pivotal dependencies of the virus within the host system and will help in the conceptualization and design of effective therapeutics.

Results

From our results in an earlier study17 we concluded that CHPV induces neuronal apoptosis through Fas-mediated extrinsic apoptotic pathway with the involvement of the following five proteins: Fas, Fas-associated Death Domain (FADD), Caspase-8 (Casp- 8), cleaved Caspase-3 (Casp- 3), and X-linked inhibitor of apoptosis (XIAP) (Fig. 1, see also Methods section for more details). These 5 proteins were inserted as inputs to STRING 9.1 online database (http://string-db.org/) for extraction of the 1st and 2nd order interactomes. The 1st order interactome contained 26 proteins while the 2nd order contained 71 proteins (Fig. 1b,c). The names of each protein from 1st and 2nd order interactomes are presented in Table 1.
Figure 1

(a) Interactions between monitored proteins Fas, FADD, Casp-8, Casp-3 and XIAP estimated using STRING 9.1 database (b) Proteins interacting directly with Fas, FADD, Casp-8, Casp-3 and XIAP were estimated using STRING 9.1 database. The nodes represent the proteins while the lines indicate interactions in this 1st order interactome. Only those proteins reported at a confidence level of 95% are considered. (c) The proteins interacting directly with the nodes of 1st order interactome were extracted analogously to capture the 2nd order interactome.

Table 1

Protein names, community structure value (Ci) score of the 1st and 2nd order interactome.

  Module
  Module
  Module
Protein NameProtein1st2ndProtein NameProtein1st2ndProtein NameProtein1st2nd
Caspase-3Casp312Caspase-7Casp742Cyclin-dependent kinase inhibitor 1A (P21)Cdkn1a 1
Caspase-8Casp823Direct IAP binding protein with low pIDiablo42Forkhead box O4Foxo4 1
FasFas23Proto-oncogene tyrosine-protein kinaseFyn 2cAMP responsive element binding protein 1Creb1 10
Fas-associated Death DomainFadd23CylindromatosisCyld 3Transformed mouse 3T3 cell double minute 2Mdm2 10
X-linked inhibitor of apoptosis proteinXiap42Cluster of Differentiation- 40Cd40 3Cyclin-dependent kinase inhibitor 1BCdkn1b 1
Fas LigandFasl23TNF receptor-associated factor 3Traf3 8Forkhead box O1Foxo1 1
Tumor Necrosis Factor (TNF) receptor-associated factor 2Traf233Ubiquitin-cUbc 10Tuberous sclerosis 2Tsc2 1
Tumor necrosis factor receptor type 1-associated DEATH domainTradd33Toll-like receptor adaptor molecule 1Ticam1 8Mechanistic target of rapamycin (serine/threonine kinase)Mtor 1
Receptor-interacting serine/threonine-protein kinase 1Ripk133Death domain-containing proteinCradd 3Phosphatase and tensin homologPten 1
CASP8 and FADD-like apoptosis regulatorCflar23Inhibitor of nuclear factor kappa-B kinase subunit gammaIkbkg 3RPTOR independent companion of MTOR, complex 2Rictor 1
Tumor necrosis factor receptor superfamily, member 10bTnfrsf10b23TNF receptor-associated factor 1Traf1 8SMT3 suppressor of mif two 3 homolog 1Sumo1 6
B-cell receptor associated protein 31Bcap3123Toll-like receptor 4Tlr4 8Mitogen-activated protein kinase kinase kinase 5Map3k5 1
Baculoviral IAP repeat-containing protein 2Birc248Mitogen-activated protein kinase kinase kinase 7map3k7 8DNA methyltransferase 1-associated protein 1Dmap1 10
            
Baculoviral IAP repeat-containing protein 3birc348Profilin1Pfn1 10Homeodomain interacting protein kinase 1Hipk1 10
GelsolinGsn110B cell leukemia/lymphoma 2Bcl2 1Alpha thalassemia/mental retardation syndrome X-linked homologAtrx 10
DNA fragmentation factor subunit alphaDffa12BCL2-like 11 (apoptosis facilitator)bcl2l11 1v-rel avian reticuloendotheliosis viral oncogene homolog ARela 9
DNA fragmentation factor subunit betaDffb12Signal transducer and activator of transcription 3Stat3 8FBJ osteosarcoma oncogeneFos 9
Apoptotic protease activating factor 1Apaf142FurinFurin 10Adiponectin, C1Q and collagen domain containingAdipoq 5
Nerve Growth FactorNgf111Transient receptor potential cation channel, subfamily V, member 1Trpv1 8LeptinLep 9
RAC-alpha serine/threonine-protein kinaseAkt111Fibroblast growth factor receptor substrate 2Frs2 3Interleukin-6Il6 9
Death-associated protein 6Daxx210Neurotrophic tyrosine kinase, receptor, type 2Ntrk2 3Interleukin-1aIl1a 9
Tumor Necrosis FactorTnf39Nerve growth factor receptor (TNFR superfamily, member 16)Ngfr 8Peroxisome proliferative activated receptor, gamma, coactivator 1 alphaPpargc1a 7
Tumor necrosis factor receptor superfamily member 1Atnfrsf1a33Neurotrophic tyrosine kinase, receptor, type 1Ntrk1 8CalspinClspn 10
Caspase-9Casp942src homology 2 domain-containing transforming protein C1Shc1 8    

The protein names for the table were arranged according to the chronology in which they have been queried from the STRING 9.1 database. The first 5 are the proteins whose expressions were monitored empirically the next 21 were the primary interacting partners. The next 45 secondary interacting partners were added to the list.

The MATLAB-based Visual Connectome Toolbox21 was used for graph-theoretic analysis of 1st and 2nd order interactome data. We computed the degrees of freedom (degree centrality) for each protein in the 1st & 2nd order networks. Subsequently, we arranged them in a descending order (Table 2). From Table 2, we observed that in both 1st and 2nd order networks FADD and Casp-3 are the common members among the top 5 proteins having highest degree centrality values. Mutual cross-validation of results from 1st and 2nd order network analysis confirms that FADD and Casp-3 are dominant players in apoptotic pathway underlying CHPV infection in neurons. Modularity determines how well a network can be divided into subgroups (hubs). Generally the modularity score ranges between [−0.5, 1) with more modular networks having a positive score. A more randomly assigned network will have a modularity score of approximately zero. We computed modularity scores of both 1st and 2nd order networks sets. The modularity score of the 1st order network was 0.36 while the 2nd order was 0.41. Theoretically, due to the random partitioning of nodes into modules to initiate the graph theoretic algorithm, the results may vary trial to trial unless the modular structure is significantly unambiguous. In our data set the modularity score remained unchanged in all 50 repetitions of the analysis. Additionally, we evaluated the significance of the estimated modularity score by comparing with the modularity scores of a random network with an identical number of nodes. We start with an adjacency matrix with all values set to zero for a given number of nodes. Then we randomly assigned a value 1 in upper diagonal matrix locations. Finally, symmetric locations in lower diagonal positions are assigned values 1 to design the adjacency matrix for which network metrics are computed. Diagonal elements were always assigned a value 0 to avoid self-connections. The mean modularity score of a random network (50 repetitions) with 26 nodes was 0.13, whereas for a random network with 71 nodes, the score was 0.09. In both cases the estimated modularity values of the empirical networks were statistically significant at Bonferroni corrected p < 0.05 (χ2 = 20.67, df = 1 for 1st order and χ2 = 58.40, df = 1 for 2nd order interactome). In case of the 1st order network, our analysis indicated the presence of 4 modules whereas in case of 2nd order network 12 modules were identified.
Table 2

The protein names were arranged in decreasing order according to their respective degree centrality (Z) scores for both 1st (a) and 2nd order (b) interactome.

1st Order
2nd Order
ProteinZ scoreProteinZ scoreProteinZ scoreProteinZ score
Casp31.9518Akt13.1326Ntrk10Ntrk2−0.4472
Casp81.569143Casp82.0533Shc10Fas−0.5019
Casp91.224745Ngf1.7889Mdm20Fasl−0.5019
Casp71.224745Fadd1.7339Sumo10Tradd−0.5019
Fadd0.998545Casp31.6202Map3k50Tnfrsf10b−0.5019
Diablo0.612372Tnf1.3618Dmap10Apaf1−0.5401
Fas0.427948Il61.3618Hipk10Rela−0.5447
Traf20Ripk11.0951Atrx0Lep−0.5447
Tradd0Casp90.9001Adipoq0Il1a−0.5447
Ripk10Birc20.7771Ppargc1a0Cdkn1a−0.7627
Tnf0birc30.7771Clspn0Foxo4−0.7627
tnfrsf1a0Cd400.7771bcl2l11−0.0545Cyld−0.8213
Fasl−0.14265Tlr40.7771Stat3−0.0545Traf1−0.9991
Cflar−0.14265Traf20.4563Foxo1−0.0545Bcap31−1.1407
Tnfrsf10b−0.14265Tsc20.2996Traf3−0.111Cradd−1.1407
Dffa−0.24398Mtor0.2996map3k7−0.111Creb1−1.1802
Dffb−0.24398Xiap0.18Cflar−0.1825Dffa−1.2601
Ngf−0.24398Casp70.18Ikbkg−0.1825Dffb−1.2601
Akt1−0.24398Diablo0.18Fyn−0.4086Ticam1−1.8872
Xiap−0.61237tnfrsf1a0.1369Bcl2−0.4086  
Birc2−0.61237Fos0.0908Cdkn1b−0.4086  
birc3−0.61237Gsn0Pten−0.4086  
Gsn−0.9759Daxx0Rictor−0.4086  
Apaf1−1.22474Ubc0Furin−0.4472  
Bcap31−1.28384Pfn10Trpv1−0.4472  
Daxx−1.28384Ngfr0Frs2−0.4472  
Using the Ci scores from Table 1 and Fig. 2, we color coded each module in Fig. 1(b,c). Modules 2 and 4 of the 1st order interactome and module numbers 3 and 2 of 2nd order interactome, respectively were presented in identical colors because they have multiple common members. The common members of module number 2 from 1st order and 3 from 2nd order are Casp-8, Tnfrs10b, Cflar, Fas, FADD, TRADD. Module 4 from 1st order and 2 from 2nd order has Casp-9, Casp-7, XIAP, Apaf-1 and Diablo. Extraction of a consistent network structure from the analysis of 1st order and 2nd order interactomes provides confidence about the biological relevance of the key modules. Table 3 lists the UniProt IDs of all proteins identified in the 1st and 2nd order interactomes.
Figure 2

Representative plots for Community Structure (Ci) Vs protein node numbers were plotted in this figure for 1st (a) and 2nd (b) order interactomes.

The Ci value from each analysis was obtained from running the codes for 50 times. Thereafter the mean Ci values corresponding to the mean Modularity score (Q) for each protein was plotted against the corresponding protein node number.

Table 3

The table enlists the Uniprot identification numbers for all the proteins which were used in our analysis.

Protein SymbolUniProt IDProtein SymbolUniprot ID
Casp3P70677map3k7Q923A8
Casp8O89110Pfn1P62962
FasP25446Bcl2P10417
FaddQ61160bcl2l11O54918
XiapQ60989Stat3P42227
FaslP41047FurinP23188
Traf2P39429Trpv1Q704Y3
TraddQ3U0V2Frs2Q8C180
Ripk1Q60855Ntrk2P15209
CflarO35732NgfrQ8CFT3
Tnfrsf10bQ9QZM4Ntrk1Q3UFB7
Bcap31Q61335Shc1P98083
Birc2Q62210Cdkn1aP39689
birc3O08863Foxo4Q9WVH3
GsnP13020Creb1Q01147
DffaO54786Mdm2P23804
DffbO54788Cdkn1bP46414
Apaf1O88879Foxo1Q9R1E0
NgfP01139Tsc2Q7TT21
Akt1P31750MtorQ9JLN9
DaxxO35613PtenO08586
TnfP06804RictorQ6QI06
tnfrsf1aP25118Sumo1P63166
Casp9Q8C3Q9Map3k5Q14AY4
Casp7P97864Dmap1Q9JI44
DiabloQ9JIQ3Hipk1O88904
FynP39688AtrxQ61687
CyldQ80TQ2RelaQ04207
Cd40P27512FosP01101
Traf3Q60803AdipoqQ60994
UbcP0CG50LepP41160
Ticam1Q80UF7Il6P08505
CraddO88843Il1aP01582
IkbkgQ8VC91Ppargc1aO70343
Traf1P39428ClspnQ80YR7
Tlr4Q9QUK6  

Discussion

In this report we propose an analysis framework to compute the modular structure of a complex protein-protein interaction network (interactome). The choice of seed proteins: Fas, Fas associated Death Domain (FADD), Caspase-8, Caspase-3 and X-linked Inhibitor of Apoptosis Protein (XIAP) for the construction of the interactome was guided from our previous experimental findings17. These were apoptotic proteins over-expressed in mouse neurons following Chandipura Virus infection. We used the STRING 9.1 database to compute the first order interactome. There are currently several bioinformatics toolboxes available, each with their own set of unique controls. We chose STRING 9.1 because it was the only method to the best of our knowledge that allowed us to prune networks based on a statistical confidence level. However, it is pertinent to note that the database used to extract the interactome will immensely influence the estimation of any functional modular structure. A study comparing the interactomes extracted from several data sets may potentially help future research in terms of data interpretation. Next, we computed the graph theory metrics: Modularity Score (Q), Community Structure (Ci) and Degree Centrality (Z) to infer further about the protein-protein interactions underlying apoptosis. To establish the predictive validity of our analysis, we constructed a second order interactome based on secondary interacting partners of the seed proteins using the STRING 9.1 database (at 95% confidence) and re-calculated the graph theory metrics. The consistent presence of key protein assemblies in the first order and second order interactomes provides confidence regarding the robustness of our approach. Finally, we compared the closeness of modularity and degree centrality computed in empirical networks with that obtained for simulated random networks. Since, no modular structures are expected in a random network, this addressed the issue of face-validity, that is, whether the method is successful in extracting meaningful information and helped us control false positives. Modularity score (Q) of a network ranges between [−0.5, 1) with negative Q scores signifying random interactions within the network. As the within group interactions increase, the network starts to become more modular and the Q value shifts more towards the positive side nearing to 1. For every network there exists an optimal Q value beyond which the modularity score cannot be enhanced even if we increase the number of modules. In our case we have determined the Q values of 1st order and 2nd order ineractome are 0.3911 and 0.4716, respectively. These scores were stable across 50 independent runs. The community structure also remained unchanged. These two findings give us the confidence to state that protein-protein interactions are indeed highly modular due to their inherent biological properties. Hence it is pertinent that the interactive nodes of both the networks have been classified into a maximum number of possible modules. Next we focus on each module to decipher their biological significance. In the first order interactome, 4 interactive modules were identified (Fig. 1b). We could clearly characterize that all proteins segregated in separate modules on the basis of their functional role in the apoptotic process. Module 2 and 3 consist of all the proteins which are mostly known as death domain (DD) and death-inducing signalling complex (DISC). Proteins like FADD, TRADD, Cflar RIPK1, Daxx, Bcap31, Tnfrs1a & 10b have been reported to contribute the DD2223 while Caspase 8 and FADD forms the DISC2425. Other members like Fas (Module 2) and TNF (Module 3) are commonly known as the initiators of the death process. Module 3 consists of proteins that are co-stimulators of tumor necrosis factor (TNF) induced cell death whereas Module 2 consists of proteins that contribute to both Fas and TNF pathways. Module 4 is a heterogeneous group that consists of both apoptotic activators and inhibitors that belong to caspase group. XIAP has been previously reported both in our previous report and other researchers to be a Casp3 antagonist17 while Birc2 & Birc3 are well known to be in association with TNF to combat the apoptosis signalling2627. Surprisingly, TNF and Birc2 and Birc3 were not in the same module in our 1st order interactome. Other apoptotic activators of module 4 are Apaf-1 and Diablo along with the caspases like Casp 9 & 7. Overall this module represents proteins that are affecting the intermediate phase of apoptosis before the appearance of the final executioner of the apoptotic pathways. Module 1 is a classical cluster consisting of the close interactors of Casp3, the final executioner of the apoptotic pathway. This module consists of some of the targets of Casp3 which gets cleaved in order to bring about various changes in the cellular environment and to help in completion of the apoptotic process. Both Dffa and Dffb are cleaved by Casp3 to effect the DNA fragmentation28 while Gsn cleavage brings about morphological changes to the cell during apoptosis29. Ngf30 has been previously reported to be closely associated with Casp3. Akt-1 activation in response to cytokine receptor signalling has been associated with anti-apoptotic processes31. In our analysis we observed that although Akt-1 is linked with other modules, its association with Caspase-3 is strong, and as a result Akt-1 has been grouped in Module 1. However, the scenario changes drastically once we enhance the network including the primary interactors of each of the proteins in the 1st order interactome model to develop the 2nd order interactome. The 2nd order interactome segregated into 12 modules, among which 7 were larger groups, each containing 6 or more members while the rest were smaller groups with single nodes (Fig. 1c). The module configurations of the 2nd order interactome clearly indicate that most members of module 3 and 2 are also present in module 2 & 4 of the 1st order interactome, respectively. Module 3 in 2nd order interactome consists of Casp-8 and FADD, key players of the DD and DISC processes. Module 2 is now an integrated assembly formed from nodes of module 1 and 4 of 1st order interactome and consists of proteins taking part in the intermediate stage and the final execution of apoptosis. A closer look at the 2nd order interactome reveals the 4 major groups apart from 2 and 3. Modules 1, 8 and 9 have been built around few of the major anti-apoptotic proteins of the 1st order interactome for example Akt1, Birc 2, Birc3, Traf2 and TNF. We observed that in 1st order interactome TNF and Traf2 were included within module 3 whereas in 2nd order interactome TNF and Traf2 were placed in modules 8 and 9 respectively. TNF has been earlier reported to be involved in activation of apoptotic pathways3233. But from our analysis we propose TNF may have some anti-apoptotic function based on its interactions with cytokines IL-1a and IL-6, that have been reported to be involved in cell survival3435. The modules 8, 10 and 11 being influenced by the anti-apoptotic proteins form a significant part of this network that was not so prominent in the 1st order interactome. Other modules such as 4, 5, 6, 7 and 12 although consisting of fewer members in the context of our study, have the potential to embark into larger modules if an even bigger network is considered. This is simply because these modules consist of very important proteins that have been known to play pivotal roles in apoptosis. Degree centrality is simply defined as the interaction score of a particular node within a network. The more interactions a node has within a group of nodes which are mutually interacting among each other, the higher its chance will be to form a module. Hence the community structure formation largely depends upon degree centrality of the nodes within a complex network. Casp3 and FADD were ranked among top 5 proteins when nodes of 1st and 2nd order interactomes were sorted in terms of degree centrality (Table 2). This signifies the pivotal role played by these two proteins in apoptosis and also gives us confidence to interpret the biological significance of modules from graph-theoretic measures. In Table 2 we see an interesting pattern. Nodes in the 1st order interactome that have positive degree centrality scores remained to be in the positive side in the 2nd order interactome. However, degree centrality of nodes that had 0 or negative values in the 1st order connectome either enhanced or got depreciated in 2nd order. In order to explain this pattern we have to carefully analyze both the interactome models. Nodes having positive scores in the 1st order connectome interact not only maximally within their modules but also with other nodes in different modules. Hence, with the increase in number of interacting partners in the 2nd order interactome, the overall connectivity is enhanced for the constituent nodes. For example, Akt1 in the 1st order interactome interacts with several nodes of different modules but not consistently within one module. However, in the 2nd order interactome, the degree centrality of Akt1 increased and creation of a separate module involving Akt1 was observed3637. Other nodes like TNF and Traf2 that were in one module in 1st order, increased their interactive partners and gained entry to bigger modules in 2nd order interactome. Nodes that have a predominant role to play in apoptosis maintained their modules and their degree centrality scores across both 1st and 2nd order interactome models. In conclusion, we have outlined a robust method for studying the interactome underlying apoptosis following CHPV infection. This method may be used to study other metabolic pathways in order to yield important information about the strategic proteins of a specific network and the functionally important modules within the network. In the future, therapeutic targeting of particular proteins in case of various disease conditions needs to be investigated.

Methods

Empirical data

In an earlier study17, samples of Chandipura Virus was inoculated into Balb/c mouse intraperitoneally (i.p.) post-natal 10 days, at a plaque forming unit (pfu/ml) of 3 × 105. The animals developed CHPV related symptoms of hind limb paralysis, high grade fever and severe weight loss, within 72–96 hours post infection leading to death. From immunoblotting and immunostaining analyses performed on the extracted brain tissue, we found over-expression of 6 proteins of the extrinsic apoptosis pathway: Fas, FADD (Fas-associated Death Domain), Caspase-8, Caspase-3 and XIAP (Poly ADP Ribose Polymerase-1). Our results were further validated using RNAi studies, ELISA assays and flow-cytometric analyses17. Table 3 enlists the protein names with their corresponding Uniprot IDs.

Generation of meta-network

STRING (Search Tool for the Retrieval of Interacting Genes/Proteins) is an open datasource providing information about protein-protein interactions based on experimental data, computational prediction methods and public database38. STRING 9.1 database contains information about more than 5.4 million proteins and >1100 organisms39. The database has two modes of applications: Protein-mode (for protein interactions) and COG-mode (for gene interactions). STRING imports protein association information from databases of physical interaction and curated biological pathway knowledge (MINT, HPRD, BIND, DIP, BioGRID, KEGG, Reactome, IntAct, EcoCyc, NCI-Nature Pathway Interaction Database, GO). Protein/genes are queried to the STRING database which as an output that represents the associations in the form of a graph network with nodes (proteins/genes) and edges (interactions). The edges are weighted, integrated and a confidence score is assigned to each of them based upon the evidence of the association obtained from experimental data, computational prediction and public data collection methods. Based on these edges are assigned various shades of color (blue)38. The prediction methods generally used in determining the interactions are:

Neighbourhood

This method of prediction utilizes the theory that protein interactions validated in case of one or more species is predicted to carry more weightage and confidence score.

Gene Fusion

Proteins fused in one genome are likely to be functionally linked and hence carry stronger association.

Co-occurrence

Occurrence of two proteins within the same metabolic pathway is predicted to functionally linked with each other. Hence their co-occurrence strengthens their confidence score.

Co-expression

Simultaneous expression of two proteins is also predicted to have strong interaction between them.

Generation of 1st order interactome

The 5 proteins identified through molecular analyses were queries in the STRING 9.1 that produced 26 interacting partners as an output from the Mus musculus database. The STRING 9.1 software defines significance of the interactions between various queried proteins in terms of confidence score. This confidence score is an empirical score defined by the number of citations and experimental evidence for a particular interaction. The highest (0.95) confidence score in the database, that defines the significance of interactions between various queried protein was chosen to extract interactomes in this study. Furthermore, we limited the number of interacting partner to 1000 in the provision for maximum interacting partners using active prediction methods as neighbourhood, gene fusion, co-occurence and co-expression.

Generation of 2nd order network

In order to investigate the structure of an even larger network we queried for interacting partners of all the 26 proteins obtained from the previous analysis. The 2nd order connectome in Fig. 1b was generated from STRING 9.1 database using same confidence score (0.95) as for the 1st order connectome and limiting to 1000 interacting partners

Graph theoretic analysis

The adjacency matrices for graph theoretic analysis were created from 1st and 2nd order interactomes. Visual Connectome analysis tool box in MATLAB was used to compute the modularity score and the degree centrality of all the nodes40.

Degree Centrality

Degree centrality is the property that defines the connectivity of particular node with other nodes of the same network. This means the higher number of connections of a particular node with other nodes in a network, higher is its degree centrality. The node with the highest degree centrality is the one through which maximum edges pass. Degree centrality of a vertex v, for a given graph G = (V,E) with |V| vertices and |E| edges is defined as

Modularity

Modularity score is used to measure the community structure within a network. The value of modularity ranges between [−0.5, 1) with 0 and negative values meaning a network with randomly assigned edges to positive values indicating highly communal structure. In a given graph G (V, E) which can be partitioned into two membership variables s. If a node v falls into community 1 then s = 1 or else s = −1. An adjacency matrix may be denoted by A, which says A = 1 means there is a connection between nodes v and w and A = 0 when there are no interactions. Modularity (Q) is then defined as the fraction of edges that fall within community 1 or 2, minus the expected number of edges within communities 1 and 2 for a random graph with the same node degree distribution as the given graph. The expected number of edges will be calculated using the concept of Configuration Models41. The configuration model is a randomized representation of a particular graph. Given a network with n nodes, where each node v has a node degree k, the configuration model intercepts each edge into two halves, and then each half edge is defined as a stub, that is rewired randomly with any other stub in the network even allowing self loops. Hence even though the node degree distribution of the graph remains intact, the configuration model results in a completely random network. Let the total number of stubs beIf two nodes v and w with node degrees k and k, respectively are these nodes, thenModularity score is calculated asThe above equation is valid for two-community structure and can be generalized into c-community structure.

Modularity Optimization

The (6) can be re-written as:where s is column vector whose elements are s; and B is a symmetric matrixB is also referred to as the modularity matrix which will be having elements whose rows and columns sum upto 0, so that it always has an eigen vector (1, 1, 1..) with eigen value 042. The algorithm that we used, initially divided the network into two communities and in further iterations the community structure is subdivided. For a group g of size n we can express the contribution to modularity aswhich simplify to: that can be expressed aswhere δ stands for Kronecker δ symbol and B represents the n Xn matrix with vertices v, w in a particular group g having values ofCertainly (8) and (11) are similar and therefore spectral approach42 was applied to the generalized modularity matrix to maximize the values of ΔQ. for a complete network happens to be a symmetric matrix and thus (11) turns to nothing but (8). Once ΔQ is almost 0 for an indivisible network, then further subdividing beyond this point will not contribute to the increase in modularity value Q. This can be used to terminate community structure division. The algorithm ran with the following theory: The modularity matrix, (9) was constructed for both interactomes and found the most positive eigenvalue and the corresponding eigenvector in each case. The algorithm divided the network into two parts depending upon the signs of the elements of the corresponding vectors, and then subdividing using the generalized modularity matrix (12). In the process ΔQ comes to 0 or negative at any stage of subdivision the algorithm left subgraph undivided. Hence, the algorithm would end at a certain point when the optimal network has been estimated. In order to fine tune this method of community structure optimization further, the Visual Connectome toolbox21 that we employed uses Kernighan-Lin algorithm43.

Additional Information

How to cite this article: Ghosh, S. et al. Graph theoretic network analysis reveals protein pathways underlying cell death following neurotropic viral infection. Sci. Rep. 5, 14438; doi: 10.1038/srep14438 (2015).
  42 in total

1.  Identification of structural domains in proteins by a graph heuristic.

Authors:  L Wernisch; M Hunting; S J Wodak
Journal:  Proteins       Date:  1999-05-15

2.  Chandipura virus induces neuronal death through Fas-mediated extrinsic apoptotic pathway.

Authors:  Sourish Ghosh; Kallol Dutta; Anirban Basu
Journal:  J Virol       Date:  2013-09-11       Impact factor: 5.103

3.  Towards a proteome-scale map of the human protein-protein interaction network.

Authors:  Jean-François Rual; Kavitha Venkatesan; Tong Hao; Tomoko Hirozane-Kishikawa; Amélie Dricot; Ning Li; Gabriel F Berriz; Francis D Gibbons; Matija Dreze; Nono Ayivi-Guedehoussou; Niels Klitgord; Christophe Simon; Mike Boxem; Stuart Milstein; Jennifer Rosenberg; Debra S Goldberg; Lan V Zhang; Sharyl L Wong; Giovanni Franklin; Siming Li; Joanna S Albala; Janghoo Lim; Carlene Fraughton; Estelle Llamosas; Sebiha Cevik; Camille Bex; Philippe Lamesch; Robert S Sikorski; Jean Vandenhaute; Huda Y Zoghbi; Alex Smolyar; Stephanie Bosak; Reynaldo Sequerra; Lynn Doucette-Stamm; Michael E Cusick; David E Hill; Frederick P Roth; Marc Vidal
Journal:  Nature       Date:  2005-09-28       Impact factor: 49.962

4.  Weight-conserving characterization of complex functional brain networks.

Authors:  Mikail Rubinov; Olaf Sporns
Journal:  Neuroimage       Date:  2011-04-01       Impact factor: 6.556

5.  The role of nerve growth factor in caspase-dependent apoptosis in human BE(2)C neuroblastoma.

Authors:  Janette L Holub; Yi-Yong Qiu; Fei Chu; Mary Beth Madonna
Journal:  J Pediatr Surg       Date:  2011-06       Impact factor: 2.545

6.  Modular organization of the monkey presubiculum.

Authors:  S L Ding; K S Rockland
Journal:  Exp Brain Res       Date:  2001-08       Impact factor: 1.972

7.  A human-specific role of cell death-inducing DFFA (DNA fragmentation factor-alpha)-like effector A (CIDEA) in adipocyte lipolysis and obesity.

Authors:  Elisabet Arvidsson Nordström; Mikael Rydén; Emma C Backlund; Ingrid Dahlman; Maria Kaaman; Lennart Blomqvist; Barbara Cannon; Jan Nedergaard; Peter Arner
Journal:  Diabetes       Date:  2005-06       Impact factor: 9.461

8.  Gene network revealed involvements of Birc2, Birc3 and Tnfrsf1a in anti-apoptosis of injured peripheral nerves.

Authors:  Yongjun Wang; Xin Tang; Bin Yu; Yun Gu; Ying Yuan; Dengbing Yao; Fei Ding; Xiaosong Gu
Journal:  PLoS One       Date:  2012-09-17       Impact factor: 3.240

9.  From hub proteins to hub modules: the relationship between essentiality and centrality in the yeast interactome at different scales of organization.

Authors:  Jimin Song; Mona Singh
Journal:  PLoS Comput Biol       Date:  2013-02-21       Impact factor: 4.475

View more
  4 in total

1.  Network analysis reveals common host protein/s modulating pathogenesis of neurotropic viruses.

Authors:  Sourish Ghosh; Sriparna Mukherjee; Nabonita Sengupta; Arunava Roy; Dhritiman Dey; Surajit Chakraborty; Dhrubajyoti Chattopadhyay; Arpan Banerjee; Anirban Basu
Journal:  Sci Rep       Date:  2016-09-01       Impact factor: 4.379

2.  Determining the Balance Between Drug Efficacy and Safety by the Network and Biological System Profile of Its Therapeutic Target.

Authors:  Xiao Xu Li; Jiayi Yin; Jing Tang; Yinghong Li; Qingxia Yang; Ziyu Xiao; Runyuan Zhang; Yunxia Wang; Jiajun Hong; Lin Tao; Weiwei Xue; Feng Zhu
Journal:  Front Pharmacol       Date:  2018-10-31       Impact factor: 5.810

3.  Validation and quality assessment of macromolecular structures using complex network analysis.

Authors:  Jure Pražnikar; Miloš Tomić; Dušan Turk
Journal:  Sci Rep       Date:  2019-02-08       Impact factor: 4.379

4.  Clinical trials, progression-speed differentiating features and swiftness rule of the innovative targets of first-in-class drugs.

Authors:  Ying Hong Li; Xiao Xu Li; Jia Jun Hong; Yun Xia Wang; Jian Bo Fu; Hong Yang; Chun Yan Yu; Feng Cheng Li; Jie Hu; Wei Wei Xue; Yu Yang Jiang; Yu Zong Chen; Feng Zhu
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.