Literature DB >> 29928007

A cytokine protein-protein interaction network for identifying key molecules in rheumatoid arthritis.

Venugopal Panga1,2, Srivatsan Raghunathan1.   

Abstract

Rheumatoid arthritis (RA) is a chronic inflammatory disease of the synovial joints. Though the current RA therapeutics such as disease-modifying antirheumatic drugs (DMARDs), nonsteroidal anti-inflammatory drugs (NSAIDs) and biologics can halt the progression of the disease, none of these would either dramatically reduce or cure RA. So, the identification of potential therapeutic targets and new therapies for RA are active areas of research. Several studies have discovered the involvement of cytokines in the pathogenesis of this disease. These cytokines induce signal transduction pathways in RA synovial fibroblasts (RASF). These pathways share many signal transducers and their interacting proteins, resulting in the formation of a signaling network. In order to understand the involvement of this network in RA pathogenesis, it is essential to identify the key transducers and their interacting proteins that are part of this network. In this study, based on a detailed literature survey, we have identified a list of 12 cytokines that induce signal transduction pathways in RASF. For these cytokines, we have built a signaling network using the protein-protein interaction (PPI) data that was obtained from public repositories such as HPRD, BioGRID, MINT, IntAct and STRING. By combining the network centrality measures with the gene expression data from the RA related microarrays that are available in the open source Gene Expression Omnibus (GEO) database, we have identified 24 key proteins of this signaling network. Two of these 24 are already drug targets for RA, and of the remaining, 12 have direct PPI links to some of the current drug targets of RA. Therefore, these key proteins seem to be crucial in the pathogenesis of RA and hence might be treated as potential drug targets.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29928007      PMCID: PMC6013252          DOI: 10.1371/journal.pone.0199530

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

RA is a debilitating chronic inflammatory synovial joint disease that affects about 1% of the world’s population [1]. The disease usually affects the small joints of the hands and feet. The etiology of the disease is unknown. The chronic inflammation causes invasion of synovial membrane toward articular bone which results in the formation of a layer of granulation tissue, called pannus. Further, the inflammation would induce irreversible damage of the synovium, which leads to dysfunction of the joints [1-2]. The current RA therapeutics can suppress inflammation and the damage of cartilage and bone but cannot cure the disease. Further, hepatotoxicity, cardiotoxicity and gastrointestinal effects are the clinical side effects of some of these treatments [3-5]. The lack of a clear understanding of the pathogenesis of the disease remains an obstacle for discovering effective treatments for RA. Along with RASF, various immune cells, such as B cells, T cells, mast cells, macrophages, dendritic cells and natural killer (NK) cells are activated in RA [6]. Many of these cells produce cytokines, which are involved in the pathogenesis of the disease. Some of these cytokines are pro-inflammatory while others are anti-inflammatory. In RA, cytokines induce autoimmunity, chronic inflammation and eventual joint damage. Many cytokines such as tumor necrosis factor (TNF), interferon gamma (IFNγ), several interleukins—IL-1β, IL-4, IL-6, IL-7, IL-12, IL-13, IL-15, IL-18, IL-23 and transforming growth factor beta (TGFβ) are expressed in the synovial tissues [1]. Also, the cytokine levels in the synovial tissues are altered at various stages of the disease. For instance, in early RA the levels of IL-13 and IL-4 are elevated, whereas in the later stages they are present at low levels [7]. Some of these cytokines act in a synergistic fashion to augment inflammation. For instance, IL-17, IL-1β and TNF synergistically activate synovial fibroblasts leading to the production of inflammatory mediators [8]. Several of these cytokines induce signal transduction pathways in RASF, which lead to the activation of their respective transcription factors. The activated transcription factors induce the expression of the genes that encode inflammatory mediators such as C-X-C motif ligands—CXCL8, CXCL9, CXCL10, CXCL11, CXCL12 and C-C motif ligand 5 (CCL5)—and articular cartilage degrading enzymes such as matrix metalloproteinases MMP1 and MMP3 [9-33]. RA is a complex phenotype that can be caused by a combination of genetic and environmental stresses on complex biological networks [34-35]. The pro- and anti-inflammatory cytokines form one such complex network. The cytokines interact with one another via their signal transduction pathways. Many signal transducers are shared by these pathways, thus forming a network. However, which of these signal transducers can be targeted for the effective treatment of RA has not been fully established. Building the cytokine network and identifying the key molecules in it would be a major step toward the holistic understanding of the cytokine pathways in RA. The cytokine network can be built using protein-protein interaction (PPI) data that is available in the public repositories such as the human protein reference database (HPRD), the biological general repository for interaction datasets (BioGRID), the molecular interaction (MINT), IntAct, STRING and a tissue-specific human protein interaction dataset called CRG [36-41]. By combining network analysis methods and the RA gene expression profiling data that is present in the GEO database, we can come up with strategies for identifying certain key molecules that are part of the cytokine signaling network. In earlier studies, some efforts have been made to identify key molecules in complex diseases like tuberculosis and cancer using network analysis methods [42-43]. However, such studies are limited in RA. In order to fill this gap, we had earlier created a cytokine signaling network based on literature curation and evaluated the activity of the cytokine signaling pathways using gene expression profiling data [44]. Further, Hwang et al. have emphasized the use of existing gene expression data for pathway evaluation in RA [45]. In this study, using publicly available PPI data, we have built a signaling network associated with the cytokines and their transcription factors that are active in RASF. Using network centrality measures, we have identified some important molecules of this network. Further, the activity of these molecules was assessed using the RASF gene expression profiling data that is available in GEO. Finally, the proteins with higher centrality scores and with differential gene expression were considered the key molecules.

Methods

Creation of a human PPI database

We created a human PPI database by extracting the interactions from six publicly available databases, namely HPRD, BioGRID, IntAct, MINT, STRING and CRG [36-41]. From each database, only the experimentally determined physical interactions in human cells were considered. The experimental methods used for determining the protein interactions that are listed in each database are given in the S1 File. All these interactions were merged into a single database by converting the protein identifiers of individual databases into gene symbols. In HPRD and BioGRID the proteins are represented with their respective gene symbols. In other databases the different protein identifiers were converted into gene symbols before merging. For instance, in IntAct and MINT databases, the proteins are identified with their respective UniProtKB entries. They were converted into gene symbols by using a HUGO gene nomenclature committee (HGNC) custom downloaded file containing gene symbols and UniProtKB entries [46]. Similarly, in the STRING and CRG databases the proteins are represented with the Ensembl protein and Ensembl gene identifiers respectively, and were converted into gene symbols with the aid of the Ensembl Biomart project [47]. All these interactions have been merged based on their gene symbols to create a final human PPI database. The database is in the S2 File.

Creation of a synovial tissue-specific PPI database for the plasma membrane and cytoplasm

In this study, we have focused on creating a cytokine signaling network starting from the binding of the cytokines to their cell surface receptors and ending with the activation of the transcription factors in the cytoplasm. Therefore we aim to create a PPI interactome that is specific to the plasma membrane and the cytoplasm. For this, only the interactions of those proteins that get localized to the cytoplasm and the plasma membrane were extracted from the human PPI database using the subcellular localization data present in the mammalian protein subcellular localization database, LOCATE [48]. We also extracted the interactions of many cytokine receptors and transcription factors that are not listed in LOCATE. A list of the genes of this interactome, called the ‘G-list’, was prepared. Furthermore, to make this interactome specific to the synovial tissue, we computed the co-expression of the interacting partners of these interactions by analyzing the RA related microarray gene expression data obtained from GEO. All the microarray datasets chosen in this study are based on the Affymetrix platform. Further details about the microarray datasets and their analysis are provided at the end of the ‘Methods’ section. For every microarray dataset, we did the following; for a pair of genes in the G-list we computed the Pearson correlation coefficient of the gene expression values across all RA disease samples. The correlations were computed for all possible pairs in the G-list. This was repeated for all the datasets. Only those pairs that had experimentally determined PPI interactions and with a Pearson correlation coefficient > 0.7 in at least one microarray dataset were considered as co-expressed in the synovial tissues. Finally, a synovial tissue-specific PPI database was created by selecting the interactions of the co-expressed interacting proteins.

Creation of cytokine PPI network (CPPIN) in RASF

Using the created synovial tissue-specific PPI database, we have built a network with proteins as nodes and their interactions as edges. The open source ‘igraph’ package in ‘R’ and python scripts were used for this purpose. We named this network the ‘synovial PPI network’ (SPPIN). Based on a literature search in PubMed, we identified 12 active cytokines and 8 of their transcription factors in RASF. Details of these cytokines and their transcription factors are described in the Results section and the list of search terms used is in S3 File. Considering these cytokines and their transcription factors as sources and targets respectively, we extracted all the shortest paths from the sources to the targets from the SPPIN network using the Breadth-first search (BFS) algorithm. In addition to BFS, there are several other approaches for finding shortest paths in graphs. Some of them are Depth-first search (DFS), Dijkstra and A* [49]. DFS puts the visited vertices in a stack while BFS puts them in a queue. The BFS algorithm is generally used for finding shortest paths in unweighted PPI networks. In SPPIN, the sources and the targets are closer to one another, which is an ideal scenario for using the BFS algorithm. This algorithm is computationally faster for searching shortest paths in SPPIN. Each shortest path extracted from SPPIN contains the receptor of the source cytokine, the target transcription factor and the intermediate proteins that connect them. If a cytokine receptor is encoded by more than one gene, all the shortest paths between each of the cytokine receptor genes and their respective transcription factors were considered. Similarly, if a transcription factor is encoded by more than one gene, all the shortest paths between the cytokine receptor and each of these genes were considered. There were a total of 103 distinct intermediate proteins in the shortest paths. Each of these intermediates was treated as a focal node and all of the latter’s immediate neighborhood nodes were isolated. Finally, we formed the cytokine network by connecting the cytokines, their transcription factors, the intermediates that connect them and the neighborhood nodes of the intermediates [Fig 1]. We named this network the ‘cytokine PPI network’ (CPPIN).
Fig 1

Building cytokine PPI network (CPPIN) in RA synovial fibroblasts (RASF).

CPPIN was built using the publicly available PPI data. For building this network, 12 cytokines and eight of their target transcription factors that are active in RASF were considered. In the figure, the upper half shows the steps involved in the creation of CPPIN while the lower half is a representation of CPPIN. First, the network SPPIN was created by applying two filters to the collated interactions from the six PPI databases. The two filters are (i) plasma membrane and cytoplasm subcellular localization and (ii) synovial tissue co-expression using microarray data. For creating CPPIN from SPPIN, the shortest paths between cytokines and target transcription factors and the neighboring nodes of the shortest path intermediates were used. In the lower half of the figure, the rectangles represent cytokines, the Y-shaped symbols represent the cytokine receptors, the red colored vertical ovals represent the intermediate proteins that connect the cytokine receptors and the transcription factors, the green circles represent the neighborhood nodes of the intermediate proteins and the pentagons represent the target transcription factors of the cytokines.

Building cytokine PPI network (CPPIN) in RA synovial fibroblasts (RASF).

CPPIN was built using the publicly available PPI data. For building this network, 12 cytokines and eight of their target transcription factors that are active in RASF were considered. In the figure, the upper half shows the steps involved in the creation of CPPIN while the lower half is a representation of CPPIN. First, the network SPPIN was created by applying two filters to the collated interactions from the six PPI databases. The two filters are (i) plasma membrane and cytoplasm subcellular localization and (ii) synovial tissue co-expression using microarray data. For creating CPPIN from SPPIN, the shortest paths between cytokines and target transcription factors and the neighboring nodes of the shortest path intermediates were used. In the lower half of the figure, the rectangles represent cytokines, the Y-shaped symbols represent the cytokine receptors, the red colored vertical ovals represent the intermediate proteins that connect the cytokine receptors and the transcription factors, the green circles represent the neighborhood nodes of the intermediate proteins and the pentagons represent the target transcription factors of the cytokines.

Centrality measures

In order to identify the highly connected and central proteins in the network, we measured four important centralities of every node present in the network as described below.

Degree centrality

Degree centrality is the number of edges through which a node connects to other nodes within a network. Proteins with a high degree are connected to a large number of other proteins. In PPI networks, proteins with a higher degree are considered ‘essential proteins’ or ‘hubs’ as they are located at the center of the network [50]. Considering the pair-wise interactions of ‘n’ nodes, the degree centrality of a node pk is calculated using the following equation [51]. Where a(pi, pk) = 1 if and only if protein pi and protein pk are connected by an edge. Otherwise it is 0. ‘n’ is the total number of nodes present in the network.

Betweenness centrality

Betweenness centrality is the measure of the number of shortest paths that pass through a node within a network. Nodes with high betweenness, called ‘bottlenecks’, control the flow of information within a network [52]. The betweenness of a node ‘n’ takes the node pairs such as (n1, n2) and calculates all the shortest paths that go through ‘n’ to connect n1 and n2. The betweenness of a node ‘n’ is calculated using the following equation. Where, ‘gn1, n, n2’ is the number of shortest paths that pass through node ‘n’ and ‘gn1, n2’ is the total number of shortest paths.

Closeness centrality

Closeness centrality is a measure of the average distance of all the shortest paths between a node and every other node within a network. It gives how far a certain node is from all other nodes [51]. This is calculated using the following equation. Where, gni represents the shortest paths between the node ‘n’ and every other node i. Closeness centralities are measured for all the nodes present within the CPPIN network.

Eigenvector centrality

Eigenvector centrality measures the influence of a node within a network. The node with a high eigenvector is considered the central and influential node as it is connected to many other central nodes [53]. Eigenvector centrality scores are the elements of the first eigenvector of the adjacency matrix of a network.

Microarray data analysis

In this study, we have considered five Affymetrix RA related microarray datasets that are available in the NCBI GEO database. They are GSE7307, GSE55457, GSE55235, GSE12021 (HGU133A) and GSE12021 (HGU133B) (Table 1). These microarray experiments were carried out on RA and normal synovial fibroblasts by other workers. The RA samples used in these studies were obtained by tissue excision upon joint replacement/synovectomy surgery from RA patients whereas the control samples were obtained from either postmortem joints or traumatic joint injury cases. We re-analyzed these datasets using the R/Bioconductor statistical package. All the datasets were normalized using two algorithms, MAS5 and RMA, separately. The differential expression of the genes between RA and control groups was computed using the two sample independent t-test.
Table 1

Details of microarray datasets used in this study.

S.No.GEO AccessionPubMed IDPlatformProbe NumberNumber of Samples
RAControl
1GSE7307-Affymetrix Human Genome U133 Plus 2.0 Array5467559
2GSE1202118721452Affymetrix Human Genome U133A Array22283129
3GSE1202118721452Affymetrix Human Genome U133B Array22645124
4GSE5545724690414Affymetrix Human Genome U133A Array222831310
5GSE5523524690414Affymetrix Human Genome U133A Array222831010
For the co-expression analysis described above, we considered the RMA normalized expression values from the disease samples of each microarray dataset. A gene is said to be differentially expressed in a dataset if it satisfies the following criteria: (i) Gene should have a P-value < 0.05 and a fold-change > 1.5 for up or down regulation. (ii) A gene is said to be up-regulated if it shows up-regulation by both the normalization methods or up-regulation in one and below the fold-change threshold in the other. (iii) Similarly, a gene is said to be down-regulated if it shows down-regulation by both the normalization methods or down-regulation in one and below the fold-change threshold in the other. The same criteria were applied across the datasets to decide whether a gene is up/down regulated in each one of them.

Results

Construction of the CPPIN network

As explained in the ‘Methods’, we combined interactions from all the six resources to obtain a complete dataset. By doing this, we obtained 77218 interactions from CRG, 39042 from HPRD, 298802 from BioGRID, 89355 from STRING and 45896 from both the IntAct and MINT databases. Overall, we considered 363,476 non-redundant interactions from the six resources (S2 File). Interactions from all these six resources seem to be comprehensive. Then, two filters were applied to this data: (i) plasma membrane and cytoplasm subcellular localization and (ii) synovial tissue co-expression. This resulted in a synovial tissue-specific interactome with 7939 interactions. Using this, we built the synovial tissue-specific PPI network, SPPIN. In this network, the proteins and their interactions are represented as nodes and edges respectively. Based on a literature survey, we have identified 12 cytokines and eight of their target transcription factors that are active in RASF (Table 2). These cytokines stimulate the RASF and activate their respective transcription factors. We considered these cytokines as the sources and their transcription factors as the targets in SPPIN. We then extracted a total of 139 shortest paths that pass through the cytokine receptors between these sources and targets. The number of intermediates on all the shortest paths was 103. The number of times each of these intermediate proteins occurred was also determined (S4 File).
Table 2

Cytokines, their receptors, their transcription factors and shortest paths.

All the combinations of the shortest paths from each of the cytokine receptors to their target transcription factor are considered.

S.No.CytokineCytokine Receptor(s)Transcription FactorSubunits/forms of the transcription factorNo. of shortest pathsReference(s)
1TNFTNFRSF1A, TNFRSF1BNF-κBNFKB1, NFKB2, RELA, RELB, REL10[912, 5466]
2IL-18IL18R1, IL18RAPNF-κBNFKB1, NFKB2, RELA, RELB, REL10[13]
3IL-17IL17RA, IL17RCNF-κBNFKB1, NFKB2, RELA, RELB, REL10[11, 1415, 6768]
4IL-27IL27RA, IL6STSTAT1STAT12[16]
5IL-1α/βIL1R1NF-κBNFKB1, NFKB2, RELA, RELB, REL5[1112, 1718, 58, 62, 6975]
6TNFTNFRSF1A, TNFRSF1BAP-1JUN, JUNB, FOS, FOSB, FOSL1, FOSL212[11, 16, 17, 54]
7IL-1βIL1R1AP-1JUN, JUNB, FOS, FOSB, FOSL1, FOSL26[11, 16, 1820, 7172, 74]
8IL-17IL17RA, IL17RCAP-1JUN, JUNB, FOS, FOSB, FOSL1, FOSL212[11, 15]
9TNFTNFRSF1A, TNFRSF1BIRF3IRF32[19]
10LIGHT (TNFSF14)TNFRSF14NF-κBNFKB1, NFKB2, RELA, RELB, REL5[2122]
11IL-33IL1RL1NF-κBNFKB1, NFKB2, RELA, RELB, REL5[23]
12IL-17IL17RA, IL17RCSTAT3STAT32[24, 76]
13IL-1βIL1R1STAT1STAT11[16, 26]
14IFNα/βIFNAR1, IFNAR2STAT1STAT12[10, 2627]
15IFNγIFNGR1, IFNGR2STAT1STAT12[10, 26]
16TNFTNFRSF1A, TNFRSF1BSTAT1STAT12[16, 26]
17IL-1α/βIL1R1IRF3IRF31[28]
18IL-1α/βIL1R1IRF7IRF71[28]
19IFNα/βIFNAR1, IFNAR2IRF3IRF32[28]
20IFNα/βIFNAR1, IFNAR2IRF7IRF72[28]
21TNFTNFRSF1A, TNFRSF1BIRF1IRF12[12]
22IL-1βIL1R1IRF1IRF11[12]
23TGF-β1TGFBR1, TGFBR2NF-κBNFKB1, NFKB2, RELA, RELB, REL10[2930]
24TGF-β1TGFBR1, TGFBR2AP-1JUN, JUNB, FOS, FOSB, FOSL1, FOSL212[29]
25TGF-β1TGFBR1, TGFBR2SMADSMAD2, SMAD34[31, 77]
26IL-21IL2RGSTAT3STAT31[32, 78]
27IL6IL6RSTAT3STAT31[79]
28TNFTNFRSF1A, TNFRSF1BSTAT3STAT32[54]
29IL-27IL27RA, IL6STAP-1JUN, JUNB, FOS, FOSB, FOSL1, FOSL212[16]
Total = 139

TNFRSF1A and TNFRSF1B, tumor necrosis factor receptor superfamily member 1A and 1B; IL18R1, interleukin-18 receptor 1; IL18RAP, interleukin-18 receptor accessory protein; IL17RA, interleukin-17 receptor A; IL17RC, interleukin-17 receptor C; IL27RA, interleukin-27 receptor alpha; IL6ST, interleukin 6 signal transducer (glycoprotein 130); IL1R1, interleukin 1 receptor, type 1; TNFRSF14, tumor necrosis factor receptor superfamily member 14; IL1RL1, interleukin-1 receptor-like 1; IFNAR1, interferon-alpha receptor alpha chain; IFNAR2, interferon-alpha receptor beta chain; IFNGR1, interferon gamma receptor 1; IFNGR2, interferon gamma receptor 2; TGFBR1, TGF beta receptor 1; TGFBR2, TGF beta receptor 2; IL2RG, interleukin-2 receptor subunit gamma; IL6R, inteleukin-6 receptor; NF-κB, nuclear factor kappa-light-chain-enhancer of activated B cells; STAT1, signal transducer and activator of transcription 1; AP-1, activator protein 1; IRF3, interferon regulatory factor 3; STAT3, signal transducer and activator of transcription 3; IRF7, interferon regulatory factor 7; IRF1, interferon regulatory factor 1

Cytokines, their receptors, their transcription factors and shortest paths.

All the combinations of the shortest paths from each of the cytokine receptors to their target transcription factor are considered. TNFRSF1A and TNFRSF1B, tumor necrosis factor receptor superfamily member 1A and 1B; IL18R1, interleukin-18 receptor 1; IL18RAP, interleukin-18 receptor accessory protein; IL17RA, interleukin-17 receptor A; IL17RC, interleukin-17 receptor C; IL27RA, interleukin-27 receptor alpha; IL6ST, interleukin 6 signal transducer (glycoprotein 130); IL1R1, interleukin 1 receptor, type 1; TNFRSF14, tumor necrosis factor receptor superfamily member 14; IL1RL1, interleukin-1 receptor-like 1; IFNAR1, interferon-alpha receptor alpha chain; IFNAR2, interferon-alpha receptor beta chain; IFNGR1, interferon gamma receptor 1; IFNGR2, interferon gamma receptor 2; TGFBR1, TGF beta receptor 1; TGFBR2, TGF beta receptor 2; IL2RG, interleukin-2 receptor subunit gamma; IL6R, inteleukin-6 receptor; NF-κB, nuclear factor kappa-light-chain-enhancer of activated B cells; STAT1, signal transducer and activator of transcription 1; AP-1, activator protein 1; IRF3, interferon regulatory factor 3; STAT3, signal transducer and activator of transcription 3; IRF7, interferon regulatory factor 7; IRF1, interferon regulatory factor 1 We have also extracted the neighborhood nodes of the intermediates from the SPPIN network. The intermediates are considered seed nodes. All the nodes to which a seed node has a direct connection (path length 1) are extracted. Finally, we formed CPPIN by connecting the cytokines, their transcription factors, the 103 intermediates and the neighborhood nodes of the intermediates. This gave a network comprising 1204 nodes and 2155 edges. The edge list of this network is provided in S5 File.

Central nodes of the CPPIN network and their activity in the RA synovium

We have measured the four centralities of all the nodes present in the network and plotted their histograms (Fig 2). With the histograms as reference, we have extracted approximately 20% of the nodes with the highest centralities in each category. The degree distributions of scale free networks, such as many real networks and human PPI, have a power-law tail. As a consequence of this, a few highly connected nodes exist in the whole network. Researchers generally refer to the 20% of nodes with the highest degree in a network as the hubs [50]. The degree and eigenvector histograms have a power-law tail (Fig 2). We extended top 20% nodes to the betweenness and closeness centrality categories in order to increase the number of central nodes. This resulted in ~30% (354) of the total nodes in CPPIN as the central nodes. Then we proceeded to determine how many of these nodes are differentially expressed in the RA synovium. The nodes that are selected in (a) at least three of the four centrality measures and differentially expressed in three of the five microarray datasets or (b) that are selected in at least two centrality measures and differentially expressed in at least four microarray datasets were considered the key molecules. This resulted in a total of 24 molecules (Table 3). A concise interaction map that shows how these molecules are related to 12 cytokines and eight transcription factors, and among them, is shown in Fig 3. This map can also be visualized in cytoscape using the S6 File.
Fig 2

The histograms of the four centrality measures, degree, betweenness, closeness and eigenvector.

The distributions of the four centrality measures are plotted as histograms. Taking the histogram as a reference, approximately 20% of the proteins with high centrality scores (toward the right side of the histogram) are extracted for differential expression analysis.

Table 3

Centrality and differential expression of the CPPIN network proteins.

The proteins that are selected in at least (a) three centrality measures and three synovial microarray datasets or (b) two centrality measures and four synovial microarray datasets are considered as the key molecules. This resulted in 24 key molecules.

S.No.Key moleculeDegreeBetweennessClosenessEigenvectorNumber of microarray datasets in which the gene is differentially expressed
12345
1SYK
2PTPN6
3LCK
4PTPRC
5INPP5D
6PRKCB
7CD3E
8CSF2RB
9IL2RG
10EGFR
11JUN
12MYC
13FOS
14AR
15CCR5
16IFNAR2
17LRP1
18MX1
19LAP3
20CXCR4
21EGR1
22FOSB
23TNFRSF14
24CD48

Blue color indicates the presence of the protein in the top 20% centrality list. Green color indicates the up-regulation and red color indicates the down-regulation.

Fig 3

A concise interaction map for the 12 cytokines, eight transcription factors and 24 key molecules.

This map shows how the cytokines and transcription factors considered in this study interact with 24 key molecules. The key molecules, JUN, FOS and FOSB are represented with AP-1. Another key molecule, IL2RG is represented with IL-21R.

The histograms of the four centrality measures, degree, betweenness, closeness and eigenvector.

The distributions of the four centrality measures are plotted as histograms. Taking the histogram as a reference, approximately 20% of the proteins with high centrality scores (toward the right side of the histogram) are extracted for differential expression analysis.

A concise interaction map for the 12 cytokines, eight transcription factors and 24 key molecules.

This map shows how the cytokines and transcription factors considered in this study interact with 24 key molecules. The key molecules, JUN, FOS and FOSB are represented with AP-1. Another key molecule, IL2RG is represented with IL-21R.

Centrality and differential expression of the CPPIN network proteins.

The proteins that are selected in at least (a) three centrality measures and three synovial microarray datasets or (b) two centrality measures and four synovial microarray datasets are considered as the key molecules. This resulted in 24 key molecules. Blue color indicates the presence of the protein in the top 20% centrality list. Green color indicates the up-regulation and red color indicates the down-regulation.

Directionality of differential expression for cytokine-transcription factor shortest path molecules

Since all the 12 cytokines are known to induce signal transduction pathways in RASF, all the shortest paths—between a given cytokine and transcription factor pair—seem to be important. However, some of these paths have the molecules with the same directionality (up- or down-regulated) of differential expression between RA and normal samples in synovial microarray datasets (Figs 4–6). Some cytokine and transcription factor pair shortest paths have higher number of up-regulated molecules over down-regulated molecules (Figs 7 and 8). Some other paths have higher number of down-regulated molecules over up-regulated molecules (Figs 8–10). From this we can conclude that all these pathways are getting affected in RA. Some are active because they have a high proportion of up-regulated molecules; some others are inactive because they have a high proportion of down-regulated molecules; few of them are dysregulated because they have both up- and down-regulated molecules.
Fig 4

Exclusively up-regulated shortest path molecules for TNF and IFNα/β related pathways.

The figure shows the completely up-regulated shortest path molecules in TNF and NF-κB, IRF3, IRF1 and STAT1 pairs, and IFNα/β and STAT1 pair.

Fig 6

Exclusively down-regulated shortest path molecules for IL-1β and TGF-β related pathways.

The figure shows the completely down-regulated shortest path molecules in IL-1β and TGF-β cytokines and AP-1, SMAD and STAT1 transcription factor pairs.

Fig 7

Mostly up-regulated shortest path molecules in IL-18, IL-17, TNFSF14 and IL-27 related pathways.

The figure shows the mostly up-regulated shortest path molecules in IL-18, IL-17 and TNFSF14 cytokines and NF-κB transcription factor pairs, and IL-27 and STAT1 pair. The blue and red bars represent the up- and down-regulated genes respectively.

Fig 8

Up- and down-regulated shortest path molecules in IFNγ, IL-17 and TGF-β related pathways.

The figure shows the mostly up-regulated shortest path molecules in IFNγ and STAT1, IL-17 and STAT3 pairs. It also shows the mostly down-regulated molecules in the shortest path between TGF-β and NF-κB pair. The blue and red bars represent the up- and down-regulated genes respectively.

Fig 10

Mostly down-regulated shortest path molecules in IL-1α/β and IL-33 related pathways.

The figure shows mostly down-regulated shortest path molecules in IL-1α/β cytokine and IRF7, IRF3 and IRF1 transcription factor pairs, and IL-33 and NF-κB pair. The blue and red bars represent the up- and down-regulated genes respectively.

Exclusively up-regulated shortest path molecules for TNF and IFNα/β related pathways.

The figure shows the completely up-regulated shortest path molecules in TNF and NF-κB, IRF3, IRF1 and STAT1 pairs, and IFNα/β and STAT1 pair.

Exclusively up-regulated shortest path molecules for TNF and IFNα/β, IL-21 and IL6 related pathways.

The figure shows the completely up-regulated shortest path molecules in TNF, IL-21 and IL6 cytokines and STAT3 transcription factor pairs, and IFNα/β cytokine and IRF3 and IRF7 pairs.

Exclusively down-regulated shortest path molecules for IL-1β and TGF-β related pathways.

The figure shows the completely down-regulated shortest path molecules in IL-1β and TGF-β cytokines and AP-1, SMAD and STAT1 transcription factor pairs.

Mostly up-regulated shortest path molecules in IL-18, IL-17, TNFSF14 and IL-27 related pathways.

The figure shows the mostly up-regulated shortest path molecules in IL-18, IL-17 and TNFSF14 cytokines and NF-κB transcription factor pairs, and IL-27 and STAT1 pair. The blue and red bars represent the up- and down-regulated genes respectively.

Up- and down-regulated shortest path molecules in IFNγ, IL-17 and TGF-β related pathways.

The figure shows the mostly up-regulated shortest path molecules in IFNγ and STAT1, IL-17 and STAT3 pairs. It also shows the mostly down-regulated molecules in the shortest path between TGF-β and NF-κB pair. The blue and red bars represent the up- and down-regulated genes respectively.

Mostly down-regulated shortest path molecules in IL-17, TNF and IL-1α/β related pathways.

The figure shows the mostly down-regulated shortest path molecules in IL-17 and TNF cytokines and AP-1 transcription factor pairs, and IL-1α/β and NF-κB pair. The blue and red bars represent the up- and down-regulated genes respectively.

Mostly down-regulated shortest path molecules in IL-1α/β and IL-33 related pathways.

The figure shows mostly down-regulated shortest path molecules in IL-1α/β cytokine and IRF7, IRF3 and IRF1 transcription factor pairs, and IL-33 and NF-κB pair. The blue and red bars represent the up- and down-regulated genes respectively.

Effects of medical therapy initiation on gene expression

Except for GSE7307 and GSE55235, each RA patient belonging to other microarray studies considered in this study has undergone different combinations of medical therapies. The details of therapies are in Table 4. The therapies initiated on the patients are non-steroidal anti-rheumatic drug (NSARD), Azulfidine (AZ), Prednisolone (PS), Methotrexate (MTX), Cox-2 inhibitor (CX), Quensyl (QS), non-steroidal anti-inflammatory drug (NSAID) and Tilidin (T). Some patients within a dataset were treated with the same combination of therapies while others were treated with different combinations. To find out how these therapies could have affected the gene expression of 24 key molecules, the samples in each dataset were hierarchically clustered based on the expression values. In GSE7307, GSE55235 and except for one RA sample in GSE12021 (HGU133B), all RA and control samples were clustered into separate groups (Figs 11–13). In GSE12021 (HGU133A) and GSE55457, some RA samples were clustered into a separate group while others were clustered with controls (Figs 14 and 15), showing that there is a drug effect.
Table 4

Medical therapies initiated on RA patients.

DatasetPatientsMedical Therapies
GSE7307RA1Not treated
 RA2Not treated
 RA3Not treated
 RA4Not treated
 RA5Not treated
GSE12021 (HGU133A)RA1NSARD + Azulfidine + Prednisolone
 RA2NSARD + MTX + Prednisolone
 RA3NSARD + MTX+ Prednisolone
 RA4NSARD + Azulfidine + Prednisolone + MTX
 RA5NSARD + MTX + Prednisolone
 RA6NSARD + Azulfidine + Prednisolone
 RA7MTX + Prednisolone
 RA8NSARD
 RA9NSARD + Prednisolone
 RA10NSARD + Prednisolone
 RA11COX-2 inhibitor + Prednisolone + Quensyl
 RA12NSAID + Tilidin + Prednisolone
GSE12021 (HGU133B)RA1NSARD + Azulfidine + Prednisolone
 RA2NSARD + MTX + Prednisolone
 RA3NSARD + MTX+ Prednisolone
 RA4NSARD + Azulfidine + Prednisolone + MTX
 RA5NSARD + MTX + Prednisolone
 RA6NSARD + Azulfidine + Prednisolone
 RA7MTX + Prednisolone
 RA8NSARD
 RA9NSARD + Prednisolone
 RA10NSARD + Prednisolone
 RA11COX-2 inhibitor + Prednisolone + Quensyl
 RA12NSAID + Tilidin + Prednisolone
GSE55457RA1NSARD + Azulfidine + Prednisolone
 RA2NSARD + MTX + Prednisolone
 RA3NSARD + MTX+ Prednisolone
 RA4NSARD + Azulfidine + Prednisolone + MTX
 RA5NSARD + MTX + Prednisolone
 RA6NSARD + Azulfidine + Prednisolone
 RA7MTX + Prednisolone
 RA8NSARD
 RA9NSARD + Prednisolone
 RA10no therapy used
 RA11NSARD + Prednisolone
 RA12COX-2 inhibitor + Prednisolone + Quensyl
 RA13NSAID + Tilidin + Prednisolone
GSE55235RA1Therapies not mentioned
 RA2Therapies not mentioned
 RA3Therapies not mentioned
 RA4Therapies not mentioned
 RA5Therapies not mentioned
 RA6Therapies not mentioned
 RA7Therapies not mentioned
 RA8Therapies not mentioned
 RA9Therapies not mentioned
 RA10Therapies not mentioned
Fig 11

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE7307.

The RA patients belonging to this dataset were not treated with drugs. The RA and control samples were clustered into separate groups.

Fig 13

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE12021 (HGU133B).

The RA (except one) and control samples were clustered into separate groups. The figure also shows the combinations of drugs used for treating RA patients.

Fig 14

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE12021 (HGU133A).

Some RA samples were clustered into a separate group while others were clustered with control samples. The figure also shows the combinations of drugs used for treating RA patients.

Fig 15

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE55457.

Some RA samples were clustered into a separate group while others were clustered with control samples. The figure also shows the combinations of drugs used for treating RA patients.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE7307.

The RA patients belonging to this dataset were not treated with drugs. The RA and control samples were clustered into separate groups.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE55235.

The RA and control samples were clustered into separate groups.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE12021 (HGU133B).

The RA (except one) and control samples were clustered into separate groups. The figure also shows the combinations of drugs used for treating RA patients.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE12021 (HGU133A).

Some RA samples were clustered into a separate group while others were clustered with control samples. The figure also shows the combinations of drugs used for treating RA patients.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE55457.

Some RA samples were clustered into a separate group while others were clustered with control samples. The figure also shows the combinations of drugs used for treating RA patients. In order to find the effect of medical therapies on the differential expression of genes, we removed the RA samples that were clustered with healthy controls from GSE12021 (HGU133A), GSE12021 (HGU133B) and GSE55457 datasets and repeated the differential expression analysis for the 354 genes which encode central proteins of CPPIN. With the same selection criteria of differential expression and centrality measures, all the 24 genes of key molecules were retained, and in addition, 10 other genes that encode central proteins were also selected (S1 Table). S1–S3 Figs show the heat maps of the expression levels of these 24 genes in these three data sets after eliminating the RA samples that clustered with healthy controls. We notice the complete separation of controls from RA samples in the clusters. We conclude that the 24 genes that were differentially expressed in both the cases were not affected by the therapy initiation while the 10 other genes that were selected in the second case might have been affected.

Direct PPI links of the CPPIN key molecules with current RA drug targets

In order to determine if there are any recognized RA drug targets in the CPPIN key molecules, we have downloaded a dataset from the Therapeutic Target Database (TTD) which contains information on drug target genes that are at various stages of the drug discovery process [80]. The dataset consists of successful drug targets as well as the ones that have been studied in research projects and clinical trials for several diseases. We extracted all the drug targets of RA from this dataset and combined them with the successful RA drug targets listed in Okada et al. [81]. The resulting ensemble of 48 RA drug targets is listed in Table 5. Two of the key CPPIN molecules, spleen tyrosine kinase (SYK) and c-Jun (JUN) are already established drug targets for RA. Among the remaining key CPPIN molecules, 12 have direct PPI links to some of the current RA drug targets (Table 6). So these 24 molecules can be considered potential drug targets for RA. The overall strategy that was used to come up with the key molecules is explained in Fig 16.
Table 5

An ensemble of current RA drug targets.

These are the current RA drug targets that are obtained from two sources namely TTD and a scientific article, Okada et al. [81].

Drug targetStatus of the targetSource
IKBKBClinical trialTTD, Okada et al.
CHUKClinical trialTTD
CFLARResearch projectTTD
JUNClinical trialTTD
ITGB1Clinical trialTTD
IL6RClinical trialTTD, Okada et al.
FGF2Clinical trialTTD
CCL2Clinical trialTTD
ITGA4successfulTTD
OSMResearch projectTTD
MIFClinical trialTTD
IL1R1successfulTTD, Okada et al.
LIFResearch projectTTD
MMP8successfulTTD
IL13Clinical trialTTD
PTGS2successfulTTD, Okada et al.
IL6STClinical trialTTD
IL15Clinical trialTTD
CTSKsuccessfulTTD
SYKClinical trialTTD
JAK3successfulTTD, Okada et al.
MAPK12Clinical trialTTD
F2RL1Research projectTTD
IL2Clinical trialTTD
DHODHsuccessfulTTD, Okada et al.
IKBKEClinical trialTTD
MYD88Research projectTTD
TLR9Clinical trialTTD, Okada et al.
TNFsuccessfulOkada et al.
CD80successfulOkada et al.
CD86successfulOkada et al.
MS4A1successfulOkada et al.
PRDX5successfulOkada et al.
HPRT1successfulOkada et al.
CAMLGsuccessfulOkada et al.
PPP3R2successfulOkada et al.
ELANEsuccessfulOkada et al.
DHFRsuccessfulOkada et al.
ALOX5successfulOkada et al.
PTGS1successfulOkada et al.
PPARGsuccessfulOkada et al.
FKBP1AsuccessfulOkada et al.
MTORsuccessfulOkada et al.
JAK1successfulOkada et al.
JAK2successfulOkada et al.
NR3C1successfulOkada et al.
NR3C2successfulOkada et al.
TLR7successfulOkada et al.

TTD, therapeutic target database.

Table 6

Direct PPI links of the key molecules with current RA drug targets.

S.No.Key moleculeDirect PPI interaction with the RA drug targets
1PTPN6JAK1, JAK2, SYK
2LCKMS4A1, SYK
3PTPRCJAK2
4INPP5DJAK1, SYK
5CD3ESYK
6CSF2RBJAK2
7IL2RGIL2
8MYCCHUK, JUN
9FOSJUN
10CCR5JAK1, JAK2
11EGR1JUN
12FOSBJUN
Fig 16

The overall strategy used for identifying the key molecules.

From the CPPIN network, approximately 20% of the proteins with high scores in each centrality category were extracted. This resulted in 354 proteins. The differential expression of the genes which encode these proteins was computed in five synovial microarray datasets. Finally, the genes which are selected in at least three centrality measures and three microarray datasets or two centrality measures and four microarray datasets were considered the key molecules. This resulted in 24 key molecules. Two of these, SYK and JUN, are already current RA drug targets. Among the remaining, 12 have direct PPI links to some of the current RA drug targets.

The overall strategy used for identifying the key molecules.

From the CPPIN network, approximately 20% of the proteins with high scores in each centrality category were extracted. This resulted in 354 proteins. The differential expression of the genes which encode these proteins was computed in five synovial microarray datasets. Finally, the genes which are selected in at least three centrality measures and three microarray datasets or two centrality measures and four microarray datasets were considered the key molecules. This resulted in 24 key molecules. Two of these, SYK and JUN, are already current RA drug targets. Among the remaining, 12 have direct PPI links to some of the current RA drug targets.

An ensemble of current RA drug targets.

These are the current RA drug targets that are obtained from two sources namely TTD and a scientific article, Okada et al. [81]. TTD, therapeutic target database.

Significance of overlap of key molecules with current RA drug targets

A random sample of 24 proteins was drawn from the CPPIN network and estimated their overlap with the current RA drug targets. After removing the overlaps from the 24, for the remaining proteins we determined the direct PPI links to the drug targets. We got the following results based on one million random samples: The probability of getting 2 or more drug targets out of 24 randomly selected proteins is 0.0478, giving a statistical significance of less than 5%. After removing overlaps, we counted the number of proteins among the remaining ones that are directly connected to drug targets in CPPIN. The probability of getting 12 or more proteins with direct connections to drug targets is found to be 2.6×10−5. Thus, the selection of 24 proteins by our analysis with two drug targets and at least 12 direct PPI links to drug targets has statistical significance.

Discussion

In this study, we have built a cytokine signaling network (CPPIN) in RASF using publicly available PPI data. The CPPIN network contains 12 cytokine pathways that are active in RASF. The cytokine receptors, their transcription factors, intermediate signal transducers that connect them and the direct interacting proteins of the intermediates are part of this network. The cytokines include TNF, interleukins—IL-1α/β, IL-6, IL-17, IL-18, IL-21, IL-27 and IL-33, tumor necrosis factor superfamily member 14 (TNFSF14), three interferons (IFNα/β and IFNγ) and transforming growth factor-beta1 (TGF-β1). The transcription factors include nuclear factor kappa-light-chain-enhancer of activated B cells (NF-κB), signal transducer and activator of transcription 1 (STAT1), STAT3, activator protein 1 (AP-1), interferon regulatory factor 1 (IRF1), IRF3, IRF7 and SMAD. Even though the number of cytokines considered in this study is low, they are reliable in the sense that they stimulate RASF and induce signal transduction pathways leading to the activation of their respective transcription factors. For building this comprehensive network, we have considered the PPI interactions only if the interacting participants of each interaction are co-expressed in the synovial tissues. To find the central proteins of the CPPIN network, four centrality measures, degree, closeness, betweenness and eigenvector have been measured for all the proteins of the CPPIN network. In each centrality category, approximately 20% of the proteins with high centrality scores were pulled out and the lists were merged for further analysis. To identify the differential expression of the genes that encode the proteins with high centrality values we analyzed five microarray datasets related to RASF in the GEO database. We have used the following methodology to come up with the key molecules. A protein is considered a key molecule only if it is selected in at least three centrality measures and three microarray datasets or two centrality measures and four microarray datasets. This gave 24 key molecules. Two of these 24, namely JUN and SYK, are already drug targets for RA. Among the remaining, 12 have direct PPI links to current RA drug targets. One of the key molecules in the list is epidermal growth factor receptor (EGFR). EGFR is down-regulated in at least five RA microarray datasets. It is also highly connected in the CPPIN network as it has high degree, betweenness and closeness centrality measures. Swanson et al. observed that inhibition of EGFR by erlotinib reduced pannus formation, synovitis, vascularisation, and cartilage and bone erosion in type-II collagen-induced arthritis (CIA) mouse models [82-83]. Further, an earlier topological analysis of a PPI network in RA has reported that EGFR is highly relevant to RA (Tieri et al.) [84]. Another key molecule, tyrosine-protein phosphatase non-receptor type 6 (PTPN6) is up-regulated in three microarray datasets and is selected in all the four centrality measures. It also interacts with some of the current RA drug targets, janus kinase 1 and 2 (JAK1 and JAK2) and SYK. Additionally, it is found to be enriched in the synovial fluid of RA patients [85]. Lymphocyte-specific protein tyrosine kinase (LCK) is up-regulated in four microarray datasets. It is also selected in all the four centrality measures. Further, Swanson et al. have affirmed that tyrosine kinases such as LCK are the predominant players in the cell signaling pathways that enhance inflammation and the formation of pannus in RA. They emphasized that LCK can be considered a drug target for RA [86]. Another key molecule known as colony stimulating factor 2 receptor beta (CSF2RB) is up-regulated in four microarray datasets and is selected in three centrality measures. It interacts with an RA drug target JAK2. Additionally, Fujikado et al. have observed the up-regulation of the CSF2RB gene in their mouse RA models [87]. Interleukin receptor subunit gamma (IL2RG), which interacts with at least six cytokine receptors including IL2RA, IL4RA, IL7RA, IL9RA, and IL15RA, is up-regulated in four microarray datasets and is selected in all the four centralities. In addition, Chang et al. have observed the up-regulation of IL2RG in their microarray studies on RA synovial tissues [88]. c-myc (MYC) is down-regulated in three microarray datasets and is selected in all the four centralities. It also interacts with two current RA drug targets, the conserved helix-loop-helix ubiquitous kinase (CHUK) and JUN. Hashiramoto et al. have found the involvement of c-myc in RA pathogenesis [89]. They have observed that c-myc antisense oligodeoxynucleotides (AS ODN) arrested cell proliferation and induced apoptosis in rheumatoid synoviocytes. Further, Pap et al. have found c-myc, in co-operation with c-Raf-1, controls the growth and invasiveness of RASFs in the SCID mouse model of RA [90]. A subunit of the transcription factor AP-1, c-Fos (FOS), is down-regulated in three microarray datasets and is selected in all the four centrality measures. Further, Aikawa et al. have reported the suppression of RA by a small molecule inhibitor of c-Fos in CIA mice models [91]. CCR5 is up-regulated in four microarray datasets and is also selected in three centrality measures. Further, it interacts with the RA drug targets, JAK1 and JAK2. However, in a preclinical study blocking CCR5 with its antagonists and subsequently testing the effects of CCR5 blocking in a clinical trial with RA patients has not reported any clinical benefit [92-93]. The C-X-C chemokine receptor type 4 (CXCR4) is up-regulated in four microarray datasets and is selected in two centrality measures. Schmutz et al. have also observed the up-regulation of CXCR4 in synovial tissues [94]. Further, the antagonists of CXCR4 have reduced angiogenesis in the CIA murine models of RA [95]. Some antagonists of CXCR4 such as AMD3100 and a T140 analog have also reduced joint inflammation and the severity of RA [96-97]. Furthermore, low-level laser irradiation has reduced the expression of CXCR4 in CIA rat models of RA [98]. Even the single nucleotide polymorphisms (SNPs) of some genes such as PTPRC and TNFRSF14 are associated with RA [99-100]. In the current study, microarray and human PPI data were combined to generate a cytokine signaling network. We identified key molecules, which are the central proteins of this network with differential expression in RA. We also identified how these key molecules are connected to some of the current RA drug targets. Our strategy is based on a two dimensional information involving PPI and gene expression data. This network-based strategy, which led to the identification of key molecules of the cytokine signaling network, may be used for identifying multiple biomarkers, which may have potential for monitoring therapy responses. Even though eight of the key molecules were down-regulated, their knowledge can be used for making strategies for drug discovery. For instance, designing drugs in such a way that they (i) enhance the expression of the down-regulated genes or (ii) inhibit the action or expression of a particular molecule which is known to cause the down-regulation of the key molecule (for instance, inhibition of a transcriptional repressor) can be a useful strategy in dealing with the down-regulated genes. In addition, the gene expression signatures which include both the up- and down-regulated genes can be used for screening a library of bioactive small molecules using the connectivity map (CMAP) database [101]. The molecules can further be explored for their exact targets and mechanism. This way, the gene expression signatures can essentially be used in discovering new knowledge from existing knowledge. Moreover, biological systems are robust because the perturbations caused by drug treatments can be restored. Overcoming robustness is likely to be the key factor for finding better drug targets. In this respect, both the up- and down-regulated genes may be leveraged for a multi-targeted approach. In building the CPPIN network, we did not look for the differential expression of the cytokines in the microarray data. Cytokines are autocrine, paracrine and endocrine signaling molecules and they might be secreted by a bunch of different populated cell types in the synovium and synovial fluid of the patients with RA. Since they come from a variety of sources, they may not be differentially expressed in the microarray data. If they are secreted by other cell types and are present in the microenvironment of RASF, they can induce signaling pathways. All of the reported 12 cytokines in this study are known to be elevated in RA and they are known to activate their corresponding eight transcription factors. Elevated levels of the transcription factors may contribute to the enhanced expression of their target genes. However, an activated transcription factor may induce the expression of its target genes even though it is not differentially expressed but present in enough concentration. Therefore, we did not look for the differentially expressed cytokine and transcription factor pairs for building CPPIN. In summary, we have built a cytokine signaling network in RA. A combination of network centrality measures and gene expression profiling data has identified 24 key molecules of this network. Two of these are already drug targets for RA while 12 others physically interact with some of the recognized drug targets. Some of these molecules such as EGFR, PTPN6, LCK, CSF2RB, IL2RG, MYC, FOS, CXCR4, PTPRC and TNFRSF14 are well studied in RA by other workers and are reported to play a role in the pathogenesis of RA. However, our strategy herein was to develop a proof-of-principle method for identifying key molecules probably involved in the pathogenesis of RA. Though some of these molecules are well studied in RA, yet their crucial involvement in the disease and their amenability for drug discovery needs to be established.

Conclusions

The present study was focused on developing an approach that can maximize the use of the publicly available PPI and gene expression profiling data from GEO for identifying key molecules for RA. In this study, a comprehensive RA synovial-specific cytokine signaling network with 12 cytokines and 8 of their respective transcription factors has been built. Using a novel approach that combines network centrality measures and differential expression in microarray datasets, we identified 24 key molecules of this network probably involved in the pathogenesis of RA. Two of these molecules, JUN and SYK, are already known drug targets for RA. Of the remaining, 12 have direct PPI links to some of the current drug targets of RA. The scientific literature also provides evidences for the prominence of some of these 24 molecules in the pathogenesis of RA. These molecules, seemingly important to the cytokine signaling network, need to be further studied in order to establish their involvement in the pathogenesis of RA and to explore their potential for developing new therapeutics.

The list of experimental methods used for determining protein-protein interactions in the six databases.

(DOCX) Click here for additional data file.

The human PPI database created from the six publicly available protein-protein interaction databases namely HPRD, BioGRID, IntAct, MINT, STRING and CRG.

(TXT) Click here for additional data file.

The list of search terms used in PubMed for retrieving articles which provided information on active cytokines and their target transcription factors.

(DOC) Click here for additional data file.

The intermediate proteins of the shortest paths.

The number of occurrences of each of the intermediates is also mentioned in the file. (TXT) Click here for additional data file.

The edge list of the CPPIN network.

(TXT) Click here for additional data file.

An interaction map for 12 cytokines, eight transcription factors and 24 key molecules.

(XGMML) Click here for additional data file.

In addition to 24, 10 other genes showed up in microarray data analysis after removing RA samples clustered with healthy controls.

(XLSX) Click here for additional data file.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE12021 (HGU133B) after removing the RA sample clustered with the healthy controls.

(TIF) Click here for additional data file.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE12021 (HGU133A) after removing the RA samples clustered with the healthy controls.

(TIF) Click here for additional data file.

Hierarchical clustering of RA and control samples based on the gene expression of selected key molecules in GSE55457 after removing the RA samples clustered with the healthy controls.

(TIF) Click here for additional data file.
  95 in total

1.  Synoviocyte innate immune responses: II. Pivotal role of IFN regulatory factor 3.

Authors:  Susan E Sweeney; Trevor B Kimbler; Gary S Firestein
Journal:  J Immunol       Date:  2010-05-17       Impact factor: 5.422

2.  AMD3100, a potent and specific antagonist of the stromal cell-derived factor-1 chemokine receptor CXCR4, inhibits autoimmune joint inflammation in IFN-gamma receptor-deficient mice.

Authors:  P Matthys; S Hatse; K Vermeire; A Wuyts; G Bridger; G W Henson; E De Clercq; A Billiau; D Schols
Journal:  J Immunol       Date:  2001-10-15       Impact factor: 5.422

3.  Pathogenic role of CXCR7 in rheumatoid arthritis.

Authors:  Kaori Watanabe; Mark E T Penfold; Akio Matsuda; Naho Ohyanagi; Kayoko Kaneko; Yoshishige Miyabe; Kenji Matsumoto; Thomas J Schall; Nobuyuki Miyasaka; Toshihiro Nanki
Journal:  Arthritis Rheum       Date:  2010-11

4.  Anti-malarial agent artesunate inhibits TNF-alpha-induced production of proinflammatory cytokines via inhibition of NF-kappaB and PI3 kinase/Akt signal pathway in human rheumatoid arthritis fibroblast-like synoviocytes.

Authors:  H Xu; Y He; X Yang; L Liang; Z Zhan; Y Ye; X Yang; F Lian; L Sun
Journal:  Rheumatology (Oxford)       Date:  2007-02-21       Impact factor: 7.580

5.  A pro-inflammatory role for A20 and ABIN family proteins in human fibroblast-like synoviocytes in rheumatoid arthritis.

Authors:  Hideya Igarashi; Ayano Yahagi; Taro Saika; Jun Hashimoto; Tetsuya Tomita; Hideki Yoshikawa; Katsuhiko Ishihara
Journal:  Immunol Lett       Date:  2011-11-09       Impact factor: 3.685

6.  Preclinical and clinical investigation of a CCR5 antagonist, AZD5672, in patients with rheumatoid arthritis receiving methotrexate.

Authors:  Daniëlle M Gerlag; Sally Hollis; Mark Layton; Jiří Vencovský; Zoltán Szekanecz; Martin Braddock; Paul P Tak
Journal:  Arthritis Rheum       Date:  2010-11

7.  Galectin 3 induces a distinctive pattern of cytokine and chemokine production in rheumatoid synovial fibroblasts via selective signaling pathways.

Authors:  Andrew Filer; Magdalena Bik; Greg N Parsonage; John Fitton; Emily Trebilcock; Katherine Howlett; Michelle Cook; Karim Raza; David L Simmons; Andrew M C Thomas; Mike Salmon; Dagmar Scheel-Toellner; Janet M Lord; Gabriel A Rabinovich; Christopher D Buckley
Journal:  Arthritis Rheum       Date:  2009-06

8.  Identification of arthritis-related gene clusters by microarray analysis of two independent mouse models for rheumatoid arthritis.

Authors:  Noriyuki Fujikado; Shinobu Saijo; Yoichiro Iwakura
Journal:  Arthritis Res Ther       Date:  2006       Impact factor: 5.156

9.  Anti-Inflammatory Effects of TRAF-Interacting Protein in Rheumatoid Arthritis Fibroblast-Like Synoviocytes.

Authors:  Qing-Zhu Kong; Li-Tao Guo; Jia-Ning Yang; Yan-Fei Wang; Jing-Xin Zhao; Su-Hong Kong; Meng Zhang; Shi Yan; Yu Jin
Journal:  Mediators Inflamm       Date:  2016-10-26       Impact factor: 4.711

10.  Methotrexate hepatotoxicity in patients with rheumatoid arthritis.

Authors:  R Sotoudehmanesh; B Anvari; M Akhlaghi; S Shahraeeni; S Kolahdoozan
Journal:  Middle East J Dig Dis       Date:  2010-09
View more
  7 in total

Review 1.  In silico Methods for Identification of Potential Therapeutic Targets.

Authors:  Xuting Zhang; Fengxu Wu; Nan Yang; Xiaohui Zhan; Jianbo Liao; Shangkang Mai; Zunnan Huang
Journal:  Interdiscip Sci       Date:  2021-11-26       Impact factor: 3.492

2.  Promising targets and drugs in rheumatoid arthritis: a module-based and cumulatively scoring approach.

Authors:  Xingyan Li; Yejing Yang; Guili Sun; Wanwu Dai; Xuri Jie; Yongjun Du; Runjie Huang; Jiaming Zhang
Journal:  Bone Joint Res       Date:  2020-08-02       Impact factor: 5.853

3.  Cytokine Expression Pattern and Protein-Protein interaction network analysis of Leucocyte Rich Platelet Rich Fibrin and Injectable Form of Platelet Rich Fibrin.

Authors:  Sharmila Jasmine; Annamalai Thangavelu; Rajapandiyan Krishnamoorthy; Mohammed A Alshuniaber; Ali A Alshatwi
Journal:  Oral Maxillofac Surg       Date:  2020-09-11

4.  Analysis of Protein-Protein Interactions by Split Luciferase Complementation Assay.

Authors:  Yuekun Lang; Zhong Li; Hongmin Li
Journal:  Curr Protoc Toxicol       Date:  2019-12

5.  Mitochondrial dysfunction in rheumatoid arthritis: A comprehensive analysis by integrating gene expression, protein-protein interactions and gene ontology data.

Authors:  Venugopal Panga; Ashwin Adrian Kallor; Arunima Nair; Shilpa Harshan; Srivatsan Raghunathan
Journal:  PLoS One       Date:  2019-11-08       Impact factor: 3.240

6.  Serum proteomic networks associate with pre-clinical rheumatoid arthritis autoantibodies and longitudinal outcomes.

Authors:  Liam J O'Neil; Xiaobo Meng; Caitlin Mcfadyen; Marvin J Fritzler; Hani S El-Gabalawy
Journal:  Front Immunol       Date:  2022-09-08       Impact factor: 8.786

7.  Reconstruction of Intercellular Signaling Network by Cytokine-Receptor Interactions.

Authors:  Somayeh Azadian; Javad Zahiri; Seyed Shahriar Arab; Reza Hassan Sajedi
Journal:  Iran J Biotechnol       Date:  2021-01-01       Impact factor: 1.671

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.