Literature DB >> 28282387

The essential and downstream common proteins of amyotrophic lateral sclerosis: A protein-protein interaction network analysis.

Yimin Mao1,2, Su-Wei Kuo2, Le Chen1, C J Heckman2,3,4, M C Jiang2.   

Abstract

Amyotrophic Lateral Sclerosis (ALS) is a devastative neurodegenerative disease characterized by selective loss of motoneurons. While several breakthroughs have been made in identifying ALS genetic defects, the detailed molecular mechanisms are still unclear. These genetic defects involve in numerous biological processes, which converge to a common destiny: motoneuron degeneration. In addition, the common comorbid Frontotemporal Dementia (FTD) further complicates the investigation of ALS etiology. In this study, we aimed to explore the protein-protein interaction network built on known ALS-causative genes to identify essential proteins and common downstream proteins between classical ALS and ALS+FTD (classical ALS + ALS/FTD) groups. The results suggest that classical ALS and ALS+FTD share similar essential protein set (VCP, FUS, TDP-43 and hnRNPA1) but have distinctive functional enrichment profiles. Thus, disruptions to these essential proteins might cause motoneuron susceptible to cellular stresses and eventually vulnerable to proteinopathies. Moreover, we identified a common downstream protein, ubiquitin-C, extensively interconnected with ALS-causative proteins (22 out of 24) which was not linked to ALS previously. Our in silico approach provides the computational background for identifying ALS therapeutic targets, and points out the potential downstream common ground of ALS-causative mutations.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28282387      PMCID: PMC5345759          DOI: 10.1371/journal.pone.0172246

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disorder characterized by progressive and selective loss of upper and lower motoneurons with no effective treatment available[1]. Clinical symptoms includes tremor, muscle weak, spasticity and paralysis, and patients usually die from respiratory failure within five years[2]. The majority (90–95%) of ALS patients are sporadic form (sALS), only small cohort of patients (5–10%) are associated to autosomal dominant inheritance as familial cases (fALS)[3, 4]. The incidence rate is at 2.7/100,000 in a ten-year Ireland research[5]. To date there are 26 subtypes of ALS listed in the Online Mendelian Inheritance in Man (OMIM) database with varied disease onset time and symptom onset origin, and 15–52% ALS patients are comorbid with Frontotemporal Dementia (FTD)[6, 7], the second most common dementia representing series of neurological symptoms involving frontotemporal lobar degeneration. FTD can exist alone without developing ALS, while certain forms of FTD and ALS shared some clinical and genetic features [8]. Although the pathogenic mechanisms of ALS are not fully clear, a number of gene mutations linked to ALS were discovered over past 20 years, such as superoxide dismutase 1 (SOD1), TAR DNA-binding protein (TARDBP), fused in sarcoma (FUS), optinurin (OPTN), valosin-containing protein (VCP), sequestosome-1 (SQSTM1), ubiquilin-2 (UBQLN2), C9ORF72, heterogeneous nuclear ribonucleoproteins A1 and A2/B1 (HNRNPA1 and HNRNPA2B1)[9-18]. In particular, SOD1, TDP-43, FUS, optinurin and ubiquilin-2 proteins were identified in aggregates from the autopsies of many patients[19]. These ALS-causative genes encoded proteins with divergent functions, many of which can be related to several categories, such as cellular transport (axonal/vesicle transport or whose aggregation would impede transport), RNA processing and ubiquitin proteasome system. However, it remains unclear how these minimally related proteins, once impaired, all result in motoneuron degeneration which eventually lead to various ALS subtypes. One explanation is that some of the ALS-causative proteins are “essential proteins” that, when acted inappropriately on by other ALS-causative mutations, would result in profound effects to motoneuron survival. An alternative hypothesis is that there may be common downstream proteins, which maximally connect with those ALS-causative proteins through either direct or indirect interaction. The original idea of “essential proteins” is that these proteins are crucial for survival and that their deletion confers the lethal phenotype[20]. The conventional experimental approaches in identifying essential proteins, such as gene knock-out or RNA interference, are usually time-consuming and cost-intensive. Several previous studies have shown the feasibility of computational approaches to predict gene essentiality and morbidity[21-24]. For example, topological properties of protein-protein interaction (PPI) have been employed to identify essential proteins in various organisms[25, 26]. The main idea is the “centrality-lethality rule”, in which highly connected hub proteins are more essential to survival in PPI network[22]. Although there is still significant debate regarding the rule, several studies suggest a correlation between topological centrality and protein essentiality[27-29]. To identify crucial ALS-causative proteins, we applied this same concept to calculate the essentiality of each protein in a PPI network. This approach also allowed us to investigate the roles of ALS/FTD mutations such as C9ORF72 in the PPI network, which had been shown to involve distinct pathways from sALS in brain transcriptome[30]. Given such divergent functional background of these ALS mutations, it is tempting to hypothesize that there might be a common downstream interaction, either via individual proteins or convergent pathways, which would render motoneuron vulnerable to toxicity. To evaluate this possibility, we fully explored the ALS PPI network to analyze those proteins capable of interacting with ALS-causative proteins directly. If such proteins do exist, they must maximally interact with ALS-causative proteins and play vital roles in maintaining cellular activities. Thus upstream ALS-causative mutations might impair or disrupt the function of downstream proteins and cause long-term toxic effects. In current study, we constructed a PPI network based on ALS-causative genes imported from the OMIM database. Genes linked only to classical ALS were grouped with or without genes implicated in ALS/FTD, namely ALS+FTD group vs. classical ALS group. Our integration of network topological properties and protein cluster information revealed that classical ALS and ALS+FTD groups showed similar essential protein sets (VCP, FUS, TDP-43 and hnRNPA1) but distinct patterns in functional enrichment analysis. These essential proteins might present a set of proteins susceptible to disruptions. Moreover, we identified an interconnected common protein, ubiquitin-C, which extensively interacts with almost all ALS-causative proteins (22 out of 24 proteins). To the best of our knowledge, ubiquitin-C has not been linked to ALS yet. Our results, based on computational analyses, suggest potentials of novel disease mechanisms that may underlie various forms of ALS and shed light on new direction in ALS study

Materials and methods

The overall workflows used in this study are shown in Fig 1. The process involves four main steps: construction, processing, identification and producing. Construction consisted of obtaining ALS causal genes from OMIM database (labeled (1) in Fig 1) and constructing the protein-protein interaction network of ALS based on I2D database (2). Processing consisted of detecting clusters by analyzing PPI (3); assessing topological properties by measuring degree centrality in the network based on global PPI (4); and finally, analyzing functional enrichment and pathway (5). Identification consisted of: finding essential proteins for ALS by adopted edge clustering coefficient (ECC) method (6) and identifying the biological functions and pathways by GO and KEGG pathway enrichment analysis (7). Producing consisted of analyzing downstream common proteins by designed DFloyd algorithm (8).
Fig 1

Overall work flow.

2.1 Genetic defects in ALS and construction of PPI network

OMIM database is a disease phenotype database and a catalogue of human genes and genetic disorders[31]. In OMIM, the list of hereditary disease genes is described in the OMIM morbid map. The ALS-causative genes were imported from the OMIM morbid map (https://www.omim.org/phenotypicSeries/PS105400). Corresponding protein names and IDs were obtained by investigating the mapping scheme of UniProt database[32]. Hence we only considered those loci with known encoding protein profiles in UniProt, thus ALS3 and ALS7 were excluded in this research. We referred to genes linked to ALS1-22 in OMIM, except ALS3 and ALS7, as classical ALS (20 genes); FTDALS 1–4 were regarded as ALS/FTD (4 genes) (Table 1). Total ALS-causative genes were referred to ALS+FTD (20+4 genes) in this research. In order to investigate the differences between classical ALS and ALS+FTD groups, we explored experimentally validated interactions from the Interologous Interaction Database (I2D) database (http://ophid.utoronto.ca/ophidv2.204/)[33, 34]. I2D is a protein-protein interaction database and an integrated database of known experimental and predicted human protein interaction data sets (including HRPD, BIND, and BioGrid). The gene names were inputted into I2D database with human as the only chosen target organism. I2D contains more than 290,000 experimental interactions from 38 databases of human source[34]. Thus, choosing I2D instead of single database might help minimizing the bias[35]. All homologous predicted protein interactions in I2D database were excluded to increase the reliability of protein interaction data. The rest experiment-based PPIs were then used to construct classical ALS and ALS+FTD PPI networks with ALS-causative proteins/interacting partners as nodes and interactions as edges. The comprehensive interaction data published in the I2D database enabled our analysis on a complete network of disease proteins. Identifiers of proteins were unified using the protein IDs defined in the UniProt database[32]. Some proteins were given multiple names, to avoid the ambiguous referring, the results in tables and figures were presented in the format of gene name and UniProt ID. Our study considered the I2D database version 2.9 released in September 2015.
Table 1

ALS-causative genes from OMIM.

SubtypeUniprot IDGeneSubtypeUniprot IDGeneSubtypeUniprot IDGene
ALS1P00441SOD1ALS10Q13148TARDBPALS19Q15303ERBB4
ALS2Q96Q42ALS2ALS11Q92562FIG4ALS20P09651HNRNPA1
ALS3--ALS12Q96CV9OPTNALS21P43243MATR3
ALS4Q7Z333SETXALS13Q99700ATXN2ALS22P68366TUBA4A
ALS5Q96J17SPG11ALS14P55072VCP-----------------------------------------
ALS6P35637FUSALS15Q9UHD9UBQLN2FTD-ALS1Q96LT7C9ORF72
ALS7--ALS16Q99720SIGMAR1FTD-ALS2Q8WYQ3CHCHD10
ALS8O95292VAPBALS17Q9UQN3CHMP2BFTD-ALS3Q13501SQSTM1
ALS9P03950ANGALS18P07737PFN1FTD-ALS4Q9UHD2TBK1

2.2 Computing topological properties of protein interaction network

We evaluated the centrality of proteins in PPI network to investigate the essentiality of proteins. Many research works indicated that PPI networks have characters of “small-world behavior” and “centrality-lethality” [22, 36]. Removal of nodes with high centrality makes the PPI network collapse into isolated clusters which might imply the collapse of biological system. Degree centrality of a protein indicates how many interactions that protein has to other proteins. For each node (protein), we applied the topological measures to assess its role in the network: degree centrality (DC). A PPI network is represented as an undirected graph G (V, E) with proteins as nodes and interactions as edges. The degree centrality is calculated by: v represents a node in PPI network where u is any node other than v in the network. e(u, v) represents the interaction between v and u. If such an interaction does exist, the value of e(u, v) is one. If not, is e(u, v) zero. |e(u, v)| represents total interaction numbers between v and u.

2.3 Identification of clusters

A unique feature of the clique percolation clustering method (CPM) is that it can uncover the overlapping community structure of complex networks, i.e., one node can belong to several communities[37]. In order to detect the densely connected regions in network with possible overlap and their functions in the network, the PPI information data were imported into CFinder-2.0.6 (an open source software platform in CPM method), and the clustering analysis was easily performed. The clusters were identified to have the minimum k-cliques, which was defined as the union of all k-cliques (complete sub-graph of size k) that could be reached from each other.

2.4 Finding essential proteins

The use of global centrality measures based on network topology has become an important method in identifying essential proteins. However recent research pointed out that many essential proteins have low connectivity and are difficult to be identified by centrality measuring[37-41]. Hart et al. pointed out that clusters have high correlation with essential proteins[42]. A new method ECC was proposed to identify essential proteins by integration of PPI network topology and cluster information. The unabridged ECC method can be found in Ren et al.[43]. Its basic concepts are as follow. In-degree K (i,c) of a protein i in a cluster C was defined as the number of interactions which connect i to other proteins in C. The complex centrality of a protein i, Complex_C(i), was defined as the sum of in-degree value of i in all clusters which included it. Complex_C(i) could define the overlapping number of protein complexes. Where CS was the cluster set, and Ci was a cluster which included i. The global centrality adopted subgraph centrality (DC) as it had better performance in identifying essential proteins in centrality methods[29]. To integrate DC(i) and Complex_C(i), a harmonic centrality (HC) of protein i was defined as follows: Where α was a proportionality coefficient and took value in range of 0 to 1, generally set 0.5, DC was the maximum DC value. Complex_C was the maximum Complex_C value.

2.5 Functional enrichment analysis

Functional enrichment analysis was performed to further study the functions and enriched pathways of cluster based on GO database (Version No.2010.09.03) (http://www.genontology.org/)[44] and KEGG pathway (http://www.genome.jp/kegg/)[45], respectively. In functional analysis, P<0.01 were considered statistically significant. This analysis was performed by using the database for annotation, visualization, and integrated discovery (DAVID, http://david.abcc.ncifcrf.gov/tools.jsp)[46, 47], which is an online platform providing functional annotation tools to analyze biological meaning behind large list of genes.

2.6 Downstream common protein analysis

We carried out five steps to analyze the downstream common proteins by designing a deformation Floyd (DFloyd) algorithm[34] of the shortest path. Give the source sets S and target T, S is the protein set that composed of two sources: (1) k-clique cluster proteins under highest possible k value excluding ALS-causative proteins; (2) proteins presented in significant enrichment analysis GO term and KEGG pathway. T is the ALS-causative protein set. The proteins in S and T are all part of ALS PPI Network. The algorithm of DFloyd is shown below. Calculate the relation edge set W from source set S to target set T by the enumerating method. W = {,,…,,…,,…,} Computer all distance(s, t), < s, t > ∈W. If there is not an edge between s and t, then distance(s, t) = ∞, else distance(s, t) = 1. Remove all s, t from W, s, t ∈distance(s, t) = 1 For any < s, t > in W, Label = False. If there is a vertax v, distance(s, ν)+distance(ν, t)< distance(s, t), then distance(s, t) = distance(s, ν)+distance(ν, t). Label = Ture Endfor Repeat step 4, until Label = False

2.7. Statistic

Thompson Tau test was performed to detect if the value was significantly deviated from mean value. Modified Thompson Tau was calculated as below: t = Student’s t value, α = 0.05, df = n-2; ** >mean ±τ SD; * >mean ±(τ SD)/2

Results

3.1 Genes and PPI

Genetic deficits accounting for various classical ALS (ALS1-2, 4–6 and 8–22; 20 genes) and ALS/FTD phenotypes (ALS/FTD1-4; 4 genes) were obtained from the OMIM database as shown in Table 1. We studied their interactions based on known protein-protein interactions by exploring the I2D database. To investigate the roles of ALS/FTD mutations in ALS, mutations linked to classical ALS phenotypes were separated from those causing ALS/FTD. The PPI network was constructed from proteins encoded by classical ALS (20) or ALS+FTD gene set (20+4). The I2D contains more than 230,000 experiment-based interactions and around 70,000 predicted interactions from human source. Our input of ALS-causative proteins into I2D yielded 4,144 interactions in classical ALS group and 5,454 interactions in ALS+FTD group. After removal of homologous predicted interactions, 3,023 (S1 Text) and 3,764 (S2 Text) interactions with 1,932 and 2,288 interacting nodes were used to construct the PPI networks of classical ALS and ALS+FTD respectively.

3.2 Network centrality degree analysis

The topological properties of protein interactions were calculated to identify essential proteins in the network. The degree centrality of each protein in PPI was ranked in Table 2. The majority of proteins with high degree centrality were proteins encoded by ALS-causative genes listed in Table 1, which was not surprising. Interestingly ubiquitin-C (UBC) and YWHAE presented in ALS+FTD group, as the only two proteins not encoded by ALS-causative genes. Both UBC and YWHAE have not been linked to ALS or FTD yet. Among both groups, VCP, hnRNPA1 and FUS were all in the top five with obviously higher degree centrality. The results suggested the importance and extensive involvement of VCP, hnRNPA1 and FUS in ALS pathogenesis. Moreover our results revealed an intriguing role of UBC in ALS+FTD PPI networks, a link that might have been otherwise overlooked. It is not surprising to see similar results between two groups because degree centrality lack of the topological information. Thus we performed cluster analysis to further examine the PPI network.
Table 2

Degree centrality ranking in ALS PPI network.

Classical ALSALS+FTD
RankUniprotProteinDCRankUniprotProteinDCRankUniprotProteinDC
1P55072VCP729**1P55072VCP729**15Q15303ERBB468
2P35637FUS385*2Q13501SQSTM1596**16Q99700ATXN255
3P09651HNRNPA13513P35637FUS385*17Q9UQN3CHMP2B27
4Q13148TARDBP3194P09651HNRNPA135118Q7Z333SETX25
5P68366TUBA4A2235Q13148TARDBP31919P0CG48UBC22
6P00441SOD12166P68366TUBA4A22320Q96Q42ALS214
7P43243MATR31497P00441SOD121621Q96LT7C9orf7213
8P07737PFN11108P43243MATR314922P62258YWHAE11
9Q96CV9OPTN1019Q9UHD2TBK1144
10Q99720SIGMAR19010P07737PFN1110
11Q9UHD9UBQLN27811Q96CV9OPTN101
12O95292VAPB7312Q99720SIGMAR190
13Q15303ERBB46813Q9UHD9UBQLN278
14Q99700ATXN25514O95292VAPB73

** >mean ±τ SD;

* >mean ±(τ SD)/2

** >mean ±τ SD; * >mean ±(τ SD)/2

3.3 Cluster analysis

To further scrutinize the complex ALS PPI network, we conducted a cluster analysis to get a clear picture of interactions between the proteins in the PPI. The clusters (cliques) with densely connected nodes in the PPI network were detected using the ClusterOne plug-in of the CFinder 2.0.6 software. In the classical ALS group, 393 and 136 clusters were identified with parameters set to a minimum size of three (k = 3) and four (k = 4) respectively, while in the ALS+FTD group, 578 clusters were detected when k = 3 and 59 clusters were found when k = 5. In current study VCP, FUS, hnRNPA1 and TDP-43 were the cores of network in classical ALS (k = 4); ALS+FTD (k = 5) groups shared similar feature but included SQSTM-1 instead of hnRNPA1 and FUS (Fig 2A and 2B). In the classical ALS group, however, a few proteins not encoded by classical ALS gene set appeared in the list: UBC, YWHAE, YWHAZ and sequestosome-1/p62 (SQSTM1). Mutations on SQSTM1 are known to associate with ALS/FTD; In this research, SQSTM1 was categorized in ALS+FTD group, but it also involved in some ALS without FTD cases[48], which directly validated our algorithm and highlighted the importance of SQSTM1/p62 in pathology across ALS-FTD spectrum. More importantly those surrounding proteins which interacted with the cluster cores provided important clues to further investigate downstream pathways of ALS pathogenesis. The involvement of each protein was assessed by calculating the number of clusters each protein participated in. VCP, TDP-43 and hnRNPA1 were at the top of the list in both groups (Table 3). Notably, UBC and YWHAE presented in both groups again as proteins not encoded by ALS-causative genes.
Fig 2

The clusters of (a) classical ALS, 136 clusters at k = 4; (b) ALS+FTD, 59 clusters at k = 5. The yellow highlights the core clusters identified by significant involvement ranking calculated in Table 3 based on Thompson Tau test.

Table 3

Protein involvement based on participated cluster number.

Classical ALSALS+FTD
RankUniprotProteinClustersRankUniprotProteinClusters
1P55072VCP296**1Q13501SQSTM1460**
2Q13148TDP-43269**2P55072VCP351*
3P09651HNRNPA1214*3Q13148TDP-43294*
4P35637FUS185*4P09651HNRNPA1241
5P43243MATR3815P35637FUS228
6P07737PFN1656P43243MATR388
7Q96CV9OPTN257P07737PFN165
8O95292VAPB208P68366TUBA4A62
9Q99700ATXN2169P00441SOD147
10P0CG48UBC1110Q96CV9OPTN47
11Q96Q42ALS2911O95292VAPB20
12P00441SOD1812Q99700ATXN216
13Q13501SQSTM1813P0CG48UBC14
14Q9UHD9UBQLN2614Q9UHD2TBK112
15P62258YWHAE615Q96Q42ALS28
16P63104YWHAZ616P62258YWHAE8

** >mean ±τ SD;

* >mean ±(τ SD)/2

The clusters of (a) classical ALS, 136 clusters at k = 4; (b) ALS+FTD, 59 clusters at k = 5. The yellow highlights the core clusters identified by significant involvement ranking calculated in Table 3 based on Thompson Tau test. ** >mean ±τ SD; * >mean ±(τ SD)/2

3.4 Essential proteins

All above analysis were based on network topological properties. In order to take the relevance between interactions and protein essentiality into account, we used the ECC method to identify essential proteins in network. The result of harmonic centrality are shown in Fig 3 in which VCP, hnRNPA1, FUS and TDP-43 were the top-ranking hub proteins in both the classical ALS and ALS+FTD PPI networks, based on Thompson Tau test. In addition SQSTM1/p62 had the highest harmonic centrality in ALS+FTD group, and UBC presented in both groups as the only none ALS-causative protein. Interestingly the widely presented C9ORF72 mutation (34% fALS and 7% sALS[48, 49]) was not top ranked in either group. However, its interactive protein profile was unique from other ALS-causative proteins (Fig 4A).
Fig 3

Essential proteins ranked by HC (a) classical ALS; (b) ALS+FTD. ** >mean ±τ SD; * >mean ±(τ SD)/2

Fig 4

Profiles of the interacting proteins.

A) Profile of C9orf72 with only direct interaction B) The profiles include both direct and indirect interactions of important downstream proteins. Red, blue and green denote direct, secondary and tertiary contact proteins respectively. Yellow highlights ALS/FTD proteins.

Essential proteins ranked by HC (a) classical ALS; (b) ALS+FTD. ** >mean ±τ SD; * >mean ±(τ SD)/2

Profiles of the interacting proteins.

A) Profile of C9orf72 with only direct interaction B) The profiles include both direct and indirect interactions of important downstream proteins. Red, blue and green denote direct, secondary and tertiary contact proteins respectively. Yellow highlights ALS/FTD proteins.

3.5 Functional enrichment analysis

GO term and KEGG pathway enrichment analysis were performed on the 393 clusters, k = 3 in classical ALS group as well as the 578 clusters, k = 3 in ALS+FTD group. GO term analysis was carried out in three categories, including biological processes (BP), molecular function (MF), and cellular components (CC). Tables 4 and 5 describe the top five GO terms of BP, MF, and CC as well as top five pathways in KEGG (GO terms with p values <0.01 were discarded). In classical ALS group the most significant terms in BP and CC were all RNA processing related; the translation control was also highlighted in BP (Table 4). In accordance with the results from GO terms analysis, ribosme and spliceosome pathways were all significant in KEGG pathway analysis (Table 5). Consistenly, the ALS+FTD group showed high functional enrichment in translation control, ribosome and RNA processing (Table 4). Autophagy, ribosome, spliceosome and neurotrophin signaling pathways were shared by classical ALS and ALS+FTD groups in KEGG enrichment analysis (Table 5).
Table 4

Most significantly enriched GO terms.

Classical ALSALS+FTD
TermsP valueTermsP value
BiologicalmRNA metabolic process7.12E-23Translational elongation9.54E-45
ProcessmRNA processing5.40E-21Translation8.27E-27
RNA processing5.50E-21mRNA metabolic process5.95E-25
Translational elongation1.00E-19RNA processing5.80E-24
RNA splicing1.17E-17mRNA processing2.17E-23
CellularRibonucleoprotein complex6.71E-45Ribonucleoprotein complex4.76E-62
ComponentOrganelle lumen4.56E-28Cytosol1.74E-54
Intracellular organelle lumen1.15E-27Cytosolic part3.09E-38
Membrane-enclosed lumen2.60E-27Intracellular organelle lumen3.82E-34
Cytosol7.40E-27Organelle lumen4.39E-34
MolecularRNA binding1.06E-38RNA binding3.40E-48
FunctionNucleotide binding9.80E-16Structural constituent of ribosome1.34E-29
Enzyme binding3.33E-15Nucleotide binding6.21E-22
Structural constituent of ribosome2.57E-12Structural molecule activity2.23E-20
Unfolded protein binding6.83E-11Enzyme binding4.20E-18
Table 5

Most significantly enriched KEGG pathways.

Classical ALSALS+FTD
TermsP valueTermsP value
1. Ribosome2. Regulation of autophagy3. Spliceosome4. mTOR signaling pathway5. Neurotrophin signaling pathway3.94E-141.33E-071.62E-076.97E-066.39E-051. Ribosome2. Neurotrophin signaling pathway3. Prostate cancer4. Regulation of autophagy5. Spliceosome2.81E-294.73E-109.69E-091.91E-081.26E-06

3.6 Common proteins analysis

To investigate how these functionally diverse pathogenic proteins all led to motoneuron degeneration, we analyzed their PPI profiles with the aim to identify common ground downstream of ALS-causative proteins. For the classical ALS group, in addition to the 78 interacting proteins at k = 4 (Fig 2A), we included proteins involved in significant GO terms and KEGG pathways as protein set S1. For the ALS+FTD group, 27 proteins from k = 5 (Fig 2B) along with proteins from significant GO terms and KEGG pathways formed protein set S2. The DFloyd algorithm was employed to investigate the interaction between ALS-causative protein set (T) and downstream protein sets (S1 and S2). The results suggested that UBC was the most interconnected protein in both classical ALS and ALS+FTD groups. Other top ranking common proteins shared by both groups were YWHAZ, PARK2, GBAS and GABARAP (Table 6). The detailed interactive profiles of selective proteins were shown in Fig 4B, in which UBC was connected to all ALS-causative proteins except ANG and CHCHD10.
Table 6

Downstream proteins ranked by number of direct interactions with ALS-causative proteins.

Classical ALSALS+FTD
RankUniprotProteinInteracting #RankUniprotProteinInteracting #
1POCG48UBC19**1POCG48UBC22**
2Q13501SQSTM111*2P63104YWHAZ10
3P63104YWHAZ93O60260PARK29
4Q00987MDM284O75323GBAS9
5O00443PIK3C2A85O95166GABARAP9
6O60260PARK286P11021GRP789
7O75323GBAS87P55854SUMO39
8O95166GABARAP88Q13618CUL39
9P55854SUMO389Q9H492MAP1LC3A9
10Q13618CUL3810Q00987MDM29
11P60520GABARAPL2811Q9Y4P8WIPI28
12Q86VP9PRKAG2712Q676U5ATG16L18
13Q9BSB4ATG101713Q14999CUL78
14Q9H1Y0ATG5714Q15843NEDD88
15Q9H492MAP1LC3A715P60520GABARAPL28
16P11021GRP78716Q9H0R8GABARAPL18
17Q676U5ATG16L1717O00443PIK3C2A8
18Q9H0R8GABARAPL1718Q9H1Y0ATG58
19P61956SUMO2719P61956SUMO28
20Q14999CUL7720Q13616CUL18
21Q15843NEDD8721Q9BSB4ATG1017
22Q9Y4P8WIPI2722Q9UGJ0PRKAG17
23Q13573SNW1623P07437TUBB7
24Q99459CDC5L624P07900HSP90AA17
25P15297.NOF625Q13573SNW17
26P54619PRKAG1626Q86VP9PRKAG27
27Q13286CLN3627P54619PRKAG17
28Q9GZZ9UBA5628Q13286CLN37
29Q9NR46SH3GLB2629Q9GZZ9UBA57
30Q9Y484WDR45630Q9NR46SH3GLB27
31Q9UGJ0PRKAG2631Q99459CDC5L6
32O75147OBSL1532P15297NOF6
33O75530EED533Q9Y484WDR456
34P02751FN1534O75147OBSL16
35P13612ITGA4535O75530EED6
36P19320VCAM1536P02751FN16
37P27694RPA1537P13612ITGA46
38P35244RPA3538P19320VCAM16
39P35638DDIT3539P35638DDIT36
40P54646PRKAA2540P54646PRKAA26
41Q15831STK11541Q15831STK116
42Q13616CUL1542P62993GRB26
43O15530PDPK1543P27694RPA16
44P62993GRB2544P27361MAPK36

** >mean ± τ ∙ SD.

* >mean ± (τ ∙ SD)/2.

** >mean ± τ ∙ SD. * >mean ± (τ ∙ SD)/2.

Discussion

Essential proteins

The ECC method was used to identify essential proteins in PPI network that integrated global topological properties and cluster information[43]. Our results showed that VCP, SQSTM1/p62, hnRNPA1, FUS and TDP-43 were proteins with significantly high harmonic centrality in ECC method. VCP is known to associate with some forms of FTD, Paget's disease of the bone (PDB) and inclusion body myopathy (IBM) before the discovery that mutations in these same genes account for approximately 1–2% fALS patients[50, 51]. The finding of VCP mutations in ALS gives rise to an interesting phenomenon that the mutations on single gene can affect multiple tissues and result in distinctive diseases. VCP is associated with nucleocytoplasmic transport and putative ATP binding protein[52]. Bartolome et al. showed that VCP mutations were likely resulted in reduced ATP level in energy production due to dysregulated mitochondria[53], which might partially explain the profound effects of mutations in diverse tissues. The idea of “multisystem proteinopathy” is further reinforced by the discovery of HNRNPA1/HNRNPA2B1 mutations in some forms of ALS patients[54]. Firstly the involvement of hnRNPA1 brings up the importance of RNA processing in motoneuron degeneration that was previously highlighted by discovery of mutations of FUS and TDP-43 being implicated in both ALS and FTD[55-58]. Secondly the results suggest that certain ALS subtypes indeed have wide-spreading disease effects on not only motoneuron but also on muscles, brain and bones. These wide-spread effects imply that motoneuron degeneration in ALS, at least in some subtypes, might be the final outcome of a series of genetic deficits causing multisystem dysregulations instead of a single disease. Given above rationale that ALS might not be a single disease as well as the complexity of motoneuron degeneration etiology, it is reasonable to integrate diverse causative proteins to identify a hub network in which essential proteins strongly interact with each other and downstream proteins (Fig 2A and 2B). Such a hub network would ideally incorporate as many causative proteins as possible. Thus, essential proteins in the network provide important clues to understand mechanisms underlying motoneuron degeneration. SQSTM1/p62 was identified as having involvement in ALS, FTD and PDB[18, 59, 60]. P62, SQSTM1 encoding protein, has been reported to be strongly associated with ubiquitination and involve in autophagy, oxidative stress and NF-ƙB pathway[59, 61, 62]. In ALS and FTD, it is frequently found in inclusion bodies containing polyubiquitinated proteins along with UBQLN2[63]. The common pathological features of ALS and FTD strongly imply the overlapping between diseases spectrum[64]. TARDBP encoded protein, TDP-43, is found in the common pathological hallmark, ubiquitin-positive inclusion bodies, in both ALS and FTD[56]. TDP-43 is an important regulator of RNA metabolism and its association with ALS evokes major interest in the role of RNA processing in ALS[65]. Soon after the discovery, FUS mutations were also found in small cohort of fALS patients (~4%)[55, 58]. More importantly the majority of mutants are clustered at RNA-binding domain rich C-terminus, a feature similar to TDP-43 in ALS. However, the FUS mutation induced ALS seems to lack of TDP-43 positive inclusions. Further evidence showed that overexpressed wild-type FUS rescued TARDBP knockdown, but not vice versa, suggesting TDP-43 might be upstream of FUS[66]. In current work, we calculated the essential proteins based on PPI network built from ALS causative proteins. hnRNPA1, TDP-43 and FUS are all associated with RNA-processing while VCP and SQSTM1/p62 are involved in nucleocytoplasmic transport and autophagy respectively. It seems that we have a chain of essential proteins involved in cellular activities ranging from nucleus/RNA level to cytoplasmic/protein level. It is clear that this chain tightly regulates protein turnover activities through upstream RNA metabolism to downstream protein chaperoning and clearance. Conceivably, disruption in any part of the chain would lead to catastrophic results. However, certain mutations only affect very small amount of ALS patients, which does not to fit in the so called “centrality-lethality” theory in PPI network analysis field. The contradiction is likely due to redundancies of regulatory proteins, compensation pathways or feedback regulations. Besides it is unclear why the same mutation would cause different levels of damage in tissues, and resulted in distinctive phenotypes, onset time and disease severity. One possible explanation is that each tissue has its own protein homeostasis/turnover profile, such that each mutation is likely to cause different level of impacts onto the PPI network. Thus, identifying essential proteins and hub network, and a detailed profiling of interested tissue are worthy further investigation to precisely assess the level of damage in specific tissue due to certain mutations. In addition, mutations of essential proteins with various functions all led to motoneuron degeneration in ALS suggests that there might be common downstream proteins where different pathogenic mechanisms converge.

Downstream common proteins

In present work, we analyzed both classical ALS and ALS+FTD PPI networks, and identified the aforementioned essential proteins. Further calculation revealed a set of common downstream proteins with strong interactivities toward causative proteins. In the classical ALS group, UBC and SQSTM1/p62 stood out as the most interconnected downstream proteins. SQSTM1 as a causative gene is categorized in the ALS+FTD group but not the classical ALS group; the result validates our algorithm in identifying potential targets associated to motoneuron degeneration. There are a set of widely-interconnected downstream proteins shared by both groups: UBC, YWHAZ, PARK2, GBAS and GABARAP (Table 6). It seems reasonable to consider UBC as the most interconnected downstream protein in both classical ALS and ALS+FTD groups since it is a polyubiquitin precursor protein. However, it is intriguing that ubiquitin A-52 (UBA-52) and polyubiquitin B (UBB) are not on top of the list; especially as UBB is involved in several neurodegenerative diseases. UBB frame shifting mutation (UBB+1) impedes proteasomal proteolysis, and has been extensively identified in Alzheimer disease (AD), FTD and Huntington disease (HD) as pathological hallmark[67]. However, UBB+1 transgenic mice with mutant huntingtin showed no aberrant phenotype except increased inclusion bodies, albeit they were more sensitive to the toxicity[68]. In contrast to the well-studied role of UBB in neurodegeneration, to the best of our knowledge, there is no strong molecular evidence to link UBC and neurodegeneration to date. However, the same observation, high interactivities between UBC and causative proteins in neuronal degeneration, was also made in an in silico network analysis of AD[69]. UBC, not to be confused with ubiquitin-conjugating enzyme (also called UBC or more frequently E2), along with UBB encode polyubiquitin precursor proteins with nine and three ubiquitin tandem repeats respectively. Both UBB and UBC transcription is shown to be induced and upregulated in response to various cellular stresses[70-74], in addition to the constitutive expression under normal condition[75]. The presumed redundancy of both genes seems to be insufficient to compensate for the loss of each other. In the case of Ubc knockout mice, it is lethal to transgenic mice at embryonic stage due to the disrupted fetal liver development[75, 76]. On the other hand, Ubb-null mice lead to damaged neurons within the arcuate nucleus of the hypothalamus[77]. The results strongly suggest that UBC and UBB are not functionally redundant. Since UBB and UBC are functionally and structurally similar, the role and importance of UBB regulation in AD might reveal the importance of UBC in neurodegeneration. UBB+1 and UCHL1 are both important regulators, likely to have opposite effects, of beta-amyloid production and amyloid precursor protein processing in AD [78]. Although UCHL1 is not known to directly cause ALS, UCHL1 null mice showed upper motoneuron vulnerability [79]. Thus it stressed the importance of protein quality control which involved both UCHL1 and polyubiquitin precursor. Moreover UBC demonstrates a tissue-specific manner in coping with various cellular stresses instead of a generalized response[72, 80]. UBC is shown to be positively regulated by Sp1, MEK1 and FOXO3a in rat L6 muscle cell[80-82], and FOXO3a is neuroprotective in models of motoneuron diseases[83]. Additionally TDP-43 regulates protein quality control through FOXO-dependent pathway[84]. Thus it is highly plausible that FOXO-mediated UBC expression contributes to motoneuron proteasome regulation which eventually affects ubiquitin-proteasome system. Given that UBC is able to interact with almost all ALS causative proteins (22 out of 24), further investigation is needed to profile UBC in motoneuron to better understand the behavior of this common downstream protein in ALS. Another interesting finding is the presence of 14-3-3 protein family member (YWHAZ) in downstream protein analysis. It has been known that 14-3-3 protein interacts with TDP-43 and SOD1 in ALS to modulate neurofilament light chain mRNA stability in G93A and A4T mSOD1 mice[85]; 14-3-3 protein was also found in lewy body in sALS[86]. YWHAZ also interacts with FUS in ALS whereas its isoform YWHAQ was reported to have significantly elevated mRNA level in sALS patients[86, 87]. 14-3-3 protein extensively involves in apoptosis, and protein and mRNA stabilization, however, much remains unknown about its role in ALS pathology. Notably its isoform, YWHAE, also presents in the results of cluster analysis and ECC, which suggests a deep involvement of this protein family in ALS.

C9ORF72

Although considerable progression has been made in identifying ALS causative genes over past few years, TARDBP, FUS, VCP, UBQLN2, SQSTM1 and C9ORF72 for example, underlying mechanisms remain elusive, as does the clinical spectrum of phenotypes. C9ORF72 not only associates with a large portion of ALS patients (7% sALS/ 34% fALS) but also implicates in FTD (25%). The functions of C9ORF72 encoded protein are not fully clear, however, it has been shown capable of interacting with hnRNPA1, hnRNPA2/B1, ubiquilin-2 in immunoprecipitation from cell line models[88]. The GGGGCC repeats in C9ORF72 translate to dipeptide repeat protein (DPR) in repeat-associated non-ATG manner which could impair RNA processing and lead to cell death[89, 90]. Recent study showed that C9orf72 mutation led to SQSTM-1 (p62) pathology as seen in ALS/FTD patients through Rab1a and ULK1 autophagy initiation complex [91]. Our analysis of C9ORF72 interacting profile is based on I2D database which does not reflect the DPR proteins and PPI described above. This should be taken as caveat in interpreting our analysis results. Nevertheless, the functional enrichment analysis suggests that ALS+FTD is different from classical ALS in GO terms and KEGG pathway. Specifically, unfolded protein response (UPR), membrane transport, splicing and RNA metabolism pathway are unique in ALS+FTD group compared to classical ALS, which highly enriches with autophagy, survival and nutrient-sensing pathways (Table 6). The result is consistent with brain transcriptome study in ALS conducted by Prudencio et al., in which C9ALS mainly affected intracellular transport/localization and UPR pathways, and sALS involved cytoskeleton organization, defense response and synaptic transmission while alternative splicing and RNA processing defects were found in both[30]. In addition, the surprisingly low ranking in centrality and topology analysis of C9ORF72 suggests that certain forms of ALS associated with FTD might not experience exactly the same molecular mechanism as classical ALS, although they share many pathological hallmarks and final destiny. If this holds true, future ALS therapeutic development might take into consideration individual genetic profile to tailor treatment.

Conclusion

The PPI network analysis highlights a set of ALS causative proteins as essential proteins, which form a complete regulatory chain of protein turnover. The result emphasizes on the importance of protein turnover in motoneuron degeneration. More importantly the hub network formed by essential proteins provides a converging point connecting other ALS causative proteins to downstream common proteins. It might help to explain why these functionally diverse ALS mutations all led to motoneuron degeneration. Our in silico analysis suggests a more active role of UBC in motoneuron degeneration which has been overlooked. Considering its active regulatory roles in ubiquitination and transcription under various conditions, UBC is likely the common protein connecting most causative proteins to proteasome regulation. UBC itself might not be sufficient to cause motoneuron degeneration, but it surely can serve as a useful start point to explore further UBC-related pathways that might shed light on common mechanism underlying motoneuron degeneration. (TXT) Click here for additional data file. (TXT) Click here for additional data file.
  81 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Virtual identification of essential proteins within the protein interaction network of yeast.

Authors:  Ernesto Estrada
Journal:  Proteomics       Date:  2006-01       Impact factor: 3.984

3.  Identification of functional modules in a PPI network by clique percolation clustering.

Authors:  Shihua Zhang; Xuemei Ning; Xiang-Sun Zhang
Journal:  Comput Biol Chem       Date:  2006-11-13       Impact factor: 2.877

4.  SQSTM1 mutations in frontotemporal lobar degeneration and amyotrophic lateral sclerosis.

Authors:  Elisa Rubino; Innocenzo Rainero; Adriano Chiò; Ekaterina Rogaeva; Daniela Galimberti; Pierpaola Fenoglio; Yakov Grinberg; Giancarlo Isaia; Andrea Calvo; Salvatore Gentile; Amalia Cecilia Bruni; Peter Henry St George-Hyslop; Elio Scarpini; Salvatore Gallone; Lorenzo Pinessi
Journal:  Neurology       Date:  2012-09-12       Impact factor: 9.910

5.  Ubiquitin mRNA is a major stress-induced transcript in mammalian cells.

Authors:  A J Fornace; I Alamo; M C Hollander; E Lamoreaux
Journal:  Nucleic Acids Res       Date:  1989-02-11       Impact factor: 16.971

6.  Ubiquitin (UbC) expression in muscle cells is increased by glucocorticoids through a mechanism involving Sp1 and MEK1.

Authors:  Anne C Marinovic; Bin Zheng; William E Mitch; S Russ Price
Journal:  J Biol Chem       Date:  2002-02-28       Impact factor: 5.157

Review 7.  FTD and ALS: a tale of two diseases.

Authors:  R Ferrari; D Kapogiannis; E D Huey; P Momeni
Journal:  Curr Alzheimer Res       Date:  2011-05       Impact factor: 3.498

8.  The mouse polyubiquitin gene UbC is essential for fetal liver development, cell-cycle progression and stress tolerance.

Authors:  Kwon-Yul Ryu; René Maehr; Catherine A Gilchrist; Michael A Long; Donna M Bouley; Britta Mueller; Hidde L Ploegh; Ron R Kopito
Journal:  EMBO J       Date:  2007-05-10       Impact factor: 11.598

9.  Tar DNA binding protein of 43 kDa (TDP-43), 14-3-3 proteins and copper/zinc superoxide dismutase (SOD1) interact to modulate NFL mRNA stability. Implications for altered RNA processing in amyotrophic lateral sclerosis (ALS).

Authors:  Kathryn Volkening; Cheryl Leystra-Lantz; Wenchang Yang; Howard Jaffee; Michael J Strong
Journal:  Brain Res       Date:  2009-10-06       Impact factor: 3.252

10.  Corticospinal Motor Neurons Are Susceptible to Increased ER Stress and Display Profound Degeneration in the Absence of UCHL1 Function.

Authors:  Javier H Jara; Barış Genç; Gregory A Cox; Martha C Bohn; Raymond P Roos; Jeffrey D Macklis; Emel Ulupınar; P Hande Özdinler
Journal:  Cereb Cortex       Date:  2015-01-16       Impact factor: 5.357

View more
  8 in total

1.  TDP-43 regulates the alternative splicing of hnRNP A1 to yield an aggregation-prone variant in amyotrophic lateral sclerosis.

Authors:  Jade-Emmanuelle Deshaies; Lulzim Shkreta; Alexander J Moszczynski; Hadjara Sidibé; Sabrina Semmler; Aurélien Fouillen; Estelle R Bennett; Uriya Bekenstein; Laurie Destroismaisons; Johanne Toutant; Quentin Delmotte; Kathryn Volkening; Stéphanie Stabile; Anaïs Aulas; Yousra Khalfallah; Hermona Soreq; Antonio Nanci; Michael J Strong; Benoit Chabot; Christine Vande Velde
Journal:  Brain       Date:  2018-05-01       Impact factor: 13.501

2.  A molecular view of amyotrophic lateral sclerosis through the lens of interaction network modules.

Authors:  Klaus Højgaard Jensen; Anna Katharina Stalder; Rasmus Wernersson; Tim-Christoph Roloff-Handschin; Daniel Hvidberg Hansen; Peter M A Groenen
Journal:  PLoS One       Date:  2022-05-16       Impact factor: 3.752

3.  A serum microRNA sequence reveals fragile X protein pathology in amyotrophic lateral sclerosis.

Authors:  Axel Freischmidt; Anand Goswami; Katharina Limm; Vitaly L Zimyanin; Maria Demestre; Hannes Glaß; Karlheinz Holzmann; Anika M Helferich; Sarah J Brockmann; Priyanka Tripathi; Alfred Yamoah; Ina Poser; Peter J Oefner; Tobias M Böckers; Eleonora Aronica; Albert C Ludolph; Peter M Andersen; Andreas Hermann; Joachim Weis; Jörg Reinders; Karin M Danzer; Jochen H Weishaupt
Journal:  Brain       Date:  2021-05-07       Impact factor: 13.501

4.  Excessive Homeostatic Gain in Spinal Motoneurons in a Mouse Model of Amyotrophic Lateral Sclerosis.

Authors:  Su-Wei Kuo; Marc D Binder; C J Heckman
Journal:  Sci Rep       Date:  2020-06-03       Impact factor: 4.379

5.  A network analysis revealed the essential and common downstream proteins related to inguinal hernia.

Authors:  Yimin Mao; Le Chen; Jianghua Li; Anna Junjie Shangguan; Stacy Kujawa; Hong Zhao
Journal:  PLoS One       Date:  2020-01-07       Impact factor: 3.240

Review 6.  Proteinopathies as Hallmarks of Impaired Gene Expression, Proteostasis and Mitochondrial Function in Amyotrophic Lateral Sclerosis.

Authors:  Bridget C Benson; Pamela J Shaw; Mimoun Azzouz; J Robin Highley; Guillaume M Hautbergue
Journal:  Front Neurosci       Date:  2021-12-23       Impact factor: 4.677

7.  Protein-protein interactions reveal key canonical pathways, upstream regulators, interactome domains, and novel targets in ALS.

Authors:  Ina Dervishi; Oge Gozutok; Kevin Murnan; Mukesh Gautam; Daniel Heller; Eileen Bigio; P Hande Ozdinler
Journal:  Sci Rep       Date:  2018-10-03       Impact factor: 4.379

Review 8.  From Multi-Omics Approaches to Precision Medicine in Amyotrophic Lateral Sclerosis.

Authors:  Giovanna Morello; Salvatore Salomone; Velia D'Agata; Francesca Luisa Conforti; Sebastiano Cavallaro
Journal:  Front Neurosci       Date:  2020-10-30       Impact factor: 4.677

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.