Literature DB >> 33036560

Does conserved domain SOD1 mutation has any role in ALS severity and therapeutic outcome?

Surinder Pal1, Abha Tiwari2, Kaushal Sharma3, Suresh Kumar Sharma4.   

Abstract

BACKGROUND: Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative fatal disease that can affect the neurons of brain and spinal cord. ALS genetics has identified various genes to be associated with disease pathology. Oxidative stress induced bunina and lewy bodies formation can be regulated through the action of SOD1 protein. Hence, in the present study we aim to analyse the structural and functional annotation of various reported SOD1 variants throughout and their putative correlation with the location of mutation and degree of ALS severity by inferring the structural and functional alterations in different SOD1 variants. <br> METHODS: We have retrieved around 69 SNPs of SOD1 gene from Genecards. Structural annotation of SOD1 variants were performed using SWISS Model, I-Mutant 2.0, Dynamut, ConSurf. Similarly, the functional annotation of same variants were done using SIFT, PHP-SNP, PolyPhen2, PROVEAN and RegulomeDB. Ramachandran plot was also obtained for six synonymous SNPs to compare the amino acid distribution of wild-type SOD1 (WT SOD1protein. Frequency analysis, Chi square analysis, ANOVA and multiple regression analysis were performed to compare the structural and functional components among various groups. RESULTS AND <br> CONCLUSION: Results showed the mutations in conserved domain of SOD1 protein are more deleterious and significantly distort the tertiary structure of protein by altering Gibb's free energy and entropy. Moreover, significant changes in SIFT, PHP-SNP, PolyPhen2, PROVEAN and RegulomeDB scores were also observed in mutations located in conserved domain of SOD1 protein. Multiple regression results were also suggesting the significant alterations in free energy and entropy for conserved domain mutations which were concordant with structural changes of SOD1 protein. Results of the study are suggesting the biological importance of location of mutation(s) which may derive the different disease phenotypes and must be dealt accordingly to provide precise therapy for ALS patients.

Entities:  

Keywords:  Amyotrophic lateral sclerosis; PHP-SNP; PolyPhen2; QMEAN; Ramachandran plot; SOD1

Year:  2020        PMID: 33036560      PMCID: PMC7547430          DOI: 10.1186/s12868-020-00591-3

Source DB:  PubMed          Journal:  BMC Neurosci        ISSN: 1471-2202            Impact factor:   3.288


Introduction

Amyotrophic lateral sclerosis (ALS, also known as Lou Gehrig’s disease), a term coined by Jean Martin Charcot in 1874, is considered to be one of the most fatal neurodegenerative diseases. It can be characterized by progressive degeneration of both upper and lower motor neurons (amyotrophic) leading to hardening of lateral columns and creeping paralytic condition (lateral sclerosis). For ALS patients, maximum duration of survival is around 3–5 years [1, 2]. As the disease progresses, phosphorylation of neurofilaments stimulates the accumulation of bunina and lewy bodies in pathophysiology of ALS patients consequently induce inflammation in perikarya and proximal axons. Spheroids and ubiquitinylated strands can also stimulate the inflammation in these patients later which can be mediated through triggered response of microglia and astrocytes [3]. Terminal stage in ALS patients can feature failure of cardio-respiratory system after 2-5 years of onset of disease, resulting into death [2]. Three phenotypic variants of ALS have been described which include: pacific (associated with dementia), familial (mostly autosomal dominant, fALS), and sALS (sporadic ALS) [1]. Approximate, 10% of familial ALS cases demonstrate the Mendelian inheritance. Interestingly, 20% of fALS cases have exhibited the SOD1 mutation (superoxide dismutase 1 contains Cu and Zn in catalytic site). Additionally, mutation in TARDBP has also accounted for 5-10% of fALS cases along with the mutations in FUS (5%) and ANG (1%) [4]. However, sALS has also shown 5% cases with SOD1 mutation. Most of SOD1 mutations showed dominant clinical phenotype in ALS disease [5] except D90A mutation which could exhibit either recessive [6] or dominant [7, 8] phenotype. Mutations in 13 genes and their loci have been identified as the causal genetic factor for typical ALS clinical phenotype which primarily involves SOD1, TARDBP, ANG, FUS, and OPTN etc. [4]. Recent genome wide association studies (GWAS) have also suggested and associated seven novel genes with ALS pathology including C21orf2, TUBA4A, CHCHD10, NEK1, TBK1, MATR3, and CCNF [9]. Intriguingly, hexa-nucleotide expansion (G4C2) of C9ORF72 gene (by mean of loss or gain of function mutations leading to develop somatic mosaicism) has also been associated with fALS (with 3.2% prevalence rate in Indian ALS) which may stimulate pathology by TDP-43 accumulation, impaired RNA metabolism and defective proteosomal degradation mechanism [10-12]. C9ORF72 association with fALS is signifying the biological importance of non-coding mutation, though it’s not theme of current study. The present study aims to examine the changes in functional and structural domains of SOD1 protein for the reported SNPs in various studies. Additionally, we also attempted to investigate significance of mutation location in maintaining the free energy and entropy of protein’s tertiary structure. The impact of synonymous SNPs on protein structure as they do not exert any changes in the tertiary structure has also been explored to demonstrate their biological significance in SOD1 protein structure. Moreover, we also tried to decipher the biological significance of individual scores obtained from various bioinformatics tools to show their impact on structural and functional aspects of SOD1 protein and their implication in ALS severity which can be used to differentiate transmittant ALS phenotypes and can provide the substrate for development of personalized medicine.

Methodology

We have retrieved 69 clinically associated exonic SNPs of SOD1 with ALS pathology in various populations from Genecards [13] (Accession number for protein from NCBI NP_000445.1, Uniprot ID: P00441) (Additional file 1: Table S2). Structural and functional parameters of SOD1 genetic variations were collected from different web servers (Additional file 1: Table S1). Consurf [14] was utilized to identify the position of evolutionary conserved amino acid based on their respective SNPs (Additional file 1: Figure S1). FASTA format has been taken into consideration to annotate SOD1 protein structure and its comparison with different variants along with the severity of ALS pathology. Protein Data Bank (PDB) structure models were generated using Swiss Model [15]. All the structural components of protein including the Cβ, all atoms, salvation and torsion angles were retrieved from Swiss Model web server. Collectively these values define the Q mean (Z-score) of protein and describe the degree of naiveness of the protein model based on genetic polymorphism and ultimately signifying the protein stability. The corresponding free energy (Gibb’s free energy) and the entropy with changes in protein sequence were also obtained from Dynamut [16] and I-Mutant 2.0 [17] (Additional file 1: Table 1). Functional association of free energy and entropy changes can be correlated with protein stability and molecular flexibility of mutated protein. Functional annotation of studied mutations has also been analysed through algorithms derived from SIFT [18], PROVEAN [19] and PolyPhen-2 [20] to express the effect of non-synonymous SNPs (nsSNP) to define deleterious and/or neutral nature of mutation. Moreover, reliability index (RI) were also retrieved through PhD-SNP [21] to demonstrate the deleterious effect of nucleotide variation (SNPs) to define the nature of the mutation as neutral or diseased (Tables 2, 3 and 4). Predictive functional annotation of SNPs was also determined by using RegulomeDB [22]. Scoring of SNPs through RegulomeDB can be represented as transcription factor (TF) binding site or regulatory regions (promoter, operator and enhancer sequences) or DNase hypersensitive region etc. Synonymous mutations were scored using RegulomeDB where structural and functional annotations cannot be predicted for such variations (Tables 2, 3 and 4). String [23] was used to see the biological interaction of SOD1 with other biomolecules which may collectively regulate the oxidative stress and the processes mediated by them. Moreover, to identify the evolutionary conserved pattern of SOD1, cladogram among various species was analyzed using MEGAX [24].
Table 1

Tabular representation QMEAN, Cβ, solubility and torsion obtained from SWISS model algorithm and domain location of studied SOD1 variants

MutationConSurfNatureQMAll atomSolvTorMutationConSurfNatureQMAll atomSolvTor
L39VCon.NP2.672.71−0.890.382.15V149GCon.NP2.462.98−0.8502.02
G38RConPB1.772.45−0.970.331.28G148DCon.PA2.532.7−1.020.392.01
Q23RCon.PB2.562.68−0.960.362.06N140LCon.NP2.372.56−1.090.371.88
Q23LCon.NP2.892.9−0.960.372.36V149ICon.NP2.692.82−0.90.422.14
N87SCon.P2.692.75−0.960.332.19T55RCon.PB2.722.71−0.90.412.2
N87ICon.NP2.52.64−0.870.461.96S135NCon.P2.552.73−0.870.332.04
G42SCon.P2.042.9−0.920.381.45S135TCon.P2.722.43−0.930.322.29
G42DCon.PA2.132.87−0.970.381.56L85VCon.NP2.672.72−0.930.372.16
H44RCon.PB2.362.51−0.930.331.89G17SCon.P2.522.65−1.040.232.07
G86SCon.P2.012.79−0.90.281.48G17CCon.P2.552.57−1.070.212.13
G86RCon.PB2.352.39−1.060.321.92L127SCon.P2.93.11−0.80.332.33
L107VCon.NP2.732.8−0.880.352.21I114TAvg.P2.752.69−0.920.292.28
L107FCon.NP2.452.28−0.890.332.03I105FAvg.NP2.652.57−0.940.272.21
A5VCon.NP2.552.79−0.920.232.07C7TAvg.P2.442.75−0.90.271.95
H47RCon.PB2.722.71−0.910.242.26C7FAvg.NP2.432.41−0.960.12.07
A5TCon.P2.562.56−0.970.242.13I152TAvg.P2.592.71−0.880.342.08
A5SCon.P2.822.65−0.930.322.34G13RAvg.PB2.523.09−0.870.591.84
A146TCon.P2.522.84−1.050.262.02S106LAvg.NP2.492.82−0.980.361.96
G73SCon.P2.422.46−0.920.261.99S106LAvg.NP2.492.82−0.980.361.96
G73CCon.P2.542.45−0.910.32.1I113TAvg.P2.72.76−0.970.342.19
F46SCon.P2.762.84−0.930.352.24I36FAvg.NP2.642.61−1.080.292.19
F46CCon.P2.792.75−0.960.372.29F65LAvg.NP2.712.89−0.890.362.17
H81RCon.PB2.692.79−0.890.32.18E101GVarNP2.532.49−1.01−0.052.21
I150TCon.P2.472.69−0.920.341.97L145SVarP2.612.71−0.860.322.12
V119LCon.NP2.572.71−0.880.352.06E22KVarPB2.612.47−1.120.372.16
A90VCon.NP2.822.8−1.030.222.36D97NVarP2.672.71−0.870.32.18
R116GCon.NP2.822.59−0.930.412.33L145FVarNP2.662.85−0.910.362.12
R116CCon.P2.562.57−0.980.472.04G74RVarPB2.492.59−0.980.242.04
L85FCon.NP2.62.71−0.930.282.12D91AVarNP2.522.99−0.780.271.98
V88ACon.NP2.572.68−0.890.222.11N20SVarP2.72.76−0.810.362.18
I150TCon.P2.472.69−0.920.341.97
Table 2

Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy and entropy values for studied variants which fall in conserve domain of SOD1 protein

MutationDDG (Kcal/mol)StabilityΔΔG ENCoM (kcal/mol)StabilityΔΔSVib ENCoM (kcal.mol1.K1)Mol flexProveanEff.RIEff.SIFTEff.Pphen2Eff.Reg
L39V−0.84Dec.0.01D−0.013Dec.−2.77Del6Dis0.02APF0.998Pro. D5
G38R−1.14Dec.0.32D−0.406Dec.−7.17Del8Dis0APF1Pro. D5
Q23R−1.26Dec.−0.01D0.007Inc.−3.18Del7Dis0APF0.018Ben4
Q23L−0.01Dec.−0.144D0.180Inc.−4.76Del5Dis0.01APF0.117Ben4
N87S−0.47Dec.−0.05D0.066Inc.−4.94Del6Dis0APF1Pro. D4
N87I1.44Inc.0.19D−0.241Dec.−8.91Del7Dis0APF1Pro. D4
G42S−0.27Dec.−0.03D0.032Inc.−5.57Del7Dis0.01APF0.999Pro. D5
G42D−1.09Dec.0.03D−0.038Dec.−6.52Del6Dis0APF0.939Pos. D5
H44R−0.86Dec.0.17D−0.206Dec.−7.40Del7Dis0.26Tol1Pro. D5
G86S−2.02Dec.0.43D−0.541Dec.−5.97Del8Dis0APF1Pro. D4
G86R−1.71Dec.0.97S−1.209Dec.−7.91Del8Dis0.03APF1Pro. D4
L107V−1.01Dec.−0.40D0.499Inc.−2.96Del6Dis0.01APF0.997Pro. D4
L107F−0.47Dec.0.581S−0.726Dec.−3.93Del8Dis0APF1Pro. D4
A5V−0.62Dec.0.95S−1.186Dec.−3.20Del7Dis0APF0.999Pro. D4
H47R−0.75Dec.0.53S−0.663Dec.−7.52Del7Dis0APF1Pro. D5
A5T−0.55Dec.0.39D−0.485Dec.−3.08Del7Dis0APF0.993Pro. D4
A5S−0.84Dec.0.04D−0.045Dec.−2.30Neu7Dis0APF0.393Ben4
A146T−0.68Dec.0.384D−0.48Dec.−3.39Del7Dis0APF1Pro. D4
G73S−0.05Dec.0.036D−0.045Dec.−2.80Del7Dis0APF1Pro. D4
G73C0.07Inc.−0.14D0.18Inc.−2.78Del5Dis0.02APF0.999Pro. D4
F46S−2.78Dec.−0.08D0.096Inc.−2.70Del7Dis0APF0.749Pos. D4
F46C−1.44Dec.0.26D−0.326Dec.−5.06Del7Dis0APF0.998Pro. D4
H81R−1.45Dec.0.21D−0.265Dec.−7.33Del7Dis0APF1Pro. D4
I150T−2.84Dec.−0.22D0.274Inc.−5.76Del8Dis0APF1Pro. D2b
V119L−0.43Dec.0.28D−0.345Dec.−5.55Del8Dis0APF0.97Pro. D4
A90V−0.7Dec.0.33D−0.407Dec.−8.41Del8Dis0APF1Pro. D4
R116G−2.02Dec.−0.83D1.033Inc.−7.50Del8Dis0APF0.999Pro. D5
R116C−2.12Dec.−0.61D0.759Inc.−7.51Del7Dis0APF1Pro. D5
L85F−1.03Dec.−0.25D0.313Inc.−7.87Del8Dis0APF1Pro. D4
V88A−3.18Dec.−0.12D0.154Inc.−4.17Del8Dis0APF1Pro. D3a
I150T−1.53Dec.0.396D−0.495Dec.−2.95Del5Dis0APF1Pro. D4
V149G−1.13Dec.0.58S−0.724Dec.−3.88Del7Dis0.21Tol1Pro. D3a
G148D−0.88Dec.−0.41D0.51Inc.−6.92Del9Dis0APF1Pro. D4
N140L−1.13Dec.−0.15D0.188Inc.−7.91Del7Dis0APF1Pro. D4
V149I−0.73Dec.0.39D−0.482Dec.−3.83Del6Dis0APF1Pro. D4
T55R−1.96Dec.−0.38D0.476Inc.−3.96Del0Neu0APF0.999Pro. D3a
S135N−3.18Dec.−0.12D0.154Inc.−4.17Del8Dis0APF1Pro. D3a
S135T−2.62Dec.−0.33D0.414Inc.−6.08Del7Dis0APF1Pro. D3a
L85V−1.11Dec.0.043D−0.054Dec.−5.92Del7Dis0APF1Pro. D
G17S1.67Inc.−0.17D0.214Inc.−8.30Del5Dis0APF1Pro. D4
G17C−0.55Dec.−0.05D0.064Inc.−0.87Neu5Neu0.43Tol1Pro. D3a
L127S0.55Inc.0.105D−0.132Dec.−5.66Del5Dis0.02APF0.869Pos. D2b
Table  3

Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy and entropy values for studied variants which fall in average domain of SOD1 protein

MutationDDG (Kcal/mol)StabilityΔΔG ENCoM kcal/molStabilityΔΔSVib ENCoM (kcal.mol−1.K−1)Mol flexProveanEff.RIEff.SIFTEff.Pphen2Eff.Reg.
I114T−1.82Dec−0.345Destab0.43Inc−4.85Del4Dis0APF0.999Pro. D2b
I105F−3.1Dec0.395Destab−0.49Dec−3.61Del8Dis0APF0.999Pro. D4
C7T−0.13Dec0.114Destab−0.14Dec−7.92Del4Dis0.06Tol1Pro. D4
C7F0.7Inc1.306Stab−1.63Dec−8.79Del8Dis0.01APF1Pro. D4
I152T−3.12Dec−0.177Destab0.221Inc−3.98Del4Dis0.03APF0.969Pro. D4
G13R−1.96Dec0.171Destab−0.21Dec−2.99Del6Dis0.03APF0.968Pro. D4
S106L−2.1Dec−0.153Destab0.19Inc−4.89Del5Dis0.08Tol0.924POS. D4
S106L−2.1Dec−0.153Destab0.19Inc−4.89Del2Dis0.08Tol0.924POS. D4
I113T−2.42Dec−0.209Destab0.26Inc−4.72Del6Dis0APF1Pro. D2b
I36F−0.51Dec0.517Stab−0.65Dec−3.46Del7Dis0.02APF0.998Pro. D4
F65L−1.44Dec−0.396Destab0.495Inc−5.70Del5Dis0APF0.995Pro. D5
Table 4

Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy and entropy values for studied variants which fall in variable domain of SOD1 protein

MutationDDG (Kcal/mol)StabilityΔΔG ENCoM kcal/molStabilityΔΔSVib ENCoM (kcal.mol−1.K−1)Mol flexProveanEff.RIEff.SIFTEff.Pphen2Eff.Reg.
E101G−1.46Dec−0.381Destab0.476Inc−3.072Del2Dis0.18Tol0.439Ben4
L145S−2.75Dec−0.523Destab0.654Inc−4.146Del6Dis0.05Tol0.999Pro d4
E22K−1.14Dec0.084Destab−0.104Dec−2.154Neu2Neu0.32Tol0.9POS D4
D97N−2.41Dec0.108Destab−0.135Dec−0.31Neu5Neu0.44Tol0Ben4
L145F−1.13Dec0.382Destab−0.478Dec−2.951Del8Dis0.06Tol0.999Pro d4
G74R−0.71Dec1.48Stab−1.85Dec−5.79Del4Dis0.06Tol1Pro d
D91A−2.46Dec−0.148Destab0.185Inc−2.437Neu6Dis0.13Tol0Ben3a
N20S−0.39Dec−0.031Destab0.038Inc−1.081Neu9Neu0.4Tol0.003Ben4
Tabular representation QMEAN, Cβ, solubility and torsion obtained from SWISS model algorithm and domain location of studied SOD1 variants Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy and entropy values for studied variants which fall in conserve domain of SOD1 protein Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy and entropy values for studied variants which fall in average domain of SOD1 protein Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy and entropy values for studied variants which fall in variable domain of SOD1 protein

Statistical analysis

Frequencies and their association were calculated using Fisher’s test analysis. One-way ANOVA was employed to examine the changes in the structural and functional parameters including QMEAN, Gibb’s free energy (ΔΔG), entropy (ΔS), SIFT and PROVEAN scores. Independent t test was also carried out to calculate the mean difference of the above mentioned parameters between the groups classified based on the location of the amino acid (e.g. conserved and variable region). Logistic regression analysis was also conducted to identify the changes in various parameters by considering one of them as dependent factor and to demonstrate their diagnostic efficacy in identifying ALS cases more precisely. We calculated fold changes in both structural and functional parameters by deducing the values of mutant SNPs from wild type (WT).

Results

Frequency and association of SOD1 variants

Studied exonic mutations are mostly falling in conserved domain (49 variants) (by ConSurf) of SOD1 protein which have been found to affect the protein structure (52 variants) (by SIFT) and show deleterious effect (62 variants) (by PROVEAN) on SOD1 protein in ALS pathology (Table 5). Gibb’s free energy (EcoDDG) of these variants has also suggested that 60 variants are destabilizing the protein structure, though entropy changes have shown both decreasing (n = 38) and increasing (n = 30) trends in almost equal proportions. Results of RegulomeDB has also indicated that the most of these variants are falling under the score 4 (n = 46) which represents the binding site for TF and DNAse peak, on the contrary to lesser frequencies of 2b (TF binding + matched TF motif + matched DNase Footprint + DNase peak), 3a (TF binding + any motif + DNase peak) (n = 11) and 5 (TF binding or DNase peak) (n = 9).
Table 5

Frequency of studied SOD1 variants based on their structural and functional annotation parameters by various bioinformatics tools

ParameterComponent(s)Frequency
Domain locationVariable8
Average11
Conserve49
PROVEAN phenotypeDeleterious62
Neutral6
SIFT phenotypeAffect protein52
Tolerant16
DDG iMutant stabilityDecrease63
Increase5
EcoDDG stabilityDestabilize60
Stabilize8
Eco entropy dynamut FlexibilityDecrease38
Increase30
HumDiv phenotypeBenign7
Possibly damaging8
Probably damaging53
RegulomeDB2b + 3a11
446
59
Frequency of studied SOD1 variants based on their structural and functional annotation parameters by various bioinformatics tools Interestingly, Fisher’s test analysis has revealed that PROVEAN (p = <0.0001), SIFT (p = <0.0001) and PHD-SNP (p = 0.002) significantly associated with the conserved domain of SOD1 protein which may define the severity of ALS phenotype. Additionally, HumDiv phenotype to predict the PolyPhen-2 model has demonstrated that around 41 SOD1 variants are falling in conserved domain of protein and signifying most of SOD1 variants are probably damaging in nature (Table 6).
Table 6

Frequency distribution of various structural and functional parameters among groups classified based on mutation location (conserved or variable domains) and their association by Fisher’s exact test

ParametersNumberDomain locationP
VariableAverageConserve
Nature of MutationPolar36180.806
Non Polar3421
Charged2110
PROVEAN PhenotypeDeleterious41147<0.0001
Neutral402
PHD-SNP PhenotypeDisease511470.002
Neutral302
SIFT PhenotypeAffect Protein0844<0.0001
Tolerant835
DDG iMutant StabilityDecrease810450.694
Increase014
EcoDDG StabilityDestabilize79440.757
Stabilize125
EcoEntropy Dynamut FlexibilityDecrease45290.666
Increase4620
HumDiv PhenotypeBenign4030.002
Possibly Damaging125
Probably Damaging3941
RegulomeDB2b + 3a1280.769
46832
5018
Frequency distribution of various structural and functional parameters among groups classified based on mutation location (conserved or variable domains) and their association by Fisher’s exact test Phylogenetic relationship through Cladogram has revealed that human SOD1 is closely related to Pongo abelli, Similarly SOD1 of Rattus norvegicus has also showed phyologenetic proximity to Mus musculus, however, both showed some divergence from the human SOD1 (Additional file 1: Figure S2).

Alterations of structural and functional parameters

Mutation in conserved domain influences PROVEAN score

Comparative analysis of structural and functional values derived from various annotating software like Dynamut, PolyPhen-2, Reliability index (RI) of PHD-SNP, PROVEAN and SIFT have indicated significant alterations based on the location and nature of mutations (both in nucleotide and amino acid) in SOD1 protein. PROVEAN score has been found to be significantly varied in HumDiv phenotypes i.e. benign, possibly damaging and probably damaging (Fig. 1a) and conserved domains (Fig. 1b). Significant alteration in PROVEAN score has been found (between probably damaging and benign) among HumDiv phenotypes derived from PolyPhen-2 algorithm in SOD1 variants. Similarly, PROVEAN score has also significantly changed in both average and conservative domains as compared to benign domain. Results suggest that biological significance of mutations located in conserved domain which could lead to drastic changes in structural and functional components of SOD1 protein can be correlated with the phenotypic severity of ALS patients.
Fig. 1

PROVEAN score alterations among a HumDiv phenotypes and b conserved domain based groups. Var: variable domain (n = 8); Avg: average domain (n = 11) analysed by ANOVA; Con: conserved domain (n = 49). Bar is representing SEM; P value * < 0.05, ** < 0.001, *** < 0.0001

PROVEAN score alterations among a HumDiv phenotypes and b conserved domain based groups. Var: variable domain (n = 8); Avg: average domain (n = 11) analysed by ANOVA; Con: conserved domain (n = 49). Bar is representing SEM; P value * < 0.05, ** < 0.001, *** < 0.0001

SOD1 Mutation in conserved domain enhances ALS severity by disrupting structural and functional parameters of protein

To compare the effect of SOD1 mutant variants in conserved domain versus variable-average region of protein, ANOVA results have significantly changed in PROVEAN and SIFT scores. Functional annotation of SOD1 mutations through PROVEAN and SIFT algorithms are suggesting that changes in structural and functional parameters of SOD1 protein can be correlated with the ALS severity and distorted version of protein which can be corresponding to the location of mutation (conserved versus variable-average) (Fig. 2). Marginally significant alteration reliability index (RI) of PolyPhen-2 has also been observed between conserved versus variable-average domain. Significant alterations in HumDiv sensitivity and specificity derived from PolyPhen-2 algorithm has also suggested highly distorted structure of SOD1 protein when mutation was located in conserved domain as compared to variable-average (Fig. 2).
Fig. 2

Structural and functional parameters comparison of SOD1 variants fall in conserved and variable-average domains analyzed by independent T-test. a PROVEAN and PolyPhen-2, b Comparison of SIFT, HumDiv sensitivity and HumDiv specificity in between conserved and variable-average domains. Var-avg: variable-average domains (n = 19); Conserved domain (n = 49); Bar is representing SEM; P value * < 0.05, ** < 0.001, *** < 0.0001

Structural and functional parameters comparison of SOD1 variants fall in conserved and variable-average domains analyzed by independent T-test. a PROVEAN and PolyPhen-2, b Comparison of SIFT, HumDiv sensitivity and HumDiv specificity in between conserved and variable-average domains. Var-avg: variable-average domains (n = 19); Conserved domain (n = 49); Bar is representing SEM; P value * < 0.05, ** < 0.001, *** < 0.0001

SOD1 variants alter structural component Cβ

Results of Pearson’s chi- square have shown that most of the SOD1 variants are located in conserved domain of protein. ANOVA results have demonstrated that the structural component of SOD1 protein, namely Cβ, derived from SWISS model has significantly altered in RegulomeDB 4 score as compare to score 2b +3a (Fig. 3a). Alterations in ΔΔG ENCoM (kcal/mol) and ΔΔSVib ENCoM (kcal.mol−1.K−1) have also been observed among RegulomeDB groups, though not of statistical significance (Fig. 3b, c). Results are indicating that the conserved domain mutation(s) can alter the structural component Cβ of SOD1 protein and may influence the binding of TF and DNase hypersensitivity.
Fig. 3

SOD1 variants fall in conserved domain can alter the action of regulatory sequences by distorting the protein structure. a Significant changes in Cβ between 2b + 3a and 4 RegulomeDB score. (b, c) Gibb’s free energy and entropy alterations among RegulomeBD groups. p-value computed by employing ANOVA. Bar is representing SEM; P value * < 0.05, ** < 0.001, *** < 0.0001. RegulomeDB 2b + 3a (n = 11); RegulomeDB 4 (n = 46); RegulomeDB 5 (n = 9)

SOD1 variants fall in conserved domain can alter the action of regulatory sequences by distorting the protein structure. a Significant changes in Cβ between 2b + 3a and 4 RegulomeDB score. (b, c) Gibb’s free energy and entropy alterations among RegulomeBD groups. p-value computed by employing ANOVA. Bar is representing SEM; P value * < 0.05, ** < 0.001, *** < 0.0001. RegulomeDB 2b + 3a (n = 11); RegulomeDB 4 (n = 46); RegulomeDB 5 (n = 9) The impact of various algorithms to predict the functional and structural components was further correlated with SOD1 variants located in different domains including variable and conserved etc. Multiple regression analysis has shown that SWISS model component QMEAN has been found to be associated with PROVEAN score and can alter the same by 3.299 (p = 0.003). Moreover, ΔΔG ENCoM (kcal/mol) has also exhibited the association with QMEAN (p = 0.016; B = −0.156). Importantly, standardized coefficient beta (β) has also demonstrated the difference of 0.358. Entropy changes (ΔΔSVib ENCoM (kcal.mol−1K−1) of SOD1 variants showed the significant association with DDG value derived from iMutant (p = 0.02) and can alter the value by −0.582 unit. Results are suggesting the alterations in corresponding stability and flexibility of SOD1 protein with respect to nature and location of mutation. Importantly, predicted structural and functional changes of SOD1 protein can be further correlated with expression levels in ALS pathology which may be useful for predicting precise ALS phenotype (Table 7).
Table 7

Multiple regression analysis to demonstrate the association of structural and functional parameters of SOD1 variants and their correlation with ALS

Coefficients
Dependent variable: PROVEAN Score
Unstandardized coefficientsStandardized coefficientsTP
BStd. Errorbeta
(Constant)−13.982.891−4.8360
QMEAN3.2991.0580.3583.1190.003
Dependent variable: QMEAN
 (Constant)2.9210.06544.7570
 PROVEAN Score0.0370.0120.3363.0250.004
 Eco DDG DynaMut−0.1560.063−0.274−2.470.016
Dependent variable: DDG iMutant value
 (Constant)−1.3760.127−10.8110
 Eco Entropy Dynamut−0.5820.244−0.281−2.3820.02
Dependent variable: PolyPhen-2 Score
 (Constant)0.6210.0768.140
 PROVEAN Score−0.0540.014−0.429−3.8630.02
Multiple regression analysis to demonstrate the association of structural and functional parameters of SOD1 variants and their correlation with ALS Structural and functional annotations of synonymous SNPs (sSNPs) cannot be done with existing tools to predict their structural and functional alterations. In current data set, six synonymous SOD1 variants were retrieved which were located in TF binding site and DNase hypersensitivity (Regulome 4 score). Interestingly, five out of six SOD1 sSNPs were falling in conserved domain and one was located in variable region. Results are suggesting the regulatory function of sSNPs by modulating the SOD1 expression and associated cellular mechanism (Table 8).
Table 8

Clinically associated synonymous SNPs of SOD1 in ALS pathology with their Consurf and RegulomeDB score

MutationCodonConSurfRegulome score
L85L253T > CConserve4
L85L255G > AConserve4
A141A423T > AConserve4
N140N420C > TConserve4
E22E66G > AVariable4
N132N396T > CConserve4
Clinically associated synonymous SNPs of SOD1 in ALS pathology with their Consurf and RegulomeDB score Moreover, it was found that SNP at 94 position of SOD1 protein has six different variants as reported by various studied. (Table 9), suggesting a potential hot spot in ALS pathology. Comparative analysis has indicated that structural and functional annotations of these variants have drastically changed as compared to WT protein structure of SOD1. Results have shown increased scores of QMEAN, Cβ, salvation and torsion angel parameters as compared to WT SOD1 protein. All structural factors derived from SWISS modeling in WT protein structure are required to maintain amino acids geometry, for necessary hydrogen and hydrophobic bonding to guide the secondary and tertiary structures of SOD1 protein. Predictive ΔΔG ENCoM (kcal/mol) and ΔΔSVib ENCoM (kcal.mol−1.K−1) by Dynamut also got altered which was concordant with structural changes in protein derived from these SOD1 variants. Results can indicate the distorted molecular flexibility and decreased stability of SOD1 protein derived from these six SNPs. Importantly, these variants were falling in the conserved region of SOD1 protein and primarily affect the TF binding and DNase hypersensitivity.
Table 9

Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy, entropy values and Ramachandran plot details for studied variants which fall at 94 position of SOD1 protein

MutationΔΔG ENCoM (kcal/mol)StabilityΔΔSVib ENCoM (kcal.mol1.K1)Mol flexProveanRIEff.SIFTEffPphen2Eff.Reg.QMAll AtomsSolvTor
Wild Type0.932.41−1.29−0.130.58
G94S0.27D−0.335Dec.−5.337Dis0.05Tol0.856Pos. D42.172.68−0.930.281.67
G94R0.25D−0.315Dec.−7.228Dis0APF1Pro. D42.212.69−0.960.41.67
G94C0.20D−0.245Dec.−8.189Dis0.05Tol1Pro. D42.322.8−0.910.311.8
G94N0.18D−0.222Dec.−5.168Dis0APF0.664Pos. D42.322.76−0.930.351.79
G94A−0.05D0.067Inc.−5.486Dis0.04APF0.991Pro. D42.282.53−0.51−01.91
G94V0.38D−0.471Dec.−8.278Dis0APF1Pro. D42.082.68−0.960.351.56
Tabular representation of PROVEAN, SIFT and PolyPhen-2 scores along with Gibb’s free energy, entropy values and Ramachandran plot details for studied variants which fall at 94 position of SOD1 protein Comparative Ramachandran plot analysis has also suggested the changes in distribution of amino acids in favored, allowed and outlier regions as shown in Fig. 4. Results of Table 9 demonstrating the increased number of amino acids fall in favored region in these six SNPs as compared to WT (286 amino acids). Similarly, drastic changes in number of amino acids can be seen in SOD1 variants in both allowed and outlier regions of these six SNPs in comparison to WT (14 and 2, respectively). Results signify changes in structural parameters in all six variants as compared to WT which can affect the bonding pattern of secondary structure SOD1 proteins.
Fig. 4

Ramachandran plot to demonstrate the distribution of Gly, Pre-Pro, and Pro in allowed and favoured regions for multiple variants at 94 position require to maintain the secondary structure SOD1 protein

Ramachandran plot to demonstrate the distribution of Gly, Pre-Pro, and Pro in allowed and favoured regions for multiple variants at 94 position require to maintain the secondary structure SOD1 protein

Discussion

Present study provides the comparative analysis of various bioinformatics tools used to predict structural and functional aspects of exonic variants of SOD1 in ALS pathology which may be used to decipher the clinical severity of disease and translational implications. Not much success has been achieved in the advancement of prognostic and diagnostic fields to predict ALS in early stages of pathology. ALS is one of the most devastating degenerative diseases which demands faster conversion of genetic data into its clinical and translational development. ALS patients have been prescribed Riluzole to offer symptomatic relief and retard the degenerative process by inhibiting release of glutamic acid and noncompetitive action with N-methyl-D-aspartate (NMDA) receptors [25, 26]. We have demonstrated that mutation(s) found in conserved domain of SOD1 protein are more deleterious and disease causing which can significantly distort the structure of SOD1 protein by altering Gibb’s free energy and entropy of naïve protein. PROVEAN, PHD-SNP and SIFT scores also got altered significantly in mutations located in the conserved domain of SOD1 protein. Interestingly, results of multiple regression analysis to see individual impact of different entities including SIFT, PROVEAN, Polyphen2, QMEAN etc. on structure and function of SOD1 protein have revealed significant changes in free energy and entropy (Delta G and Delta S) which were concordant with structural changes in SOD1 protein. Results are suggesting that conserved domain mutation may have pivotal role in balancing the free energy and entropy of SOD1 protein by maintaining homeostatic interactions. Multiple bioinformatics tools to predict the structural and functional analysis can enhance the possibility of identifying variant’s nature that can be missed by employing one tool to be specific. The resulted SNPs can induce the unfavorable conformational changes in SOD1 protein and may refuse to interact with other associated molecules which may hamper the mediated functions of downstream molecules (Fig. 5). Mutations in conserved domain of SOD1 protein have been found to stimulate the sedimentable aggregates [27], impair the activity of Na+/K+ATPase-α3 [28], reduce the affinity for Zn ion [29] and increase the Palmitoylation [30]. Therefore, it can also be argued that location and degree of mutation of SOD1 gene may have diverse impact on structural and functional aspects of SOD1 protein. This suggests that ALS pathology derived through various mutation may be dealt accordingly to the nature of SOD1 mutation and therapeutic regimen must be designed accordingly. It is evident from our results that mutation in phylogenetically conserved region is pronounced to be highly detrimental in nature because these alterations are located in functional domain of the effective protein and suggest maximum structural distortion of protein. Based on these results, it may be suggestive to provide the therapies or molecules which may assist in maximum structural restoration of the mutated SOD1 protein to provide the interactive interface for downstream molecules, may be beneficial for ALS patients.
Fig. 5

Schematic illustration of various biomolecules to reveal the interaction with SOD1 protein to perform the various cellular and molecular function to maintain oxidative homeostatis

Schematic illustration of various biomolecules to reveal the interaction with SOD1 protein to perform the various cellular and molecular function to maintain oxidative homeostatis Study has also provided comparative analysis of synonymous SNPs (six), though they do not exert any changes in tertiary structure of protein. Our results are suggesting the changes in QMEAN, Cβ, salvation and torsion angle of these six variants as compared to WT SOD1 protein and indicating to consider the same while making a clinical impression of ALS phenotype. Moreover, Ramachandran plot analysis has also showed the differential distribution of amino acids among favorable, allowed and outlier region require to maintain secondary structure of SOD1 protein suggesting the distorted molecular flexibility and stability of protein of variants at 94 position Such multiple variations at single location may lead to differential clinical phenotypes of ALS based on distribution of amino acid in Ramachandran plot due to varied degree of interactions with downstream molecules.

Conclusion

Study has indicated the biological significance of mutation fall on conserved domain mutation of SOD1 and can distort the structure of naïve protein. Such spectrum of mutation can confer the various intermittent phenotypes of ALS by exerting the varying degree of interaction with downstream molecules which may warrant the personalized therapy based on location of SOD1 mutation.

Limitation

Predictive genetic interactions between these SOD1 variants and molecular interaction with other genes have not been deciphered. Corresponding protein levels of SOD1 variants can precisely define consequence in ALS severity and can derive better representation of ALS phenotype which could be demonstrated by adopting cell culture or animal model based analysis. Additional file 1: Tables and Figures.
  30 in total

1.  Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information.

Authors:  E Capriotti; R Calabrese; R Casadio
Journal:  Bioinformatics       Date:  2006-08-07       Impact factor: 6.937

Review 2.  Posttranslational modifications in Cu,Zn-superoxide dismutase and mutations associated with amyotrophic lateral sclerosis.

Authors:  Yoshiaki Furukawa; Thomas V O'Halloran
Journal:  Antioxid Redox Signal       Date:  2006 May-Jun       Impact factor: 8.401

3.  Realizing the gains and losses in C9ORF72 ALS/FTD.

Authors:  Cathleen Lutz
Journal:  Nat Neurosci       Date:  2020-05       Impact factor: 24.884

4.  C9orf72 hexanucleotide repeat expansion in Indian patients with ALS: a common founder and its geographical predilection.

Authors:  Uzma Shamim; Sakshi Ambawat; Jyotsna Singh; Aneesa Thomas; Chevula Pradeep-Chandra-Reddy; Varun Suroliya; Bharathram Uppilli; Shaista Parveen; Pooja Sharma; Shankar Chanchal; Saraswati Nashi; Veeramani Preethish-Kumar; Seena Vengalil; Kiran Polavarapu; Muddasu Keerthipriya; Niranjan Prakash Mahajan; Neeraja Reddy; Priya Treesa Thomas; Arun Sadasivan; Manjusha Warrier; Malika Seth; Sana Zahra; Aradhana Mathur; Deepti Vibha; Achal K Srivastava; Atchayaram Nalini; Mohammed Faruq
Journal:  Neurobiol Aging       Date:  2020-01-03       Impact factor: 4.673

Review 5.  Amyotrophic lateral sclerosis.

Authors:  Matthew C Kiernan; Steve Vucic; Benjamin C Cheah; Martin R Turner; Andrew Eisen; Orla Hardiman; James R Burrell; Margaret C Zoing
Journal:  Lancet       Date:  2011-02-04       Impact factor: 79.321

6.  Palmitoylation of superoxide dismutase 1 (SOD1) is increased for familial amyotrophic lateral sclerosis-linked SOD1 mutants.

Authors:  Sarah E Antinone; Ghanashyam D Ghadge; Tukiet T Lam; Lijun Wang; Raymond P Roos; William N Green
Journal:  J Biol Chem       Date:  2013-06-12       Impact factor: 5.157

7.  Mutations in Cu/Zn superoxide dismutase gene are associated with familial amyotrophic lateral sclerosis.

Authors:  D R Rosen; T Siddique; D Patterson; D A Figlewicz; P Sapp; A Hentati; D Donaldson; J Goto; J P O'Regan; H X Deng
Journal:  Nature       Date:  1993-03-04       Impact factor: 49.962

8.  Recessive amyotrophic lateral sclerosis families with the D90A SOD1 mutation share a common founder: evidence for a linked protective factor.

Authors:  A Al-Chalabi; P M Andersen; B Chioza; C Shaw; P C Sham; W Robberecht; G Matthijs; W Camu; S L Marklund; L Forsgren; G Rouleau; N G Laing; P V Hurse; T Siddique; P N Leigh; J F Powell
Journal:  Hum Mol Genet       Date:  1998-12       Impact factor: 6.150

Review 9.  C9orf72-mediated ALS and FTD: multiple pathways to disease.

Authors:  Rubika Balendra; Adrian M Isaacs
Journal:  Nat Rev Neurol       Date:  2018-09       Impact factor: 42.937

Review 10.  Peripheral and Central Nervous System Immune Response Crosstalk in Amyotrophic Lateral Sclerosis.

Authors:  Zhouyang Liu; Xi Cheng; Shanshan Zhong; Xiuchun Zhang; Chang Liu; Fangxi Liu; Chuansheng Zhao
Journal:  Front Neurosci       Date:  2020-06-16       Impact factor: 4.677

View more
  1 in total

1.  Spatiotemporal evolution of pyroptosis and canonical inflammasome pathway in hSOD1G93A ALS mouse model.

Authors:  Haoyun Zhang; Hao Li; Bingkun Huang; Shaoye Wang; Ying Gao; Fandi Meng; Yanchun Chen; Fenghua Zhou; Yingjun Guan; Xin Wang
Journal:  BMC Neurosci       Date:  2022-08-09       Impact factor: 3.264

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.