Dimos Kapetis1,2, Jenny Sassone3,4, Yang Yang5,6, Barbara Galbardi7, Markos N Xenakis8,9, Ronald L Westra8,9, Radek Szklarczyk8, Patrick Lindsey8, Catharina G Faber8,10, Monique Gerrits8, Ingemar S J Merkies10,11, Sulayman D Dib-Hajj5,6, Massimo Mantegazza12, Stephen G Waxman5,6, Giuseppe Lauria3. 1. Bioinformatics Unit, IRCCS Foundation "Carlo Besta" Neurological Institute, Milan, Italy. dimos.kapetis@gmail.com. 2. Neuroalgology Unit, IRCCS Foundation "Carlo Besta" Neurological Institute, Milan, Italy. dimos.kapetis@gmail.com. 3. Neuroalgology Unit, IRCCS Foundation "Carlo Besta" Neurological Institute, Milan, Italy. 4. Present address: San Raffaele Scientific Institute and Vita-Salute University, Milan, Italy. 5. Department of Neurology, Yale University School of Medicine, New Haven, USA. 6. Center for Neuroscience and Regeneration Research, Yale University School of Medicine, New Haven, USA. 7. Bioinformatics Unit, IRCCS Foundation "Carlo Besta" Neurological Institute, Milan, Italy. 8. Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, The Netherlands. 9. Department of Knowledge Engineering, Maastricht University, Maastricht, The Netherlands. 10. Department of Neurology, Maastricht University Medical Center, Maastricht, The Netherlands. 11. Department of Neurology, Spaarne Hospital, Hoofddorp, The Netherlands. 12. Laboratory of Excellence Ion Channel Science and Therapeutics, Institute of Molecular and Cellular Pharmacology, CNRS UMR7275 & University of Nice-Sophia Antipolis, Valbonne, France.
Abstract
BACKGROUND: Gain-of-function mutations in SCN9A gene that encodes the voltage-gated sodium channel NaV1.7 have been associated with a wide spectrum of painful syndromes in humans including inherited erythromelalgia, paroxysmal extreme pain disorder and small fibre neuropathy. These mutations change the biophysical properties of NaV1.7 channels leading to hyperexcitability of dorsal root ganglion nociceptors and pain symptoms. There is a need for better understanding of how gain-of-function mutations alter the atomic structure of Nav1.7. RESULTS: We used homology modeling to build an atomic model of NaV1.7 and a network-based theoretical approach, which can predict interatomic interactions and connectivity arrangements, to investigate how pain-related NaV1.7 mutations may alter specific interatomic bonds and cause connectivity rearrangement, compared to benign variants and polymorphisms. For each amino acid substitution, we calculated the topological parameters betweenness centrality (B ct ), degree (D), clustering coefficient (CC ct ), closeness (C ct ), and eccentricity (E ct ), and calculated their variation (Δ value = mutant value -WT value ). Pathogenic NaV1.7 mutations showed significantly higher variation of |ΔB ct | compared to benign variants and polymorphisms. Using the cut-off value ±0.26 calculated by receiver operating curve analysis, we found that ΔB ct correctly differentiated pathogenic NaV1.7 mutations from variants not causing biophysical abnormalities (nABN) and homologous SNPs (hSNPs) with 76% sensitivity and 83% specificity. CONCLUSIONS: Our in-silico analyses predict that pain-related pathogenic NaV1.7 mutations may affect the network topological properties of the protein and suggest |ΔB ct | value as a potential in-silico marker.
BACKGROUND: Gain-of-function mutations in SCN9A gene that encodes the voltage-gated sodium channel NaV1.7 have been associated with a wide spectrum of painful syndromes in humans including inherited erythromelalgia, paroxysmal extreme pain disorder and small fibre neuropathy. These mutations change the biophysical properties of NaV1.7 channels leading to hyperexcitability of dorsal root ganglion nociceptors and pain symptoms. There is a need for better understanding of how gain-of-function mutations alter the atomic structure of Nav1.7. RESULTS: We used homology modeling to build an atomic model of NaV1.7 and a network-based theoretical approach, which can predict interatomic interactions and connectivity arrangements, to investigate how pain-related NaV1.7 mutations may alter specific interatomic bonds and cause connectivity rearrangement, compared to benign variants and polymorphisms. For each amino acid substitution, we calculated the topological parameters betweenness centrality (B ct ), degree (D), clustering coefficient (CC ct ), closeness (C ct ), and eccentricity (E ct ), and calculated their variation (Δ value = mutant value -WT value ). Pathogenic NaV1.7 mutations showed significantly higher variation of |ΔB ct | compared to benign variants and polymorphisms. Using the cut-off value ±0.26 calculated by receiver operating curve analysis, we found that ΔB ct correctly differentiated pathogenic NaV1.7 mutations from variants not causing biophysical abnormalities (nABN) and homologous SNPs (hSNPs) with 76% sensitivity and 83% specificity. CONCLUSIONS: Our in-silico analyses predict that pain-related pathogenic NaV1.7 mutations may affect the network topological properties of the protein and suggest |ΔB ct | value as a potential in-silico marker.
SCN9A gene encodes the alpha-subunit of voltage-gated sodium channel NaV1.7 that is expressed in dorsal root ganglion (DRG) nociceptors and in sympathetic neurons. NaV1.7 is folded into four homologous domains, each containing six transmembrane helices (S1-S6). S1–S4 helices form the voltage-sensing domain (VSD) and highly conserved basic residues in S4 sense the electric field across the membrane. S5–S6 helices with the re-entrant extracellular loop in between form the pore domain (PD) [1]. Membrane depolarisation induces a conformational change in the VSD that, through the S4-S5 linker, is transmitted to the PD and prompt the gate to open, allowing the passage of sodium ions through the pore [2]. Opening and closing of the channel modulate the subthreshold membrane potential of nociceptors and play a key role in regulating their firing.Missense mutations in SCN9A have been associated to a spectrum of painful conditions in humans [3], including inherited erythromelalgia (IEM), [4-13], paroxysmal extreme pain disorder (PEPD) [14-17], and small fibre neuropathy (SFN) [18, 19]. Voltage-clamp recording, performed in transfected cell lines and DRG neurons in vitro, showed that IEM-related mutations enhance the activation of NaV1.7 through a hyperpolarising shift and a slower deactivation that keeps the channel open longer once it is activated [3], thus generating a larger-than-normal inward sodium current, with greater biophysical changes at higher temperature [20]. PEPD-related NaV1.7 mutations impair channel inactivation and prolong action potentials and repetitive nociceptor firing in response to provoking stimuli, such as stretching and exposure to cold temperatures [14, 16, 21]. NaV1.7 mutations identified in SFNpatients display a spectrum of electrophysiological signatures, including impaired slow inactivation, depolarised slow and fast inactivation and enhanced resurgent currents [18].Overall, all the disease-related NaV1.7 mutations are pro-excitatory for the NaV1.7 channel, thus increasing nociceptor excitability. For those NaV1.7 mutations that have been studied by structural modelling, the gain-of-function effect stems from functionally significant changes in the biomolecular structure of NaV1.7 channel [22-24]. Accordingly, gain-of-function mutations found in IEM, PEPD, and SFNpatients might be expected to produce functionally significant changes in the protein structure of NaV1.7, whereas single nucleotide polymorphisms (SNPs) or variants not associated with disease would not be expected to modify the NaV1.7 protein structure in functionally significant ways. Previous NaV1.7 structural modelling, combined with functional studies, showed that the disruption of the hydrophobic ring by the F1449V [24] or the in-frame deletion Leu955Del [22] contribute to destabilizing the NaV1.7 closed-state. These studies suggest that homology modelling is a useful tool to predict functional changes in the biomolecular structure of Nav1.7. However, the nature and extent of interatomic bond variations in NaV1.7 protein structure caused by amino acid changes have not been examined over a spectrum of mutations and SNPs.Structural modelling combined with network theory has been widely exploited in studying protein structure to identify the emergent features of global connectivity. Indeed, several studies have used network theory to provide important insights in the local topology of interactions from a global prospective with examples from the field of allosteric communication pathways [25], protein-protein interactions [26], catalytic site residues in enzymes [27] and protein-folding mechanisms [28]. Several methods have been proposed in the literature to transform the protein structures into a network by considering: (a) the C-alpha/C-beta atoms in the amino acid residues, as in a protein backbone network [29] (b) description of the atomic contacts between residues that also feature correlated motions [30-32] or (c) weak and strong non-covalent protein structure network considering atom-atom interaction at the side chain level which has been proven to provide valuable biological insights [30, 33, 34]. These studies have shown that network analysis of a protein can yield a useful method to characterize the topology of the constituent amino acid residues. Protein topologies and interaction connectivity could often produce distinct small-world networks proprieties [28, 35, 36], thus having high local connectivity of residue nodes with a smaller number of long-range residue-residue interactions.In the present study, we aimed at elucidating specific interatomic bond variations caused by amino acid changes in NaV1.7 structure by using a network-based method. We tested the hypothesis that mutations associated with IEM, PEPD and SFN cause specific types of interatomic bonds variation of NaV1.7 that can be quantified by a network-based theoretical approach able to reduce the complexity of the three-dimensional protein architecture to one-dimensional graphs [28].
Methods
Protocol description
The overall method is summarized in Fig. 1. Our methodology can be encapsulated in a protocol that has two main components: homology modelling and topology analysis. The main steps of the current protocol are: (A) Homology modelling of NaV1.7 WT based on the bacterial NavAbsodium channel template. (B) Energy minimization and structure refinement of the protein structure (C) In-silico mutagenesis is performed for pathogenetic and control group (nABN/hSNPs) mutations (Table 1). (D) Construction of inter-residue network based on weak and strong noncovalent interactions (E) Network centrality calculation and (F) the difference between mutated and WT mutated (Δ = mutant -WT ).
Fig. 1
NaV1.7 computational protocol overview. A NaV1.7 WT homology modelling of based on the bacterial NavAb sodium channel template. B Energy minimization and structure refinement of the protein structure with YAMBER force field and FG-MD server. C
In-silico mutagenesis for pathogenetic and control group (nABN/hSNPs) mutations. D Transforming NaV1.7 structure into residue interaction graphs. The construction of inter-residue network was based on interatomic bonds (hydrophobic, hydrogen bonds, salt-bridges, cation-π and π-π stacking interactions) using the commands “ListIntAtom” and “ListIntBo” via YASARA software. The de novo network construction for each mutant and WT models is achieved considering the predicted binary interatomic bonds. E-F. Network centrality calculation and their relative variation between mutant and WT (Δ = mutant -WT )
Table 1
ΔB values of NaV1.7 mutations associated to IEM, SFN and PEPD
Disease
Mutation
Amino acidProperties
ChannelPart
ΔBct
Reference
IEM
I136V
=HΦO
VSD (S1;DI)
0.12
[12, 58, 69]
S211P
Polar → HΦO
VSD (S4;DI)
-1.09
[70]
F216S
HΦO → polar
VSD (S4;DI)
-1.71
[11, 57]
L823R
HΦO → charged
VSD (S4;DII)
1.23
[7, 71]
W1538R
HΦO → charged
VSD (S2-S3;DIV)
0.18
[72]
I234T
HΦO → polar
S4-S5 (DI)
2.33
[73]
S241T
=Polar
S4-S5 (DI)
0.34
[23, 74, 75]
I848T
HΦO → polar
S4-S5 (DII)
-5.83
[4, 9, 17, 47, 59]
L858H
HΦO → charged
S4-S5 (DII)
-1.85
[4, 9, 17]
L858F
=HΦO
S4-S5 (DII)
-1.74
[5, 11, 76]
G856Da
HΦO → charged
S4-S5 (DII)
-0.55
[19]
A863P
=HΦO
S4-S5 (DII)
-0.32
[6]
P1308L
=HΦO
S4-S5 (DIII)
0.04
[21]
V1316A
=HΦO
S4-S5 (DIIII)
0.36
[47, 77]
A1632Eb
HΦO → charged
S4-S5 (DIV)
0.27
[78]
N395K
polar → charged
Pore (S6;DI)
5.32
[11, 79]
V400M
=HΦO
Pore (S6;DI)
-0.68
[23, 80]
V872G
=HΦO
Pore (S5;DI)
-2.48
[81]
F1449V
=HΦO
Pore (S6;DIII)
-0.51
[10, 23]
A1746G
=HΦO
Pore (S6;DIV)
1.40
[72]
SFN
R185H
=charged
VSD (S2-S3;DI)
0
[18]
I228M
=HΦO
VSD (S4;DI)
2.04
[18, 82]
I739V
=HΦO
VSD (S1;DII)
0.54
[18, 83]
M1532I
=HΦO
VSD (S2-S3;DIV)
0.15
[18]
M932L
=HΦO
Loop Pore (DII)
0.46
[18]
PEPD
V1298D
HΦO → charged
S4-S5 (DIII)
-0.81
[14]
V1298F
=HΦO
S4-S5 (DIII)
-0.004
[14, 15, 21]
V1299F
=HΦO
S4-S5 (DIII)
0.07
[14, 15, 17]
G1607R
HΦO → charged
S4-S5 (DIII)
0.62
[84]
M1627K
HΦO → charged
S4-S5 (DIV)
1.22
[14, 16, 17, 85]
IEM Inherited erythromelalgia, SFN Small Fibre Neuropathy, PEPD Paroxysmal extreme pain disorder, nABN no biophysical abnormalities, H
O Hydrophobic. ΔB was calculated as (mutated B – Wild-type B) × 100
aThis mutation associates with clinical features of IEM and SFN
bThis mutation causes symptoms common both to IEM and PEPD
NaV1.7 computational protocol overview. A NaV1.7 WT homology modelling of based on the bacterial NavAbsodium channel template. B Energy minimization and structure refinement of the protein structure with YAMBER force field and FG-MD server. C
In-silico mutagenesis for pathogenetic and control group (nABN/hSNPs) mutations. D Transforming NaV1.7 structure into residue interaction graphs. The construction of inter-residue network was based on interatomic bonds (hydrophobic, hydrogen bonds, salt-bridges, cation-π and π-π stacking interactions) using the commands “ListIntAtom” and “ListIntBo” via YASARA software. The de novo network construction for each mutant and WT models is achieved considering the predicted binary interatomic bonds. E-F. Network centrality calculation and their relative variation between mutant and WT (Δ = mutant -WT )ΔB values of NaV1.7 mutations associated to IEM, SFN and PEPDIEMInherited erythromelalgia, SFN Small Fibre Neuropathy, PEPD Paroxysmal extreme pain disorder, nABN no biophysical abnormalities, H
O Hydrophobic. ΔB was calculated as (mutated B – Wild-type B) × 100aThis mutation associates with clinical features of IEM and SFNbThis mutation causes symptoms common both to IEM and PEPD
NaV1.7 homology modelling
A homology model of the closed-state pore domain of the NaV1.7 was generated using the crystal structure of the bacterial Arcobacter bultzeri NaV channel NaVAb [37] as a template with the human sequence NM_002977.3 through the MEMOIR server [38]. Gap region (269-340, DI) between template-target alignment and interdomain loop regions (416-726, DI-DII; 967-1175, DII-DIII; 1458-1498, DIII-DIV) were excluded from in-silico mutagenesis (Fig. 1A). The NaVAb template shared 28% sequence identity for DI, 24% for DII, 28% for DIII and 28% for DIV (overall 27% sequence identity). The four homologous domains were modelled in the clockwise direction viewed from the extracellular side as previously suggested [39, 40]. Ab-initio modelling was performed to extend the S6 helices of the PD using the Iterative Threading ASSEmbly Refinement (I-TASSER) server [41]. The final model was subjected to energy minimization and model refinement using the YAMBER force field [42] and the Fragment-Guided Molecular Dynamics (FG-MD) server [43]. The NaV1.7 WT model was subjected to stereochemical analysis with RAMPAGE server (http://services.mbi.ucla.edu/). RAMPAGE provides results in a graphical form that shows the number of residues falling in favoured region, allowed region and in outlier region.
In-Silico Mutagenesis of NaV1.7 pathogenetic and control mutations
We performed in-silico mutagenesis via WT domain replacement of NaV1.7 mutations found in IEM, PEPD or SFNpatients in which gain-of-function was demonstrated by cell electrophysiology assay and that do not alter the biophysical properties of the channel (nABN). To increase the number of control variants, we added missense SNPs identified between SCN9a homologous genes sharing >90% nucleotide sequence identity using the NCBI HomoloGene Database [44]. We constructed the phylogenetic tree of the multiple sequence alignment using ClustalW via neighbor joining method (Additional file 1: S1 Text; https://www.ebi.ac.uk/Tools/phylogeny/clustalw2_phylogeny/). The mutated models were further subjected to energy minimization and model refinement using the YAMBER force field [42] and the FG-MD server (Fig. 1B) [43]. Such hSNPs have previously been used in similar studies [45-47]. All the mutations and SNPs are reported in Additional file 2: Table S1.
Transforming NaV1.7 structure into residue interaction graphs
NaV1.7 structures were transformed into mathematical graphs by identifying interatomic bonds between the amino acids. The amino acid residues form the nodes and inter-node contact interaction form the edges of the graph (Fig. 1D). We identified the interatomic bonds (hydrophobic, hydrogen bonds, salt-bridges, cation-π and π-π stacking interactions) between two residues i and j as long as the atom-atom distance between them was less than 5.0 Å using the commands “ListIntAtom” and “ListIntBo” via YASARA software (Yet Another Scientific Artificial Reality Application, www.yasara.org). Hydrophobic contacts between residues were considered in the following atom groups: (a) the first carbon of CH3-, -CH2- and CHC3 (b) sp2carbons (phenolic rings). π-π stacking were considered between (a) sp2carbons with a hydrogen and (b) carbon, nitrogen, oxygen or sulphur atoms in planar phenolic rings. Cation-π formation was considered to be a π-π contact with the difference being that one of the interaction partners is a cation. The de novo network construction for each mutant and WT models is achieved considering the predicted binary interatomic bonds identified through YASARA software.
Topological metrics and network visualization
We computed some of the most well-known network centrality measures for each mutant and WT network NaV1.7 graph using the Cytoscape plugin NetworkAnalyzer [48], namely:Betweenness Centrality (B) and edge Betweenness centrality (EB): B [49] is defined as the fraction of shortest pathways between all pairs of nodes of the network that go through that node. Let G = (N, E) a graph, where N is the set of the nodes and E is the set of the edges. For each node n and m in N, let d (n, m) the distance between n and m. We definewhere s, t ∈N, σst (n) is the number of shortest paths from s to t that n lies on, and σst denotes the number of shortest paths from s to t. It accounts the importance of a node facilitating interactions between other nodes. For example, a node with high Bct can operate as a bridge on many shortest paths between other nodes in the network. It is a measure of how powerful a node is able to transfer (high B) or interrupt (low B) the spread of information on the fastest connection between two nodes. Similarly, the EB of an edge is the number of shortest paths between pairs of nodes that run along it. We define:Where N = set of nodes; E = set of edges; = number of shortest paths between ni and nj; = number of shortest paths between ni and nj which pass through e ∊ E;Degree (D): D [49] of a node (k) is defined as the total number of nodes that it is directly connected to;Clustering Coefficient (CC): Clustering Coefficient [49] is a metric commonly employed to identify well-connected sub-components in network which represents the interconnectivity of neighbors of the node. It measures the degree to which nodes tend to cluster together and is defined as the fraction of triangles around a node among the total number of possible triangles. We definewhere kn is the number of neighbors of n and en is the number of connected pairs between all neighbors of n;Closeness centrality (C): Cct is defined as the sum of the inverted distances, i.e. farness, to all other nodes in the graph. It captures the basic intuition that the closer a node is to all other nodes in terms of path length, the more important it is. Mathematically, C of a node n is defined as the inverse of the sum of shortest paths from n to all other nodes m in network. We defineEccentricity (E): Ect measures the distance between a node n and the most distance node m; if the E of the node n is low, this means that all other nodes are in proximity whereas a high E means that there is at least one node (and all its neighbors) that is far from node n. We define E maximum non-infinite length of a shortest path between n and another node in the network. We define
Network centrality measure variation
For each network centrality measures we calculated the difference between mutant and WT values defined as Δvalue (Δvalue = mutant value – WT value). The NaV1.7 amino acid network was visualized using Cytoscape’s Organic layout, which is a force-directed layout algorithm similar to the Fruchterman-Reingold approach [50].
Statistical analysis
Statistical analyses were performed using the R statistical Package [51]. Data are indicated as mean ± SD. Statistical significance was determined by the Wilcoxon signed-ranked test (p <0.05). The receiver operating characteristics (ROC) curve was used to assess the discriminatory power of centrality measure variations between pathogenetic NaV1.7 mutations and control groups (nABN and hSNPs). The upper-angle of ROC corresponding to the best sensitivity and specificity was used to identify the best cut-off value.
Results
NaV1.7 interatomic structure graph design
We performed homology modelling to construct the tertiary structure of the closed-state NaV1.7sodium channel (Fig. 1). We constructed the atomic model of NaV1.7sodium channel using the MEMOIR server [38] based on the crystal structure of the bacterial Arcobacter bultzeri NaV channel NaVAb as a template with the human sequence NM_002977.3. The first four helices S1–S4 form the VSD and the last two helices S5–S6 form the PD (Fig. 2a and b). Gap region (269-340, S5-6 extracellular linker in DI) between template-target alignment and interdomain loop regions (416-726, DI-DII; 967-1175, DII-DIII; 1458-1498, DIII-DIV) were excluded from in-silico mutagenesis. The four homologous domains were modelled in the clockwise direction viewed from the extracellular side as suggested previously [39, 40].
Ab-initio modelling was performed to extend the S6 helices of the PD using the Iterative Threading ASSEmbly Refinement (I-TASSER) server [41]. The final model was subjected to energy minimization and model refinement using the YAMBER force field [42] and the Fragment-Guided Molecular Dynamics (FG-MD) server [43] (Additional file 3: NaV1.7 pdb file). The RAMPAGE results for the NaV1.7 model showed 88.5% residues in most favored region (Additional file 4: Figure S1), 9% (90 residues) in allowed region and 2.5% (25 residues) in outlier region. A good quality Ramachandran plot has over 90% residues in the most favoured regions [52] therefore Ramachandran plot of NaV.17 it is close to a good quality model (88.5% residues in most favoured regions).
Fig. 2
NaV1.7 structure and inter-atomic network features. a View of the sodium channel α-subunit from the intracellular side of the membrane NaV1.7 is folded into four repeated domains (DI–DIV); helices S1–S4 comprise the voltage-sensing domain (VSD); helices S5–S6 and their intracellular linker comprise the pore domain (PD). b Intramembrane view of the folded model of NaV1.7. c The graph shows the topology of the mutations found in patients with inherited erythromelalgia (IEM; red), paroxysmal extreme pain disorder (PEPD; green), small-fibre neuropathy (SFN; purple) and the amino acid substitution with no biophysical abnormalities (nABN) and homologous SNPs (light blue). Nodes represent the residues and edges of the interatomic bonds. Red and black edges represent high (red) or low (grey) edge betweenness centrality (EB) values, respectively. Edge thickness are proportional to EB and reveal that a high number of shortest paths pass through few edges. *This mutation associates with clinical features of IEM and SFN. ǂThis mutation causes in vitro biophysics changes and in vivo symptoms common both to IEM and PEPD. The NaV1.7 amino acid network were visualized using Cytoscape’s Organic layout, which is a force-directed layout algorithm similar to the Fruchterman-Reingold approach
NaV1.7 structure and inter-atomic network features. a View of the sodium channel α-subunit from the intracellular side of the membrane NaV1.7 is folded into four repeated domains (DI–DIV); helices S1–S4 comprise the voltage-sensing domain (VSD); helices S5–S6 and their intracellular linker comprise the pore domain (PD). b Intramembrane view of the folded model of NaV1.7. c The graph shows the topology of the mutations found in patients with inherited erythromelalgia (IEM; red), paroxysmal extreme pain disorder (PEPD; green), small-fibre neuropathy (SFN; purple) and the amino acid substitution with no biophysical abnormalities (nABN) and homologous SNPs (light blue). Nodes represent the residues and edges of the interatomic bonds. Red and black edges represent high (red) or low (grey) edge betweenness centrality (EB) values, respectively. Edge thickness are proportional to EB and reveal that a high number of shortest paths pass through few edges. *This mutation associates with clinical features of IEM and SFN. ǂThis mutation causes in vitro biophysics changes and in vivo symptoms common both to IEM and PEPD. The NaV1.7 amino acid network were visualized using Cytoscape’s Organic layout, which is a force-directed layout algorithm similar to the Fruchterman-Reingold approachWe performed in-silico mutagenesis for 18 mutations causing IEM, 6 mutations causing SFN, 6 mutations causing PEPD (Additional file 2: Table S1), 4 mutations not causing biophysical abnormalities (nABN) in the channel (N1245S: [53]; L1267V: [53]; V1428I and T920N: Waxman, Dib-Hajj and Mantegazza, unpublished observations) and 49 SNPs identified among human and homologous mammalian (hSNPs) SCN9A genes with >90% sequence identity (Additional file 5: S2 Text). All the disease-related mutations had previously been characterized by electrophysiological assays, and found to confer gain-of-function changes to the NaV1.7 channel (Additional file 2: Table S1). The WT and mutant NaV1.7 structures were transformed into undirected graphs by the identification of hydrophobic, cation-π and π-π stacking interactions and hydrogen bonds (H-bonds) among the amino acids. In the resulting graph, amino acids are the nodes and their interactions are the edges (Fig. 2c).
Analyses of the interatomic variations caused by gain-of-function mutations
Previous studies showed that gain-of-function mutations change the biophysical properties of the channel NaV1.7 [4–9, 14–16, 18, 54] but the underlying interatomic variations are yet to be investigated. We analyzed the interatomic variations by calculating the network centrality parameters (B, D, CC, C, E; see methods for detailed definitions) of WT and mutated residues and the value of the variation (Δ = mutant - WT , ΔB, ΔD, ΔCC, ΔC, ΔE) associated with each gain-of-function NaV1.7 mutation, nABN and hSNP. B is a measure of the centrality of a node n defined as the fraction of shortest pathways between all pairs of nodes (s, t) of the network that go through that node n [49, 55]. D of a node n is defined as the total number of nodes that it is directly connected to [49, 55]. CC is a metric commonly employed to identify well-connected sub-components in network which represents the interconnectivity of neighbors of a node n [35, 49]. C is defined as the sum of the inverted distances of a node n, i.e. farness, to all other nodes in the graph. It captures the basic intuition that the closer a node is to all other nodes, the more important it is [56]. Eccentricity (E) of a node n is the greatest distance from a node n to any other node m [55].Figure 3a-e show the profile of the topological parameters B, D, CC, C, and E in WT and mutated residues. The graphs show that both gain-of-function mutations and nABN/hSNPs modify the D values (Fig. 3c and Additional file 6: Figure S2) and CC values (Fig. 3d and Additional file 7: Figure S3) in a wide range but without significant differences between the groups (gain-of-function mean ∆D = 4.30 ± 5.15; nABN and hSNP mean ∆D = 2.27 ± 2.1; p > 0.05 by Wilcoxon signed-ranked test; gain-of-function mean ∆CC = 0.15 ± 0.20; nABN and hSNP mean ∆CC = 0.20 ± 0.25; p > 0.05 by Wilcoxon signed-ranked test). Smaller variations were observed in C values (Fig. 3e and Additional file 8: Figure S4) and E values (Fig. 3f and Additional file 9: Figure S5) without significant differences between the groups (gain-of-function mean ∆C = 0.65 ± 0.94; nABN and hSNP mean ∆C = 0.71 ± 1.51; p > 0.05 by Wilcoxon signed-ranked test; gain-of-function mean ∆E = 1.53 ± 3.75; nABN and hSNP mean ∆E = 2.05 ± 4.62; p > 0.05 by Wilcoxon signed-ranked test). Overall, ΔD, ΔCC, ΔC, ΔE did not differ significantly between gain-of-function mutations and nABN and hSNPs.
Fig. 3
Topological parameter profiles of NaV1.7 gain-of-function mutations and nABN and hSNPs. a The upper panel shows the B profile of gain-of-function mutation; the lower panel show the B profile of nABN and hSNPs. Squares indicates B values of WT amino acids, circles indicate B values of mutated amino acids. The graphs highlight that the difference between B value of mutated amino acids and B value of WT amino acids is higher in the cohort of gain-of-function (GF) mutations (upper panel) compared to control (Ctrl) nABN and hSNPs (lower panel). B values are multipled by 100. b The box plot shows the |ΔB| difference between gain-of-function mutations and the cohort of nABN and hSNP variants (mean gain-of-function |∆B| = 1.14 ± 1.40; nABN and hSNP ∆B = 0.19 ± 0.28; ***p < 0.001 by Wilcoxon signed-ranked test). |∆B| values are multipled by 100; dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. c The upper panel shows the Degree (D) profile of gain-of-function mutations; the lower panel show the D profile of nABN and hSNPs. Squares indicates D values of WT amino acids, circles indicate D values of mutated amino acids. The box plot shows the |ΔD| difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (gain-of-function mean |∆D| = 4.3 ± 5.15; nABN and hSNP |∆D| = 2.37 ± 2.10; p > 0.05 by Wilcoxon signed-ranked test); dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. d The upper panel shows the Clustering Coefficient (CC) profile of gain-of-function mutations; the lower panel show the CC profile of nABN and hSNPs. Squares indicates CC values of WT amino acids; circles indicate CC values of mutated amino acids. The box plot shows the |ΔCC| difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (mean gain-of-function |∆CC| = 0.15 ± 0.20; nABN and hSNP |∆CC| = 0.20 ± 0.25; p > 0.05 by Wilcoxon signed-ranked test); dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. e The upper panel shows the Closeness (Cct) profile of gain-of-function mutations; the lower panel show the Cct profile of nABN and hSNPs. Squares indicates Cct values of WT amino acids, circles indicate Cct values of mutated amino acids. The box plot shows the ΔC difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (mean gain-of-function |∆C| = 0.6 ± 0.9; nABN and hSNP |∆C| = 0.7 ± 1.4; p > 0.05 by Wilcoxon signed-ranked test). C and ∆C values are multipled by 100; dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. f The upper panel shows the Eccentricity (Ect) profile of gain-of-function mutations; the lower panel show the Ect profile of nABN and hSNPs. Squares indicates Ect values of WT amino acids, circles indicate Ect values of mutated amino acids. The box plot shows the |ΔE| difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (mean gain-of-function |∆E| = 1.53 ± 3.75; nABN and hSNP |∆E| = 2.05 ± 4.62; p > 0.05 by Wilcoxon signed-ranked test); dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers
Topological parameter profiles of NaV1.7 gain-of-function mutations and nABN and hSNPs. a The upper panel shows the B profile of gain-of-function mutation; the lower panel show the B profile of nABN and hSNPs. Squares indicates B values of WT amino acids, circles indicate B values of mutated amino acids. The graphs highlight that the difference between B value of mutated amino acids and B value of WT amino acids is higher in the cohort of gain-of-function (GF) mutations (upper panel) compared to control (Ctrl) nABN and hSNPs (lower panel). B values are multipled by 100. b The box plot shows the |ΔB| difference between gain-of-function mutations and the cohort of nABN and hSNP variants (mean gain-of-function |∆B| = 1.14 ± 1.40; nABN and hSNP ∆B = 0.19 ± 0.28; ***p < 0.001 by Wilcoxon signed-ranked test). |∆B| values are multipled by 100; dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. c The upper panel shows the Degree (D) profile of gain-of-function mutations; the lower panel show the D profile of nABN and hSNPs. Squares indicates D values of WT amino acids, circles indicate D values of mutated amino acids. The box plot shows the |ΔD| difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (gain-of-function mean |∆D| = 4.3 ± 5.15; nABN and hSNP |∆D| = 2.37 ± 2.10; p > 0.05 by Wilcoxon signed-ranked test); dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. d The upper panel shows the Clustering Coefficient (CC) profile of gain-of-function mutations; the lower panel show the CC profile of nABN and hSNPs. Squares indicates CC values of WT amino acids; circles indicate CC values of mutated amino acids. The box plot shows the |ΔCC| difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (mean gain-of-function |∆CC| = 0.15 ± 0.20; nABN and hSNP |∆CC| = 0.20 ± 0.25; p > 0.05 by Wilcoxon signed-ranked test); dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. e The upper panel shows the Closeness (Cct) profile of gain-of-function mutations; the lower panel show the Cct profile of nABN and hSNPs. Squares indicates Cct values of WT amino acids, circles indicate Cct values of mutated amino acids. The box plot shows the ΔC difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (mean gain-of-function |∆C| = 0.6 ± 0.9; nABN and hSNP |∆C| = 0.7 ± 1.4; p > 0.05 by Wilcoxon signed-ranked test). C and ∆C values are multipled by 100; dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliers. f The upper panel shows the Eccentricity (Ect) profile of gain-of-function mutations; the lower panel show the Ect profile of nABN and hSNPs. Squares indicates Ect values of WT amino acids, circles indicate Ect values of mutated amino acids. The box plot shows the |ΔE| difference between gain-of-function (GF) mutations and the cohort of nABN and hSNP (Ctrl) variants (mean gain-of-function |∆E| = 1.53 ± 3.75; nABN and hSNP |∆E| = 2.05 ± 4.62; p > 0.05 by Wilcoxon signed-ranked test); dark horizontal lines and the triangular symbol represent median and mean values respectively, with the box representing the 25th and 75th percentiles, the whiskers the 5th and 95th percentiles, and the dots the outliersWe next analysed B values and found that pathogenic NaV1.7 mutations are characterized by higher variations of ΔB compared with non-pathogenic mutations and polymorphisms (Fig. 3a; Table 1 and 2). Indeed, |ΔB| was significantly higher in gain-of-function mutations compared with nABN and hSNPs (gain-of-function mean ∆B = 1.14 ± 1.40; nABN and hSNP mean ∆B = 0.19 ± 0.28; p < 0.001 by Wilcoxon signed-ranked test; Fig. 3b). ΔB variations associated with Nav1.7 pathogenetic mutations and nABN variants are exemplified in the structural modeling shown in the Fig. 4.
Table 2
ΔB values of NaV1.7 nABN and hSNPs
Type
Mutation
Amino acidproperties
ChannelPart
ΔBct
hSNP
S126A
Polar → HΦO
VSD (S1;DI)
0
L127A
=HΦO
VSD (S1;DI)
0.12
M145L
=HΦO
VSD (S1;DI)
0
N146S
=Polar
VSD (S1;DI)
0.020
V194I
=HΦO
VSD (S3;DI)
-0.14
L201V
=HΦO
VSD (S2;DI)
0.259
N206D
Polar → Charged
VSD (S2;DI)
-0.037
E759D
=Charged
VSD (S1-S2;DII)
-0.18
A766T
HΦO → polar
VSD (S2;DII)
0.20
A766V
=HΦO
VSD (S2;DII)
0.06
I767V
=HΦO
VSD (S2;DII)
0.13
V795I
=HΦO
VSD (S3;DII)
-0.72
A815S
HΦO → polar
VSD (S3-S4;DII)
0.24
K1176R
=Charged
VSD (S1;DIII)
0.0004
R1207K
=Charged
VSD (S1-S2;DIII)
-0.19
T1210N
=Polar
VSD (S1-S2;DIII)
0
I1235V
=HΦO
VSD (S2;DIII)
0
A1505V
=HΦO
VSD (S1;DIV)
0.00026
S1509T
=Polar
VSD (S1;DIV)
0.092
S1509A
Polar → HΦO
VSD (S1;DIV)
0.0084
Q1530P
Polar → HΦO
VSD (S1-S2;DIV)
-0.166
Q1530K
Polar → Charged
VSD (S1-S2;DIV)
0.20
Q1530D
=Polar
VSD (S1-S2;DIV)
0.066
H1531Y
Charged → Polar
VSD (S1-S2;DIV)
0.12
M1532V
=HΦO
VSD (S1-S2;DIV)
0.66
E1534D
=Charged
VSD (S1-S2;DIV)
0.067
Y1537N
=Polar
VSD (S1-S2;DIV)
0.63
T1548S
=Polar
VSD (S2;DIV)
0.085
H1560Y
Charged → Polar
VSD (S2-S3;DIV)
-0.20
H1560C
=HΦO
VSD (S2-S3;DIV)
0.17
V1565I
=HΦO
VSD (S2-S3;DIV)
-0.45
I1577L
=HΦO
VSD (S3;DIV)
-0.07
D1586E
=Charged
VSD (S3;DIV)
0.07
T1590K
Polar → Charged
VSD (S3-S4;DIV)
0.07
T1590R
Polar → Charged
VSD (S3-S4;DIV)
0.29
V1613I
=HΦO
VSD (S4;DIV)
-0.67
nABN
N1245Sa
=polar
VSD (S2-S3;DIII)
-0.05
L1267Va
=HΦO
VSD (S3;DIII)
0
V1428I
=HΦO
Pore (S6;DIII)
0.19
T920N
=Polar
Loop-P (DII)
0.04
hSNP
D890E
=Charged
Loop-P (DII)
-0.13
D890V
Charged → HΦO
Loop-P (DII)
-0.13
T1398M
Polar → HΦO
Loop-P (DIII)
-0.14
I1399D
HΦO → Charged
Loop-P (DIII)
-1.63
D1411S
Charged → Polar
Loop-P (DIII)
-0.20
D1411N
Charged → Polar
Loop-P (DIII)
-0.16
K1412I
Charged → HΦO
Loop-P (DIII)
-0.0033
K1412E
=Charged
Loop-P (DIII)
-0.067
K1415I
Charged → HΦO
Loop-P (DIII)
0.038
S1419N
=Polar
Loop-P (DIII)
0.001
D1662A
Charged → HΦO
Loop-P (DIV)
-0.43
G1674A
=HΦO
Loop-P (DIV)
-0.71
K1700A
Charged → HΦO
Loop-P (DIV)
-0.22
hSNPs homologous Single nucleotide Polymorphisms ΔB was calculated as (mutated B – Wild-type B) × 100
aBrouwer et al. [53]; HO: Hydrophobic
Fig. 4
Structural modelling of NaV1.7 variants and their interatomic bonds. a The graph shows the NaV1.7 sodium channel topology and highlights the IEM associated mutation F216S and its intra-domain bond interaction (S3 and S4; depicted in red). b
Upper left inset shows the intramembrane view of the NaV1.7 channel and the amino acid F216. Upper right inset shows network view of the four NaV1.7 channel domains (DI, purple; DII, green; DIII, light blue; DIV, orange); the topology of amino acid F216 is showed as grey node. Lower insets show the bonds of WT amino acid F216 (left) and mutated amino acid S216 (right). Hydrophobic bonds are showed in green solid lines. H-bonds are showed with yellow dashed lines. F216 (DI, red) interacts with V194, V195, F198, T202 (S3, DI) and L219 (S4, DI) via hydrophobic bonds. F216 interact via H-bonds with L213 (formed by F216[NH] and L213[CO]) and L219 (F216[CO] with L219[NH]) located in S4, DI. The mutation F216S (right) interrupts all the hydrophobic interactions with S3 residues and created new H-bonds (S216[NH] with A212[CO]) causing a decrease of B and EB values. c The graph shows the NaV1.7 sodium channel topology and highlights the IEM associated mutation L858H and its inter-domain bond interaction (S4-S5 and S4; depicted in red). d
Upper left inset shows the intramembrane view of the NaV1.7 channel and the amino acid L858. Upper right insets show network view of the four NaV1.7 channeldomains (DI, purple; DII, green; DIII, light blue; DIV, orange); the topology of amino acid L858 is showed as grey node. Lower inset shows the bonds of WT amino acid L858 (left) and mutated amino acid H858 (right). Hydrophobic bonds are showed in green solid lines. H-bonds are indicated by yellow dashed lines. L858 residue (red, S4-S5; DII) interacts with I234 (DI; S4-S5), V861 (DII; S4-S5), N950, L951 and V947 (DII; S6) through hydrophobic bonds and through H-bonds with L862 (formed by L858[CO] and L862[NH]) located in DII; S4-S5. L858H mutation interrupts hydrophobic interaction with I234 (DI; S4-S5), V861 (DII; S4-S5), N950 and forms new H-bonds with A854 (formed by H858[NH] and A854[CO]) (DII; S4-S5) and V947 (formed by H858[NE2] and V947[CO]) (DII; S6). These changes decrease B value of amino acid 858 from 2.2 to 0.39. e The graph shows the NaV1.7 sodium channel topology and highlights the nABN mutation L1267V that is located in the domain DIII; S3 depicted in red. f
Upper left inset show the intramembrane view of the NaV1.7 channel and the amino acid L1267. Upper right inset shows network view of the four NaV1.7 channel domains (DI, purple; DII, green; DIII, light blue; DIV, orange); the topology of amino acid L1267 is showed as grey node. Lower inset show the bonds of WT amino acid L1267 (left) and mutated amino acid V1267 (right). Hydrophobic bonds are showed in green solid lines. H-bonds are indicated by yellow dashed lines. L1267[NH] (VSD in DIII) interacts through H-bonds with V1263[CO]. V1267mutation interacts with V1263 through a hydrophobic bond. This change does not modify B value of the residue 1267
ΔB values of NaV1.7nABN and hSNPshSNPs homologous Single nucleotide Polymorphisms ΔB was calculated as (mutated B – Wild-type B) × 100aBrouwer et al. [53]; HO: HydrophobicStructural modelling of NaV1.7 variants and their interatomic bonds. a The graph shows the NaV1.7sodium channel topology and highlights the IEM associated mutation F216S and its intra-domain bond interaction (S3 and S4; depicted in red). b
Upper left inset shows the intramembrane view of the NaV1.7 channel and the amino acid F216. Upper right inset shows network view of the four NaV1.7 channel domains (DI, purple; DII, green; DIII, light blue; DIV, orange); the topology of amino acid F216 is showed as grey node. Lower insets show the bonds of WT amino acid F216 (left) and mutated amino acid S216 (right). Hydrophobic bonds are showed in green solid lines. H-bonds are showed with yellow dashed lines. F216 (DI, red) interacts with V194, V195, F198, T202 (S3, DI) and L219 (S4, DI) via hydrophobic bonds. F216 interact via H-bonds with L213 (formed by F216[NH] and L213[CO]) and L219 (F216[CO] with L219[NH]) located in S4, DI. The mutation F216S (right) interrupts all the hydrophobic interactions with S3 residues and created new H-bonds (S216[NH] with A212[CO]) causing a decrease of B and EB values. c The graph shows the NaV1.7sodium channel topology and highlights the IEM associated mutation L858H and its inter-domain bond interaction (S4-S5 and S4; depicted in red). d
Upper left inset shows the intramembrane view of the NaV1.7 channel and the amino acid L858. Upper right insets show network view of the four NaV1.7 channeldomains (DI, purple; DII, green; DIII, light blue; DIV, orange); the topology of amino acid L858 is showed as grey node. Lower inset shows the bonds of WT amino acid L858 (left) and mutated amino acid H858 (right). Hydrophobic bonds are showed in green solid lines. H-bonds are indicated by yellow dashed lines. L858 residue (red, S4-S5; DII) interacts with I234 (DI; S4-S5), V861 (DII; S4-S5), N950, L951 and V947 (DII; S6) through hydrophobic bonds and through H-bonds with L862 (formed by L858[CO] and L862[NH]) located in DII; S4-S5. L858H mutation interrupts hydrophobic interaction with I234 (DI; S4-S5), V861 (DII; S4-S5), N950 and forms new H-bonds with A854 (formed by H858[NH] and A854[CO]) (DII; S4-S5) and V947 (formed by H858[NE2] and V947[CO]) (DII; S6). These changes decrease B value of amino acid 858 from 2.2 to 0.39. e The graph shows the NaV1.7sodium channel topology and highlights the nABN mutation L1267V that is located in the domain DIII; S3 depicted in red. f
Upper left inset show the intramembrane view of the NaV1.7 channel and the amino acid L1267. Upper right inset shows network view of the four NaV1.7 channel domains (DI, purple; DII, green; DIII, light blue; DIV, orange); the topology of amino acid L1267 is showed as grey node. Lower inset show the bonds of WT amino acid L1267 (left) and mutated amino acid V1267 (right). Hydrophobic bonds are showed in green solid lines. H-bonds are indicated by yellow dashed lines. L1267[NH] (VSD in DIII) interacts through H-bonds with V1263[CO]. V1267mutation interacts with V1263 through a hydrophobic bond. This change does not modify B value of the residue 1267Figure 4a and b shows the B topological proprieties of the F216S mutation associated to IEM [11, 57]. In the WT protein, F216 is located in VSD (S4) of DI and is predicted to mediates hydrophobic interactions with V194, V195, F198, T202 (S3, DI) and L219 (S4, DI). F216 is also predicted to mediate two H-bonds: F216[NH] with L213[CO] and F216[CO] with L219[NH] residues (S4, DI). Upon mutation, the hydrophobic interaction between F216S (S4) and the S3 residues (DI; VSD) are interrupted. The H-bonds F216[NH] with L213[CO] are interrupted. New H-bonds between S216[NH] and A212[CO] are created. All these changes yield negative B variation (ΔB = -1.71, Fig. 4a and b; Additional file 10: S1 YASARA; Additional file 11: S2 YASARA). L858H is another IEM-associated mutation [4, 9, 17]. In the WT protein, L858 is located in S4-S5 and is predicted to interacts with I234 (DI; S4-S5), V861 (DII; S4-S5), N950, L951 and V947 (DII; S6) through hydrophobic bonds and through H-bonds formed by L858[CO] and L862[NH]) (DII; S4-S5). L858H mutation interrupts hydrophobic interaction with I234 (DI; S4-S5), V861 (DII; S4-S5), N950 (DII; S6) and forms new H-bonds by H858[NH] and A854[CO] (DII; S4-S5) and by H858[CO] with V947[NH] (DII; S6) leading to a negative ΔB value (-1.85) (Fig. 4c and d; Additional file 12: S3 YASARA; Additional file 13: S4 YASARA). L1267V is an example of nABN variant that is located in the VSD of DIII which is highly conserved between human and SCN9A homologous genes (Additional file 5: S2 Text). L1267 interacts with V1263 through H-bonds formed by L1267[NH] and V1263[CO]. Upon mutation, V1267 forms new hydrophobic bond with V1263 which does not cause B variation (ΔB =0) (Fig. 4e and f; Additional file 14:S5 YASARA; Additional file 15:S6 YASARA).Figure 5 shows the network inter-residue connectivity of the IEM-associated mutations I848T and N395K, both characterized by very high ΔB values. I848 is located in S4-S5 (DII) and I848T causes a significant hyperpolarising shift in activation, a slow deactivation and an increased response to small-ramp depolarisations in DRG nociceptors [4, 9, 17, 58, 59]. I848 is predicted to interact with S4-S5 (DII) and pore (DIII; S6) through I845 and F1435, which have with very high B values (3.4 and 6.6, respectively). Upon mutation, the interatomic bond interactions between DII (S4-S5) and DIII (pore; S6) are interrupted and therefore ΔB shifts to a negative value (-5.83) showing lower EBct values (Fig. 5a and b, Additional file 16: Figure S6). Conversely, N395K mutation forms interdomain hydrophobic (S4-S5; DI and DIV, Pore; DIV) and H-bonds (S4-S5; DIV and S6; DIV), leading to a positive ΔB (5) and higher EBct values.
Fig. 5
Network inter-residue connectivity of the IEM-associated mutations I848T and N395K. a The graph shows the NaV1.7 sodium channel topology and highlights the amino acids I848 (DI; S4-S5) and N395 (DI; S6). Inter-domain bond interaction are depicted in red for the IEM associated mutation I848T and in green for the IEM associated mutation N395K. b
Upper panels show I848 and T848 networks, lower panels show N395 and K395 network. Bct and EBct evidence interatomic traffic over the network. Red-to-white color gradient of amino acids (nodes) represents Bct value (red represents high Bct and white low Bct). Red-to-black color gradient of edges (amino acid interatomic interactions) corresponds to EBct value (red represents high EBct and black low EBct). Hydrophobic bonds are showed in solid lines and H-bonds are indicated by dashed lines. I848 present high EB of connecting different parts of the NaV1.7 network. Upper right panels show that I848 interacts through hydrophobic interactions with S4-S5 (DII) and pore (DIII) through I845 and F1435 that are two residues having very high B values (3.4 and 6.6, respectively) and H-bonds with V852 and L844 (I848[CO] with V852[NH]; I848[NH] with L844[CO]). Note the difference of Bct of the upper left panel (I848Bct = 7.36) compared to the upper right panel (T848Bct = 1.52). I848T mutation interrupts the shortest paths within the network between DII (S4-S5) and DIII (pore) and therefore ΔB shifts to a negative value (-5.84). T848 interacts with F1435 through hydrophobic interactions and with S851 and L844 through H-bonds (T848[CO] with S851[NH]; T848[NH] with L844[CO]; T848[HG1] with L844[CO]). Lower panels show that N395 amino acid (red, S6 in pore module in DI) interacts with L1626 (S4-S5) via hydrophobic bond and via H-bonds formed by N395[CO] and A399[NH]and N395[NH]and F391[CO]. K395 mutation creates new hydrophobic bonds with V248 (S4-S5, DI), K398 (S6, DI), V1747 (S6, DIV), L1622 (S4-S5, DIV) and new H-bonds formed by K395[NZ] with N1751[CG] (S6, DIV) and K395[NZ] with A1625[CO] (S4-S5, DIV). These new bonds create a novel communication path within the network and thus increase Bct value of the residue 395 (K395Bct = 7.76 compared to the left panel N395Bct = 2.44). Edge thickness are proportional to EB and reveal that a high number of shortest paths pass through few edges
Network inter-residue connectivity of the IEM-associated mutations I848T and N395K. a The graph shows the NaV1.7sodium channel topology and highlights the amino acids I848 (DI; S4-S5) and N395 (DI; S6). Inter-domain bond interaction are depicted in red for the IEM associated mutation I848T and in green for the IEM associated mutation N395K. b
Upper panels show I848 and T848 networks, lower panels show N395 and K395 network. Bct and EBct evidence interatomic traffic over the network. Red-to-white color gradient of amino acids (nodes) represents Bct value (red represents high Bct and white low Bct). Red-to-black color gradient of edges (amino acid interatomic interactions) corresponds to EBct value (red represents high EBct and black low EBct). Hydrophobic bonds are showed in solid lines and H-bonds are indicated by dashed lines. I848 present high EB of connecting different parts of the NaV1.7 network. Upper right panels show that I848 interacts through hydrophobic interactions with S4-S5 (DII) and pore (DIII) through I845 and F1435 that are two residues having very high B values (3.4 and 6.6, respectively) and H-bonds with V852 and L844 (I848[CO] with V852[NH]; I848[NH] with L844[CO]). Note the difference of Bct of the upper left panel (I848Bct = 7.36) compared to the upper right panel (T848Bct = 1.52). I848T mutation interrupts the shortest paths within the network between DII (S4-S5) and DIII (pore) and therefore ΔB shifts to a negative value (-5.84). T848 interacts with F1435 through hydrophobic interactions and with S851 and L844 through H-bonds (T848[CO] with S851[NH]; T848[NH] with L844[CO]; T848[HG1] with L844[CO]). Lower panels show that N395 amino acid (red, S6 in pore module in DI) interacts with L1626 (S4-S5) via hydrophobic bond and via H-bonds formed by N395[CO] and A399[NH]and N395[NH]and F391[CO]. K395 mutation creates new hydrophobic bonds with V248 (S4-S5, DI), K398 (S6, DI), V1747 (S6, DIV), L1622 (S4-S5, DIV) and new H-bonds formed by K395[NZ] with N1751[CG] (S6, DIV) and K395[NZ] with A1625[CO] (S4-S5, DIV). These new bonds create a novel communication path within the network and thus increase Bct value of the residue 395 (K395Bct = 7.76 compared to the left panel N395Bct = 2.44). Edge thickness are proportional to EB and reveal that a high number of shortest paths pass through few edges
ΔBct distinguishes with high specificity pathogenic NaV1.7 mutations from variants not causing disease
The in-silico topological analyses described in Figures 4 and 5 was computed for all the pain disorder-related mutations (18 causing IEM, 6 causing SFN and 6 causing PEPD), and for all the 4 nABN and 49 hSNPs variants showed in Fig. 2c. The results showed that the only topological parameter that differs significantly between gain-of-function mutations and non-pathogenic amino acid changes is the |ΔB| value (Fig. 3). Indeed, 83% of nABN variants and hSNPs were characterized by |ΔB| values <0.26. The remaining 17% showed |ΔB| values >0.26 (42, V795I; 57, M1532V; 59, Y1537N; 63, V1565I; 68, V1613I; 73, I1399D; 67, T1590R; 81, D1662A; 82, G1674A) (Fig. 6a and Table 2). According to our NaV1.7 model structure, most of nABN and hSNPs, which are evolutionary variable, are located in VSD and P-loop domains and are predicted to be exposed to the lipid interface (Fig. 6b).
Fig. 6
Summary of ∆B values for all nABN and hSNP variants and gain-of-function mutations. a ∆B values of all the nABN and hSNP variants (light blue circles) and all the gain-of-function mutations (red circles) analysed in this study. Dashed lines indicates the cut-off value (ΔB ± 0.26) that maximizes sensitivity and specificity. b Intra- and extracellular view of the NaV1.7 and locations of amino acids affected by gain-of-function mutations (red) that are linked with IEM, SFN and PEPD and control group variants (nABN and hSNPs; light blue). Localization of IEM, SFN and PEPD related mutation showed in red (red). *The M1532 residue shares the same position with the SFN-related M1532I mutation which causes in vitro biophysics changes and M1532V variant belongs to the control group.c Receiver operating curve (ROC) of gain-of-function mutations and control mutations (nABN and hSNPs) as a function of ΔB. Using a cut-off value of ± 0.26, ΔB correctly classified 44 out of 53 controls and 23 of 30 gain-of-function mutations yielding 76% sensitivity and 83% specificity. The area under the curve is 0.81 (95% Confidence Interval = 0.70 to 0.91)
Summary of ∆B values for all nABN and hSNP variants and gain-of-function mutations. a ∆B values of all the nABN and hSNP variants (light blue circles) and all the gain-of-function mutations (red circles) analysed in this study. Dashed lines indicates the cut-off value (ΔB ± 0.26) that maximizes sensitivity and specificity. b Intra- and extracellular view of the NaV1.7 and locations of amino acids affected by gain-of-function mutations (red) that are linked with IEM, SFN and PEPD and control group variants (nABN and hSNPs; light blue). Localization of IEM, SFN and PEPD related mutation showed in red (red). *The M1532 residue shares the same position with the SFN-related M1532I mutation which causes in vitro biophysics changes and M1532V variant belongs to the control group.c Receiver operating curve (ROC) of gain-of-function mutations and control mutations (nABN and hSNPs) as a function of ΔB. Using a cut-off value of ± 0.26, ΔB correctly classified 44 out of 53 controls and 23 of 30 gain-of-function mutations yielding 76% sensitivity and 83% specificity. The area under the curve is 0.81 (95% Confidence Interval = 0.70 to 0.91)Twenty-three out of 30 (77%) gain-of-function NaV1.7 mutations had |ΔB| > 0.26 and are located in VSD, Pore and S4-S5 of DI, DII, DIII and DIV domains (Table 1). The remaining 7 mutations (23%) had |ΔB| <0.26 (1, I136V; 2, R185H; 8, M1532I; 9, W1538R; 17, V1298F; 19, V1299F; 20, P1308L) (Fig. 6a and Table 1). These pathogenetic mutations with small |ΔB| variation are located in VSD of DI (2, R185H) and DIII (8, M1532I; 9 W1538R) or in S4-S5 linker of DIII (17, V1298F; 19, V1299F; 20, P1308L), are highly evolutionary conserved residues (Additional file 5: S2 Text) and are predicted to be exposed outside the core of the channel (exception: I136V; Fig. 6b-c).According to these results, we hypothesized that ΔB might provide enough sensitivity and specificity to distinguish gain-of-function mutations from control variants. Using the cut-off value (ΔB ± 0.26) that maximizes sensitivity and specificity, ΔB correctly classified 44 out of 53 controls variants (nABN and hSNPs) and 23 out of 30 gain-of-function mutations, yielding 76% sensitivity and 83% specificity. The area under the ROC curve analysis for the ΔB scores was 0.81 (Fig. 6c, 95% confidence interval CI = 0.70–0.91).
Discussion
Many phenomena can be modelled as collections of elements that interact through a complex set of connections. Network theory has become one of the most successful frameworks for studying these phenomena [60] and has led to major advances in our understanding of ecological systems [61], social and communication networks [62], brain connectivity [63] and metabolic and gene regulatory pathways in living cells [64].Using network theory, protein structure can be described as mathematical graphs [28] that represent the interatomic connections. The topological features of amino acid residues, named nodes, can be described using centrality measures that define the reciprocal relationship in terms of connectivity and capability to influence other nodes within the network. We focused on the topological analysis of NaV1.7 gain-of-function mutations identified in patients with painful disorders. We considered a homology model of the NaV1.7 pore in the closed state and calculated the interaction of the nodes within the network through several measures of topology.Our findings show that ΔB values tend to be significantly higher in NaV1.7pain-related mutations than in control groups (nABN and hSNPs). B represents the influence that the shortest communication pathways have on the overall interatomic connections. Nodes with high B value could efficiently integrate signals (e.g. energy) and the reduction of B value caused by single amino acid substitutions suggests that the signalling transfer capability of the network is decreased. Conversely, the increase of B value suggests that a mutated node could facilitate the load transfer through the shortest communication pathways. Therefore, changes in ΔB reflect increased or decreased potential for connectivity of amino acid within the protein and provides numerical values about how single amino acid substitutions might act as a bottleneck for specific nodes linking different parts of the network. Previous studies of network topological parameters revealed that effective allosteric communications can be primarily provided by structurally stable residues that exhibit high B [65]. Therefore, B might provide a novel and useful tool for identifying allosteric hotspots in comparison with other centrality measures as previously suggested [25, 66].Using the cut-off value (ΔB ± 0.26) that maximizes sensitivity and specificity, our data show that ΔB correctly classified 44 out of 53 controls variants (nABN and hSNPs) and 23 out of 30 gain-of-function mutations, yielding 76% sensitivity and 83% specificity. The area under the ROC curve value for the ΔB scores was 0.81 (Fig. 6c, 95% confidence interval CI = 0.70–0.91). By contrast, our data show that none of other topological parameters (D, CC, C, and E) differ significantly between controls and gain-of-function NaV1.7 mutations. Although these data suggest that the pain-related NaV1.7 gain-of-function mutations do not have significant effects on the degree of connectivity, local clustering connectivity of the neighbour nodes (i.e. their tendency to cluster together) and eccentricity (i.e. how far is each node from any other node within the network), it is important to consider that our results derive from homology modelling constructed on the closed-state pore domain of NaV1.7. A given residue may have a number of distinct interaction networks within the channel protein throughout the gating cycle, thus our modeling captures a snap shot of these interactions, and future studies are needed to further investigate interaction networks within the channel protein throughout the gating cycle.Our NaV1.7 modeling also suggests a link between ΔB value and the buried or exposed nature of an amino acid substitution. Indeed, gain-of-function mutations predicted to be buried inside or close to the core of the channel have higher |ΔB| than the overall mean |ΔB| =1.14 (3, S211P; 4, F216S; 11, I234T; 5, I228M;12, S241T; 13, I848T; 15, L858H; 16, L858F; 24, N395K; 27, V872G; 30, A1746G) or the cut-off value (>0.26) (6, I739V; 25, V400M; 29, F1449V). Conversely, gain-of-function mutations predicted to face the lipid interface (exception: I136V) have lower |ΔB| than the overall mean |ΔB| (1.14) (14, G856D; 26, A863P; 28, M932L; 21, V1316A; 18, V1298D; 10, G1607R; 23, A1632E; exception: M1627K) or the cut-off value |ΔB| (<0.26) (17, V1298F; 19, V1299F; 20, P1308L; 2, R185H; 8, V1532I; 9, W1538R). Similarly, most of the control variants (nABN/hSNPs) predicted to face the lipid interface have low |ΔB| (<0.26) (Fig. 6b and c). Hence, in our NaV1.7 model, mutations predicted to be buried into the core of the channel show higher |ΔB| than those exposed at the interface of the membrane. This finding suggests that lipophilic interactions within the cell membrane may be disturbed by the mutations. Additional studies are required to more definitively assess the changes in lipophilic interactions that are produced by these mutations. Irrespective of the underlying mechanistic/molecular explanation, some pathogenic mutations would be missed by our method. Thus, ΔB should be regarded as a novel in-silico screening tool in addition to existing common predictive algorithms (e.g. Polyphen-2 [67], SIFT [68]) that could help in selecting pathogenetic mutations for functional testing.
Conclusions
Our findings demonstrate that most of the pathogenic NaV1.7 mutations identified in patients affected by severe painful disorders could be predicted, according to our homology modelling, to cause profound changes in the amino acid connectivity of the channel. Such modification may underpin the gain-of-function effects measurable in DRG nociceptors by electrophysiological assays. Based on these findings, we propose to consider Bct may therefore be a marker of pathogenic shift in the mutant channels, though prospective experimental studies will be required to validate its effectiveness and its biological meaning.
Authors: Janneke G J Hoeijmakers; Chongyang Han; Ingemar S J Merkies; Lawrence J Macala; Giuseppe Lauria; Monique M Gerrits; Sulayman D Dib-Hajj; Catharina G Faber; Stephen G Waxman Journal: Brain Date: 2012-01-26 Impact factor: 13.501
Authors: Hye-Sook Ahn; Sulayman D Dib-Hajj; James J Cox; Lynda Tyrrell; Frances V Elmslie; Antonia A Clarke; Joost P H Drenth; C Geoffrey Woods; Stephen G Waxman Journal: Eur J Pain Date: 2010-04-10 Impact factor: 3.931
Authors: Xiaoyang Cheng; Sulayman D Dib-Hajj; Lynda Tyrrell; Dowain A Wright; Tanya Z Fischer; Stephen G Waxman Journal: Mol Pain Date: 2010-04-29 Impact factor: 3.395
Authors: Tanya Z Fischer; Elaine S Gilmore; Mark Estacion; Emmanuella Eastman; Sean Taylor; Michel Melanson; Sulayman D Dib-Hajj; Stephen G Waxman Journal: Ann Neurol Date: 2009-06 Impact factor: 10.422
Authors: Sulayman D Dib-Hajj; Mark Estacion; Brian W Jarecki; Lynda Tyrrell; Tanya Z Fischer; Mark Lawden; Theodore R Cummins; Stephen G Waxman Journal: Mol Pain Date: 2008-09-19 Impact factor: 3.395
Authors: Jennifer A Bohn; Jamie L Van Etten; Trista L Schagat; Brittany M Bowman; Richard C McEachin; Peter L Freddolino; Aaron C Goldstrohm Journal: Nucleic Acids Res Date: 2018-01-09 Impact factor: 16.971
Authors: Julie I R Labau; Mark Estacion; Brian S Tanaka; Bianca T A de Greef; Janneke G J Hoeijmakers; Margot Geerts; Monique M Gerrits; Hubert J M Smeets; Catharina G Faber; Ingemar S J Merkies; Giuseppe Lauria; Sulayman D Dib-Hajj; Stephen G Waxman Journal: Brain Date: 2020-03-01 Impact factor: 13.501
Authors: Kim Le Cann; Jannis E Meents; Vishal Sudha Bhagavath Eswaran; Maike F Dohrn; Raya Bott; Andrea Maier; Martin Bialer; Petra Hautvast; Andelain Erickson; Roman Rolke; Markus Rothermel; Jannis Körner; Ingo Kurth; Angelika Lampert Journal: Channels (Austin) Date: 2021-12 Impact factor: 2.581
Authors: Makros N Xenakis; Dimos Kapetis; Yang Yang; Jordi Heijman; Stephen G Waxman; Giuseppe Lauria; Catharina G Faber; Hubert J Smeets; Patrick J Lindsey; Ronald L Westra Journal: J Biol Phys Date: 2021-03-18 Impact factor: 1.560
Authors: Alberto A Toffano; Giacomo Chiarot; Stefano Zamuner; Margherita Marchi; Erika Salvi; Stephen G Waxman; Catharina G Faber; Giuseppe Lauria; Achille Giacometti; Marta Simeoni Journal: Sci Rep Date: 2020-10-21 Impact factor: 4.379