| Literature DB >> 36051694 |
Mohammed Y Behairy1, Mohamed A Soltan2, Mohamed S Adam3, Ahmed M Refaat4, Ehab M Ezz5, Sarah Albogami6, Eman Fayad6, Fayez Althobaiti6, Ahmed M Gouda7, Ashraf E Sileem8, Mahmoud A Elfaky9,10, Khaled M Darwish11, Muhammad Alaa Eldeen12.
Abstract
The NRAS gene is a well-known oncogene that acts as a major player in carcinogenesis. Mutations in the NRAS gene have been linked to multiple types of human tumors. Therefore, the identification of the most deleterious single nucleotide polymorphisms (SNPs) in the NRAS gene is necessary to understand the key factors of tumor pathogenesis and therapy. We aimed to retrieve NRAS missense SNPs and analyze them comprehensively using sequence and structure approaches to determine the most deleterious SNPs that could increase the risk of carcinogenesis. We also adopted structural biology methods and docking tools to investigate the behavior of the filtered SNPs. After retrieving missense SNPs and analyzing them using six in silico tools, 17 mutations were found to be the most deleterious mutations in NRAS. All SNPs except S145L were found to decrease NRAS stability, and all SNPs were found on highly conserved residues and important functional domains, except R164C. In addition, all mutations except G60E and S145L showed a higher binding affinity to GTP, implicating an increase in malignancy tendency. As a consequence, all other 14 mutations were expected to increase the risk of carcinogenesis, with 5 mutations (G13R, G13C, G13V, P34R, and V152F) expected to have the highest risk. Thermodynamic stability was ensured for these SNP models through molecular dynamics simulation based on trajectory analysis. Free binding affinity toward the natural substrate, GTP, was higher for these models as compared to the native NRAS protein. The Gly13 SNP proteins depict a differential conformational state that could favor nucleotide exchange and catalytic potentiality. A further application of experimental methods with all these 14 mutations could reveal new insights into the pathogenesis and management of different types of tumors.Entities:
Keywords: NRAS gene; carcinogenesis; computational analysis; precision medicine; single nucleotide polymorphism
Year: 2022 PMID: 36051694 PMCID: PMC9424727 DOI: 10.3389/fgene.2022.872845
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Prediction and scores of deleterious missense SNPs by six in silico tools.
| SNP Id | AA change | SIFT | PolyPhen-2 | PROVEAN | SNP&GO | PHD-SNP | SNAP2 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Prediction | Score | Prediction | Score | Prediction | Score | Prediction | Ri score | Prediction | Ri score | Prediction | Score | |||
| 1 | rs1658971260 | R164C | deleterious | 0.00 | Probably damaging | 0.997 | Deleterious | −4.975 | Disease | 3 | Disease | 8 | effect | 43 |
| 2 | rs757968407 | V152F | deleterious | 0.00 | Probably damaging | 0.991 | Deleterious | −4.484 | Disease | 2 | Disease | 8 | effect | 30 |
| 3 | rs1163400692 | S145L | deleterious | 0.00 | Probably damaging | 0.994 | Deleterious | −5.529 | Disease | 1 | Disease | 7 | effect | 42 |
| 4 | rs754428086 | D119G | deleterious | 0.00 | Probably damaging | 0.994 | Deleterious | −6.472 | Disease | 2 | Disease | 7 | effect | 45 |
| 5 | rs1365635887 | R68G | deleterious | 0.00 | Probably damaging | 0.979 | Deleterious | −6.912 | Disease | 2 | Disease | 4 | effect | 62 |
| 6 | rs752508313 | Y64D | deleterious | 0.00 | Probably damaging | 0.999 | Deleterious | −8.841 | Disease | 7 | Disease | 8 | effect | 73 |
| 7 | rs267606920 | G60E | deleterious | 0.00 | Probably damaging | 1 | Deleterious | −7.492 | Disease | 6 | Disease | 7 | effect | 36 |
| 8 | rs1557982817 | G60R | deleterious | 0.00 | Probably damaging | 1 | Deleterious | −7.492 | Disease | 5 | Disease | 8 | effect | 37 |
| 9 | rs1465850103 | D57N | deleterious | 0.00 | Probably damaging | 0.975 | Deleterious | −4.517 | Disease | 2 | Disease | 8 | effect | 64 |
| 10 | rs1246727247 | I55R | deleterious | 0.00 | Probably damaging | 0.996 | Deleterious | −6.527 | Disease | 2 | Disease | 8 | effect | 33 |
| 11 | rs139287106 | S39F | deleterious | 0.00 | Probably damaging | 0.997 | Deleterious | −5.427 | Disease | 0 | Disease | 3 | effect | 30 |
| 12 | rs397514553 | P34R | deleterious | 0.00 | Probably damaging | 1 | Deleterious | −7.698 | Disease | 5 | Disease | 5 | effect | 19 |
| 13 | rs121913248 | A18P | deleterious | 0.00 | Probably damaging | 0.992 | Deleterious | −4.316 | Disease | 7 | Disease | 10 | effect | 15 |
| 14 | rs1308441238 | V14G | deleterious | 0.00 | Probably damaging | 0.999 | Deleterious | −5.856 | Disease | 2 | Disease | 9 | effect | 60 |
| 15 | rs121434596 | G13V | deleterious | 0.00 | Probably damaging | 0.975 | Deleterious | −7.649 | Disease | 8 | Disease | 9 | effect | 65 |
| 16 | rs121434595 | G13C | deleterious | 0.00 | Probably damaging | 0.994 | Deleterious | −7.719 | Disease | 7 | Disease | 9 | effect | 59 |
| G13R | deleterious | 0.00 | Probably damaging | 0.982 | Deleterious | −6.709 | Disease | 7 | Disease | 9 | effect | 58 | ||
Effect of missense variants on NRAS protein stability.
| SNP Id | AA change | I-mutant 2 prediction | Reliability index (RI) | DDG value (kcal/mol) | MUpro | |
|---|---|---|---|---|---|---|
| Prediction | Delta delta G | |||||
| rs1658971260 | R164C | Decrease | 5 | −1.19 | Decrease | −0.17 |
| rs757968407 | V152F | Decrease | 9 | −2.13 | Decrease | −0.85 |
| rs1163400692 | S145L | Increase | 3 | 0.24 | Increase | 0.09 |
| rs754428086 | D119G | Decrease | 7 | −0.53 | Decrease | −1.43 |
| rs1365635887 | R68G | Decrease | 6 | −0.91 | Decrease | −1.92 |
| rs752508313 | Y64D | Decrease | 4 | −0.99 | Decrease | −0.90 |
| rs267606920 | G60E | Decrease | 1 | −0.89 | Decrease | −0.46 |
| rs1557982817 | G60R | Decrease | 7 | −1.36 | Decrease | −0.63 |
| rs1465850103 | D57N | Increase | 1 | 0.18 | Decrease | −0.65 |
| rs1246727247 | I55R | Decrease | 7 | −2.31 | Decrease | −2.01 |
| rs139287106 | S39F | Increase | 3 | 0.06 | Decrease | −0.24 |
| rs397514553 | P34R | Decrease | 6 | −0.6 | Decrease | −0.98 |
| rs121913248 | A18P | Increase | 1 | −1.53 | Decrease | −1.32 |
| rs1308441238 | V14G | Decrease | 10 | −4.96 | Decrease | −2.02 |
| rs121434596 | G13V | Increase | 2 | −0.02 | Decrease | −0.25 |
| rs121434595 | G13C | Decrease | 5 | −1.18 | Decrease | −0.32 |
| G13R | Decrease | 6 | −1.27 | Decrease | −0.15 | |
Locations of NRAS variants on protein domains, phylogenetic conservation analysis, and secondary structure prediction.
| SNP Id | AA change | Location on protein | ConSurf conservation score | Functional/structural | Buried/exposed | Secondary structure |
|---|---|---|---|---|---|---|
| rs1658971260 | R164C | — | 5/intermediately conserved | — | exposed | Alpha helix |
| rs757968407 | V152F | Small GTP-binding protein domain | 9/highly conserved | structural | buried | Alpha helix |
| rs1163400692 | S145L | Small GTP-binding protein domain | 9/highly conserved | functional | exposed | Alpha helix |
| rs754428086 | D119G | Small GTP-binding protein domain | 9/highly conserved | functional | exposed | Random coil |
| rs1365635887 | R68G | Small GTP-binding protein domain | 7/highly conserved | — | exposed | Alpha helix |
| rs752508313 | Y64D | Small GTP-binding protein domain | 8/highly conserved | — | buried | Alpha helix |
| rs267606920 | G60E | Small GTP-binding protein domain | 9/highly conserved | functional | exposed | Random coil |
| rs1557982817 | G60R | Small GTP-binding protein domain | 9/highly conserved | functional | exposed | Random coil |
| rs1465850103 | D57N | Small GTP-binding protein domain | 9/highly conserved | functional | exposed | Alpha helix |
| rs1246727247 | I55R | Small GTP-binding protein domain | 8/highly conserved | — | buried | Alpha helix |
| rs139287106 | S39F | Small GTP-binding protein domain | 7/highly conserved | — | exposed | Alpha helix |
| rs397514553 | P34R | Small GTP-binding protein domain | 7/highly conserved | — | exposed | Random coil |
| rs121913248 | A18P | Small GTP-binding protein domain | 8/highly conserved | — | buried | Alpha helix |
| rs1308441238 | V14G | Small GTP-binding protein domain | 9/highly conserved | functional | exposed | Random coil |
| rs121434596 | G13V | Small GTP-binding protein domain | 7/highly conserved | — | exposed | Random coil |
| rs121434595 | G13C | Small GTP-binding protein domain | 7/highly conserved | — | exposed | Random coil |
| G13R | Small GTP-binding protein domain | 7/highly conserved | — | exposed | Random coil |
Scores of validation, structural similarity, and binding affinity of the predicted mutant models.
| No. | SNP | Ramachan assessment (favored) | ProSA Z score (%) | Tm-score | RMSD | GTP-binding score (kcal/mol) |
|---|---|---|---|---|---|---|
| 1 | G13R | 96.93 | −7.38 | 0.99384 | 0.06 | −12.3 |
| 2 | G13C | 96.93 | −7.36 | 0.99383 | 0.06 | −12 |
| 3 | G13V | 96.32 | −7.38 | 0.99383 | 0.06 | −12.2 |
| 4 | V14G | 96.93 | −7.30 | 0.99384 | 0.06 | −11.7 |
| 5 | A18P | 96.32 | −7.51 | 0.99365 | 0.09 | −11.6 |
| 6 | P34R | 96.32 | −7.10 | 0.99383 | 0.06 | −12 |
| 7 | S39F | 96.93 | −7.29 | 0.99384 | 0.06 | −11.8 |
| 8 | I55R | 96.32 | −7.12 | 0.99365 | 0.09 | −11.7 |
| 9 | D57N | 97.55 | −7.52 | 0.99384 | 0.06 | −11.6 |
| 10 | G60R | 96.32 | −7.32 | 0.99302 | 0.15 | −11.5 |
| 11 | G60E | 96.32 | −7.19 | 0.99344 | 0.11 | −10.6 |
| 12 | Y64D | 96.93 | −7.38 | 0.99383 | 0.06 | −11.6 |
| 13 | R68G | 96.93 | −7.42 | 0.99383 | 0.06 | −11.9 |
| 14 | D119G | 96.93 | −7.35 | 0.99383 | 0.06 | −11.8 |
| 15 | S145L | 97.55 | −7.40 | 0.99374 | 0.07 | −11.1 |
| 16 | V152F | 96.32 | −7.63 | 0.99348 | 0.11 | −12 |
| 17 | R164C | 96.93 | −7.32 | 0.99383 | 0.06 | −11.8 |
FIGURE 13D structure of GTP (red color sticks) docked in NRAS target receptor (cartoon and surface) with Mg2+ ion (green sphere). (A) Native model, (B) mutant model with the highest binding affinity (G13R SNP), and (C) mutant model with the lowest binding affinity (G60E SNP). NRAS cartoon/surface is colored differently according to its constitutive halves: effector lobe (blue) and allosteric lobe (magenta). P-loop (residue range: 10–17), switch-I (residue range: 30–40), and switch-II (residue range: 60–76) for the GTP-binding site. Conserved GTPase motifs reported essential for ligand recognition/binding and Mg2+ coordination are colored differently; GxxxxGK (yellow), DxxG and Thr35 (brown), NKxD (green), as well as ExSAK and Phe28 (cyan). Letters N and C denote the amine- and carboxy-protein terminals, respectively.
FIGURE 2Trajectory analysis for the simulated GTPase NRAS models. Monitored alpha-carbon trajectories of (A) protein’s RMSD, (B) ligand’s RMSD, (C) protein’s Rg, and (D) protein’s SASA were plotted against the entire molecular dynamics simulation timelines (100 ns).
FIGURE 3Relative ΔRMSF analysis for the simulated GTP-bounded NRAS proteins along the whole molecular dynamics simulations. The ΔRMSF values, in reference to protein backbone Cα-atoms, are represented in terms of constituting residue sequence numbers.
Monitored ΔRMSF [Å] for GTPase NRAS complexes along entire molecular dynamics runs.
| Canonical domains composing GTPase-binding/catalytic site | Residues | Model #1 | Model #2 | Model #3 | Model #6 | Model #16 | Native holo |
|---|---|---|---|---|---|---|---|
|
|
| 0.38 | 0.35 | 0.32 | 0.42 | 0.42 | 0.32 |
|
| 0.52 | 0.45 | 0.45 | 0.56 | 0.56 | 0.45 | |
|
| 1.67 | 1.58 | 1.59 | 1.68 | 1.72 | 1.64 | |
|
| 0.60 | 0.55 | 0.56 | 0.62 | 0.66 | 0.60 | |
|
| 0.43 | 0.41 | 0.40 | 0.45 | 0.46 | 0.40 | |
|
| 0.34 | 0.34 | 0.30 | 0.37 | 0.38 | 0.29 | |
|
| 0.35 | 0.36 | 0.30 | 0.40 | 0.37 | 0.33 | |
|
| 0.41 | 0.44 | 0.38 | 0.51 | 0.48 | 0.43 | |
| Switch-I vicinal residues |
| 0.35 |
| 0.34 | 0.42 | 0.27 | 0.38 |
|
| 0.81 | 0.66 | 0.73 | 0.76 | 0.69 | 0.80 | |
|
| 1.04 | 0.93 | 1.06 | 0.98 | 0.94 | 1.09 | |
|
| 1.57 | 1.23 | 1.11 | 1.52 | 1.43 | 1.52 | |
|
| 1.61 | 1.60 | 1.31 | 1.77 | 1.22 | 1.73 | |
| Switch-I |
| 2.88 | 2.53 | 2.23 | 2.73 | 2.83 | 2.90 |
|
| 2.71 | 2.33 | 1.95 | 2.58 | 2.61 | 2.76 | |
|
| 2.00 | 2.38 | 1.65 | 2.87 | 3.07 | 2.94 | |
|
| 1.01 | 1.51 | 0.91 | 2.08 | 2.64 | 1.83 | |
|
| 0.64 | 1.44 | 0.59 | 1.74 | 3.02 | 0.33 | |
|
|
| 1.21 |
| 0.64 | 2.52 |
| |
|
| 0.38 | 1.14 |
| 1.47 | 2.90 |
| |
|
|
|
|
| 0.76 | 2.71 |
| |
|
|
|
|
| 0.55 | 1.41 |
| |
|
|
|
|
| 0.38 | 1.18 |
| |
|
|
|
|
|
|
|
| |
| DxxG motif |
|
| 0.75 | 0.34 | 0.40 | 0.78 |
|
|
| 0.40 | 0.80 | 0.55 | 0.41 | 0.81 |
| |
|
|
| 0.57 | 0.42 |
| 0.74 |
| |
|
| 1.09 | 1.69 | 1.49 | 1.20 | 1.84 | 1.15 | |
| Switch-II |
| 1.71 | 1.79 | 1.95 | 1.95 | 2.17 | 1.45 |
|
| 2.08 | 1.74 | 2.18 | 1.87 | 2.30 | 1.53 | |
|
| 1.97 | 1.72 | 2.00 | 1.34 | 1.74 | 1.56 | |
|
|
|
|
| 0.40 | 1.18 |
| |
|
|
|
|
| 0.54 |
|
| |
|
|
|
|
|
|
|
| |
|
| 0.94 | 0.52 |
| 0.30 | 0.55 | 0.48 | |
|
| 0.45 |
| 0.43 | 0.34 | 0.31 | 0.40 | |
|
| 0.46 | 0.36 | 0.58 | 0.43 | 0.35 | 0.33 | |
|
| 0.46 | 0.56 | 0.50 | 0.57 | 0.33 | 0.47 | |
|
|
| 0.53 | 0.34 | 0.48 | 0.35 | 0.43 | |
|
|
| 0.33 |
| 0.31 | 0.34 | 0.32 | |
|
|
| 0.31 |
|
| 0.33 | 0.30 | |
|
|
| 0.24 |
|
|
|
| |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
| NKxD motif |
|
|
|
|
|
|
|
|
| 0.32 | 0.32 |
| 0.36 | 0.34 | 0.31 | |
|
|
|
|
| 0.28 |
|
| |
|
| 0.82 | 0.84 | 0.78 | 0.86 | 0.86 | 0.80 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
| 0.32 |
|
| |
|
| 0.34 | 0.32 |
| 0.38 |
| 0.33 | |
|
| 0.51 | 0.44 | 0.43 | 0.55 | 0.44 | 0.52 | |
| SNP in model #16 |
|
|
|
|
|
|
|
ΔRMSF ≤ 0.30 Å cut-off are in bold red numbers inferring residues showing significant mobility/flexibility.
FIGURE 4Conformational analysis of simulated GTPase NRAS-GTP complexes at the start and final molecular dynamics timeframes. Superimposed 0 and 100-ns shots of the simulated NRAS complexes; (A) model #1, (B) model #2, (C) model #3, (D) model #6, (E) model #16, (F) native holo protein. Complexes are shown in green and red cartoons respective to the initial and last extracted frames. Ligands (sticks) and SNP residues (lines) are presented in colors corresponding to extracted frames.
The MM-PBSA-calculated total binding-free energy and its constituting energy terms.
| Energy (kJ/mol ± SD) | GTP-NRAS complexes | |||||
|---|---|---|---|---|---|---|
| Model #1 | Model #2 | Model #3 | Model #6 | Model #16 | Native holo | |
| Δ |
|
|
|
|
|
|
| Δ |
|
|
|
|
|
|
| Δ | 316.7 ± 14.2 | 131.5 ± 29.0 | 113.9 ± 28.4 | 336.5 ± 39.1 | 174.5 ± 29.2 | 163.2 ± 12.3 |
| Δ |
|
|
|
|
|
|
| Δ |
|
|
|
|
|
|
FIGURE 5Residue-oriented free binding energies for the simulated GTP-NRAS systems.
Predicted SNP impacts on NRAS protein structure by HOPE server.
| SNP Id | AA change | Amino acid properties | Location/structure | SNP’s impact on the protein |
|---|---|---|---|---|
| rs757968407 | V152F | There is a difference in size as the mutant amino acid is bigger than the wild one. Therefore, the mutant amino acid could not fit as the wild residue is buried in the core | The wild residue is buried in the core. Therefore, the mutant amino acid could not fit due to its bigger size | The mutation affects a domain that is important in binding to other molecules and could disturb its contact with the important domain for the activity of our protein and disturb signal transfer between two domains |
| rs397514553 | P34R | There are differences in charge, size, and hydrophobicity between the mutant residue and the wild one, which could cause repulsions with neighboring residues, disturbance of interactions with protein parts or other molecules, and loss of hydrophobic interactions, respectively | The mutation occurs at a stretch of residues described as a special motif in UniProt, which could suffer from disturbance and loss of function. In addition, the special backbone conformation due to proline rigidity could be disturbed with the mutant residue | The mutation affects a domain that is important in binding to other molecules and could disturb its contact with the important domain for the activity of our protein and disturb signal transfer between two domains |
| rs121434596 | G13V | There is a difference in size as the mutant residue has a bigger size, which could lead to bumps. In addition, the unusual torsion angles by glycine could be lost with disturbance in the local structure, and the local backbone could be forced into incorrect conformation as well | The mutation could lead to a loss of the flexibility of glycine, which may be necessary for the function of the protein | The mutation affects a domain with importance for binding to other molecules. Therefore, this could lead to the disturbance of this function |
| rs121434595 | G13C | There is a difference in size as the mutant amino acid is bigger than the wild one. Therefore, the mutant one could not fit as the wild residue is buried in the core. In addition, the unusual torsion angles by glycine could be lost with disturbance in the local structure, and the local backbone could be forced into incorrect conformation as well | The mutation could lead to a loss of the flexibility of glycine, which may be necessary for the function of the protein | The mutation affects a domain that is important in binding to other molecules and could disturb its contact with the important domain for the activity of our protein and disturb signal transfer between two domains |
| G13R | There is a difference in charge between the mutant and wild residues, and the introduced charge in this buried residue could result in protein folding problems. Moreover, there is a difference in size as the mutant amino acid is bigger than the wild one. Therefore, the mutant one could not fit as the wild residue is buried in the core. In addition, the unusual torsion angles by glycine could be lost with disturbance in the local structure, and the local backbone could be forced into incorrect conformation as well | The mutation could lead to a loss of the flexibility of glycine, which may be necessary for the function of the protein | The mutation affects a domain that is important in binding to other molecules and could disturb its contact with the important domain for the activity of our protein and disturb signal transfer between two domains |
FIGURE 6Illustration of the substitution of wild type amino acid (green colored) by the mutant one (red colored) in different NRAS SNPs (A) G13C. (B) G13R. (C) P34R. (D) V152F.
FIGURE 7Post-translational modifications of NRAS produced by the MusiteDeep server.
FIGURE 8Network of NRAS gene–gene interactions produced by the GeneMANIA tool.