Literature DB >> 35386434

In-Silico and In-Vitro Analysis of Human SOS1 Protein Causing Noonan Syndrome - A Novel Approach to Explore the Molecular Pathways.

Vinoth Sigamani1, Sheeja Rajasingh1, Narasimman Gurusamy1, Arunima Panda2, Johnson Rajasingh1,3,4.   

Abstract

Aims: Perform in-silico analysis of human SOS1 mutations to elucidate their pathogenic role in Noonan syndrome (NS). Background: NS is an autosomal dominant genetic disorder caused by single nucleotide mutation in PTPN11, SOS1, RAF1, and KRAS genes. NS is thought to affect approximately 1 in 1000. NS patients suffer different pathogenic effects depending on the mutations they carry. Analysis of the mutations would be a promising predictor in identifying the pathogenic effect of NS.
Methods: We performed computational analysis of the SOS1 gene to identify the pathogenic nonsynonymous single nucleotide polymorphisms (nsSNPs) th a t cause NS. SOS1 variants were retrieved from the SNP database (dbSNP) and analyzed by in-silico tools I-Mutant, iPTREESTAB, and MutPred to elucidate their structural and functional characteristics.
Results: We found that 11 nsSNPs of SOS1 that were linked to NS. 3D modeling of the wild-type and the 11 nsSNPs of SOS1 showed that SOS1 interacts with cardiac proteins GATA4, TNNT2, and ACTN2. We also found that GRB2 and HRAS act as intermediate molecules between SOS1 and cardiac proteins. Our in-silico analysis findings were further validated using induced cardiomyocytes (iCMCs) derived from NS patients carrying SOS1 gene variant c.1654A>G (NSiCMCs) and compared to control human skin fibroblast-derived iCMCs (C-iCMCs). Our in vitro data confirmed that the SOS1, GRB2 and HRAS gene expressions as well as the activated ERK protein, were significantly decreased in NS-iCMCs when compared to C-iCMCs.
Conclusion: This is the first in-silico and in vitro study demonstrating that 11 nsSNPs of SOS1 play deleterious pathogenic roles in causing NS.
© 2021 Bentham Science Publishers.

Entities:  

Keywords:  Noonan syndrome; SOS1 gene; in-silico analysis; nonsynonymous SNP; pathogenic variants; post-translational modification

Year:  2021        PMID: 35386434      PMCID: PMC8905634          DOI: 10.2174/1389202922666211130144221

Source DB:  PubMed          Journal:  Curr Genomics        ISSN: 1389-2029            Impact factor:   2.689


INTRODUCTION

Noonan syndrome (NS) is an autosomal dominant genetic disorder characterized by short stature, congenital heart disease, bleeding problems, developmental delays, and skeletal malformation. The occurrence of NS is estimated to be between 1:1000 and 1:2500 live births, and it affects more males than females [1, 2]. The molecular defects of NS are related to the altered function of RAS-MAPK signaling caused by a mutation in four main genes - PTPN11, SOS1, RAF1, and KRAS. Among these genes, the PTPN11 is the dominant gene for NS [3, 4]. The Son of Sevenless Homolog 1 (SOS1) is the second dominant gene for NS, and the mutation of this gene causes a distinctive phenotype with keratosis pilaris and curly hair [5]. The substitution of Thr266Lys in SOS1 showed facial dysmorphisms and mild pulmonic stenosis [6]. Studies have shown that in NS patients with germline mutation of SOS1 developed tumors [7]. Individuals with NS display cardiac anomalies, such as non-syndromic pulmonic stenosis, atrial septal defects, and ventricular septal defects [8]. SOS1 is a guanine nucleotide exchange factor, which upregulates the RAS signaling pathway, leading to the changes in human development [9, 10], and also alters RAS and RAC1 pathways [11]. The CIIA act as a negative regulator of RAS-specific guanine nucleotide exchange factor activity of SOS1 [12]. The interaction domains of SOS1/GRB2 control the embryonic stem cell fate during mammalian development [13]. The human genome contains 1.42 million single nucleotide polymorphisms (SNPs). Among these 250,000 - 400,000 SNPs in the protein-coding region, not sequence of the genome are named as non-synonymous SNPs (nsSNPs), which alters the amino acid in their functional protein [14, 15]. About 26-32% of nsSNPs are functionally effective and lead to cause disease by changing post-translational modification (PTM), protein stability and protein-protein interaction [16, 17]. In this study, we computationally analyzed the SOS1 gene to identify the pathogenic nsSNPs responsible for NS. The 3D models of wildtype and mutant SOS1 proteins were analysed. Since NS is known to cause cardiac anomalies, the interactions of SOS1 with other cardiac proteins were studied using STRING, and were experimentally validated in-vitro by mRNA and protein expressions in NS patient-derived induced cardiomyocytes (NS-iCMCs). This is the first in-silico study of the SOS1 variants linked to NS, and to discover the molecular pathways associated with this disease.

MATERIALS AND METHODS

Datamining

The SOS1 variants were retrieved from the NCBI SNP database (dbSNP) (https://www.ncbi.nlm.nih.gov/snp/) that is a public domain archive for a broad collection of single genetic polymorphisms. The protein sequence of SOS1 was retrieved from UniProt (https://www.uniprot.org/), which provides free accessible resources of protein sequence and functional information.

Consequences of Variants

The retrieved variants were analysed by the variant effect predictor (VEP) tool (http://www.ensembl.org/Tools/ VEP) to determine the likely consequences of amino acid substitutions on protein function [18].

Identifying the Most Pathogenic nsSNPs with Noonan Syndrome

The pathogenic nsSNPs of SOS1 were filtered following a previous literature review on NS () on NS. Further, the nsSNPs were analysed by in-silico tools such as dbNSFP, sorting intolerant from tolerant (SIFT), polymorphism phenotyping (PolyPhen), protein variation effect analyser (PROVEAN), functional analysis through hidden markov model (FATHMM), mutation taster, consensus deleteriousness (condel), likelihood ratio test (LRT) (https://www.ensembl.org/Homo_sapiens/Tools/VEP?db=core), single nucleotide polymorphisms and gene ontology (SNP & GO) (http://snps.biofold.org/snps-and- go/snps-and-go.html), and predictor of human deleterious single nucleotide polymorphisms (PhD-SNP) (http://snps. biofold.org/phd-snp/ phd-snp.html). Studies have shown that the SNPs were considered as more pathogenic, when it had been predicted by more than eight in-silico tools [18, 19].

Analysing Protein Stability

The stability of the wild type and mutant SOS1 protein stability was predicted by I-Mutant 2.0 (http://folding.bio- fold.org/i-mutant/i-mutant2.0.html) and iPTREE-STAB, an interpretable decision tree-based method (http://203. 64.84.190:8080/IPTREEr/iptree.htm). I-Mutant 2.0 is a support vector machine-based web server that helps in the automatic prediction of protein stability changes upon single-site mutations by using a data set derived from ProTherm. It predicts the changes in free energy delta-delta-G (DDG), which predicts how a single-point mutation affects protein stability and the results are expressed as a positive or negative value in Kcal/mol. We submitted protein sequence by changing single-site mutations of SOS1 protein to predict the stability of proteins, while the conditions were set at the temperature 25º C and pH 7 [20, 21].

Analysis of Structural and Functional Properties of nsSNPs

The structural and functional properties of pathogenic nsSNPs were analyzed by submitting their amino acid substitution of the protein sequence (FASTA format) in MutPred2 (http://mutpred2.mutdb.org/about.html). It predicts the molecular pathogenicity of amino acid substitution and the altered molecular mechanism affecting the phenotype using the threshold P-value of ≤0.05 [22].

3D Protein Modeling and Visualization

To generate the 3D models of wild type and mutants of SOS1 protein, we submitted amino acid substitution of a protein sequence in the Iterative Threading ASSEmbly Refinement (I-TASSER) (https://zhanglab.ccmb.med.umich.edu/I- TASSER/), which is a hierarchical approach to predict the structure and function of a protein. It identifies the structural templates from the PDB, based on LOMETS approaches, and the function of the target is derived by BioLiP [23-25]. Later, these models were verified and selected based on C-Score, ERRAT score (https://servicesn.mbi.ucla.edu/ ERRAT/) [26], and also verified by Ramachandran Plot Analysis (http://mordred.bioc.cam.ac.uk/~rapper/rampage.php) [27]. Then, the verified structure was visualized on PyMOL 2.2.2, a 3D structure visualization software. The 3D structural effect on the mutation was analyzed by HOPE (http://www.cmbi.ru.nl/hope/input/), which is an automatic mutation analysis server [28]. Furthermore, to extend our structural analysis, TM-score and root mean square deviation (RMSD) between wild type and mutant was calculated using TM-align software (https://zhanglab.ccmb.med.umich.edu/TM-align/) [29].

SOS1 Protein Network

To predict the specific interaction of SOS1 with cardiac proteins, we submitted the protein in the STRING server (https://string-db.org/), which is a database that provides computational direct protein-protein interactions and its indirect functional associations [30].

In-vitro Validation of Protein Network

Finally, the findings from the in-silico gene interactions associated with NS were further validated by in-vitro conditions using induced pluripotent stem cells (iPSC)-derived cardiomyocytes (iCMCs) obtained from normal control individuals (C-iCMCs), NS patient (NS-iCMCs), and their respective parent cells; normal human skin fibroblast (C-SF), NS patient cardiac fibroblasts (NS-CF), and iPSCs derived from C-SF (C-iPSCs); NS-iCMCs, and iPSC-derived from NS-CF (NS-iPSCs) carrying SOS1 gene variant c.1654A>G (SNP_ID.rs137852814) as demonstrated in our previous studies [31, 32]. To perform qRT-PCR, we isolated total RNA from C-SF, C-iPSCs, C-iCMCs, NS-CF, NS-iPSCs and NS- iCMCs using trizol reagent (Ambion by Life Technologies) as described in the manufacturer protocol. Then, the first strand cDNA was synthesized from each RNA sample by using High-Capacity cDNA Reverse Transcription Kit (Applied Biosystems). The gene expressions for the protein-network interacted genes (ACTN2, GATA4, GRB2, HRAS, NKX2.5, TNNT2 and SOS1) were performed using PowerUp™ SYBR™ Green Master Mix (Applied Biosystems). Briefly, a total reaction volume of 10 µl was prepared with the composition of 0.5 µl of cDNA, 1 µl of 5 picomole forward and reverse primer mix, 5 µl of SYBR Green and 3.5 µl of DEPC water. The PCR amplification was carried out in a Quantstudio 6pro (Applied Biosystems) with fast thermal cycling condition, 2 minutes of UDG activation temperature at 50˚C, 2 minutes of dual-lock DNA polymerase temperature at 95˚C and followed with 40 cycles at 95˚C for 1 second and 60˚C for 30 seconds. The relative expression for the target genes was normalized with 18S rRNA as endogenous control. The results are shown in fold change expression, and the values were calculated as the ratio of induced expression to control expression. The mRNA expression was further supported by Western immunoblotting analysis of protein expression as described earlier [33]. GAPDH was used as a protein loading control. The sequence of primers used for qRT-PCR and the antibodies used for the Western immunoblotting are given in Supplementary Table .

Statistical Analysis

The statistical analyses were performed between the mean of each group by Bonferroni’s method of one-way ANOVA using GraphPad Prism 8 software. The P-value <0.05 is considered as statistically significant. All in-vitro experiments were performed with biological triplicates.

RESULTS

Variants of SOS1 Genes

We used NCBI - dbSNP and UniProt were used to retrieve variants of the SOS1 gene and the protein sequence of human SOS1 (ID: Q07889). There was a total of 38137 variants, including 634 deletions, 3767 deletions/insertions, 11 multiple nucleotide variants and 33725 single nucleotide variants in SOS1 gene (Fig. ).

Consequences of SOS1 Variants in Percentages

We have used the VEP tool to find the consequences of SOS1 variants [18]. From a total of 38137 variants, 34058 (89.1%) occurred in the intron; 1814 (4.8%) in upstream and downstream, 1066 (2.8%) in 3' UTR, 688 (1.8%) in nonsynonymous, 320 (0.9%) in synonymous, and 191 (0.5%) in other variant classes were identified Fig. (). The 688 nsSNPs were analysed further to examine their pathogenicity.

Prediction of Most Pathogenic nsSNPs in Noonan Syndrome

Based on the literature, we identified that 16 nsSNPs of SOS1 variants were pathogenic with NS (Table ). The functionality of these 16 nsSNPs was predicted using in-silico tools. The functional prediction and scores obtained from various in-silico tools are given in Supplementary Table . The SIFT score predicts a function of protein affected by amino acid substitution. The score ranges from 0.0 (deleterious) to 1.0 (tolerate), and the score 0.0 to 0.05 considered as deleterious. The PolyPhen score ranges from 0.0 (benign) to 1.0 (deleterious), and the results predicted to be benign, probably or possibly damaging. In the PROVEAN tool, if the score is below the threshold value of -2.5, it is considered as deleterious, and if the score is greater than the threshold, it is considered as neutral. FATHMM tool returns a p-value, and the variant scores ≤ 0.5 are considered deleterious. In Mutation Taster, a larger score is considered as deleterious, and based on the score ranges from 0 to 1; a variant is classified as A (disease-causing automatic), D (disease-causing), N (polymorphism), or P (polymorphism automatic). Condel tool evaluates missense single nucleotide variants and yields a score ranging from 0.0 to 1.0; the value < 0.5 is considered as neutral, and the value >0.5 is considered as deleterious. The score of LRT ranges from 0 to 1, and this can predict the SNP as damaging or neutral based on the rules described by Liu et al. [34]. For the SNPs & GO and PhD-SNP, the score ranges from 0.0 to 1.0; if the value of a variant is >0.5, it is predicted as disease causing, otherwise, it is considered neutral [18, 19]. Based on the results of in-silico analyses (Table and Supplementary Table ), we have chosen 11 SNPs as the most pathogenic predicted based on the number of in-silico tools (8 out of 9 tools) and shown as deleterious (Supplementary Table and Fig. 3A) and the remaining 5 SNPs were predicted as less pathogenic (Fig. ). The specificity of our analysis was confirmed with the cancer-associated SOS1 mutation D309Y [35], which showed pathogenicity in 7 out of 9 in-silico tools in our study (Supplementary Table 2).
Fig. (3)

The pathogenicity of SOS1 nsSNPs predicted by various in-silico tools. (A) The most pathogenic 11 nsSNPs and (B) the less pathogenic 5 SNPs were associated with NS, predicted by various in-silico tools. (A higher resolution / colour version of this figure is available in the electronic copy of the article).

Protein Stability Prediction

We used I-Mutant 2.0 and iPTREE-STAB tools to predict the stability of a protein resulting from the nsSNPs. Both of these tools gave a DGG value of a protein at the pH 7.0 and at the temperature at 25° C. The value of DGG can be positive or negative, and it is interpreted as the increase or decrease in protein stability, respectively [20, 21]. Our results showed that all of the 11 nsSNPs had a decreased protein stability, at least in one of the two tools (Table ). The protein stability scores predicted for the five less pathogenic SOS1 mutants from I-Mutant and iPTREE are given in Table .

Structural and Functional Properties of nsSNPs

The structural and functional properties of nsSNPs were evaluated by MutPred2, which predicted the probability of deleterious mutations and the alterations in molecular mechanism, if it obtains a P-value of ≤ 0.05 [22]. Our results showed that eight nsSNPs, listed in Table , have deleterious mutations with altered mechanisms with a P-value of ≤ 0.05. However, three nsSNP (IDs rs137852814, rs267607079, and rs397517149) had a score of <0.5 and classified as ‘tolerated’ by MutPred2.

3D Modelling and Visualization of SOS1 Protein

We have generated 3D protein models for the wild type and 11 nonsynonymous mutants of SOS1 by using I-TASSER, which produced five models for each variant [23-25]. These models were verified and one model was for each variant based on the model having a minimum C-score, maximum ERRAT score, and the most allowed region on Ramachandran plot [26, 27]. The results of the selected models are shown in Table . Furthermore, the chosen models of wild type and mutants were visualized by PyMOL 2.2.2 software; and the HOPE server [28] was used to mask their amino acid substitution (Fig. ( and Supplementary Fig. ). To extend our structural analysis, we calculated template modeling score (TM-score) and root-mean-square deviation of atomic positions (RMSD) for the 11 nonsynonymous mutant models, and compared them with wild type models [29]. The TM-score shows the topological similarity between wild type and mutant models. All the mutants had a score of > 0.5, which indicated that the models were similar. The RMSD value measures the average distance between the α- carbon backbones of the wild type and mutant models. The higher RMSD value indicated a greater deviation between the wild type and mutant models. The results of TM-align, along with its RMSD score, are shown in Table .

Interactions of SOS1 with Cardiac-Specific Proteins

The interactions of SOS1 with cardiac proteins were predicted by the STRING server [30], which predicted the physical and functional interactions of proteins. SOS1 was found to interact with many cardiac proteins, such as ACTN2, ACTN4, GATA4, NKX2.5, TNNI3 and TNNT2, mainly through GRB2 and HRAS (Fig. ).

In-vitro Validation Using NS-iCMCs in Comparison with C-iCMCs

We verified the mRNA and protein expressions of SOS1 interacting cardiac proteins in NS-iCMCs and compared those levels with C-iCMCs. qRT-PCR analyses have revealed that the mRNA expression of ACTN2, GATA4, TNNT2 (Fig. ) and GRB2, HRAS, SOS1 (Fig. ) were significantly decreased, and NKX2.5 (Fig. ) was significantly increased in NS-iCMCs compared with C-iCMCs. To support these mRNA expression profiles, we performed Western blot analyses, which showed that GATA4, GRB2, HRAS, and SOS1 proteins and ERK1/2 activation were significantly reduced in NS-iCMCs compared with C-iCMCs (Fig. ). SOS1 promotes the formation of active RAS, leading to the activation of RAF-MEK-ERK cascade, which plays a key role in cardiac physiology/pathophysiology and cancer [36-38]. In order to analyze the effect of SOS1 mutation in the downstream effector molecules, we studied the ERK1/2 activation by the phosphorylation of ERK1/2 at the Thr202/Tyr204 by Western blotting. Our results indicate that ERK1/2 activation was significantly decreased in NS-iCMCs compared to C-iCMCs (Figs. ). ERK1/2 were found to be activated in both control and NS parent cells; and interestingly, their activation was found to be higher in NS-iPSCs, compared to control iPSCs (Figs. ). Importantly, our GAPDH loading control showed an equal loading of proteins in all the lanes whereas, the total ERK1/2 expressions were unexpectedly varied among the samples.
Fig. (6)

In-vitro validation of cardiac protein expressions using NS-iCMCs in comparison with C-iCMCs. A-C, qRT-PCR analyses showing the mRNA expressions of (A) A decreased expression of cardiac structural genes ACTN2 and TNNT2, and a cardiac specific regulatory molecule GATA4, *P<0.05 and **P<0.01 C-iCMC vs NS-iCMC; (B) A decreased expression of SOS1 and RAS-MAPK pathway associated genes GRB2 and HRAS *P<0.05 C-iCMC vs NS-iCMC; (C) An increased NKX2.5 gene expression observed in NS-iCMCs when compared to Noonan syndrome (NS) patient-derived cardiac fibroblasts (NS-CF), NS-iPSCs, C-iPSCs and C-iCMCs. Each bar represents the mean ± SEM of three replicated experiments. Each gene expression was normalized with 18S rRNA. *P<0.001 vs C-iCMC. (D) Western blot analyses showing the protein expressions of GATA4, GRB2, HRAS, and SOS1 in NS-CF, NS-iPSCs and NS-iCMCs in comparison with C-SF, C-iPSCs and C-iCMC. GAPDH was used as a protein loading control. (E) Western immunoblotting for the phosphorylation of ERK1/2 at the Thr202/Tyr204 and the ERK1/2. GAPDH was used to identify the equal loading of protein samples. (F) The ratio of the phosphorylated ERK1/2 at the Thr202/Tyr204 and the total ERK1/2 was calculated. The activated ERK is significantly reduced in NS-iCMC when compared to N-iCMC *P<0.05 vs. NS-iCMC, whereas the activated ERK significantly increased in NS-iPSC when compared to C-iPSC #P<0.05 vs. C-iPSC. (A higher resolution / colour version of this figure is available in the electronic copy of the article).

DISCUSSION

Bioinformatic tools are capable of identifying the genetic variations that are associated with a patient’s disease genome. There are several bioinformatic tools available to acquire specific and accurate genetic information that is already present in online databases. Genome-wide association studies (GWAS) can be applied for testing millions of genetic variants and are also examined to identify genotype-phenotype associations across the genomes. These associations provide insights into the disease susceptibility through the identification of disease specific genes and mechanisms [39]. The SNPs in the coding region of human genes were associated with genetic disorders [14]. A large number of SNPs have been reported in the database, and it is difficult to screen all the SNPs for the particular phenotype [40, 41]. The computational analysis tools help to narrow down and examine the pathogenic SNPs for the specific genetic disorders and in minimizing the risk [41, 42]. NS is an autosomal dominant genetic disorder, where the alterations are predominantly present in four genes - PTPN11, SOS1, RAF1, and KRAS. The SOS1 is the second dominant gene for NS [1, 2, 5]. In the human SOS1 gene, a total of 38137 SNPs have been listed in the database. Among these, 688 nsSNPs were involved in the functional protein coding regions. In the present study, various computational tools based on different algorithms were utilized to screen the most pathogenic nsSNPs of the human SOS1 gene to induce NS. A similar approach has been carried out to test the functional nsSNPs of ARHGEF6, BRAF, TAGAP, and UTY gene [41-44]. The 688 nsSNPs of the SOS1 gene were clinically filtered and it was found that 16 nsSNPs were pathogenic with NS [3-13, 45-56]. Using various in-silico tools such as, SIFT, PolyPhen, PROVEAN, FATHMM, Mutation Taster, Condel, LRT, SNPs&GO, and PhD-SNP, we identified that 11 out of 16 nsSNPs, such as p.Met269Arg, (rs137852813), p.Arg552Gly (rs137852814), p.Arg552Ser (rs267607079), p.Trp432Arg (rs267607080), p.Glu433Lys (rs397517147), p.Gly434Arg (rs397517148), p.Ser548Arg (rs397517149), p.Ile437Thr (rs397517150), p.Leu550Pro (rs397517153), p.Arg552Thr (rs397517154), and p.Cys441Tyr (rs727504295) were more pathogenic to induce NS. Protein stability analyses that using I-Mutant 2.0 and iPTREE-STAB revealed that all of the 11 nsSNPs had a decreased protein stability, except rs727504295. Decreased protein stability results in altered protein structure through increased aggregation, degradation, ubiquitination, and misfolding of proteins, leading to initiation of diseases [57-59]. Similarly, the prediction of the altered molecular mechanism by MutPred2 revealed a gain of intrinsic disorder in rs137852813 and rs267607080; gain of acetylation at K427 in rs267607080; altered coiled-coil in rs267607080, rs397517147, and rs397517148; gain of the helix in rs397517147 and rs397517148; altered metal binding in rs397517150 and rs727504295; loss of helix in rs397517153, rs397517154 and rs727504295; loss of proteolytic cleavage at D555 and altered transmembrane protein in rs397517153 and rs397517154; and gain of relative solvent accessibility and strand in rs727504295. Our study results did not reveal the details of the domain or the type of mutation involved with the SOS1 variants. A study has shown that most of the pathogenic SOS1 mutations were clustered in pleckstrin-homology (PH), Dbl-homology (DH) or in the helical linker between PH domain and RAS exchanger motif (REM) domain. Furthermore, the SOS1 variants p.Met269Arg, p.Arg552Gly, p.Arg552Ser, p.Ser548Arg, p.Ile437Thr, p.Leu550Pro, p.Arg552Thr, and p.Cys441Tyr were associated with Class 1 mutation, where they participated in the auto-inhibitory interaction of the DH and REM domains blocking RAS access [8]. The SOS1 variants p.Trp432Arg, p.Glu433Lys and p.Gly434Arg were associated with Class 2 mutation, where they have been shown to modify the surface of the PH domain [8]. The 3D models were generated using I-TASSER for the wild type and mutant SOS1 proteins, and their structural variations were analysed. Similar computational structural analysis have been carried out for various protein’s models like ARHGEF6, BRAF, TAGAP, and UTY [41-44]. Besides, the HOPE tool was utilized to study the amino acid changes between the wild type and mutants. The analysis of interaction of proteins will be helpful to elucidate the network of functional proteomics [60, 61]. In this study, using STRING server, we found that SOS1 interacts with cardiac structural and functional proteins, such as ACTN2, ACTN4, GATA4, NKX2.5, TNNI3 and TNNT2 mainly through interactions with GRB2 and HRAS. ACTN-4 is a newly discovered nonmuscle alpha-actinin isoform that requires further investigation in cardiac physiology/pathophysiology [62]. Our in-vitro study results have demonstrated that NS-iCMCs containing SOS1 gene variant (rs137852814) had a significantly decreased expression of SOS1 mRNA and protein, when compared with normal C-iCMCs. NS-iCMCs also showed decreased mRNA and protein expressions of cardiac specific regulatory molecules GATA4, RAS-MAPK pathway associated GRB2 and HRAS, and decreased mRNA expressions of cardiac structural genes like ACTN2 and TNNT2, when compared with C-iCMCs. In contrast, the mRNA expression level of NKX2.5 was significantly upregulated in NS-iCMCs, when compared with C-iCMCs. The increased expression of NKX2.5 in NS-iCMCs could be associated with cardiac hypertrophy and is in agreement with other published studies [63-65]. This increased expression of cardiac homeobox gene NKX2.5 activated the NPPA/ANF and HAND2 and resulted in the induction of cardiac hypertrophy. Similarly, in our previous study, we have found an increased expression of NPPA and HAND2 in NS-iCMCs, compared to C-iCMCs [32]. SOS1 mutation has been shown to affect the stability of the protein [8], and our study results showed that SOS1 mRNA and protein expressions were decreased in NS-iCMCs relative to the wildtype iCMCs. Our results are in agreement with the other study, where somatic mutations resulted in altered mRNA and protein expressions [66]. Studies have shown that the role of SOS1-mediated RAS-RAF-MEK-ERK cascade plays an important role in preserving cardiac function during physiological or pathological cardiac hypertrophy [36-38, 67, 68]. Although the role of ERK1/2 pathway during cardiac development is not fully understood, our study results indicated that ERK1/2 activation was decreased in NS-iCMCs compared to C-iCMCs, even though the NS-iCMCs had a significant amount of total ERK1/2 and it was mostly present as an inactive form. These results emphasise the importance of ERK1/2 activation in normal cardiac development. On the other hand, suppression of ERK activation facilitated the reprogramming of somatic cells [69], and accordingly, the ERK1/2 activation was reduced in normal C-iPSCs, and increased in NS-iPSCs. This results further indicated the pathophysiology associated with NS. It is also intriguing to argue whether the increased ERK1/2 activation in NS-iPSCs impairs its cardiac differentiation ability. Further studies are needed to explore the role of ERK in NS-iPSCs and NS-iCMCs during differentiation.

CONCLUSION

In conclusion, our in-silico analyses have identified that 11 variants of SOS1 gene nsSNPs were more pathogenic to cause NS. Furthermore, our in-vitro studies using the NS patient-derived iCMCs carrying SOS1 mutation showed a specific pattern of gene expression that results in a reduced expression of genes involved in cardiac development and an increased expression of genes involved in cardiac hypertrophy. However, further studies are required to elucidate the specific roles of SOS1 and its interacting proteins in inducing cardiomyopathy in NS. Our current study results will be helpful for the screening of NS patients with SOS1 mutation for the expression of more pathogenic variants. This will facilitate exploring SOS1 interacting proteins in order to target and thereby discover a possible treatment option for NS.
Table 1

List of Pathogenic nsSNPs associated with NS.

SNP IDcDNA ChangesCDS ChangesAmino Acid ChangesDisordersReferences
rs137852812c.884C>Ac.797C>Ap.Thr266LysNS Type 4, NS, Rasopathy, Gingival fibromatosis 1[3-13, 45-56]
rs137852813c.893T>G/Cc.806T>G/Cp.Met269Arg/ThrNS Type 4, NS, Rasopathy, Inborn genetic diseases
rs137852814c.1741A>Gc.1654A>Gp.Arg552GlyNS Type 4, NS, Rasopathy, Gingival fibromatosis 1
rs267607079c.1743G>T/Cc.1656G>C/Tp.Arg552Ser
rs267607080c.1381T>Cc.1294T>Cp.Trp432ArgNS Type 4, NS
rs397517147c.1384G>Ac.1297G>Ap.Glu433LysNS, Rasopathy
rs397517148c.1387G>Ac.1300G>Ap.Gly434ArgNS, Abnormality of the sternum, Ptosis, Pulmonic stenosis, Short stature, Rasopathy
rs397517149c.1729A>Cc.1642A>Cp.Ser548ArgNS Type 4, NS, Rasopathy, Inborn genetic diseases
rs397517150c.1397T>Cc.1310T>Cp.Ile437ThrNS, Rasopathy
rs397517153c.1736T>Cc.1649T>Cp.Leu550Pro
rs397517154c.1742G>C/Ac.1655G>C/Ap.Arg552Thr/LysNS, NS Type 3, Rasopathy, Abnormality of the aortic valve
rs397517156c.2270A>Tc.2183A>Tp.Lys728IleNS
rs397517159c.2623G>Ac.2536G>Ap.Glu846LysNS Type 4, NS, Rasopathy, Gingival fibromatosis 1
rs397517164c.409G>Ac.322G>Ap.Glu108LysNS, Rasopathy
rs397517180c.1012G>Tc.925G>Tp.Asp309TyrNS
rs727504295c.1409G>Ac.1322G>Ap.Cys441TyrNS, Rasopathy
Table 2

SOS1 mutant proteins stability prediction by I-mutant 2.0 and iPTREE-STAB.

SNP ID Amino Acid Changes I-Mutant 2.0 iPTREE-STAB
DDG Protein Stability DDG Protein Stability
rs137852813p.Met269Arg-0.78Decrease-2.46Decrease
rs137852814p.Arg552Gly-1.17Decrease-0.60Decrease
rs267607079p.Arg552Ser-2.14Decrease-5.10Decrease
rs267607080p.Trp432Arg-1.82Decrease-0.11Decrease
rs397517147p.Glu433Lys-0.81Decrease-1.85Decrease
rs397517148p.Gly434Arg-1.79Decrease-1.77Decrease
rs397517149p.Ser548Arg-0.26Decrease-1.55Decrease
rs397517150p.Ile437Thr-2.15Decrease-5.10Decrease
rs397517153p.Leu550Pro-0.08Decrease-0.60Decrease
rs397517154p.Arg552Thr-1.12Decrease-5.10Decrease
rs727504295p.Cys441Tyr0.83Increase-0.02Decrease
rs137852812p.Thr266Lys-0.28Decrease0.1725Increase
rs397517156p.Lys728Ile0.27Increase-1.8289Decrease
rs397517159p.Glu846Lys-0.82Decrease-1.3925Decrease
rs397517164p.Glu108Lys-0.41Decrease-0.6080Decrease
rs397517180p.Asp309Tyr-0.77Decrease-0.4600Decrease
Table 3

The protein stability scores predicted for the five less pathogenic SOS1 mutants by I-mutant 2.0 and iPTREE-STAB.

SNP ID Amino Acid Changes I-Mutant iPTREE-STAB
DDG Protein Stability DDG Protein Stability
rs137852812p.Thr266Lys-0.28Decrease0.1725Increase
rs397517156p.Lys728Ile0.27Increase-1.8289Decrease
rs397517159p.Glu846Lys-0.82Decrease-1.3925Decrease
rs397517164p.Glu108Lys-0.41Decrease-0.6080Decrease
rs397517180p.Asp309Tyr-0.77Decrease-0.4600Decrease
Table 4

Prediction of altered molecular mechanisms of SOS1 nsSNPs by mutPred2.

SNP ID Amino Acid Changes MutPred2 Score Molecular Mechanisms with P-Values <= 0.05
Prediction Probability P-Value
rs137852813p.Met269Arg0.782Gain of Intrinsic disorder0.30.05
rs137852814p.Arg552Gly0.464---
rs267607079p.Arg552Ser0.447---
rs267607080p.Trp432Arg0.896Gain of Intrinsic disorder0.410.0075
Gain of Acetylation at K4270.30.0039
Altered Coiled coil0.280.02
rs397517147p.Glu433Lys0.528Altered Coiled coil0.530.004
Gain of Helix0.270.04
rs397517148p.Gly434Arg0.573Gain of Helix0.280.02
Altered Coiled coil0.250.02
rs397517149p.Ser548Arg0.349---
rs397517150p.Ile437Thr0.644Altered Metal binding0.230.04
rs397517153p.Leu550Pro0.849Loss of Helix0.330.0016
Loss of Proteolytic cleavage at D5550.210.0017
Altered Transmembrane protein0.160.01
rs397517154p.Arg552Thr0.519Loss of Helix0.290.01
Loss of Proteolytic cleavage at D5550.20.0024
Altered Transmembrane protein0.150.01
rs727504295p.Cys441Tyr0.922Loss of Helix0.330.0016
Gain of Relative solvent accessibility0.280.02
Gain of Strand0.270.02
Altered Metal binding0.260.02
Table 5

Structural verification scores for the wild type and selected mutant models of SOS1.

SNP ID Amino Acid Changes C - Score ERRAT Value Ramachandran Plot TM-Score RMSD (Å)
Favoured Region Allowed Region Outlier Region
Wild Type - -2.482.45978.40%12.50%9.10%--
rs137852813p.Met269Arg-2.4996.64175.40%12.70%11.90%0.78230.55
rs137852814p.Arg552Gly-2.2385.83275.40%12.50%12.10%0.77650.47
rs267607079p.Arg552Ser-2.585.14677.20%11.30%11.50%0.94511.29
rs267607080p.Trp432Arg-2.1684.78377.50%12.90%9.60%0.78560.49
rs397517147p.Glu433Lys-2.1584.63378.10%11.40%10.40%0.94481.37
rs397517148p.Gly434Arg-2.1181.55975.80%12.80%11.30%0.77760.48
rs397517149p.Ser548Arg-2.2184.62177.80%10.40%11.80%0.78210.48
rs397517150p.Ile437Thr-2.3485.00878.30%11.80%9.90%0.94651.26
rs397517153p.Leu550Pro-2.3586.70279.10%10.80%10.10%0.93191.25
rs397517154p.Arg552Thr-1.9581.09575.40%11.60%13.10%0.77090.51
rs727504295p.Cys441Tyr-2.6784.44176.40%13.20%10.40%0.78640.5
  68 in total

1.  In silico analysis of Single Nucleotide Polymorphisms (SNPs) in human BRAF gene.

Authors:  Muhammad Ramzan Manwar Hussain; Noor Ahmad Shaik; Jumana Yousuf Al-Aama; Hani Z Asfour; Fatima Subhani Khan; Tariq Ahmad Masoodi; Muhammad Akhtar Khan; Nazia Sultana Shaik
Journal:  Gene       Date:  2012-07-21       Impact factor: 3.688

2.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

Authors:  R Sachidanandam; D Weissman; S C Schmidt; J M Kakol; L D Stein; G Marth; S Sherry; J C Mullikin; B J Mortimore; D L Willey; S E Hunt; C G Cole; P C Coggill; C M Rice; Z Ning; J Rogers; D R Bentley; P Y Kwok; E R Mardis; R T Yeh; B Schultz; L Cook; R Davenport; M Dante; L Fulton; L Hillier; R H Waterston; J D McPherson; B Gilman; S Schaffner; W J Van Etten; D Reich; J Higgins; M J Daly; B Blumenstiel; J Baldwin; N Stange-Thomann; M C Zody; L Linton; E S Lander; D Altshuler
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

Review 3.  Noonan syndrome: a review.

Authors:  H M Mendez; J M Opitz
Journal:  Am J Med Genet       Date:  1985-07

4.  Identification and Characterization of Oncogenic SOS1 Mutations in Lung Adenocarcinoma.

Authors:  Diana Cai; Peter S Choi; Maya Gelbard; Matthew Meyerson
Journal:  Mol Cancer Res       Date:  2019-01-11       Impact factor: 5.852

5.  Verification of protein structures: patterns of nonbonded atomic interactions.

Authors:  C Colovos; T O Yeates
Journal:  Protein Sci       Date:  1993-09       Impact factor: 6.725

6.  Clinical and molecular characterization of 40 patients with Noonan syndrome.

Authors:  Giovanni Battista Ferrero; Giuseppina Baldassarre; Angelo Giovanni Delmonaco; Elisa Biamino; Elena Banaudi; Claudio Carta; Cesare Rossi; Margherita Cirillo Silengo
Journal:  Eur J Med Genet       Date:  2008-07-17       Impact factor: 2.708

7.  Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces.

Authors:  Hanka Venselaar; Tim A H Te Beek; Remko K P Kuipers; Maarten L Hekkelman; Gert Vriend
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

8.  Improvement of cardiac function in mouse myocardial infarction after transplantation of epigenetically-modified bone marrow progenitor cells.

Authors:  Johnson Rajasingh; Jayakumar Thangavel; Mohammad R Siddiqui; Ignatius Gomes; Xiao-pei Gao; Raj Kishore; Asrar B Malik
Journal:  PLoS One       Date:  2011-07-22       Impact factor: 3.240

9.  COFACTOR: an accurate comparative algorithm for structure-based protein function annotation.

Authors:  Ambrish Roy; Jianyi Yang; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2012-05-08       Impact factor: 16.971

10.  I-TASSER server: new development for protein structure and function predictions.

Authors:  Jianyi Yang; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2015-04-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.