Literature DB >> 35966807

Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach.

Malik Siddique Mahmood^1,2, Maryam Afzal¹, Hina Batool³, Amara Saif³, Tahreem Aqdas¹, Naeem Mahmood Ashraf⁴, Mahjabeen Saleem¹.

Abstract

LHPP gene encodes a phospholysine phosphohistidine inorganic pyrophosphate phosphatase, which functions as a tumor-suppressor protein. The tumor suppression by this protein has been confirmed in various cancers, including hepatocellular carcinoma (HCC). LHPP downregulation promotes cell growth and proliferation by modulating the PI3K/AKT signaling pathway. This study identifies potentially deleterious missense single nucleotide variants (SNVs) associated with the LHPP gene using multiple computational tools based on different algorithms. A total of 4 destabilizing mutants are identified as L22P, I212T, G227R, and G236R, from the conserved region of the phosphatase. The 3-dimensional (3D) modeling and structural comparison of variants with the native protein reveals significant structural and conformational variations after mutations, suggesting disruption in the function of phospholysine phosphohistidine inorganic pyrophosphate phosphatase. The identified mutations might, therefore, participate in the cause of HCC.

Entities: Chemical

Keywords: Hepatocellular carcinoma; LHPP; phospholysine phosphohistidine inorganic pyrophosphate phosphatase; single nucleotide variants; tumor-suppressor protein

Year: 2022 PMID： 35966807 PMCID： PMC9373111 DOI： 10.1177/11779322221115547

Source DB: PubMed Journal: Bioinform Biol Insights ISSN： 1177-9322

Introduction

Hepatocellular carcinoma (HCC) is among the most assertive form of liver morbidities worldwide. The mortality rates associated with the HCC makes it, the fourth leading cause of cancer deaths globally, with a significant load on the developing countries.[2,3] Hepatocellular carcinoma is a multifactorial disorder with variable etiological agents distributed in different geographical regions. The primary underlying factors include persistent liver cirrhosis, long-term liver diseases, diabetes mellitus, obesity, aflatoxins, hepatitis B, and C viruses. Despite the remarkable advances in therapeutics, the late diagnosis of this condition results in the continuous expansion in a disease incidence and mortality. Therefore, detailed knowledge of HCC initiation and prognosis is essential for developing the early diagnosis and treatment strategies for this type of cancer. Recently, the contribution of many genetic loci in the incidence of HCC has been well established, including 1q21, which harbors potential tumor-suppressor and oncogenes. In addition, many genetic elements are known to implicate in various cancers. LHPP is a tumor-suppressor gene encoding inorganic phosphatase, which negatively correlates with the cell cycle and metastasis. The overexpression of LHPP suppresses the expression of oncogenes, which suggests an anti-cancerous effect of this gene. The down-regulation of LHPP is therefore reported to induce hepato-carcinogenesis in humans.[6-8] Till now, the single nucleotide variations in LHPP are well characterized with multiple disorders, including rs35936514 for major depressive disorders (MDDs), rs34997829 for alcohol-dependent risky sexual behaviors, and rs201982221 for oropharyngeal carcinomas.[9-12] The single nucleotide variants (SNVs) ensuring in the coding regions of the proteins affect the functional integrity of the protein and, therefore, increase the susceptibility toward many diseases, including cancer. The screening of SNVs associated with specific phenotypes is a point of concern as it requires comprehensive testing of the mutated gene. A possible solution is prioritizing the mutations based on their functional characteristics using computational tools. The in silico approaches offer significant advantages over the experimental methods in terms of speed, reliability, convenience, and cost. This study is designed for the computational screening of the missense SNVs from the LHPP gene, regulating the structure, function, and stability of its protein. Furthermore, the impact of pathogenic mutations on the structure of the protein is evaluated via geometrical simulations. This study would be a significant addition to the existing literature by revealing the association between LHPP mutations and HCC. The study would, therefore, contribute to the development of early diagnostic and management strategies for this disease.

Methodology

Retrieval of protein sequence and SNVs

The amino acid sequence of the LHPP was retrieved from the National Center for Biotechnology and Information (NCBI). The missense SNVs reported for this protein were retrieved using Ensemble dbSNP.

Screening of the potentially deleterious missense SNVs through different in silico algorithms

To evaluate the functional impact of selected missense SNVs, a diverse set of 19 prediction tools based on sequence homology, machine learning, sequence to structure, and consensus-derived algorithms, were used. The selection of multiple algorithms and multiple computational tools from each algorithm helped to screen the potentially deleterious SNVs involved in the prognosis of HCC. The missense mutations commonly marked as deleterious by computational tools based on distinct algorithms were considered, for the downstream analysis, excluding all other SNVs.

Sequence homology-based approaches

All missense mutations, retrieved from Ensemble dbSNP, were first evaluated using 4 sequence homology-based tools; SIFT, PROVEAN, Mutation Accessor, and PANTHER. Computational tools based on sequence homology algorithm identify the significantly pathogenic mutations based on their alignment with the known pathogenic mutations. For screening, the pathogenic mutations from SIFT, a prediction score of less than 0.05, was applied. Likewise, in PROVEAN and PANTHER delta alignment score ⩽ −2.5 and substitution position-specific evolutionary conservation (subPSEC) score ⩾ −3 was considered.[18-22] The missense mutations, commonly marked as deleterious via 4 homology-based computational tools, were selected for further analysis.

Machine learning–based approaches

The pathogenic missense mutations identified using sequence homology-based computational tools were further evaluated by the 7 computational tools based on the machine learning algorithm. These computational tools include; SNAP2, SAAP, MutPred, SusPect, PMut, SNP&GO, and PhD-SNP servers. All these computational tools utilize random forest, artificial neural network (ANN), and support vector machine (SVM) to classify the nonsynonymous single nucleotide variants (nsSNVs) into deleterious or tolerant substitutions. In SNAP2, the prediction score ranging from +1 to +100 indicates the deleterious missense mutations. Similarly, the prediction scores of >50, was applied in SusPect and SNP & GO. The screening of missense mutations from the PMut and MutPred was carried out at the cut-off value of >0.5 and 0.8, respectively.[23-29] Again, the missense SNVs, commonly marked as deleterious via supervised learning approaches, were selected for the next step of the analysis.

Sequence to structure-based approaches

PolyPhen-2, Site-Directed Mutator (SDM), PoPMuSiC, and Fold-X are the 4 computational tools, which were used for the evaluation of all missense SNVs selected from the sequence homology and machine learning–based algorithms. These tools consider the sequence or structural parameters for screening the disease-causing variations. Among these, PolyPhen-2 makes binary predictions, with 0 indicating the neutral substitutions and 1 to the deleterious substitutions.[30-33]

Consensus-based approaches

The further assortment of nsSNVs was performed, using 4 consensus-based computational tools, that is, Condel, Meta-SNP, PON-P2, and Predict-SNP methods. These computational tools integrate multiple algorithms to determine the potentially deleterious point mutations from neutral mutations.[34-37]

Evolutionary conservation analysis of deleterious missense SNVs

The amino acid substitutions in the evolutionarily conserved regions of the proteins can alter the protein stability, folding, and structure. Therefore, after the screening of missense mutations from the combination of sequence homology, machine learning, and consensus-based approaches, the mutations present in the evolutionarily conserved regions of the protein were traced via the ConSurf server. This tool identifies the evolutionarily conserved mutations by multiple sequence alignments with the homologous sequences. The conservation scores range from 1 (extremely variable) to 9 (highly conserved).

Structural modeling and active site analysis

For structural comparison of the native and mutated protein models, the 3D structure of the LHPP protein was retrieved from protein data bank (PDB) using PDB-ID of 2X4D. However, the 3D models of the proteins with selected SNVs were built using Fold-X. The protein models were validated based on the Ramachandran plot that were designed using PROCHECK server.[39-41] As the mutations altering the amino acid residues in the active site of the protein are significantly deleterious, therefore, after structural modeling, the CASTp server was used to analyze whether the selected mutations are the part of active protein or not.

Comparison of native and mutated protein models

The native and mutated protein models were compared using Discovery Studio that provides extensive insight into the protein structure and folding.[32,43]

Geometric simulations analysis

The deviation of a mutated confirmation from the native structure impacts the functional integrity of the respective protein. The 3D structures of native and mutated LHPP proteins were therefore compared using geometric structural simulations to identify the extent of fluctuations in the structural conformation after mutation. The geometric simulation approach, using NMSim, was used to predict the biologically related conformational transitions in the mutated LHPP protein models via different parameters including, root mean square deviation (RMSD), root mean square fluctuation (RMSF), radius of gyration (Rg), and polar surface area (PSA).

Molecular dynamics simulations

The molecular dynamics (MD) simulation analysis of the wild-type and mutated protein model was performed to assess the dynamic stability and structural features of the mutated proteins compared to the wild-type protein. The root mean square deviations of the mutated and wild-type proteins were checked for the period of 20 ns using VMD software.

Results

The protein sequence of the LHPP protein (Accession ID: NP_071409.3) contains 270 amino acids. For this protein, a total of 40,862 SNVs, are reported in the Ensemble dbSNP (Figure 1). Most of these variants were non-coding including; 35,821 variants in intron region, 30 in 5′-UTR, and 166 in 3′-UTR. Likewise, coding variants included 238 mutations as missense, 108 as synonymous, 166 splice acceptor, and 166 splice donor. In addition, there were 2 start-loss and 13 stop-gain variants (Figure 1). Since the missense SNVs are the main benefactors behind rare genetic disorders, therefore, this study considered only 238 missense SNVs for further analysis.

Figure 1.

Distribution frequency of SNVs in the LHPP gene. Most variants are present in noncoding regions: 35,821 in introns, 166 in 3′ UTR, and 30 in 5′ UTR. In coding regions, most of the SNVs are missense (238), then comes synonymous (108). Other 347 mutations are present in a very low-distribution frequency including: splice acceptor and splice donor are in the same amount (166), along with the start lost (2) and stop-gained (13) variants. SNV indicates single nucleotide variant.

Screening of potentially deleterious missense SNVs through different in silico methods

Out of 238 missense SNVs associated with LHPP, the pathogenic missense mutations were screened by using the combination of multiple in silico algorithms. The variants commonly marked as deleterious by all prediction methods were selected, ignoring the neutral substitutions at each step. The selected mutations were likely to affect the functions of the candidate protein. These computational algorithms are listed below in detail.

Sequence homology–based approaches

First, the evaluation of selected missense mutations via SIFT, PROVEAN, Mutation Assessor, and PANTHER, resulted in the screening of 87, 116, 140, and 152 variants as deleterious, respectively. Finally, a total of 52 missense mutations commonly marked as pathogenic by all these homology-based computational tools and therefore were selected for the next step (Supplemental File 1). The analysis of 52 variants, selected from the sequence homology algorithm, via the computational tools based on machine learning algorithms (SNAP2, SAAP, MutPred, SusPect, PMut, SNP & GO, and PhD-SNP), resulted in the exploration of 13 missense variants commonly by all these tools. Among these, SNAP2, SAAP, MutPred, SusPect, PhD-SNP, PMut, and SNP & GO individually predicted 42, 41, 45, 20, 39, and 29 missense variants as pathogenic, respectively. The selected 13 missense variants were common in the prediction results of all the above-mentioned tools (Supplemental File 2). PolyPhen-2, SDM, PoPMuSiC, and Fold-X are the protein sequence and structure-based protein stability predictors, which further evaluated the 13 missense SNVs. Among these, PolyPhen-2, SDM, and PoPMuSiC evaluated 12 variants as deleterious, whereas Fold-X predicted only 9 mutations as pathogenic. However, 6 missense mutations, that is, L22P, G27R, L91P, I212T, G227R, and G236R were nominated for the next step because of their combined predictions as deleterious by all the above-mentioned tools (Supplemental File 3).

Consensus-based methods

Furthermore, the 6 missense SNVs with the consensus-based in silico tools, Condel, Meta-SNP, PON-P2, and Predict-SNP, also assorted the pathogenic missense mutations. Three of these tools, Condel, Meta-SNP, and Predict-SNP, labeled all 6 variants as deleterious while, PON-P2 marked 1 variant (G27R) as neutral (Supplemental File 4). Thus, the 5 missense mutations, that is, L22P, G227R, L91P, I212T, and G236R were nominated from the LHPP protein, as pathogenic mutations. All these mutations were likely to associate with the cause of HCC (Figure 2).

Figure 2.

Evaluation of missense SNVs by sequence homology-based, supervised learning–based, sequence to structure-based, and consensus-based algorithms. First, 52 missense SNVs are labeled as pathogenic by sequence homology–based tools. Out of these 52 deleterious missense SNVs, 13 were predicted as damaging by supervised learning–based tools, 6 by sequence to structure-based tools, and 5 by consensus-based tools. SNV indicates single nucleotide variant. ConSurf web server highlights the evolutionarily conserved regions from the LHPP protein using the empirical Bayesian approach. This server marked 4 missense SNVs, that is, L22P, I212T, G227R, and G236R, to be present in the evolutionarily conserved regions of the LHPP protein. The 4 selected missense mutations were likely to have a substantial impact on the structure and function of the protein and therefore were filtered for further analysis (Table 1).

Table 1.

The evolutionarily conservation analysis of deleterious missense mutations from LHPP using ConSurf web server.

Variant ID	Amino acid position	Conservation scores	Normalized values	ConSurf prediction
rs754022892	L22P	8	−0.814	Highly conserved
rs766371253	L91P	6	−0.427	Average
rs199534407	I212T	8	−0.770	Highly conserved
rs142386969	G227R	9	−1.172	Highly conserved
rs759928988	G236R	9	−1.356	Highly conserved

The evolutionarily conservation analysis of deleterious missense mutations from LHPP using ConSurf web server. The 3D structures of 4 mutants were built using Fold-X and compared with the native model of LHPP protein (PDB ID = 2X4D). The Ramachandran plot showed that more than 90% residues of each of these models reside in the most favored regions of the plot indicating the validity of these models (Additional file 5). The CASTp analysis showed that all 4 mutations were the part of active protein (Figure 3).

Figure 3.

Active site analysis of LHPP protein using CASTp server: (A) the active portion of protein is indicated in red and (B) the G227, L22, I212, and G236 are the parts of active site of protein.

Active site analysis of LHPP protein using CASTp server: (A) the active portion of protein is indicated in red and (B) the G227, L22, I212, and G236 are the parts of active site of protein. The comparative modeling suggested the altered profile of the molecular interactions among the native and mutated models. In L22P, amino acid proline replaced the nonpolar leucine at position 22. Likewise, in I212T, a polar amino acid threonine was substituting the nonpolar amino acid isoleucine at position 212. In 2 other variants, G227R and G236R, a positively charged arginine replaced a small nonpolar glycine at 227 and 236 positions, respectively. The amino acid substitutions in these mutants resulted in the significant alterations in the molecular interactions, which were responsible for the decreased stability of mutants and their contributions to the pathogenicity mechanism (Table 2).

Table 2.

The structural comparison of native and mutated LHPP models. The molecular interactions of native residues are shown in blue while the red color designate the mutated residue.

Mutations	Wild-type models	Mutated models
L22P
I212T
G227R
G236R

The structural comparison of native and mutated LHPP models. The molecular interactions of native residues are shown in blue while the red color designate the mutated residue. The conformational and geometrical deviations among wild-type LHPP protein and its 4 potentially pathogenic mutants were analyzed by using a web-based server, NMSims. The comparison was based on RMSD, RMSF, the Rg, and PSA of the protein models. The RMSD value of the native LHPP model was 3.1 Å, while I212T, L22P, G227R, and G236R mutants were having RMSD values of 2.7, 3.5, 3.29, and 3.0 Å, respectively. Mutant L22P showed maximum deviations from the native model (Figure 4A).

Figure 4.

Trajectory analysis for native LHPP and its mutants: (A) RMSD, (B) RMSF, (C) Rg, and (D) PSA of the variants are identified by different colored trajectories showed that the L22P variant displayed higher variations from its wild-type structure. RMSD indicates root mean square deviation; RMSF, root mean square fluctuation; Rg, radius of gyration; PSA, polar surface area. Moreover, the RMSF values for L22P, I212T, G227R, and G236R mutants were 4.05, 2.64, 4.7, and 3.2 Å, respectively, compared to the native LHPP conformation, 2.4 Å. The RMSF values indicated that L22P and G227R have significantly fluctuated models. G227R presented an extensive fluctuation range from 4.75 to 1.27 Å (Figure 4B). Furthermore, Rg evaluated the spatial packing, compactness, and geometrical size of the respective protein. The higher the Rg, higher would be the compactness of the mutated model. The calculated Rg values for L22P, I212T, G227R, and G236R, were 17.8, 17.3, 17.4, and 17.84 Å, respectively. However, the native protein was having Rg value of 17.92 Å, suggesting decreased compactness of all mutated models (Figure 4C). Polar surface area is the sum of protein’s surface area having polar atoms (O, H). The PSA analysis of the mutant structures revealed a slight deviation between the native and the mutants. However, G227R showed higher divergence (5625.41-6686.85 Å) in this property when compared with other variants. The values for L22P, I212T, and G236R lied within the range of native ensemble (Figure 4D).

MD simulations

The NMSim-based RMSD analysis of wild-type LHPP protein with the 4 destabilizing revealed L22P as the most conformationally deviated mutant compared to the native model. To confirm this 20 ns MD, simulations of these 2 proteins were performed, which indicate higher fluctuations in RMSD values of L22P indicating it an unstable structure (Figure 5). Therefore, the substitution of leucine with proline at 22 positions of LHPP proteins, is likely to have a significant effect on the conformation of this protein, making this mutant an important pathogenic factor for HCC.

Figure 5.

The MD simulation analysis for native LHPP protein and L22P mutant. The L22P mutant showed significant variations in RMSD values compared to the native protein. MD indicates molecular dynamics; RMSD, root mean square deviation.

Discussion

Hepatocellular carcinoma is among the most pervasive forms of liver cancer that initiates gradually after the long-term inflammations. The correlation between genetic polymorphisms in multiple cancer-mediated genes and HCC initiation needs to be established. LHPP gene encodes histidine phosphatase protein, which is a tumor-suppressor protein but the unregulated histidine-phosphorylation of LHPP expected to have oncogenic outcomes.[6,9] Currently, various computational tools are available to investigate the mutations that might induce alterations in structure, folding, conformation, or stability of the proteins and hence contribute to numerous genetic diseases. This study employs multiple in silico tools based on different algorithms for the screening of potentially deleterious missense SNVs from the conserved regions of LHPP protein. The predicted variants are likely to associate with the incidence of HCC in various populations. The study starts with the selection of 238 missense SNVs, reported for the LHPP in dbSNP. The selected missense SNVs were sequentially analyzed using multiple computational algorithms to predict their pathogenic impacts (Figure 1). To increase the prediction accuracy, more than one computational tools from each algorithm were considered. Among these algorithms, the first was a sequence homology–based approach, which employs sequence homology information to mark the effect of a particular substitution on the protein function.[30,48] The missense mutations marked as deleterious by the sequence homology algorithm that were further filtered by using the machine learning algorithm, which marked the pathogenic substitutions based on statistical values. The next approach was the sequence to the structure-based approach, which evaluated the protein’s sequence and structural features while classifying the missense mutations as neutral or deleterious.[36,49] The further evaluation of the missense SNVs via the consensus-based approach offered an accurate and robust functional assessment as compared to the individual prediction methods. The application of these multiple algorithms helped to eliminate the false positive hits from the analysis, with the selection of 5 potentially pathogenic missense SNVs (Table 1). The evolutionary conservation analysis through ConSurf marked 4 missense substitutions, that is, L22P, I212T, G227R, and G236R, to be part of the evolutionarily conserved regions of LHPP protein. These evolutionarily conserved mutations might increase the risk of cancer by deregulating the histidine phosphatase (Table 2). Fold-X marked all these 4 mutations as destabilizing mutations for the LHPP protein. As protein stability is essential for maintaining the conformation and functionality of a protein, and variations in the protein stability can cause misfolding of the proteins, resulting in the structural and functional disruption.[51-53] Furthermore, the active site analysis showed that selected mutations occur in the active site of protein and therefore may have significant contributions in altering the protein’s function (Figure 3). Moreover, the comparative analysis of the native and mutated protein revealed an altered pattern of molecular interactions, with the neighboring residues inducing detrimental effects on the protein. The mutant I212T retained H-bonding with the same residues, but after mutation, the alkyl interactions with leucine and alanine became distorted. The resulting unbound residues in the mutated model might attract other amino acids causing conformational distortions (Table 2). In L22P mutant, the substitution of aliphatic leucine with aromatic proline imparts rigidity to the polypeptide chain by imposing certain torsion angels. Moreover, the formation of tertiary amide and unfavorable bumps interactions might break the alpha helices and beta sheets that can affect the protein-protein or protein-ligand interactions.[54,55] In G227R and G236R mutants, nonpolar amino acid glycine was substituted by a polar and charged amino acid; arginine resulted in the development of hydrogen bonding and various additional interactions in the mutated models (Table 2). These deformed or distorted interactions of all mutated models could cause structural destabilization in one way or the other and would ultimately disturb the enzymatic function of the LHPP protein. Geometric simulation analysis determines the stability and functionality of any protein by generating conformational trajectories. The results are represented in terms of RMSD, RMSF, Rg, and PSA, among which RMSD and RMSF are the main parameters for determining the protein’s stability. The RMSD of the 4 mutants marked that L22P is a significantly destabilizing mutant with the highest deviations in the RMSD values. The RMSD fluctuations in the other mutants were as follows, L22P > I212T > G227R > G236R (Figure 4A). Likewise, high RMSF was observed in mutant G227R (Figure 4B). The other 2 structural parameters used to evaluate LHPP stability including Rg and PSA. Furthermore, the computed Rg and PSA values of the mutants were not significantly divergent from the native LHPP protein. The mutant G236R displayed the least divergence behavior from the wild-type LHPP protein. The other 3 variants, L22P, I212T, and G227R, were suggested to have more pronounced structural and functional effects (Figure 4). Among the 4 mutants, L22P (LHPP, rs754022892) presented considerable deviation and rigidity as compared to the native LHPP protein. This divergence might be because of the distortion in the secondary conformation and folding of the protein.[15,44] The same results were also confirmed by the MD simulation analysis (Figure 5). Hence, these findings suggest a significant decrease in the stability of the LHPP protein by L22P mutant, which might contribute to the pathogenesis of HCC by inducing tumorigeneses. Thus, more research based on static model analysis and MD simulations is required to unveil the contribution of LHPP mutant in the pathogenesis of HCC.

Conclusion

The computational pipeline employed in this study identifies 4 potentially deleterious missense mutations from the evolutionarily conserved regions of phospholysine phosphohistidine inorganic pyrophosphate phosphatase. The comparison of these mutants with the native protein model reveals significant alterations in the interaction profile after mutation. These mutations might, therefore, lead to a protein instability and contributes to HCC. All these mutants need further experimental validation via wet-laboratory practices. Click here for additional data file. Supplemental material, sj-docx-5-bbi-10.1177_11779322221115547 for Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach by Malik Siddique Mahmood, Maryam Afzal, Hina Batool, Amara Saif, Tahreem Aqdas, Naeem Mahmood Ashraf and Mahjabeen Saleem in Bioinformatics and Biology Insights Click here for additional data file. Supplemental material, sj-xlsx-1-bbi-10.1177_11779322221115547 for Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach by Malik Siddique Mahmood, Maryam Afzal, Hina Batool, Amara Saif, Tahreem Aqdas, Naeem Mahmood Ashraf and Mahjabeen Saleem in Bioinformatics and Biology Insights Click here for additional data file. Supplemental material, sj-xlsx-2-bbi-10.1177_11779322221115547 for Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach by Malik Siddique Mahmood, Maryam Afzal, Hina Batool, Amara Saif, Tahreem Aqdas, Naeem Mahmood Ashraf and Mahjabeen Saleem in Bioinformatics and Biology Insights Click here for additional data file. Supplemental material, sj-xlsx-3-bbi-10.1177_11779322221115547 for Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach by Malik Siddique Mahmood, Maryam Afzal, Hina Batool, Amara Saif, Tahreem Aqdas, Naeem Mahmood Ashraf and Mahjabeen Saleem in Bioinformatics and Biology Insights Click here for additional data file. Supplemental material, sj-xlsx-4-bbi-10.1177_11779322221115547 for Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach by Malik Siddique Mahmood, Maryam Afzal, Hina Batool, Amara Saif, Tahreem Aqdas, Naeem Mahmood Ashraf and Mahjabeen Saleem in Bioinformatics and Biology Insights

52 in total

Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach.

Introduction

Methodology

Retrieval of protein sequence and SNVs

Screening of the potentially deleterious missense SNVs through different in silico algorithms

Sequence homology-based approaches

Machine learning–based approaches

Sequence to structure-based approaches

Consensus-based approaches

Evolutionary conservation analysis of deleterious missense SNVs

Structural modeling and active site analysis

Comparison of native and mutated protein models

Geometric simulations analysis

Molecular dynamics simulations

Results

Screening of potentially deleterious missense SNVs through different in silico methods

Sequence homology–based approaches

Consensus-based methods

MD simulations

Discussion

Conclusion

1. SNP characteristics predict replication success in association studies.

Review 2. Hepatitis-related hepatocellular carcinoma: Insights into cytokine gene polymorphisms.

3. SDM: a server for predicting effects of mutations on protein stability.

4. The FoldX web server: an online force field.

5. In silico analysis of single nucleotide polymorphism (SNP) in human TNF-α gene.

6. Unraveling the Deleterious Effects of Cancer-Driven STK11 Mutants Through Conformational Sampling Approach.

7. PhD-SNPg: a webserver and lightweight tool for scoring single nucleotide variants.

8. PMut: a web-based tool for the annotation of pathological variants on proteins, 2017 update.

Review 9. FoldX as Protein Engineering Tool: Better Than Random Based Approaches?

10. CASTp 3.0: computed atlas of surface topography of proteins.