| Literature DB >> 35788771 |
Abstract
The mammalian/mechanistic target of rapamycin (mTOR) protein is an important growth regulator and has been linked with multiple diseases including cancer and diabetes. Non-synonymous mutations of this gene have already been found in patients with renal clear cell carcinoma, melanoma, and acute lymphoid leukemia among many others. Such mutations can potentially affect a protein's structure and hence its functions. In this study, therefore, the most deleterious SNPs of mTOR protein have been determined to identify potential biomarkers for various disease treatments. The aim is to generate a structured dataset of the mTOR gene's SNPs that may prove to be an asset for the identification and treatment of multiple diseases associated with the target gene. Both sequence and structure-based approaches were adopted and a wide variety of bioinformatics tools were applied to analyze the SNPs of mTOR protein. In total 11 nsSNPs have been filtered out of 2178 nsSNPs along with two non-coding variations. All of the nsSNPs were found to destabilize the protein structure and disrupt its function. While R619C, A1513D, and T1977R mutations were shown to alter C alpha distances and bond angles of the mTOR protein, L509Q, R619C and N2043S were predicted to disrupt the mTOR protein's interaction with NBS1 protein and FKBP1A/rapamycin complex. In addition, one of the non-coding SNPs was shown to alter miRNA binding sites. Characterizing nsSNPs and non-coding SNPs and their harmful effects on a protein's structure and functions will enable researchers to understand the critical impact of mutations on the molecular mechanisms of various diseases. This will ultimately lead to the identification of potential targets for disease diagnosis and therapeutic interventions.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35788771 PMCID: PMC9255762 DOI: 10.1371/journal.pone.0270919
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
List of web-based bioinformatics tools used in the study for the identification of most deleterious nsSNPs along with their description, input parameters, and accuracy rate.
| Prediction tool | Description | Input parameters | Accuracy rate |
|---|---|---|---|
| SIFT | Predicts nsSNP impact on protein function based on sequence homology and the physical properties of amino acids | chromosome positions (coordinates and orientations) and alleles/ dbSNP rsIDs | 76.99% |
| PolyPhen2 | Predicts nsSNP impact on protein structure and function based on sequence, phylogenetic, and structural features | Protein identifier and amino acid substitutions | 75.56% |
| PROVEAN | Predicts nsSNP impact on protein function based on an alignment-based scoring method | Protein query sequence and amino acid variations | 79.19% |
| Mutation Assessor | Predicts nsSNP impact on protein function based on evolutionary conservation of the affected amino acid in protein homologs | Query protein ID and variant | 78.15% |
| SNAP2 | Predicts functional effects of nsSNPs based on a neural network involving evolutionary information, predicted secondary structure, and solvent accessibility | Query protein sequence | 82% |
| SuSPect | Predicts phenotypic effects of nsSNPs using a support vector machine (SVM) method integrating sequence-, structure- and systems biology-based features | UniProt IDs and amino acid variations | 82% |
| PhD-SNP | Predicts pathogenicity of nsSNPs by an SVM-based method using sequence and profile information | Query protein sequence and amino acid variations | 78% |
| PMut | Predicts pathology of mutations based on a neural network and sequence conservation information | Protein UniProt ID and amino acid variations | 80% |
| CADD | Predicts deleteriousness of nsSNPs using a machine learning model based on sequence context, gene model annotations, evolutionary constraint, epigenetic measurements, and functional predictions. | Chromosomal coordinates and allele information | - |
| Meta-SNP | Detects disease-associated nsSNPs by integrating four methods: PANTHER, PhD-SNP, SIFT and SNAP | Query protein sequence and amino acid variations | 79% |
Fig 1A flowchart depicting the entire study plan along with the tools that have been applied in the study.
Fig 2Retrieval and Identification of most deleterious nsSNPs.
A) Retrieval of nsSNPs from three different databases: dbSNP, ClinVar, and DisGenet. The numbers in brackets indicate how many nsSNPs were retrieved from each database and how many overlapped between different databases. B) Identification of the most potentially harmful and pathogenic nsSNPs through bioinformatics analyses. The numbers in the bracket after each tool represent how many deleterious nsSNPs resulted from the previous analysis overlapped with the results from the current analysis. After 10 different tools were applied, eleven nsSNPs were selected as potentially most harmful.
The predictions and scores of the most deleterious nsSNPs determined by 10 bioinformatics tools along with their dbSNP IDs if available.
| AA variation | dbSNP ID | SIFT Score | PolyPhen2 | PROVEAN | Mutation Assessor | SNAP2 | SuSPect | PhD-SNP | PMut | CADD (PHRAD) | Meta-SNP (score) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| L509Q | 28730691 | 0.023 | 0.997 | -4.7 | medium | 20 | 84 | disease | 0.58 | 29.6 | 0.53 |
| R619C | 199712134 | 0 | 1 | -7.8 | high | 45 | 97 | disease | 0.76 | 31 | 0.83 |
| D944V | - | 0 | 0.977 | -5.9 | medium | 54 | 58 | disease | 0.59 | 28.9 | 0.75 |
| Y1151C | 151082401 | 0.017 | 0.983 | -6.3 | Medium | 57 | 87 | disease | 0.64 | 24.8 | 0.78 |
| R1161G | 202197441 | 0.002 | 0.976 | -6.3 | medium | 46 | 72 | disease | 0.64 | 25.8 | 0.69 |
| K1452N | - | 0 | 0.981 | -4.1 | medium | 21 | 57 | disease | 0.80 | 23.5 | 0.68 |
| A1513D | 374529391 | 0.001 | 0.999 | -4.1 | medium | 40 | 86 | disease | 0.75 | 28.6 | 0.70 |
| E1610K | 199612643 | 0.022 | 0.996 | -3.5 | medium | 37 | 43 | disease | 0.59 | 32 | 0.55 |
| R1616C | 17848545 | 0 | 1 | -5.4 | medium | 20 | 83 | disease | 0.67 | 32 | 0.69 |
| T1977R | - | 0 | 1 | -5.2 | medium | 59 | 66 | disease | 0.80 | 28.9 | 0.72 |
| N2043S | 371511548 | 0.005 | 1 | -3.9 | medium | 16 | 75 | disease | 0.75 | 25.7 | 0.56 |
Fig 3Conservancy and solvent accessibility analysis of selected amino acid positions within mTOR protein.
Position of the most deleterious SNPs along with their relative surface accessibility predicted by NetSurfP-2.0 with a threshold of 25% which represents SNPs having >25% RSA are predicted to be exposed on the protein surface. The conservation state of the SNP positions predicted by ConSurf is represented with color codes.
The effect of the nsSNPs on mTOR protein’s stability as determined by I-Mutant 3.0 and MUpro and on structural features of the protein as determined by HOPE.
| I-Mutant 3.0 | MUpro | Summary of HOPE report | |||||
|---|---|---|---|---|---|---|---|
| Amino acid variation | ΔΔG value (Kcal/mol) | ΔΔG value (Kcal/mol) | Affects size and charge | Affects hydrophobicity | Disrupt hydrogen bond/Salt bridge | Interferes with protein function | Interferes with other protein interaction |
| L509Q | -1.86 | -1.84 | Affects size | Yes | No | Yes | No |
| R619C | -0.93 | -0.66 | Yes | Yes | Disrupts hydrogen bond | Yes | Yes |
| D944V | -0.26 | -0.55 | Yes | Yes | Disrupts salt bridge | Yes | Yes |
| Y1151C | -1.15 | -0.12 | Affects size | Yes | Disrupts hydrogen bond | Yes | No |
| R1161G | -1.46 | -1.61 | Yes | Yes | Disrupts salt bridge | Yes | Yes |
| K1452N | -0.17 | -0.91 | Yes | No | Disrupts hydrogen bond and salt bridge | Yes | Yes |
| A1513D | -0.85 | -0.99 | Yes | Yes | No | Yes | No |
| E1610K | -0.29 | -1.25 | Yes | No | Disrupts salt bridge | Yes | Yes |
| R1616C | -0.72 | -0.56 | Yes | Yes | Disrupts hydrogen bond and salt bridge | Yes | Yes |
| T1977R | -0.23 | -0.67 | Yes | Yes | Disrupts hydrogen bond | Yes | No |
| N2043S | -0.53 | -1.12 | Affects size | Yes | Disrupts hydrogen bond | Yes | Yes |
Fig 4Structural images of the nine nsSNPs created by the HOPE report.
Images of the modified structures of mTOR protein were generated using Project HOPE. The whole protein is pictured in grey colour, the side chain of the wild-type residue is indicated in green and of the mutant residue is shown in red. In A, B, C, E, F, and I, the mutated residue which is in red is smaller than the wild-type residue presented in green. On the other hand, in D, G, and H the mutant residues are bigger than the wild type residues. All of these structural changes will disrupt the protein’s shape, conformation, and function. Note: (A) D944V (B) Y1151C (C) R1161G (D) E1610K (E) R1616C (F) K1452N (G) A1513D (H) T1977R and (I) N2043S.
Association study of the high-risk nsSNPs with different types of cancer obtained from Cscape, cBioPortal and canSAR Black databases.
| Amino acid variation | Cscape | cBioPortal | canSAR Black |
|---|---|---|---|
| L509Q | 0.92 | ||
| R619C | 0.86 | Renal clear cell carcinoma, melanoma, uterine endometrioid carcinoma | Uterine endometrial |
| D944V | 0.91 | ||
| Y1151C | 0.88 | Lung adenocarcinoma | Lung |
| R1161G | 0.85 | Brain | |
| K1452N | 0.58 | Kidney | |
| A1513D | 0.91 | Uterine endometrioid carcinoma | Uterine endometrial |
| E1610K | 0.92 | Prostate adenocarcinoma, colorectal adenocarcinoma | |
| R1616C | 0.78 | Cutaneous melanoma | |
| T1977R | 0.95 | Acute lymphoid leukemia, lung adenocarcinoma, renal clear cell carcinoma, colorectal carcinoma | Lymphoma, Leukemia, Uterine endometrial, Kidney, Prostate |
| N2043S | 0.86 |
(Yellow values denote low-confidence oncogenic predictions and red values denote high-confidence oncogenic predictions made by Cscape)
Structure validation of the 3D protein models of mTOR protein and TM-score and RMSD values of mutated mTOR and wild-type mTOR.
| Protein model | Template and sequence identity (%) | SWISS Model Ramachandran favored region | QMEAN score | MolProbity score | ERRAT score | TM-score | RMSD value |
|---|---|---|---|---|---|---|---|
| Wt | 6zwm.1.A (100) | 92.93% | -2.01 | 1.33 | 92.0821 | - | - |
| L509Q | 6zwm.1.A (99.96) | 93.56% | -1.96 | 1.27 | 92.4473 | 0.9747 | 0.045 |
| R619C | 6zwm.1.A (99.96) | 93.13% | -2.02 | 1.45 | 91.63 | 0.9768 | 0.094 |
| D944V | 6zwm.1.A (99.96) | 93.13% | -2.05 | 1.3 | 92.719 | 0.9728 | 0.011 |
| Y1151C | 6zwm.1.A (99.96) | 93.13% | -1.99 | 1.31 | 91.7826 | 1.0000 | 0.087 |
| R1161G | 6zwm.1.A (99.96) | 92.93% | -2.04 | 1.3 | 92.1816 | 1.0000 | 0.052 |
| K1452N | 6zwm.1.A (99.96) | 92.89% | -2.06 | 1.39 | 91.9983 | 1.0000 | 0.047 |
| A1513D | 6zwm.1.A (99.96) | 93.20% | -2.05 | 1.41 | 89.8113 | 0.98 | 0.139 |
| E1610K | 6zwm.1.A (99.96) | 92.97% | -2.03 | 1.33 | 92.1207 | 1.0000 | 0.011 |
| R1616C | 6zwm.1.A (99.96) | 92.97% | -2.10 | 1.43 | 92.4204 | 1.0000 | 0.062 |
| T1977R | 6zwm.1.A (99.96) | 93.44% | -2.10 | 1.37 | 90.3725 | 0.9800 | 0.123 |
| N2043S | 6zwm.1.A (99.96) | 93.40% | -2.22 | 1.39 | 91.288 | 0.9671 | 0.078 |
Alteration of H bond interactions and their lengths due to amino acid substitutions.
| Protein model | Wild type | Altered | ||
|---|---|---|---|---|
| H bond | Bond length (Å) | H bond | Bond length (Å) | |
| R619C | R619-H615 | 3.294 | C619-His615 | 3.238 |
| R619-A623 | 3.06 | C619-A623 | 2.964 | |
| A1513D | A1513-Q1509 | 3.006 | D1513-Q1509 | 3.073 |
| A1513-A1517 | 2.897 | D1513-A1517 | 2.810 | |
| T1977R | T1977-I1973 | 3.005 | R1977-I1973 | 3.113 |
Fig 5Domains and interaction sites within the mTOR protein.
The positions of the final 11 nsSNPs are shown within mTOR protein with respect to the NBS1 interacting domain, FAT domain, PI3K/PI4K catalytic domain, and FATC domain. The domains are shown in grey horizontal bars and the nsSNPs are depicted by blue arrows. The exons (58 exons) are shown by green vertical bars while the introns are shown by green horizontal lines. The scaling of the exon positions and the domains is approximate.