| Literature DB >> 34921182 |
Shu Wen Lim1, Kennet JunKai Tan1, Osman Mohd Azuraidi2, Maran Sathiya3, Ee Chen Lim1, Kok Song Lai4, Wai-Sum Yap5, Nik Abd Rahman Nik Mohd Afizan6.
Abstract
MYB proteins are highly conserved DNA-binding domains (DBD) and mutations in MYB oncoproteins have been reported to cause aberrant and augmented cancer progression. Identification of MYB molecular biomarkers predictive of cancer progression can be used for improving cancer management. To address this, a biomarker discovery pipeline was employed in investigating deleterious non-synonymous single nucleotide polymorphisms (nsSNPs) in predicting damaging and potential alterations on the properties of proteins. The nsSNP of the MYB family; MYB, MYBL1, and MYBL2 was extracted from the NCBI database. Five in silico tools (PROVEAN, SIFT, PolyPhen-2, SNPs&GO and PhD-SNP) were utilized to investigate the outcomes of nsSNPs. A total of 45 nsSNPs were predicted as high-risk and damaging, and were subjected to PMut and I-Mutant 2.0 for protein stability analysis. This resulted in 32 nsSNPs with decreased stability with a DDG score lower than - 0.5, indicating damaging effect. G111S, N183S, G122S, and S178C located within the helix-turn-helix (HTH) domain were predicted to be conserved, further posttranslational modifications and 3-D protein analysis indicated these nsSNPs to shift DNA-binding specificity of the protein thus altering the protein function. Findings from this study would help in the field of pharmacogenomic and cancer therapy towards better intervention and management of cancer.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34921182 PMCID: PMC8683427 DOI: 10.1038/s41598-021-03624-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
High risk nsSNPs identified in MYB family genes by in silico tools.
| SNP ID | AA change | PROVEAN | SIFT | PolyPhen-2 | SNPs&GO | PhD-SNP | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Preda | Sc | Predb | Sc | Effectc | Sc | Predd | RI | Prede | RI | ||
| rs1302072057 | R73L | Del | − 3.80 | Dmg | 0 | Pro.dmg | 1 | Disease | 4 | Disease | 7 |
| rs1302072057 | R73Q | Del | − 6.65 | Dmg | 0 | Pro.dmg | 1 | Disease | 5 | Disease | 8 |
| rs866246271 | C78Y | Del | − 10.45 | Dmg | 0 | Pro.dmg | 0.999 | Disease | 7 | Disease | 9 |
| rs1246761830 | K84E | Del | − 3.80 | Dmg | 0.001 | Pro.dmg | 0.998 | Disease | 5 | Disease | 4 |
| rs1231582413 | P94R | Del | − 8.55 | Dmg | 0.001 | Pos.dmg | 0.608 | Disease | 3 | Disease | 1 |
| rs1776412940 | W95L | Del | − 12.35 | Dmg | 0 | Pro.dmg | 1 | Disease | 4 | Disease | 7 |
| rs1361650612 | G111S | Del | − 5.69 | Dmg | 0 | Pro.dmg | 1 | Disease | 5 | Disease | 7 |
| rs1316378738 | A158E | Del | − 4.74 | Dmg | 0 | Pos.dmg | 0.577 | Disease | 7 | Disease | 6 |
| rs1583273308 | R176Q | Del | − 3.80 | Dmg | 0 | Pro.dmg | 1 | Disease | 6 | Disease | 8 |
| rs1179275735 | N183S | Del | − 4.58 | Dmg | 0 | Pro.dmg | 1 | Disease | 7 | Disease | 5 |
| rs1335964521 | R191Q | Del | − 3.75 | Dmg | 0 | Pro.dmg | 1 | Disease | 2 | Disease | 5 |
| rs1444007668 | K192E | Del | − 3.75 | Dmg | 0 | Pro.dmg | 0.999 | Disease | 7 | Disease | 6 |
| rs1777011832 | D286Y | Del | − 6.18 | Dmg | 0 | Pro.dmg | 1 | Disease | 4 | Disease | 7 |
| rs1247338239 | W406C | Del | − 5.30 | Dmg | 0.001 | Pro.dmg | 0.985 | Disease | 0 | Disease | 5 |
| rs1247579811 | P574S | Del | − 6.02 | Dmg | 0 | Pro.dmg | 1 | Disease | 0 | Disease | 5 |
| rs775717051 | A594P | Del | − 3.37 | Dmg | 0.001 | Pro.dmg | 1 | Disease | 1 | Disease | 6 |
| rs756830286 | G603S | Del | − 4.53 | Dmg | 0.007 | Pro.dmg | 1 | Disease | 0 | Disease | 5 |
| rs1367098628 | R68C | Del | − 7.60 | Dmg | 0 | Pro.dmg | 1 | Disease | 3 | Disease | 7 |
| rs767355502 | R68H | Del | − 4.75 | Dmg | 0 | Pro.dmg | 0.996 | Disease | 1 | Disease | 6 |
| rs1472109411 | P83T | Del | − 7.60 | Dmg | 0 | Pos.dmg | 0.811 | Disease | 3 | Disease | 3 |
| rs766676175 | G122S | Del | − 5.69 | Dmg | 0.002 | Pro.dmg | 0.975 | Disease | 3 | Disease | 5 |
| rs768073245 | R156W | Del | − 6.55 | Dmg | 0.002 | Pro.dmg | 1 | Disease | 2 | Disease | 5 |
| rs1299187617 | R160H | Del | − 4.74 | Dmg | 0 | Pro.dmg | 0.990 | Disease | 2 | Disease | 5 |
| rs866260709 | S175Y | Del | − 3.62 | Dmg | 0 | Pro.dmg | 1 | Disease | 0 | Disease | 2 |
| rs1281804000 | R185Q | Del | − 3.30 | Dmg | 0.003 | Pro.dmg | 0.978 | Disease | 0 | Disease | 3 |
| rs1809689475 | E265K | Del | − 3.34 | Dmg | 0.005 | Pro.dmg | 0.957 | Disease | 2 | Disease | 6 |
| rs1225867137 | M276T | Del | − 4.79 | Dmg | 0.006 | Pro.dmg | 0.992 | Disease | 2 | Disease | 6 |
| rs1808814297 | P512R | Del | − 6.44 | Dmg | 0 | Pro.dmg | 1 | Disease | 1 | Disease | 6 |
| rs1361362325 | C514W | Del | − 5.98 | Dmg | 0.002 | Pro.dmg | 1 | Disease | 1 | Disease | 6 |
| rs770108773 | A562E | Del | − 3.54 | Dmg | 0.001 | Pro.dmg | 1 | Disease | 0 | Disease | 6 |
| rs777095803 | A565P | Del | − 2.90 | Dmg | 0.003 | Pro.dmg | 1 | Disease | 1 | Disease | 7 |
| rs777150665 | G596R | Del | − 3.07 | Dmg | 0.001 | Pro.dmg | 1 | Disease | 0 | Disease | 5 |
| rs1281394929 | G721E | Del | − 5.73 | Dmg | 0.007 | Pro.dmg | 1 | Disease | 5 | Disease | 6 |
| rs748449655 | L57R | Del | − 4.37 | Dmg | 0.001 | Pro.dmg | 0.999 | Disease | 8 | Disease | 2 |
| rs1228232756 | R64C | Del | − 6.64 | Dmg | 0 | Pro.dmg | 1 | Disease | 8 | Disease | 5 |
| rs867195152 | G102D | Del | − 6.10 | Dmg | 0 | Pro.dmg | 0.995 | Disease | 8 | Disease | 6 |
| rs1164247754 | R116Q | Del | − 3.35 | Dmg | 0.022 | Pro.dmg | 1 | Disease | 8 | Disease | 6 |
| rs1323182096 | R124C | Del | − 7.06 | Dmg | 0 | Pro.dmg | 1 | Disease | 8 | Disease | 7 |
| rs1300383239 | N127D | Del | − 4.52 | Dmg | 0.006 | Pro.dmg | 0.971 | Disease | 2 | Disease | 4 |
| rs1295676923 | G166V | Del | − 8.00 | Dmg | 0 | Pos.dmg | 0.946 | Disease | 8 | Disease | 7 |
| rs968286439 | R167K | Del | − 2.63 | Dmg | 0 | Pro.dmg | 0.999 | Disease | 7 | Disease | 5 |
| rs1271670254 | D169G | Del | − 5.87 | Dmg | 0 | Pro.dmg | 1 | Disease | 7 | Disease | 7 |
| rs1438994955 | S178C | Del | − 4.01 | Dmg | 0 | Pro.dmg | 1 | Disease | 3 | Disease | 3 |
| rs781229138 | G188C | Del | − 6.21 | Dmg | 0.003 | Pro.dmg | 0.966 | Disease | 6 | Disease | 5 |
| rs1171631148 | E552V | Del | − 5.52 | Dmg | 0.001 | Pro.dmg | 1 | Disease | 7 | Disease | 2 |
| rs776972688 | G530R | Del | − 5.97 | Dmg | 0.001 | Pro.dmg | 1 | Disease | 7 | Disease | 2 |
| rs779332836 | G669R | Del | − 6.13 | Dmg | 0 | Pro.dmg | 1 | Disease | 6 | Disease | 3 |
| rs776117094 | R682W | Del | − 5.78 | Dmg | 0 | Pro.dmg | 1 | Disease | 5 | Disease | 1 |
AA amino acid, Pred prediction, TI tolerance index, Sc score, Del deleterious, Dmg damaging, Pro.dmg probably damaging, Pos.dmg possibly damaging, RI reliability index.
aPROVEAN: Del (Sc < − 2.5).
bSIFT: Dmg (Sc ≤ 0.05).
cPolyPhen-2: Pos.dmg (0.453 ≤ Sc ≤ 0.956), Pro.dmg (0.957 ≤ Sc ≤ 1.0).
dSNPs&GO: Disease (Probability > 0.5).
ePhD-SNP: Disease (Probability > 0.5).
Predictions of high risk nsSNPs in MYB oncoproteins by PMut, I-Mutant 2.0, and ConSurf.
| Protein | nsSNP ID | Mutation | PMut | I-Mutant 2.0 | ConSurf | ||||
|---|---|---|---|---|---|---|---|---|---|
| Score and percentage | Predictiona | Stability | RI | DDG (kcal/mol) | Conservation scoreb | Prediction | |||
| MYB | rs1302072057 | R73L | 0.86 (91%) | Disease | Decrease | 9 | − 0.90 | 9 | Highly conserved and exposed (f) |
| rs1302072057 | R73Q | 0.73 (87%) | Disease | Decrease | 9 | − 1.38 | 9 | Highly conserved and exposed (f) | |
| rs866246271 | C78Y | 0.79 (89%) | Disease | Decrease | 2 | − 0.07 | 9 | Highly conserved and buried (s) | |
| rs1246761830 | K84E | 0.52 (79%) | Disease | Decrease | 1 | − 0.30 | 9 | Highly conserved and exposed (f) | |
| rs1231582413 | P94R | 0.72 (86%) | Disease | Decrease | 7 | − 0.42 | 9 | Highly conserved and exposed (f) | |
| rs1776412940 | W95L | 0.86 (91%) | Disease | Decrease | 7 | − 0.99 | 9 | Highly conserved and exposed (f) | |
| rs1361650612 | G111S | 0.80 (89%) | Disease | Decrease | 9 | − 1.03 | 9 | Highly conserved and exposed (f) | |
| rs1316378738 | A158E | 0.82 (90%) | Disease | Decrease | 7 | − 1.28 | 9 | Highly conserved and buried (s) | |
| rs1583273308 | R176Q | 0.86 (91%) | Disease | Decrease | 8 | − 0.51 | 9 | Highly conserved and exposed (f) | |
| rs1179275735 | N183S | 0.82 (90%) | Disease | Decrease | 4 | − 0.04 | 9 | Highly conserved and exposed (f) | |
| rs1335964521 | R191Q | 0.77 (88%) | Disease | Decrease | 9 | − 1.13 | 9 | Highly conserved and exposed (f) | |
| rs1444007668 | K192E | 0.55 (80%) | Disease | Decrease | 5 | − 0.95 | 9 | Highly conserved and exposed (f) | |
| rs1777011832 | D286Y | 0.64 (84%) | Disease | Increase | 2 | 0.11 | 2 | Exposed | |
| rs1247338239 | W406C | 0.65 (84%) | Disease | Decrease | 6 | − 1.49 | 7 | Buried | |
| rs1247579811 | P574S | 0.63 (84%) | Disease | Decrease | 9 | − 1.01 | 9 | Highly conserved and exposed (f) | |
| rs775717051 | A594P | 0.65 (84%) | Disease | Increase | 2 | − 1.01 | 9 | Highly conserved and buried (s) | |
| rs756830286 | G603S | 0.50 (82%) | Neutral | Decrease | 6 | − 1.52 | 9 | Highly conserved and exposed (f) | |
| MYBL1 | rs767355502 | R68C | 0.39 (86%) | Neutral | Decrease | 3 | − 0.73 | 9 | Highly conserved and exposed (f) |
| rs767355502 | R68H | 0.86 (91%) | Disease | Decrease | 7 | − 1.31 | 9 | Highly conserved and exposed (f) | |
| rs1472109411 | P83T | 0.81 (89%) | Disease | Decrease | 4 | 0.08 | 9 | Highly conserved and exposed (f) | |
| rs766676175 | G122S | 0.76 (88%) | Disease | Decrease | 7 | − 0.52 | 9 | Highly conserved and exposed (f) | |
| rs768073245 | R156W | 0.82 (90%) | Disease | Decrease | 7 | − 1.04 | 9 | Highly conserved and exposed (f) | |
| rs1299187617 | R160H | 0.77 (88%) | Disease | Decrease | 9 | − 1.17 | 9 | Highly conserved and exposed (f) | |
| rs866260709 | S175Y | 0.76 (88%) | Disease | Increase | 2 | − 0.64 | 9 | Highly conserved and buried (s) | |
| rs1281804000 | R185Q | 0.67 (85%) | Disease | Decrease | 9 | − 1.44 | 4 | Highly conserved and exposed (f) | |
| rs1809689475 | E265K | 0.63 (83%) | Disease | Decrease | 5 | − 0.38 | 6 | Exposed | |
| rs1225867137 | M276T | 0.32 (89%) | Neutral | Decrease | 5 | − 0.62 | 9 | Exposed | |
| rs1808814297 | P512R | 0.86 (91%) | Disease | Decrease | 9 | − 1.25 | 6 | Highly conserved and exposed (f) | |
| rs1361362325 | C514W | 0.63 (84%) | Disease | Decrease | 5 | − 1.06 | 9 | Buried | |
| rs770108773 | A562E | 0.78 (88%) | Disease | Decrease | 5 | − 0.76 | 9 | Highly conserved and buried (s) | |
| rs777095803 | A565P | 0.53 (80%) | Disease | Increase | 0 | − 1.85 | 8 | Highly conserved and exposed (f) | |
| rs777150665 | G596R | 0.53 (80%) | Disease | Decrease | 7 | − 1.87 | 9 | Highly conserved and exposed (f) | |
| rs1281394929 | G721E | 0.82 (90%) | Disease | Increase | 2 | − 0.27 | 9 | Highly conserved and exposed (f) | |
| MYBL2 | rs748449655 | L57R | 0.66 (85%) | Disease | Decrease | 6 | − 0.76 | 9 | Highly conserved and buried (s) |
| rs1228232756 | R64C | 0.78 (88%) | Disease | Decrease | 4 | − 0.66 | 9 | Highly conserved and exposed (f) | |
| rs867195152 | G102D | 0.86 (91%) | Disease | Decrease | 5 | − 1.45 | 9 | Highly conserved and exposed (f | |
| rs1164247754 | R116Q | 0.88 (92%) | Disease | Decrease | 7 | 0.13 | 9 | Highly conserved and exposed (f) | |
| rs1323182096 | R124C | 0.89 (92%) | Disease | Decrease | 6 | 0.41 | 9 | Highly conserved and exposed (f) | |
| rs1300383239 | N127D | 0.85 (91%) | Disease | Decrease | 6 | − 0.73 | 9 | Highly conserved and exposed (f) | |
| rs1295676923 | G166V | 0.88 (92%) | Disease | Increase | 0 | − 0.89 | 9 | Highly conserved and exposed (f) | |
| rs968286439 | R167K | 0.87 (91%) | Disease | Decrease | 9 | − 0.98 | 9 | Highly conserved and exposed (f) | |
| rs1271670254 | D169G | 0.79 (89%) | Disease | Decrease | 6 | − 1.04 | 9 | Highly conserved and exposed (f) | |
| rs1438994955 | S178C | 0.53 (80%) | Disease | Decrease | 2 | − 1.33 | 9 | Highly conserved and exposed (f) | |
| rs781229138 | G188C | 0.80 (89%) | Disease | Decrease | 8 | − 2.28 | 5 | Exposed | |
| rs1171631148 | G530R | 0.76 (88%) | Disease | Decrease | 4 | − 1.69 | 9 | Highly conserved and exposed (f) | |
| rs776972688 | E552V | 0.71 (86%) | Disease | Increase | 2 | 0.08 | 9 | Highly conserved and exposed (f) | |
| rs779332836 | G669R | 0.82 (90%) | Disease | Decrease | 7 | − 1.24 | 9 | Highly conserved and exposed (f) | |
| rs776117094 | R682W | 0.82 (90%) | Disease | Decrease | 5 | 0.65 | 9 | Highly conserved and exposed (f) | |
RI reliability index, DDG free energy change value, (f) predicted functional residue (highly conserved and exposed), (s) predicted structural residue (highly conserved and buried).
aPMut: Disease (Score > 0.5), Neutral (Score ≤ 0.5).
bConSurf: highly variable (1), highly conserved (9).
Figure 1Putative PTM sites and high risk nsSNPs in MYB family proteins. (a) MYB had 90 common phosphorylation sites (Ser:56, Thr:29, Tyr:5), of which 22 were functional sites. (b) MYBL1 had 1 common methylation site and 90 common phosphorylation sites (Ser:50, Thr:33, Tyr:7), of which 21 were functional sites. (c) MYBL2 had 81 common phosphorylation sites (Ser:48, Thr:31, Tyr:2), of which 24 were functional si.
High risk nsSNPs selected by considering ConSurf and PTM predictions.
| Protein | SNP ID | Mutation | ConSurf | PTM | |
|---|---|---|---|---|---|
| Conservation scorea | Prediction | ||||
| MYB | rs1361650612 | G111S | 9 | e, f | Phosphorylation |
| rs1179275735 | N183S | 9 | e, f | Phosphorylation | |
| rs1247579811 | P574S | 9 | e, f | Phosphorylation | |
| MYBL1 | rs766676175 | G122S | 9 | e, f | Phosphorylation |
| MYBL2 | rs1438994955 | S178C | 9 | e, f | Phosphorylation |
b buried residue, s predicted structural residue (highly conserved and buried), e exposed residue, f predicted functional residue (highly conserved and exposed).
aHighly variable (1), highly conserved (9).
TM-score, RMSD value, GMQE score, QMEAN Z-score, ERRAT value, and PROCHECK Ramachandran plot analysis of the selected protein models.
| Protein | Model | TM-align | SWISS-MODEL | ERRAT | PROCHECK Ramachandran plot analysis | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| TM-scorea | RMSD | GMQE score | QMEAN Z-scoreb | ERRAT value (overall quality factor)c | Residues in most favoured regionsd | Residues in additional allowed regionsd | Residues in generously allowed regionsd | Residues in disallowed regionsd | ||
| MYB | + | Nil | Nil | Nil | Nil | 97.222 | 120 (87.6%) | 17 (12.4%) | 0 (0.0%) | 0 (0.0%) |
| G111S | 1 | 0 | 0.15 | 0.55 | 99.3056 | 122 (88.4%) | 15 (10.9%) | 0 (0.0%) | 1 (0.7%) | |
| N183S | 1 | 0 | 0.16 | 0.72 | 98.6111 | 122 (89.1%) | 15 (10.9%) | 0 (0.0%) | 0 (0.0%) | |
| MYBL1 | + | Nil | Nil | Nil | Nil | 97.9167 | 123 (89.1%) | 15 (10.9%) | 0 (0.0%) | 0 (0.0%) |
| G122S | 1 | 0 | 0.15 | − 0.02 | 98.6014 | 123 (89.1%) | 15 (10.9%) | 0 (0.0%) | 0 (0.0%) | |
| MYBL2 | + | Nil | Nil | Nil | Nil | 96.5278 | 119 (86.2%) | 19 (13.8%) | 0 (0.0%) | 0 (0.0%) |
| S178C | 1 | 0 | 0.16 | 0.19 | 96.5278 | 118 (85.5%) | 20 (14.5%) | 0 (0.0%) | 0 (0.0%) | |
TM-score template modelling-score, RMSD root-mean-square deviation, GMQE global model quality estimation, QMEAN qualitative model energy analysis, “+” Wildtype.
aRandom structural similarity (0.0 < TM-score < 0.30), both structures are within the same fold (0.50 < TM-score < 1.00).
bLow quality model (QMEAN Z-score ≤ − 4.0).
cReliable model (ERRAT value > 85%).
dNumber of residues (percentage of residues).
Figure 2Structural comparison of wild-type MYB family proteins with their mutant forms. (a) 3D model of wild-type MYB protein. (b) Superimposed structures of wild-type MYB protein and its mutant having mutation from Glycine to Serine at position 111. (c) Superimposed structures of wild-type MYB protein and its mutant having mutation from Asparagine to Serine at position 183. (d) 3D model of wild-type MYBL1 protein. (e) Superimposed structures of wild-type MYBL1 protein and its mutant having mutation from Glycine to Serine at position 122. (f) 3D model of wild-type MYBL2 protein. (g) Superimposed structures of wild-type MYBL2 protein and its mutant having mutation from Serine to Cysteine at position 178.This figure was generated using UCSF Chimera 1.15 (https://www.cgl.ucsf.edu/chimera/download.html).
Figure 3Diagrammatic representation of methodology.