| Literature DB >> 31971949 |
Mirza Jawad Ul Hasnain1, Muhammad Shoaib2, Salman Qadri3, Bakhtawar Afzal4, Tehreem Anwar1, Syed Hassan Abbas1, Amina Sarwar1, Hafiz Muhammad Talha Malik5, Muhammad Tariq Pervez1.
Abstract
Single Nucleotide Polymorphisms (SNPs) are the most common candidate mutations in human beings that play a vital role in the genetic basis of certain diseases. Previous studies revealed that Solute Carrier Family 26 Member 4 (SLC26A4) being an essential gene of the multi-faceted transporter family SLC26 facilitates reflexive movement of Iodide into follicular lumen through apical membrane of thyrocyte. SLC26A4 gene encodes Pendred protein, a membrane glycoprotein, highly hydrophobic in nature, present at the apical membrane of thyrocyte functioning as transporter of iodide for thyroid cells. A minor genetic variation in SLC26A4 can cause Pendred syndrome, a syndrome associated with thyroid glands and deafness. In this study, we performed in-silico analysis of 674 missense SNPs of SLC26A4 using different computational platforms. The bunch of tools including SNPNEXUS, SNAP-2, PhD-SNP, SNPs&GO, I-Mutant, ConSurf, and ModPred were used to predict 23 highly confident damaging and disease causing nsSNPs (G209V, G197R, L458P, S427P, Q101P, W472R, N392Y, V359E, R409C, Q235R, R409P, G139V, G497S, H723R, D87G, Y127H, F667C, G334A, G95R, S427C, R291W, Q383H and E384G) that could potentially alter the SLC26A4 gene. Moreover, protein structure prediction, protein-ligand docking and Molecular Dynamics simulation were performed to confirm the impact of two evident alterations (Y127H and G334A) on the protein structure and function.Entities:
Year: 2020 PMID: 31971949 PMCID: PMC6977751 DOI: 10.1371/journal.pone.0225368
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic representation of the overall schema followed in the study.
Fig 2Distribution of SNPs.
Impacts of changes in SLC26A4 gene due to amino acid substitution predicted by SNPNEXUS and SNAP2.
| dbSNP | Variant | Protein | Mutation | SIFT Score | Prediction | PolyPhen Score | Prediction | SNAP2 | Expected Accuracy % |
|---|---|---|---|---|---|---|---|---|---|
| rs111033303 | G|T | ENSP00000265715 | G209V | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs111033380 | G|A | ENSP00000265715 | G197R | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs773076588 | T|C | ENSP00000265715 | L458P | 0 | deleterious | 1 | probably damaging | effect | 85 |
| rs758015694 | T|C | ENSP00000265715 | S427P | 0 | deleterious | 0.999 | probably damaging | effect | 85 |
| rs529182720 | A|C | ENSP00000265715 | Q101P | 0 | deleterious | 0.999 | probably damaging | effect | 91 |
| rs373738509 | T|C | ENSP00000265715 | W472R | 0 | deleterious | 0.999 | probably damaging | effect | 91 |
| rs201562855 | A|T | ENSP00000265715 | N392Y | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs776063528 | T|A | ENSP00000265715 | V359E | 0 | deleterious | 0.999 | probably damaging | effect | 80 |
| rs147952620 | C|T | ENSP00000265715 | R409C | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs752485540 | A|G | ENSP00000265715 | Q235R | 0 | deleterious | 1 | probably damaging | effect | 91 |
| rs111033305 | G|C | ENSP00000265715 | R409P | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs756272252 | G|T | ENSP00000265715 | G139V | 0 | deleterious | 1 | probably damaging | effect | 80 |
| rs111033308 | G|A | ENSP00000265715 | G497S | 0 | deleterious | 1 | probably damaging | effect | 85 |
| rs121908362 | A|G | ENSP00000265715 | H723R | 0 | deleterious | 1 | probably damaging | effect | 85 |
| rs111033344 | A|G | ENSP00000394760 | D87G | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs773173756 | T|C | ENSP00000265715 | Y127H | 0 | deleterious | 1 | probably damaging | effect | 85 |
| rs121908360 | T|G | ENSP00000265715 | F667C | 0 | deleterious | 1 | probably damaging | effect | 85 |
| rs759264261 | G|T | ENSP00000265715 | G334A | 0 | deleterious | 1 | probably damaging | effect | 85 |
| rs111033398 | G|A | ENSP00000265715 | G95R | 0 | deleterious | 1 | probably damaging | effect | 95 |
| rs765939287 | C|G | ENSP00000265715 | S427C | 0 | deleterious | 0.999 | probably damaging | effect | 57 |
| rs775610413 | C|T | ENSP00000265715 | R291W | 0 | deleterious | 0.999 | probably damaging | effect | 95 |
| rs565382433 | G|C | ENSP00000265715 | Q383H | 0 | deleterious | 1 | probably damaging | effect | 75 |
| rs111033244 | A|G | ENSP00000394760 | E384G | 0 | deleterious | 1 | probably damaging | effect | 95 |
Disease association of the SNPs as predicted by SNP & GO and PhD-SN.
| dbSNP | Variant | Mutation | PhD-SNP | Prediction | SNP&GO | Prediction |
|---|---|---|---|---|---|---|
| rs111033303 | G|T | G209V | 0.955 | Disease | 0.87 | Disease |
| rs111033380 | G|A | G197R | 0.953 | Disease | 0.834 | Disease |
| rs773076588 | T|C | L458P | 0.926 | Disease | 0.881 | Disease |
| rs758015694 | T|C | S427P | 0.922 | Disease | 0.926 | Disease |
| rs529182720 | A|C | Q101P | 0.911 | Disease | 0.841 | Disease |
| rs373738509 | T|C | W472R | 0.904 | Disease | 0.828 | Disease |
| rs201562855 | A|T | N392Y | 0.899 | Disease | 0.87 | Disease |
| rs776063528 | T|A | V359E | 0.892 | Disease | 0.911 | Disease |
| rs147952620 | C|T | R409C | 0.882 | Disease | 0.95 | Disease |
| rs752485540 | A|G | Q235R | 0.879 | Disease | 0.872 | Disease |
| rs111033305 | G|C | R409P | 0.876 | Disease | 0.802 | Disease |
| rs756272252 | G|T | G139V | 0.87 | Disease | 0.812 | Disease |
| rs111033308 | G|A | G497S | 0.865 | Disease | 0.801 | Disease |
| rs121908362 | A|G | H723R | 0.856 | Disease | 0.855 | Disease |
| rs111033344 | A|G | D87G | 0.855 | Disease | 0.864 | Disease |
| rs773173756 | T|C | Y127H | 0.846 | Disease | 0.885 | Disease |
| rs121908360 | T|G | F667C | 0.839 | Disease | 0.979 | Disease |
| rs759264261 | G|T | G334A | 0.839 | Disease | 0.862 | Disease |
| rs111033398 | G|A | G95R | 0.836 | Disease | 0.852 | Disease |
| rs765939287 | C|G | S427C | 0.807 | Disease | 0.84 | Disease |
| rs775610413 | C|T | R291W | 0.805 | Disease | 0.83 | Disease |
| rs565382433 | G|C | Q383H | 0.804 | Disease | 0.81 | Disease |
| rs111033244 | A|G | E384G | 0.801 | Disease | 0.80 | Disease |
Effects of SNPs on the Protein Stability, Increasing or Decreasing.
| dbSNP | Protein | Mutation | I-Mutant | Mu-Pro |
|---|---|---|---|---|
| rs111033303 | ENSP00000265715 | G209V | Increase | Decrease |
| rs111033380 | ENSP00000265715 | G197R | Decrease | Decrease |
| rs773076588 | ENSP00000265715 | L458P | Decrease | Decrease |
| rs758015694 | ENSP00000265715 | S427P | Decrease | Decrease |
| rs529182720 | ENSP00000265715 | Q101P | Decrease | Decrease |
| rs373738509 | ENSP00000265715 | W472R | Decrease | Decrease |
| rs201562855 | ENSP00000265715 | N392Y | Decrease | Decrease |
| rs776063528 | ENSP00000265715 | V359E | Decrease | Decrease |
| rs147952620 | ENSP00000265715 | R409C | Decrease | Decrease |
| rs752485540 | ENSP00000265715 | Q235R | Decrease | Decrease |
| rs111033305 | ENSP00000265715 | R409P | Decrease | Decrease |
| rs756272252 | ENSP00000265715 | G139V | Decrease | Decrease |
| rs111033308 | ENSP00000265715 | G497S | Decrease | Decrease |
| rs121908362 | ENSP00000265715 | H723R | Decrease | Decrease |
| rs111033344 | ENSP00000394760 | D87G | Decrease | Decrease |
| rs773173756 | ENSP00000265715 | Y127H | Decrease | Decrease |
| rs121908360 | ENSP00000265715 | F667C | Decrease | Decrease |
| rs759264261 | ENSP00000265715 | G334A | Decrease | Decrease |
| rs111033398 | ENSP00000265715 | G95R | Decrease | Decrease |
| rs765939287 | ENSP00000265715 | S427C | Decrease | Decrease |
| rs775610413 | ENSP00000265715 | R291W | Decrease | Decrease |
| rs565382433 | ENSP00000265715 | Q383H | Decrease | Decrease |
| rs111033244 | ENSP00000394760 | E384G | Decrease | Decrease |
Detailed summary of most important SNPs identified.
| dbSNP | Mutation | Functional Consequence | Disease Association | Phylogenetics | PTMs | |||
|---|---|---|---|---|---|---|---|---|
| SIFT | PolyPhen | SNAP2 | SNP& | PhD-SNP | ConSurf | ModPred | ||
| rs111033303 | G209V | 9,9,b,s | - | |||||
| rs111033380 | G197R | 9,9,b,s | - | |||||
| rs773076588 | L458P | 7,6,b | - | |||||
| rs758015694 | S427P | 8,8,b | O-linked glycosylation | |||||
| rs529182720 | Q101P | 9,9,e,f | - | |||||
| rs373738509 | W472R | 3,1,e | - | |||||
| rs201562855 | N392Y | 9,9,b,s | - | |||||
| rs776063528 | V359E | 9,9,b,s | - | |||||
| rs147952620 | R409C | 9,9,e,f | Proteolytic cleavage | |||||
| rs752485540 | Q235R | 9,9,e,f | - | |||||
| rs111033305 | R409P | 9,9,e,f | Proteolytic cleavage | |||||
| rs756272252 | G139V | 9,9,b,s | - | |||||
| rs111033308 | G497S | 8,7,b | - | |||||
| rs121908362 | H723R | 8,8,b | - | |||||
| rs111033344 | D87G | 9,9,b,s | Proteolytic cleavage | |||||
| rs773173756 | Y127H | 9,9,b,s | Proteolytic cleavage | |||||
| rs121908360 | F667C | 6,5,b | - | |||||
| rs759264261 | G334A | 9,9,e,f | Proteolytic cleavage | |||||
| rs111033398 | G95R | 9,9,b,s | - | |||||
| rs765939287 | S427C | 8,8,b | O-linked glycosylation | |||||
| rs775610413 | R291W | 9,9,e,f | Proteolytic cleavage | |||||
| rs565382433 | Q383H | 9,9,e,f | - | |||||
| rs111033244 | E384G | 9,9,e,f | - | |||||
functional consequences, disease association, PTMs, phylogenetic conservation and minor allele frequency.
(*deleterious
** disease associated, b-buried, e-exposed, f-functional, s-structural).
Fig 3ConSurf results for residues conservation.
Colors of ConSurf results showing the level of confidence for the sequence conservation where sky-blue color indicates variables, and dark purple color indicates highly conserved residues.
Fig 4Model structure of SLC26A4.
(Pendrin) protein containing 780 a.a. with solid ribbon display style (red, blue and green color: alpha helix, beta sheets and coil, respectively).
Fig 5Structure superimposition.
Superimposition of native and mutant structures for the two hotspot mutations; native with mutant-1 at point G334A, native with mutant-2 at point Y127H.
The binding sites predicted at different amino acid residues along with their position and binding affinities.
| Position | Amino Acid | Binding Affinity |
|---|---|---|
| 2 | ALA | -0.258 |
| 4 | PRO | -0.397 |
| 320 | GLU | -0.307 |
| 321 | LYS | -0.026 |
| 373 | THR | -0.039 |
| 560 | ASP | 0.214 |
| 564 | LYS | -0.044 |
| 586 | ARG | 0.015 |
| 590 | LYS | -0.114 |
| 622 | GLU | -0.053 |
| 626 | GLU | 0.058 |
| 674 | ARG | 0.062 |
| 704 | GLU | -0.392 |
| 707 | GLY | -0.177 |
Fig 6Graphical depiction of protein ligand docking: Native, mutant-Y127H and mutant-G334A protein structures with ligands DMU, MG, 4KU, BCT, XAN show the interactive deviations of mutant1 and mutant 2 from the native structure.
Interacting residues obtained from docking.
Protein structures Wild, MutantY127H and MutantG334A with ligands 4KU, BCT, DMU, MG and XAN including their binding residues and surrounding hydrophobic interactions.
| Receptor—Ligand | Hydrogen Bond Interactions | Hydrophobic Interactions | |
|---|---|---|---|
| Receptor | Ligand | ||
| Wild- 4KU | Glu701, Ile713, Lys715, Asp716 | 4KU | Asp697, Ile700, Phe719, Leu763, Arg719, Asp711, Asp710 |
| MutantY127H – 4KU | Ser28, Leu13, Pro11 | 4KU | Gly5, Glu29, Ala31, Leu30, Tyr27, Pro14, Gln12, Cys54, Gly6 |
| MutantG334A – 4KU | Ile71, Gln41, Lys66 | 4KU | Arg47, Glu42, Pro76, Leu72, Trp74, Lys419, Tyr377. |
| Wild- BCT | Gln383, Arg134, Tyr377 | BCT | Thr132 |
| MutantY127H –BCT | Arg39, Phe128, Gly131 | BCT | Phe130, Ile129, Trp74, Gln421 |
| MutantG334A –BCT | Asn712, Asp710, Asp711. | BCT | Phe708, Asp495, Leu492, Leu496. |
| Wild- DMU | Leu108, Ser234, Ser154 | DMU | Ala451, Lys447, Gln235, Ile238, Leu107, Lys237, Leu153, Ser150, Gly149, Gln230 |
| MutantY127H –DMU | Phe719, Arg674, Arg677 | DMU | Phe719, Leu727, Val673, Val670, Ile487, Leu492, Lys715, Val488, Phe709, Arg714, Ile491. |
| MutantG334A –DMU | Leu146, Gln230 | DMU | Ile713, Gly493, Phe719, Phe708, Leu108, Ser234, Lys237, Leu153, Arg714, Ser150, Asn712, Cys706, Leu492, Gly707, Ala351, Asp495. |
| Wild- MG | Gln732, Tyr728, Arg674 | MG | ----- |
| MutantY127H –MG | Ile713, Asn712, Asp710 | MG | ----- |
| MutantG334A –MG | Asp711, Asp710, Asn712, Ile713 | MG | ----- |
| Wild- XAN | Glu625, Glu622, Asp573 | XAN | Leu627, Arg576, Ala574 |
| MutantY127H –XAN | Glu73, Gln383, Gly131, Arg39 | XAN | Gln41, Ile129, Trp74, Gln421, Phe128 |
| MutantG334A –XAN | Gln705 | XAN | Phe709, Phe718, His723, Ile700, Glu704, Leu703. |
Fig 7Temperature, pressure and density versus time graph of wildtype and mutants of SLC simulated through GROMACS.
Wildtype is shown in blue, G334A mutant is shown in grey and Y127H is shown in orange.
Fig 8RMSD and RMSF graphs of backbone atoms of wildtype and mutants of SLC simulated through GROMACS version 5.1.2.
Wildtype is shown in black, G334A mutant is shown in red and Y127H is shown in green.
Fig 9A) Rg of Cα atoms of native and mutant type SLC protein. B) Solvent accessible surface area of wildtype and mutant type SLC protein Wildtype is shown in black, G334A mutant is shown in red and Y127H is shown in green.