| Literature DB >> 20981267 |
Raphael D Isokpehi1, Hari H P Cohly, Matthew N Anyanwu, Rajendram V Rajnarayanan, Paul B Tchounwou, Udensi K Udensi, Barbara E Graham-Evans.
Abstract
Arsenic is a toxic metalloid that causes skin cancer and binds to cysteine residues-a property that could be used to infer arsenic responsiveness of a target protein. Non-synonymous Single Nucleotide Polymorphisms (nsSNPs) result in amino acid substitutions and may alter arsenic binding with cysteine residues. Thus, the objective of this investigation was to identify and analyze nsSNPs that lead to substitutions to or from cysteine residues as an indication of increased or decreased arsenic responsiveness. We hypothesize that integration of data on molecular impacts of nsSNPs and arsenic-gene relationships will identify nsSNPs that could serve as arsenic responsiveness markers. We have analyzed functional and structural impacts data for 5,811 nsSNPs linked to 1,224 arsenic-annotated genes. In addition to the identified candidate nsSNPs for increased or reduced arsenic responsiveness, we observed i) a nsSNP that results in the breakage of a disulfide bond, as candidate marker for reduced arsenic responsiveness of KLK7, a secreted serine protease participate in normal shedding of the skin; and ii) 6 pairs of vicinal cysteines in KLK7 protein that could be binding sites for arsenic. In summary, our analysis identified non-synonymous SNPs that could be used to evaluate responsiveness of a protein target to arsenic. In particular, an epidermal expressed serine protease with crucial function in normal skin physiology was prioritized on the basis of abundance of vicinal cysteines for further research on arsenic-induced keratinocyte carcinogenesis.Entities:
Keywords: arsenic; keratinocytes; non-synonymous single nucleotide polymorphisms; skin cancer; toxicogenomics; vicinal cysteines
Year: 2010 PMID: 20981267 PMCID: PMC2964045 DOI: 10.4137/BBI.S5498
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Figure 1.Screenshot of a SNPs3D page for a gene. The functional and structural impacts, molecular effect and frequency of non-synonymous SNPs associated with the protein isoforms (RefSeq accession) is documented on the page. The negative SVM score (value in red) indicates a deleterious substitution.
Figure 2.Plot of frequencies of unique Support Vector Machine (SVM) scores for dataset of non-synonymous SNPs linked to arsenic-annotated genes. The SVM score predicted for each nsSNP substitution was extracted from the SNPs3D page.
Summary of datasets.
| Arsenic-annotated genes Retrieved from CTD | 1,604 |
| Arsenic-annotated genes with nsSNPs in SNPs3D | 1,224 |
| Non-synonymous SNPs | 5,811 |
| Arsenic-annotated protein isoforms | 1,872 |
| Amino acid substitutions | 8,992 |
| Deleterious substitutions | 3,700 |
| Non-deleterious substitutions | 5,292 |
| High confidence deleterious substitutions | 2,739 |
| High confidence non-deleterious substitutions | 4,191 |
SVM sequence profile score:
SVM <−0.5;
SVM >0.5
Distribution of SVM scores for nsSNP substitutions.
| 0.0–0.5 | 1083 | 20.46% | 977 | 26.41% |
| 0.5–1.0 | 1186 | 22.41% | 831 | 22.46% |
| 1.0–1.5 | 1170 | 22.11% | 627 | 16.95% |
| 1.5–2.0 | 826 | 15.61% | 448 | 12.11% |
| 2.0–2.5 | 560 | 10.58% | 348 | 9.41% |
| 2.5–3.0 | 246 | 4.65% | 255 | 6.89% |
| 3.0–3.5 | 126 | 2.38% | 132 | 3.57% |
| 3.5–4.0 | 49 | 0.93% | 63 | 1.70% |
| 4.0–4.5 | 25 | 0.47% | 18 | 0.49% |
| 4.5–5.0 | 12 | 0.23% | 1 | 0.03% |
| 5.0–5.5 | 2 | 0.04% | ||
| 5.5–6.0 | 3 | 0.06% | ||
| 6.0–6.5 | 3 | 0.06% | ||
| 6.5–7.0 | 1 | 0.02% | ||
Note:
Absolute values.
Categories of nsSNPs observed in analysis of arsenic-annotated genes.
| From Cys, deleterious | 45 | 53 | 67 |
| From Cys, non-deleterious | 36 | 39 | 58 |
| To Cys, deleterious | 101 | 130 | 145 |
| To Cys, non-deleterious | 63 | 72 | 108 |
Candidate nsSNPs that increase protein’s arsenic responsiveness through amino acid change to cysteine.
| 834 | CASP1 | 3203613 | NP_150635 | S33C | 2.76 | 1.33 | PS |
| 834 | CASP1 | 3203613 | NP_150634 | S126C | 2.36 | 1.33 | PS |
| 11200 | CHEK2 | 28909981 | NP_665861 | S471C | 2.20 | 1.13 | PS; HBL |
| 834 | CASP1 | 3203613 | NP_150636 | S33C | 1.99 | 1.33 | PS |
| 8644 | AKR1C3 | 35575889 | NP_003730 | R170C | 1.82 | 1.59 | |
| 55713 | ZNF334 | 41283032 | NP_955473 | R237C | 1.71 | 1.08 | PS |
| 55713 | ZNF334 | 41283032 | NP_060572 | R275C | 1.68 | 1.08 | PS |
| 6389 | SDHA | 1041948 | NP_004159 | S346C | 1.62 | 0.92 | HBL |
| 2877 | GPX2 | 17880492 | NP_002074 | R146C | 1.61 | 0.84 | SBL |
| 983 | CDC2 | 8755 | NP_203698 | R59C | 1.53 | 1.22 | PS |
| 10935 | PRDX3 | 11554910 | NP_054817 | Y53C | 1.19 | 0.99 | PS |
| 10935 | PRDX3 | 11554910 | NP_006784 | Y71C | 1.14 | 0.99 | PS |
| 5265 | SERPINA1 | 1802962 | NP_000286 | S325C | 0.88 | 1.37 | PS |
| 5265 | SERPINA1 | 1802962 | NP_001002235 | S325C | 0.88 | 1.37 | PS |
| 5265 | SERPINA1 | 1802962 | NP_001002236 | S325C | 0.88 | 1.37 | PS |
| 983 | CDC2 | 8755 | NP_001777 | R59C | 0.81 | 1.22 | PS |
| 4233 | MET | 34589476 | NP_000236 | R970C | 0.77 | 0.99 | PS |
Abbreviations: PS, On the protein surface; HBL, hydrogen bond lost; SBL, saltbridge lost.
Candidate nsSNPs that reduce protein’s arsenic responsiveness through amino acid change from cysteine.
| 23660 | ZKSCAN5 | 28411998 | NP_055384 | C579W | −3.63 | −0.84 | BP; OP |
| 23660 | ZKSCAN5 | 28411998 | NP_659570 | C579W | −3.63 | −0.84 | BP; OP |
| 7465 | WEE1 | 17854721 | NP_003381 | C379R | −3.62 | −1.18 | |
| 9753 | ZSCAN12 | 2232432 | NP_001034732 | C332R | −3.59 | −1.11 | ESR |
| 9446 | GSTO1 | 45529437 | NP_004823 | C32Y | −3.53 | −1.20 | OP |
| 5650 | KLK7 | 17855561 | NP_644806 | C226W | −2.44 | −1.20 | OP; BDB |
| 5650 | KLK7 | 17855561 | NP_005037 | C226W | −2.43 | −1.20 | OP; BDB |
| 5879 | RAC1 | 7673785 | NP_008839 | C157R | −2.06 | −1.24 | BC; OP |
| 3945 | LDHB | 3575 | NP_002291 | C294Y | −1.83 | −1.05 | BP; OP |
| 5879 | RAC1 | 7673785 | NP_061485 | C176R | −1.77 | −1.52 | OP |
| 5265 | SERPINA1 | 8350 | NP_000286 | C256W | −1.76 | −0.51 | BP |
| 5265 | SERPINA1 | 8350 | NP_001002235 | C256W | −1.76 | −0.51 | BP |
| 5265 | SERPINA1 | 8350 | NP_001002236 | C256W | −1.76 | −0.51 | BP |
| 2868 | GRK4 | 35824641 | NP_892027 | C215R | −1.66 | −0.79 | ESR; OP |
| 2868 | GRK4 | 35824641 | NP_001004057 | C215R | −1.55 | −0.79 | ESR; OP |
| 835 | CASP2 | 11551881 | NP_116764 | C370F | −1.54 | −0.99 | OP |
| 2868 | GRK4 | 35824641 | NP_001004056 | C183R | −1.51 | −0.80 | ESR; OP |
| 10465 | PPIH | 11550298 | NP_006338 | C131R | −1.28 | −1.41 | BC; ESR; OP |
| 3107 | HLA-C | 41563216 | NP_002108 | C125R | −1.15 | −1.28 | BC; OP; BDB |
| 3107 | HLA-C | 41543517 | NP_002108 | C188W | −0.81 | −0.86 | BP; OP; BDB |
Abbreviations: BP, buriedpolar; BC, buriedcharged; BDB, breakage of a disulfide bond; ESR, electrostaticrepulsion; OP, overpacking; SBL, saltbridge lost.
Non-synonymous SNPs that predict potential region of arsenic-binding to vicinal cysteines.
| FST | 1127760 | NP_006341 | C239S |
| FST | 1127760 | NP_037541 | C239S |
| HLA-C | 41563216 | NP_002108 | C125R |
| HLA-C | 41543517 | NP_002108 | C188W |
| HLA-C | 41562916 | NP_002108 | C188F |
| IL4 | 4986964 | NP_000580 | C27R |
| IL4 | 4986964 | NP_758858 | C27R |
| KLK7 | 17855561 | NP_005037 | C226W |
| KLK7 | 17855561 | NP_644806 | C226W |
| TFRC | 9852079 | NP_003225 | C363W |
| TLR4 | 2770145 | NP_612564 | C306W |
Figure 3.Screenshot of predicted structural impact of nsSNP rs17855561 on human kallikrein-7 preproprotein (NP_005037). The nsSNP results in steric strain (over-packing) and breakage of a disulfide bond by changing Cysteine (C) residue in position 226 to a Tryptophan (W).
Figure 4.Predicted structure of human kallikrein-7 preproprotein (NP_005037). The vicinal cysteine pairs are 36–165; 55–71; 137–239; 144–211; 176–190; 201–226. The homology structure shows that the Cys201 pairs with Cys226. Arsenic is known to reduce the expression of KLK7 in an epidermal cell line.
Cluster of sentences from PubMed abstracts on GSTO1 polymorphism.
| 18216717_10 | The percent of inorganic arsenic in the urine of 205 Chilean participants showed a bimodal distribution that was not associated with the Ala140Asp, Glu155del or Ala236Val polymorphisms in GSTO1-1. |
| 19686770_5 | Genotyping of CYP2E1, GSTO1 and GSTO2 was determined by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). |
| 16638819_4 | We identified 31 and 66 polymorphisms in GSTO1 and GSTO2, respectively, with four nonsynonymous-coding single nucleotide polymorphisms (cSNPs) in each gene. |
| 12928150_5 | We screened two genes responsible for arsenic metabolism, human purine nucleoside phosphorylase (hNP), which functions as an arsenate reductase converting arsenate to arsenite, and human glutathione S-transferase omega 1-1 (hGSTO1-1), which functions as a monomethylarsonic acid (MMA) reductase, converting MMA(V) to MMA(III), to develop a comprehensive catalog of commonly occurring genetic polymorphisms in these genes. |
| 12928150_9 | In hGSTO1-1, 33 polymorphisms were observed. |
| 12928150_11 | In contrast to hNP, in which the IA group was more polymorphic than the EA group, in hGSTO1-1 the EA group was more polymorphic than the IA group, which had only 1 polymorphism with a frequency >10%. |
| 19635583_4 | This study was conducted to investigate the relationship between polymorphisms in the GSTO1 and GSTO2 genes and arsenic metabolism and oxidative stress status in Chinese populations chronically exposed to different levels of arsenic in drinking water. |
| 19635583_5 | Two polymorphisms (GSTO1 |
| 19635583_8 | Multivariate analysis revealed that there was no association between the urinary profile or oxidative stress status and the polymorphism of GSTO1 |
| 19635583_9 | Collectively, polymorphisms in GSTO1 or GSTO2 do not appear to contribute to the large individual variability in arsenic metabolism or susceptibility to arsenicosis. |
| 14680363_3 | To understand this variability, we studied the relationship between polymorphisms in the gene for human monomethylarsonic acid (MMA(V)) reductase/hGSTO1 and the urinary arsenic profiles of individuals chronically exposed to arsenic in their drinking water. |
| 14680363_9 | These two subjects had the same unique polymorphisms in hGSTO1 in that they were heterozygous for E155del and Glu208 Lys. |
| 18414634_9 | RESULTS: Among four candidate genes, PNP, As3MT, GSTO1, and GSTO2, we found that distribution of three exonic polymorphisms, His20His, Gly51Ser, and Pro57Pro of PNP, was associated with arsenicism. |
Note:
Sentence identifier consists of PubMed identifier (PMID) and the location of the sentence in the abstract with abstract title as the first sentence.