| Literature DB >> 26516624 |
Mikhail Ponomarenko1, Dmitry Rasskazov2, Olga Arkova2, Petr Ponomarenko3, Valentin Suslov2, Ludmila Savinkova2, Nikolay Kolchanov1.
Abstract
The use of biomedical SNP markers of diseases can improve effectiveness of treatment. Genotyping of patients with subsequent searching for SNPs more frequent than in norm is the only commonly accepted method for identification of SNP markers within the framework of translational research. The bioinformatics applications aimed at millions of unannotated SNPs of the "1000 Genomes" can make this search for SNP markers more focused and less expensive. We used our Web service involving Fisher's Z-score for candidate SNP markers to find a significant change in a gene's expression. Here we analyzed the change caused by SNPs in the gene's promoter via a change in affinity of the TATA-binding protein for this promoter. We provide examples and discuss how to use this bioinformatics application in the course of practical analysis of unannotated SNPs from the "1000 Genomes" project. Using known biomedical SNP markers, we identified 17 novel candidate SNP markers nearby: rs549858786 (rheumatoid arthritis); rs72661131 (cardiovascular events in rheumatoid arthritis); rs562962093 (stroke); rs563558831 (cyclophosphamide bioactivation); rs55878706 (malaria resistance, leukopenia), rs572527200 (asthma, systemic sclerosis, and psoriasis), rs371045754 (hemophilia B), rs587745372 (cardiovascular events); rs372329931, rs200209906, rs367732974, and rs549591993 (all four: cancer); rs17231520 and rs569033466 (both: atherosclerosis); rs63750953, rs281864525, and rs34166473 (all three: malaria resistance, thalassemia).Entities:
Mesh:
Substances:
Year: 2015 PMID: 26516624 PMCID: PMC4609514 DOI: 10.1155/2015/359835
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1How to use the Web service SNP_TATA_Comparator [62] to find a significant change in gene expression caused by SNPs of this gene's promoter via a change in affinity of the TATA-binding protein (TBP) for this promoter in the cases of (a) a known biomedical SNP marker and (b) a nearby candidate SNP marker. Solid, dotted, and dashed arrows are the gene, transcript, and sequence lists, respectively, from Ensembl [5] and GENCODE [6] databases of the reference human genome, hg19. Dash-and-dot arrows are an estimate of the statistical significance (Z-score, p value) of deviation of the gene expression in patients carrying minor alleles, relative to the ancestral allele, (1)–(4) and Algorithm 1.
Algorithm 1Known disease-related SNP markers increasing affinity of the TATA-binding protein (TBP) for human gene promoters, their SNP neighbors.
| Gene | RNA | dbSNP [ | SNP | 5′-flank |
| 3′-flank | min versus hg19 | Δ |
|
| Known [reference] diseases or | [Reference], [this work] |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| #2 | rs1143627 | −31c→t | ttttgaaagc |
| ataaaaacag | 2 versus 5 | ↑ | 15 | 10−7 | Gastric cancer in | [ |
|
| −28a→t | tgaaagccat |
| aaaacagcga |
| ↓ |
|
|
| [This work], [ | ||
|
| ||||||||||||
|
| #1 | rs563763767 | −21c→t | ccctttatag |
| gcgcggggca | 2 versus 3 | ↑ | 6 | 10−7 | Myocardial infarction and venous thromboembolism | [ |
|
| ||||||||||||
|
| #1 | ND, see [ | −51t→c | gtataaatac |
| tcttggctgc | 1 versus 2 | ↑ | 3 | 10−2 | Resistance to malaria, epilepsy risk | [ |
|
| ||||||||||||
|
| #3 | rs10168 | −26g→a | ctgcacaaat |
| gggacgaggg | 9 versus 15 | ↑ | 9 | 10−7 | Resistance to methotrexate therapy for leukemia | [ |
|
| ||||||||||||
|
| #2 | rs10895068 | −26g→a | gggagataaa |
| gagccgcgtg | 6 versus 10 | ↑ | 8 | 10−7 | Endometrial cancer caused by a | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #2 | ND, see [ | −20a→t | gtcattccag |
| aaagggccac | 13 versus 24 | ↑ | 9 | 10−7 | A healthy Hungarian blood donor participating in a health check-up program | [ |
|
| ||||||||||||
|
| #3 | rs111426889 | −25c→t | gtgctataaa |
| gccgccccct | 2 versus 4 | ↑ | 8 | 10−7 | A healthy individual in the “Control” cohort selected for comparison with the “Autoimmune Diseases” cohort | [ |
Note: N SNP, total number of SNPs processed; RNA, item number of mRNA in GENCODE v.19 [6]; TSS, transcription start site; hg19, ancestral allele; min, minor allele; K , an estimate [55] of the dissociation constant (K ) of the TBP-DNA complex in vitro [50]; ND, not documented; Δ, the expression change in comparison with the norm: overexpression (↑), deficient expression (↓), and norm (=); Z, Z-score; α = 1 − p, significance (p, probability; Figure 1); TF, transcription factor; EMSA, electrophoretic mobility shift assay; CAT, chloramphenicol acetyl transferase activity; LUC, bioluminescence.
Known disease-related SNP markers insignificantly changing TBP affinity for human gene promoters, their SNP neighbors.
| Gene | RNA | dbSNP [ | SNP | 5′-flank |
| 3′-flank | min versus hg19 | Δ |
|
| Known [reference] or hypothetical [this work] diseases | [Reference], [this work] |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| #2 | rs1394205 | −29g→a | gcaaatgcag |
| aagaaatcag | 7.3 versus 7.3 | = | 0 | >0.05 | No differences between proven fathers and infertile men | [ |
|
| ||||||||||||
|
| #1 | ND, see [ | −48g→c | agctcagctt |
| tactttggta | 6.4 versus 6.4 | = | 0 | >0.05 | Leiden hemophilia B, |
[ |
| ND, see [ | −42t→a | gcttgtactt |
| ggtacaacta | 6.4 versus 6.4 | = | 0 | >0.05 | ||||
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #3 | rs16887226 | −33c→t | cagccttcag |
| gggggacatt | 10.3 versus 10.3 | = | 0 | >0.05 | Hypertensive diabetic patients, | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work], [ | ||
|
| ||||||||||||
|
| #1 | rs28399433 | del-50g | aggggccagg |
| tataaaaagg | 1.4 versus 1.5 | = | 1 | >0.05 | Short stature, | [ |
|
| ||||||||||||
|
| #4 | rs1332018 | −49c→a | ccccttatgt |
| gggtataaag | 3.1 versus 3.6 | = | 1.9 | >0.05 | Risk of brain, lung, testicular, and renal cell carcinomas, | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #1 | rs7586110 | −57t→g | cttcttccac |
| tactatatta | 1.48 versus 1.54 | = | 1 | >0.05 | Oral cancer risk, | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
|
|
|
|
|
| ↓ |
|
| ||||
|
| ||||||||||||
|
| #1 | rs10465885 | −55g→a | caactaagat |
| tattaaacac | 3.1 versus 3.4 | = | 1 | >0.05 | Arrhythmia, cardiovascular events | [ |
| #2 | rs35594137 | −39g→a | gaggagggaa |
| gcgacagata | 5.7 versus 5.7 | = | 0 | >0.05 | Arrhythmia, cardiovascular events | [ | |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #1 | rs13306848 | −33g→a | agggagggcc |
| ggcacttata | 2.3 versus 2.1 | = | 1 | 10−7 | Thrombophlebitis risk | [ |
|
| ||||||||||||
|
| #201 | rs34983651 | ins-55at | ggtttttgcc |
| atatatatat | 0.65 versus 0.67 | = | 1 | >0.05 | Necessary but not sufficient in hyperbilirubinemia and jaundice | [ |
| rs398048306 | del-51(at)1;2 | ggtttttgcc |
| atatatatat | 0.71 versus 0.67 | = | 1 | >0.05 | Ethnic differences such as | [ | ||
Known disease-related SNP markers decreasing affinity of the TATA-binding protein (TBP) for human gene promoters, their SNP neighbors.
| Gene | RNA | dbSNP [ | SNP | 5′-flank |
| 3′-flank | min versus hg19 | Δ |
|
| Known [reference] or hypothetical [this work] diseases (observations) | [Reference], [this work] |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| #2 | rs397509430 | del-29t | gggctgggca |
| atacaacagt | 29 versus 5 | ↓ | 34 | 10−7 |
Malaria resistance and |
[ |
| rs33980857 | −29t→a,g,c | gggctgggca |
| atacaacagt | 21 versus 5 | ↓ | 27 | 10−7 | ||||
| rs34598529 | −28a→g | ggctgggcat |
| aaagtcaggg | 18 versus 5 | ↓ | 24 | 10−7 | ||||
| rs33931746 | −27a→g,c | gctgggcata |
| aagtcagggc | 11 versus 5 | ↓ | 14 | 10−7 | ||||
| rs33981098 | −30a→g,c | agggctgggc |
| taaaagtcag | 9 versus 5 | ↓ | 10 | 10−7 | ||||
| rs34500389 | −31c→a,t,g | cagggctggg |
| ataaaagtca | 6 versus 5 | ↓ | 3 | 10−2 | ||||
| ND, see [ | −27a→t | gctgggcata |
| aagtcagggc | 3 versus 5 | ↑ | 8 | 10−2 | Health, well-known so-called “silent SNP” | [ | ||
|
|
|
|
|
|
| ↓ |
|
|
|
[This work], [ | ||
|
|
|
|
|
|
| ↓ |
|
| ||||
|
| ||||||||||||
|
| #3 | rs35518301 | −31a→g | caggaccagc |
| taaaaggcag | 8 versus 4 | ↓ | 11 | 10−7 | Malaria resistance and | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work], [ | ||
|
| ||||||||||||
|
| #3 | rs2814778 | −26t→c | ttggctctta |
| cttggaagca | 12 versus 10 | ↓ | 4 | 10−3 | Low white-blood-cell count and resistance to malaria | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #3 | rs28399433 | −34t→g | tcaggcagta |
| aaaggcaaac | 9 versus 2 | ↓ | 21 | 10−7 | Lower risk of lung cancer in smokers | [ |
|
| ||||||||||||
|
| #1 | rs55999272 | −28t→c | tcctgctata |
| agccccgccg | 5 versus 2 | ↓ | 11 | 10−7 | For “−28 | [ |
|
| ||||||||||||
|
| #1 | rs2276109 | −27a→g | gatatcaact |
| tgagtcactc | 14 versus 11 | ↓ | 3 | 10−2 | Low risk of chronic asthma, systemic sclerosis, and psoriasis | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #4 | ND, see [ | del-54 | cgtgggggct |
| gggctccagg | 7 versus 4 | ↓ | 7 | 10−7 | Hyperalphalipoproteinemia reduces atherosclerosis risk | [ |
|
|
|
|
|
|
|
|
|
|
|
[This work], [ | ||
|
|
|
|
|
|
|
|
|
| ||||
|
| ||||||||||||
|
| #1 | rs34223104 | −28t→c | gatgaaattt |
| ataacagggt | 10 versus 4 | ↓ | 15 | 10−7 | Better bioactivation of anticancer prodrug cyclophosphamide | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #4 | rs7277748 | −32a→g | ggtctggcct |
| taaagtagtc | 7 versus 2 | ↓ | 17 | 10−7 | Familial amyotrophic lateral sclerosis | [ |
|
| ||||||||||||
|
| #201 | rs1800202 | −24t→g | gcgctctata |
| aagtgggcag | 4 versus 1 | ↓ | 17 | 10−7 | Hemolytic anemia and neuromuscular diseases | [ |
|
| ||||||||||||
|
| #1 | rs35036378 | −43t→g | cctctcggtc |
| ttaaaaggaa | 8 versus 6 | ↓ | 5 | 10−3 | ESR2-low pT1 tumor | [ |
|
| ||||||||||||
|
| #2 | rs201739205 | −36a→c | aggtgatatc |
| agcccagagc | 18 versus 13 | ↓ | 5 | 10−3 | Breast cancer | [ |
|
| ||||||||||||
|
| #1 | rs72661131 | −39t→c | tctatttcta |
| atagcctgca | 4 versus 2 | ↓ | 12 | 10−7 | Variable immunodeficiency, stroke, and preeclampsia | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
|
|
|
|
|
|
|
|
|
| [This work], [ | ||
|
| ||||||||||||
|
| #3 | rs17537595 | −36t→c | gctgctgtta |
| atacaacaga | 3 versus 1 | ↓ | 13 | 10−7 | Esophageal cancer | [ |
|
|
|
|
|
|
| ↓ |
|
|
| [This work] | ||
|
| ||||||||||||
|
| #3 | ND, see [ | −35a→c | tgcagacata |
| ataggccctg | 4 versus 3 | ↓ | 5 | 10−3 | Hematuria, fatty liver, obesity | [ |
|
| ||||||||||||
|
| #1 | ND, see [ | −33a→c | ccttggaggc |
| gagaactttg | 62 versus 53 | ↓ | 3 | 10−2 | Moderate bleeding tendency | [ |
|
|
|
|
|
|
|
|
|
|
|
[This work], [ | ||
|
|
|
|
|
|
|
|
|
| ||||
Note: hereinafter, can be seen under Table 1.