| Literature DB >> 26043787 |
Yul Kim, Chiyong Kang, Bumki Min, Gwan-Su Yi.
Abstract
Post-translational modification (PTM) plays a crucial role in biological functions and corresponding disease developments. Discovering disease-associated non-synonymous SNPs (nsSNPs) altering PTM sites can help to estimate the various PTM candidates involved in diseases, therefore, an integrated analysis between SNPs, PTMs and diseases is necessary. However, only a few types of PTMs affected by nsSNPs have been studied without considering disease-association until now. In this study, we developed a new database called PTM-SNP which contains a comprehensive collection of human nsSNPs that affect PTM sites, together with disease information. Total 179,325 PTM-SNPs were collected by aligning missense SNPs and stop-gain SNPs on PTM sites (position 0) or their flanking region (position -7 to 7). Disease-associated SNPs from GWAS catalogs were also matched with detected PTM-SNP to find disease associated PTM-SNPs. Our result shows PTM-SNPs are highly associated with diseases, compared with other nsSNP sites and functional classes including near gene, intron and so on. PTM-SNP can provide an insight about discovering important PTMs involved in the diseases easily through the web site. PTM-SNP is freely available at http://gcode.kaist.ac.kr/ptmsnp.Entities:
Mesh:
Year: 2015 PMID: 26043787 PMCID: PMC4460713 DOI: 10.1186/1755-8794-8-S2-S7
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Detection of a PTM-SNP by alignment of a PTM site sequence and a SNP site sequence. Start from -7 to 7 residues of PTM site sequences, each residue is matched with nsSNP site of the SNP sequence by sliding one-by-one. From the best aligned position of SNP sequence, the location of PTM-SNP is identified as the location of the PTM site sequence that is aligned with the nsSNP.
PTM-SNP statitsics based on SNP locations
| Post-translational Modification | dbPTM-dbSNP matching | PhosSNP | PTM-SNP | ||
|---|---|---|---|---|---|
| Experimental | Predicted | Experimental | Predicted | ||
| On the PTM site | 4,385 | 5,098 | 172 | 16,954 | 26,100 |
| On the flank of the PTM site | 52,249 | 76,654 | 1,836 | 59,340 | 163,254 |
| Stop-gain affected PTM site | 7,757 | 10,432 | 48 | 442 | 11,940 |
| 60,380 | 88,181 | 2,004 | 6,4035 | 179,325 | |
Statistics of PTM Site based on PTM types with the Top 10 ratio
| Post-translational Modification | Number of PTM sites | Number of PTM Sites of PTM-SNPs | Ratio |
|---|---|---|---|
| Disulfide bond | 1,750 | 1,230 | 0.703 |
| Ubiquitination | 34,507 | 21,824 | 0.633 |
| Proteolytic Cleavage | 1,569 | 987 | 0.629 |
| Acetylation | 11,612 | 6,760 | 0.582 |
| Phosphorylation | 355,203 | 203,830 | 0.574 |
| Sumorylation | 1,180 | 673 | 0.570 |
| S-nitrosylation | 1,286 | 695 | 0.540 |
| N-linked Glycosylation | 65,121 | 21,215 | 0.326 |
| Methylation | 6,166 | 1,761 | 0.286 |
| Sulfation | 8,614 | 2,433 | 0.282 |
| O-linked Glycosylation | 91,678 | 21,335 | 0.233 |
| Other PTMs | 20,696 | 7,670 | 0.371 |
| Total | 599,382 | 290,413 | 0.485 |
Statistics of disease-associated SNPs based on Functional Categories
| Functional classification | Number of SNPs | Number of Disease-associated SNPs | Coverage Ratio | P-value |
|---|---|---|---|---|
| Missense & Stop-gain (PTM-SNP) | 179,325 | 186 | 0.00114 | 2.72E-67 |
| NearGene-5 | 744,086 | 251 | 0.00033 | 7.59E-13 |
| UTR-3: MirSNP | 414,510 | 151 | 0.00036 | 2.76E-10 |
| UTR-3 | 513,249 | 170 | 0.00033 | 1.61E-08 |
| NearGene-3 | 189,105 | 71 | 0.00037 | 2.54E-06 |
| UTR-5 | 80,250 | 35 | 0.00043 | 3.33E-05 |
| Cds-synon | 312,479 | 100 | 0.00032 | 4.54E-05 |
| Missense & Stop-gain (Non PTM-SNP) | 361,401 | 95 | 0.00026 | 0.014 |
| Intergenic | 20,492,263 | 4,357 | 0.00021 | 0.136 |
| Frameshift | 30,578 | 5 | 0.00016 | 0.768 |
| Intron | 19,248,959 | 3,678 | 0.00019 | 0.999 |
| 41,740,143 | 8,771 | 0.00021 | - | |
Statistics of disease-associated SNPs based on PTMs
| Post-translational Modification | Number of Disease Associated PTM-SNP | Number pf Total PTM-SNP | P-value |
|---|---|---|---|
| Phosphorylation | 179 | 165489 | 0.021 |
| Proteolytic Cleavage | 2 | 1145 | 0.333 |
| Disulfide bond | 2 | 1253 | 0.373 |
| S-palmitoyl Cysteine | 1 | 1797 | 0.846 |
| N-linked Glycosylation | 7 | 9400 | 0.860 |
| O-linked Glycosylation | 3 | 5925 | 0.947 |
| Acetylation | 1 | 5579 | 0.997 |
| Ubiquitination | 4 | 15862 | 0.999 |
Statistics of trait or disease-associated PTM-SNPs from GWAS catalogs
| Disease | Number of SNP | Number of PTM-SNP | Ratio |
|---|---|---|---|
| Coronary heart disease | 122 | 7 | 0.057 |
| Diabetes Mellitus, Type 1 | 293 | 11 | 0.038 |
| Lupus Erythematosus, Systemic | 345 | 11 | 0.032 |
| Diabetes Mellitus, Type 2 | 279 | 6 | 0.022 |
| Macular Degeneration | 661 | 11 | 0.017 |
| Stroke | 1490 | 17 | 0.011 |
| Coronary Artery Disease | 700 | 7 | 0.010 |
| Heart Failure | 1127 | 10 | 0.009 |
| Cholesterol, LDL | 831 | 7 | 0.008 |
| Iron deficiency | 961 | 7 | 0.007 |
Type 2 diabetes-associated PTM-SNPs
| SNP | PTM | p-value | Gene | Known T2D gene |
|---|---|---|---|---|
| 6235 | Phosphorylation | 1.00E-26 | PCSK1 | |
| 5219 | Phosphorylation | 7.00E-11 | KCNJ11 | O |
| 5215 | Phosphorylation | 5.00E-11 | KCNJ11 | O |
| 1801214 | N-linked Glycosylation | 3.00E-08 | WFS1 | O |
| 1048886 | Phosphorylation | 3.00E-08 | C6orf57 | |
| 1801282 | Phosphorylation | 2.00E-06 | PPARG | O |
Figure 2Schematic illustration of data collection, analysis, and representation of PTM-SNPs.