| Literature DB >> 28176830 |
Yang Pan1, Cheng Yan1, Yu Hu1, Yu Fan1, Qing Pan2, Quan Wan1, John Torcivia-Rodriguez1, Raja Mazumder1,3.
Abstract
Single nucleotide variations (SNVs) can result in loss or gain of protein functional sites. We analyzed the effects of SNVs on enzyme active sites, ligand binding sites, and various types of post translational modification (PTM) sites. We found that, for most types of protein functional sites, the SNV pattern differs between germline and somatic mutations as well as between synonymous and non-synonymous mutations. From a total of 51,138 protein functional site affecting SNVs (pfsSNVs), a pan-cancer analysis revealed 142 somatic pfsSNVs in five or moreEntities:
Mesh:
Substances:
Year: 2017 PMID: 28176830 PMCID: PMC5296879 DOI: 10.1038/srep42169
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of the distribution bias analysis of protein functional site affecting single nucleotide variations (pfsSNVs).
Position based* summary of comparison between the previous and current datasets.
| Somatic Mutation | Germline Mutation | Functional Site | Somatic pfsSNV Mapped | Germline pfsSNV Mapped | Total Mutations Mapped | |
|---|---|---|---|---|---|---|
| 994,339 | 710,946 | 259,216 | 25,390 | 13,159 | 38,549 | |
| 1,840,711 | 1,501,666 | 268,478 | 30,848 | 18,619 | 49,467 |
*Statistics summarized in Table 1 is amino acid position based where different functional types occupying the amino acid position are counted as one.
pfsSNVs based* summary of the previous and current datasets.
| Previous Version of Dataset | Current Version of Dataset | %Increase | |||||
|---|---|---|---|---|---|---|---|
| Somatic Mutation | Germline Mutation | Total | Somatic Mutation | Germline Mutation | Total | Increases by | |
| 512 | 351 | 863 | 691 | 432 | 1,123 | 30.1% | |
| 1,214 | 841 | 2055 | 1562 | 1052 | 2,614 | 27.2% | |
| 5,466 | 3917 | 9383 | 7373 | 5282 | 12,655 | 34.9% | |
| 2,375 | 1,997 | 4,372 | 3,217 | 2,630 | 5,847 | 33.7% | |
| 97 | 108 | 205 | 126 | 115 | 241 | 17.6% | |
| 163 | 61 | 224 | 208 | 74 | 282 | 25.9% | |
| 42 | 10 | 52 | 57 | 22 | 79 | 52.0% | |
| 32 | 43 | 75 | 51 | 48 | 99 | 32.0% | |
| 1,574 | 811 | 2385 | 2,084 | 1,040 | 3,124 | 31.0% | |
| 12,286 | 6,395 | 18,681 | 16,630 | 8,444 | 25,074 | 34.2% | |
| 23,761 | 14,534 | 38,295 | 31,999 | 19,139 | 51,138 | 33.5% | |
*Statistics summarized in Table 2 is pfsSNVs based where different functional types occupying the amino acid position are counted separately.
Figure 2Synonymous and non-synonymous SNV occurrence ratio among different types of protein functional site.
The values on each axis show, for each PTM type, the percentage of its site occupied by SNVs. X-axis shows the somatic mutation percentage and Y-axis shows germline mutation percentage. Dot and triangle markings represent non-synonymous and synonymous mutations respectively. Each protein functional site type was shown in different color as per the legend. Linear lines in the figure show global ratio for each mutation type.
Figure 3Occurrence ratio of SNV on the protein functional site neighboring region.
Occurrence ratio of synonymous somatic (SS), synonymous germline (SG), non-synonymous somatic (non-SS) and non-synonymous germline (non-SG) mutations +/−20 amino acid from protein functional sites. Y-axis shows fold of change of SNV occurrence on corresponding amino acid position. Different SNV types are represented as different colors. Value 0 on X-axis indicates the PTM site. T and P represent one sample t-test of the PTM site comparing with its neighboring. P represents the p-value of corresponding one sample t-test.
Figure 4Pan-caner hierarchical clustering of non-synonymous (A) and synonymous (B) somatic mutation occurrence on protein functional site region. Figure shows cancer SNV occurrence at PTM site vs somatic SNV occurrence at a neighboring region for different cancer types. Color indicates fold of change of somatic SNV occurrence. Red color indicates overrepresentation while blue indicates under-representation. Grey color means that there is no detected somatic SNV on corresponding PTM type for corresponding cancer.
Top 20 pfsSNVs1 based on the number of associated cancer type count.
| Gene Name | UnProtKB AC | Variation | Functional Site | Cancer Type Count |
|---|---|---|---|---|
| TP53 | P04637 | R273C | Binding Site | 31 |
| TP53 | P04637 | R248Q | Binding Site | 28 |
| TP53 | P04637 | R248W | Binding Site | 28 |
| TP53 | P04637 | R273H | Binding Site | 27 |
| TP53 | P04637 | H179Y | Binding Site | 26 |
| NRAS | P01111 | Q61K | Binding Site | 24 |
| TP53 | P04637 | C176F | Binding Site | 23 |
| TP53 | P04637 | C275Y | Binding Site | 22 |
| NRAS | P01111 | Q61R | Binding Site | 21 |
| CTNNB1 | P35222 | T41A | Phosphorylation | 21 |
| TP53 | P04637 | C176Y | Binding Site | 20 |
| TP53 | P04637 | H179R | Binding Site | 20 |
| TP53 | P04637 | K132N | Ubiquitylation | 19 |
| TP53 | P04637 | C238F | Binding Site | 19 |
| TP53 | P04637 | C242F | Binding Site | 19 |
| TP53 | P04637 | R248L | Binding Site | 19 |
| TP53 | P04637 | S241F | Binding Site | 18 |
| TP53 | P04637 | C242Y | Binding Site | 18 |
| CTNNB1 | P35222 | S33C | Phosphorylation | 18 |
| TP53 | P04637 | C238Y | Binding Site | 17 |
1pfsSNV: Protein functional site affecting SNV.
Figure 5Circos plot of gene level summarization of 142 key pfsSNVs across five and more cancer types.
Bands are colored by genes, and connect between gene and various types of protein functional sites. Note that, in 142 key pfsSNVs, all key pfsSNVs on CTNNB1 occur on phosphorylation site and all key pfsSNVs on RAS subfamily occur on binding site.
Top 20 pfsSNVs based on the number of associated cancer type count (TP53 excluded).
| Gene Name | UnprotKB ID | Variation | Functional Site | Cancer Type Count |
|---|---|---|---|---|
| NRAS | P01111 | Q61K | Binding Site | 24 |
| CTNNB1 | P35222 | T41A | Phosphorylation | 21 |
| NRAS | P01111 | Q61R | Binding Site | 21 |
| CTNNB1 | P35222 | S33C | Phosphorylation | 18 |
| GNAS | P63092 | R201C | Binding Site | 16 |
| GNAS | Q5JWF2 | R844C | Binding Site | 16 |
| KRAS | P01116 | Q61H | Binding Site | 16 |
| HRAS | P01112 | Q61L | Binding Site | 15 |
| NRAS | P01111 | Q61L | Binding Site | 15 |
| PTEN | P60484 | R130Q | Active Site | 15 |
| CTNNB1 | P35222 | S33F | Phosphorylation | 14 |
| CTNNB1 | P35222 | S37C | Phosphorylation | 14 |
| CTNNB1 | P35222 | S37F | Phosphorylation | 14 |
| CTNNB1 | P35222 | S45F | Phosphorylation | 14 |
| GNAS | P63092 | R201H | Binding Site | 14 |
| GNAS | Q5JWF2 | R844H | Binding Site | 14 |
| CTNNB1 | P35222 | T41I | Phosphorylation | 13 |
| CTNNB1 | P35222 | S45P | Phosphorylation | 13 |
| KRAS | P01116 | Q61K | Binding Site | 13 |
| KRAS | P01116 | Q61L | Binding Site | 13 |
Figure 6Manhattan plot of pfsSNVs enriched in patients with specific cancer types.
X-axis indicates chromosome from 1 to 23 and X, Y in different colors. Each dot in the figure represents a pfsSNV with –log10 (p-value) calculated from a binomial test. Cutoff was set as -log10 (5e-8). A total number of 77 pfsSNVs are statistically significant in specific cancer type. [L] and [G] indicate loss of PTM/active/binding site and gain of PTM/active/binding site respectively. As marked in the figure, [L]NRAS-61-Binding Site and [G]PIK3CA-545/542-Phosphorylation significantly associate with multiple cancer type.
Top 20 pfsSNVs enriched in patients with specific cancer type (full list in Supplementary Table 5).
| PfsSNVs | Cancer Type | PfsSNVs Associated Sample | Total Sample | P-Value |
|---|---|---|---|---|
| [G]2PIK3CA-E545K-Phosphorylation | DOID:1612/breast cancer | 63 | 973 | 9.62E-85 |
| [L]NRAS-Q61R-Binding Site | DOID:4159/skin cancer | 44 | 370 | 9.94E-51 |
| [G]PIK3CA-E542K-Phosphorylation | DOID:1612/breast cancer | 41 | 973 | 4.43E-48 |
| [G]CDC27-A274D-Phosphorylation | DOID:1793/pancreatic cancer | 44 | 210 | 1.59E-41 |
| [G]PIK3CA-E545K-Phosphorylation | DOID:4362/cervical cancer | 33 | 198 | 5.25E-36 |
| [G]KRTAP4-L161V-Phosphorylation | DOID:1793/pancreatic cancer | 39 | 210 | 7.70E-35 |
| [L]NRAS-Q61K-Binding Site | DOID:4159/skin cancer | 33 | 370 | 2.91E-34 |
| [G]ANKRD36-T998S-Phosphorylation | DOID:1793/pancreatic cancer | 37 | 210 | 2.94E-32 |
| [G]EVPL-R336S-Phosphorylation | DOID:1793/pancreatic cancer | 36 | 210 | 5.47E-31 |
| [G]PIK3CA-E545K-Phosphorylation | DOID:11934/head and neck cancer | 28 | 508 | 1.92E-29 |
| [L]PTEN-R130G-Active Site | DOID:363/uterine cancer | 28 | 305 | 4.59E-29 |
| [G]NCOR1-Y20S-Phosphorylation | DOID:1793/pancreatic cancer | 34 | 210 | 1.72E-28 |
| [L]TP53-R273C-Binding Site | DOID:1319/brain cancer | 31 | 287 | 8.97E-27 |
| [L]NRAS-Q61R-Binding Site | DOID:1781/thyroid cancer | 27 | 390 | 3.68E-26 |
| [G]UPF3A-V70L-Phosphorylation | DOID:1793/pancreatic cancer | 31 | 210 | 7.35E-25 |
| [G]KRT8-R23C-Phosphorylation | DOID:1793/pancreatic cancer | 29 | 210 | 1.61E-22 |
| [L]MEF2A-Y105C-Phosphorylation | DOID:1793/pancreatic cancer | 27 | 210 | 3.02E-20 |
| [G]ZNF814-A337V-Phosphorylation | DOID:1793/pancreatic cancer | 27 | 210 | 3.02E-20 |
| [G]SALL1-S159G-Phosphorylation | DOID:1793/pancreatic cancer | 26 | 210 | 3.88E-19 |
[L]1: Loss of protein functional site.
[G]2: Gain of protein functional site.
132 pfsSNVs that satisfy key pfsSNVs identification criteria.
| Gene Name | UniProtAC | Variation | Functional site | Gene Name | UniProtAC | Variation | Functional site |
|---|---|---|---|---|---|---|---|
| BCOR | Q6W2J9 | N1459S | [G]Phosphorylation | TP53 | P04637 | N239D | [L]Binding Site |
| BRAF | P15056 | L597R | [L]Binding Site | TP53 | P04637 | C242S | [L]Binding Site |
| CTNNB1 | P35222 | T41A | [L]Phosphorylation | TP53 | P04637 | R273L | [L]Binding Site |
| CTNNB1 | P35222 | S33C | [L]Phosphorylation | TP53 | P04637 | R280T | [L]Binding Site |
| CTNNB1 | P35222 | S33F | [L]Phosphorylation | TP53 | P04637 | N239S | [L]Binding Site |
| CTNNB1 | P35222 | S37C | [L]Phosphorylation | TP53 | P04637 | C275F | [L]Binding Site |
| CTNNB1 | P35222 | S37F | [L]Phosphorylation | TP53 | P04637 | C176S | [L]Binding Site |
| CTNNB1 | P35222 | S45F | [L]Phosphorylation | TP53 | P04637 | H179L | [L]Binding Site |
| CTNNB1 | P35222 | T41I | [L]Phosphorylation | TP53 | P04637 | R273S | [L]Binding Site |
| CTNNB1 | P35222 | S45P | [L]Phosphorylation | TP53 | P04637 | C238S | [L]Binding Site |
| CTNNB1 | P35222 | S33Y | [L]Phosphorylation | TP53 | P04637 | R273P | [L]Binding Site |
| CTNNB1 | P35222 | S37Y | [L]Phosphorylation | TP53 | P04637 | A276P | [L]Binding Site |
| CTNNB1 | P35222 | S33P | [L]Phosphorylation | TP53 | P04637 | R280G | [L]Binding Site |
| CTNNB1 | P35222 | S37A | [L]Phosphorylation | TP53 | P04637 | C238R | [L]Binding Site |
| CTNNB1 | P35222 | S37P | [L]Phosphorylation | TP53 | P04637 | S241Y | [L]Binding Site |
| CTNNB1 | P35222 | S33A | [L]Phosphorylation | TP53 | P04637 | R280S | [L]Binding Site |
| CTNNB1 | P35222 | S45C | [L]Phosphorylation | TP53 | P04637 | H179Q | [L]Binding Site |
| CTNNB1 | P35222 | S45Y | [L]Phosphorylation | TP53 | P04637 | S241C | [L]Binding Site |
| EGFR | P00533 | T790M | [L]Binding Site | TP53 | P04637 | C242R | [L]Binding Site |
| HRAS | P01112 | Q61L | [L]Binding Site | TP53 | P04637 | C277F | [L]Binding Site |
| HRAS | P01112 | Q61K | [L]Binding Site | TP53 | P04637 | H179D | [L]Binding Site |
| HRAS | P01112 | Q61R | [L]Binding Site | TP53 | P04637 | H179N | [L]Binding Site |
| HRAS | P01112 | Q61H | [L]Binding Site | TP53 | P04637 | R273G | [L]Binding Site |
| IDH1 | O75874 | R132H | [L]Binding Site | TP53 | P04637 | C277Y | [L]Binding Site |
| IDH1 | O75874 | R132C | [L]Binding Site | TP53 | P04637 | S241A | [L]Binding Site |
| IDH1 | O75874 | R132G | [L]Binding Site | TP53 | P04637 | C242W | [L]Binding Site |
| IDH1 | O75874 | R132L | [L]Binding Site | TP53 | P04637 | R248G | [L]Binding Site |
| IDH1 | O75874 | R132S | [L]Binding Site | TP53 | P04637 | R248P | [L]Binding Site |
| IDH2 | P48735 | R172K | [L]Binding Site | TP53 | P04637 | R280I | [L]Binding Site |
| IDH2 | P48735 | R172S | [L]Binding Site | TP53 | P04637 | C176R | [L]Binding Site |
| KRAS | P01116 | Q61H | [L]Binding Site | TP53 | P04637 | C275W | [L]Binding Site |
| KRAS | P01116 | Q61K | [L]Binding Site | TP53 | P04637 | C176W | [L]Binding Site |
| KRAS | P01116 | Q61L | [L]Binding Site | TP53 | P04637 | C275R | [L]Binding Site |
| KRAS | P01116 | Q61R | [L]Binding Site | TP53 | P04637 | A276D | [L]Binding Site |
| NCOR1 | O75376 | Y20S | [G]Phosphorylation | TP53 | P04637 | A276T | [L]Binding Site |
| NRAS | P01111 | Q61R | [L]Binding Site | TP53 | P04637 | R337C | [L]Methylation |
| NRAS | P01111 | Q61K | [L]Binding Site | TP53 | P04637 | R213L | [L]Methylation |
| NRAS | P01111 | Q61L | [L]Binding Site | TP53 | P04637 | R110L | [L]Methylation |
| NRAS | P01111 | Q61H | [L]Binding Site | TP53 | P04637 | R213Q | [L]Methylation |
| NRAS | P01111 | Q61E | [L]Binding Site | TP53 | P04637 | R110P | [L]Methylation |
| PIK3CA | P42336 | E545K | [G]Phosphorylation | TP53 | P04637 | R213P | [L]Methylation |
| PIK3CA | P42336 | E542K | [G]Phosphorylation | TP53 | P04637 | R337L | [L]Methylation |
| PIK3CA | P42336 | N345K | [G]Phosphorylation | TP53 | P04637 | R110C | [L]Methylation |
| PIK3R1 | P27986 | N564D | [L]Binding Site | TP53 | P04637 | R209K | [L]Methylation |
| PTEN | P60484 | R130G | [L]Active Site | TP53 | P04637 | R337H | [L]Methylation |
| PTEN | P60484 | R130Q | [L]Active Site | TP53 | P04637 | S215R | [L]Phosphorylation |
| PTEN | P60484 | D92E | [L]Active Site | TP53 | P04637 | T155N | [L]Phosphorylation |
| PTEN | P60484 | Y155C | [L]Phosphorylation | TP53 | P04637 | T155I | [L]Phosphorylation |
| PTEN | P60484 | Y68H | [L]Phosphorylation | TP53 | P04637 | S215I | [L]Phosphorylation |
| RB1 | P06400 | R661W | [L]Binding Site | TP53 | P04637 | T211I | [L]Phosphorylation |
| SF3B1 | O75533 | K700E | [L]Ubiquitylation | TP53 | P04637 | T155P | [L]Phosphorylation |
| SMAD4 | Q13485 | R361H | [L]Binding Site | TP53 | P04637 | S215G | [L]Phosphorylation |
| SMAD4 | Q13485 | R361C | [L]Binding Site | TP53 | P04637 | S215N | [L]Phosphorylation |
| TP53 | P04637 | R273H | [L]Binding Site | TP53 | P04637 | T155A | [L]Phosphorylation |
| TP53 | P04637 | H179Y | [L]Binding Site | TP53 | P04637 | T284P | [L]Phosphorylation |
| TP53 | P04637 | C176F | [L]Binding Site | TP53 | P04637 | K132N | [L]Ubiquitylation |
| TP53 | P04637 | C275Y | [L]Binding Site | TP53 | P04637 | K132R | [L]Ubiquitylation |
| TP53 | P04637 | C176Y | [L]Binding Site | TP53 | P04637 | K132E | [L]Ubiquitylation |
| TP53 | P04637 | H179R | [L]Binding Site | TP53 | P04637 | K132M | [L]Ubiquitylation |
| TP53 | P04637 | C238F | [L]Binding Site | TP53 | P04637 | K132Q | [L]Ubiquitylation |
| TP53 | P04637 | C242F | [L]Binding Site | TP53 | P04637 | K139N | [L]Ubiquitylation |
| TP53 | P04637 | R248L | [L]Binding Site | TP53 | P04637 | K132T | [L]Ubiquitylation |
| TP53 | P04637 | S241F | [L]Binding Site | TP53 | P04637 | K164E | [L]Acetylation |
| TP53 | P04637 | C242Y | [L]Binding Site | TP53 | P04637 | R273C | [L]Binding Site |
| TP53 | P04637 | C238Y | [L]Binding Site | TP53 | P04637 | R248Q | [L]Binding Site |
| TP53 | P04637 | R280K | [L]Binding Site | TP53 | P04637 | R248W | [L]Binding Site |
Key pfsSNVs identification criteria: gene present in the list of 260 significantly mutated gene (SMG) set; gene present in the list of 573 cancer gene consensus (CGC) gene set; pfsSNV either exists across multiple cancer types or significantly associates with specific cancer type.
Figure 7Kaplan-Meier plot of pancreatic cancer patient survival based on the existence of MEF2A-105-Y-C-Phosphorylation.
X-axis indicate days of survival and Y-axis indicates survival probability. Red and blue lines indicate survival time of pancreatic cancer patients with and without such mutation respectively. Log-rank test shows that, comparing with patients with MEF2A-105-Y-C-Phosphorylation, patients without this pfsSNV survive significantly longer with adjusted p-value of 0.0255. The hazard ratio is 2.348.