| Literature DB >> 31511885 |
Yanbo Yang1, Qiong Zhang2, Ya-Ru Miao2, Jiajun Yang1, Wenqian Yang1, Fangda Yu1, Dongyang Wang1, An-Yuan Guo2, Jing Gong1,3.
Abstract
Alternative polyadenylation (APA) is an important post-transcriptional regulation that recognizes different polyadenylation signals (PASs), resulting in transcripts with different 3' untranslated regions, thereby influencing a series of biological processes and functions. Recent studies have revealed that some single nucleotide polymorphisms (SNPs) could contribute to tumorigenesis and development through dysregulating APA. However, the associations between SNPs and APA in human cancers remain largely unknown. Here, using genotype and APA data of 9082 samples from The Cancer Genome Atlas (TCGA) and The Cancer 3'UTR Altas (TC3A), we systematically identified SNPs affecting APA events across 32 cancer types and defined them as APA quantitative trait loci (apaQTLs). As a result, a total of 467 942 cis-apaQTLs and 30 721 trans-apaQTLs were identified. By integrating apaQTLs with survival and genome-wide association studies (GWAS) data, we further identified 2154 apaQTLs associated with patient survival time and 151 342 apaQTLs located in GWAS loci. In addition, we designed an online tool to predict the effects of SNPs on PASs by utilizing PAS motif prediction tool. Finally, we developed SNP2APA, a user-friendly and intuitive database (http://gong_lab.hzau.edu.cn/SNP2APA/) for data browsing, searching, and downloading. SNP2APA will significantly improve our understanding of genetic variants and APA in human cancers.Entities:
Mesh:
Year: 2020 PMID: 31511885 PMCID: PMC6943033 DOI: 10.1093/nar/gkz793
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Simplified schematic showing the workflow of SNP2APA database. (A) Collection of genotype and clinical data. (B) Collection of APA data and GWAS data. (C) Database content in SNP2APA. (D) The online PAS predict tool in SNP2APA. (E) Main functions in SNP2APA.
Summary of apaQTLs in SNP2APA
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Cancer type | No. of amples | No. of enotypes | No. of PA events | Pairs | APA events | apaQTLs | Pairs | APA events | apaQTLs |
| ACC | 77 | 3 567 954 | 3114 | 3026 | 135 | 2864 | 1566 | 158 | 1422 |
| BLCA | 408 | 4 190 525 | 3780 | 17 072 | 218 | 16 472 | 883 | 82 | 819 |
| BRCA | 1 091 | 2 746 335 | 5379 | 11 941 | 212 | 11 376 | 501 | 7 | 470 |
| CESC | 299 | 4 291 784 | 3268 | 14 767 | 211 | 14 358 | 773 | 114 | 745 |
| CHOL | 36 | 4 012 152 | 3564 | 1 710 | 54 | 1610 | 1980 | 34 | 1153 |
| COAD | 285 | 4 499 815 | 3356 | 15 797 | 231 | 15 264 | 1341 | 231 | 1280 |
| DLBC | 48 | 4 845 461 | 3658 | 1630 | 67 | 1580 | 2640 | 126 | 2171 |
| ESCA | 184 | 4 457 611 | 4510 | 27 484 | 615 | 26 009 | 665 | 122 | 644 |
| GBM | 150 | 4 556 998 | 5353 | 36 614 | 801 | 34 381 | 575 | 126 | 539 |
| HNSC | 518 | 4 254 665 | 4646 | 19 960 | 254 | 19 162 | 715 | 18 | 655 |
| KICH | 66 | 3 771 774 | 4477 | 3047 | 136 | 3010 | 1477 | 128 | 1313 |
| KIRC | 525 | 4 577 720 | 4906 | 20 978 | 240 | 19 596 | 905 | 25 | 867 |
| KIRP | 290 | 4 895 360 | 4355 | 19 494 | 280 | 18 258 | 2390 | 330 | 2156 |
| LAML | 122 | 5 143 663 | 3754 | 7675 | 159 | 7588 | 517 | 81 | 501 |
| LGG | 515 | 4 634 138 | 5251 | 29 267 | 330 | 27 826 | 1150 | 41 | 1008 |
| LIHC | 369 | 4 158 963 | 3127 | 10 779 | 159 | 10 511 | 842 | 131 | 738 |
| LUAD | 511 | 4 384 429 | 4471 | 19 628 | 241 | 18 763 | 1210 | 23 | 1160 |
| LUSC | 500 | 3 744 419 | 5126 | 21 804 | 296 | 20 915 | 718 | 14 | 673 |
| MESO | 87 | 4 784 882 | 3999 | 9077 | 237 | 8447 | 1082 | 120 | 1019 |
| OV | 291 | 2 963 431 | 6174 | 21 159 | 382 | 19 702 | 285 | 57 | 285 |
| PAAD | 178 | 4 996 008 | 4466 | 20 351 | 462 | 19 177 | 1065 | 178 | 951 |
| PCPG | 178 | 4 721 561 | 3696 | 25 042 | 571 | 23 185 | 1133 | 131 | 1130 |
| PRAD | 494 | 4 828 721 | 4704 | 30 998 | 332 | 29 312 | 1842 | 15 | 1796 |
| SARC | 258 | 4 088 267 | 3910 | 13 158 | 232 | 12 582 | 897 | 320 | 536 |
| SKCM | 103 | 4 854 570 | 4179 | 12 811 | 310 | 11 672 | 1766 | 144 | 1702 |
| STAD | 414 | 4 310 492 | 6978 | 23 045 | 334 | 21 499 | 478 | 97 | 465 |
| TGCT | 150 | 4 825 013 | 4616 | 20 876 | 487 | 19 369 | 1118 | 204 | 1068 |
| THCA | 503 | 4 877 853 | 519 | 2999 | 35 | 2896 | 10 | 9 | 9 |
| THYM | 120 | 4 940 146 | 3773 | 12 939 | 325 | 12 255 | 971 | 117 | 957 |
| UCEC | 176 | 4 950 486 | 2588 | 8903 | 288 | 8788 | 987 | 212 | 920 |
| UCS | 56 | 3 888 385 | 3733 | 2206 | 99 | 1999 | 1185 | 143 | 1112 |
| UVM | 80 | 4 737 552 | 3149 | 8021 | 186 | 7516 | 552 | 66 | 457 |
Figure 2.The interface of SNP2APA database. (A) Browser bar in SNP2APA. (B) Modules of cis- and trans-apaQTL, survival apaQTL and GWAS apaQTL. (C) An example of survival apaQTL. KM-plot indicated that rs10247994 in KIRC was highly association with patient survival time, and box plot indicated that rs10247994 in KIRC was highly associated with PDUI values of the APA event in PUSH gene. (D) An example of GWAS apaQTL. Box plot indicated that GWAS associated apaQTL rs370151 in BRCA was highly associated with PDUI values of the APA event in AMFR. (E) Search results of cis-apaQTL dataset. (F) The heatmap displaying the correlation coefficient (r) of apaQTLs in the ‘Pancan-apaQTL’ page. The label for y-axis contains SNP ID, gene symbol of APA and APA event. (G) The input of online PAS prediction tool.