| Literature DB >> 29106599 |
Daejin Hyung1,2, Jihyun Kim1, Soo Young Cho1, Charny Park1.
Abstract
Alternative splicing confers the human genome complexity by increasing the diversity of expressed mRNAs. Hundreds or thousands of splicing regions have been identified through differential alternative splicing analysis of high-throughput datasets. However, it is hard to explain the functional impact of each splicing event. Protein domain formation and nonsense-mediated decay are considered the main functional features of splicing. However, other functional features such as miRNA target sites, phosphorylation sites and single-nucleotide variations are directly affected by alternative splicing and affect downstream function. Hence, we established ASpedia: a comprehensive database for human alternative splicing annotation, which encompasses a range of functions, from genomic annotation to isoform-specific function (ASpedia, http://combio.snu.ac.kr/aspedia). The database provides three features: (i) genomic annotation extracted from DNA, RNA and proteins; (ii) transcription and regulation elements analyzed from next-generation sequencing datasets; and (iii) isoform-specific functions collected from known and published datasets. The ASpedia web application includes three components: an annotation database, a retrieval system and a browser specialized in the identification of human alternative splicing events. The retrieval system supports multiple AS event searches resulting from high-throughput analysis and the AS browser comprises genome tracks. Thus, ASpedia facilitates the systemic annotation of the functional impacts of multiple AS events.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29106599 PMCID: PMC5753336 DOI: 10.1093/nar/gkx1014
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Database construction overview of ASpedia. All annotation database components are showed in left and right side. Each annotation item associated with AS event were selected by workflow step 1∼3 in the center. (B) Sequence filtering strategy is classified into three cases and is applied in workflow step 4 of (A).
Summary statistics of alternative splicing events and annotation items for each gene model
| Status | Ensembl | RefSeq | |||
|---|---|---|---|---|---|
| Gene | AS event (or isoform) | Gene | AS event (or isoform) | ||
| Gene model | Total gene | 57 773 | 27 778 | ||
| AS gene/AS | 22 183 | 39 804 | 15 058 | 26 918 | |
| Annotation database | NMD | 2849 | 5120 | 1754 | 2722 |
| miRNA binding site | 1118 | 1911 | 1034 | 1582 | |
| Repeat | 4823 | 9017 | 3019 | 5208 | |
| Protein domain | 7303 | 19 162 | 5556 | 11 060 | |
| PTM | 9888 | 32 516 | 8505 | 20 752 | |
| Protein interaction | 2445 | 43 172 | 2355 | 44 145 | |
| Subcellular localization | 1105 | 1711 | 1327 | 3384 | |
All annotations were assigned distinct gene IDs and matched with AS events. Protein interactions and subcellular localizations were assigned unique transcript IDs.
RNA-binding protein dataset status analyzed from ENCODE CLIP-Seq and RIP-Seq
| Platform | RBP | Total sample | Splicing factor (SF) | SF sample |
|---|---|---|---|---|
| eCLIP-Seq | 112 | 322 | 33 | 102 |
| iCLIP-Seq | 5 | 10 | 3 | 6 |
| RIP-Seq | 23 | 93 | 5 | 25 |
| Total | 135 | 425 | 38 | 133 |
Figure 2.(A) System overview of ASpedia. Our system allows two input types both gene names and BED file. Retrieval system and browser are specialized to confirm each AS event. Final results can be downloaded in a text file format. (B) User interface of ASpedia search results. Left panel shows filtering options and the matched AS event list. Right panel presents AS event genomic profiles, text result and AS browser.