| Literature DB >> 34554251 |
Yangjun Zhang1,2, Xiangyang Yao1,2, Hui Zhou1,2, Xiaoliang Wu1,2, Jianbo Tian3, Jin Zeng4, Libin Yan5, Chen Duan1,2, Haoran Liu6, Heng Li1,2, Ke Chen1,2, Zhiquan Hu1,2, Zhangqun Ye1,2, Hua Xu2,7,8,9.
Abstract
Alternative splicing (AS) represents a crucial method in mRNA level to regulate gene expression and contributes to the protein complexity. Abnormal splicing has been reported to play roles in several diseases, including cancers. We developed the OncoSplicing database for visualization of survival-associated and differential alternative splicing in 2019. Here, we provide an updated version of OncoSplicing for an integrative view of clinically relevant alternative splicing based on 122 423 AS events across 33 cancers in the TCGA SpliceSeq project and 238 558 AS events across 32 cancers in the TCGA SplAdder project. The new version of the database contains several useful features, such as annotation of alternative splicing-associated transcripts, survival analysis based on median and optimal cut-offs, differential analysis between TCGA tumour samples and adjacent normal samples or GTEx normal samples, pan-cancer views of alternative splicing, splicing differences and results of Cox'PH regression, identification of clinical indicator-relevant and cancer-specific splicing events, and downloadable splicing data in the SplAdder project. Overall, the substantially updated version of OncoSplicing (www.oncosplicing.com) is a user-friendly and registration-free database for browsing and searching clinically relevant alternative splicing in human cancers.Entities:
Mesh:
Year: 2022 PMID: 34554251 PMCID: PMC8728274 DOI: 10.1093/nar/gkab851
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Splice types and quantification method in the SpliceSeq and SplAdder project. (A) Splice model for splice types detected in the SpliceSeq and the SplAdder projects. Exon number was illustrated in the box and splice junctions and alternatively spliced exon were indicated in red lines and boxes respectively. (B) Diagram of alternative splicing detection and PSI calculation by the SpliceSeq and SplAdder software. The PSI-calculation method of intron retention in both SpliceSeq and SplAdder is similar to that of exon skipping described for SpliceSeq software, except that the alternate exon is replaced by retained intron.
Figure 2.Data processing and database construction pipeline. (A) Splicing data collection and processing in the SpliceSeq project. (B) Data collection and processing in the SplAdder project. (C) Analyses for alternative splicing associated to survival, sample type, cancer specificity, clinical indicators and annotated transcripts. (D) Pan-cancer analyses and views for PSI distribution, survival association and sample types difference of alternative splicing.
Figure 3.The landscape of alternative splicing in OncoSplicing. (A) Statistics of alternative splicing of seven splice types in the SpliceSeq project. The full names of splice types and cancer types were listed respectively in the Figure 1A and the Supplementary Table S1. (B) Statistics of alternative splicing of five splice types in the SplAdder project. The cancer type LAML was not included in this project. (C) Statistics of AS associated transcripts in the SpliceSeq project. Blue, green and red bar indicate respectively the number of AS events that both, either and neither of exons splice in and splice out that are associated with at least one annotated transcript. (D) Statistics of AS associated transcripts in the SplAdder project. (E) The number of overlapped AS events in five splice types detected in both the SpliceSeq and SplAdder projects. (F) Consistency of AS events detected in the two projects. Spearman correlation analyses were performed by median PSI values of the overlapped AS events across each cancer type. For each splice type, error bar indicate mean plus or minus standard deviation of correlation coefficients of different cancer types. (G) The number of DASEs detected from the overlapped AS events in the SpliceSeq and SplAdder projects. The five colours in each cancer type indicate different splice types described in (B). (H) Overlap similarity of DASEs detected in these two project. Overlap similarity is defined as ratio that equal to the intersection size of two sets divided by the minimum size. For each splice type, error bar indicate mean plus or minus standard deviation of overlap similarity of different cancer types. ASEs: alternative splicing events. DASEs: Differential alternative splicing events.
Figure 4.Overview of the OncoSplicing database. (A) Browser bar in OncoSplicing and an example of search results on the ‘SplAdder’ page. After querying gene ‘SYK’ in cancer type ‘KIRC’, a result table responses with detailed information of the query gene, alternative splicing, statistic results and function buttons. (B) An example of customized tracks in the UCSC genome browser for the query gene, presenting explanatory diagrams of alternative splicing events and annotated structure of transcripts in the SplAdder and SpliceSeq projects.