| Literature DB >> 24267658 |
Abstract
BACKGROUND: Alternative splicing is an important and widespread mechanism for generating protein diversity and regulating protein expression. High-throughput identification and analysis of alternative splicing in the protein level has more advantages than in the mRNA level. The combination of alternative splicing database and tandem mass spectrometry provides a powerful technique for identification, analysis and characterization of potential novel alternative splicing protein isoforms from proteomics.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24267658 PMCID: PMC3850988 DOI: 10.1186/1471-2105-14-S14-S13
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
current statistics of database
| Alternative Splicing Events | Count |
|---|---|
| EXON_NM | 1,200,494 |
| E_E_NM | 1,005,388 |
| E_I_AS | 1,005,368 |
| I_E_AS | 1,005,344 |
| E_E_AS | 6,709,352 |
| INTRON_AS | 993,833 |
| Total | 11,919,779 |
| Genes | 56,630 (Ensembl gene ids) |
| Transcripts | 95,260 (Ensembl transcript ids) |
a comparison of alternative splicing in SASD against several common alternative splicing data sources
| SASD | |||||
|---|---|---|---|---|---|
| AS coverage | 9405 | 62,474 | 185,174 | 5,324,542 | 11,919,779 |
| gene coverage | 16,710 | 11,242 | 37,204 | 23,516 | 56,630 |
| Last Updated | 2008 | 2000 | July 2007 | Feb 2010 | May 2013 |
| Curation Type | Manual | Synthetic | Synthetic | Synthetic | Synthetic |
| Query by single gene | No | No | No | Yes | Yes |
| Query by Pathway | No | No | No | No | Yes |
| Query by Disease | No | No | No | No | Yes |
| Query by Drug | No | No | No | No | Yes |
| Query by Organ | No | No | No | No | Yes |
| Query by custom gene set | No | No | No | No | Yes |
| Query by gene sequence | No | No | No | Yes | Yes |
| Gene view | No | No | No | Yes | Yes |
| Transcript view | No | No | No | No | Yes |
| Region view | No | No | No | Yes | Yes |
| Peptide sequence | No | No | No | Yes | Yes |
Figure 1Web interface structure. a) Query by genes or proteins. For example, Ensembl Gene ID, Ensembl Transcript ID, UniGene IDs, Entrez gene IDs, gene names, UniProt IDs, UniProt Accessions or IPI IDs are all supported. To enter multiple values, delimit them by comma, semi-colon, line or space. b) search result. In the alternative splicing analysis table, it shows Query ID, SASD ID, Ensembl Gene ID, Ensembl Transcript ID, Mode, and Type. For each alternative splicing event, users can further browse its region view, gene view and transcript view by clicking on the links at the right corners of Column ID, Column Gene, and Column Transcript. c) gene view d) transcript view e) region view. f) molecule inter-association. It shows molecule, Gene Symbol, Pathway ID (Disease ID, Drug ID, Organ ID), and Pathway Name(Disease Name, Drug Name, Organ Name).
Figure 2Six types of as events. The thick boxes represent exons and the thin boxes stand for introns. The orange lines stand for combination of exon-exon or exon-intron.
17 novel peptide isoforms identified in human fetal liver project
| hits | sequence | gene | transcript | mode | type |
|---|---|---|---|---|---|
| 6 | ENSG00000243910 | ENST00000473885 | i3E4 | i_E_AS | |
| 3 | ENSG00000149273 | ENST00000525690 | E1i1 | E_i_AS | |
| 2 | ENSG00000173163 | ENST00000427417 | i2E3 | i_E_AS | |
| 2 | ENSG00000069329 | ENST00000299138 | i2E3 | i_E_AS | |
| 2 | ENSG00000110955 | ENST00000547250 | i2E3 | i_E_AS | |
| 2 | ENSG00000110955 | ENST00000552919 | E6i6 | E_i_AS | |
| 2 | GAVLGAERPR | ENSG00000120251 | ENST00000507898 | i1 | INTRON_AS |
| 2 | GTLYIIKLSADIR | ENSG00000115593 | ENST00000419482 | i8 | INTRON_AS |
| 2 | ENSG00000170289 | ENST00000320005 | i11E12 | i_E_AS | |
| 2 | IGGIGTVPVGR | ENSG00000172244 | ENST00000306862 | i6 | INTRON_AS |
| 2 | ENSG00000183091 | ENST00000397345 | E102E182 | E_E_AS | |
| 2 | ENSG00000119139 | ENST00000377245 | E15E21 | E_E_AS | |
| 2 | LPLQDVYK | ENSG00000172244 | ENST00000306862 | i6 | INTRON_AS |
| 2 | SPGAWEGGREDR | ENSG00000160111 | ENST00000291440 | i2 | INTRON_AS |
| 2 | ENSG00000087274 | ENST00000264758 | E2E8 | E_E_AS | |
| 2 | ENSG00000137177 | ENST00000259711 | E17E23 | E_E_AS | |
| 2 | WPDSQLAWFLR | ENSG00000119844 | ENST00000238855 | i8 | INTRON_AS |
Figure 3Spectrum of LISQIVSSIT(A)SLR. The blue lines represent b-ions. And the red lines represent y-ions.
8 cancer-specific peptide markers identified in breast cancer
| Peptide sequence | gene | transcript | mode | type | pvalue | h | c |
|---|---|---|---|---|---|---|---|
| SWGGRPQRMGAVPGGVWSAVLMGGAR | ERBB2 | ENST00000269571 | i18 | INTRON_AS | 9.48E-05 | 4 | 20 |
| BRCA2 | ENST00000380152 | E7_E11 | E_E_AS | 8.57E-04 | 1 | 12 | |
| NTRK3 | ENST00000317501 | i2_E3 | i_E_AS | 1.22E-02 | 2 | 10 | |
| ERBB2 | ENST00000269571 | E1_E16 | E_E_AS | 1.22E-02 | 2 | 10 | |
| LSWNHVARALTLTQSLVSSVTSGK | NTRK3 | ENST00000559764 | i2 | INTRON_AS | 1.39E-02 | 4 | 13 |
| BAP1 | ENST00000460680 | E3_E9 | E_E_AS | 3.33E-02 | 9 | 18 | |
| PBRM1 | ENST00000296302 | E9_E29 | E_E_AS | 3.89E-02 | 6 | 14 | |
| EP300 | ENST00000263253 | E22_E31 | E_E_AS | 4.50E-02 | 4 | 11 |
Figure 4Relational metadata model. The datasets were derived by the data generation pipeline.