| Literature DB >> 23118479 |
Matteo Giulietti1, Francesco Piva, Mattia D'Antonio, Paolo D'Onorio De Meo, Daniele Paoletti, Tiziana Castrignanò, Anna Maria D'Erchia, Ernesto Picardi, Federico Zambelli, Giovanni Principato, Giulio Pavesi, Graziano Pesole.
Abstract
A comprehensive knowledge of all the factors involved in splicing, both proteins and RNAs, and of their interaction network is crucial for reaching a better understanding of this process and its functions. A large part of relevant information is buried in the literature or collected in various different databases. By hand-curated screenings of literature and databases, we retrieved experimentally validated data on 71 human RNA-binding splicing regulatory proteins and organized them into a database called 'SpliceAid-F' (http://www.caspur.it/SpliceAidF/). For each splicing factor (SF), the database reports its functional domains, its protein and chemical interactors and its expression data. Furthermore, we collected experimentally validated RNA-SF interactions, including relevant information on the RNA-binding sites, such as the genes where these sites lie, their genomic coordinates, the splicing effects, the experimental procedures used, as well as the corresponding bibliographic references. We also collected information from experiments showing no RNA-SF binding, at least in the assayed conditions. In total, SpliceAid-F contains 4227 interactions, 2590 RNA-binding sites and 1141 'no-binding' sites, including information on cellular contexts and conditions where binding was tested. The data collected in SpliceAid-F can provide significant information to explain an observed splicing pattern as well as the effect of mutations in functional regulatory elements.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23118479 PMCID: PMC3531144 DOI: 10.1093/nar/gks997
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Distribution of the sites collected in SpliceAid-F, reporting the number of binding sites, including conditional- and no-binding sites, which have been shown unable to bind a splicing factor in some or any conditions, respectively
| Splicing factor | Binding sites | Conditional-binding sites | No-binding sites |
|---|---|---|---|
| 9G8 | 70 | 1 | 29 |
| CUG-BP1 | 42 | 3 | 32 |
| DAZAP1 | 12 | 0 | 3 |
| ESRP1 | 1 | 0 | 1 |
| ESRP2 | 1 | 0 | 1 |
| ETR-3 | 31 | 4 | 36 |
| FMRP | 43 | 4 | 4 |
| Fox-1 | 12 | 0 | 2 |
| Fox-2 | 13 | 0 | 3 |
| hnRNP A0 | 1 | 0 | 0 |
| hnRNP A1 | 143 | 17 | 39 |
| hnRNP A2/B1 | 42 | 1 | 8 |
| hnRNP A3 | 2 | 0 | 1 |
| hnRNP C | 21 | 7 | 13 |
| hnRNP C1 | 11 | 2 | 19 |
| hnRNP C2 | 10 | 0 | 16 |
| hnRNP D | 29 | 2 | 14 |
| hnRNP D0 | 1 | 0 | 0 |
| hnRNP DL | 34 | 0 | 0 |
| hnRNP E1 | 43 | 17 | 14 |
| hnRNP E2 | 39 | 13 | 23 |
| hnRNP F | 67 | 8 | 26 |
| hnRNP G | 1 | 0 | 0 |
| hnRNP H1 | 85 | 8 | 45 |
| hnRNP H2 | 101 | 8 | 42 |
| hnRNP H3 | 60 | 8 | 44 |
| hnRNP I (PTB) | 129 | 13 | 53 |
| hnRNP J | 1 | 0 | 12 |
| hnRNP K | 58 | 15 | 13 |
| hnRNP L | 172 | 4 | 9 |
| hnRNP LL | 13 | 0 | 2 |
| hnRNP M | 1 | 0 | 3 |
| hnRNP P (TLS) | 16 | 4 | 8 |
| hnRNP Q | 10 | 0 | 7 |
| hnRNP U | 19 | 3 | 0 |
| HTra2α | 7 | 0 | 3 |
| HTra2β1 | 20 | 1 | 14 |
| HuB | 44 | 0 | 1 |
| HuC | 2 | 0 | 2 |
| HuD | 51 | 6 | 5 |
| HuR | 72 | 25 | 26 |
| KSRP | 22 | 2 | 7 |
| MBNL1 | 92 | 11 | 34 |
| Nova-1 | 25 | 4 | 18 |
| Nova-2 | 12 | 4 | 9 |
| nPTB | 3 | 0 | 1 |
| PSF | 32 | 0 | 7 |
| QKI | 1 | 0 | 0 |
| RBM25 | 1 | 0 | 1 |
| RBM4 | 8 | 0 | 2 |
| RBM5 | 7 | 0 | 2 |
| Sam68 | 16 | 0 | 5 |
| SAP155 | 1 | 0 | 0 |
| SC35 | 172 | 5 | 47 |
| SF1 | 24 | 1 | 5 |
| SF2/ASF | 248 | 15 | 52 |
| SLM-1 | 1 | 0 | 0 |
| SLM-2 | 6 | 0 | 0 |
| SRm160 | 1 | 0 | 0 |
| SRp20 | 74 | 0 | 23 |
| SRp30c | 25 | 9 | 6 |
| SRp38 | 10 | 0 | 0 |
| SRp40 | 68 | 7 | 27 |
| SRp54 | 1 | 0 | 0 |
| SRp55 | 64 | 7 | 24 |
| SRp75 | 8 | 0 | 18 |
| TDP43 | 22 | 1 | 8 |
| TIA-1 | 39 | 2 | 7 |
| TIAL1 | 37 | 2 | 2 |
| YB-1 | 21 | 1 | 14 |
| ZRANB2 | 19 | 0 | 4 |
| Total | 2590 | 245 | 896 |
RNA-binding proteins collected in SpliceAid-F, including information on their functional domains
| Proteins | RNA-binding domain | ||||
|---|---|---|---|---|---|
| Class | Name | RRM | KH | Zinc finger | Other |
| SRs | SC35, SRp20, SRp38, SRp54, HTra2α, HTra2β1 | 1 | |||
| 9G8 | 1 | 1 CCHC | |||
| SF2/ASF, SRp30c, SRp40, SRp55, SRp75 | 2 | ||||
| SRm160 | 1 PWI | ||||
| hnRNPs | hnRNP C, C1, C2, G | 1 | |||
| hnRNP P | 1 | 1 RanBP2 | |||
| hnRNP A0, A1, A2/B1, A3, D, D0, DL, H3 | 2 | ||||
| hnRNP F, H1, H2, L, LL, M, Q | 3 | ||||
| hnRNP I (PTB), nPTB | 4 | ||||
| hnRNP E1, E2, J, K | 3 | ||||
| hnRNP U | 1 SAP | ||||
| Others | Fox-1, Fox-2 | 1 | |||
| RBM25 | 1 | 1 PWI | |||
| DAZAP1, PSF, TDP43 | 2 | ||||
| RBM4 | 2 | 1 CCHC | |||
| RBM5 | 2 | 1 C2H2, 1 RanBP2 | 1 G-patch | ||
| CUG-BP1, ESRP1, ESRP2, ETR-3, HuB, HuC, HuD, HuR, TIA-1, TIAL1 | 3 | ||||
| QKI, Sam68, SLM-1, SLM-2 | 1 | ||||
| SF1 | 1 | 1 CCHC | |||
| FMRP | 2 | ||||
| Nova-1, Nova-2 | 3 | ||||
| KSRP | 4 | ||||
| ZRANB2 | 2 RanBP2 | ||||
| MBNL1 | 4 C3H1 | ||||
| YB-1 | 1 CSD | ||||
| SAP155 | 11 HEAT repeats | ||||
RRM, RNA Recognition Motif; KH, K homology; CCHC, Cys-Cys-His-Cys; RanBP2, Ran-binding protein 2; PWI, Pro-Trp-Ile; C2H2, Cys-Cys-His-His; G-patch, Glycine rich; SAP, SAF-A/B, Acinus and PIAS; C3H1, Cys-Cys-Cys-His; HEAT, Huntingtin, EF3, PP2A, TOR1; CSD, cold-shock domain.
The activity of some splicing proteins is context dependent
| Activity | SR proteins | hnRNP proteins | Other proteins |
|---|---|---|---|
| Exon definition | 9G8, HTra2α, HTra2β1, SRm160, SRp55, SRp75 | ESRP1, ESRP2, Nova-1, Nova-2, SLM-1, SLM-2 | |
| Mainly exon definition | SC35, SF2/ASF, SRp20, SRp30c, SRp38, SRp40 | ETR-3, FMRP, RBM4, Sam68, TIA-1, TIAL1, YB-1 | |
| Mainly intron definition | hnRNP F, H1, H2, H3, I (PTB), L, Q | Fox-1, Fox-2, MBNL1 | |
| Intron definition | SRp54 | hnRNP A0, A1, A2/B1, A3, C, C1, C2, D, D0, DL, E1, E2, G, J, K, LL, M, P, U, nPTB | CUG-BP1, DAZAP1, HuB, HuC, HuD, HuR, KSRP, PSF, QKI, RBM25, RBM5, SAP155, SF1, TDP43, ZRANB2 |
Exon definition: the proteins of this class have been shown so far to contribute exclusively to exon retention in mature transcripts. Mainly exon definition: usually these proteins promote exon retention, but in rare contexts might promote exon skipping. Mainly intron definition: proteins mostly causing exon skipping. Intron definition: proteins causing exon skipping in all contexts found.