| Literature DB >> 27048167 |
Aino I Järvelin1, Marko Noerenberg1, Ilan Davis1, Alfredo Castello2.
Abstract
RNA-binding proteins play a key role in the regulation of all aspects of RNA metabolism, from the synthesis of RNA to its decay. Protein-RNA interactions have been thought to be mostly mediated by canonical RNA-binding domains that form stable secondary and tertiary structures. However, a number of pioneering studies over the past decades, together with recent proteome-wide data, have challenged this view, revealing surprising roles for intrinsically disordered protein regions in RNA binding. Here, we discuss how disordered protein regions can mediate protein-RNA interactions, conceptually grouping these regions into RS-rich, RG-rich, and other basic sequences, that can mediate both specific and non-specific interactions with RNA. Disordered regions can also influence RNA metabolism through protein aggregation and hydrogel formation. Importantly, protein-RNA interactions mediated by disordered regions can influence nearly all aspects of co- and post-transcriptional RNA processes and, consequently, their disruption can cause disease. Despite growing interest in disordered protein regions and their roles in RNA biology, their mechanisms of binding, regulation, and physiological consequences remain poorly understood. In the coming years, the study of these unorthodox interactions will yield important insights into RNA regulation in cellular homeostasis and disease.Entities:
Keywords: Arginine-rich motif; Basic patch; Co- and post-transcriptional RNA regulation; GAR repeat; Intrinsically disordered protein; Poly-K patch; RGG-box; RNA granule; RNA-binding protein; RS repeat
Mesh:
Substances:
Year: 2016 PMID: 27048167 PMCID: PMC4822317 DOI: 10.1186/s12964-016-0132-3
Source DB: PubMed Journal: Cell Commun Signal ISSN: 1478-811X Impact factor: 5.712
Fig. 1Three classes of disordered protein regions involved in direct RNA-interactions. Blue oval indicates the disordered region of each protein involved in RNA binding. Sequence is shown below the protein model, and typical sequence characteristics are indicated by boxes. Disorder profile was calculated using IUPred [172]. Values above 0.4 are considered disordered
Examples of RNA binding proteins where a disordered, non-classical region is involved in direct RNA binding. Additional details for each protein are presented in Additional file 1: Figure S1. Disorder prediction was calculated using IUPred [172]
| Protein | Properties of disorder involved in RNA binding | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ID | Name | Aliases | Species | Canonical domains | Function | Class | Sequence | Disorder assignment | Target RNA preference | Regulation at disordered region | Interaction with other biomolecules | Ref |
| SRSF1 | Serine/arginine-rich splicing factor 1 | ASF, SF2, SF2P33, SFRS1 | Homo sapiens | 2xRRM | RNA splicing. Essential for heart development. | RS |
196−GPRSPSYGRSRSRSR | Experimental | - | Serine phosphorylated. Becomes more structured upon phosphorylation. | Protein | [ |
| U2AF2 | Splicing factor U2AF 65 kDa subunit | U2AF65 | Homo sapiens | 3xRRM | RNA splicing. | RS |
1−MSDFDEFERQLNENK | Predicted | No specificity | Serine phosphorylation, lysine acetylation, lysine hydroxylation a | Protein | [ |
| NKAP | NF-kappa-B-activating protein | - | Homo sapiens | None | RNA splicing, transcriptional repression. | RS |
1−MAPVSGSRSPDREAS | Predicted | poly (U) | Lysine acetylation a | Protein | [ |
| Nucleo-capsid protein | - | Nucleoprotein, NC, N | Severe acute respiratory syndrome coronavirus (SARS-CoV) | None | Major structural component of virions that associates with genomic RNA to form a long, flexible, helical nucleo-capsid. | Other, RS, polyK/ other |
1−MSDNGPQSNQRSAPR | Experimental | poly (U) ssRNA | - | - | [ |
| ALYREF | Aly/REF export factor 2 | Alyref | Mus musculus | 1xRRM | RNA export. | RG |
22−VNRGGGPRRNRPAIA | Experimental | - | TAP displaces RNA from ALYREF | Protein | [ |
| Aven | Cell death regulator Aven | - | Homo sapiens | None | Positive translational regulator. | RG |
1−MQAERGARGGRGRRP | Predicted | RNA G-quadruplex | Methylated (no influence on RNA binding; influences protein interactions and polysome association). Alternative transcript (mouse) | Protein | [ |
| Caprin-1 | - | GPIAP1, GPIP137, M11S1, RNG105 | Homo sapiens, Xenopus | None | Regulation of localised translation, synaptic plasticity, cell proliferation and migration. | RG |
612−RGGSRGARGLMNGYR | Predicted | - | The end of the human sequence (RGGPPRPNRGMPQMNTQQV) is in an alternative isoform a | - | [ |
| DDX4 | Probable ATP-dependent RNA helicase DDX4 | Vasa | Homo sapiens | None | RNA helicase. | RG |
1−MGDEDWEAEINPHMS | Experimental | Single-stranded DNA. | Arginine methylation. Alternative isoforms a | - | [ |
| EWS | RNA-binding protein EWS | EWSR1 | Homo | 1xRRM | Transcription, splicing. | RG |
288−PGENRSMSGPDNRGR | Predicted | G-quadruplex (RGG3, not RGG1 or RGG2) | Alternative splicing a. Arginine dimethylation at RGG repeats affects protein sub cellular localization | DNA (via RGG3). All three RGG repeats bind SMN protein. | [ |
| FMRP | Fragile X mental retardation protein 1 | FMR1 | Homo sapiens, mouse | 2xKH | Regulation of translation (repressor). | RG |
527−RRGDGRRRGGGGRGQ | Experimental | G quartets, G-quadruplex | Arg methylation. Alternative splicing at regions flanking the RGG-box alters FMRP’s capacity to bind RNA, to be methylated, and associate with polysomes. | C-terminal part of this protein that also includes the RG region is involved in protein-protein interactions. | [ |
| FUS | RNA-binding protein FUS | TLS | Homo sapiens, Drosophila melanogaster | 1xRRM | Splicing, poly-adenylation. | RG |
213−RGGRGRGG−220, 241−PRGRGGGRGGRGG−253, 377−RGGGNGRGGRGRGGP | Predicted | G-quadruplex | Arginine methylation. | - | [ |
| hnRNP U | Heterogeneous nuclear ribonucleoprotein U | HNRPU, SAFA, U21.1 | Homo sapiens | None | RNA stability, U2 snRNP maturation, DNA binding. | RG |
714−MRGGNFRGGAPGNRG | Predicted | Poly (U and poly (G) homopolymers, UGUGG | - | DNA | [ |
| ICP27 | Infected cell protein 27, Immediate-early protein IE63 | - | Herpes simplex virus | None | RNA export. | RG | 138−RGGRRGRRRGRGRGG−152 | Predicted | poly (G) and poly (U) homopolymers, GC-rich sequences | Methylated | - | [ |
| LAF1 | - | DDX3 | C. elegans | None | RNA | RG |
1−MESNQSNNGGSGNAA | Experimental | - | Region 43–106 containing RG-repeat is alternative. | - | [ |
| NXF1 | Nuclear RNA export factor 1 | TAP | Mus musculus, homo sapiens | None | Nuclear export. | RG |
2−ADEGKSYSEHDDERV | Predicted | Non-specific | - | Protein. Overlaps a nuclear localisation and export signals. | [ |
| Nucle- | - | NCL, Protein C23 | Hamster | 4xRRM | Chromatin decondensation, pre-rRNA transcription, ribosome assembly. | RG |
630−MEDGEIDGNKVTLDW | Experimental. Suggested to form a flexible β-spiral. | None | - | Protein (in human) | [ |
| RBMX | RNA-binding motif protein, X chromosome | HNRPG, RBMXP1 | Homo sapiens, Xenopus laevis | 1xRRM | Regulation of transcription, splicing. | RG |
333−DLYSSGRDRVGRQER | Predicted | C-terminal regions binds structured (hairpin) RNA | Identical C-terminal sequence is mouse RBMX is alternatively spliced. | - | [ |
| Foamy virus Gag | - | - | Human foamy virus | None | Viral genome binding, capsid formation. | RG | 485−RPSRGRGRGQN−495 | Predicted | - | - | - | [ |
| TERF2 | Telomeric repeat-binding factor 2 | TRBF2, TRF2 | Homo sapiens | None | Presynaptic plasticity, axonal mRNA transport, telomere maintenance | RG | 43−MAGGGGSSDGSGRAAGRRASRSSGRARRGRHEPGLGGPAERGAG- 86 | Predicted | G-rich, TERRA | Arginine methylation | Protein | [ |
| XTUT7 | - | - | Xenopus laevis | Zinc finger | RNA polyuridylat-ion, translational repression. | Basic patch (poly R) |
453−MRRNRVRRRNNENAG | Predicted | - | - | - | [ |
| Tat | Transactivating regulatory protein | - | Human immuno-deficiency virus (HIV) | None | transcriptional activator, transcription elongation. | Basic patch (poly R) | 49−RKKRRQRRR−57 | Experimental | Structured RNA (HIV-1 Trans-activation response element, TAR) | Arginine methylation (with impact on RNA binding). Lysine acetylation (impact on TAR binding, through an effect on Tat-TAR-CyclinT1 ternary complex formation). | Protein | [ |
| Rev | Regulator of expression of viral proteins | - | Human immuno-deficiency virus (HIV) | None | RNA export. | Basic patch (poly R) |
34−TRQARRNRRRRWRER | Experimental | Structured RNA (HIV-1 Rev response element, RRE) | Arginine methylation. | Protein | [ |
| Tat | Transactivating regulatory protein | S ORF, bTat | Bovine immunodeficiency virus | None | Transcriptional activator | Basic patch (polyR) | 70−RGTRGKGRRIRR−81 | Experimental | Structured RNA (TAR) | - | Protein | [ |
| Coat protein | - | - | Alfalfa mosaic virus | None | Capsid protein, viral RNA. Translation initiation. | Basic patch (poly K) |
6−KKAGGKAGKPTKRSQ | Experimental | - | - | - | [ |
| PAPD5 | Non-canonical poly (A) RNA polymerase PAPD5 | - | Homo sapiens | None | RNA oligoadenylation, RNA stability | Basic patch (poly K) | 557−KKRKHKR−563 | Predicted | May have a preference for structured RNA | Alternative splicing a | - | [ |
| SDAD1 | Protein SDA1 homolog | - | Homo sapiens | None | Protein transport, ribosomal large subunit export from nucleus. | Basic patch (poly K) |
244−RDLLVQYATGKKSSK | Predicted | - | - | - | [ |
| HMGA1 | High mobility group protein HMG-I/HMG-Y | - | Homo sapiens | None | - | (e) AT | 21−TEKRGRGRPRK−31 | Experimental | Binds structured RNA. | Arginine methylation. | DNA | [ |
| Tip5 | Bromodomain adjacent to zinc finger domain protein 2A | BAZ2A | Homo sapiens | None | Epigenetic rRNA gene silencing. | (e) AT | 650−GKRGRPRNTEK−660, 670−KRGRGRPPKVKIT−682 | Experimental | Exhibits preferential binding towards dsRNA | - | DNA | [ |
| PTOV1 | Prostate tumor-overexpressed gene 1 protein | ACID2, PP642 | Homo sapiens | None | Regulation of transcription. | (e) AT |
1−MVRPRRAPYRSGAGG | Predicted | Exhibits preferential binding towards dsRNA | Alternative splicing a | DNA | [ |
| GPBP1 | - | Vasculin, GPBP, SSH6 | Homo sapiens | None | Transcription factor, positive regulation of transcription | e (AT) |
38−NRYDVNRRRHNSSDG | Predicted | Exhibits preferential binding towards dsRNA | Alternative splicing a | DNA | [ |
| SRSF2 | Serine/arginine-rich splicing factor 2 | SFRS2 | Homo sapiens | 1xRRM | RNA splicing. | Other (GRP) |
1−MSYGRPPP−8, | Experimental | UCCA/UG, UGGA/UG | - | [ | |
| Tra2-β1 | Transformer-2 protein homolog beta | TRA2B, SFRS10 | Homo sapiens | 1xRRM | RNA splicing. | Other |
110−NRANPDPNCC−119, | Experimental | GAAGAA (primary), AGAAG (primary), GACUUCAACA AGUC (structured) | - | - | [ |
| hnRNPA1 | Heterogeneous nuclear ribonucleoprotein A1 | HNRPA1 | Human, Xenopus tropical | 2xRRM | hnRNP particle formation, nucleo-cytoplasmic transport, splicing. | Other/RG |
186−MASASSSQRGRSGSG | Predicted | - | Region containing the RG- and FG-repeat peptides is alternatively spliced. | - | [ |
| LUZP4 | Leucine zipper protein 4 | CT-28, | Homo sapiens | None | Nuclear export. | 51−RQNHSKKESPSRQQSKAHRHRHRRGYSRCR−80, 238−LVDTQSDLIATQRDLIATQKDLIATQRDLIATQRDLIVTQRDLVATERDL−287 | Predicted | - | Alternative splicing affecting the first, R-rich region a | Protein | [ | |
| ORF57 | 52 kDa immediate-early phosphoprotein, mRNA export factor ICP27 homolog | - | Herpes-virus saimiri | None | Viral RNA regulation. | Other |
64−RQRSPITWEHQSPLS | Experimental | Viral RNA: GAAGAGG, CAGUCGCGAAGAGG | RNA binding region partially overlaps with ALYREF binding site. | Protein | [ |
| APC | Adenomatous polyposis coli protein | - | Mus musculus | None | Microtubule binding, negative regulator of Wnt signaling. | Other |
2223−SISRGRTMIHIPGLR | Predicted | G-rich motif | - | - | [ |
| CTCF | Transcriptional repressor CTCF | - | Homo sapiens | 11x Zn finger (3 according to Pfam) | - | Other |
575−DNCAGPDGVEGENGG | Predicted | - | Serine phosphorylation a | - | [ |
| Df31 | Decondensation factor 31 | Anon1A4 | D. melanogaster | None | Regulation of higher-order chromatin structure, maintenance of open chromatin. | Other |
1−MADVAEQKNETPVVE | Experimental | Non-specific but does not bind ssDNA or dsDNA. Preferentially binds snoRNA. | - | - | [ |
| Ezh2 | Histone-lysine N-methyltransferase EZH2 | Enx1h | Mus musculus | None | Polycomb group protein. Involved in H3 methylation (H3K9me and H3K27me). | Other |
342−RIKTPPKRPGGRRRG | Predicted | May have a preference for RNA stem loops. | 1st Thr is phosphorylated in a cell cycle dependent manner. Phosphorylation increases RNA binding. | This region overlaps a region involved in protein-protein interactions in human, however, RNA and protein binding regions may be distinct from one another. | [ |
| Nrep | Neuronal regeneration-related protein | P311 | Mus musculus | None | Axonal regeneration, cell differentiation. | Other |
27−KGRLPVPKEVNRKKM | Experimental | - | - | Protein | [ |
| Gemin5 | Gem-associated protein 5 | - | Homo sapiens | None | snRNP assembly, splicing, IRES-mediated translation initiation. | Other |
1297−PNSSVWVRAGHRTLS | Experimental | - | - | - | [ |
| Nup153 | - | - | Homo sapiens | None | Component of the nucleopore, RNA trafficking. | Other |
250−KTSQLGDSPFYPGKT | Predicted | Single-stranded RNA with little sequence preference | Serine and threonine phosphorylation a | - | [ |
| SCML2 | Sex comb on midleg-like protein 2 | - | Homo sapiens | None | Binds Polycomb Repressive Complex 1 and histones. Involved in epigenetic silencing. | Other |
256−SPSEASQHSMQSPQK | Predicted | No specificity, but discriminates between RNA and DNA. | Alternative isoform a , Serine phosphorylation a | - | [ |
| KDM4D | Lysine-specific demethylase 4D | JMJD2D | Homo sapiens | None | Demethylates lysine 9 on histone H3. | Other |
348−MEPRVPASQELSTQK | Experimental | - | - | - | [ |
| - | - | - | Synthetic | None | Bind HIV RNA (RRE) | Other/polyR | SRSSRRNRRRRRRR, | Experimental | Structured RNA (HIV-1 Rev response element, RRE) | - | - | [ |
a According to uniprot, from a large-scale study but no detailed experimental confirmation available
Fig. 2Structural examples RNA-bound disordered regions. a The RGG-peptide of the human FMRP bound to a in vitro-selected guanine-rich sc1 RNA determined by NMR (PDB 2LA5) [78] b Basic patch of disordered bovine immunodeficiency virus (BIV) Tat forms a β–turn when interacting with its target RNA, TAR. Structure determined by NMR (PDB 1MNB) [91] c Dimer of the basic patch containing Rev protein of the human immunodeficiency virus (HIV) in complex with target RNA, RRE, determined by crystallography [102] (PDB 4PMI). Red, peptide; yellow, RNA. Illustrations were created using PyMol
Fig. 3Models for properties of protein disorder in RNA binding. a Attributes of disordered protein regions in RNA interactions. b Post-translational modification and alternative splicing can modulate RNA-binding