| Literature DB >> 19812790 |
Susan M Corley1, Jill E Gready.
Abstract
Using comparative genomics and in-silico analyses, we previously identified a new member of the prion-protein (PrP) family, the gene SPRN, encoding the protein Shadoo (Sho), and suggested its functions might overlap with those of PrP. Extended bioinformatics and conceptual biology studies to elucidate Sho's functions now reveal Sho has a conserved RGG-box motif, a well-known RNA-binding motif characterized in proteins such as FragileX Mental Retardation Protein. We report a systematic comparative analysis of RGG-box containing proteins which highlights the motif's functional versatility and supports the suggestion that Sho plays a dual role in cell signaling and RNA binding in brain. These findings provide a further link to PrP, which has well-characterized RNA-binding properties.Entities:
Keywords: RGG motif; RNA-binding protein; Shadoo; comparative genomics; conceptual biology; methylation; phosphorylation; prion protein
Year: 2008 PMID: 19812790 PMCID: PMC2735946 DOI: 10.4137/bbi.s1075
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Proteins with RGG-box domains Selected on Criteria explained in Methods.
| Database name, name (ID) | # AA | RGG domain (residue numbers) | Other RNA binding motifs | Functions/Comments | R | |
|---|---|---|---|---|---|---|
| 1 | SHO_HUMAN, Shadoo (Q5BIV9) | 151 | PrP family member. Likely attached to cell membrane by GPI anchor ( | |||
| 2 | ROA0_HUMAN, hnRNP A0 (Q13151) | 305 | 2 RRM | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. Similar to hnRNP A/B but less abundant. Component of ribonucleosomes. ( | R | |
| 3 | ROA1_HUMAN, hnRNP Al (P09651) | 372 | 2 RRM | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. Transport of poly (A) mRNA from nucleus to cytoplasm. ( | R | |
| 4 | ROA2_HUMAN, hnRNP A2 (P22626) | 353 | 2 RRM | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. Trafficking of RNAs containing the cis-acting A2 response element (A2RE). ( | R | |
| 5 | ROA3_HUMAN hnRNP A3 (P51991) | 378 | 2 RRM | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. Trafficking of RNAs containing the cis-acting A2 response element (A2RE). ( | R | |
| 6 | HNRPD_HUMAN, hnRNPD0 (Q14103) | 355 | 2 RRM | Binds to mRNA with AU-rich elements (AREs) in 3′-UTR. Transcription regulator; binds to ds and ss DNA sequences. Possibly Involved in translationally coupled mRNA turnover. ( | R | |
| 7 | HNRPG_HUMAN, hnRNP G (P38159) | 391 | 1 RRM | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. ( | R | |
| 8 | ||||||
| 9 | HNRPK, hnRNP K (P61978) | 463 | 3 KHC | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. Major poly(C) RNA binding hnRNP. Also binds poly(C) ssDNA. ( | R | |
| 10 | HNRPQ_HUMAN, hnRNP Q (060506) | 623 | 3 RRM | 3 isoforms. Pre-mRNA processing. Associated with splicing intermediates and mature mRNA. Interacts preferentially with poly(A) and poly (U) RNA sequences. Region 518–549 (RGRAGYSQRGGPGSARGVRGAR GGAQQQRGRG) sufficient to bind RNA. ( | R | |
| 11 | HNRPR_HUMAN, hnRNP R (043390) | 633 | 3 RRM | Found in splicesome C; expected involvement in splicing, pre-mRNA processing. ( | R | |
| 12 | HNRPU_HUMAN, hnRNP U (Q00839) | 824 | First discovered that hnRNP U contains a 26-residue peptide (M | R | ||
| 13 | HNRL1, hnRNP U like protein 1 (Q9BUJ2) | 856 | Pre-mRNA processing and transport. Binds poly(G) and poly(C) RNA. Represses transcription driven by viral and cellular promoters. Associated with RBRD7, activates transcription. ( | R | ||
| 14 | PURG_HUMAN, Purine-rich element-binding protein gamma (Q9UJV8) | 347 | In purine-rich element binding protein family (PUR). Binds ssDNA and RNA. Highly expressed in many tumor lines. ( | R | ||
| 15 | DDX4_HUMAN, DEAD box protein 4 (Q9NQI0) | 724 | In DEAD box helicase family. Helicase activity, RNA unwinding, needed in splicing, ribosome biogenesis and RNA degradation. ( | R | ||
| 16 | THOC4_HUMAN, Tho complex subunit 4 (Q86V81) | 257 | 1 RRM | In THO/TREX complex, promotes transcriptional activation, recruited to RNA polymerase during elongation. Associated with spliced mRNA; roles in mRNA export and decay. May mediate interactions of proteins and/or RNA. ( | R/P | |
| 17 | NOLA1_HUMAN, Nucleolar protein family A member 1 (Q9NY12) | 217 | Aka GAR1. Required for ribosome biogenesis and telomere maintenance. Processing or intranuclear trafficking of TERC, the RNA component of the telomerase reverse transcriptase (TERT). RGG box accessory to RNA binding. Interaction with SMN1 requires at least one of the RGG-box regions. ( | R | ||
| 18 | ||||||
| 19 | SFPQ_HUMAN, Splicing factor proline- and glutamine-rich (P23246) | 707 | 2 RRM | Pre-mRNA splicing factor. Binds to intronic polypyrimidine tracts. Possible role in nuclear retention of defective RNAs. Regulates basal and cAMP-dependent transcription. ( | R | |
| 20 | FBRL_HUMAN, Fibrillarin (P22087) | 321 | Involved in pre-rRNA processing. Component of box C/D small nucleolar ribonucleoprotein (snoRNP) particles. ( | R | ||
| 21 | HABP4_HUMAN, Hyaluronan binding protein (HAPB4, Ki-1/57) (Q5JVS0) | 413 | This sequence also constitutes a hyaluronan binding motif, (R/K-X(7)-R/K) where X is not acidic ( | R/P | ||
| 22 | PAIRB_HUMAN, Plasminogen activator inhibitor 1 RNA binding protein (Q8NC51) | 408 | Aka CG1–55. Regulation of mRNA stability/decay. Interacts with CHD3, similar to HABP4. ( | R/P | ||
| 23 | FUS_HUMAN, RNA binding protein FUS (P35637) | 526 | 1 RRM | Component of nuclear riboprotein complexes. Binds ds and ss DNA. Promotes annealing of complementary ssDNAs. ( | R/P | |
| 24 | ||||||
| 25 | ||||||
| 26 | EWS_HUMAN, Ewing sarcoma (EWS) protein (Q01844) | 656 | 1 RRM | Found on cell surface as well as in the nucleus and cytoplasm. Binds RNA. Is a transcriptional activator but this activity can be repressed by the RGG box. May be involved in pre mRNA splicing and transport. | R/P | |
| 27 | It has been suggested that EWS protein may act as a receptor or binding protein for ligands on the cell surface, such as nucleic acids, and thus might mediate extracellular and nuclear events. Interacts with PTK2B/FAK2 then relocates from cytoplasm to ribosomes. ( | |||||
| 28 | RB56_HUMAN, TATA-binding protein-associated factor 2N (Q92804) | 592 | 1 RRM | Binds RNA and ssDNA. Transcription regulation. In RNA polymerase II transcriptional multiprotein complex. Similar to EWS and FUS/TLS. ( | R/P | |
| 29 | ||||||
| 30 | CIRPB_HUMAN, Cold-inducible RNA-binding protein (Q14011) | 172 | 1 RRM | Cold-induced suppression of cell proliferation. Activates the ERK pathway. ( | ||
| 31 | PP1RA_HUMAN, Serine/threonine- protein phosphatase 1 regulatory subunit (Q96QC0) | 940 | Aka p99. Binds mRNA, ssDNA, poly(A) and poly(G). Inhibits phosphatase activities when phosporylated. ( | R | ||
| 32 | FMR1_HUMAN, Fragile X Mental Retardation Protein (FRMP) (Q06787) | 632 | 2 KH | Binds many mRNA transcripts. Transports mRNA from nucleus to cytoplasm. Involved in neural plasticity through translational repression. ( | R/P | |
| 33 | NUCL_HUMAN, Nucleolin (P19338) | 710 | 4 RRM | Found on cell surface as well as in the nucleus and cytoplasm. RGG box is necessary for efficient RNA binding and possibly operates by unstacking RNA bases, but the RRMs are required for specific RNA recognition. Duplex DNA, ssDNA and RNA are all effective ligands for nucleolin. Associated with intranucleolar chromatin and preribosomal particles. Binds to histone HI to induce chromatin decondensation. When attached to the cell surface, nucleolin binds the proteins cytokine MK and HB-19 through its RGG box and acts as cell surface receptor. ( | R/P | |
| 34 | G3BP1_HUMAN, Ras GTPase-activating protein-binding protein 1 (Q13283) | 466 | 1 RRM | G3BP has a role in the ras-signaling pathway affecting cell proliferation and survival as well as being involved in RNA metabolism. Cleaves MYC mRNA And has Helicase activity—unwinds DNA/DNA, RNA/DNA and RNA/RNA. Combining these two functions, it has been suggested the G3BPs are members of a novel subclass of RNA-binding proteins which act at the level of RNA metabolism in response to cell signaling allowing the cell to rapidly control protein activity at a stage after transcription. Also involved in formation of stress granules. ( | R/P | |
| 35 | RGMC_HUMAN, Hemojuvelin (precursor) (Q6ZVN8) | 426 | In repulsive guidance molecule (RGM) family; RGMa and RGMb involved in neural development. GPI anchored. Interacts with neogenin which regulates shedding of GPI anchor. Binding cytokines BMP2 and BMP4 affects BMP signaling pathway and expression of hepcidin. Function of RGG domain unknown. ( | P | ||
| 36 | ZNH14_HUMAN, Zinc finger HIT domain-containing protein 4 (Q9C086) | 343 | Aka PAPA-1. Induces growth and cell cycle arrest at Gl phase. Interacts with splicing factors altering pre-mRNA splicing. Complexes with other nucleolar proteins. ( | P | ||
| 37 | K1C9_HUMAN, Keratin type 1 cytoskeletal 9 (P35527) | 623 | Cytoskeletal and microfibrillar keratin. Function in mature or developing palmar and plantar skin ( | |||
| 38 | MRE11_HUMAN, Double strand break repair protein MRE 11A (P49959) | 708 | In MRN complex, role in dsDNA repair, recombination, maintenance of telomere integrity and meiosis. ( | |||
| 39 | WBP7_HUMAN WW domain-binding protein 7 (Q9UMN6) | 2715 | WW domain-binding (Trithorax homolog 2). Possible transcriptional regulator. ( | |||
| 40 | BRWD3_HUMAN, Bromodomain and WD repeat-containing protein 3 (Q6RI45) | 1802 | In WD repeat protein family involved in cell-cycle progression, signal transduction, apoptosis, gene regulation. Possible transcription factor with 2 bromodomains and 9 WD repeats. May be involved in Jak/Stat pathway. ( | |||
| 41 | CA077_HUMAN, Uncharacterised protein Clorf77 (Q9Y3Y2) | 248 | NA | |||
| 42 | FA98A_HUMAN, Protein FAM98A | 519 | NA | |||
| 43 | (Q8NCA5) FA98A_HUMAN (Q8NCA5) | 519 | NA | |||
| 44 | LS14A_HUMAN, LSM14 protein homolog A (Q8ND56) | 463 | Putative alpha synuclein binding protein. |
RNA-binding motifs in addition to the RGG box.
RRM = 80–90 amino acid sequence containing RNP-1 (octapeptide) and RNP-2 (6 amino acid) consensus sequences.
K homology region as in hnRNP K.
RNA binding.
Protein binding.
Figure 1Alignment of the RGG-box sequence at the N-terminal end of Shos from fish to mammals. LHS are sequence numbers. Mdl, Monodelphis domestica; Xl, Xenopus laevis; Xt, Xenopus tropicalis; Danio, Danio rerio; Fugu, Fugu rubripes, Tetraodon, Tetraodon nigroviridis. Note that region starts with completely conserved KGG triplet. Complete RGG triplets are bolded.
Subset of RGG-box proteins (see Table S1 in Supplementary Information for full list).
| No. (#) | Name (ID) | RGG domain (residue numbers) | Other RNA-binding motifs | Functions/Comments R |
|---|---|---|---|---|
| 1 | Shadoo Q5BIV9 | RGGARGSARGGVRGG (28–42) | PrP family member. Likely attached to cell membrane by GPI anchor. | |
| 12 | hnRNP U Q00839 | RGGGHRGRGGFNMRGGNFRGGAPGNRGG (701–728) | RGG box first identified when a 26-residue sequence (M | |
| 21 | HABP4 Hyaluronan binding protein 4 Q5JVS0 | RGGPRGGMRGRGRGG (185–199) | Also constitutes a hyaluronan binding motif, (R/K–X(7)-R/K) where X is not acidic. Binds strongly and specifically to hyaluronan and weakly to RNA. Involved in mRNA transport, chromatin remodeling, regulation of transcription. | |
| 26 | EWS Ewing sarcoma Q01844 | RGGFDRGGMSRGGRGGGRGGMGSAGERGG (304–332) and RGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGG (565–617) | 1 RRM | Found on cell surface, nucleus and cytoplasm. Is a transcriptional activator but this activity can be repressed by RGG box. May be involved in pre-mRNA splicing and transport. Suggested that EWS protein acts as a receptor or binding protein for ligands on cell surface, such as nucleic acids, and thus might mediate extracellular and nuclear events. |
| 32 | FMRP Fragile X Mental Retardation Protein Q06787 | RGGGGRGQGGRGRGG (534–548) | 2 KH | Binds many mRNA transcripts. Transports mRNA from nucleus to cytoplasm. Involved in neural plasticity through translational repression. |
| 33 | Nucleolin P19338 | RGGGRGGFGGRGGGRGGRGGFGGRGRGGFGGRGGFRGGRGG (656–696) | 4 RRM | Found on cell surface, nucleus and cytoplasm. RGG box is necessary for efficient RNA binding but the RRMs are required for specific RNA recognition. Duplex DNA, ssDNA and RNA are all effective ligands. Acts as cell surface receptor—binds cytokine MK and HB-19 through its RGG box. |
| 34 | G3BP1 Ras GTPase-activating protein-binding protein 1 Q13283 | RGGLGGGMRGPPRGG (435–449) | 1 RRM | Role in ras-signaling pathway affecting cell proliferation and survival as well as involved in RNA metabolism. Cleaves MYC mRNA and has helicase activity. Combining these functions, suggested to be member of novel sub-class of RBPs which act at level of RNA metabolism in response to cell signaling, thus allowing cell to rapidly control protein activity at a stage after transcription. |
number of protein as appears in Table S1.
RNA-binding motifs in addition to the RGG box.
RRM = 80–90 amino acid sequence containing a RNP-1 (octapeptide) and RNP-2 (6 amino acid) consensus sequences.
K homology region as in hnRNP K.
RNA binding.
Protein binding.
Figure 2Frequency histograms of structural and compositional features of the 45 RGG sequences surveyed (Table S1). A) Position of the RGG box region in the proteins. N-terminal (within the first 35% of the protein sequence), C-terminal (last 35% of the protein), Middle, region between. B) Length of the RGG box. C) Spacing between RGG repeats where X may be any residue including Arg and Gly. D) Amino acid composition in terms of type: basic (R, K, H); acidic (D, E); Gly: aromatic (F, Y, W); non-polar amino acids (A, V, L, I, M, P) and polar (S, T, N, Q, C).
Figure 3Alignment of the RGG box of proteins with RGG-X9-RGG spacing. The number of the residue at the start and end of the sequence is given, as well as the total number of exact residue matches (#) to Sho.
Figure S1Alignment of Sho sequences of Eutherian mammals. The 3 PKC phosphorylation sites are indicated by boxes. The N-terminal and C-terminal cleavage sites are shown by arrows.
Phosphorylation sites in RGG box proteins surveyed in this studya.
| Protein | Id | PKC | CK2 | TYR | Expt |
|---|---|---|---|---|---|
| SHO_HUMAN | Q5BIV9 | 3 | 0 | 0 | |
| ROAO_HUMAN | Q13151 | 4 | 3 | 0 | 2 |
| ROA1_HUMAN | P09651 | 10 | 10 | 0 | 9 |
| ROA2_HUMAN | P22626 | 9 | 4 | 0 | 6 |
| ROA3_HUMAN | P51991 | 9 | 7 | 0 | 6 |
| HNRPD_HUMAN | Q14103 | 9 | 6 | 1 | 7 |
| HNRPG_HUMAN | P38159 | 18 | 16 | 1 | 7 |
| HNRPK | P61978 | 7 | 12 | 1 | 6 |
| HNRPQ_HUMAN | O60506 | 7 | 3 | 2 | 2 |
| HNRPR_HUMAN | O43390 | 5 | 5 | 2 | |
| HNRPU_HUMAN | Q00839 | 10 | 5 | 0 | 5 |
| HNRL1 | Q9BUJ2 | 6 | 9 | 0 | 3 |
| PURG_HUMAN | Q9UJV8 | 6 | 2 | 0 | 1 |
| DDX4_HUMAN | Q9NQI0 | 16 | 14 | 0 | |
| THOC4_HUMAN | Q86V81 | 4 | 5 | 0 | 1 |
| NOLA1_HUMAN | Q9NY12 | 3 | 1 | 0 | |
| SFPQ_HUMAN | P23246 | 7 | 4 | 2 | 1 |
| FBRL_HUMAN | P22087 | 5 | 3 | 0 | |
| HABP4_HUMAN | Q5JVS0 | 5 | 8 | 2 | 2 |
| PAIRB_HUMAN | Q8NC51 | 6 | 10 | 0 | 12 |
| FUS_HUMAN | P35637 | 7 | 6 | 1 | |
| EWS_HUMAN | Q01844 | 4 | 5 | 0 | |
| RB56_HUMAN | Q92804 | 7 | 12 | 2 | 1 |
| CIRPB_HUMAN | Q14011 | 4 | 2 | 1 | |
| PP1RA_HUMAN | Q96QC0 | 10 | 12 | 2 | 4 |
| FMR1_HUMAN | Q06787 | 9 | 12 | 1 | 1 |
| NUCL_HUMAN | P19338 | 8 | 23 | 0 | 14 |
| G3BP1_HUMAN | Q13283 | 2 | 6 | 1 | 5 |
| RGMC_HUMAN | Q6ZVN8 | 11 | 2 | 0 | |
| ZNH14_HUMAN | Q9C086 | 3 | 0 | 0 | |
| K1C9_HUMAN | P35527 | 7 | 14 | 3 | |
| MRE11_HUMAN | P49959 | 16 | 17 | 0 | 4 |
| WBP7_HUMAN | Q9UMN6 | 40 | 36 | 3 | 4 |
| BRWD3_HUMAN | Q6RI45 | 34 | 42 | 4 | 4 |
| CA077_HUMAN | Q9Y3Y2 | 4 | 2 | 0 | 1 |
| FA98A_HUMAN | Q8NCA5 | 5 | 9 | 1 | |
| LS14A_HUMAN | Q8ND56 | 4 | 7 | 1 | 11 |
Searches were conducted using the ScanProsite program available on the ExPASy Proteomics Server of the Swiss Institute of Bioinformatics website http://au.expasy.org/.
Number of protein kinase C phosphorylation sites (PS00005).
Number of casein kinase II phosphorylation sites (PS00006).
Number of tyrosine kinase phosphorylation sites (PS00007).
As annotated in the SwissProt database.
Mazroui, R., Huot, M.E., Tremblay, S., Boilard, N., Labelle, Y. and Khandjian, E.W. (2003) Fragile X Mental Retardation protein determinants required for its association with polyribosomal mRNPs. Hum Mol Genet, 12;3087–96.