Literature DB >> 27503141

Advances in the characterization of RNA-binding proteins.

Domenica Marchese^1,2, Natalia Sanchez de Groot^1,2, Nieves Lorenzo Gotor^1,2, Carmen Maria Livi^1,2,3, Gian G Tartaglia^4,5,6.

Abstract

From transcription, to transport, storage, and translation, RNA depends on association with different RNA-binding proteins (RBPs). Methods based on next-generation sequencing and protein mass-spectrometry have started to unveil genome-wide interactions of RBPs but many aspects still remain out of sight. How many of the binding sites identified in high-throughput screenings are functional? A number of computational methods have been developed to analyze experimental data and to obtain insights into the specificity of protein-RNA interactions. How can theoretical models be exploited to identify RBPs? In addition to oligomeric complexes, protein and RNA molecules can associate into granular assemblies whose physical properties are still poorly understood. What protein features promote granule formation and what effects do these assemblies have on cell function? Here, we describe the newest in silico, in vitro, and in vivo advances in the field of protein-RNA interactions. We also present the challenges that experimental and computational approaches will have to face in future studies. WIREs RNA 2016, 7:793-810. doi: 10.1002/wrna.1378 For further resources related to this article, please visit the WIREs website.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Substances：
RNA-Binding Proteins
RNA

Year: 2016 PMID： 27503141 PMCID： PMC5113702 DOI： 10.1002/wrna.1378

Source DB: PubMed Journal: Wiley Interdiscip Rev RNA ISSN： 1757-7004 Impact factor: 9.957

INTRODUCTION

Since ‘the central dogma’ was formulated (the genetic information flows from DNA to protein passing through RNA), our knowledge on the transcriptome has progressed enormously.1, 2 We now know that genes do not code for just a single protein but produce a number of variants: the activity of nuclear RNAs (snRNAs and snoRNAs) and several splicing factors generates several messenger RNAs (mRNAs) with different lengths and exon compositions (Figure 1).2 In addition to protein production, RNA participates in other essential processes such as mRNAs expression regulation (microRNAs, miRNA; small interfering RNAs, siRNA; long noncoding RNAs, lncRNAs)1, 2 and genome protection by transposon silencing (PIWI‐interacting RNA, piRNA).3 RNAs also perform a number of structural and functional tasks, as in the case of rRNAs, which constitute 60% of the ribosome,4 and tRNAs, which carry the amino acids to the ribosome during the translation process.1, 2

Figure 1

RNA‐binding proteins (RBPs) and RNA life. RNA birth is regulated by RBPs (1) that are responsible for maturation (2) and modification (3). RBPs protect (4) and transport (5) RNA around the cell to specific sites (6). Interactions are regulated through a diverse set of binding sites that allows the formation of dynamic complexes sustained by reversible contacts and involving multiple partners (7, 8). When RNA is not required, it can be stored for future needs (7) or degraded (8). The last process of the RNA life cycle is the release of nucleotides that will be employed to build new RNA molecules. When RBPs are impaired (e.g., protein mutation, concentration deregulation, etc.), half‐life, arrangement and location of RNA are affected (9). Every aspect of RNA life, from birth (polymerases) to degradation (nucleases), involves protein binding (Figure 1). A correct interplay between RNAs and RNA‐binding proteins (RBPs) is crucial for the development of cellular processes: miRNAs, for instance, require Argonaute proteins to reach target mRNAs as well as piRNAs associate with PIWI proteins to form silencing complexes that protect the germline genome from transposons.1, 2 Moreover, the interplay between RBPs and RNAs can promote formation of membrane‐less organelles (e.g., P‐bodies, stress granules),5, 6, 7 whose composition can be rapidly adapted to the cell state and the environment conditions (e.g., stress conditions).8 Mutations in RBPs, aberrant interactions or altered RNA processing have been related to a number of human diseases, from neurological disorders to cancer.9, 10, 11, 12 Protein–RNA interactions have been studied using a range of quantitative approaches such as electrophoretic mobility shift assay (EMSA),13 fluorescent anisotropy/polarization,14 Förster resonance energy transfer (FRET),15 surface plasmon resonance (SPR)16 or, more recently, microscale thermophoresis (MST)17, 18 and stochastic optical reconstruction microscopy (STORM).19 Although these approaches have proved to be powerful to assess the specificity and affinity of protein–RNA interactions, they only allow the study of single or few molecular interactions at a time. The recent development of large‐scale quantitative methods, exploiting next‐generation sequencing20, 21 and protein mass spectrometry,22 contributes to the genome‐wide identification of RBPs, RNA targets and cofactors.2 Deep‐sequencing approaches combined with RBPs immunoprecipitation as well as in vitro methods, such as Systematic Evolution of Ligands by Exponential Enrichment SELEX,23, 24 revealed the binding ability of a number of RBPs and showed that many RBPs bind to thousands of transcripts. In parallel to experimental advances, a number of in silico methods have been developed to predict protein–RNA interactions and to detect binding‐sites. Computational tools are particularly useful to predict potential ribonucleoprotein associations and to narrow down a list of interaction partners for experimental validation. For instance, some RBPs recognize particular nucleotide sequences, whereas others bind to the backbone or double‐stranded RNA.25 When modeling protein–RNA interactions, many factors should be considered, including secondary structures, folding and characteristics of binding interfaces.26 Especially physicochemical properties of amino acids, such as structural disorder and polarity, are relevant to characterize the RNA‐binding ability of proteins.27, 28 Indeed, recent studies reported that in addition to classical RNA‐binding domains other regions found in ribosomal proteins, translation elongation factors, zinc fingers as well as structurally disordered parts participate in contacting transcripts.29 Here we review the most recent experimental and computational advances for the detection of protein–RNA interactions and introduce new challenges for future developments in the field.

EXPERIMENTAL METHODS FOR DETECTION OF PROTEIN–RNA INTERACTIONS

Transcripts are never naked and form complexes with partner proteins in ribonucleoprotein particles from their birth to their degradation (Figure 1).30 The assembly of functional complexes and the delivery to final destination involves progression through a series of intermediate complexes and subcellular localizations. For instance, Cajal bodies are sites of noncoding ribonucleoprotein particles maturation, where assembly factors gather to accelerate complicated biochemical reactions. Complexes containing mRNAs often undergo remodeling as they travel from the site of transcription to the cytoplasm where they are translated. The advent of sequencing technologies, together with the introduction of various cross‐linking chemistries, has enabled the development of new high‐throughput methods for the simultaneous detection of hundreds to thousands interactions in a single experiment. The methods can be classified into ‘protein‐centric’ approaches, which reveal RNAs bound to a known protein, and ‘RNA‐centric’ approaches, which characterize proteins interacting with an RNA of interest.

Protein‐Centric Approaches

The predominant protein‐centric methods are based on protein immunoprecipitation from cell lines or tissues, and detection of the co‐purified RNAs (Table 1 and Figure 2).

Table 1

List of Experimental Methods for the Identification of Protein–RNA Interactions

Method			Advantages	Challenges	Disease‐Related? RBP/?RNA	References
Protein‐centric	In vivo	RIP	Genome‐wide Applicable to tissues	High background noise Possible artifacts Low resolution	ELAVL1 (epilepsy, cancer) CELF1 (myotonic dystrophy) IGF2BP1 (cancer) ADAR (Dyschromatosis symmetrica, spastic paraplegia)	Tenenbaum et al.31
		HiTS‐CLIP PAR‐CLIP iCLIP eCLIP	Genome‐wide High specificity High resolution (binding sites)	False negative due to low cross‐linking efficiency Time consuming Challenging set‐up Not applicable to tissues (PAR‐CLIP)	CELF4 (epilepsy, hyperactivity) ELAVL1 (epilepsy, cancer) FMR1 (Fragile‐X mental retardation, autism spectrum disorders) FUS (FTLD, ALS) MBNL1/2 (myotonic dystrophy) NOVA1/2 (POMA) PARK7 (Parkinon's disease) TARDBP (FTLD, ALS) RBFOX (autism)	Licatalosi et al.32 Hafner et al.33 Konig et al.20 Van Nostrand et al.34
	In vitro	RNA‐compete SEQRS RBNS RNA‐MaP HiTS‐RAP MITOMI	Large‐scale Measurement of interaction affinity and specificity Analysis of multiple proteins	Nonphysiological conditions	ELAVL4 NOVA1 (POMA) HNRNPA0 (cancer) MBNL1 (myotonic dystrophy) CELF1 (myotonic dystrophy)	Ray et al.35 Campbell et al.24 Lambert et al.36 Buenrostro et al.37 Tome et al.38 Martin et al.39
RNA‐centric	In vitro	TRAP/RAT RaPID RiboTrap RNA‐assisted chromatography	Relatively easy and flexible Easy purification protocols	High background noise Possible perturbation of RNA folding Large amounts of starting material		Hogg and Collins40 Slobodin and Gerst41 Michlewski and Cáceres42
		Protein microarray	Large‐scale Relatively easy and fast	Possible artifacts due to nonphysiological conditions (protein folding, accessibility, post‐translational modifications, etc.,)	TP53 HRAS MYC (cancer) BCL2 PWRN1 (Prader Willi syndrome)	Scherrer et al.43 Siprashvili et al.44
	In vivo	MS2‐BioTRAP	Fast and easy set‐up Large‐scale Low amounts of starting material	Challenges associated with cell transfection Possible artifacts due to tag		Tsai et al.45
		ChIRP CHART RAP‐MS	Study of protein–RNA interactions under physiological conditions	Time and cost consuming Large amounts of starting material required Challenging set‐up	MALAT1 (cancer) NEAT1 (cancer)	Chu et al.46 Simon47
		Interactome capture	Study of protein–RNA interactions under physiological conditions Identification of unknown RBPs	Possible artifacts (positive and false negative) due to cross‐linking Time consuming Challenging set‐up		Castello et al.29

RIP, RNA Immunoprecipitation, HiTS‐CLIP, High‐throughput sequencing of RNA isolated by cross‐linking immunoprecipitation, PAR‐CLIP, Photoactivable ribonucleoside enhanced CLIP, iCLIP, individual‐nucleotide resolution Cross‐Linking and ImmunoPrecipitation, SEQRS, in?vitro selection high‐throughput sequencing of RNA and sequence specificity landscape, RBNS, RNA Bind‐n‐Seq, RNA‐Map, RNA on a massively parallel array, HiTS‐RAP, High‐throughput sequencing RNA affinity profiling, RNA‐MITOMI, RNA mechanically induced trapping of molecular interactions, TRAP, Tandem RNA affinity purification, RAT, RNA affinity in tandem, RaPID, RNA‐binding protein purification and identification, MS2‐BioTRAP, MS2 in vivo biotin tagged RNA affinity purification, ChIRP, Chromatin isolation by RNA purification, CHART, Capture hybridization analysis of RNA targets, RAP‐MS, RNA antisense purification‐mass spectrometry.

Figure 2

Schematic representation of experimental methods for the identification of protein–RNA interactions. Protein‐centric methods. (a) In vivo approaches include native purification protocols (RNA Immunoprecipitation, RIP) and denaturing protocols (Cross‐linking and Immunoprecipitation, CLIP). In the fist case, RNAs bound to a specific protein are immunoprecipitated from cell lysate in native conditions by using a protein‐specific antibody and, after wash and protein removal by proteinase K treatment, RNAs are reverse transcribed and identified through RNA sequencing.48 In the second case, cells are UV cross‐linked to ‘freeze’ protein–RNA complexes. RNA is digested to obtain fragments of a defined size and the obtained complexes are immunoprecipitated and resolved on an SDS‐PAGE. After isolation from membrane and proteinase K digestion, the RNA fragments are reverse transcribed and sequenced.49 In red and yellow are represented simulated enrichment values of a target and nontarget RNA respectively (lower part of the panel). (b) As an example of in vitro approaches, the schematic workflow of RNA compete is represented.35 RNA libraries are generated by in vitro transcription. Transcripts are incubated with the protein of interest immobilized on an affinity matrix (e.g., streptavidin‐biotin tag system) and the bound fragments are then fluorescently labeled and detected by hybridization on a microarray platform. RNA‐centric methods. (c) In vivo approaches for the identification of proteins bound to an RNA of interest often derive from methods used for the identification of genomic DNA loci targeted by noncoding RNAs. Cells are cross‐linked and lysed. Chromatin is sheared and protein–RNA–DNA complexes are pull‐down by using biotinylated oligos complementary to the sequence of the RNA. After RNA digestion, proteins can be identified by western blot or mass spectrometry analysis.46 (d) In vitro approaches commonly exploit the use of RNA tags to immobilize the RNA of interest onto an affinity matrix. Upon incubation with cell lysate, proteins bind to the immobilized RNA. After washes, the protein–RNA complexes are eluted from the matrix and proteins are characterized by western blot or mass spectrometry.50

List of Experimental Methods for the Identification of Protein–RNA Interactions RIP, RNA Immunoprecipitation, HiTS‐CLIP, High‐throughput sequencing of RNA isolated by cross‐linking immunoprecipitation, PAR‐CLIP, Photoactivable ribonucleoside enhanced CLIP, iCLIP, individual‐nucleotide resolution Cross‐Linking and ImmunoPrecipitation, SEQRS, in?vitro selection high‐throughput sequencing of RNA and sequence specificity landscape, RBNS, RNA Bind‐n‐Seq, RNA‐Map, RNA on a massively parallel array, HiTS‐RAP, High‐throughput sequencing RNA affinity profiling, RNA‐MITOMI, RNA mechanically induced trapping of molecular interactions, TRAP, Tandem RNA affinity purification, RAT, RNA affinity in tandem, RaPID, RNA‐binding protein purification and identification, MS2‐BioTRAP, MS2 in vivo biotin tagged RNA affinity purification, ChIRP, Chromatin isolation by RNA purification, CHART, Capture hybridization analysis of RNA targets, RAP‐MS, RNA antisense purification‐mass spectrometry. Schematic representation of experimental methods for the identification of protein–RNA interactions. Protein‐centric methods. (a) In vivo approaches include native purification protocols (RNA Immunoprecipitation, RIP) and denaturing protocols (Cross‐linking and Immunoprecipitation, CLIP). In the fist case, RNAs bound to a specific protein are immunoprecipitated from cell lysate in native conditions by using a protein‐specific antibody and, after wash and protein removal by proteinase K treatment, RNAs are reverse transcribed and identified through RNA sequencing.48 In the second case, cells are UV cross‐linked to ‘freeze’ protein–RNA complexes. RNA is digested to obtain fragments of a defined size and the obtained complexes are immunoprecipitated and resolved on an SDS‐PAGE. After isolation from membrane and proteinase K digestion, the RNA fragments are reverse transcribed and sequenced.49 In red and yellow are represented simulated enrichment values of a target and nontarget RNA respectively (lower part of the panel). (b) As an example of in vitro approaches, the schematic workflow of RNA compete is represented.35 RNA libraries are generated by in vitro transcription. Transcripts are incubated with the protein of interest immobilized on an affinity matrix (e.g., streptavidin‐biotin tag system) and the bound fragments are then fluorescently labeled and detected by hybridization on a microarray platform. RNA‐centric methods. (c) In vivo approaches for the identification of proteins bound to an RNA of interest often derive from methods used for the identification of genomic DNA loci targeted by noncoding RNAs. Cells are cross‐linked and lysed. Chromatin is sheared and protein–RNA–DNA complexes are pull‐down by using biotinylated oligos complementary to the sequence of the RNA. After RNA digestion, proteins can be identified by western blot or mass spectrometry analysis.46 (d) In vitro approaches commonly exploit the use of RNA tags to immobilize the RNA of interest onto an affinity matrix. Upon incubation with cell lysate, proteins bind to the immobilized RNA. After washes, the protein–RNA complexes are eluted from the matrix and proteins are characterized by western blot or mass spectrometry.50 RNA immunoprecipitation (RIP) implies the purification of RNA–protein interactions in native conditions by using a protein‐specific antibody, and detection of interacting RNAs by either microarray (RIP‐chip) or sequencing (RIP‐seq).31, 51, 52 Despite the genome‐wide potential, the method has practical limitations. Indeed, it is prone to detect nonspecific interactions due to the nonphysiological formation of protein–RNA complexes in solution. As a consequence, interactions identified using native purification methods often require additional validation. RIP limitations have been overcome with the introduction of cross‐linking and denaturing methods, namely cross‐linking and immunoprecipitation (CLIP). CLIP combines UV cross‐linking of RBPs to their cognate RNA molecules with stringent purification of protein–RNA complexes that are resolved and size‐selected on an SDS‐PAGE before proceeding to high‐throughput sequencing of cDNA library (HITS‐CLIP).32 Photoactivable ribonucleoside enhanced CLIP (PAR‐CLIP) and individual‐nucleotide resolution CLIP (iCLIP) are approaches derived from modified cross‐linking or library‐preparation protocols that allow the identification of cross‐linking sites at a single‐nucleotide resolution.32, 33, 53, 54 CLIP has enabled the characterization of RNA binding profiles of several RBPs involved in neurologic disorders and cancer, thus helping in understanding their role in disease. The first CLIP experiment, published by Ule et al., shed light on the role of NOVA 1 and 2 proteins in paraneoplastic opsoclonus‐myoclonus‐ataxia (POMA), an autoimmune neurologic disease characterized by abnormal motor inhibition.55 NOVA proteins were found to regulate alternative splicing of RNAs encoding multiple components of inhibitory synapses, such as GABAβ 2 receptor and GIRK2, which mediate slow inhibitory postsynaptic potentials, the K+ voltage‐gated channel KCNQ3 or the nicotinic acetylcholine receptors β2 and α2, highly represented in the GABAergic interneurons.55 The combination of RIP‐chip and CLIP approaches has allowed the identification of targets of the mammalian ELAVl family whose four members, ELAVL 1,2,3 and 4, (also known as HuR, HuB, HuC, and HuD) are implicated in different cancers and neurological diseases.56, 57 In particular, ELAVL1, the only member of the family expressed in both neuronal and non neuronal cells, has been found particularly abundant in breast, ovary, colon, and brain cancers58 associated with poor prognosis.59 RIP‐chip and CLIP have identified a transcriptome‐wide list of targets (including Cyclin D1, E1, A2 and B1, EGF, c‐Myc, p27, COX‐2, and BRCA1) and indicated that ELAVL1 functions as a major hub for regulating RNA metabolism in the cell at different levels, from pre‐mRNA alternative splicing to mature mRNA stability and microRNA biogenesis.57, 60 The other three members, ELAVL2–4 (nELAVl), are highly enriched in neurons and show a unique hierarchical expression during cortical development.61 HITS‐CLIP on mice cortical tissue has revealed that nELAVl regulated transcripts are mainly involved in synaptic cytoskeleton assembly and disassembly, amino acid and sugar biosynthetic pathways. In particular, nELAVl have been found to regulate the alternative splicing of the gene coding for the glutaminase enzyme, the major responsible for the synthesis of the excitatory neurotransmitter glutamate.56 The complex protocol and bioinformatics required for data analysis represent the main disadvantage of CLIP approaches (Figure 2). As a matter of fact, high experimental failure rates are reported. Recently, an enhanced CLIP (eCLIP) protocol has been developed.34 In addition to reducing hands‐on time to as few as 4 days, eCLIP dramatically decreases required library amplification by 1000 fold and enhances the rate of success at generating libraries with high usable reads percentages across diverse RBPs, maintaining the single‐nucleotide resolution of previous methods. In addition, paired‐matched input controls improve the signal‐to‐noise ratio for the discovery of authentic binding sites. 102 eCLIP experiments for 73 diverse RBPs have been generated (available at https://www.encodeproject.org), providing an unprecedented source of data for the study of protein–RNA functional networks. Despite its successful applications, a major concern when using CLIP is that UV‐induced cross‐linking is still poorly understood at the biophysical level. Only a very small percentage (1–5%) of protein–RNA complexes present in the cells can be efficiently cross‐linked, and it is not clear which types of interactions might be unseen.62 For instance, several RBP families do not directly associate with nucleic acid bases but interact with other elements (i.e., the sugar phosphate backbone) showing low cross‐linking efficiency. Moreover, UV only cross‐links direct protein–RNA interactions but it does not capture interactions occurring with protein complexes, thus providing just part of the information. The study of in vivo interactions is limited to contacts formed in a certain cell type and at a specific time point. To better understand the physicochemical properties controlling protein–RNA interactions, a number of new methods allow the in vitro screening of interactions between proteins and libraries of randomly generated RNA sequences by combining the use of microarray and microfluidic platforms with molecule fluorescent labeling and RNA sequencing technologies. While in vitro evolution SELEX has bias towards highest‐affinity targets,36 other methods enable the characterization of lower‐specificity and medium‐range affinities with single proteins or multiprotein complexes.63 Incubation of RNA libraries with a protein of interest immobilized on an affinity matrix is followed by fluorescent labeling of selected RNAs and hybridization to a microarray, in the case of RNA‐compete (Figure 2), or deep sequencing in the case of in vitro selection high‐throughput sequencing of RNA and sequence specificity landscape (SEQRS) and RNA Bind‐n‐Seq (RBNS).25, 36 These methods are often applied to identify consensus elements of RBPs (see also Computational Methods for Detection of Protein–RNA Interactions). Quantitative analysis of RNA on a massively parallel array (RNA‐MaP) and high‐throughput sequencing RNA affinity profiling (HiTS‐RAP), consist in the in situ synthesis of RNA libraries inside an Illumina sequencing flow cell followed by incubation with a fluorescently labeled protein and quantification of molecular interactions.37, 38 These approaches, together with RBNS, provide quantitative measurements of dissociation constants (Kd) through the use of multiple protein concentrations. Moreover, methods relying on the use of fluorescent proteins, such as the previously mentioned RNA‐MaP and HiTS‐RAP, as well as the RNA mechanically induced trapping of molecular interactions (RNA‐MITOMI),39 provide effective visualization of multiple proteins simultaneously, thus revealing the effects of protein partners. These approaches have great range of applicability, especially considering that RNA metabolism is regulated by multiple RBPs to form functional particles (e.g., small nuclear ribonucleoproteins, telomerases, ribosomal subunits, UTR‐regulatory complexes, etc).

RNA‐Centric Approaches

RNA‐centric methods aim to identify RBPs targeting a single RNA of interest. The majority of the existing methods exploit tagged RNAs as a bait to capture and characterize all proteins bound to it by mass spectrometry analysis (Table 1 and Figure 2). In vitro synthesized RNA can be chemically tagged through the incorporation of modified ribonucleotides that contain biotin, fluorescent dyes, or, alternatively, natural or artificial aptamers (e.g., S1, D8, MS2 hairpin loop, etc.,) can be incorporated.40, 41, 42 In addition to their in vitro application (e.g., TRAP, RAT, RaPID, RiboTrap, RNase‐assisted RNA chromatography), RNA tagging system can be applied in vivo, as in the example of MS2‐BioTRAP method, where a MS2 hairpin loop tagged RNA and the bacteriophage MS2 coat protein fused to a protein tag (e.g., HB or streptavidin), are co‐expressed to capture the RNA of interest by exploiting the high affinity interaction between the MS2 protein and the MS2 hairpin loop.45 All these methods are relatively flexible, and their in vivo applicability allows the study of protein–RNA interactions in physiological conditions. Nevertheless, the incorporation of a tag into the RNA bait may alter its secondary structures and possibly the formation of ribonucleoprotein complexes. In addition, MS2‐BioTRAP is only applicable in easy‐to‐transfect cells and the overexpression of at least one of the two interacting molecules might lead to experimental artifacts. Capture hybridization analysis of RNA targets (CHART), Chromatin isolation by RNA purification (ChIRP) and RNA antisense purification (RAP) are methods designed to identify DNA and proteins targeted by noncoding‐RNAs in vivo (Figure 2). These approaches involve cell cross‐linking and pull‐down of the RNA of interest using short biotinylated oligodeoxyribonucleotides that are complementary to the endogenous RNA. After reversion of cross‐linking, the RNA‐associated DNA and proteins are identified by sequencing and mass spectrometry (MS) analysis, respectively.46, 47, 64, 65 The design of antisense oligonucleotides with high affinity to accessible single‐stranded regions of RNAs is often a challenging step in these approaches. Moreover, these protocols are time consuming and usually require large amounts of starting material in order to generate sufficient product for MS. CHART was first applied for the characterization of protein interactors of the human long noncoding RNAs NEAT1 (nuclear‐enriched abundant transcript 1) and MALAT1 (metastasis‐associated lung adenocarcinoma transcript 1) that localize to nuclear speckles and paraspeckles, respectively.66 Speckles and paraspeckles are nuclear bodies in close proximity to each other, comprising many RBPs, including splicing factors. Interestingly, paraspeckles have been implicated in sequestering RNAs that respond to cellular stress. Although NEAT1 and MALAT1 have specific binders, the overlap of their protein interactome suggested potential redundancy or cooperation in regulating nuclear organization around nuclear bodies. ChiRP and RAP‐MS have been applied to the discovery of the protein interactome of Xist (X‐inactive specific transcript), a lncRNA required for X chromosome inactivation (XCI) of one of the two X chromosomes in female cells, enabling dosage compensation between XX females and XY males.66 Through an RAP‐MS approach, McHugh and colleagues recently characterized ten Xist‐specific interactors in mouse embryonic stem cells (mESC).67 One of them, SAF‐A (scaffold attachment factor‐A, also known as HNRNPU) was previously shown to be required for tethering Xist to the inactive X chromosome in differentiated cells, while five of these proteins are implicated in transcriptional repression, chromatin regulation, and nuclear organization. These results partially overlap with a ChiRP‐MS based study conducted on the same year by Chu and colleagues, in which 81 Xist interactors were described.68 These proteins are mainly involved in chromatin modification, nuclear matrix and RNA remodeling pathways. Notably, this analysis reveals two sets of proteins that interact with Xist in a developmentally regulated manner, shedding light on a potential step‐wise assembly of Xist binding proteins from the pluripotent state to cell differentiation. A number of other methods based on the in vitro or in vivo screening of protein libraries have been developed. As an example, protein microarrays have been recently used to test the binding of proteins with a specific RNA in vitro.43, 44 The RNA is fluorescently labeled and hybridized on a protein chip. This method allows the screening of thousands of proteins in less than a day, and relatively small amounts of RNA are required. On the other hand, the quality of the data is less robust because it strongly depends on the proteins conditions on the array (i.e., folding, RBDs accessibility, post‐translational modifications, etc). Finally, the interactome capture, a method developed in 2012 by Castello et al., allows the simultaneous recovery and characterization of the whole proteome associated with mRNAs in living cells.29 This approach, originally applied to HeLa and Hek293 cells,69 and lately to mouse embryonic stem cells70 and yeast,71 led to the discovery of new potential RBPs lacking the canonical RNA‐binding domains but enriched in features such as protein disorder and repetitive low complexity amino acid regions. The majority of these candidate RBPs have also enzymatic activity, and, while the RNA binding capacity of some of them has been validated, further evidence should be provided for the large majority of these proteins.

Future Experimental Challenges

Despite the enormous progress done in the last few decades, it is clear that each of the experimental methods offers a partial, and in some cases potentially erroneous, view on protein–RNA interactions. In this scenario, two main questions arise: to what extent in vitro interactions are relevant in vivo? Does in vivo binding imply functionality? As regulatory regions interact with competing RBPs and RNAs, only a small percentage of in vitro determined binding sites is actually occupied in vivo. Very often, binding regions become inaccessible due to mRNA localization into granular assemblies (see section Beyond the Protein–RNA Complex: Membrane‐Less Organelles).72 It should be also mentioned that RNAs lacking annotated biding sites are found to interact in vivo because of indirect interactions or as a result of molecule sequestration in subcellular compartments, where high concentrations favor low affinity or unspecific interactions.71 Most importantly, in vivo occupancy not always indicates functionality. As a matter of fact, to unravel the complexity of the RNA biology, it will be essential to develop new tools for the integration of interaction data with global functional assays, such as for instance Ribosome profiling.73

COMPUTATIONAL METHODS FOR DETECTION OF PROTEIN–RNA INTERACTIONS

A large number of computational methods address the problem of characterizing RNA partners of specific RBPs. MEME,74 RBPmap,75 SeAMotE,76 and RNAcontext 77 identify motifs enriched in RNA targets.26 The use of sequence motifs allows discovery of targets in RNA datasets, but ab initio predictions of RNA interactions are not possible without previous experimental knowledge on the RBPs of interest. A different class of computational approaches aims to identify RNA‐binding residues78, 79, 80 and RNA‐binding regions using primary‐, secondary‐ or tertiary‐structure information.81, 82, 83 Methods such as Struct‐NB,84 PRIP,85 SPOT‐Seq,86 and OPRA 87 predict the RNA‐binding ability by identifying regions in the protein surface that accommodate nucleotide chains. As three‐dimensional structures are needed to perform the calculations, these approaches are limited by the existence of available templates. Yet, using a library of 1164 nonredundant protein–RNA complexes (95% sequence identity cutoff) and the folding recognition technique SPARKS X88 the SPOT‐Seq‐RNA approach89 has been used to characterize 2418 novel RBPs in the human proteome of which 291 are reported in interactome capture experiments29 (see RNA‐centric methods in section Experimental Methods for Detection of Protein–RNA Interactions). Algorithms relying on primary‐structure features have a clear advantage over tertiary‐structure methods that require three‐dimensional references to compare structures87 and atomistic details90 to study interactions. Methods based on primary‐structure exploit evolutionary information (i.e., conservation of specific residues in sequence alignments), secondary‐structure propensities and information on physicochemical properties of amino acids (e.g., hydrophobicity). The binding elements are often classified using machine‐learning methods such as Support Vector Machine (SVM), Random Forest (RF), and Naïve Bayes (NB). For further information about the algorithms, we refer the Reader to reviews describing features and techniques in detail.81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93 In the following text we will provide a short description of the most used algorithms (Table 2) and their published performances.

Table 2

List of Computational Methods for the Identification of Protein–RNA Interactions

Prediction	Examples	Advantages	Disadvantages	References
Binding motif (RNA)	MEME	de novo binding site discovery	High‐throughput data are required as input	Bailey et al.94
Binding motif (RNA)	SeAMotE	de novo binding site discovery	Sequence complexity is a limitation	Agostini et al.76
Binding residue	Pprint	Evolutionary information	RNA‐binding domains cannot be identified	Kumar et al.79
	BindN+			Wang et al.78
	RNAbindR+			Walia et al.80
Domain (protein)	HMMER	Domain recognition	Annotation of RNA‐binding domains are required	Finn et al.95
Domain (protein)	catRAPID signature	Annotation of RNA‐binding domains are not required	Single amino acid resolution has not been implemented	Livi et al.83
RNA–protein interaction	catRAPID	Runs on high‐throughput data	RNA < 1200 nt	Bellucci et al.96 Agostini et al.97
	catRAPID	Runs on high‐throughput data	Protein < 750 aa	Bellucci et al.96 Agostini et al.97
	RPISeq	High sensitivity	Low specificity	Muppirala et al.98
	RPISeq	High sensitivity	Max 100 sequences per run	Muppirala et al.98

List of Computational Methods for the Identification of Protein–RNA Interactions BindN 68 predicts DNA‐ and RNA‐binding residues through a SVM trained on hydrophobicity, dissociation constant (pKa), and molecular mass of amino acids. The method was validated on a dataset of 100 protein–RNA complexes characterized through X‐ray crystallography and nuclear magnetic resonance (NMR), and shows a cross‐validation accuracy of 0.69. Using a slightly different codification of physicochemical properties and integrating them with evolutionary information, the BindN+ algorithm78 reaches an accuracy to of 0.78. The evolutionary information is derived from a position‐specific scoring matrix (PSSM) generated by PSI‐BLAST,99 that searches against a nonredundant database of protein sequences. With the same approach, Pprint 100 combines evolutionary information from PSSMs with SVMs, thus predicting RNA‐binding sites with an accuracy of 0.81. Similarly, RNAProB 101 employs a smoothed PSSM encoding scheme to predict RNA‐binding sites on proteins and shows an accuracy of 0.89. Slightly different types of physicochemical properties are combined in RNABindR,27 that exploits accessible surface area, sequence specific entropy, hydrophobicity, secondary structure propensities, and electrostatics. In the testing phase, RNABindR identifies interface residues with an accuracy of 0.85. Adding the contribution of HomPRIP,55 a sequence homology‐based method for prediction of RNA‐binding sites, RNABindR+ shows an accuracy of 0.83 on a larger benchmark set of 200 proteins.80 The working principles of these algorithms are based on different assumptions: the conservation, physicochemical properties or topological properties of the binding site. Hence, it is challenging to compare the algorithms; their applicability depends on the kind of dataset that needs to be analyzed. For more details, the Reader can refer to comparative studies that describe the tools in more detail.92, 102 While BindN,68 Pprint,100 RNAProB,101 and RNABindR27 predict the RNA‐binding ability of individual amino acids without considering the sequence context, HMMER 82 is designed to perform homology searches in protein sequences using Hidden Markov Models and multiple sequence alignments of domain families. HMMER functionality is limited to previously annotated domain‐families, as the tool does not perform de novo detection of binding‐domains. A recent method, ,83 identifies RNA‐binding regions by considering physicochemical properties that are present in known RBPs.83 catRAPID signature exploits properties such as hydrophobicity, secondary structure, disorder, and burial.83 Each feature defines a unique signature, or profile, containing position‐specific information arranged in sequential order from the N‐ to the C‐terminus.28, 103 In addition to the RNA‐binding score, catRAPID signature predicts regions contacting RNA. On a test set of mouse proteins harboring noncanonical RNA‐binding domains,70 catRAPID signature shows an accuracy of 0.71 in predicting the RNA‐binding ability. When applied to newly discovered RBPs, the algorithm discriminates RBPs from nonRBPs with an area under the receiver operating curve of 0.76.29, 69 In Table 3, we show the performances of catRAPID signature,83 BindN+,78 Pprint,100 and RNABindR+80 to identify nonclassical RNA‐binding domains.70

Table 3

Performances on Detecting RNA‐Binding Regions

Method	ACC^a	sens^b	spec^c	prec^d
catRAPID	0.67	0.76	0.60	0.65
BindN+	0.38	0.37	0.39	0.38
PPrint	0.47	0.49	0.45	0.49
RNAbindR+	0.48	0.53	0.42	0.48

We analyzed 102 proteins containing nonclassical RNA‐binding domains and 102 without annotated RNA‐binding domains with catRAPID signature and three other algorithms.27, 68, 83, 100 The performances are measured using a. accuracy, b. sensitivity, c. specificity, and d. precision.

Performances on Detecting RNA‐Binding Regions We analyzed 102 proteins containing nonclassical RNA‐binding domains and 102 without annotated RNA‐binding domains with catRAPID signature and three other algorithms.27, 68, 83, 100 The performances are measured using a. accuracy, b. sensitivity, c. specificity, and d. precision. We also compared the prediction performances of BindN,93 BindN+,78 Pprint,79 RNAproB,101 RNABindR+,80 and catRAPID signature 83 using structural data from 90 folds, 126 families and 100 superfamilies retrieved from the SCOP database102 (Figure 3). In all the classifications, catRAPID signature and BindN+ show the best performances102 recognizing contacts identified at a distance cutoff of 3.5 Å. We note that while BindN,93 BindN+,78 Pprint,79 RNAproB,101 and RNABindR+80 have been designed to predict the RNA‐binding ability of individual residues in protein sequences, catRAPID signature identifies binding regions (Table 2).

Figure 3

Methods to predict RNA binding sites. We calculated performances of BindN,93 BindN+,78 Pprint,79 RNAproB,101 RNABindR+,80 and catRAPID signature 83 on a set of proteins whose RNA‐binding sites have been validated through X‐ray and NMR techniques. As in a recent work,102 three protein classes ‘fold,’ ‘family,’ and ‘superfamily’ were retrieved from SCOP.102 Performances were estimated using the formula (sensitivity + specificity)/2 on 90 folds, 126 families, and 100 superfamilies (details at http://service.tartaglialab.com/static_files/shared/documentation_signature.html). Only a few methods address the question whether a protein of interest interacts with a specific RNA exploiting the properties of both interaction partners. catRAPID96, 97 estimates the interaction potential through van der Waals, hydrogen bonding and secondary structure propensities of both protein and RNA sequences (total of 10 features per molecule)81 and allows identification of binding partners with high confidence (training accuracy = 0.89; testing accuracy = 0.78; see also Cirillo et al.91; Table 2). catRAPID identified the interaction of the N‐terminus of Fragile Mental Retardation Protein FMRP104 with dendritic nontranslatable brain cytoplasmic RNA BC1105 before crystallographic studies determined the presence of tandem Agenet and KH domains.106 Similarly, catRAPID predictions for the physical binding of heterogeneous nuclear ribonucleoprotein U SAF‐A with the long noncoding Xist 107 have been recently confirmed by ChiRP‐MS experiments.68 In addition, catRAPID can be used to perform proteome‐ and transcriptome‐wide calculations,97 which is useful to complement experimental approaches, such as interactome capture29 (see also Experimental Methods for Detection of Protein–RNA Interactions) and to identify target partners.108 Owing to the complexity of the conformational space, a fragmentation procedure,107, 109 based on division of polypeptide and nucleotide sequences into overlapping regions, is used for RNAs longer than 1000 nucleotides and proteins longer than 750 amino acids. Another method called RPISeq predicts interactions combining protein and RNA features in SVM and RF classifiers.98 In RPISeq, RNA sequences are encoded by frequency of nucleotide tetrads (i.e., 4‐mer combinations of [A,C,G,U]), while protein sequences are represented using 3‐mer of 7 amino acid types ([A,G,V], [I,L,F,P], [Y,M,T,S], [H,N,Q,W], [R,K], [D,E], and [C]). More specifically, the RNA is represented by a 4 × 4 × 4 or 256‐dimensional vector, in which each feature represents the 4‐mer normalized frequency appearing in a RNA sequence (e.g., CCAU, AUUG, and GACA). The protein instead is encoded by a 7 × 7 × 7 or 343‐dimensional vector, where each element of the vector corresponds to the normalized frequency of the corresponding triple of amino acids in the sequence. The protein and RNA vectors serve as input for the SVM and RF to predict whether the protein–RNA pair interacts. RPISeq shows significantly high performances in the training phase (accuracy = 0.89) and high sensitivity / low specificity when applied to other sets.81, 92, 106

Future Computational Challenges

catRAPID and other computational tools96, 98 predict the interaction propensity of a protein–RNA pair to interact, dismissing other proteins involved in the physical binding. As a matter of fact, RBPs often form complexes and bind together to their target RNAs (see also ‘RIP’ section in Experimental Methods for Detection of Protein–RNA Interactions).110 A recent computational approach developed to predict transcription factor binding sites suggests that additional information retrieved from protein–protein interactions improves the performances of current predictions.111 Thus, as done for DNA‐binding proteins, integration of protein–protein network layers112 could boost considerably interaction predictions. We envisage that network‐based methods will shed light on the complex life of newly discovered RBPs, their dynamics and role in human diseases.113 A number of reports indicate that RBPs with many partners are enriched in structural disorder.114, 115 The lack of stable tertiary structure increases the ability of RBPs to interact with multiple partners116 and promotes the formation of transient ribonucleoprotein complexes, which are discussed in section Beyond the Protein–RNA Complex: Membrane‐Less Organelles. Moreover, some RBPs have been found to be enriched in glycine, arginine, lysine, and tyrosine motifs29 as well as a number of other patterns71 that could promote transient interactions. Thus, to reach a complete and biologically relevant understanding of protein–RNA interactions, complex formation should be studied as a dynamic process. Data based on techniques such as surface plasmon resonance could allow measurement of interactions in real time providing both equilibrium and kinetic information for the development of new methods.16

BEYOND THE PROTEIN–RNA COMPLEX: MEMBRANE‐LESS ORGANELLES

RBPs and RNAs interact through different types of contacts that facilitate not only the formation of oligomeric complexes but also other types of dynamic assemblies.31 Interestingly, a number of RBPs associate through weak contacts present in structurally disordered regions117 to trigger a process known as liquid‐to‐liquid phase separation.118 Through liquid‐liquid demixing, ribonucleoprotein assemblies form distinct compartments in the cytoplasm and nucleoplasm119 that in vivo can act as membrane‐less organelles.5, 6, 7 The liquid nature of these assemblies allows rapid diffusion favoring chemical reactions on biological timescales. Two of the most common macromolecular assemblies are processing‐bodies (P‐bodies) and stress granules.7, 120 These assemblies are associated with mRNA degradation and storage, respectively, and are built in response to different environmental stimuli.8 P‐bodies and stress granules are conserved throughout evolution and appear in yeast, plant, nematode, fly, and mammalian cells.121 Ribonucleoprotein assemblies have special physicochemical properties that can be investigated using computational approaches such as the multicleverMachine.122, 123 With respect to the yeast proteome (globular proteins with sequence redundancy < 40%), the multicleverMachine predicts that proteins forming P‐bodies and stress granules are depleted in hydrophobicity and enriched in structural disorder and RNA‐binding ability (p‐values < 10−5; Fisher's exact test; Figure 4), which is in agreement with experimental evidence.72, 126 Hydrophobicity and structural disorder highly discriminate the two sets (areas under the ROC curve AUC in the range of 0.70–0.80; calculations available at http://www.tartaglialab.com/cs_multi/confirm/1207/c51ef6cff3/).

Figure 4

Features of yeast proteins forming ribonucleoprotein assemblies. Using physicochemical properties, the multicleverMachine approach122, 123 discriminates P‐bodies and stress granules from other globular proteins. The datasets employed in this analysis comprise 52 P‐body and 62 stress granule proteins, previously reported in experimental works,124 as well as five random sets of 100 globular proteins from Astral SCOPe 2.05 (<40% sequence identity).125 Three specific properties are reported: (a) nucleic acid binding, (b) hydrophobicity, and (c) disorder propensity. For each feature, enrichment or depletion of a set is indicated with a specific color: green indicates that proteins contained in P‐bodies or stress granules are enriched with respect to proteome subsamples; red means depletion with respect to the proteome subsamples; yellow indicates no significant differences between the sets. As observed in previous experimental works,63, 104 proteins found in P‐bodies and stress granules are more structurally disordered and prone to bind nucleic acids as well as less hydrophobic (p‐values <10−5; Fisher's exact test). Sets and results can be accessed at http://www.tartaglialab.com/cs_multi/confirm/1207/c51ef6cff3/ where additional statistical analyses are also reported. An important property of the membrane‐less organelles is the wide spectrum of structural states,126 which is promoted by the different type of interactions between protein and RNA components. The RNA content influences viscoelasticity of assemblies, as well as exchange rate with the surrounding milieu, fusion and fission kinetics.127 As a matter of fact, the protein–RNA contacts can be quite diverse: strong to stabilize RNAs from synthesis to degradation or weak to bind RNAs at a precise moment and place.128 A better understanding of the physical characteristics of ribonucleoprotein components is crucial to investigate their function and dysfunction.

RBP Assemblies and Disease

Current literature indicates that the structure of a ribonucleoprotein assembly depends on a delicate equilibrium between protein and RNA components that is regulated by the quality control machinery (e.g., chaperones).129 Incorrect assembly/disassembly of such macrostructures can jeopardize molecular cell homeostasis. Intriguingly, RBPs found in liquid phase‐separated assemblies are enriched in structural disorder that is also associated with a high risk of misfolding and the formation of toxic protein aggregates.126, 129 Neurons appear to be especially susceptible to failures in proteostasis, a major source of neurodegenerative diseases. Indeed, several dementias and motor‐neuron diseases are associated with accumulation of disordered RBPs, such as TAR‐DNA‐binding protein 43 in Amyotrophic Lateral Sclerosis (ALS) or Ataxin 1 in Ataxia.130 In this context, chaperones perform the important task to keep protein homeostasis and prevent disease. They are upregulated upon cellular stress (e.g., diabetes‐induced [glyc]oxidation stresses) to limit the accumulation of damage131 via folding assistance, blocking aberrant interactions, disaggregating proteins, and facilitating protein degradation.132, 133, 134, 135, 136 We speculate that the ability to form phase‐separated ribonucleoprotein assemblies requires a tightly regulated control machinery, that in stress conditions and aging can be compromised and drive to formation of toxic aggregates. Thus, chaperones can be applied to develop therapeutic interventions, from controlling the initial aggregative events in the ER, to blocking the assembly of cytotoxic aggregates.

Future Challenges in the Characterization of RBP Assemblies

A major challenge in the future will be to identify the components of each ribonucleoprotein assembly and pinpoint the defects associated with RBP dysfunction.128, 137 Currently the characterisation of solid phase separated aggregates is possible.138 However, the study of the liquid phase assemblies is more complex due to their fast exchange with the cell milieu, i.e., their rapid dissolution impeding its isolation. Solid aggregates appear in many diseases, including Huntington's, Creutzfeldt‐Jakob, and Alzheimer's disease. This suggests a common hallmark139 and the existence of common pathways for pharmacotherapeutic targeting.137, 140 In this context, it is crucial to understand the cellular mechanisms controlling health conditions as well as the factors triggering toxicity. Interestingly, the metastable nature of the liquid assemblies offers the possibility to use endogenous biochemical pathways to reverse pathological states.137 As a matter of fact, prion protein neurodegeneration can be reduced by inhibition of RNA granules pathway and stimulation of protein synthesis.140

CONCLUSION

RBPs functions include protection, modification, and transport of transcripts to their translation or degradation sites (Figure 1). The scenario is particularly complex considering the fact that not only RBPs regulate RNAs but also RNAs regulate RBPs.141, 142 Experimental techniques aiming to reveal RNA–protein contacts will have to uncover more details of such interactions (see ‘protein‐centric’ and ‘RNA‐centric’ experimental methods). The study of RBPs will benefit from new computational approaches taking into account information of protein–protein networks (see section Computational Methods for Detection of Protein–RNA Interactions). An important aspect to consider while building computational models is that many proteins exhibit RNA‐binding activity without having canonical RNA‐binding domains.30, 71, 117 As observed for protein–protein interactions, structurally disordered regions bind with high specificity and low affinity.143 Hence, noncanonical domains, which are enriched in structural disorder, could promote low affinity interactions with RNA molecules.108 Accordingly, computational methods predict that the RNA‐binding ability of structural disorder is more pronounced in noncanonical RBPs, indicating that unfolded regions promote RNA interactions.108 With respect to the RNA‐binding ability of full‐length proteins, the contribution of disorder is high at low‐interaction propensities, which suggests that a large number of transcripts can be targeted by noncanonical domains.108 How many other RBPs lacking canonical binding domains exist in nature?69, 83 The repertoire of RBPs needs to be expanded in order to link transcriptomic with metabolomic properties,144 which will involve the development of new computational strategies.80, 83 Most importantly, future computational and experimental investigations will have to focus on noncanonical protein–RNA complexes.145 Indeed, transient interactions, promoted by structurally disordered regions in RBPs, often induce phase‐separation in the nucleus or cytoplasm (see section Beyond the Protein–RNA Complex: Membrane‐Less Organelles).6 The functionality of phase‐separated protein–RNA assemblies is still under investigation but appearance of solid protein–RNA aggregates has been linked with devastating diseases, such as for instance ALS (see section RBP Assemblies and Disease).137, 140 Thus, studying protein–RNA assemblies will be essential to better understand the etiopathogenesis of specific diseases and to design new therapeutic strategies.

143 in total

1. SVM based prediction of RNA-binding proteins using binding residues and evolutionary information.

Authors: Manish Kumar; M Michael Gromiha; Gajendra P S Raghava
Journal: J Mol Recognit Date: 2011 Mar-Apr Impact factor: 2.137

Review 2. Spliceosome structure and function.

Authors: Cindy L Will; Reinhard Lührmann
Journal: Cold Spring Harb Perspect Biol Date: 2011-07-01 Impact factor: 10.005

10. Denaturing CLIP, dCLIP, Pipeline Identifies Discrete RNA Footprints on Chromatin-Associated Proteins and Reveals that CBX7 Targets 3' UTRs to Regulate mRNA Expression.

Authors: Michael Rosenberg; Roy Blum; Barry Kesner; Verena K Maier; Attila Szanto; Jeannie T Lee
Journal: Cell Syst Date: 2017-10-25 Impact factor: 10.304

Advances in the characterization of RNA-binding proteins.

INTRODUCTION

EXPERIMENTAL METHODS FOR DETECTION OF PROTEIN–RNA INTERACTIONS

Protein‐Centric Approaches

RNA‐Centric Approaches

Future Experimental Challenges

COMPUTATIONAL METHODS FOR DETECTION OF PROTEIN–RNA INTERACTIONS

Future Computational Challenges

BEYOND THE PROTEIN–RNA COMPLEX: MEMBRANE‐LESS ORGANELLES

RBP Assemblies and Disease

Future Challenges in the Characterization of RBP Assemblies

CONCLUSION

1. SVM based prediction of RNA-binding proteins using binding residues and evolutionary information.

Review 2. Spliceosome structure and function.

Review 3. A census of human RNA-binding proteins.

Review 4. lincRNAs: genomics, evolution, and mechanisms.

5. Struct-NB: predicting protein-RNA binding sites using structural features.

6. Human FMRP contains an integral tandem Agenet (Tudor) and KH motif in the amino terminal domain.

7. Extracellular chaperones prevent Aβ42-induced toxicity in rat brains.

8. Native 3D structure of eukaryotic 80s ribosome: morphological homology with E. coli 70S ribosome.

9. HMMER web server: 2015 update.

10. Global analysis of yeast mRNPs.

Review 1. Approaches for measuring the dynamics of RNA-protein interactions.

Review 2. Emerging roles of RNA-binding proteins in diabetes and their therapeutic potential in diabetic complications.

3. Detection of nucleic acid-protein interactions in plant leaves using fluorescence lifetime imaging microscopy.

Review 4. Gain-of-Function Mutations: An Emerging Advantage for Cancer Biology.

5. HyPR-MS for Multiplexed Discovery of MALAT1, NEAT1, and NORAD lncRNA Protein Interactomes.

6. Capturing Endogenous Long Noncoding RNAs and Their Binding Proteins Using Chromatin Isolation by RNA Purification.

Review 7. High throughput approaches to study RNA-protein interactions in vitro.

8. Mapping transcriptome-wide protein-RNA interactions to elucidate RNA regulatory programs.

Review 9. The search for RNA-binding proteins: a technical and interdisciplinary challenge.

10. Denaturing CLIP, dCLIP, Pipeline Identifies Discrete RNA Footprints on Chromatin-Associated Proteins and Reveals that CBX7 Targets 3' UTRs to Regulate mRNA Expression.