| Literature DB >> 25364746 |
Erik Dassi1, Alessandro Quattrone1.
Abstract
The recent explosion of high-throughput sequencing methods applied to RNA molecules is allowing us to go beyond the description of sequence variants and their relative abundances, as measured by RNA-seq. We can now probe for RNA engagement in polysomes, for ribosomes, RNA binding proteins and microRNAs binding sites, for RNA secondary structure and for RNA methylation. These descriptors produce a steadily growing multidimensional array of positional information on RNA sequences, whose effective integration only would bring to decipher the regulatory interplay occurring between proteins, RNAs and their modifications on the transcriptome. This interplay ultimately dictates the degree of mRNA availability to translation, and thus the occurrence of cell phenotypes. However, several issues in data presentation are slowing down effective integration. A standardization effort for new dataset types produced should be urgently undertaken to solve these issues. Providing uniformed experimental details along with datasets processed to be directly usable and employing shared formats would greatly simplify integration efforts, strengthening hypotheses stemming from correlative observations and eventually bringing to mechanistic understanding.Entities:
Keywords: RNA-seq; data format; integration; mRNA; post-transcriptional control; standards; transcriptome; translation
Year: 2014 PMID: 25364746 PMCID: PMC4207014 DOI: 10.3389/fcell.2014.00039
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
Figure 1Techniques for positional whole-transcriptome probing. The figure displays techniques allowing to study transcriptomes at various observational levels, with particular regard to positional information; all techniques, indicated by their representative feature on transcripts, are based on RNA-seq.
Current approaches for positional information integration on the transcriptome.
| Integrated databases | Collecting and presenting available datasets of heterogeneous types and biological sources; allowing users to mine the data types in combination | Global over a vast number of different data types | Data quality and processing assessment not always possible; achieving database completeness and constant content update is particularly time-intensive | Anders et al., |
| Multi-level profiling | Performing various types of measurements (i.e., mRNA levels, RNA secondary structure, RNA methylation) in the same system of interest (e.g., cell line) to derive correlative patterns | Global over a limited number of data types | Need very different experimental and data analysis expertise; results applicability is limited to the studied system | Genolet et al., |
| Measurements & public data exploitation | Performing a small number of measurements (i.e., mRNA levels only) in the system of interest, and exploiting public data to study genes derived from these measurements (i.e., presence of translational regulation) to infer and validate potential regulatory mechanisms and patterns | Over a small number (dozens) of interesting genes | Publicly available data on the system one wants to use may not be available; further validation and/or mechanistic experiments may be needed | Mazza et al., |
The table describes currently applied approaches to the integration of position-aware RNA datasets. Scope of the various approaches and associated potential issues are outlined along with the references of works employing them.