Literature DB >> 30103474

The State of Long Non-Coding RNA Biology.

John S Mattick1.   

Abstract

Transcriptomic studies have demonstrated that the vast majority of the genomes of mammals and other complex organisms is expressed in highly dynamic and cell-specific patterns to produce large numbers of intergenic, antisense and intronic long non-protein-coding RNAs (lncRNAs). Despite well characterized examples, their scaling with developmental complexity, and many demonstrations of their association with cellular processes, development and diseases, lncRNAs are still to be widely accepted as major players in gene regulation. This may reflect an underappreciation of the extent and precision of the epigenetic control of differentiation and development, where lncRNAs appear to have a central role, likely as organizational and guide molecules: most lncRNAs are nuclear-localized and chromatin-associated, with some involved in the formation of specialized subcellular domains. I suggest that a reassessment of the conceptual framework of genetic information and gene expression in the 4-dimensional ontogeny of spatially organized multicellular organisms is required. Together with this and further studies on their biology, the key challenges now are to determine the structure⁻function relationships of lncRNAs, which may be aided by emerging evidence of their modular structure, the role of RNA editing and modification in enabling epigenetic plasticity, and the role of RNA signaling in transgenerational inheritance of experience.

Entities:  

Keywords:  epigenetic plasticity; evolution; lncRNA structure-function; regulatory architecture

Year:  2018        PMID: 30103474      PMCID: PMC6162524          DOI: 10.3390/ncrna4030017

Source DB:  PubMed          Journal:  Noncoding RNA        ISSN: 2311-553X


1. A Surprising New Entrant into the Repertoire of Expressed Genes

A role for RNA as regulatory molecules, rather than just as templates (mRNAs) and components of the machinery (ribosomal RNAs, transfer RNAs, spliceosomal RNAs) for the production of proteins, was established in the 1990s with the surprising discovery of the RNA interference (RNAi) and related pathways, which utilize small RNA guides to regulate mRNA stability and translation, and to control transposons [1,2,3,4,5,6,7]. At the same time, seminal long non-protein-coding RNAs (lncRNAs), involved notably in X inactivation (Xist) [8,9,10,11] and in genomic imprinting (H19) [12](later shown to play a role in cancer [13])were also discovered. The subsequent revelation from high-throughput cDNA sequencing (RNAseq) in the 2000s that tens of thousands of long intronic, intergenic and antisense lncRNAs are transcribed from the genomes of mammals [14,15,16,17,18,19,20,21,22,23,24,25,26,27] and other complex organisms [28] was also surprising. The range of cellular non-coding RNAs may be underestimated by sequencing strategies that target polyadenylated RNA, as many are derived from retrotransposons [29,30], introns [24] and/or derived by processing longer transcripts [31,32], including circular lncRNAs [33,34]. Some lncRNAs are extremely long [35,36]. Indeed, it appears that the vast majority of the genomes of all organisms, irrespective of the proportion that is protein-coding, is transcribed, mainly to produce non-coding RNAs [28].

2. An Unwelcome Player

However, despite the RNAi precedent, and with few exceptions [37,38,39], the existence of these uncharacterized lncRNAs was initially ignored or dismissed as “transcriptional noise”. Not only was it unclear how they might fit into the existing conceptual framework of genetic information and gene regulation, assumed to be transacted by proteins acting in combinatoric fashion, their sheer number, if functional, threatened the primacy of this framework, which has long been an article of faith in molecular, cellular and developmental biology. The possibility that lncRNAs might be functional also contradicted the widely held belief, dating from the late 1970s, that the intronic and ‘intergenic’ sequences from which they are transcribed, and which dominate the real estate of the mammalian genome, are largely evolutionary debris (‘junk’) [40], comprised of hangovers from the prebiotic assembly of “genes” [41] expanded by accumulation of retrotransposon parasites (“selfish DNA”) [42]. The idea that the most non-coding RNAs are noise from biologically inert regions of the genome was superficially bolstered by the observation that most are expressed at low levels and are generally less conserved than protein-coding sequences (although there are notable exceptions [43]) similar to ancient retrotransposon-derived sequences, assumed to be non-functional and evolving ‘neutrally’ [44]. This is a circular argument of dubious merit [45], since there is increasing evidence that retrotransposon-derived sequences have been exapted for various functions and coopted as mobile modules to alter the patterns of gene expression [30,45,46,47,48,49]. The comparison of the rate of divergence of an extant set of ancient repeats also does not include the (unknown) number that have diverged to the point of unrecognizability, which therefore underestimates the rate of their presumed neutral evolution, and the extent of evolutionary selection on the genome [45]. The (lack of a high) conservation argument also fails to take into account the fact that adaptive radiation occurs mainly by the relatively rapid evolution of the regulatory sequences under positive selection, that such sequences have quite different structure–function constraints to proteins, and that they are subject to rapid turnover [45,50,51]. Thus, many lncRNAs are likely to be lineage-specific. At this point it is important to remember that the metazoan proteome is remarkably static. Both the nematode and human genomes contain ~20,000 protein-coding genes, most of which are functionally orthologous, despite orders of magnitude difference in their developmental (and cognitive) complexity. By contrast, the proportion of the genome that is non-protein-coding, and the number and range of non-coding RNAs expressed therefrom, increases with developmental complexity [28,52], raising the obvious possibility that these sequences are responsible for specifying developmental complexity and phenotypic diversity.

3. Evidence of Long Non-Coding RNA Functionality

Indeed, there are many different rate classes of sequence evolution in mammals, indicating that at least 45% of the alignable regions of mammalian genomes are not evolving neutrally [53], with at least 18% of the mammalian genome conserved at the level of predicted RNA structure [54]. There are also many indices of lncRNA functionality [55,56], including conservation of promoters [16,57], regulation by canonical transcription factors [58,59], and chromatin signatures of active gene expression [57,59]. Moreover, lncRNA exons have been found to be more conserved than neutrally evolving ancestral repeat sequences, albeit at lower levels than protein-coding genes [57]. The case for lncRNA functionality is also supported by their dynamic expression patterns in differentiating cells and their highly specific spatial (including subcellular) localization [57,60,61,62], especially in the brain [63,64], which also explains their low abundance in RNAseq analyses of whole tissues [26]. Indeed, high-resolution analyses using RNA capture technologies have revealed an extraordinary diversity of lncRNAs, most of which are likely to be cell-specific, and which have yet to be catalogued or characterized [27,65] Perhaps the most intriguing are the 3’UTR-derived lncRNAs that are expressed separately from, and appear to convey differentiation signals independently of, their normally associated mRNAs [66,67]. There have been many studies examining lncRNA biology over the past decade (too many to reference here, but see http://www.lncrnadb.org [68,69], https://lncipedia.org [70] and http://www.noncode.org [71]) linking lncRNAs with cellular processes, including the formation of specialized subnuclear organelles [72,73,74], chromatin domains [75], regulation of splicing [76,77], enhancer action [78,79,80], and binding to chromatin-modifying proteins such as polycomb [81,82,83,84,85], trithorax [61] and Dnmt1 [86,87]. Some of these functions may not be mediated by the lncRNA itself, but through mechanisms associated with their biogenesis [88,89,90,91]. There are also many studies linking lncRNAs with differentiation and development [92,93], and with diseases, including coronary artery disease and diabetes [33,94,95], schizophrenia [96] and cancer (see e.g., [43,97,98]). Knockdown of lncRNAs by small interfering RNA (siRNA)-related methodologies frequently results in observable changes in cellular behavior or characteristics in culture [55]. On the other hand, chromosomal deletion of lncRNA sequences often do not show overt phenotypic consequences. For example, only 5 of 18 lncRNA mouse knockouts resulted in lethality or growth defects [99,100]. However, most phenotypic screens do not examine behavioral or cognitive effects. For example, deletion of the widely brain expressed non-coding RNA BC1 showed no developmental consequences [101], but later tests showed the mutant mice, although having normal brain morphology and no obvious neurological deficits, exhibited decreased exploratory ability and increased anxiety [102]. In this context, it is worth noting that deletion of a subset of the most highly conserved sequences in the mammalian genome, ultraconserved elements (UCEs) [103,104], which are surely functional on the evolutionary evidence, also did not result in obvious abnormalities [105], although a later study showed subtle neurological alterations [106]. While skeptics remain, the most likely interpretation is that the documented functional examples are emblematic of an army of regulatory RNAs that guide epigenetic trajectories and specify cell state during a very complex and precise developmental ontogeny—from a single fertilized cell to a mobile, cognizant adult—and that most of the human genome is devoted to this purpose [37,38,39,107,108,109,110,111,112]. Indeed, the proportion of the mammalian genome devoted to cognitive function, rather than body plan development, may be considerably underestimated, given the preponderance of lncRNA expression in the brain [63]. Not surprisingly then, many lncRNAs are primate-specific [57,113,114]. Indeed, the growing body of evidence is now leading to a general acceptance of the relevance of (many or most) lncRNAs to cell and developmental biology [92], and increasingly neurobiology [63,64,96,115,116,117], with the debate, such as it remains, shifted to the proportion of lncRNAs that may be biologically relevant. For me, the best indicator, although by no means proof, is their precise expression patterns [26], on which basis one can project that most are likely to be functional. If so, the current protein-centric framework for understanding the genetic programming of differentiation and development is incomplete, a legacy of the mechanical worldview that held sway at the birth of molecular biology. Reconsideration of this framework to incorporate not only proteins but also structural and regulatory RNAs [109,111,118,119] is overdue.

4. Long Non-Coding RNA Structure–Function Relationships

The most pressing challenges now are to determine the structure–function relationships in lncRNAs and to parse their functional repertoire. This should resolve lingering questions and place lncRNAs into an integrated conceptual framework, together with small regulatory RNAs, transcription factors and signaling pathways, among others, for understanding the decisional hierarchies that control the 4-dimensional ontogeny of complex multicellular organisms [119]. There is logic and experimental evidence to suggest that lncRNAs have a modular architecture, given their likely role as scaffolds and epigenetic guide molecules [26,82,120]. This is strengthened by a recent high depth sequencing study that found, unexpectedly and in contrast to the limited information that had been previously available [57], that the internal exons of lncRNAs are almost universally alternatively spliced [27], which clearly implies modularity. If this is correct, the establishment of the exon as the primary unit of lncRNA structure–function, combined with the observation of conservation of lncRNA structure [54] and the presence of structural orthologs around the genome [121], should provide a framework for determining which structural RNA modules associate with which effector proteins [121]. It is envisaged that such studies will lead to expanded structure–function databases [122,123,124] whereby specific protein (e.g., polycomb) binding domains in regulatory RNAs can be identified genome- and transcriptome-wide, and thereby the roles of and effector pathways for different lncRNAs and their alternatively spliced isoforms. It may be much harder, as exemplified by snoRNAs, to determine the RNA and DNA targets of lncRNAs, and which modules impart this function. This framework should also allow parsing of the different types and roles of lncRNAs in establishing chromatin territories, enhancer looping, guidance of epigenetic modifier proteins that impose DNA and histone modifications, and the formation of subcellular domains, among others. In addition, while most lncRNAs are nuclear and associated with chromatin, some are cytoplasmically localized [57,63] with functions yet to be discovered. There is increasing evidence that RNAs are involved in the nucleation of liquid crystal domains in conjunction with disordered RNA-binding proteins [125], potentially an entirely new dimension of cell biology beyond that of the well-characterized membrane-bound organelles. High-resolution imaging will be required, along with high-resolution RNA sequencing, and oligonucleotide or antibody capture to dissect the components of the structures where lncRNAs are localized.

5. From Hard- to Soft-Wiring

A new and rapidly emerging frontier is the role of RNA editing [126,127] and RNA modification [128,129] in modulating RNA signaling pathways in response to developmental cues and environmental signals, which may lie at the heart of the epigenetic plasticity seen in physiological adaption, complex diseases such as cancer and diabetes, and brain function [130]. There is still much to do to understand the role of small RNAs, especially the ti/spliRNAs that are derived from transcription start sites and exonic borders [131,132], and fragments of tRNAs [133,134,135] and snoRNAs [136], some of which may function as miRNAs [137,138,139], as well as to decipher their evolutionary links and the regulatory networks in which they participate. Finally, and most intriguingly, is the role of RNA in intercellular and transgenerational inheritance (soft-wired inheritance), for which there is not only evolutionary logic [140,141,142] but also increasing evidence [143,144,145,146,147,148,149]. The emerging picture is not (simply) of RNA as a transient intermediate between ‘gene’ and protein, but rather as the central computational engine of cell biology, differentiation and development, brain function and perhaps even evolution itself. Many textbooks may have to be rewritten once the full dimensions of regulatory RNA biology are revealed.
  149 in total

Review 1.  MicroRNAs: small RNAs with a big role in gene regulation.

Authors:  Lin He; Gregory J Hannon
Journal:  Nat Rev Genet       Date:  2004-07       Impact factor: 53.242

2.  The eukaryotic genome as an RNA machine.

Authors:  Paulo P Amaral; Marcel E Dinger; Tim R Mercer; John S Mattick
Journal:  Science       Date:  2008-03-28       Impact factor: 47.728

Review 3.  Principles of Transgenerational Small RNA Inheritance in Caenorhabditis elegans.

Authors:  Oded Rechavi; Itamar Lev
Journal:  Curr Biol       Date:  2017-07-24       Impact factor: 10.834

4.  So much "junk" DNA in our genome.

Authors:  S Ohno
Journal:  Brookhaven Symp Biol       Date:  1972

5.  Pseudouridylation of tRNA-Derived Fragments Steers Translational Control in Stem Cells.

Authors:  Nicola Guzzi; Maciej Cieśla; Phuong Cao Thi Ngoc; Stefan Lang; Sonali Arora; Marios Dimitriou; Kristyna Pimková; Mikael N E Sommarin; Roberto Munita; Michal Lubas; Yiting Lim; Kazuki Okuyama; Shamit Soneji; Göran Karlsson; Jenny Hansson; Göran Jönsson; Anders H Lund; Mikael Sigvardsson; Eva Hellström-Lindberg; Andrew C Hsieh; Cristian Bellodi
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

Review 6.  Noncoding RNAs in Long-Term Memory Formation.

Authors:  Tim R Mercer; Marcel E Dinger; Jean Mariani; Kenneth S Kosik; Mark F Mehler; John S Mattick
Journal:  Neuroscientist       Date:  2008-10       Impact factor: 7.519

7.  Circular intronic long noncoding RNAs.

Authors:  Yang Zhang; Xiao-Ou Zhang; Tian Chen; Jian-Feng Xiang; Qing-Fei Yin; Yu-Hang Xing; Shanshan Zhu; Li Yang; Ling-Ling Chen
Journal:  Mol Cell       Date:  2013-09-12       Impact factor: 17.970

8.  The transcriptional landscape of the mammalian genome.

Authors:  P Carninci; T Kasukawa; S Katayama; J Gough; M C Frith; N Maeda; R Oyama; T Ravasi; B Lenhard; C Wells; R Kodzius; K Shimokawa; V B Bajic; S E Brenner; S Batalov; A R R Forrest; M Zavolan; M J Davis; L G Wilming; V Aidinis; J E Allen; A Ambesi-Impiombato; R Apweiler; R N Aturaliya; T L Bailey; M Bansal; L Baxter; K W Beisel; T Bersano; H Bono; A M Chalk; K P Chiu; V Choudhary; A Christoffels; D R Clutterbuck; M L Crowe; E Dalla; B P Dalrymple; B de Bono; G Della Gatta; D di Bernardo; T Down; P Engstrom; M Fagiolini; G Faulkner; C F Fletcher; T Fukushima; M Furuno; S Futaki; M Gariboldi; P Georgii-Hemming; T R Gingeras; T Gojobori; R E Green; S Gustincich; M Harbers; Y Hayashi; T K Hensch; N Hirokawa; D Hill; L Huminiecki; M Iacono; K Ikeo; A Iwama; T Ishikawa; M Jakt; A Kanapin; M Katoh; Y Kawasawa; J Kelso; H Kitamura; H Kitano; G Kollias; S P T Krishnan; A Kruger; S K Kummerfeld; I V Kurochkin; L F Lareau; D Lazarevic; L Lipovich; J Liu; S Liuni; S McWilliam; M Madan Babu; M Madera; L Marchionni; H Matsuda; S Matsuzawa; H Miki; F Mignone; S Miyake; K Morris; S Mottagui-Tabar; N Mulder; N Nakano; H Nakauchi; P Ng; R Nilsson; S Nishiguchi; S Nishikawa; F Nori; O Ohara; Y Okazaki; V Orlando; K C Pang; W J Pavan; G Pavesi; G Pesole; N Petrovsky; S Piazza; J Reed; J F Reid; B Z Ring; M Ringwald; B Rost; Y Ruan; S L Salzberg; A Sandelin; C Schneider; C Schönbach; K Sekiguchi; C A M Semple; S Seno; L Sessa; Y Sheng; Y Shibata; H Shimada; K Shimada; D Silva; B Sinclair; S Sperling; E Stupka; K Sugiura; R Sultana; Y Takenaka; K Taki; K Tammoja; S L Tan; S Tang; M S Taylor; J Tegner; S A Teichmann; H R Ueda; E van Nimwegen; R Verardo; C L Wei; K Yagi; H Yamanishi; E Zabarovsky; S Zhu; A Zimmer; W Hide; C Bult; S M Grimmond; R D Teasdale; E T Liu; V Brusic; J Quackenbush; C Wahlestedt; J S Mattick; D A Hume; C Kai; D Sasaki; Y Tomaru; S Fukuda; M Kanamori-Katayama; M Suzuki; J Aoki; T Arakawa; J Iida; K Imamura; M Itoh; T Kato; H Kawaji; N Kawagashira; T Kawashima; M Kojima; S Kondo; H Konno; K Nakano; N Ninomiya; T Nishio; M Okada; C Plessy; K Shibata; T Shiraki; S Suzuki; M Tagami; K Waki; A Watahiki; Y Okamura-Oho; H Suzuki; J Kawai; Y Hayashizaki
Journal:  Science       Date:  2005-09-02       Impact factor: 47.728

9.  The reality of pervasive transcription.

Authors:  Michael B Clark; Paulo P Amaral; Felix J Schlesinger; Marcel E Dinger; Ryan J Taft; John L Rinn; Chris P Ponting; Peter F Stadler; Kevin V Morris; Antonin Morillon; Joel S Rozowsky; Mark B Gerstein; Claes Wahlestedt; Yoshihide Hayashizaki; Piero Carninci; Thomas R Gingeras; John S Mattick
Journal:  PLoS Biol       Date:  2011-07-12       Impact factor: 8.029

10.  An update on LNCipedia: a database for annotated human lncRNA sequences.

Authors:  Pieter-Jan Volders; Kenneth Verheggen; Gerben Menschaert; Klaas Vandepoele; Lennart Martens; Jo Vandesompele; Pieter Mestdagh
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 16.971

View more
  23 in total

1.  Nuclear export mechanisms of circular RNAs: size does matter.

Authors:  Asfar S Azmi
Journal:  Noncoding RNA Investig       Date:  2018-09-14

2.  Physiological roles of non-coding RNAs.

Authors:  Shizuka Uchida; Josephine C Adams
Journal:  Am J Physiol Cell Physiol       Date:  2019-05-15       Impact factor: 4.249

Review 3.  Invited Review: Long non-coding RNAs: important regulators in the development, function and disorders of the central nervous system.

Authors:  R Cuevas-Diaz Duran; H Wei; D H Kim; J Q Wu
Journal:  Neuropathol Appl Neurobiol       Date:  2019-03-04       Impact factor: 8.090

4.  SILCS-RNA: Toward a Structure-Based Drug Design Approach for Targeting RNAs with Small Molecules.

Authors:  Abhishek A Kognole; Anthony Hazel; Alexander D MacKerell
Journal:  J Chem Theory Comput       Date:  2022-08-01       Impact factor: 6.578

5.  LINC00152 expression in normal and Chronic Lymphocytic Leukemia B cells.

Authors:  Serena Matis; Martina Rossi; Lorenzo Brondolo; Martina Cardillo; Daniele Reverberi; Rosanna Massara; Monica Colombo; Adalberto Ibatici; Emanuele Angelucci; Tiziana Vaisitti; Silvia Bruno; Sonia Fabris; Antonino Neri; Massimo Gentile; Fortunato Morabito; Giovanna Cutrona; Paola Briata; Roberto Gherzi; Franco Fais
Journal:  Hematol Oncol       Date:  2021-10-28       Impact factor: 4.850

Review 6.  Long Noncoding RNA HCP5, a Hybrid HLA Class I Endogenous Retroviral Gene: Structure, Expression, and Disease Associations.

Authors:  Jerzy K Kulski
Journal:  Cells       Date:  2019-05-20       Impact factor: 6.600

Review 7.  Non-Coding RNAs in Kidney Diseases: The Long and Short of Them.

Authors:  Juan Antonio Moreno; Eya Hamza; Melania Guerrero-Hue; Sandra Rayego-Mateos; Cristina García-Caballero; Mercedes Vallejo-Mudarra; Laurent Metzinger; Valérie Metzinger-Le Meuth
Journal:  Int J Mol Sci       Date:  2021-06-04       Impact factor: 5.923

8.  Long Noncoding RNAs CARMN, LUCAT1, SMILR, and MALAT1 in Thoracic Aortic Aneurysm: Validation of Biomarkers in Clinical Samples.

Authors:  Vaiva Patamsytė; Giedrius Žukovas; Dovydas Gečys; Diana Žaliaduonytė; Povilas Jakuška; Rimantas Benetis; Vaiva Lesauskaitė
Journal:  Dis Markers       Date:  2020-06-17       Impact factor: 3.434

9.  LNCipedia 5: towards a reference set of human long non-coding RNAs.

Authors:  Pieter-Jan Volders; Jasper Anckaert; Kenneth Verheggen; Justine Nuytens; Lennart Martens; Pieter Mestdagh; Jo Vandesompele
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

Review 10.  Deciphering miRNAs' Action through miRNA Editing.

Authors:  Marta Correia de Sousa; Monika Gjorgjieva; Dobrochna Dolicka; Cyril Sobolewski; Michelangelo Foti
Journal:  Int J Mol Sci       Date:  2019-12-11       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.