Literature DB >> 29945466

Platforms for Investigating LncRNA Functions.

John Lalith Charles Richard^1,2,3, Pieter Johan Adam Eichhorn^1,2,4.

Abstract

Prior to the sequencing of the human genome, it was presumed that most of the DNA coded for proteins. However, with the advent of next-generation sequencing, it has now been recognized that most complex eukaryotic genomes are in fact transcribed into noncoding RNAs (ncRNAs), including a family of transcripts referred to as long noncoding RNAs (lncRNAs). LncRNAs have been implicated in many biological processes ranging from housekeeping functions such as transcription to more specialized functions such as dosage compensation or genomic imprinting, among others. Interestingly, lncRNAs are not limited to a defined set of functions but can regulate varied activities such as messenger RNA degradation, translation, and protein kinetics or function as RNA decoys or scaffolds. Although still in its infancy, research into the biology of lncRNAs has demonstrated the importance of lncRNAs in development and disease. However, the specific mechanisms through which these lncRNAs act remain poorly defined. Focused research into a small number of these lncRNAs has provided important clues into the heterogeneous nature of this family of ncRNAs. Due to the complex diversity of lncRNA function, in this review, we provide an update on the platforms available for investigators to aid in the identification of lncRNA function.

Entities: CellLine Chemical Disease Gene Species

Keywords: lncRNA interactome; long noncoding RNA; tools for lncRNA functional annotation

Mesh：

Substances：
RNA, Long Noncoding

Year: 2018 PMID： 29945466 PMCID： PMC6249642 DOI： 10.1177/2472630318780639

Source DB: PubMed Journal: SLAS Technol ISSN： 2472-6303 Impact factor: 3.047

Introduction

Commencing with the initial discovery that coding exons of genes account for only 1.5% to 2% of the entire genome, a veritable revolution has been undertaken to understand the functional relevance of the nonprotein coding part of the genome.[1,2] Undoubtedly, these studies have been greatly simplified by the advent of novel DNA sequencing technologies, which permit accurate whole-genome transcriptome analysis.[3] This perpetual effort has resulted in the identification and cataloging of thousands of noncoding RNAs (ncRNAs).[4] In general, ncRNAs can be divided into two broad classes based on their functions, as either housekeeping ncRNAs or regulatory ncRNAs. Housekeeping ncRNAs primarily regulate generic cellular functions such as messenger RNA (mRNA) translation (rRNA/tRNA), splicing (snRNAs), or rRNA modification (snoRNA). Regulatory ncRNAs can be further classified based on transcript length, with short noncoding transcripts comprising fewer than 200 nucleotides and long noncoding RNAs comprising transcripts greater than 200 nucleotides. A number of short ncRNAs subclasses exist, including microRNA (miRNA), small interfering RNA (siRNA), piwi interacting RNA (piRNA), transcription initiation RNA, and small cajal body-specific RNA (scaRNA).[5-7] Long noncoding RNAs (lncRNAs) represent the largest class of ncRNAs. However, in contrast to short ncRNAS, which are mostly attributed to gene regulation, the mechanistic function of lncRNAs is highly diverse, adding to the increased complexity of this family of genes.[8] In addition, the lack of insight into the function of lncRNAs may also be attributed to the low expression levels and tissue specificity of lncRNAs, resulting in an incomplete understanding of lncRNA regulation. Nevertheless, through genomic initiatives like ENCODE, FANTOM, GTEx, and GENCODE, over 60,000 lncRNAs have been predicted, a number of which have been demonstrated to be altered in certain diseases, underscoring the importance of these set of transcripts.[9-12] However, to date, only a small percentage of these lncRNAs has been described in the literature, with an even smaller number being attributed to a specific mechanistic function. Furthermore, like proteins, many lncRNAs can employ more than one mechanism of action. Increasing evidence points toward an important role played by lncRNAs in regulating multiple processes of gene expression, with instances of their transcription leading to gene silencing or gene activation. Studies have also found that regulation of lncRNAs can affect mRNA transcription, splicing, translation, export, import, and stability.[13] LncRNAs have been demonstrated to function as transcription factor recruiters through their interaction with transcriptional start sites, act as transcriptional coactivators, or function as scaffold for proteins in general. Equivalently, lncRNAs may function as molecular decoys trapping transcription factors and thus limiting the ability of transcription factors to associate with DNA binding sites.[14,15] Furthermore, lncRNAs have been shown to be involved in chromatin looping, nuclear body formation and function, or transcriptional read-through.[15] Apart from transcriptional regulation, lncRNAs also play a role in mRNA processing, maturation, and stability through the regulation of mRNA splicing, inhibiting translation, operating as miRNAs sponges, or competing for miRNA binding sites on mRNA.[15] Some lncRNAs can also code for small peptides.[16,17] This fact does not necessarily disqualify a subset of lncRNAs as nonprotein coding transcripts but rather indicates that these lncRNAs can act as a bifunctional transcripts serving as either an lncRNA or a protein template. Last, lncRNAs can regulate protein and transcript trafficking and shuttling. The overall outcome of this is the association of lncRNAs with a myriad of biological processes, including imprinting, cell cycle regulation, pluripotency, dosage compensation, retro-transposon silencing, and telomere lengthening. In this review, we outline the platforms required to help define and catalogue newly discovered lncRNAs and discuss relevant techniques required to ascertain mechanistic functions of lncRNAs.

Platforms for Annotating lncRNA Functions

LncRNAs in recent times have debunked the ancient theory of junk DNA owing mainly to the recent developments in sequencing technologies and have been attributed to be essential for a number of physiological processes.[18-20] In addition, lncRNAs interface with DNA, RNA, and proteins to exert their functions, raising their complexity to a higher level. The conventional methods used to study mRNA functions are inefficient for lncRNAs studies. LncRNAs have unique features, have developmental and tissue-specific peaks in expression, and are also available in very low copy numbers. These unique properties of lncRNAs make their detection extremely difficult as well as more amenable to investigate certain lncRNAs. Furthermore, certain lncRNAs provide for allele-specific epigenetic modification of gene expression in cis, which is possible through the limited spatial exposure at the site of transcription. While their tissue-specific occurrence can aid toward the development of biomarkers, their occurrence inside the nucleus in certain cases in a similar fashion can pose a problem when employing RNA interference for loss of function studies. Therefore, methods used to investigate lncRNAs should be highly efficient with enhanced targeting and higher resolution and increased maneuverability at the molecular level. We address here some of the strategies used to study the functions of lncRNAs.

RNA Sequencing

RNA sequencing (RNA-Seq) has seen a rapid growth over the past decades, allowing for the generation of quality in-depth sequencing and providing extensive information in a short time. Presently, a single run on any of the mainstream RNA-Seq platforms can yield up to a billion reads, with an individual read length of about 10 to 300 bp to longer read lengths of 10 to 20 kb. RNA-Seq provides a quantitative as well as descriptive scenario of the complex cellular content. It is important to note here that the data obtained herein are of high resolution compared to microarray technologies.[21] In addition to obtaining high-quality RNA transcriptome profiles, RNA sequencing also aids in providing a top-up information on the 3′ end processing, alternative splicing regions, and RNA editing sites. RNA-Seq technologies have likewise helped to annotate lncRNAs, revealing global properties and even specific subclasses of lncRNAs. RNA sequencing has been advancing at a very rapid pace, with the rapid development of newer sequencing technologies serving different applications. RNA sequencing and next-generation sequencing technologies are a whole realm in themselves and have not been included in this review as a result of space constraints (refer to reviews on next-generation sequencing[22-25]). A short overview of the databases and tools used to study long noncoding RNA is presented in .

Table 1.

Databases and Tools Used to Study Long Noncoding RNA.

Database/Tools	Application	Reference
LncRNAdb v2.0 (lncRNA Database)	Reference database for functional long noncoding RNAs and provides comprehensive annotations of eukaryotic lncRNAs.	[106,107]
FANTOM (Functional Annotation of the mammalian Genome)	Database as a resource for experimentally supported lncRNA-disease association data. Database also has platform with integrated tools for predicting novel lncRNA-disease associations.	[108]
ENCODE (Encyclopaedia of DNA Elements)	The ENCODE database is a comprehensive collection of functional elements in the human genome, including elements that act at the protein and RNA levels and regulatory elements that control cells and circumstances in which a gene is active.	[109]
The GENCODE Project	The repository contains comprehensive gene annotations on reference chromosomes, scaffolds, assembly patches, and alternate loci. There is also comprehensive gene annotation of lncRNA genes.	[10]
lncRNAMap	A repository to investigate the putative regulatory functions of human lncRNAs and expression profiles for lncRNAs and their homologous protein coding genes. In addition, information regarding miRNA regulators of lncRNA is also available.	[110]
LNCipedia 3.0	A repository for annotated human lncRNA sequences.	[111,112]
The LncRNA and Disease Database	A repository for curated and experimentally supported lncRNA-disease association data. The database also hosts integrated tools for predicting novel disease associations. Interactions at various levels such as protein, RNA, miRNA, and DNA are also available.	[113,114]
lnCeDB	A database of human lncRNAs that can act as ceRNAs. Database also provides information on lncRNA-mRNA pairs having common targeting miRNAs. The expression of lncRNA can be compared across 22 human tissues to estimate the chances of the pair for actually being ceRNAs.	[115]
starBASE v2.0	Database designed for decoding pan-cancer and interaction networks of lncRNAs, miRNAs, ceRNAs, RNA binding proteins, and mRNAs from large-scale CLIP-Seq (HITS-CLIP, PAR-CLIP, iCLIP, CLASH) data and tumor samples comprising 14 cancer types spanning more than 6000 samples. Starbase also provides miR and ceRNA function web tools to predict the function of ncRNAs and protein coding genes from the miRNA-mediated (ceRNA) regulatory networks.	[116]
DIANA TOOLS LncBase v.2	Tool for determining experimentally verified and computationally predicted miRNA targets on long noncoding RNAs. The experimental module engages miRNA and lncRNA interactions pertaining to the experimental validation and outcomes. The prediction module contains information for more than 10 million interactions and provides information of interaction sites, graphical representation of their binding, and the predicted score.	[117]
GeneCards	A human gene database that provides comprehensive information on all annotated and predicted human genes, including lncRNAs. An overall integrated data comprising linked genomic, transcriptomic proteomic, genetic, clinical, and functional information.	[118]
LincSNP2.0	Database that stores and annotates disease-associated single-nucleotide polymorphisms in human long noncoding RNA and their transcription factor binding sites.	[119]
LncRNA2Target	A repository for differentially expressed genes after lncRNA knockdown or overexpression.	[120]
ChIP Base v2.0	Open database for studying transcription factor binding sites and motifs and decoding the transcriptional regulatory networks of lncRNAs, miRNAs, other noncoding RNAs, and protein coding genes.	[121]
NRED	Database for lncRNA expression from microarray and in situ hybridization data. In addition, provides information on the evolutionary conservation, secondary structure, genomic context links, and antisense relationships.	[122]
NONCODE	An integrated database dedicated to noncoding RNA and in particular long noncoding RNA with more accurate annotations. The recent update provides additional features such as conservation annotation, lncRNA-disease relationships, and an interface to choose high-quality data sets through predicted scores, literature support, and long-read sequencing method support.	[123]
HGNC (HUGO Gene Nomenclature Committee)	Database aimed at approving unique names and symbols for human loci, including protein coding genes, noncoding genes, and pseudogenes, to allow unambiguous scientific communication.	[124]
PhyloCSF (Phylogenetic Codon Substitution Frequency)	Tool used to distinguish between protein coding and noncoding regions based on a formal statistical comparison of phylogenetic codon models.	[125]

ceRNA, competing endogenous RNA; CLASH, cross-linking, ligation, and sequencing hybrids; CLIP-Seq, crosslinked immunoprecipitation sequencing; HITS-CLIP, high-throughput sequencing of RNA isolated by crosslinking immunoprecipitation; iCLIP, individual nucleotide resolution crosslinked immunoprecipitation; lncRNA, long noncoding RNA; miRNA, microRNA; mRNA, messenger RNA; ncRNA, noncoding RNA; PAR-CLIP, photoactivable ribonucleoside-enhanced crosslinked immunoprecipitation.

Databases and Tools Used to Study Long Noncoding RNA. ceRNA, competing endogenous RNA; CLASH, cross-linking, ligation, and sequencing hybrids; CLIP-Seq, crosslinked immunoprecipitation sequencing; HITS-CLIP, high-throughput sequencing of RNA isolated by crosslinking immunoprecipitation; iCLIP, individual nucleotide resolution crosslinked immunoprecipitation; lncRNA, long noncoding RNA; miRNA, microRNA; mRNA, messenger RNA; ncRNA, noncoding RNA; PAR-CLIP, photoactivable ribonucleoside-enhanced crosslinked immunoprecipitation.

RNA Interference (RNAi)

Conventional gene modifications involving RNA interference (RNAi) techniques such as small interfering RNA (siRNA) have been used extensively to study functions of lncRNAs and function by suppressing the RNA expression by cleaving the RNA molecules.[26] These strategies are transient and can be used to study quick effects of the lncRNAs upon depletion. LncRNAs are found to localize in various cellular components such as the nucleus, cytoplasm, or both, and targeting lncRNAs in the nucleus using these strategies has been less effective and also highly debated.[27,28] Recently, knocking down of lncRNAs with higher efficiencies and reduced off-target effects have been achieved through the preparation of endoribonuclease-prepared siRNA (esiRNA) transcripts.[29-31] Alternate ways of silencing include chemical synthesis of siRNAs but have been demonstrated to exhibit increased off-target effects despite comparable suppression levels.[29] Short hairpin RNA (shRNA) is another class of molecules used in RNAi. shRNAs overcame the limitation of siRNAs in their transfection ability. The introduction of shRNA through viral vectors allows for its stable integration and long-term knockdown of the target gene. Another added advantage of using shRNAs is that they can be inducible and used for functional studies demanding a tight regulation of the gene. Evidence of successful RNAi knockdown of lncRNA has been documented in many reports such as siRNA-mediated knockdown of second chromosome locus associated with prostrate 1, UCA1, and hnRNP1.[32-34]

CRISPR/Cas9

The introduction of the CRISPR/Cas9 (clustered regularly interspaced short palindromic repeats) nucleases has drastically propelled scientific research. CRISPR tools can be employed to ablate lncRNA expression or function by direct mutagenesis of the DNA sequence. However, it must be noted that unlike protein coding mRNA, changes made to the sequences coding ncRNA tend to be ineffective as mutations in the 5′ end of the ncRNA may not affect lncRNA activity, while in protein coding genes, this may result in loss of expression. Furthermore, as in many cases, lncRNAs overlap protein coding genes, and therefore the use of CRISPR may affect the function of both the lncRNA and the protein coding gene. As such, stringent designing needs to be considered in particular to alter the lncRNA structure or the specific lncRNA binding sites. Nevertheless, CRISPR technology has been used to efficiently delete large fragments of lncRNA as well.[35] CRISPR can also be used as a targeting module to recruit activators and inhibitors to affect the transcription of lncRNAs. Furthermore, it also aids in the modulation of chromatin structure at specific regions neighboring lncRNAs, ultimately perturbing its expression. CRISPR can also be used to target the lncRNA under investigation to particular gene loci or cellular compartments to study their spatial influence.[36]

CRISPR Interference (CRISPRi) and CRISPR Activation (CRISPRa)

The explosion of CRISPR technology has seen the transition of CRISPR used in genome editing to genome regulation. CRISPR technologies modulate the expression of genes from their endogenous promoter and have been extensively demonstrated to affect activating or silencing lncRNAs.[37-39] A modified CRISPR system, referred to as CRISPR interference/activation (CRISPR i/a), can effectively downregulate or upregulate gene expression by blocking or activating transcription, respectively.[40-42] CRISPRi comprises a catalytically dead Cas9 protein (dCas9) and a guide RNA (gRNA) targeting the target gene to be knocked down.[40] It is important to note here that gRNAs targeting the nontemplated DNA strand in the promoter or −35 regions exhibited greater downregulation than guides targeting 100 bp upstream of the promoter in the targeting template strand. Further modifications of CRISPRi include fusions of the dCas9 proteins with KRAB (Kruppel-associated box), which promotes heterochromatin and is also used to also bring about epigenetic silencing.[40] This strategy leads to the error-prone nonhomologous end-joining pathway in incorporating mutations in the form of frame shifts of INDELS (insertions/deletions) and disrupting the gene function. CRISPRi technology has been used successfully to repress the expression of green fluorescent protein (GFP) in HEK293 cells and even endogenous genes like CXCR4 and CD71.[41] Similarly, CRISPRa covers gain-of-function strategies by the overexpression of open reading frames (ORFs) and functions by deploying transcriptional activators through single guide RNAs (sgRNAs) and dCas9 to the transcriptional start sites (TSSs). This involves the fusion of dCas9 to transcriptional activator domains such as VP64, p65, and RTA (VPR-tripartite fusion of VP64 and the activation domains of the p65 subunit of NFκB and Epstein-Barr virus R transactivator, Rta).[43-45] A second form of transcriptional activator module is a protein tagging system for signal amplification and fluorescence imaging, consisting of an array of repeating peptides and an antibody fusion protein, and is referred to as the SunTag.[42,46] This system comprises VP64 fused to superfolder GFP (sfGFP) and an antibody single-chain variable fragment (scFv) that targets GCN4 epitope and is recruited in a tandem array of 10 copies of GCN4 epitope. A third variant of the activation systems includes an RNA scaffold and is referred to as the synergistic activation mediator (SAM) approach. The components of this system include p65 and HSF1 transcriptional activation domains fused to the MS2 coat protein. Dimers of these MS2 coat proteins are recruited to MS2 RNA hairpins.[37] A recent genome-wide approach using a dual protein coding and noncoding integrated CRISPRa screen (DICaS) to identify functional coding and long noncoding RNA using the CaLR library (CRISPR activation of long noncoding RNA) revealed putative resistance genes toward cytarabine (Ara-C), a chemotherapeutic used in the treatment of patients with acute myeloid leukemia (AML).[47]

Antisense Oligonucleotides (ASO)

Antisense oligonucleotides (ASOs), as its name suggests, are antisense oligonucleotides that are highly effective in depletion of lncRNAs present in the nucleus.[48,49] ASOs comprise modified or unmodified single-stranded deoxyribonucleotides that can hybridize to their respective complementary transcript targets followed by the ability of RNaseH to degrade the RNA component of the RNA-DNA duplex.[50] LncRNAs have a wide range of cellular localization, with a predominant portion of them residing in the nucleus (such as MALAT1 and NEAT1), some in the cytoplasm (such as DANCR), and some in both (such as HOTAIR and TUG1). ASOs rank above siRNA and shRNAs in their ability to access the lncRNAs inside the nucleus.[27] However, ASOs are also subjected to lower stability inside the cells due to their single-stranded nature and the action of nucleases. To overcome this, an advanced version of the ASOs, consisting of 15 to 20 nucleotides with a phosphorothioate modification in the backbone, can limit degradation by cellular nucleases. Yet another modification of the ASOs at the 20 position with an O-methoxy-ethyl group providing drug-like properties to ASOs showed improved binding affinity and sustained pharmacokinetics.[51,52] While these second-generation ASOs confer resistance to nucleases, their modifications lead to inefficient binding of targets, and hence using higher concentrations leads to off-target effects.[27] ASOs have been successfully used in loss-of-function studies. MALAT1 is one such lncRNA that was systematically knocked down using two different gapmers targeting two different parts of MALAT1, suggesting a potential therapy for inhibiting breast cancer progression.[53] A few ASOs, such as nusinersen and mipomersen, also have been clinically approved.[54,55] While the former is used to correct a splicing switch in SMN2 (survival of motor neuron 2) in patients with spinal muscular atrophy, the latter is used to knock down APOB100 mRNA to treat patients with familial hypercholesterolemia.[54,55] Studying the functionality of lncRNAs is complicated owing to their complex genomic architecture and the added complexity of structure and shape they might harbor. Despite available techniques to uncover the functionality of lncRNAs, major limitations and challenges still remain such as the specificity of Cas9-sgRNAs and their ability to affect the neighboring or overlapping genes in the targeted loci.[34] Strikingly, discrepancies have been observed between RNAi-based techniques and CRISPR-based techniques, underlining the necessity to pay attention to genes sharing promoters or overlapping transcripts to obtain biologically significant and relevant results for the targeted genes.[42,56] Furthermore, the functionality of lncRNAs can be assigned with higher confidence when RNAi techniques such as siRNAs, siPOOLs (short interfering RNA pools, comprising a pool of 30 siRNAs), shRNAs, ASOs, and GapmeRs are complemented with CRISPR-based experiments.

Investigating lncRNA-Protein Interaction

RNA binding proteins (RBPs), ribonucleoproteins (RNPs), and a number of RNA species, including lncRNAs, are involved in this complex regulatory network.[57,58] The spatiotemporal arrangement of the mRNA transcripts and the structural dynamicity of the RNPs are precisely correlated inside the cell.[59] To identify the diverse regulatory interactions between RNA and proteins or other genetic elements, a combination of genetic, biochemical, and computation techniques can be applied to identify the complex RNA interactome.

RNA Immunoprecipitation (RIP)

The association of proteins with specific RNA species in vivo can be studied with the help of approaches such as RNA immunoprecipitation (RIP). Through an extension of protein-protein immunoprecipitation and techniques using an antibody of choice, proteins complexed with RNA can be pulled down. The association of the RNA with proteins or other associated RNA species can be quantified with real-time PCR or extensively with RNA sequencing.[60] However, depending on the mode of protein interaction with RNA, RNA pulldown can be achieved through native RNA precipitation methods or by RNA crosslinking methods, with each technique having its own advantages. While the native RNA immunoprecipitation is indicative of a strong and direct RNA-protein binding, the crosslinking approach may be used to investigate indirect or weak binding of proteins to RNA. Importantly, one must not neglect that a binding event between two components could take place even after the lysis of the cells. The native RIP helps pull down kinetically stable interactions, but it is not conclusive if the interaction is still direct or indirect through a complex binding with the RNA.[61] Furthermore, the signature binding motifs of the proteins cannot be determined owing to the long stretches of RNA in the antibody targeting the protein pulldown.[62] In addition, native RIPs require multiple biological replicates because of their reduced reproducibility and the myriad of complex reactions taking place.[63]

Crosslinked Immunoprecipitation (CLIP)

Crosslinked immunoprecipitation (CLIP), on the other hand, engages both the RNA and protein via crosslinking. Crosslinking is mainly achieved with the help of ultraviolet light (UV), forming strong and specific crosslinks, followed by RNase treatment to shorten the RNA fragments. It is worthwhile to mention here that along with increased specificity, one can perform stringent washes on the same to reduce any background signals.[64] Even though the UV-crosslinking in CLIP increases the specificity of protein and RNA interaction, false positives are frequent, and determining the exact binding consensus remains unanswered.[65] Modification of the CLIP protocol, such as the iCLIP (individual nucleotide resolution CLIP) and PAR-CLIP (photoactivable ribonucleoside-enhanced CLIP), helps evade such drawbacks.[66-70] iCLIP helps identify RNA binding motifs by identifying the exact crosslinking site and allows mapping the RNA and protein contacts at a nucleotide resolution. This is mainly achieved by the introduction of an adapter at the 5′ end by the primer used for reverse transcription, wherein the complementary DNA (cDNA) is circularized and subsequently linearized in the following steps capturing both truncated cDNA and read-through cDNAs.[69] PAR-CLIP, on the other hand, uses 4-SU (4-thiouridine) or 6-SG (6-thiguanosine) infused in the culture media, wherein these moieties are incorporated in the RNA. The advantage of this approach is the elimination of the nonspecific targets and boosting the identification of exact binding sites at a single-nucleotide resolution.[71] Nevertheless, the use of the photoreactive ribonucleoside analogues 4-SU and 6-SG might prove toxic to the cells and needs to be optimized at the right concentrations for the cell line used. Recently, a new technique, digestion-optimized RIP sequencing (DO-RIP Seq), has the added advantage of quantifying the binding at both the whole transcript level and the binding site level. DO-RIP Seq employs micrococcal nuclease and helps generate global protein RNA interactions with added information on their binding strength in cells or tissues.[72] Interestingly, many groups have used RNA immunoprecipitation to understand the role between lncRNA and protein binding partners.[73-75] Zhao and colleagues[75] showed that the polycomb repressive complex binds to the RepA of Xist along with other lncRNAs involved in X chromosome inactivation, including Tsix (a lncRNA lying antisense of Xist). Another lncRNA, Fendrr, was also identified in a similar fashion to be associated with the PRC2 complex and WDR5 protein.[74] With advances in technology and with native RIPs coupled to RNA sequencing, it has become possible to uncover many PRC2-interacting RNAs, including some already reported RNAs and some unannotated RNAs in embryonic stem cells.[75] CLIP also has been used extensively to study lncRNA and protein associations. One such example is the association of the lncRNA air to methyl transferase G9a.[76] In combination with high-throughput sequencing, a number of RNA binding proteins like Nova, TDP-43, Ago2, and Piwi proteins were identified.[77-79] iCLIP coupled with deep sequencing revealed the global regulatory roles of hnRPN L protein.[80] Furthermore, the PAR-CLIP approach has been successfully applied to RNA binding proteins like HuR, Ataxin2, AUF1, and FMRP.[81-84]

RAP, ChIRP, and CHART

Other robust methods of studying the lncRNA and protein interaction are through approaches that target the RNA directly. These include the RNA pulldown approaches like RNA antisense purification (RAP), chromatin isolation by RNA purification (ChIRP), and capture hybridization analysis of RNA targets (CHART). These methods can provide us information depending on the experiment used, and investigators can look at the DNA, RNA, or the proteins it is interacting with coupled with an added extension of a Western blot, quantitative real-time PCRs, mass spectrometry, and high-throughput sequencing. These RNA pulldown approaches use probes targeting the lncRNA under investigation, coupled with an affinity tag, such as biotin, subsequently pulled down using streptavidin-coated agarose or magnetic beads. One such lncRNA that was discovered using the RNA pulldown method was HOTAIR, which has been attributed for its role in cancers and importantly shown to interact with the PRC2 complex.[85,86] ChIRP aids in understanding the interactions between proteins and chromatin by using biotinylated oligo probes as a bait, designed antisense to the lncRNA under investigation. It should be noted here that the cell is crosslinked, resulting in a snapshot of the interactions at that precise moment. Following up with the same experiment performed over a course of time on the cells or drug effects on the cells would help us to get a concise understanding of how the proteins interact on the given stretch of nucleic acids. Pulling down the whole chromatin using this approach provides immense information on the state of chromatin and the regions it interacts with when subsequently followed by quantitative PCRs, sequencing or mass spectrometry, or even simple Western blots when suspecting particular proteins involved in the process. A slight yet significant variation of the ChIRP is the domain-specific chromatin isolation by RNA purification (dChIRP), which was developed with the idea of investigating specific domains of the target RNA. This approach uses specific biotinylated probe pools targeting the specific domain, which in turn reduces the signal-to-noise ratio and enhances the localization and specificity, subsequently helping in the characterization of the lncRNA architecture and function.[15,87,88] The same authors showed the direct binding of the MSL protein to the 3D structure of the roX1 lncRNA.[87] Importantly, ChIRP coupled with mass spectrometry has been instrumental in the discovery of the proteome embracing the lncRNA.[89] This approach helped pave the way to understand the spread of the lncRNA Xist and the silencing flow-through mechanism by the identification of 81 endogenous proteins. Proteins such as hnRNPK were shown to participate in chromatin modification and Xist-mediated gene silencing, but it did not play a role in the localization of Xist or its biogenesis.[89] Another approach that enables the localization of the lncRNA in the chromatin and its association with proteins, based on the hybridization purification strategy, is capture hybridization analysis of RNA targets (CHART). Although highly similar to ChIRP, it differs from the former approach in its design criteria for the probes. While the probes for the ChIRP span across the complete lncRNA, CHART requires specially designed capture oligonucleotides, capable of hybridizing to accessible regions of the lncRNA. This is mainly achieved when formaldehyde-crosslinked nuclei lysates are subjected to RNAse H treatment, exposing potential hybridization regions. A follow-up mass spectrometry would help identify proteins associated with and similar to a ChIP, and CHART helps identify regions in the genome where the RNA is bound.[90,91] CHART coupled with RNA sequencing was used to identify binding sites for MALAT1 and NEAT1, and CHART coupled with mass spectrometry helped unveil a repertoire of proteins that were associated with nuclear speckle and paraspeckle components and proteins associated with active chromatin.[92] RAP is another approach of capturing the lncRNAs under investigation. RAP requires a crosslinking step but is not restricted to any particular kind of crosslinking. Any of the following crosslinking agents psoralens, formaldehyde, and UV crosslinking can be used. While psoralens are more suitable for RNA-RNA interactions, formaldehyde and UV cross-linking is preferred to study protein and nucleic acid (both RNA and DNA) interactions. RAP differs from the other RNA-centric methods in its use of long capture biotinylated probes usually greater than 60 nucleotides and the formation of very stable RNA-DNA hybrids.[93] lncRNAs such as FIRRE subsequently showed their association with a nuclear matrix factor hnRNPU.[94] The authors were also able to show that upon the genetic deletion of FIRRE and hnRNPU, the localization of this lncRNA was lost at the transchromosomal interacting loci, also highlighting the fact that RAP can be used to study the nuclear architecture even across chromosomes.[94] Like the previous approaches, RAP can be coupled with mass spectrometry (MS) to study the proteins interacting and with RNA/DNA sequencing to identify potential interacting regions or binding regions. Recently, RAP-MS helped identify about 10 proteins associated with Xist, particularly elucidating its direct interaction with SHARP to alleviate silencing transcription through the HDAC3 complex and subsequently mediating the recruitment of PRC2 in a SHARP- and HDAC3-dependent manner.[95]

Identification of lncRNA Structural-Functional Relationships

Biochemical strategies provide information on the structural-functional relationship of lncRNAs through the study of their structures with techniques such as dimethyl sulfate sequencing (DMS-Seq), selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), fragmentation sequencing (FRAG-Seq), and parallel analysis of RNA structure (PARS). SHAPE is one such technique that is widely used and is based on the 2′-hydroxyl moiety. When this moiety is acetylated, 1-methyl-7-nitroisatoic anhydride (1M7) and N-methylisatoic anhydride (NMIA), the main reagents used in SHAPE, block the reverse transcription forming a 2′-O-adduct. This RNA is then subjected to cDNA synthesis. SHAPE in itself can, however, be used for a limited number of RNA or single-strand region analyses but has become powerful with the addition of next-generation sequencing to the SHAPE technique, enabling genome-wide structure probing.[96,97] Another recent advance in the RNA SHAPE technique is the SHAPE MAP (Mutational Profiling), which is less cumbersome and does not involve any RNA ligation steps or library preparation steps.[98] RNA interaction groups by mutational profiling (RING-MaP), another technique used for RNA structure-function studies, also aid in understanding the 3D RNA structure. DMS, a reagent used in the RING-MaP technique, has a limitation in that it can only modify the cytosine and adenosine nucleotides and could lead to bias in the interpretation of the results. PARS is another technique that is employed to study genome-wide analysis of RNA structures. This method employs the use of RNAse V1 and S1 followed subsequently by RNA sequencing. In addition, this method also suggested a stark difference between the coding regions and the untranslated regions. The coding regions pertain to fewer conformational changes owing to their structured regions, whereas the unstructured UTR regions expose their functional elements and their structural coding regions.[99-101] Frag Seq is another technique that employs the P1 endonuclease to digest single-stranded RNA followed by high-throughput sequencing and bioinformatics analysis of the generated fragments.[102] summarizes some of the techniques used in understanding the functions of lncRNAs, their respective probes, and the advantages of using the techniques. provides a snapshot of the diverse lncRNA interactome and the protein-, RNA-, and DNA-centric approaches that can be used to further investigate the long noncoding RNA.

Table 2.

Techniques Used to Investigate lncRNAs.

Technique	Bait	Crosslinking	Interaction	Technical Concept	Scope	Reference
nRIP	Protein	No	Direct/indirect	Captures transcriptome and its targets. RNA and protein components associated with the protein of interest.	Genome-wide	[75]
CLIP-Seq	Protein	UV 254 nm	Direct	Captures protein-RNA interactions in vivo. RNA components associated with protein of interest.	Genome-wide	[64]
CLIP–mass spectrometry	Protein	UV 254 nm	Direct/indirect	Captures protein-RNA interactions in vivo. Proteins complexes associated with protein of interest and the RNA targets it interacts with.	Genome-wide	[126,127]
PAR-CLIP	Protein	UV 365 nm	Direct T/C or G/A	Captures protein-RNA covalent binding enabled by efficient crosslinking from 4-SU or 6-SG.	Genome-wide	[71,127]
iCLIP	Protein	UV 254 nm	Direct; bound to a barcode sequence	Circularization of reverse transcribed products after the ligation of cleavable adaptors.	Genome-wide	[68,72]
RNA pulldown	lncRNA	Optional	Direct	Special aptamers such as biotin or MS2 fused to the lncRNA pulls down interactome of lncRNA. This includes the targets of lncRNA and complexes interacting. Proteins can be studied by immunoblotting or mass spectrometry.	lncRNA-specific interactions	[85,128,129]
RAP[50,83]	Antisense-RNA	Disuccinimidyl glutarate-formaldehyde-aminomethyl-trioxsalen	Direct/indirect	120-nt long nucleotide probes antisense to the target RNA and tiled across the entire RNA target. The probes are biotinylated and captures the lncRNA enrichment amidst protein-RNA interactions, RNA degradations and RNA secondary structures.	Genome-wide	[93,130]
ChIRP	DNA	Glutaraldehyde	Direct	Antisense DNA probes that hybridize to target RNA. Pulls down endogenous RNA and associated genomic DNA.	Genome-wide	[131]
ChIRP-domain	DNA	Glutaraldehyde-formaldehyde	Direct	Enables the pulldown of endogenous RNA-chromatin interactions in living cells. Similar to ChIRP, also provides functional information on the architecture and domains of the RNA under investigation.	Genome-wide	[88]
SHAPE-Seq	RNA	1-Methyl-7 (1M7)–nitroisatoic anhydride (NMIA)	Architecture/structure	The method uses selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), which measures nucleotide resolution flexibility information for RNAs in vitro and in vivo.	Structural information	[97]
SHAPE-MaP	RNA	1-methyl-7 (1M7)–nitroisatoic anhydride (NMIA)–1M6	Architecture/structure	Similar to SHAPE, but SHAPE-MaP provides additional information on the mutations and yields accurate and high-resolution secondary-structure models and disentangles sequence polymorphisms.	Structural and mutation information	[98]
DMS-Seq	RNA	Dimethyl sulfate	Architecture/structure	Can be performed in vivo and in vitro. Interacts with unpaired adenine and cytosine residues followed by deep sequencing to identify modifications.	Structural modifications	[132]
FRAG-Seq	RNA	RNaseP1	Architecture/structure	High-throughput RNA structure probing method that uses high-throughput RNA sequencing of fragments generated by digestion with nuclease P1, which specifically cleaves single-stranded nucleic acids.	Genome-wide	[133]
PARS, PARTE	RNA	RNase V1, RNase S1	Architecture/structure	High-throughput deep sequencing of RNA fragments that are treated with structure-specific enzymes providing in vitro profiling of secondary structures at single-nucleotide resolution.	Genome-wide	[99,134]
icSHAPE	RNA	2-methylnicotinic acid imidazolide N3	Architecture/structure	Living cells are treated with the icSHAPE chemical NAI-N3 followed by selective chemical enrichment of NAI-N3–modified RNA, which provides an improved signal-to-noise ratio compared with similar methods leveraging deep sequencing. Purified RNA is then reverse-transcribed to produce cDNA, with SHAPE-modified bases leading to truncated cDNA.	Genome-wide	[135,136]

4-SU, 4-thiouridine; 6-SG, 6-thiguanosine; cDNA, complementary DNA; ChIRP, chromatin isolation by RNA purification; CLIP, crosslinked immunoprecipitation; CLIP-Seq, crosslinked immunoprecipitation sequencing; iCLIP, individual nucleotide resolution crosslinked immunoprecipitation; DMS-Seq, dimethyl sulfate sequencing; FRAG-Seq, fragmentation sequencing; icSHAPE, in vivo click selective 2′-hydroxyl acylation analyzed by primer extension; lncRNA, long noncoding RNA; nRIP, native RNA immunoprecipitation; PAR-CLIP, photoactivable ribonucleoside-enhanced crosslinked immunoprecipitation; PARS, parallel analysis of RNA structure; PARTE, parallel analysis of RNA structure with temperature elevation; RAP, RNA antisense purification; SHAPE-MaP, selective 2′-hydroxyl acylation analyzed by primer extension mutational profiling; SHAPE-Seq, selective 2′-hydroxyl acylation analyzed by primer extension sequencing; UV, ultraviolet.

Figure 1.

Long noncoding RNA (lncRNAs) interactome and strategies. The lncRNA interactome is complex and involves DNA, RNA, and/or proteins. It is important to understand the mechanisms and functions of lncRNAs and the role they play in normal and diseased states. The methods or strategies employed for studying lncRNAs can be achieved by either of the following techniques. Protein-lncRNA interactions broadly represent the protein partners of lncRNAs and suggest their functional mechanisms and pathways. RNA immunoprecipitation (RIP) and crosslinked immunoprecipitation (CLIP) techniques provide clues of the associated RNAs when ribonucleoprotein complexes are pulled down based on the antibody of interest. The coupling of these techniques with high-throughput RNA sequencing and mass spectrometry could help identify the protein interactions to lncRNAs genome-wide or simply other proteins associated in the RNA binding protein complex or the protein of interest, respectively. Techniques that shed information based on the structural features like the secondary and tertiary structure of the lncRNAs eventually aid toward understanding the lncRNA function. The structural features can be harnessed through techniques and chemical reagents that cleave RNA at specific nucleotides or attack the regions that are exposed to the solvent, avoiding the RNA regions that are buried inside or are covered by proteins. Crosslinking also could reveal the intramolecular interactions that could be extended over a long range. Ribonucleases with different cleavage specificities can be used to obtain a RNAse footprint of potential regions covered by the proteins. Methods such as selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) and in-line probing aim at providing information on the local nucleotide flexibility. Coupling of SHAPE with sequencing could provide details of binding regions. Fragment sequencing (FragSeq) and parallel analysis of RNA structure (PARS), on the other hand, also employ RNase digestion to provide information on the RNA structure. Several techniques have been developed to identify the genomic DNA targets of lncRNAs. Based on the workflow backbone of chromatin immunoprecipitation (ChIP), chromatin isolation by RNA purification (ChIRP) helps identify lncRNAs associated with unique chromatin marks, whereas techniques such as chromatin oligo-affinity purification (ChOP) and capture hybridization of RNA targets (CHART) are basically used to identify the complementary DNA regions that interact with the RNA of interest. In addition, coupling with RNA sequencing, quantitative PCRs and mass spectrometry could yield important information regarding the RNA and protein interactome, respectively.

Techniques Used to Investigate lncRNAs. 4-SU, 4-thiouridine; 6-SG, 6-thiguanosine; cDNA, complementary DNA; ChIRP, chromatin isolation by RNA purification; CLIP, crosslinked immunoprecipitation; CLIP-Seq, crosslinked immunoprecipitation sequencing; iCLIP, individual nucleotide resolution crosslinked immunoprecipitation; DMS-Seq, dimethyl sulfate sequencing; FRAG-Seq, fragmentation sequencing; icSHAPE, in vivo click selective 2′-hydroxyl acylation analyzed by primer extension; lncRNA, long noncoding RNA; nRIP, native RNA immunoprecipitation; PAR-CLIP, photoactivable ribonucleoside-enhanced crosslinked immunoprecipitation; PARS, parallel analysis of RNA structure; PARTE, parallel analysis of RNA structure with temperature elevation; RAP, RNA antisense purification; SHAPE-MaP, selective 2′-hydroxyl acylation analyzed by primer extension mutational profiling; SHAPE-Seq, selective 2′-hydroxyl acylation analyzed by primer extension sequencing; UV, ultraviolet. Long noncoding RNA (lncRNAs) interactome and strategies. The lncRNA interactome is complex and involves DNA, RNA, and/or proteins. It is important to understand the mechanisms and functions of lncRNAs and the role they play in normal and diseased states. The methods or strategies employed for studying lncRNAs can be achieved by either of the following techniques. Protein-lncRNA interactions broadly represent the protein partners of lncRNAs and suggest their functional mechanisms and pathways. RNA immunoprecipitation (RIP) and crosslinked immunoprecipitation (CLIP) techniques provide clues of the associated RNAs when ribonucleoprotein complexes are pulled down based on the antibody of interest. The coupling of these techniques with high-throughput RNA sequencing and mass spectrometry could help identify the protein interactions to lncRNAs genome-wide or simply other proteins associated in the RNA binding protein complex or the protein of interest, respectively. Techniques that shed information based on the structural features like the secondary and tertiary structure of the lncRNAs eventually aid toward understanding the lncRNA function. The structural features can be harnessed through techniques and chemical reagents that cleave RNA at specific nucleotides or attack the regions that are exposed to the solvent, avoiding the RNA regions that are buried inside or are covered by proteins. Crosslinking also could reveal the intramolecular interactions that could be extended over a long range. Ribonucleases with different cleavage specificities can be used to obtain a RNAse footprint of potential regions covered by the proteins. Methods such as selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) and in-line probing aim at providing information on the local nucleotide flexibility. Coupling of SHAPE with sequencing could provide details of binding regions. Fragment sequencing (FragSeq) and parallel analysis of RNA structure (PARS), on the other hand, also employ RNase digestion to provide information on the RNA structure. Several techniques have been developed to identify the genomic DNA targets of lncRNAs. Based on the workflow backbone of chromatin immunoprecipitation (ChIP), chromatin isolation by RNA purification (ChIRP) helps identify lncRNAs associated with unique chromatin marks, whereas techniques such as chromatin oligo-affinity purification (ChOP) and capture hybridization of RNA targets (CHART) are basically used to identify the complementary DNA regions that interact with the RNA of interest. In addition, coupling with RNA sequencing, quantitative PCRs and mass spectrometry could yield important information regarding the RNA and protein interactome, respectively.

Future Directions

Since the majority of lncRNAs are present in cells in very low levels, studying their interactions with proteins or with nucleic acids poses a major challenge for techniques employed to study the lncRNA interactome such as RIP, RAP, CLIP, and other related techniques. Initially, the low abundance of lncRNAs was considered a limitation, but with the advance in sequencing technologies and molecular and biochemical techniques, it is possible to investigate lncRNAs that are sparsely populated in cells. Harnessing the properties of CRISPRa could help in overcoming this challenge by the endogenous overexpression of the lncRNA. While recent studies focus primarily on lncRNA expression in the total cell populations, lncRNAs expressed in different subcellular components of the cell can also be investigated considering the necessity to scale up the experiment. With single-cell transcriptomics on the rise, expression of the lncRNA at the single cell level could provide substantial information on its function.[103] While substantial information regarding the genetic annotation of lncRNAs is obtained from advanced sequencing technologies, their subcellular localization still remains evasive. High-resolution quantification and spatial position of lncRNAs are possible with single-molecule RNA fluorescence in situ hybridization and could aid in functionally classifying them based on subcellular localization.[104] In addition, these noncoding RNAs are present at low levels, possibly due to their unstable nature and quick degradation posttranscription.[105] The functionality of lncRNAs in cells needs to be thoroughly analyzed through gene perturbation experiments such as overexpression and downregulation, followed by real-time quantitative PCRs or deep sequencing, to observe any differential gene expressions. Importantly, not all lncRNAs demonstrate functions in cells and laboratory model animals as most of the preliminary screenings are done in cell lines and not in physiological conditions of living organisms. Further challenges could also arise from the lack of measurability by the technique used or simply failure to knock down or overexpress the lncRNA due to its genomic architecture. A combination of computational and functional analysis such as those mentioned earlier is required for the identification of novel lncRNAs involved in both normal physiological conditions and diseased states. Upon the identification of regulatory ncRNAs, it has now become readily apparent that regulation of proteins has been completely underestimated. We are now only beginning to understand that signal transduction is dependent upon, as it appears to us today, an almost incalculable level of regulation within a signaling cascade if not at the individual protein level. This continuous dynamic regulation is necessary for cells to finetune external cellular signaling cues into appropriate transcriptional responses. The identification that loss-of-function mutations within ncRNAs contribute to the genesis and progression of human disorders further highlights their importance. Nevertheless, defining the specific modes of action of lncRNAs will be a daunting task, possibly even greater than defining the functional relevance of approximately 20,000 predicted proteins. Based upon next-generation sequencing technology, large genome efforts such as ENCODE have begun to uncover an outline of the genome and equally important corresponding transcriptional profiles. This has permitted the identification of not only lncRNAs but also other overlapping sense transcripts. Although in these cases, sequence overlap may indicate the function of these lncRNAs as transcriptional activators or repressors, sequence homology is also essential to identify miRNA binding partners or 3′-UTR overlaps. Arguably, the identification of these lncRNAs functions is relatively simple compared to elaborate mechanisms involving protein complexes or subcellular localization. Understanding the mechanism of action of these lncRNAs will remain the key challenge to the identification of therapeutics targeting these molecules. Presently, the best method to target lncRNAs is through oligonucleotide-based therapies. While major progress has been made in terms of oligonucleotide design through either in silico–based crystallography or the generation of chemically modified analogues, effective targeting of these molecules remains inefficient. As mechanisms of action, oligonucleotide design and targeting methods continue to be researched, and it is expected that a number of a compounds targeting lncRNAs will be included in a physician’s vademecum for the treatment of a wide variety of diseases.

133 in total

1. Initial sequencing and analysis of the human genome.

Authors: E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal: Nature Date: 2001-02-15 Impact factor: 49.962

2. Combined RNAi and localization for functionally dissecting long noncoding RNAs.

Authors: Debojyoti Chakraborty; Dennis Kappei; Mirko Theis; Anja Nitzsche; Li Ding; Maciej Paszkowski-Rogacz; Vineeth Surendranath; Nicolas Berger; Herbert Schulz; Kathrin Saar; Norbert Hubner; Frank Buchholz
Journal: Nat Methods Date: 2012-02-12 Impact factor: 28.547

Review 3. Molecular mechanisms of long noncoding RNAs.

Authors: Kevin C Wang; Howard Y Chang
Journal: Mol Cell Date: 2011-09-16 Impact factor: 17.970

4. Transcriptome-wide analysis of regulatory interactions of the RNA-binding protein HuR.

Authors: Svetlana Lebedeva; Marvin Jens; Kathrin Theil; Björn Schwanhäusser; Matthias Selbach; Markus Landthaler; Nikolaus Rajewsky
Journal: Mol Cell Date: 2011-06-30 Impact factor: 17.970

Review 5. Coming of age: ten years of next-generation sequencing technologies.

Authors: Sara Goodwin; John D McPherson; W Richard McCombie
Journal: Nat Rev Genet Date: 2016-05-17 Impact factor: 53.242

6. Genome-wide measurement of RNA folding energies.

Authors: Yue Wan; Kun Qu; Zhengqing Ouyang; Michael Kertesz; Jun Li; Robert Tibshirani; Debora L Makino; Robert C Nutter; Eran Segal; Howard Y Chang
Journal: Mol Cell Date: 2012-09-13 Impact factor: 17.970

Review 7. RNA in unexpected places: long non-coding RNA functions in diverse cellular contexts.

Authors: Sarah Geisler; Jeff Coller
Journal: Nat Rev Mol Cell Biol Date: 2013-10-09 Impact factor: 94.444

Review 8. Long noncoding RNA HOTAIR involvement in cancer.

Authors: Yansheng Wu; Li Zhang; Yang Wang; Hui Li; Xiubao Ren; Feng Wei; Wenwen Yu; Xudong Wang; Lun Zhang; Jinpu Yu; Xishan Hao
Journal: Tumour Biol Date: 2014-08-29

9. A promoter-level mammalian expression atlas.

Authors: Alistair R R Forrest; Hideya Kawaji; Michael Rehli; J Kenneth Baillie; Michiel J L de Hoon; Vanja Haberle; Timo Lassmann; Ivan V Kulakovskiy; Marina Lizio; Masayoshi Itoh; Robin Andersson; Christopher J Mungall; Terrence F Meehan; Sebastian Schmeier; Nicolas Bertin; Mette Jørgensen; Emmanuel Dimont; Erik Arner; Christian Schmidl; Ulf Schaefer; Yulia A Medvedeva; Charles Plessy; Morana Vitezic; Jessica Severin; Colin A Semple; Yuri Ishizu; Robert S Young; Margherita Francescatto; Intikhab Alam; Davide Albanese; Gabriel M Altschuler; Takahiro Arakawa; John A C Archer; Peter Arner; Magda Babina; Sarah Rennie; Piotr J Balwierz; Anthony G Beckhouse; Swati Pradhan-Bhatt; Judith A Blake; Antje Blumenthal; Beatrice Bodega; Alessandro Bonetti; James Briggs; Frank Brombacher; A Maxwell Burroughs; Andrea Califano; Carlo V Cannistraci; Daniel Carbajo; Yun Chen; Marco Chierici; Yari Ciani; Hans C Clevers; Emiliano Dalla; Carrie A Davis; Michael Detmar; Alexander D Diehl; Taeko Dohi; Finn Drabløs; Albert S B Edge; Matthias Edinger; Karl Ekwall; Mitsuhiro Endoh; Hideki Enomoto; Michela Fagiolini; Lynsey Fairbairn; Hai Fang; Mary C Farach-Carson; Geoffrey J Faulkner; Alexander V Favorov; Malcolm E Fisher; Martin C Frith; Rie Fujita; Shiro Fukuda; Cesare Furlanello; Masaaki Furino; Jun-ichi Furusawa; Teunis B Geijtenbeek; Andrew P Gibson; Thomas Gingeras; Daniel Goldowitz; Julian Gough; Sven Guhl; Reto Guler; Stefano Gustincich; Thomas J Ha; Masahide Hamaguchi; Mitsuko Hara; Matthias Harbers; Jayson Harshbarger; Akira Hasegawa; Yuki Hasegawa; Takehiro Hashimoto; Meenhard Herlyn; Kelly J Hitchens; Shannan J Ho Sui; Oliver M Hofmann; Ilka Hoof; Furni Hori; Lukasz Huminiecki; Kei Iida; Tomokatsu Ikawa; Boris R Jankovic; Hui Jia; Anagha Joshi; Giuseppe Jurman; Bogumil Kaczkowski; Chieko Kai; Kaoru Kaida; Ai Kaiho; Kazuhiro Kajiyama; Mutsumi Kanamori-Katayama; Artem S Kasianov; Takeya Kasukawa; Shintaro Katayama; Sachi Kato; Shuji Kawaguchi; Hiroshi Kawamoto; Yuki I Kawamura; Tsugumi Kawashima; Judith S Kempfle; Tony J Kenna; Juha Kere; Levon M Khachigian; Toshio Kitamura; S Peter Klinken; Alan J Knox; Miki Kojima; Soichi Kojima; Naoto Kondo; Haruhiko Koseki; Shigeo Koyasu; Sarah Krampitz; Atsutaka Kubosaki; Andrew T Kwon; Jeroen F J Laros; Weonju Lee; Andreas Lennartsson; Kang Li; Berit Lilje; Leonard Lipovich; Alan Mackay-Sim; Ri-ichiroh Manabe; Jessica C Mar; Benoit Marchand; Anthony Mathelier; Niklas Mejhert; Alison Meynert; Yosuke Mizuno; David A de Lima Morais; Hiromasa Morikawa; Mitsuru Morimoto; Kazuyo Moro; Efthymios Motakis; Hozumi Motohashi; Christine L Mummery; Mitsuyoshi Murata; Sayaka Nagao-Sato; Yutaka Nakachi; Fumio Nakahara; Toshiyuki Nakamura; Yukio Nakamura; Kenichi Nakazato; Erik van Nimwegen; Noriko Ninomiya; Hiromi Nishiyori; Shohei Noma; Shohei Noma; Tadasuke Noazaki; Soichi Ogishima; Naganari Ohkura; Hiroko Ohimiya; Hiroshi Ohno; Mitsuhiro Ohshima; Mariko Okada-Hatakeyama; Yasushi Okazaki; Valerio Orlando; Dmitry A Ovchinnikov; Arnab Pain; Robert Passier; Margaret Patrikakis; Helena Persson; Silvano Piazza; James G D Prendergast; Owen J L Rackham; Jordan A Ramilowski; Mamoon Rashid; Timothy Ravasi; Patrizia Rizzu; Marco Roncador; Sugata Roy; Morten B Rye; Eri Saijyo; Antti Sajantila; Akiko Saka; Shimon Sakaguchi; Mizuho Sakai; Hiroki Sato; Suzana Savvi; Alka Saxena; Claudio Schneider; Erik A Schultes; Gundula G Schulze-Tanzil; Anita Schwegmann; Thierry Sengstag; Guojun Sheng; Hisashi Shimoji; Yishai Shimoni; Jay W Shin; Christophe Simon; Daisuke Sugiyama; Takaai Sugiyama; Masanori Suzuki; Naoko Suzuki; Rolf K Swoboda; Peter A C 't Hoen; Michihira Tagami; Naoko Takahashi; Jun Takai; Hiroshi Tanaka; Hideki Tatsukawa; Zuotian Tatum; Mark Thompson; Hiroo Toyodo; Tetsuro Toyoda; Elvind Valen; Marc van de Wetering; Linda M van den Berg; Roberto Verado; Dipti Vijayan; Ilya E Vorontsov; Wyeth W Wasserman; Shoko Watanabe; Christine A Wells; Louise N Winteringham; Ernst Wolvetang; Emily J Wood; Yoko Yamaguchi; Masayuki Yamamoto; Misako Yoneda; Yohei Yonekura; Shigehiro Yoshida; Susan E Zabierowski; Peter G Zhang; Xiaobei Zhao; Silvia Zucchelli; Kim M Summers; Harukazu Suzuki; Carsten O Daub; Jun Kawai; Peter Heutink; Winston Hide; Tom C Freeman; Boris Lenhard; Vladimir B Bajic; Martin S Taylor; Vsevolod J Makeev; Albin Sandelin; David A Hume; Piero Carninci; Yoshihide Hayashizaki
Journal: Nature Date: 2014-03-27 Impact factor: 49.962

10. RNAi factors are present and active in human cell nuclei.

Authors: Keith T Gagnon; Liande Li; Yongjun Chu; Bethany A Janowski; David R Corey
Journal: Cell Rep Date: 2014-01-02 Impact factor: 9.423

62 in total

1. Identification of long noncoding RNA TC0101441 as a novel biomarker for diagnosis and prognosis of gastric cancer.

Authors: Weiwei Wang; Jianjun Wu
Journal: Int J Clin Exp Pathol Date: 2021-03-01

2. LINC00200 contributes to the chemoresistance to oxaliplatin of gastric cancer cells via regulating E2F1/RAD51 axis.

Authors: Mengxin Lin; Meifang Xu; Zongbin Xu; Zongqi Weng; Bingqiang Lin; Yanqin Lan; Qing Liu; Xiaoyan Lin; Jie Pan
Journal: Hum Cell Date: 2021-04-06 Impact factor: 4.174

3. Screening and functional analysis of differentially expressed lncRNAs in rapid atrial pacing dog atrial tissue.

Authors: Wenfeng Shangguan; Lijun Wang; Rukun Cheng; Tong Liu; Jiageng Cai; Baoshuai Zhang; Enzhao Liu; Xue Liang
Journal: J Interv Card Electrophysiol Date: 2020-07-15 Impact factor: 1.900

4. Highly deregulated lncRNA LOC is associated with overall worse prognosis in Hepatocellular Carcinoma patients.

Authors: Lee Jin Lim; Lay Hiang Ling; Yu Pei Neo; Alexander Y F Chung; Brian K P Goh; Pierce K H Chow; Chung Yip Chan; Peng Chung Cheow; Ser Yee Lee; Tony K H Lim; Samuel S Chong; London L P J Ooi; Caroline G Lee
Journal: J Cancer Date: 2021-03-30 Impact factor: 4.207

5. Long Non-coding RNA EBLN3P Regulates UHMK1 Expression by Sponging miR-323a-3p and Promotes Colorectal Cancer Progression.

Authors: Xiang-Hao Xu; Wen Song; Jun-Hua Li; Ze-Qi Huang; Ya-Fang Liu; Qiang Bao; Zhi-Wen Shen
Journal: Front Med (Lausanne) Date: 2021-05-24

6. VPS9D1-AS1, a novel long-non-coding RNA, acts as a tumor promoter by regulating the miR-324-5p/ITGA2 axis in colon adenocarcinoma.

Authors: Guohong Huang; Yimei Yang; Mengxin Lv; Tian Huang; Xiaoyan Zhan; Wei Kang; Jianghou Hou
Journal: Am J Transl Res Date: 2022-02-15 Impact factor: 4.060

7. Berberine Attenuates MPP⁺-Induced Neuronal Injury by Regulating LINC00943/miR-142-5p/KPNA4/NF-κB Pathway in SK-N-SH Cells.

Authors: Xueqin Li; Yan Su; Na Li; Feng-Ru Zhang; Nan Zhang
Journal: Neurochem Res Date: 2021-08-24 Impact factor: 3.996

Review 8. Interrogating lncRNA functions via CRISPR/Cas systems.

Authors: Meira S Zibitt; Corrine Corrina R Hartford; Ashish Lal
Journal: RNA Biol Date: 2021-03-26 Impact factor: 4.652

9. lncRNA GAS6-AS1 inhibits progression and glucose metabolism reprogramming in LUAD via repressing E2F1-mediated transcription of GLUT1.

Authors: Jing Luo; Huishan Wang; Li Wang; Gaoming Wang; Yu Yao; Kai Xie; Xiaokun Li; Lin Xu; Yi Shen; Binhui Ren
Journal: Mol Ther Nucleic Acids Date: 2021-05-01 Impact factor: 8.886

Review 10. The Role of lncRNA PCAT6 in Cancers.

Authors: Siying Wang; Zhenyao Chen; Jingyao Gu; Xin Chen; Zhaoxia Wang
Journal: Front Oncol Date: 2021-07-13 Impact factor: 6.244