| Literature DB >> 26509058 |
James Ding1, Stephen Eyre1, Jane Worthington1.
Abstract
Genome-wide association studies (GWASs) have been used to great effect to identify genetic susceptibility loci for complex disease. A series of GWAS and meta-analyses have informed the discovery of over 100 loci for rheumatoid arthritis (RA). In common with findings in other autoimmune diseases the lead signals for the majority of these loci do not map to known gene sequences. In order to realise the benefit of investment in GWAS studies it is vital we determine how disease associated alleles function to influence disease processes. This is leading to rapid development in our knowledge as to the function of non-coding regions of the genome. Here we consider possible functional mechanisms for intergenic RA-associated variants which lie within lncRNA sequences.Entities:
Keywords: Arthritis; Autoimmune Diseases; Inflammation; Rheumatoid Arthritis
Year: 2015 PMID: 26509058 PMCID: PMC4612696 DOI: 10.1136/rmdopen-2014-000028
Source DB: PubMed Journal: RMD Open ISSN: 2056-5933
Intergenic RA-associated regions that overlap lincRNA
| RA-associated region | Lead SNP | Gene | Expl.H2 | OR | Overlapping lincRNA |
|---|---|---|---|---|---|
| chr1:38614865–38644861 | rs12140275 | POU3F1 | 0.039 | 1.11 | lnc-UTP11L-2 |
| chr2:100806512–100835734 | rs9653442 | AFF3 | 0.064 | 1.11 | lnc-NMS-1 |
| chr4:26085478–26128710 | rs11933540 | SMIM20 | 0.101 | 1.15 | lnc-RBPJ-1 |
| chr6:137959233–138006504 | rs17264332 | TNFAIP3 | 0.100 | 1.17 | lnc-OLIG3-1 and lnc-TNFAIP3-3 |
| chr6:159489789–159515309 | rs2451258 | TAGAP | 0.041 | 1.10 | lnc-FNDC1-7 |
| chr8:102451261–102469182 | rs678347 | GRHL2 | 0.034 | 1.08 | lnc-ZNF706-5 |
| chr8:129540462–129571140 | rs1516971 | TMEM75 | 0.044 | 1.15 | lncTMEM75-2 |
| chr15:69984460–70010647 | rs8026898 | RPLP1 | 0.079 | 1.15 | lnc-RPLP1-2 |
| chr16:86005836–86021627 | rs13330176 | IRF8 | 0.064 | 1.12 | lnc-IRF8-5 |
Included is the associated region (r2=0.8), lead SNP, closest protein-coding gene (Gene), explained heritability (Expl.H2, %) OR and overlapping lincRNA (LNCipedia nomenclature).19 Explained heritability and ORs are taken from Okada et al.5
RA, rheumatoid arthritis; SNP, single nucleotide polymorphism.
Online lncRNA resources
| Database | Nomenclature | size | Description of key features | Provenance of lncRNAs | Reference |
|---|---|---|---|---|---|
| lncRBase | hsaLB_AN_74656 | 133 361 | Comprehensive database categorising lncRNA according to genomic context. Integrates, coding potential, expression data, associated genomic elements and publications | lncRNAdb, Broad Institute, Ensembl, NONCODE, H-InvDB | |
| lncipedia | lnc-SMUG1-7 | 113 513 | Grouping of isoforms that share at least one exon. Integrates prediction of secondary structure, locus conservation and coding potential | lncRNAdb, Ensembl, Gencode, RefSeq, NONCODE, Broad Institute and two further RNA Seq publications | |
| NONCODE | n343067/NONHSAT028510 | 95 135 | ncRNA database with detailed information regarding provenance and expression | RefSeq, literature mining and specialised databases | |
| lncRNASNP | lnc-SMUG1-7 | 32 108 | Lists overlapping miRNA binding sites and SNPs, predicting the influence of SNPs on secondary structure and miRNA binding and providing corresponding miRNA expression data | lncipedia, dbSNP and stringency filtering | |
| Ensembl | ENSG00000228630 | 23 498 | Integrative database of genome annotation with detailed information regarding provenance | Computational prediction from ESTs and chromatin marks | |
| Gencode | ENSG00000228630 | 15 877 | Integrative database including information on DNA methylation, occupancy and chromatin state, along with RNA expression and binding | HAVANA manual annotation and Ensembl annotation pipeline, mostly validated | |
| Havana | OTTHUMT 00000328662 | 14 396 | Manual genomic annotation supported by transcriptional evidence | Manual annotation | |
| Broad Institute | TCONS_00079054 | 14 353 | Expression levels in 24 tissues and cell types, stringent set of 4662 lincRNA | RNA Seq in 24 tissues and cell types | |
| ChIPBase | lncRNA2314-1 | 10 559 | Integrates chromatin immunoprecipitation sequencing data with lncRNA occupancy | Ensembl, refseq, UCSC, lncRNAdb and various publications | |
| RefSeq | NR_047517.1 | 6917 | Non-redundant database of annotated sequences | Data submitted to the International Nucleotide Sequencing Database | |
| lincPoly | XLOC_000000* | 4662 | Similar to lncRNASNP, with SNPs categorised according to phenotype and conservation data integrated in the place of miRNAs | Stringent set from Broad Institute | |
| lncRNAdb | HOTAIR | 166 | Manually curated database ofexperimentally validated, functional lncRNAs | Literature mining |
This selection of resources is not exhaustive, but presents a broad spectrum of available and up to date tools. Included is an example of the nomenclature preferred by each database, with all entries referring to the lncRNA HOTAIR.
*HOTAIR is not found in lincPoly.
Immunoprecipitation techniques for investigating lncRNA
| Acronym | Technique | Discriminating features | Reference |
|---|---|---|---|
| CHART | Capture hybridisation analysis of RNA targets | Tiled oligonucleotides against lncRNA, yielding bound DNA and protein | |
| ChIRP | Chromatin isolation by RNA purification | Pool of ∼20nt probes and gluteraldehyde crosslinking. Yields bound DNA and protein | |
| CLASH | Crosslinking, ligation and sequencing of hybrids | Small pool of ∼20nt probes. Partial RNaseH treatment during elution. Yields bound RNA | |
| iCLIP | Individual nucleotide resolution UV-crosslinking and immunoprecipitation | Antibody used against protein of interest, with interacting lncRNA nucleotide interrogated by sequencing | |
| RAP | RNA antisense purification | Large pool of ∼120nt probes. Yields bound DNA and RNA | |
| RIP | RNA immunoprecipitation | Antibody used against protein of interest, with presence of lncRNA interrogated. No cross-linking | |
| ChRIP | Chromatin RNA immunoprecipitation | Antibody used against modified chromatin, with presence of lncRNA interrogated | |
| ChOP | Chromatin oligo-affinity precipitation | Single ∼80nt probe, yielding interacting DNA |
All methods use biotinylated oligonucleotides as probes against the lncRNA of interest and formaldehyde cross-linking, unless otherwise stated.
Figure 1Graphical summary. (A) Genome-wide association studies single nucleotide polymorphism (GWAS SNPs) for complex diseases are mainly found outside of protein coding regions, and often overlay lncRNAs and/or enhancer elements. In this example, SNPs are found overlapping and proximal to three transcripts (curved lines); including two lncRNA, one of which is enhancer derived (red) and a mRNA (blue). (B) A detailed investigation of how these variants contribute to increased disease risk will have a profound impact on our understanding of the disease and is likely to implicate enhancers and lncRNA. lncRNAs (eg, green curved line) often function by regulating the transcription or translation of protein coding genes. One model of how this may occur is the mediation of chromatin interactions (eg, red curved line), which may involve the recruitment of chromatin modifiers, transcriptional activators or other proteins (green and orange forms, RNA pol II is represented by the purple form).