| Literature DB >> 29307144 |
Mohammadreza Hajjari1, Saghar Rahnama1.
Abstract
HOTAIR is an lncRNA that has been known to have an oncogenic role in different cancers. There is limited knowledge of genetic and epigenetic elements and their interactions for the gene encoding HOTAIR. Therefore, understanding the molecular mechanism and its regulation remains to be challenging. We used different in silico analyses to find genetic and epigenetic elements of HOTAIR gene to gain insight into its regulation. We reported different regulatory elements including canonical promoters, transcription start sites, CpGIs as well as epigenetic marks that are potentially involved in the regulation of HOTAIR gene expression. We identified repeat sequences and single nucleotide polymorphisms that are located within or next to the CpGIs of HOTAIR. Our analyses may help to find potential interactions between genetic and epigenetic elements of HOTAIR gene in the human tissues and show opportunities and limitations for researches on HOTAIR gene in future studies.Entities:
Keywords: CpG Islands; HOTAIR; bioinformatics; epigenetics; gene expression
Year: 2017 PMID: 29307144 PMCID: PMC5769859 DOI: 10.5808/GI.2017.15.4.170
Source DB: PubMed Journal: Genomics Inform ISSN: 1598-866X
Softwares and databases utilized in this article
| Type of analysis | Usage | Software/database | Reference/address | |
|---|---|---|---|---|
|
| ||||
| Genetic features | Epigenetic features | |||
| O | O | Finding different transcripts | Ace view | |
| UCSC | ||||
| Ensembl | ||||
|
| ||||
| O | - | Promoter detection | HMM | |
| Promoter scan | ||||
| Promoter 2.0 | ||||
| Ensembl | ||||
| Ace view | ||||
|
| ||||
| O | - | Alternative transcription start sites | Eponine | |
| SwitchGear | ||||
|
| ||||
| O | - | CpGIs detection | UCSC | |
| Bona fides CGIs | ||||
| CpGProD | ||||
| Weizmann evolutionary CGIs | ||||
|
| ||||
| O | - | DNase I hypersensitivity peak clusters | UCSC | |
|
| ||||
| - | O | CpGIs methylation status | ENCODE | |
|
| ||||
| O | O | Gene expression analysis | Ace view | |
| GTEX RNA-SEQ | ||||
|
| ||||
| O | O | Finding CTCF | ENCODE | |
|
| ||||
| O | - | Finding motifs | MEME | |
| Mast program | ||||
|
| ||||
| O | O | Transcription factor binding sites | PreMode | |
|
| ||||
| O | - | Detection of enhancers | HMM | |
|
| ||||
| O | - | Finding repeated sequences | Repeat masker | |
| Tandem repeat by TRF | ||||
|
| ||||
| O | - | Single nucleotide polymorphism | dbSNP | |
|
| ||||
| - | O | Detection of histone marks | UCSC | |
UCSC, University of California, Santa Cruz; HMM, Hidden Markov Model; CGI, CpG Island; ENCODE, Encyclopedia of DNA Elements; TRF, tandem repeat finder.
Fig. 1The flowchart of the methods used in the study. TSS, transcription start site; SNP, single nucleotide polymorphism.
Fig. 2Transcript variants of HOTAIR gene derived from the GENCODE, Ensemble and Refseq.
Fig. 3Integrated regulatory elements of HOTAIR gene structure. The schematic diagram shows a summary of results from different databases and software which are described in the text.
Fig. 4CpG Islands in the HOTAIR gene. The data are derived from databases and prediction software. CGI, CpG Island.
The positions of regulatory sequences which are near or within CpG165 of HOTAIR
| Position of CpG165 | Promoter (active) | Other CpGIs | Tandem repeat (strand+) | CTCF | Strong enhancer | DNase I hypersensitivity | Module and TSSs |
|---|---|---|---|---|---|---|---|
| CpG165: 54366816–54369103 | HSMM cells: 54365934–54370733 | Bona fide 1437: 54366623–54367999 | (GGCGGA)n: 54367601–54367637 | 54366799–54367314 | NHEK cells: 4 strong enhancers: 54365934–54367133 | 41: 54366785–54367814 | 025610: 54366634–54366977 |
|
| |||||||
| NHEK cells: 54367139–54369133 | CpG2 (WE): 54366684–54366909 | (GGGA)n: 54367731–54367801 | NHEK cells DNase I hotspot: 75095: 54366045–54370999 | 025613: 54367707–54368584 | |||
|
| |||||||
| First active promoter based on ensembl: 54365691–54370092 | CpG1 (CpGProD): 54366456–54368740 | TSSs: CHR12-P039 7-R1: 54366912–54366912 | |||||
|
| |||||||
| CpG2.4 (WE): 54368334–54368964 | CHR12-P0397-R2: 54367584–54367584 | ||||||
|
| |||||||
| Bona fide 1438: 54368166–54369840 | |||||||
Positions are based on UCSC hg19.
TSS, transcription start site; WE, Weizmann evolutionary.