| Literature DB >> 24478791 |
Mohammadreza Hajjari1, Atefeh Khoshnevisan2, Bernardo Lemos3.
Abstract
Limited understanding of the Rb1 locus hinders genetic and epigenetic analyses of Retinoblastoma, a childhood cancer of the nervous systems. In this study, we used in silico tools to investigate and review putative genetic and epigenetic elements of the Rb1 gene. We report transcription start sites, CpG islands, and regulatory moieties that are likely to influence transcriptional states of this gene. These might contribute genetic and epigenetic information modulating tissue-specific transcripts and expression levels of Rb1. The elements we identified include tandem repeats that reside within or next to CpG islands near Rb1's transcriptional start site, and that are likely to be polymorphic among individuals. Our analyses highlight the complexity of this gene and suggest opportunities and limitations for future studies of retinoblastoma, genetic counseling, and the accurate identification of patients at greater risk of developing the malignancy.Entities:
Keywords: CpG islands; epigenetics; in silico analysis; retinoblastoma
Year: 2014 PMID: 24478791 PMCID: PMC3904107 DOI: 10.3389/fgene.2014.00002
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Databases and softwares used in this study.
| Finding mRNA isoforms | Ace view | |
| UCSC | ||
| Expression analysis | Ace view | |
| Affymetrix exon array GNF Gene Expression Atlas2 | ||
| Promoter detection | Hidden Markov Model | UCSC ( |
| CoreBoost_HM Promoter Prediction | UCSC ( | |
| Promoter scan | ||
| Promoter2 | ||
| Alternative transcription start sites | DBTSS | |
| Eponine | UCSC ( | |
| SwithGear | UCSC ( | |
| Detection of CpGIs | UCSC | |
| Bona fide CGIs | ||
| CpGProD | ||
| CpGcluster | ||
| CpG-MI tool | ||
| Weizmann Evolutionary CpG Islands | UCSC ( | |
| Estimation of the CGI's methylation status | Bona fide CGIs | |
| Finding repeated sequences | Estimation of repeat variability | |
| Repeat masker | ||
| Inspecting histone marks | UCSC | |
| DNase I hypersensitive sites | UCSC | |
| Transcription factor binding sites | CisRed | |
| PReMode | ||
| ENCODE | UCSC ( | |
| Prediction of insulator sites | CTCFBSDB |
Figure 1. Transcribed RNAs from this locus identified by UCSC genome browser. The exons are represented by black blocks. Different promoters and transcription start sites of Rb1 locus are shown. The diagram shows a schematic representation of results from different databases and programs which are described in the text. The yellow circles show CpG islands identified by UCSC genome browser. There are two red boxes which show the promoters identified by HMM-Promoter prediction algorithm. TSSs (Transcription start sites) are recognized by different algorithms such as CoreBoost, Eponine, and SwitchGear. LPAR is a gene within Rb1.
Transcription start sites (TSSs) identified in the DBTSS database for different cell lines.
| Hela | 48878016 |
| DLD1 | 48877884 |
| Beas2B | 48877877 |
| Ramos | 48877983 |
| 48876242 | |
| MCF7 | 48877937 |
Table shows the cell lines (left column) and the position of TSSs in the Rb1 gene.
Figure 2The positions of CpG islands in the . The first and last blocks in the schematic gene represent the first and fourth exons of Rb1, respectively. “Bona fide” strategy accounts for a number of functional CGIs and estimates their strengths (see scores in the figure). Also, CpGProD program predicts promoter- overlapping CGIs. “Weizmann CpG islands” predicts highly conserved CGIs. Although different methods were used, the results are largely concordant.
Comparison of CpGs identified with different programs.
| 1–3.5 | CGI106: 1578-2619 | CGI775:1429-2956 | CG1:1370-3076 | #1:1540-1710 |
| #2:1759-2619 | ||||
| #3:2673-2898 | ||||
| 10–10.5 | No | No | #4:10015-10235 | |
| 12–12.5 | No | No | #5:12159-12302 | |
| 14.5–16 | CGI42:15076-15667 | No | CG2:14857-15723 | #6:15038-15446 |
| #7:15560-15667 | ||||
| #8:15839-16103 | ||||
| 16–17 | CGI85:16754-17975 | CGI779:16336-16550 | CG3:16486-20182 | #9:16592-16986 |
| 17–18 | No | #10:17039-17211 | ||
| #11:17254-17430 | ||||
| #12:17494-17738 | ||||
| #13:17786-17975 | ||||
| 18–19 | No | No | #14:18458-18645 | |
| #15:18807-19080 | ||||
| 19–20.5 | No | CGI782: 19195-19545 | #16:19167-19443 | |
| #17:19596-19672 | ||||
| #18:19823-20089 | ||||
| 155–165 | No | No | CG4:163702-164409 | #19:155929-156023 |
| #20:156294-156415 | ||||
| #21:163774-164177 |
Although each algorithm has its own strategy, there are some concordances between the results. For simplicity, we have divided the Genomic Region under Analysis (GRA) into smaller segments (First column).
Tandem repeats in .
| GCC | 2194–2246 |
| CA | 14974–15038 |
| TG | 44625–44668 |
| TG | 104353–104389 |
| AGTCATCTTCTACCAAACCTCACCTCCAGCATTGGGGAGCACACTTCAACACG | 125368–126744 |
| AAAC | 128996–129033 |
| TTCT | 158141–158239 |
Repeats were recognized by the “Estimation of Repeat variability” toolkit and have Var score above 0.5. Positions are relative to the nucleotide in −2 kb of the canonical Rb1 transcription start site.
Overlapped with CpG # 775.
Neighborhood with CpG #6 identified by CpG cluster.
CTCF binding motifs within the Genomic Region under Analysis.
| CCGGCCTGGAGGGGGTGGTT | 1796 |
| GGAACTGCA | 2597 |
The positions are relative to −2 kb of the canonical transcription start site of the Rb1 gene.