| Literature DB >> 16381948 |
Hideya Kawaji1, Takeya Kasukawa, Shiro Fukuda, Shintaro Katayama, Chikatoshi Kai, Jun Kawai, Piero Carninci, Yoshihide Hayashizaki.
Abstract
Cap-analysis gene expression (CAGE) Basic and Analysis Databases store an original resource produced by CAGE, which measures expression levels of transcription starting sites by sequencing large amounts of transcript 5' ends, termed CAGE tags. Millions of human and mouse high-quality CAGE tags derived from different conditions in >20 tissues consisting of >250 RNA samples are essential for identification of novel promoters and promoter characterization in the aspect of expression profile. CAGE Basic Database is a primary database of the CAGE resource, RNA samples, CAGE libraries, CAGE clone and tag sequences and so on. CAGE Analysis Database stores promoter related information, such as counts of related transcripts, CpG islands and conserved genome region. It also provides expression profiles at base pair and promoter levels. Both databases are based on the same framework, CAGE tag starting sites, tag clusters for defining promoters and transcriptional units (TUs). Their associations and TU attributes are available to find promoters of interest. These databases were provided for Functional Annotation Of Mouse 3 (FANTOM3), an international collaboration research project focusing on expanding the transcriptome and subsequent analyses. Now access is free for all users through the World Wide Web at http://fantom3.gsc.riken.jp/.Entities:
Mesh:
Year: 2006 PMID: 16381948 PMCID: PMC1347397 DOI: 10.1093/nar/gkj034
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1An overview of the CAGE supporting systems and data flow among them.
Main contents of the database systems
| Database | Contents |
|---|---|
| CAGE Basic Database | RNA sample information |
| CAGE library information | |
| CAGE clone plate/spot | |
| CAGE clone sequence | |
| CAGE clone sequence quality | |
| CAGE tag sequence | |
| CAGE tag mapping status | |
| Associations of CAGE tags with CTSS | |
| Associations of CTSS with TCs | |
| Associations of TCs with transcript and TUs | |
| CAGE Analysis Database | Base pair level expression profile |
| TC expression profile within TUs | |
| Statistical significance expression fluctuations | |
| Presence of predicted core promoter elements | |
| Presence of conserved genome region between human and mouse (axtNet) | |
| Presence of CpG islands | |
| Counts of TC related mRNA, 5′-EST, GIS/GSC ditags | |
| Genomic Elements Database | TC |
| Predicted core promoter elements | |
| mRNA | |
| GIS/GSC ditag | |
| 5′- and 3′-ESTs | |
| Candidates of imprited transcripts in EICO DB | |
| Transcription factors listed in TFdb | |
| Gene prediction | |
| CpG islands | |
| Repeat detected by repeatmasker and tandem repeats finder | |
| Assemble gap | |
| Conserved genome region between human and mouse (axtNet) |
aTATA box, CCAAT box, GC box and initiator.
bRetrieved from the UCSC Genome Browser Database.
Figure 2Screenshots of CAGE Analysis Database: (A) a view of base pair scale expression within a TC, where CAGE tag count of each genome position is displayed in histogram, (B) a view of TC expressions within a TU, in which expression levels are represented by a heat map like representation, (C) a view of statistical significances of expression fluctuations between RNA samples, and their E-values are displayed in a matrix.