| Literature DB >> 25723102 |
Marina Lizio, Jayson Harshbarger, Hisashi Shimoji, Jessica Severin, Takeya Kasukawa, Serkan Sahin, Imad Abugessaisa, Shiro Fukuda, Fumi Hori, Sachi Ishikawa-Kato, Christopher J Mungall, Erik Arner, J Kenneth Baillie, Nicolas Bertin, Hidemasa Bono, Michiel de Hoon, Alexander D Diehl, Emmanuel Dimont, Tom C Freeman, Kaori Fujieda, Winston Hide, Rajaram Kaliyaperumal, Toshiaki Katayama, Timo Lassmann, Terrence F Meehan, Koro Nishikata, Hiromasa Ono, Michael Rehli, Albin Sandelin, Erik A Schultes, Peter A C 't Hoen, Zuotian Tatum, Mark Thompson, Tetsuro Toyoda, Derek W Wright, Carsten O Daub, Masayoshi Itoh, Piero Carninci, Yoshihide Hayashizaki, Alistair R R Forrest, Hideya Kawaji.
Abstract
The FANTOM5 project investigates transcription initiation activities in more than 1,000 human and mouse primary cells, cell lines and tissues using CAGE. Based on manual curation of sample information and development of an ontology for sample classification, we assemble the resulting data into a centralized data resource (http://fantom.gsc.riken.jp/5/). This resource contains web-based tools and data-access points for the research community to search and extract data related to samples, genes, promoter activities, transcription factors and enhancers across the FANTOM5 atlas.Entities:
Mesh:
Year: 2015 PMID: 25723102 PMCID: PMC4310165 DOI: 10.1186/s13059-014-0560-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1FANTOM5 assay flow and the data archive. Sample collection and data processing are indicated in a schematic view (green boxes). The resulting data and the analysis are collected into corresponding directories in the data archive (orange). GO, gene ontology.
Data files available in the data archive
|
|
|
|
|---|---|---|
| Sample, RNA, and CAGE library information (metadata) | SDRF | /basic/*sdrf.txt |
| /basic/*CAGE/00_*assay_sdrf.txt | ||
| Ribosomal RNA hitting reads | FASTA | /basic/*CAGE/*nobarcode.rdna.fa.gz (1,385 files) |
| Mapping results (including unmapped reads) | BAM | /basic/*CAGE/*nobarcode.bam (1,385 files) |
| TSS profiles (counts of obtained 5'-end reads at 1 bp resolution) | BED | /basic/*CAGE/*ctss.bed.gz (1,385 files) |
| Sample classification based on the FANTOM Five Sample Ontology | OBO | /extra/Ontology/ff-phase1-*.obo |
| CAGE peaks (TSS clusters) | BED | /extra/CAGE_peaks/*.bed.gz |
| CAGE peak annotation (descriptions and gene association) | OSC | /extra/CAGE_peaks/*.ann.txt.gz |
| Expression of the CAGE peaks | OSC | /extra/CAGE_peaks/*.osc.txt.gz |
| Co-expression clustering | OSC | /extra/Co-expression_clusters/*_co-expression_modules.tar.gz |
|
| TXT | /extra/Motifs/novel_pwms.txt |
| Sample enrichment analysis | TXT | /extra/Sample_ontology_enrichment_of_CAGE_peaks /*.txt.gz |
| Gene ontology enrichment analysis of co-expression clusters | OSC | /extra/Co-expression_clusters/*co-expression_GOstats.tar.gz |
Figure 2Interfaces to FANTOM analyses. Scope and contents of the database systems.
Figure 3Access to the sample details. Detailed information of a sample, including regulatory information produced by computational analysis of its transcriptome, is summarized in a page (dotted box on the left). This page can be found by examining pages of listed samples or tissue cell types, or by looking at (dis)similarities of samples in a transcriptome space defined by expression clustering (right boxes).
Figure 4Inspection of a transcription factor and its activity. Detailed information on the most active CAGE peak at the SPI1 promoter is summarized in one page (dotted box, bottom left). This page can be found by examining the SPI1 gene on ZENBU (top left panel) or SSTAR (top right panel).