| Literature DB >> 26527727 |
Cricket A Sloan1, Esther T Chan1, Jean M Davidson1, Venkat S Malladi1, J Seth Strattan1, Benjamin C Hitz1, Idan Gabdank1, Aditi K Narayanan1, Marcus Ho1, Brian T Lee2, Laurence D Rowe1, Timothy R Dreszer1, Greg Roe1, Nikhil R Podduturi1, Forrest Tanaka1, Eurie L Hong1, J Michael Cherry3.
Abstract
The Encyclopedia of DNA Elements (ENCODE) Project is in its third phase of creating a comprehensive catalog of functional elements in the human genome. This phase of the project includes an expansion of assays that measure diverse RNA populations, identify proteins that interact with RNA and DNA, probe regions of DNA hypersensitivity, and measure levels of DNA methylation in a wide range of cell and tissue types to identify putative regulatory elements. To date, results for almost 5000 experiments have been released for use by the scientific community. These data are available for searching, visualization and download at the new ENCODE Portal (www.encodeproject.org). The revamped ENCODE Portal provides new ways to browse and search the ENCODE data based on the metadata that describe the assays as well as summaries of the assays that focus on data provenance. In addition, it is a flexible platform that allows integration of genomic data from multiple projects. The portal experience was designed to improve access to ENCODE data by relying on metadata that allow reusability and reproducibility of the experiments.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26527727 PMCID: PMC4702836 DOI: 10.1093/nar/gkv1160
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of the ENCODE Portal ‘dashboard-style’ home page. (a) The Data section provides links into search pages that allow filtering of the ENCODE data. (b) The Methods section contains standard documents, software and analysis pipeline information. (c) The About ENCODE section provides context to the ENCODE project, the site and access to relevant publications. (d) The Help section lists tutorials and guide pages. (e) The free-text search box for querying any metadata or text item. (f) The Quick Start guide has popular links to browsing and download information. (g) The News feed tracks updates and data released. Items (a) to (e) are part of a persistent menu bar that appears on every page to provide stable navigation throughout the site.
Figure 2.Using search and filtering to select relevant experiments. The organization of key experiment metadata into facets allows the user to quickly filter from nearly 5000 to only eight experiments with a few simple terms. Entering ‘heart’ in the search box reduced the list to 105 experimental datasets. Filtering further on the terms ‘RNA-seq’ and ‘Homo sapiens’ using the facets limited the list even further down to 8 experiments.
Figure 3.The ENCODE Portal Experiment Summary page. (A) Overview and details of the experiment; what assay was performed in what species using which biosamples. (B) Array of protocol documents (usually PDF files) describing the intricate details of experimental techniques performed. (C) Replicate structure indicating the number of biological and technical replicates with links to specifics about the growth or procurement of the sample. (D) The interactive graph of the file relationships with the software and pipeline provenance. Clicking on nodes of the graph will change the details displayed in the details section. (E) File listing of all files associated with the experiment with details and download links.
Figure 4.The Visualize button on the Assay search page and the individual experiment page generates a UCSC Genome Browser track hub of all appropriate files with labels generated from the metadata.