| Literature DB >> 23875683 |
Quang M Trinh1, Fei-Yang Arthur Jen, Ziru Zhou, Kar Ming Chu, Marc D Perry, Ellen T Kephart, Sergio Contrino, Peter Ruzanov, Lincoln D Stein.
Abstract
BACKGROUND: Funded by the National Institutes of Health (NIH), the aim of the Model Organism ENCyclopedia of DNA Elements (modENCODE) project is to provide the biological research community with a comprehensive encyclopedia of functional genomic elements for both model organisms C. elegans (worm) and D. melanogaster (fly). With a total size of just under 10 terabytes of data collected and released to the public, one of the challenges faced by researchers is to extract biologically meaningful knowledge from this large data set. While the basic quality control, pre-processing, and analysis of the data has already been performed by members of the modENCODE consortium, many researchers will wish to reinterpret the data set using modifications and enhancements of the original protocols, or combine modENCODE data with other data sets. Unfortunately this can be a time consuming and logistically challenging proposition.Entities:
Mesh:
Year: 2013 PMID: 23875683 PMCID: PMC3734164 DOI: 10.1186/1471-2164-14-494
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1modENCODE DCC data flow from submission to release.
modENCODE data sets, by category
| Chromatin structure | 50 |
| Copy Number Variation | 16 |
| Gene Structure | 425 |
| Histone modification and replacement | 475 |
| Metadata only | 4 |
| Other chromatin binding sites | 641 |
| RNA expression profiling | 930 |
| Replication | 39 |
| TF binding sites | 363 |
Figure 2(a) modENCODE tools can be installed via Galaxy administrator interface by clicking on ‘Admin’ and ‘Search and browse tool sheds’ (indicated by red boxes). (b) modENCODE Galaxy after installations of modENCODE tools and their dependencies.
Figure 3Data can be imported directly into Galaxy from our faceted browser.
Figure 4Running the 2-replicate uniform processing/peak calling workflow.
Figure 5Galaxy visualization of peak call output for chromosome II from the workflow for the two sample replicates.