Literature DB >> 26527727

ENCODE data at the ENCODE portal.

Cricket A Sloan¹, Esther T Chan¹, Jean M Davidson¹, Venkat S Malladi¹, J Seth Strattan¹, Benjamin C Hitz¹, Idan Gabdank¹, Aditi K Narayanan¹, Marcus Ho¹, Brian T Lee², Laurence D Rowe¹, Timothy R Dreszer¹, Greg Roe¹, Nikhil R Podduturi¹, Forrest Tanaka¹, Eurie L Hong¹, J Michael Cherry³.

Abstract

The Encyclopedia of DNA Elements (ENCODE) Project is in its third phase of creating a comprehensive catalog of functional elements in the human genome. This phase of the project includes an expansion of assays that measure diverse RNA populations, identify proteins that interact with RNA and DNA, probe regions of DNA hypersensitivity, and measure levels of DNA methylation in a wide range of cell and tissue types to identify putative regulatory elements. To date, results for almost 5000 experiments have been released for use by the scientific community. These data are available for searching, visualization and download at the new ENCODE Portal (www.encodeproject.org). The revamped ENCODE Portal provides new ways to browse and search the ENCODE data based on the metadata that describe the assays as well as summaries of the assays that focus on data provenance. In addition, it is a flexible platform that allows integration of genomic data from multiple projects. The portal experience was designed to improve access to ENCODE data by relying on metadata that allow reusability and reproducibility of the experiments.

Entities: Chemical Disease Species

Mesh：

Substances：
Proteins
RNA
DNA

Year: 2015 PMID： 26527727 PMCID： PMC4702836 DOI： 10.1093/nar/gkv1160

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

The Encyclopedia of DNA Elements (ENCODE) Project began as a Pilot Project on 1% of the human genome (1,2). In 2007, the effort was scaled to whole-genome assays followed by expansion to performing similar assays in mouse (3–5). The ENCODE Project continues to create a comprehensive catalog of gene elements and functional elements in the human and mouse genomes by measuring RNA expression levels, identifying proteins that interact with RNA and DNA (such as modified histones, transcription factors and RNA-binding proteins), measuring the levels of DNA methylation and identifying regions of DNA hypersensitivity. These data generated by ENCODE Consortium members are submitted to the ENCODE Data Coordination Center (DCC) whose primary task is to curate, process and validate the data in preparation for release to the scientific community. To promote the sharing of data, the ENCODE consortium has updated its release policy (https://www.encodeproject.org/about/data-use-policy/). These data are made available at the ENCODE Portal (www.encodeproject.org), created by the DCC, and are also distributed through the UCSC genome browser, GEO and Ensembl and used extensively as a community resource (6–11). To increase the reproducibility, reusability and interoperability of the ENCODE data, the ENCODE DCC, in collaboration with the ENCODE consortium, has defined a new metadata standard. The experimental assays and computational methods used to generate the ENCODE data are represented by a structured data model to capture metadata that allows for maximal understanding and interpretation of these results (Hong et al., submitted). Integral to the metadata, persistent identifiers are assigned for experiments, files, antibody lots and biosamples to allow specific identification of the data. In addition, controlled vocabularies and ontologies are used to increase the interoperability of the ENCODE data with other projects (12). Specifically, the experiment metadata displayed on the portal utilize the Uber Anatomy Ontology (UBERON, uberon.org) for tissue biosamples; Cell Ontology (CL, cellontology.org) for primary cells; Experimental Factor Ontology (EFO, www.ebi.ac.uk/efo/) for immortalized cell lines; Ontology for Biomedical Investigations (OBI, obi-ontology.org) for experimental assays; Chemical Entities of Biological Interest (ChEBI, www.ebi.ac.uk/chebi/) for chemical treatments; and Sequence Ontology (SO, sequenceontology.org) for the nucleic acids of interest. As discussions about the issues of reproducibility and reusability of genomic data are becoming more significant (13,14), the new ENCODE Portal serves to communicate to the scientific community the experimental standards and guidelines defined by the entire Consortium, as well as the details of how the assays were performed and the computational methods used to analyze the data.

NEW ENCODE DATA PORTAL

The new ENCODE data portal, released in August 2014, is the canonical, central source for ENCODE raw data, analysis data, methods, standards and experimental metadata. The user experience begins with a dashboard-style homepage (Figure 1), that includes a news feed, menus and a quick start guide providing instruction on the various ways to access the data via browsing, searching, visualizing and downloading. A persistent standard menu at the top of every page provides stable navigation throughout the site. Within the menu are the categories of ‘Data’, ‘Methods’, ‘About ENCODE’ and ‘Help’. The ‘Data’ section provides initial search access into the data as well as a link to the new Release Policy that describes the terms of data use. The primary access to the ENCODE data is through the ‘Assays’ search page. This page allows one to browse replicated experimental sets, by faceted search selection on the left to filter experiments by features such as assay type, biosample type, organism or treatment. The biosample page provides similar browsing access to a listing of all individual biosamples used in the experiments. The antibody page provides access to the antibodies screened or used in the ENCODE immunoprecipitation experiments along with the required antibody characterizations generated by the project (15). The annotation section links publications and analysis files generated by the consortium towards the goal of creating an encyclopedia of DNA elements. This section also contain links to multiple browsers to view annotations created from collations of information from the many ENCODE experiments.

Figure 1.

Overview of the ENCODE Portal ‘dashboard-style’ home page. (a) The Data section provides links into search pages that allow filtering of the ENCODE data. (b) The Methods section contains standard documents, software and analysis pipeline information. (c) The About ENCODE section provides context to the ENCODE project, the site and access to relevant publications. (d) The Help section lists tutorials and guide pages. (e) The free-text search box for querying any metadata or text item. (f) The Quick Start guide has popular links to browsing and download information. (g) The News feed tracks updates and data released. Items (a) to (e) are part of a persistent menu bar that appears on every page to provide stable navigation throughout the site. The ‘Methods’ section includes information on the data standards, software, analysis pipelines and experimental guidelines defined by the consortium. The ENCODE consortium develops internal standards on the quality and procedures for both the experimental methods and the computational analysis. Current standards and guidelines are available from the portal as well as an archive containing previous versions. Additionally, a large variety of software is used in the analysis of these datasets. To improve transparency, the ENCODE Portal is hosting version information and links to source code for the software used in ENCODE project analysis. The goal is to provide sufficient information to allow anyone in the community to easily repeat the analysis if so desired. The ENCODE consortium is also developing uniform analysis pipelines that map raw sequencing data for various assays, generate visualizable signal files from those mappings and produce a quantification or annotation result. Summaries of the these pipelines, along with general specifications, are linked from the ‘Pipelines' page. The ‘About ENCODE’ section gives context to the wealth of experiments found on this site, providing a project overview, access to all news items and information about data access at other sites. This section also has the ‘Publications’ page which provides guided entry into browsing a curated collection of publications that either use ENCODE data or are references for the methods used in creating the ENCODE data. Finally, the ‘Help’ menu has a variety of help documents and tutorials to facilitate differing user experiences. Included in this section are instructions on how to browse the data in the UI, using the REST API programmatic download.

DATA ACCESS FEATURES

Access to the ENCODE data from the new portal is driven by high-quality metadata that provide data provenance and transparency by describing how the assays were performed and how the data were analyzed. In order to import the data from previous phases of the ENCODE Project, a massive data review was undertaken to match files with controls, to include any information that was found in the documentation, to work with laboratory groups to bring their older metadata up to the newer standards (including capturing protocols, sequencing platforms used, replicate structure, and shared biosamples when it could be known), to assign specific ontology terms for the biosamples and capture library preparation details. With this depth and breadth of metadata, the new portal is able to provide a more advanced level of searching, a more comprehensive view of the assays performed and visualization of the ENCODE data using the selected subset of the data.

Browsing and searching

ENCODE data can be accessed both by browsing via faceted search and by searching via keywords. The faceted search page, accessible via the ‘Assay’ link underneath the ‘Data’ menu item in the toolbar, provides a simplified interface that allows the filtration of thousands of assays to the ones of interest in a few clicks. Using a few simple terms like heart, RNA-seq and Homo sapiens can reduce the experimental dataset list from nearly 5000 to 8 relevant experiments. The keyword search is a string search, in the upper right hand corner, for biosamples, assays, GEO identifiers, ENCODE identifiers, targets or primary investigators. Ontologies are used to further expand the search into synonyms and derived_from relationships. This feature allows users to enter a term like ‘heart’ or ‘K562’ into the search box and find all relevant biosamples, targets and experiments. The results of this search can be further filtered by the aforementioned faceted search (Figure 2). Both keyword search and faceted search are driven by our rich metadata and the use of ontologies and controlled vocabularies for identifying the biosamples, targets, treatments and assays. A large curation effort was undertaken to identify the appropriate ontologies to use and the specific term identifications for biosamples for the current and past ENCODE projects (12). For more complex data searches, queries can be built using the fields from the schemas of our metadata (https://www.encodeproject.org/help/rest-api/). Although these search features can be used on any of our search pages for any of the objects (antibodies, biosamples, etc.), the primary access to the data is through the assay page. The results on this page represent an experimental collection of replicates. At any point in the searching and filtering process, the selected metadata can be downloaded in table format using the ‘Download’ button.

Figure 2.

Using search and filtering to select relevant experiments. The organization of key experiment metadata into facets allows the user to quickly filter from nearly 5000 to only eight experiments with a few simple terms. Entering ‘heart’ in the search box reduced the list to 105 experimental datasets. Filtering further on the terms ‘RNA-seq’ and ‘Homo sapiens’ using the facets limited the list even further down to 8 experiments.

Experiment pages

For any one of these replicated experimental sets, there is a dynamically generated page that collects all of the experiment's details and data. This page has five sections: the assay details, the protocol documents, replicate details, the files list and (if available) an interactive graph displaying the provenance of the data files. The assay details and the protocol documents provide a mixture of structured and unstructured metadata to fully contextualize the experiment and provide any important details required for interpretation and reproduction (Figure 3A and B). The replicate section is intended to give insight into the replicate structure of the experiment (Figure 3C). The standard replicate structure for most ENCODE experiments is two biological replicates per experiment (15). However, the portal is flexible enough to handle variation in the replicate structure. In addition, the biosample links in the replicate section, provide further details about donors, strains, growth or tissue excision details and treatments. If a cell line includes some type of transgenic construct or other genetic alterations, those details and protocols can also be found through the biosample page. The files list section has the raw data separated from any processed data associated with the experiment (Figure 3E). Details like the sequencing read length and run-type are associated with the raw data. Details such as which assembly and genome annotations were used during the generation of those processed files are found with those files in the processed data subsection. Any individual file can be downloaded from this page. When associated pipeline metadata are present for the data files, the pages additionally have a dynamically-generated graph that maps out the file relationships along with links to software and pipeline details that lead from one file to the next (Figure 3D). This graph includes all of the related files in the current experiment set and and any contributing files such as those from controls or reference sequence files. Each node in the graph is interactive and provides further details about the file or processing step and the graph structure allows a user to track the software provenance of each file. In addition to these five sections which allow context for the metadata and structure of the experiment, there is a ‘Visualize Data’ button that will create UCSC Genome browser tracks via ‘track hubs’ (16) for appropriate files.

Figure 3.

The ENCODE Portal Experiment Summary page. (A) Overview and details of the experiment; what assay was performed in what species using which biosamples. (B) Array of protocol documents (usually PDF files) describing the intricate details of experimental techniques performed. (C) Replicate structure indicating the number of biological and technical replicates with links to specifics about the growth or procurement of the sample. (D) The interactive graph of the file relationships with the software and pipeline provenance. Clicking on nodes of the graph will change the details displayed in the details section. (E) File listing of all files associated with the experiment with details and download links.

Dynamic track hubs

The UCSC Genome Browser provides a mechanism called a track hub to remotely host genomic data for integration into the Genome Browser's visualization tools. The majority of the released experiments at the ENCODE portal have visualizable signal or annotation files aligned to genomic coordinates in either bigWig or bigBed format (the primary file formats supported in the UCSC Genome Browser track hub feature) (17). When the user selects the Visualize Data button, all files of these formats are included in an automatically generated track hub that is immediately launched in the Genome Browser (Figure 4). These track hubs automatically includes a summary of the metadata describing the assay based on the relationship between the file and the experimental metadata. These summaries include items such as the target of the antibody in a ChIP-seq assay, a summary of the biosample and biological replicate number. This feature allows the user to easily integrate ENCODE data with their own custom tracks or the many reference tracks available at the Genome Browser. As well as being on the individual experiment page, this feature is available on the Assay search page. Once the user has filtered the list to a reasonable subset (currently <500), the ‘Visualize’ button will appear and a track hub including all appropriate files from every experiment on the list is assembled.

Figure 4.

The Visualize button on the Assay search page and the individual experiment page generates a UCSC Genome Browser track hub of all appropriate files with labels generated from the metadata.

FUTURE WORK

Here, we have described the new features of the ENCODE Portal: improved access to the experiment standards and methods developed by the ENCODE Consortium, metadata-driven searches to find relevant datasets and visualization of only relevant datasets. The ENCODE Portal will be continuously updated with new experimental data, processed data and analysis metadata along with enhanced searching and visualization features as the project continues. Specifically, we are planning on introducing the concept of series organization of experiments into our Assay page. With this feature, we will be able to include a collection of assays in a series of time points or differentiation stages in the faceted search along with replicated experimental sets. And to further improve searching, we are implementing ways to search the data files themselves for genomic regions, in order to direct the scientific community to relevant assays based on whether a binding site is present near a gene of interest or a gene is expressed in a specific tissue. In addition to providing improved access to the ENCODE data, the ENCODE Portal can serve as a gateway for other genomic projects. We are currently integrating metadata and data from other epigenetics projects into the site. Specifically we will be hosting the data of the recently funded Genomics of Gene Regulation project (http://www.genome.gov/27561317), the published datasets of the modENCODE integration papers (18–20) and Roadmap Epigenomics Mapping Consortium (http://www.roadmapepigenomics.org/) (21). This will allow searching across projects for related assays. The basic framework for the site was built to allow complex relations of experimental metadata and it is made available for other projects. It is currently being adopted by the Clinical Genome Resource (ClinGen) (22). We have worked with GEO at NCBI to map our biosample schema to their Biosample standard (www.ncbi.nlm.nih.gov/biosample) (11,23). Currently, we are working with a method to not simply import these data into their database, but also to periodically synchronize the two databases to provide access to the most up-to-date data produced by the ENCODE consortium to the larger scientific community.

CONTACT INFORMATION

General questions about the ENCODE data portal should be directed to the mailing list. For any specific questions about laboratory details or methods, the producing laboratory information is available on each experimental page and in the metadata for each file. All of the source code used for the portal website is open source and available through our repository hosted at GitHub (www.github.com/ENCODE-DCC/). You can sign up for an ENCODE announcement mailing list at https://mailman.stanford.edu/mailman/listinfo/encode-announce. Data and portal updates are available on our news page at www.encodeproject.org/news/. You can also follow us on twitter at @encodedcc.

21 in total

1. The ENCODE (ENCyclopedia Of DNA Elements) Project.

Authors:
Journal: Science Date: 2004-10-22 Impact factor: 47.728

2. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

Authors: Ewan Birney; John A Stamatoyannopoulos; Anindya Dutta; Roderic Guigó; Thomas R Gingeras; Elliott H Margulies; Zhiping Weng; Michael Snyder; Emmanouil T Dermitzakis; Robert E Thurman; Michael S Kuehn; Christopher M Taylor; Shane Neph; Christoph M Koch; Saurabh Asthana; Ankit Malhotra; Ivan Adzhubei; Jason A Greenbaum; Robert M Andrews; Paul Flicek; Patrick J Boyle; Hua Cao; Nigel P Carter; Gayle K Clelland; Sean Davis; Nathan Day; Pawandeep Dhami; Shane C Dillon; Michael O Dorschner; Heike Fiegler; Paul G Giresi; Jeff Goldy; Michael Hawrylycz; Andrew Haydock; Richard Humbert; Keith D James; Brett E Johnson; Ericka M Johnson; Tristan T Frum; Elizabeth R Rosenzweig; Neerja Karnani; Kirsten Lee; Gregory C Lefebvre; Patrick A Navas; Fidencio Neri; Stephen C J Parker; Peter J Sabo; Richard Sandstrom; Anthony Shafer; David Vetrie; Molly Weaver; Sarah Wilcox; Man Yu; Francis S Collins; Job Dekker; Jason D Lieb; Thomas D Tullius; Gregory E Crawford; Shamil Sunyaev; William S Noble; Ian Dunham; France Denoeud; Alexandre Reymond; Philipp Kapranov; Joel Rozowsky; Deyou Zheng; Robert Castelo; Adam Frankish; Jennifer Harrow; Srinka Ghosh; Albin Sandelin; Ivo L Hofacker; Robert Baertsch; Damian Keefe; Sujit Dike; Jill Cheng; Heather A Hirsch; Edward A Sekinger; Julien Lagarde; Josep F Abril; Atif Shahab; Christoph Flamm; Claudia Fried; Jörg Hackermüller; Jana Hertel; Manja Lindemeyer; Kristin Missal; Andrea Tanzer; Stefan Washietl; Jan Korbel; Olof Emanuelsson; Jakob S Pedersen; Nancy Holroyd; Ruth Taylor; David Swarbreck; Nicholas Matthews; Mark C Dickson; Daryl J Thomas; Matthew T Weirauch; James Gilbert; Jorg Drenkow; Ian Bell; XiaoDong Zhao; K G Srinivasan; Wing-Kin Sung; Hong Sain Ooi; Kuo Ping Chiu; Sylvain Foissac; Tyler Alioto; Michael Brent; Lior Pachter; Michael L Tress; Alfonso Valencia; Siew Woh Choo; Chiou Yu Choo; Catherine Ucla; Caroline Manzano; Carine Wyss; Evelyn Cheung; Taane G Clark; James B Brown; Madhavan Ganesh; Sandeep Patel; Hari Tammana; Jacqueline Chrast; Charlotte N Henrichsen; Chikatoshi Kai; Jun Kawai; Ugrappa Nagalakshmi; Jiaqian Wu; Zheng Lian; Jin Lian; Peter Newburger; Xueqing Zhang; Peter Bickel; John S Mattick; Piero Carninci; Yoshihide Hayashizaki; Sherman Weissman; Tim Hubbard; Richard M Myers; Jane Rogers; Peter F Stadler; Todd M Lowe; Chia-Lin Wei; Yijun Ruan; Kevin Struhl; Mark Gerstein; Stylianos E Antonarakis; Yutao Fu; Eric D Green; Ulaş Karaöz; Adam Siepel; James Taylor; Laura A Liefer; Kris A Wetterstrand; Peter J Good; Elise A Feingold; Mark S Guyer; Gregory M Cooper; George Asimenos; Colin N Dewey; Minmei Hou; Sergey Nikolaev; Juan I Montoya-Burgos; Ari Löytynoja; Simon Whelan; Fabio Pardi; Tim Massingham; Haiyan Huang; Nancy R Zhang; Ian Holmes; James C Mullikin; Abel Ureta-Vidal; Benedict Paten; Michael Seringhaus; Deanna Church; Kate Rosenbloom; W James Kent; Eric A Stone; Serafim Batzoglou; Nick Goldman; Ross C Hardison; David Haussler; Webb Miller; Arend Sidow; Nathan D Trinklein; Zhengdong D Zhang; Leah Barrera; Rhona Stuart; David C King; Adam Ameur; Stefan Enroth; Mark C Bieda; Jonghwan Kim; Akshay A Bhinge; Nan Jiang; Jun Liu; Fei Yao; Vinsensius B Vega; Charlie W H Lee; Patrick Ng; Atif Shahab; Annie Yang; Zarmik Moqtaderi; Zhou Zhu; Xiaoqin Xu; Sharon Squazzo; Matthew J Oberley; David Inman; Michael A Singer; Todd A Richmond; Kyle J Munn; Alvaro Rada-Iglesias; Ola Wallerman; Jan Komorowski; Joanna C Fowler; Phillippe Couttet; Alexander W Bruce; Oliver M Dovey; Peter D Ellis; Cordelia F Langford; David A Nix; Ghia Euskirchen; Stephen Hartman; Alexander E Urban; Peter Kraus; Sara Van Calcar; Nate Heintzman; Tae Hoon Kim; Kun Wang; Chunxu Qu; Gary Hon; Rosa Luna; Christopher K Glass; M Geoff Rosenfeld; Shelley Force Aldred; Sara J Cooper; Anason Halees; Jane M Lin; Hennady P Shulha; Xiaoling Zhang; Mousheng Xu; Jaafar N S Haidar; Yong Yu; Yijun Ruan; Vishwanath R Iyer; Roland D Green; Claes Wadelius; Peggy J Farnham; Bing Ren; Rachel A Harte; Angie S Hinrichs; Heather Trumbower; Hiram Clawson; Jennifer Hillman-Jackson; Ann S Zweig; Kayla Smith; Archana Thakkapallayil; Galt Barber; Robert M Kuhn; Donna Karolchik; Lluis Armengol; Christine P Bird; Paul I W de Bakker; Andrew D Kern; Nuria Lopez-Bigas; Joel D Martin; Barbara E Stranger; Abigail Woodroffe; Eugene Davydov; Antigone Dimas; Eduardo Eyras; Ingileif B Hallgrímsdóttir; Julian Huppert; Michael C Zody; Gonçalo R Abecasis; Xavier Estivill; Gerard G Bouffard; Xiaobin Guan; Nancy F Hansen; Jacquelyn R Idol; Valerie V B Maduro; Baishali Maskeri; Jennifer C McDowell; Morgan Park; Pamela J Thomas; Alice C Young; Robert W Blakesley; Donna M Muzny; Erica Sodergren; David A Wheeler; Kim C Worley; Huaiyang Jiang; George M Weinstock; Richard A Gibbs; Tina Graves; Robert Fulton; Elaine R Mardis; Richard K Wilson; Michele Clamp; James Cuff; Sante Gnerre; David B Jaffe; Jean L Chang; Kerstin Lindblad-Toh; Eric S Lander; Maxim Koriabine; Mikhail Nefedov; Kazutoyo Osoegawa; Yuko Yoshinaga; Baoli Zhu; Pieter J de Jong
Journal: Nature Date: 2007-06-14 Impact factor: 49.962

3. BigWig and BigBed: enabling browsing of large distributed datasets.

Authors: W J Kent; A S Zweig; G Barber; A S Hinrichs; D Karolchik
Journal: Bioinformatics Date: 2010-07-17 Impact factor: 6.937

4. Providing Access to Genomic Variant Knowledge in a Healthcare Setting: A Vision for the ClinGen Electronic Health Records Workgroup.

Authors: C L Overby; B Heale; S Aronson; J M Cherry; S Dwight; A Milosavljevic; T Nelson; A Niehaus; M A Weaver; E M Ramos; M S Williams
Journal: Clin Pharmacol Ther Date: 2015-11-09 Impact factor: 6.875

5. Comparative analysis of metazoan chromatin organization.

Authors: Joshua W K Ho; Youngsook L Jung; Tao Liu; Burak H Alver; Soohyun Lee; Kohta Ikegami; Kyung-Ah Sohn; Aki Minoda; Michael Y Tolstorukov; Alex Appert; Stephen C J Parker; Tingting Gu; Anshul Kundaje; Nicole C Riddle; Eric Bishop; Thea A Egelhofer; Sheng'en Shawn Hu; Artyom A Alekseyenko; Andreas Rechtsteiner; Dalal Asker; Jason A Belsky; Sarah K Bowman; Q Brent Chen; Ron A-J Chen; Daniel S Day; Yan Dong; Andrea C Dose; Xikun Duan; Charles B Epstein; Sevinc Ercan; Elise A Feingold; Francesco Ferrari; Jacob M Garrigues; Nils Gehlenborg; Peter J Good; Psalm Haseley; Daniel He; Moritz Herrmann; Michael M Hoffman; Tess E Jeffers; Peter V Kharchenko; Paulina Kolasinska-Zwierz; Chitra V Kotwaliwale; Nischay Kumar; Sasha A Langley; Erica N Larschan; Isabel Latorre; Maxwell W Libbrecht; Xueqiu Lin; Richard Park; Michael J Pazin; Hoang N Pham; Annette Plachetka; Bo Qin; Yuri B Schwartz; Noam Shoresh; Przemyslaw Stempor; Anne Vielle; Chengyang Wang; Christina M Whittle; Huiling Xue; Robert E Kingston; Ju Han Kim; Bradley E Bernstein; Abby F Dernburg; Vincenzo Pirrotta; Mitzi I Kuroda; William S Noble; Thomas D Tullius; Manolis Kellis; David M MacAlpine; Susan Strome; Sarah C R Elgin; Xiaole Shirley Liu; Jason D Lieb; Julie Ahringer; Gary H Karpen; Peter J Park
Journal: Nature Date: 2014-08-28 Impact factor: 49.962

6. ENCODE whole-genome data in the UCSC genome browser (2011 update).

Authors: Brian J Raney; Melissa S Cline; Kate R Rosenbloom; Timothy R Dreszer; Katrina Learned; Galt P Barber; Laurence R Meyer; Cricket A Sloan; Venkat S Malladi; Krishna M Roskin; Bernard B Suh; Angie S Hinrichs; Hiram Clawson; Ann S Zweig; Vanessa Kirkup; Pauline A Fujita; Brooke Rhead; Kayla E Smith; Andy Pohl; Robert M Kuhn; Donna Karolchik; David Haussler; W James Kent
Journal: Nucleic Acids Res Date: 2010-10-30 Impact factor: 16.971

7. ENCODE whole-genome data in the UCSC Genome Browser: update 2012.

Authors: Kate R Rosenbloom; Timothy R Dreszer; Jeffrey C Long; Venkat S Malladi; Cricket A Sloan; Brian J Raney; Melissa S Cline; Donna Karolchik; Galt P Barber; Hiram Clawson; Mark Diekhans; Pauline A Fujita; Mary Goldman; Robert C Gravell; Rachel A Harte; Angie S Hinrichs; Vanessa M Kirkup; Robert M Kuhn; Katrina Learned; Morgan Maddren; Laurence R Meyer; Andy Pohl; Brooke Rhead; Matthew C Wong; Ann S Zweig; David Haussler; W James Kent
Journal: Nucleic Acids Res Date: 2011-11-09 Impact factor: 16.971

8. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata.

Authors: Tanya Barrett; Karen Clark; Robert Gevorgyan; Vyacheslav Gorelenkov; Eugene Gribov; Ilene Karsch-Mizrachi; Michael Kimelman; Kim D Pruitt; Sergei Resenchuk; Tatiana Tatusova; Eugene Yaschenko; James Ostell
Journal: Nucleic Acids Res Date: 2011-12-01 Impact factor: 16.971

9. ENCODE whole-genome data in the UCSC Genome Browser.

Authors: Kate R Rosenbloom; Timothy R Dreszer; Michael Pheasant; Galt P Barber; Laurence R Meyer; Andy Pohl; Brian J Raney; Ting Wang; Angie S Hinrichs; Ann S Zweig; Pauline A Fujita; Katrina Learned; Brooke Rhead; Kayla E Smith; Robert M Kuhn; Donna Karolchik; David Haussler; W James Kent
Journal: Nucleic Acids Res Date: 2009-11-17 Impact factor: 16.971

10. Integrative analysis of 111 reference human epigenomes.

Authors: Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal: Nature Date: 2015-02-19 Impact factor: 69.504

194 in total

1. Self-Reporting Transposons Enable Simultaneous Readout of Gene Expression and Transcription Factor Binding in Single Cells.

Authors: Arnav Moudgil; Michael N Wilkinson; Xuhua Chen; June He; Alexander J Cammack; Michael J Vasek; Tomás Lagunas; Zongtai Qi; Matthew A Lalli; Chuner Guo; Samantha A Morris; Joseph D Dougherty; Robi D Mitra
Journal: Cell Date: 2020-07-24 Impact factor: 41.582

2. TICA: Transcriptional Interaction and Coregulation Analyzer.

Authors: Stefano Perna; Pietro Pinoli; Stefano Ceri; Limsoon Wong
Journal: Genomics Proteomics Bioinformatics Date: 2018-12-19 Impact factor: 7.691

3. An Improved Method for Identifying Specific DNA-Protein-Binding Sites In Vitro.

Authors: Liangyan Wang; Huizhi Lu; Yunguang Wang; Su Yang; Hong Xu; Kaiying Cheng; Ye Zhao; Bing Tian; Yuejin Hua
Journal: Mol Biotechnol Date: 2017-03 Impact factor: 2.695

4. Genetic Regulatory Mechanisms of Smooth Muscle Cells Map to Coronary Artery Disease Risk Loci.

Authors: Boxiang Liu; Milos Pjanic; Ting Wang; Trieu Nguyen; Michael Gloudemans; Abhiram Rao; Victor G Castano; Sylvia Nurnberg; Daniel J Rader; Susannah Elwyn; Erik Ingelsson; Stephen B Montgomery; Clint L Miller; Thomas Quertermous
Journal: Am J Hum Genet Date: 2018-08-23 Impact factor: 11.025

5. Dynamic chromatin accessibility modeled by Markov process of randomly-moving molecules in the 3D genome.

Authors: Yinan Wang; Caoqi Fan; Yuxuan Zheng; Cheng Li
Journal: Nucleic Acids Res Date: 2017-06-02 Impact factor: 16.971

6. Modeling Steatohepatitis in Humans with Pluripotent Stem Cell-Derived Organoids.

Authors: Rie Ouchi; Shodai Togo; Masaki Kimura; Tadahiro Shinozawa; Masaru Koido; Hiroyuki Koike; Wendy Thompson; Rebekah A Karns; Christopher N Mayhew; Patrick S McGrath; Heather A McCauley; Ran-Ran Zhang; Kyle Lewis; Shoyo Hakozaki; Autumn Ferguson; Norikazu Saiki; Yosuke Yoneyama; Ichiro Takeuchi; Yo Mabuchi; Chihiro Akazawa; Hiroshi Y Yoshikawa; James M Wells; Takanori Takebe
Journal: Cell Metab Date: 2019-05-30 Impact factor: 27.287

Review 7. The RNA 3D Motif Atlas: Computational methods for extraction, organization and evaluation of RNA motifs.

Authors: Lorena G Parlea; Blake A Sweeney; Maryam Hosseini-Asanjan; Craig L Zirbel; Neocles B Leontis
Journal: Methods Date: 2016-04-25 Impact factor: 3.608

8. Chromium disrupts chromatin organization and CTCF access to its cognate sites in promoters of differentially expressed genes.

Authors: Andrew VonHandorf; Francisco Javier Sánchez-Martín; Jacek Biesiada; Hongxia Zhang; Xiang Zhang; Mario Medvedovic; Alvaro Puga
Journal: Epigenetics Date: 2018-05-03 Impact factor: 4.528

9. SliceIt: A genome-wide resource and visualization tool to design CRISPR/Cas9 screens for editing protein-RNA interaction sites in the human genome.

Authors: Sasank Vemuri; Rajneesh Srivastava; Quoseena Mir; Seyedsasan Hashemikhabir; X Charlie Dong; Sarath Chandra Janga
Journal: Methods Date: 2019-09-05 Impact factor: 3.608

10. Resources for the Comprehensive Discovery of Functional RNA Elements.

Authors: Balaji Sundararaman; Lijun Zhan; Steven M Blue; Rebecca Stanton; Keri Elkins; Sara Olson; Xintao Wei; Eric L Van Nostrand; Gabriel A Pratt; Stephanie C Huelga; Brendan M Smalec; Xiaofeng Wang; Eurie L Hong; Jean M Davidson; Eric Lécuyer; Brenton R Graveley; Gene W Yeo
Journal: Mol Cell Date: 2016-03-17 Impact factor: 17.970