Literature DB >> 27899657

The eukaryotic promoter database in its 30th year: focus on non-vertebrate organisms.

René Dreos1, Giovanna Ambrosini2,3, Romain Groux2, Rouaïda Cavin Périer2, Philipp Bucher2,3.   

Abstract

We present an update of the Eukaryotic Promoter Database EPD (http://epd.vital-it.ch), more specifically on the EPDnew division, which contains comprehensive organisms-specific transcription start site (TSS) collections automatically derived from next generation sequencing (NGS) data. Thanks to the abundant release of new high-throughput transcript mapping data (CAGE, TSS-seq, GRO-cap) the database could be extended to plant and fungal species. We further report on the expansion of the mass genome annotation (MGA) repository containing promoter-relevant chromatin profiling data and on improvements for the EPD entry viewers. Finally, we present a new data access tool, ChIP-Extract, which enables computational biologists to extract diverse types of promoter-associated data in numerical table formats that are readily imported into statistical analysis platforms such as R.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2016        PMID: 27899657      PMCID: PMC5210552          DOI: 10.1093/nar/gkw1069

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

EPD is an old promoter resource first published as a table in a journal article (1) and shortly afterwards distributed in machine-readable form (first on magnetic tapes then via the internet). Promoters are conceptually and operationally defined as transcription start sites or initiation regions. EPD was initially a manually compiled and curated database, strictly relying on critical assessment of experimental data published in journal articles. From the beginning, it was a sequence annotation resource not a sequence collection. The representative TSS of a promoter was defined by an accession number and a sequence position in an EMBL Nucleotide Sequence Library entry. A detailed description of the scope, contents, format and maintenance procedures of the old, manually compiled part of EPD can be found in (2). The advent of ultra-high-throughput protocols for genome-wide TSS mapping forced us to completely revise our data acquisition and curation procedures. The result of this major redesign is EPDnew, a computationally generated database derived from electronically distributed primary data. EPD thus now consists of two parts: (i) the old, manually curated part containing promoters from more than 100 different species all contained in a single file and (ii) EPDnew, which consists of multiple files, each containing a comprehensive TSS collection for an important eukaryotic model organisms. These modules are independent entities conforming to minimal data representation standards. For instance, each model organism has its own entry viewer displaying different types of promoter-associated genomic features and hyperlinking to different external data resources. The design principles of EPDnew were already explained in (3). Here, we present only a short summary in form of a flowchart shown in Figure 1. The development, generation and quality control of an EPDnew module is shortly explained in the accompanying Figure legend.
Figure 1.

Schematic representation of the EPDnew development and production pipeline. (A) Download of authoritative gene catalogs and primary TSS mapping data from public databases, data repositories and consortium websites. (B) Quality control (QC) of incoming data (e.g. read mapping efficiency, contaminations, etc.). (C) Data passing QC are reformatted and incorporated into the MGA repository. (D) Selection of a subset of TSS mapping experiments for generating a new organism-specific TSS collection. (E) Input data for a new module of EPDnew. (F) Organism-specific automatic database assembly pipeline tailored to the input data, see (3) for a detailed description of the human EPDnew assembly pipeline. (G) Preliminary or final TSS collection (H) Manual sanity checks of individual randomly selected promoter entries using the corresponding entry viewer, see Figure 2D for an example of an entry view. (I) Automatic quality evaluation of the TSS collections as a whole by motif enrichment tests, see Figure 1A for an example and ref (22) for an explanation of the method. (L) Feedback is collected from quality evaluation steps H and I. This may lead to the exclusion, replacement or addition of source data sets or modifications (e.g. program parameter fine-tuning) of the computational database generation pipeline. Note that the development of a final, publicly released EPDNnew module typically involves several evaluation-modification cycles.

Schematic representation of the EPDnew development and production pipeline. (A) Download of authoritative gene catalogs and primary TSS mapping data from public databases, data repositories and consortium websites. (B) Quality control (QC) of incoming data (e.g. read mapping efficiency, contaminations, etc.). (C) Data passing QC are reformatted and incorporated into the MGA repository. (D) Selection of a subset of TSS mapping experiments for generating a new organism-specific TSS collection. (E) Input data for a new module of EPDnew. (F) Organism-specific automatic database assembly pipeline tailored to the input data, see (3) for a detailed description of the human EPDnew assembly pipeline. (G) Preliminary or final TSS collection (H) Manual sanity checks of individual randomly selected promoter entries using the corresponding entry viewer, see Figure 2D for an example of an entry view. (I) Automatic quality evaluation of the TSS collections as a whole by motif enrichment tests, see Figure 1A for an example and ref (22) for an explanation of the method. (L) Feedback is collected from quality evaluation steps H and I. This may lead to the exclusion, replacement or addition of source data sets or modifications (e.g. program parameter fine-tuning) of the computational database generation pipeline. Note that the development of a final, publicly released EPDNnew module typically involves several evaluation-modification cycles.
Figure 2.

EPDnew analysis and tools. (Instructions how to generate these figures via the EPD web server are given in Supplementary Data.) (A) TATA-box (continuous lines) and Initiatior (Inr, dotted lines) occurrence profiles in three H. sapiens promoters databases. This picture has been obtained with the use of OProf from the SSA program package for two EPDnew versions (3 and 4) and from a list gene starts from the UCSC Gene list, which was used as input for the generation of the EPDnew collections. (B) Distribution of nucleosomes around S. cerevisiae, H. sapiens and D. rerio promoters. The Figure is based on MNase-seq data from (13–15) and has been made with the ChIP-Cor tool from the ChIP-Seq server (5). The MNase-seq data are stored in the MGA repository and are directly accessible via a pull-down menu from the ChIP-Cor input form. This comparative analysis shows the differences in the position of the N+1 (+40 from the TSS for S. cerevisiae and +120 for H. sapiens and D. rerio), distance between two consecutive nucleosomes (+160 in S. cerevisiae and +180 in H. sapiens and D. rerio) and length of the nucleosome-free region for the three organisms. (C) An example of ChIP-Extract output to study nucleosome maps around S. cerevisiae promoters. Each row in the matrix represents a promoter whereas each column the counts of MNase-seq reads found at a specific distance from the TSS. In this example the ordering option in ChIP-Extract has been turned on, which orders rows according to their similarity with the average signal (shown in Figure 2B). This simple procedure shows that some yeast promoters do not have the expected chromatin organization. Data is from (15). (D) An example of D. rerio EPD Hub visualized at the UCSC Genome Browser for the promoter of the ccni gene. The single experiment CAGE tracks shows the promoter shifting from maternally induced TSS (top lines) to zygotic specific TSS (bottom lines) (21). The blue icon near the bottom shows the TSS assignment of the corresponding promoter entry in EPDnew. Note that the two narrow TSS clusters are represented by only one promoter entry since they are too close to each other. The minimum distance requirement for two separate alternative promoters in EPDnew is 100 bp. In such cases, the EPDviewer provides essential information to users interested in the very details of the transcription initiation patterns.

EPDnew is tightly integrated with two accessory bioinformatics resources, the Signal Search Analysis (SSA) (4) and ChIP-Seq servers (5). The former offers tools for DNA motif-oriented analysis, the latter for exploring and downloading promoter-associated functional genomics data. More about the use of these resources in conjunction with EPDnew can be found in (6). The reason why we keep these tools separate, is because they are useful in many other contexts, for instance for ChIP-seq data analysis.

RECENT DEVELOPMENTS

Extension of EPDnew to plants and fungi

The content of EPDnew has substantially increased over the last two years. In our previous paper (6), we presented promoter collections for five model organisms, all animals (Homo sapiens, Mus musculus, Drosophila melanogaster, Danio rerio and Caenorhabditis elegans), totaling together more than 75 thousand entries. In the meantime, the number of promoters for H. sapiens and D. melanogaster has increased; both databases are approaching complete coverage with >92% of protein coding genes covered by at least one validated promoter (Table 1). In addition, we were able to extend EPDnew to five new organisms: a new insect (Apis mellifera), two plant species (a dicotyledonous, Arabidopsis thaliana and a monocotyledonous, Zea mays) and two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe). These five new databases greatly extend EPDnew to non-animal species allowing scientists to perform comparative studies of promoter features and organization (7). The expansion of EPDnew to novel organisms has prompted us to add links to species-specific databases such as TAIR for A. thaliana (8) or PomBase for S. pombe (9). Moreover, in order to facilitate the conversion of promoter lists to different genome assemblies, we recently added a genome coordinate conversion (liftOver) tool (10) to our promoter selection and download pages for all EPDnew databases corresponding to an organism that is supported by the UCSC Genome Browser.
Table 1.

Current contents of EPDnew

Organism, versionPromoters, genesaTSS librariesbChromatin data MNase–DnasecChIP-seq samples histones–PIC–TFsd
H. sapiens (4)25 503, 17 785 (95%)108823–9982231–491–3794
M. musculus (2)21 239, 17 565 (90%)3394–0174–60–384
D. melanogaster (2)15 073, 12 603 (92%)576–2329–12–189
D. rerio (1)10 728, 10 235 (43%)124–412–3–1
C. elegans (1)7120, 6363 (32%)86–62–1–3
A. mellifera (1)6493, 5712 (53%)160–00–0
A. thaliana (1)10 229, 10 177 (37%)10–00–0–32
Z. mays (1)17 081, 15 828 (59%)80–08–0–0
S. cerevisiae (2)5117, 5110 (88%)191–270–8–17
S. pombe (1)3440, 3438 (67%)18–86–0–51

aIn parenthesis is indicated the percentage of genes coverage.

bCAGE, GRO-cap and TSS-seq samples used to build the relative database.

cMNase-seq and DNase-seq samples that are present in the MGA repository.

dChIP-seq samples for histone marks and variants (such as H3K4me3, H2A.Z, H3), components of the PIC (such as Pol-II, TFIID, TFIIB, TBP, etc.) and Transcription Factors that are present in the MGA repository.

aIn parenthesis is indicated the percentage of genes coverage. bCAGE, GRO-cap and TSS-seq samples used to build the relative database. cMNase-seq and DNase-seq samples that are present in the MGA repository. dChIP-seq samples for histone marks and variants (such as H3K4me3, H2A.Z, H3), components of the PIC (such as Pol-II, TFIID, TFIIB, TBP, etc.) and Transcription Factors that are present in the MGA repository.

Increased precision of the human promoter collection

The H. sapiens database is at its fourth release and it has been generated using more than a thousand samples totaling >20 billion reads (data from ENCODE (11) and FANTOM5 (12) consortia). It has the largest collection of data among EPDnew databases and can be taken as a model on how other EPDnew databases will evolve in the near future. Although the number of samples used in this release is more than six times the previous, the database is reaching saturation (coverage of 95%) and as a consequence the increase in promoter numbers and gene coverage is not as significant (25 503 promoters for v004, 23 360 for v003). In this case, the addition of many more samples did not lead to the finding of many new promoters but to an overall increase in TSS mapping precision. This can be seen in the positional distributions of core promoter elements, which are expected to be found at fixed distance from the TSS (Figure 2A). The distribution of both the TATA-box and the Inr motifs within the different promoter collections show an increased frequency at the expected positions for the newer version compared to the other, indicating an increased quality for the latest version. We can predict that the M. musculus and D. melanogaster databases will soon follow the same trend. As their coverage surpasses the 90% limit, the addition of new samples will not lead to more promoters validated but to better estimates of TSS positions for existing promoters. Note that the curves shown in Figure 2A were generated with the OProf tool from the SSA server, which is directly accessible from the EPD web site. Detailed instructions how to reproduce the results are given in Supplemental Data. EPDnew analysis and tools. (Instructions how to generate these figures via the EPD web server are given in Supplementary Data.) (A) TATA-box (continuous lines) and Initiatior (Inr, dotted lines) occurrence profiles in three H. sapiens promoters databases. This picture has been obtained with the use of OProf from the SSA program package for two EPDnew versions (3 and 4) and from a list gene starts from the UCSC Gene list, which was used as input for the generation of the EPDnew collections. (B) Distribution of nucleosomes around S. cerevisiae, H. sapiens and D. rerio promoters. The Figure is based on MNase-seq data from (13–15) and has been made with the ChIP-Cor tool from the ChIP-Seq server (5). The MNase-seq data are stored in the MGA repository and are directly accessible via a pull-down menu from the ChIP-Cor input form. This comparative analysis shows the differences in the position of the N+1 (+40 from the TSS for S. cerevisiae and +120 for H. sapiens and D. rerio), distance between two consecutive nucleosomes (+160 in S. cerevisiae and +180 in H. sapiens and D. rerio) and length of the nucleosome-free region for the three organisms. (C) An example of ChIP-Extract output to study nucleosome maps around S. cerevisiae promoters. Each row in the matrix represents a promoter whereas each column the counts of MNase-seq reads found at a specific distance from the TSS. In this example the ordering option in ChIP-Extract has been turned on, which orders rows according to their similarity with the average signal (shown in Figure 2B). This simple procedure shows that some yeast promoters do not have the expected chromatin organization. Data is from (15). (D) An example of D. rerio EPD Hub visualized at the UCSC Genome Browser for the promoter of the ccni gene. The single experiment CAGE tracks shows the promoter shifting from maternally induced TSS (top lines) to zygotic specific TSS (bottom lines) (21). The blue icon near the bottom shows the TSS assignment of the corresponding promoter entry in EPDnew. Note that the two narrow TSS clusters are represented by only one promoter entry since they are too close to each other. The minimum distance requirement for two separate alternative promoters in EPDnew is 100 bp. In such cases, the EPDviewer provides essential information to users interested in the very details of the transcription initiation patterns.

New data in the MGA repository

As usual, the source data from which the current versions of EPDnew were derived is available in standardized format in the MGA repository (3), the back-end data archive used by EPD and the other tools developed by our group. This repository is not restricted to TSS-related data only (such as CAGE, GRO-cap, etc.) but can potentially contain any data set that can be represented as single coordinates in the genome. Examples are genome annotations (TSS, CDS, Intron-exon boundaries, transcripts ends, etc.), ChIP-seq samples (transcription factors, histones marks, etc.), MNase-seq samples, SNPs and conservation scores. Currently, it contains >11 000 samples. The recent addition of samples related to chromatin structure and promoter activity, such as ChIP-seq experiments on histone marks, Pol-II and components of the pre-initiation complex (PIC), gives substantial value to EPD as well (Table 1), as all these samples are accessible by the EPD accessory data analysis tools and can be used to study promoter function in greater details. One example of the use of MGA samples combined with EPDnew is shown in Figure 2B. It involves public MNase-seq data from human, zebra fish and yeast (13–15) and addresses the question whether the canonical nucleosome organization of promoters differs between eukaryotic species as has been reported previously (16,17). The most striking difference revealed by this analysis is the position of the first nucleosome downstream of the TSS: in vertebrate it is centered at about pos. +120 and does not cover the TSS whereas in yeast it occurs at +40 and thus includes the TSS. These results were generated with the ChIP-Cor tool from the ChIP-Seq server. ChIP-Cor computes the distribution of a chromatin feature (here MNase-seq reads) relative to a set of genomic positions (here TSSs). The analysis can be reproduced with a few mouse clicks starting from the EPDnew home page (see Supplementary Data).

ChIP-extract: a new tool to download promoter data in numerical table format

Recently we added a new tool called ChIP-Extract to the ChIP-Seq resource. ChIP-Extract enables computational biologists to extract promoter relevant data from the MGA repository in table format for downstream processing with other tools (e.g. R software). The output is a matrix with each row representing a promoter and each column a distance range relative to the TSS. Each cell then contains the number of sequence reads (or any other kind of genomic feature) that are found at a particular distance from the TSS in a particular promoter. In addition to a tab-delimited text file, the ChIP-Extract server returns a graphical representation of the data as a heatmap. Figure 2C shows the distribution of MNase-seq reads around S. cerevisiae promoters. Note that in this picture, the rows have been reordered according to their similarity to the average MNase-seq profile. However, the main purpose of the ChIP-Extract tool is to export the data for analysis with locally installed software tools. An example of such downstream analysis of promoter/MNase-seq data can be found in Figure 4 of (18). There, a probabilistic partitioning algorithm was used for the identification of human promoter subclasses based on nucleosome distribution.

Improvement and reorganization of the EPD viewer

In 2013 we first introduced the EPD viewer for H. sapiens (3) based on a careful selection of the tracks to be visualized in the UCSC Genome Browser. We developed it with the intent to provide a customizable visualization platform to explore promoter-relevant genomic features (experimental, computationally derived, and manually annotated) of individual promoters. As the number of EPDnew databases grew, we reorganized and extended the viewer to all other organisms that were supported by the UCSC Genome Browser. To achieve this, we developed a track hub (19) as a web-accessible directory tree containing the genomic data visualized in the Genome Browser. The hub has a minimal composition of 3 EPD-specific tracks: the combined TSS mapping samples used in the EPD assembly pipeline at single base pair resolution for the plus and minus strands separately and the EPD promoter track. Other computationally derived and annotation tracks are often present such as a gene track, the conservation scores and repetitive element tracks and, when available, a CpG island track. Additionally, other data might be visualized if the corresponding samples are present in the MGA repository such as promoter specific ChIP-seq samples (Pol-II and H3K4me3); enhancer specific markers (H3K4me1); selected CAGE samples from representative cell lines or tissues organized as a track set. Following these lines, the human viewer has been updated with new global CAGE tracks and single CAGE samples for several cell lines (11), the viewers for mouse and D. melanogaster with CAGE samples for different tissues (12,20), and the D. rerio viewer with CAGE samples for early embryonic developmental stages (21). Figure 2D shows an EPD viewer snapshot for D. rerio with chromatin and CAGE tracks displayed for the ccni_1 promoter. This example was adopted from a recent paper (21) reporting that TSS positions of some zebra fish genes shift during early development. As seen in the picture, zygotic transcripts of the ccni_1 initiate about 60 bp upstream of the maternal TSS.

ACCESS

EPD and EPDnew are freely accessible without need for preregistration. Web-based access is provided via the EPD web site at http://epd.vital-it.ch/. Data files can be downloaded via FTP from ftp://ccg.vital-it.ch/.
  20 in total

1.  The Eukaryotic Promoter Database EPD: the impact of in silico primer extension.

Authors:  Christoph D Schmid; Viviane Praz; Mauro Delorenzi; Rouaïda Périer; Philipp Bucher
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  High-throughput mapping of the chromatin structure of human promoters.

Authors:  Fatih Ozsolak; Jun S Song; X Shirley Liu; David E Fisher
Journal:  Nat Biotechnol       Date:  2007-01-14       Impact factor: 54.908

3.  Compilation and analysis of eukaryotic POL II promoter sequences.

Authors:  P Bucher; E N Trifonov
Journal:  Nucleic Acids Res       Date:  1986-12-22       Impact factor: 16.971

4.  A promoter-level mammalian expression atlas.

Authors:  Alistair R R Forrest; Hideya Kawaji; Michael Rehli; J Kenneth Baillie; Michiel J L de Hoon; Vanja Haberle; Timo Lassmann; Ivan V Kulakovskiy; Marina Lizio; Masayoshi Itoh; Robin Andersson; Christopher J Mungall; Terrence F Meehan; Sebastian Schmeier; Nicolas Bertin; Mette Jørgensen; Emmanuel Dimont; Erik Arner; Christian Schmidl; Ulf Schaefer; Yulia A Medvedeva; Charles Plessy; Morana Vitezic; Jessica Severin; Colin A Semple; Yuri Ishizu; Robert S Young; Margherita Francescatto; Intikhab Alam; Davide Albanese; Gabriel M Altschuler; Takahiro Arakawa; John A C Archer; Peter Arner; Magda Babina; Sarah Rennie; Piotr J Balwierz; Anthony G Beckhouse; Swati Pradhan-Bhatt; Judith A Blake; Antje Blumenthal; Beatrice Bodega; Alessandro Bonetti; James Briggs; Frank Brombacher; A Maxwell Burroughs; Andrea Califano; Carlo V Cannistraci; Daniel Carbajo; Yun Chen; Marco Chierici; Yari Ciani; Hans C Clevers; Emiliano Dalla; Carrie A Davis; Michael Detmar; Alexander D Diehl; Taeko Dohi; Finn Drabløs; Albert S B Edge; Matthias Edinger; Karl Ekwall; Mitsuhiro Endoh; Hideki Enomoto; Michela Fagiolini; Lynsey Fairbairn; Hai Fang; Mary C Farach-Carson; Geoffrey J Faulkner; Alexander V Favorov; Malcolm E Fisher; Martin C Frith; Rie Fujita; Shiro Fukuda; Cesare Furlanello; Masaaki Furino; Jun-ichi Furusawa; Teunis B Geijtenbeek; Andrew P Gibson; Thomas Gingeras; Daniel Goldowitz; Julian Gough; Sven Guhl; Reto Guler; Stefano Gustincich; Thomas J Ha; Masahide Hamaguchi; Mitsuko Hara; Matthias Harbers; Jayson Harshbarger; Akira Hasegawa; Yuki Hasegawa; Takehiro Hashimoto; Meenhard Herlyn; Kelly J Hitchens; Shannan J Ho Sui; Oliver M Hofmann; Ilka Hoof; Furni Hori; Lukasz Huminiecki; Kei Iida; Tomokatsu Ikawa; Boris R Jankovic; Hui Jia; Anagha Joshi; Giuseppe Jurman; Bogumil Kaczkowski; Chieko Kai; Kaoru Kaida; Ai Kaiho; Kazuhiro Kajiyama; Mutsumi Kanamori-Katayama; Artem S Kasianov; Takeya Kasukawa; Shintaro Katayama; Sachi Kato; Shuji Kawaguchi; Hiroshi Kawamoto; Yuki I Kawamura; Tsugumi Kawashima; Judith S Kempfle; Tony J Kenna; Juha Kere; Levon M Khachigian; Toshio Kitamura; S Peter Klinken; Alan J Knox; Miki Kojima; Soichi Kojima; Naoto Kondo; Haruhiko Koseki; Shigeo Koyasu; Sarah Krampitz; Atsutaka Kubosaki; Andrew T Kwon; Jeroen F J Laros; Weonju Lee; Andreas Lennartsson; Kang Li; Berit Lilje; Leonard Lipovich; Alan Mackay-Sim; Ri-ichiroh Manabe; Jessica C Mar; Benoit Marchand; Anthony Mathelier; Niklas Mejhert; Alison Meynert; Yosuke Mizuno; David A de Lima Morais; Hiromasa Morikawa; Mitsuru Morimoto; Kazuyo Moro; Efthymios Motakis; Hozumi Motohashi; Christine L Mummery; Mitsuyoshi Murata; Sayaka Nagao-Sato; Yutaka Nakachi; Fumio Nakahara; Toshiyuki Nakamura; Yukio Nakamura; Kenichi Nakazato; Erik van Nimwegen; Noriko Ninomiya; Hiromi Nishiyori; Shohei Noma; Shohei Noma; Tadasuke Noazaki; Soichi Ogishima; Naganari Ohkura; Hiroko Ohimiya; Hiroshi Ohno; Mitsuhiro Ohshima; Mariko Okada-Hatakeyama; Yasushi Okazaki; Valerio Orlando; Dmitry A Ovchinnikov; Arnab Pain; Robert Passier; Margaret Patrikakis; Helena Persson; Silvano Piazza; James G D Prendergast; Owen J L Rackham; Jordan A Ramilowski; Mamoon Rashid; Timothy Ravasi; Patrizia Rizzu; Marco Roncador; Sugata Roy; Morten B Rye; Eri Saijyo; Antti Sajantila; Akiko Saka; Shimon Sakaguchi; Mizuho Sakai; Hiroki Sato; Suzana Savvi; Alka Saxena; Claudio Schneider; Erik A Schultes; Gundula G Schulze-Tanzil; Anita Schwegmann; Thierry Sengstag; Guojun Sheng; Hisashi Shimoji; Yishai Shimoni; Jay W Shin; Christophe Simon; Daisuke Sugiyama; Takaai Sugiyama; Masanori Suzuki; Naoko Suzuki; Rolf K Swoboda; Peter A C 't Hoen; Michihira Tagami; Naoko Takahashi; Jun Takai; Hiroshi Tanaka; Hideki Tatsukawa; Zuotian Tatum; Mark Thompson; Hiroo Toyodo; Tetsuro Toyoda; Elvind Valen; Marc van de Wetering; Linda M van den Berg; Roberto Verado; Dipti Vijayan; Ilya E Vorontsov; Wyeth W Wasserman; Shoko Watanabe; Christine A Wells; Louise N Winteringham; Ernst Wolvetang; Emily J Wood; Yoko Yamaguchi; Masayuki Yamamoto; Misako Yoneda; Yohei Yonekura; Shigehiro Yoshida; Susan E Zabierowski; Peter G Zhang; Xiaobei Zhao; Silvia Zucchelli; Kim M Summers; Harukazu Suzuki; Carsten O Daub; Jun Kawai; Peter Heutink; Winston Hide; Tom C Freeman; Boris Lenhard; Vladimir B Bajic; Martin S Taylor; Vsevolod J Makeev; Albin Sandelin; David A Hume; Piero Carninci; Yoshihide Hayashizaki
Journal:  Nature       Date:  2014-03-27       Impact factor: 49.962

5.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools.

Authors:  Philippe Lamesch; Tanya Z Berardini; Donghui Li; David Swarbreck; Christopher Wilks; Rajkumar Sasidharan; Robert Muller; Kate Dreher; Debbie L Alexander; Margarita Garcia-Hernandez; Athikkattuvalasu S Karthikeyan; Cynthia H Lee; William D Nelson; Larry Ploetz; Shanker Singh; April Wensel; Eva Huala
Journal:  Nucleic Acids Res       Date:  2011-12-02       Impact factor: 16.971

6.  Dynamic regulation of the transcription initiation landscape at single nucleotide resolution during vertebrate embryogenesis.

Authors:  Chirag Nepal; Yavor Hadzhiev; Christopher Previti; Vanja Haberle; Nan Li; Hazuki Takahashi; Ana Maria M Suzuki; Ying Sheng; Rehab F Abdelhamid; Santosh Anand; Jochen Gehrig; Altuna Akalin; Christel E M Kockx; Antoine A J van der Sloot; Wilfred F J van Ijcken; Olivier Armant; Sepand Rastegar; Craig Watson; Uwe Strähle; Elia Stupka; Piero Carninci; Boris Lenhard; Ferenc Müller
Journal:  Genome Res       Date:  2013-09-03       Impact factor: 9.043

7.  Two independent transcription initiation codes overlap on vertebrate core promoters.

Authors:  Vanja Haberle; Nan Li; Yavor Hadzhiev; Charles Plessy; Christopher Previti; Chirag Nepal; Jochen Gehrig; Xianjun Dong; Altuna Akalin; Ana Maria Suzuki; Wilfred F J van IJcken; Olivier Armant; Marco Ferg; Uwe Strähle; Piero Carninci; Ferenc Müller; Boris Lenhard
Journal:  Nature       Date:  2014-02-16       Impact factor: 49.962

8.  EPD and EPDnew, high-quality promoter resources in the next-generation sequencing era.

Authors:  René Dreos; Giovanna Ambrosini; Rouayda Cavin Périer; Philipp Bucher
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

9.  Influence of Rotational Nucleosome Positioning on Transcription Start Site Selection in Animal Promoters.

Authors:  René Dreos; Giovanna Ambrosini; Philipp Bucher
Journal:  PLoS Comput Biol       Date:  2016-10-07       Impact factor: 4.475

10.  Landscape of transcription in human cells.

Authors:  Sarah Djebali; Carrie A Davis; Angelika Merkel; Alex Dobin; Timo Lassmann; Ali Mortazavi; Andrea Tanzer; Julien Lagarde; Wei Lin; Felix Schlesinger; Chenghai Xue; Georgi K Marinov; Jainab Khatun; Brian A Williams; Chris Zaleski; Joel Rozowsky; Maik Röder; Felix Kokocinski; Rehab F Abdelhamid; Tyler Alioto; Igor Antoshechkin; Michael T Baer; Nadav S Bar; Philippe Batut; Kimberly Bell; Ian Bell; Sudipto Chakrabortty; Xian Chen; Jacqueline Chrast; Joao Curado; Thomas Derrien; Jorg Drenkow; Erica Dumais; Jacqueline Dumais; Radha Duttagupta; Emilie Falconnet; Meagan Fastuca; Kata Fejes-Toth; Pedro Ferreira; Sylvain Foissac; Melissa J Fullwood; Hui Gao; David Gonzalez; Assaf Gordon; Harsha Gunawardena; Cedric Howald; Sonali Jha; Rory Johnson; Philipp Kapranov; Brandon King; Colin Kingswood; Oscar J Luo; Eddie Park; Kimberly Persaud; Jonathan B Preall; Paolo Ribeca; Brian Risk; Daniel Robyr; Michael Sammeth; Lorian Schaffer; Lei-Hoon See; Atif Shahab; Jorgen Skancke; Ana Maria Suzuki; Hazuki Takahashi; Hagen Tilgner; Diane Trout; Nathalie Walters; Huaien Wang; John Wrobel; Yanbao Yu; Xiaoan Ruan; Yoshihide Hayashizaki; Jennifer Harrow; Mark Gerstein; Tim Hubbard; Alexandre Reymond; Stylianos E Antonarakis; Gregory Hannon; Morgan C Giddings; Yijun Ruan; Barbara Wold; Piero Carninci; Roderic Guigó; Thomas R Gingeras
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

View more
  82 in total

1.  A transcription factor IIA-binding site differentially regulates RNA polymerase II-mediated transcription in a promoter context-dependent manner.

Authors:  Juan Wang; Shasha Zhao; Wei He; Yun Wei; Yang Zhang; Henry Pegg; Paul Shore; Stefan G E Roberts; Wensheng Deng
Journal:  J Biol Chem       Date:  2017-05-24       Impact factor: 5.157

2.  Dissecting the sources of gene expression variation in a pan-cancer analysis identifies novel regulatory mutations.

Authors:  Anchal Sharma; Chuan Jiang; Subhajyoti De
Journal:  Nucleic Acids Res       Date:  2018-05-18       Impact factor: 16.971

3.  Evolution of replication origins in vertebrate genomes: rapid turnover despite selective constraints.

Authors:  Florian Massip; Marc Laurent; Caroline Brossas; José Miguel Fernández-Justel; María Gómez; Marie-Noelle Prioleau; Laurent Duret; Franck Picard
Journal:  Nucleic Acids Res       Date:  2019-06-04       Impact factor: 16.971

4.  Level of hydrogen peroxide affects expression and sub-cellular localization of Pax6.

Authors:  Sachin Shukla; Rajnikant Mishra
Journal:  Mol Biol Rep       Date:  2018-05-16       Impact factor: 2.316

5.  Allele-specific single-cell RNA sequencing reveals different architectures of intrinsic and extrinsic gene expression noises.

Authors:  Mengyi Sun; Jianzhi Zhang
Journal:  Nucleic Acids Res       Date:  2020-01-24       Impact factor: 16.971

6.  Pervasive and CpG-dependent promoter-like characteristics of transcribed enhancers.

Authors:  Robin Steinhaus; Tonatiuh Gonzalez; Dominik Seelow; Peter N Robinson
Journal:  Nucleic Acids Res       Date:  2020-06-04       Impact factor: 16.971

7.  TGFβ-induced fibroblast activation requires persistent and targeted HDAC-mediated gene repression.

Authors:  Dakota L Jones; Andrew J Haak; Nunzia Caporarello; Kyoung M Choi; Zhenqing Ye; Huihuang Yan; Xaralabos Varelas; Tamas Ordog; Giovanni Ligresti; Daniel J Tschumperlin
Journal:  J Cell Sci       Date:  2019-10-18       Impact factor: 5.285

8.  NFE2-Related Transcription Factor 2 Coordinates Antioxidant Defense with Thyroglobulin Production and Iodination in the Thyroid Gland.

Authors:  Panos G Ziros; Ioannis G Habeos; Dionysios V Chartoumpekis; Eleni Ntalampyra; Emmanuel Somm; Cédric O Renaud; Massimo Bongiovanni; Ioannis P Trougakos; Masayuki Yamamoto; Thomas W Kensler; Pilar Santisteban; Nancy Carrasco; Carrie Ris-Stalpers; Elena Amendola; Xiao-Hui Liao; Luciano Rossich; Lisa Thomasz; Guillermo J Juvenal; Samuel Refetoff; Gerasimos P Sykiotis
Journal:  Thyroid       Date:  2018-06       Impact factor: 6.568

9.  Auto-aggressive CXCR6+ CD8 T cells cause liver immune pathology in NASH.

Authors:  Michael Dudek; Dominik Pfister; Sainitin Donakonda; Pamela Filpe; Annika Schneider; Melanie Laschinger; Daniel Hartmann; Norbert Hüser; Philippa Meiser; Felix Bayerl; Donato Inverso; Jennifer Wigger; Marcial Sebode; Rupert Öllinger; Roland Rad; Silke Hegenbarth; Martina Anton; Adrien Guillot; Andrew Bowman; Danijela Heide; Florian Müller; Pierluigi Ramadori; Valentina Leone; Cristina Garcia-Caceres; Tim Gruber; Gabriel Seifert; Agnieszka M Kabat; Jan-Philipp Mallm; Simon Reider; Maria Effenberger; Susanne Roth; Adrian T Billeter; Beat Müller-Stich; Edward J Pearce; Friedrich Koch-Nolte; Rafael Käser; Herbert Tilg; Robert Thimme; Tobias Boettler; Frank Tacke; Jean-Francois Dufour; Dirk Haller; Peter J Murray; Ron Heeren; Dietmar Zehn; Jan P Böttcher; Mathias Heikenwälder; Percy A Knolle
Journal:  Nature       Date:  2021-03-24       Impact factor: 49.962

10.  Chronic Mild Stress and Venlafaxine Treatment Were Associated with Altered Expression Level and Methylation Status of New Candidate Inflammatory Genes in PBMCs and Brain Structures of Wistar Rats.

Authors:  Katarzyna Bialek; Piotr Czarny; Paulina Wigner; Ewelina Synowiec; Gabriela Barszczewska; Michal Bijak; Janusz Szemraj; Monika Niemczyk; Katarzyna Tota-Glowczyk; Mariusz Papp; Tomasz Sliwinski
Journal:  Genes (Basel)       Date:  2021-04-29       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.