| Literature DB >> 31680159 |
Patrick Meylan1, René Dreos1, Giovanna Ambrosini1,2, Romain Groux1,2, Philipp Bucher1,2.
Abstract
The Eukaryotic Promoter Database (EPD), available online at https://epd.epfl.ch, provides accurate transcription start site (TSS) information for promoters of 15 model organisms plus corresponding functional genomics data that can be viewed in a genome browser, queried or analyzed via web interfaces, or exported in standard formats (FASTA, BED, CSV) for subsequent analysis with other tools. Recent work has focused on the improvement of the EPD promoter viewers, which use the UCSC Genome Browser as visualization platform. Thousands of high-resolution tracks for CAGE, ChIP-seq and similar data have been generated and organized into public track hubs. Customized, reproducible promoter views, combining EPD-supplied tracks with native UCSC Genome Browser tracks, can be accessed from the organism summary pages or from individual promoter entries. Moreover, thanks to recent improvements and stabilization of ncRNA gene catalogs, we were able to release promoter collections for certain classes of ncRNAs from human and mouse. Furthermore, we developed automatic computational protocols to assign orphan TSS peaks to downstream genes based on paired-end (RAMPAGE) TSS mapping data, which enabled us to add nearly 9000 new entries to the human promoter collection. Since our last article in this journal, EPD was extended to five more model organisms: rhesus monkey, rat, dog, chicken and Plasmodium falciparum.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31680159 PMCID: PMC7145694 DOI: 10.1093/nar/gkz1014
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Screen shot of a UCSC Genome Browser window showing two divergent human promoters for genes EMC8 and COX4l1, as displayed by the EPD viewer. From top to bottom (track source in parentheses): high-resolution ChIP-seq tracks for histone modifications H3K4me1, H3K4me3 and RNA polymerase 2 (EPD); lower resolution ChIP-seq tracks for H3K27ac (ENCODE-UCSC); single base resolution CAGE tracks for ENCODE, RAMPAGE and FANTOM5, all libraries combined plus a few selected tissue-specific libraries (EPD); EPD promoter annotation track, thin and thick lines mark 50 bp upstream and10 bp downstream regions relative to the dominant TSS (EPD); GENCODE genes (UCSC); open chromatin/DNase I clusters (ENCODE-UCSC); ChIP-seq peaks for selected transcription factors and RNA polymerase 2 subunits (ENCODE-UCSC); sequence conservation tracks, common SNPs from dbSNP and repetitive elements from RepeatMasker (UCSC).
Figure 2.Frequency and positional distribution of core promoter motifs relative to the TSS of human coding and noncoding genes as defined in EPD.
Organisms covered in EPD and corresponding promoter totals (September 2019)
| Mammals | Other animals | Other eukaryotes | |||
|---|---|---|---|---|---|
|
| 29598 (2339) |
| 16972 |
| 22703 |
|
| 25111 (3077) |
| 10728 |
| 17081 |
|
| 12601 |
| 7120 |
| 5597 |
|
| 9575 |
| 6493 |
| 5113 |
|
| 7545 |
| 6127 |
| 4802 |
Numbers in parentheses relate to promoters of noncoding RNAs.