| Literature DB >> 16381980 |
Christoph D Schmid1, Rouaïda Perier, Viviane Praz, Philipp Bucher.
Abstract
The Eukaryotic Promoter Database (EPD) is an annotated non-redundant collection of eukaryotic POL II promoters, experimentally defined by a transcription start site (TSS). Access to promoter sequences is provided by pointers to positions in the corresponding genomes. Promoter evidence comes from conventional TSS mapping experiments for individual genes, or, starting from release 73, from mass genome annotation projects. Subsets of promoter sequences with customized 5' and 3' extensions can be downloaded from the EPD website. The focus of current development efforts is to reach complete promoter coverage for important model organisms as soon as possible. To speed up this process, a new class of preliminary promoter entries has been introduced as of release 83, which requires less stringent admission criteria. As part of a continuous integration process, new web-based interfaces have been developed, which allow joint analysis of promoter sequences with other bioinformatics resources developed by our group, in particular programs offered by the Signal Search Analysis Server, and gene expression data stored in the CleanEx database. EPD can be accessed at http://www.epd.isb-sib.ch.Entities:
Mesh:
Year: 2006 PMID: 16381980 PMCID: PMC1347508 DOI: 10.1093/nar/gkj146
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of currently accessible mass genome annotation data for promoter mapping
| 5′ EST sequences from oligo-capped cDNA libraries | |||
| Human | 400 225 | Suzuki | |
| Mouse | 580 209 | Suzuki | |
| | Sequences available from Genbank/EMBL, accession numbers extractable from Unigene (23), Unilib IDs 23941 or 23942 | 102 617 | Stapleton |
| | 92 654 | Seki | |
| 5′ sequences tags (5′SAGE, CAGE, GIS ditag) | |||
| Human | 22 546 | Hashimoto | |
| Human | 5 992 395 | Carninci | |
| Mouse | 11 567 973 | Carninci | |
| Mouse | 225 914 | Ng | |
| Reference sequence collections from oligo-capped cDNA libraries | |||
| Rice | 30 598 | Kikuchi | |
The third column indicates the number of available sequences or tags.
Figure 1Graphical representation of the distribution of 5′ ends of full-length transcripts. The diagram is based on data from the Berkley Drosophila Genome Project for gene ARF79F and is part of the ‘niceview’ display of EPD entry DM_ARF1_2 ().