| Literature DB >> 22121218 |
Abstract
The mapper(2) Database (http://genome.ufl.edu/mapperdb) is a component of mapper(2), a web-based system for the analysis of transcription factor binding sites in multiple genomes. The database contains predicted binding sites identified in the promoters of all human, mouse and Drosophila genes using 1017 probabilistic models representing over 600 different transcription factors. In this article we outline the current contents of the database and we describe its web-based user interface in detail. We then discuss ongoing work to extend the database contents to experimental data and to add analysis capabilities. Finally, we provide information about recent improvements to the hardware and software platform that mapper(2) is based on.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22121218 PMCID: PMC3245066 DOI: 10.1093/nar/gkr1080
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Number of hmm models and factors represented in each of the three default model libraries
| Library | Models | Factors |
|---|---|---|
| TRANSFAC | 399 | 326 |
| MAPPER | 529 | 434 |
| JASPAR | 89 | 89 |
| Total | 1017 | 678 |
Summary of the contents of the mapper2 database, by organism
| Human | Mouse | ||
|---|---|---|---|
| Models | 832 | 829 | 819 |
| Genes | 21 510 | 21 736 | 14 003 |
| Promoters | 34 022 | 27 443 | 22 369 |
| Total bases (Mb) | 628 | 578.6 | 266 |
| Total hits | 33 122 746 | 24 313 246 | 19 248 318 |
| Hits/promoter | 973.5 | 886.0 | 860.5 |
| Hits/model | 1.17 | 1.06 | 1.05 |
| Hits spacing | 18.95 | 23.80 | (13.82) |
‘Models’ indicates the number of models that produced at least one hit in a genome. ‘Hits/model’ represents the average number of hits in a promoter for each model. ‘Hits spacing’ is the average number of nucleotides between hits (see text for an explanation of the Drosophila value).
Summary of hits with scores above a threshold corresponding to a false discovery rate of 1%, obtained by scanning a 5 Mb randomized promoter sequence
| Human | Mouse | ||
|---|---|---|---|
| High quality hits | 12 354 505 | 8 849 004 | 6 828 654 |
| Hits/promoter | 363.1 | 322.5 | 305.2 |
| Hits/model | 0.44 | 0.49 | 0.37 |
| Hits distance | 50.8 | 65.39 | (39.0) |
The number of models and of promoters analyzed is the same as in Table 2.
Figure 1.The mapper2 page displaying results of a single-gene database query. The gray box shows detailed information for the hit in the line directly above it.
Figure 2.Example distance distribution plot for a pair of models. The graph represents the histogram of distances between binding sites for models M00158 (HNF-4) and M00724 (HNF-3α) in mouse. In the vast majority of cases, these binding sites are separated by 100bp, while other distances occur at very low and almost constant frequencies.