| Literature DB >> 19906716 |
Elodie Portales-Casamar1, Supat Thongjuea, Andrew T Kwon, David Arenillas, Xiaobei Zhao, Eivind Valen, Dimas Yusuf, Boris Lenhard, Wyeth W Wasserman, Albin Sandelin.
Abstract
JASPAR (http://jaspar.genereg.net) is the leading open-access database of matrix profiles describing the DNA-binding patterns of transcription factors (TFs) and other proteins interacting with DNA in a sequence-specific manner. Its fourth major release is the largest expansion of the core database to date: the database now holds 457 non-redundant, curated profiles. The new entries include the first batch of profiles derived from ChIP-seq and ChIP-chip whole-genome binding experiments, and 177 yeast TF binding profiles. The introduction of a yeast division brings the convenience of JASPAR to an active research community. As binding models are refined by newer data, the JASPAR database now uses versioning of matrices: in this release, 12% of the older models were updated to improved versions. Classification of TF families has been improved by adopting a new DNA-binding domain nomenclature. A curated catalog of mammalian TFs is provided, extending the use of the JASPAR profiles to additional TFs belonging to the same structural family. The changes in the database set the system ready for more rapid acquisition of new high-throughput data sources. Additionally, three new special collections provide matrix profile data produced by recent alternative high-throughput approaches.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19906716 PMCID: PMC2808906 DOI: 10.1093/nar/gkp950
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Examples of SELEX-derived matrix profiles replaced by ChIP-seq-derived profiles. (A) The previous MYCN matrix profile (MA0104.1) derived by DNA SELEX. (B) The new MYCN profile (MA0104.2) derived from ChIP-seq binding shows general agreement with the SELEX profile, with additional information derived from hundreds of sites at flanking positions. (C) The previous KLF4 profile (MA0039.1) is an example of SELEX-derived profile that did not correspond well to the handful of individually characterized KLF4 sites. (D) The new KLF profile derived from ChIP-seq data (13) shows a dramatic increase in information content and a good agreement with individually characterized binding sites.
Summary of the content and growth of the JASPAR database
| JASPAR | Brief description | Subset | Number of profiles in JASPAR 3.0 | New profiles in JASPAR 4.0 | Updated profiles | Removed profiles | Total profiles (including all versions) | Total profiles (non- redundant) |
|---|---|---|---|---|---|---|---|---|
| Core | ||||||||
| Non-redundant, literature-derived, curated models | Vertebrates | 101 | 29 | 16 | 1 | 145 | 130 | |
| Plants | 21 | – | – | 21 | 21 | |||
| Insects | 14 | 109 | 1 | – | 124 | 123 | ||
| Nematoda | – | 5 | – | – | 5 | 5 | ||
| Fungi | – | 177 | – | – | 177 | 177 | ||
| Urochordata | 1 | – | – | – | 1 | 1 | ||
| Total core | 137 | 321 | 17 | 1 | 474 | 457 | ||
| Collections | ||||||||
| POLII | Core promoter element profiles | – | 13 | – | – | 13 | 13 | |
| FAM | Familial ‘consensus’ profiles for major structural families of transcription factors | – | 11 | – | – | – | 11 | 11 |
| CNE | Profiles overrepresented in vertebrate highly conserved non-coding elements | – | 233 | – | – | – | 233 | 233 |
| PHYLOFACTS | Evolutionary conserved profiles in 5′ promoter regions | – | 174 | – | – | 174 | 174 | |
| SPLICE | Splice sites | – | 6 | – | – | – | 6 | 6 |
| PBM | Protein binding microarray profiles | – | – | 208 | – | – | 208 | 208 |
| PBM_HOMEO | Protein binding microarray profiles focused on homeodomain TFs | – | – | 176 | – | – | 176 | 176 |
| PBM_BHLH | Protein binding microarray profiles focused on bHLH domain TFs | – | – | 19 | – | – | 19 | 19 |
| Total collections | 437 | 403 | – | – | 840 | 840 |