| Literature DB >> 29140473 |
Aziz Khan1, Oriol Fornes2, Arnaud Stigliani3, Marius Gheorghe1, Jaime A Castro-Mondragon1, Robin van der Lee2, Adrien Bessy3, Jeanne Chèneby4,5, Shubhada R Kulkarni6,7,8, Ge Tan9,10, Damir Baranasic9,10, David J Arenillas2, Albin Sandelin11, Klaas Vandepoele6,7,8, Boris Lenhard9,10,12, Benoît Ballester4,5, Wyeth W Wasserman2, François Parcy3, Anthony Mathelier1,13.
Abstract
JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs (60 for vertebrates and 262 for plants) and 33 PFMs were updated (24 for vertebrates, 8 for plants and 1 for insects). These new profiles represent a 30% expansion compared to the 2016 release. In addition, we have introduced 316 TFFMs (95 for vertebrates, 218 for plants and 3 for insects). This release incorporates clusters of similar PFMs in each taxon and each TF class per taxon. The JASPAR 2018 CORE vertebrate collection of PFMs was used to predict TF-binding sites in the human genome. The predictions are made available to the scientific community through a UCSC Genome Browser track data hub. Finally, this update comes with a new web framework with an interactive and responsive user-interface, along with new features. All the underlying data can be retrieved programmatically using a RESTful API and through the JASPAR 2018 R/Bioconductor package.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29140473 PMCID: PMC5753243 DOI: 10.1093/nar/gkx1126
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of the growth of the number of PFMs in the JASPAR 2018 CORE collection compared to the JASPAR 2016 CORE collection
| Taxonomic group | Non-redundant PFMs in JASPAR 2016 | New non-redundant PFMs in JASPAR 2018 | Updated PFMs in JASPAR 2018 | Total PFMs (non-redundant) in JASPAR 2018 | Total PFMs (all versions) in JASPAR 2018 |
|---|---|---|---|---|---|
| Vertebrates | 519 | 60 | 24 | 579 | 719 |
| Plants | 227 | 262 | 8 | 489 | 501 |
| Insects | 133 | 0 | 1 | 133 | 140 |
| Nematodes | 26 | 0 | 0 | 26 | 26 |
| Fungi | 176 | 0 | 0 | 176 | 177 |
| Urochordata | 1 | 0 | 0 | 1 | 1 |
|
|
|
|
|
|
|
Figure 1.JASPAR PFM clustering. (A) Radial tree representing the clusterization of the JASPAR CORE vertebrate PFMs. (B) Zoom in view of the radial tree where the predicted clusters are highlighted at the branches and the TF classes are indicated with different colors at the leaves. (C) Clicking on a leaf in the radial tree will open a link to the corresponding motif description page on the JASPAR website (the MA0148.3 profile associated to FOXA1 is provided here as an example).
Figure 2.Overview of the JASPAR 2018 new web interface with interactive searching activity. (A) A quick and detailed search feature on the homepage. (B) A responsive table lists the searched profile(s), which can be further selected and added to the cart listed on the right panel for users to perform their own analyses. (C) A detailed page for the GATA3 matrix profile, which is divided into sub-panels including the profile summary, sequence logo, PFM, TF-binding information, external links, version information, ChIP-seq centrality, TFFM and other details. (D) The PFM for the GATA3 profile (MA0037.2) is downloaded in MEME format using the RESTful API.