| Literature DB >> 26531826 |
Anthony Mathelier1, Oriol Fornes1, David J Arenillas1, Chih-Yu Chen1, Grégoire Denay2, Jessica Lee1, Wenqiang Shi1, Casper Shyr1, Ge Tan3, Rebecca Worsley-Hunt1, Allen W Zhang1, François Parcy2, Boris Lenhard4, Albin Sandelin5, Wyeth W Wasserman6.
Abstract
JASPAR (http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups. For this 2016 release, we expanded the JASPAR CORE collection with 494 new TF binding profiles (315 in vertebrates, 11 in nematodes, 3 in insects, 1 in fungi and 164 in plants) and updated 59 profiles (58 in vertebrates and 1 in fungi). The introduced profiles represent an 83% expansion and 10% update when compared to the previous release. We updated the structural annotation of the TF DNA binding domains (DBDs) following a published hierarchical structural classification. In addition, we introduced 130 transcription factor flexible models trained on ChIP-seq data for vertebrates, which capture dinucleotide dependencies within TF binding sites. This new JASPAR release is accompanied by a new web tool to infer JASPAR TF binding profiles recognized by a given TF protein sequence. Moreover, we provide the users with a Ruby module complementing the JASPAR API to ease programmatic access and use of the JASPAR collection of profiles. Finally, we provide the JASPAR2016 R/Bioconductor data package with the data of this release.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26531826 PMCID: PMC4702842 DOI: 10.1093/nar/gkv1176
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of the content growth in JASPAR 2016 (version 6.0) compared to JASPAR 2014 (version 5.0)
| Taxonomic group | Number of non-redundant profiles in JASPAR 5.0 | New non-redundant profiles in JASPAR 6.0 | Updated profiles | Removed profiles | Total profiles (including older versions of profiles) | Total profiles (non-redundant) |
|---|---|---|---|---|---|---|
| Vertebrates | 205 | 315 | 58 | 1 | 635 | 519 |
| Plants | 64 | 164 | 0 | 0 | 231 | 227 |
| Insects | 131 | 3 | 0 | 0 | 139 | 133 |
| Nematodes | 15 | 11 | 0 | 0 | 26 | 26 |
| Fungi | 177 | 1 | 1 | 2 | 177 | 176 |
| Urochordata | 1 | 0 | 0 | 0 | 1 | 1 |
| Total | 593 | 494 | 59 | 3 | 1210 | 1082 |
See Supplementary Text for more information.
Figure 1.Screenshot of a new TFFM introduced in the new dedicated layout. Refer to (2) for details on TFFMs and interpretation of the dedicated logos describing the dinucleotide dependencies captured by the models.
Figure 2.Overview of the JASPAR TF binding profile inference. (A) The user can input a TF protein sequence for which to look for a JASPAR TF binding profile. (B) Binding profiles potentially bound by the TF provided by the user are inferred.