| Literature DB >> 28035026 |
Ralf Eggeling1, Ivo Grosse2,3, Jan Grau2.
Abstract
Summary: Recent studies have shown that the traditional position weight matrix model is often insufficient for modeling transcription factor binding sites, as intra-motif dependencies play a significant role for an accurate description of binding motifs. Here, we present the Java application InMoDe, a collection of tools for learning, leveraging and visualizing such dependencies of putative higher order. The distinguishing feature of InMoDe is a robust model selection from a class of parsimonious models, taking into account dependencies only if justified by the data while choosing for simplicity otherwise. Availability and Implementation: InMoDe is implemented in Java and is available as command line application, as application with a graphical user-interface, and as an integration into Galaxy on the project website at http://www.jstacs.de/index.php/InMoDe . Contact: ralf.eggeling@cs.helsinki.fi.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28035026 PMCID: PMC5408807 DOI: 10.1093/bioinformatics/btw689
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Exemplary conditional sequence logo as generated by VisualisationApp (A) and screenshot of the graphical user interface (B). The examples use the E2F1 dataset from JASPAR (Sandelin ). The top row of nucleotide stacks in the CSL corresponds to a classical sequence logo (displayed in top-right box of the GUI), whereas the bottom row shows conditional nucleotide probabilities given the context represented by the PCT (middle). A motif refinement by first-order dependencies is achieved at position 2, 3 and 8, higher-order dependencies do occur once at position 10, and yet the model allows to choose for simplicity where nothing else is justified by the data, such as at position 4–7 and 11