Literature DB >> 28591841

RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.

Jaime Abraham Castro-Mondragon1, Sébastien Jaeger2, Denis Thieffry3, Morgane Thomas-Chollier3, Jacques van Helden1.   

Abstract

Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28591841      PMCID: PMC5737723          DOI: 10.1093/nar/gkx314

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  65 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  Natural similarity measures between position frequency matrices with an application to clustering.

Authors:  Utz J Pape; Sven Rahmann; Martin Vingron
Journal:  Bioinformatics       Date:  2008-01-02       Impact factor: 6.937

3.  C2H2 zinc finger proteins greatly expand the human regulatory lexicon.

Authors:  Hamed S Najafabadi; Sanie Mnaimneh; Frank W Schmitges; Michael Garton; Kathy N Lam; Ally Yang; Mihai Albu; Matthew T Weirauch; Ernest Radovani; Philip M Kim; Jack Greenblatt; Brendan J Frey; Timothy R Hughes
Journal:  Nat Biotechnol       Date:  2015-02-18       Impact factor: 54.908

4.  Information content of binding sites on nucleotide sequences.

Authors:  T D Schneider; G D Stormo; L Gold; A Ehrenfeucht
Journal:  J Mol Biol       Date:  1986-04-05       Impact factor: 5.469

5.  TRANSFAC: transcriptional regulation, from patterns to profiles.

Authors:  V Matys; E Fricke; R Geffers; E Gössling; M Haubrock; R Hehl; K Hornischer; D Karas; A E Kel; O V Kel-Margoulis; D-U Kloos; S Land; B Lewicki-Potapov; H Michael; R Münch; I Reuter; S Rotert; H Saxel; M Scheer; S Thiele; E Wingender
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

6.  FlyFactorSurvey: a database of Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system.

Authors:  Lihua Julie Zhu; Ryan G Christensen; Majid Kazemian; Christopher J Hull; Metewo Selase Enuameh; Matthew D Basciotta; Jessie A Brasefield; Cong Zhu; Yuna Asriyan; David S Lapointe; Saurabh Sinha; Scot A Wolfe; Michael H Brodsky
Journal:  Nucleic Acids Res       Date:  2010-11-19       Impact factor: 16.971

7.  RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets.

Authors:  Morgane Thomas-Chollier; Carl Herrmann; Matthieu Defrance; Olivier Sand; Denis Thieffry; Jacques van Helden
Journal:  Nucleic Acids Res       Date:  2011-12-08       Impact factor: 16.971

8.  UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions.

Authors:  Maxwell A Hume; Luis A Barrera; Stephen S Gisselbrecht; Martha L Bulyk
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 16.971

9.  Predicting the human epigenome from DNA motifs.

Authors:  John W Whitaker; Zhao Chen; Wei Wang
Journal:  Nat Methods       Date:  2014-09-21       Impact factor: 28.547

10.  Selective influence of Sox2 on POU transcription factor binding in embryonic and neural stem cells.

Authors:  Tapan Kumar Mistri; Arun George Devasia; Lee Thean Chu; Wei Ping Ng; Florian Halbritter; Douglas Colby; Ben Martynoga; Simon R Tomlinson; Ian Chambers; Paul Robson; Thorsten Wohland
Journal:  EMBO Rep       Date:  2015-08-11       Impact factor: 8.807

View more
  45 in total

1.  Genome-Wide Transcriptional Analysis Reveals Novel AhR Targets That Regulate Dendritic Cell Function during Influenza A Virus Infection.

Authors:  Anthony M Franchini; Jason R Myers; Guang-Bi Jin; David M Shepherd; B Paige Lawrence
Journal:  Immunohorizons       Date:  2019-06-17

2.  Conserved DNA sequence features underlie pervasive RNA polymerase pausing.

Authors:  Martyna Gajos; Olga Jasnovidova; Alena van Bömmel; Susanne Freier; Martin Vingron; Andreas Mayer
Journal:  Nucleic Acids Res       Date:  2021-05-07       Impact factor: 16.971

3.  A bipartite transcription factor module controlling expression in the bundle sheath of Arabidopsis thaliana.

Authors:  Patrick J Dickinson; Jana Kneřová; Marek Szecówka; Sean R Stevenson; Steven J Burgess; Hugh Mulvey; Anne-Maarit Bågman; Allison Gaudinier; Siobhan M Brady; Julian M Hibberd
Journal:  Nat Plants       Date:  2020-11-23       Impact factor: 15.793

Review 4.  Organizing combinatorial transcription factor recruitment at cis-regulatory modules.

Authors:  Julie Dubois-Chevalier; Parisa Mazrooei; Mathieu Lupien; Bart Staels; Philippe Lefebvre; Jérôme Eeckhoute
Journal:  Transcription       Date:  2017-11-28

5.  A human cell atlas of fetal chromatin accessibility.

Authors:  Silvia Domcke; Andrew J Hill; Riza M Daza; Junyue Cao; Diana R O'Day; Hannah A Pliner; Kimberly A Aldinger; Dmitry Pokholok; Fan Zhang; Jennifer H Milbank; Michael A Zager; Ian A Glass; Frank J Steemers; Dan Doherty; Cole Trapnell; Darren A Cusanovich; Jay Shendure
Journal:  Science       Date:  2020-11-13       Impact factor: 47.728

6.  Enhanced Maps of Transcription Factor Binding Sites Improve Regulatory Networks Learned from Accessible Chromatin Data.

Authors:  Shubhada R Kulkarni; D Marc Jones; Klaas Vandepoele
Journal:  Plant Physiol       Date:  2019-07-25       Impact factor: 8.340

7.  The Master Regulator Protein BAZ2B Can Reprogram Human Hematopoietic Lineage-Committed Progenitors into a Multipotent State.

Authors:  Karthik Arumugam; William Shin; Valentina Schiavone; Lukas Vlahos; Xiaochuan Tu; Davide Carnevali; Jordan Kesner; Evan O Paull; Neus Romo; Prem Subramaniam; Jeremy Worley; Xiangtian Tan; Andrea Califano; Maria Pia Cosma
Journal:  Cell Rep       Date:  2020-12-08       Impact factor: 9.423

8.  Single nucleus multi-omics identifies human cortical cell regulatory genome diversity.

Authors:  Chongyuan Luo; Hanqing Liu; Fangming Xie; Ethan J Armand; Kimberly Siletti; Trygve E Bakken; Rongxin Fang; Wayne I Doyle; Tim Stuart; Rebecca D Hodge; Lijuan Hu; Bang-An Wang; Zhuzhu Zhang; Sebastian Preissl; Dong-Sung Lee; Jingtian Zhou; Sheng-Yong Niu; Rosa Castanon; Anna Bartlett; Angeline Rivkin; Xinxin Wang; Jacinta Lucero; Joseph R Nery; David A Davis; Deborah C Mash; Rahul Satija; Jesse R Dixon; Sten Linnarsson; Ed Lein; M Margarita Behrens; Bing Ren; Eran A Mukamel; Joseph R Ecker
Journal:  Cell Genom       Date:  2022-03-09

9.  Genetic effects on liver chromatin accessibility identify disease regulatory variants.

Authors:  Kevin W Currin; Michael R Erdos; Narisu Narisu; Vivek Rai; Swarooparani Vadlamudi; Hannah J Perrin; Jacqueline R Idol; Tingfen Yan; Ricardo D'Oliveira Albanus; K Alaine Broadaway; Amy S Etheridge; Lori L Bonnycastle; Peter Orchard; John P Didion; Amarjit S Chaudhry; Federico Innocenti; Erin G Schuetz; Laura J Scott; Stephen C J Parker; Francis S Collins; Karen L Mohlke
Journal:  Am J Hum Genet       Date:  2021-05-25       Impact factor: 11.025

10.  Dynamic landscape of protein occupancy across the Escherichia coli chromosome.

Authors:  Peter L Freddolino; Haley M Amemiya; Thomas J Goss; Saeed Tavazoie
Journal:  PLoS Biol       Date:  2021-06-25       Impact factor: 8.029

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.