Literature DB >> 27899623

Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction.

Florian Schmidt1,2, Nina Gasparoni3, Gilles Gasparoni3, Kathrin Gianmoena4, Cristina Cadenas4, Julia K Polansky5, Peter Ebert2,6, Karl Nordström3, Matthias Barann7, Anupam Sinha7, Sebastian Fröhler8, Jieyi Xiong8, Azim Dehghani Amirabad1,2,6, Fatemeh Behjati Ardakani1,2, Barbara Hutter9, Gideon Zipprich10, Bärbel Felder10, Jürgen Eils10, Benedikt Brors9, Wei Chen8, Jan G Hengstler4, Alf Hamann6, Thomas Lengauer2, Philip Rosenstiel7, Jörn Walter3, Marcel H Schulz11,2.   

Abstract

The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27899623      PMCID: PMC5224477          DOI: 10.1093/nar/gkw1061

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  68 in total

1.  Epigenetic priors for identifying active transcription factor binding sites.

Authors:  Gabriel Cuellar-Partida; Fabian A Buske; Robert C McLeay; Tom Whitington; William Stafford Noble; Timothy L Bailey
Journal:  Bioinformatics       Date:  2011-11-08       Impact factor: 6.937

2.  High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells.

Authors:  Alan P Boyle; Lingyun Song; Bum-Kyu Lee; Darin London; Damian Keefe; Ewan Birney; Vishwanath R Iyer; Gregory E Crawford; Terrence S Furey
Journal:  Genome Res       Date:  2010-11-24       Impact factor: 9.043

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

5.  A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping.

Authors:  Suhas S P Rao; Miriam H Huntley; Neva C Durand; Elena K Stamenova; Ivan D Bochkov; James T Robinson; Adrian L Sanborn; Ido Machol; Arina D Omer; Eric S Lander; Erez Lieberman Aiden
Journal:  Cell       Date:  2014-12-11       Impact factor: 41.582

6.  BLUEPRINT to decode the epigenetic signature written in blood.

Authors:  David Adams; Lucia Altucci; Stylianos E Antonarakis; Juan Ballesteros; Stephan Beck; Adrian Bird; Christoph Bock; Bernhard Boehm; Elias Campo; Andrea Caricasole; Fredrik Dahl; Emmanouil T Dermitzakis; Tariq Enver; Manel Esteller; Xavier Estivill; Anne Ferguson-Smith; Jude Fitzgibbon; Paul Flicek; Claudia Giehl; Thomas Graf; Frank Grosveld; Roderic Guigo; Ivo Gut; Kristian Helin; Jonas Jarvius; Ralf Küppers; Hans Lehrach; Thomas Lengauer; Åke Lernmark; David Leslie; Markus Loeffler; Elizabeth Macintyre; Antonello Mai; Joost H A Martens; Saverio Minucci; Willem H Ouwehand; Pier Giuseppe Pelicci; Hèléne Pendeville; Bo Porse; Vardhman Rakyan; Wolf Reik; Martin Schrappe; Dirk Schübeler; Martin Seifert; Reiner Siebert; David Simmons; Nicole Soranzo; Salvatore Spicuglia; Michael Stratton; Hendrik G Stunnenberg; Amos Tanay; David Torrents; Alfonso Valencia; Edo Vellenga; Martin Vingron; Jörn Walter; Spike Willcocks
Journal:  Nat Biotechnol       Date:  2012-03-07       Impact factor: 54.908

7.  Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification.

Authors:  Housheng Hansen He; Clifford A Meyer; Sheng'en Shawn Hu; Mei-Wei Chen; Chongzhi Zang; Yin Liu; Prakash K Rao; Teng Fei; Han Xu; Henry Long; X Shirley Liu; Myles Brown
Journal:  Nat Methods       Date:  2013-12-08       Impact factor: 28.547

8.  An expansive human regulatory lexicon encoded in transcription factor footprints.

Authors:  Shane Neph; Jeff Vierstra; Andrew B Stergachis; Alex P Reynolds; Eric Haugen; Benjamin Vernot; Robert E Thurman; Sam John; Richard Sandstrom; Audra K Johnson; Matthew T Maurano; Richard Humbert; Eric Rynes; Hao Wang; Shinny Vong; Kristen Lee; Daniel Bates; Morgan Diegel; Vaughn Roach; Douglas Dunn; Jun Neri; Anthony Schafer; R Scott Hansen; Tanya Kutyavin; Erika Giste; Molly Weaver; Theresa Canfield; Peter Sabo; Miaohua Zhang; Gayathri Balasundaram; Rachel Byron; Michael J MacCoss; Joshua M Akey; M A Bender; Mark Groudine; Rajinder Kaul; John A Stamatoyannopoulos
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

9.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

10.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Authors:  Anthony Mathelier; Oriol Fornes; David J Arenillas; Chih-Yu Chen; Grégoire Denay; Jessica Lee; Wenqiang Shi; Casper Shyr; Ge Tan; Rebecca Worsley-Hunt; Allen W Zhang; François Parcy; Boris Lenhard; Albin Sandelin; Wyeth W Wasserman
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  33 in total

Review 1.  Assessment of stem cell differentiation based on genome-wide expression profiles.

Authors:  Patricio Godoy; Wolfgang Schmidt-Heck; Birte Hellwig; Patrick Nell; David Feuerborn; Jörg Rahnenführer; Kathrin Kattler; Jörn Walter; Nils Blüthgen; Jan G Hengstler
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2018-07-05       Impact factor: 6.237

2.  HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.

Authors:  Ivan V Kulakovskiy; Ilya E Vorontsov; Ivan S Yevshin; Ruslan N Sharipov; Alla D Fedorova; Eugene I Rumynskiy; Yulia A Medvedeva; Arturo Magana-Mora; Vladimir B Bajic; Dmitry A Papatsenko; Fedor A Kolpakov; Vsevolod J Makeev
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

3.  Temporal enhancer profiling of parallel lineages identifies AHR and GLIS1 as regulators of mesenchymal multipotency.

Authors:  Deborah Gérard; Florian Schmidt; Aurélien Ginolhac; Martine Schmitz; Rashi Halder; Peter Ebert; Marcel H Schulz; Thomas Sauter; Lasse Sinkkonen
Journal:  Nucleic Acids Res       Date:  2019-02-20       Impact factor: 16.971

4.  Neuronal brain-region-specific DNA methylation and chromatin accessibility are associated with neuropsychiatric trait heritability.

Authors:  Lindsay F Rizzardi; Peter F Hickey; Varenka Rodriguez DiBlasi; Rakel Tryggvadóttir; Colin M Callahan; Adrian Idrizi; Kasper D Hansen; Andrew P Feinberg
Journal:  Nat Neurosci       Date:  2019-01-14       Impact factor: 24.884

5.  Prediction of single-cell gene expression for transcription factor analysis.

Authors:  Fatemeh Behjati Ardakani; Kathrin Kattler; Tobias Heinen; Florian Schmidt; David Feuerborn; Gilles Gasparoni; Konstantin Lepikhov; Patrick Nell; Jan Hengstler; Jörn Walter; Marcel H Schulz
Journal:  Gigascience       Date:  2020-10-30       Impact factor: 6.524

6.  Functional effects of variation in transcription factor binding highlight long-range gene regulation by epromoters.

Authors:  Joanna Mitchelmore; Nastasiya F Grinberg; Chris Wallace; Mikhail Spivakov
Journal:  Nucleic Acids Res       Date:  2020-04-06       Impact factor: 16.971

7.  Unique and assay specific features of NOMe-, ATAC- and DNase I-seq data.

Authors:  Karl J V Nordström; Florian Schmidt; Nina Gasparoni; Abdulrahman Salhab; Gilles Gasparoni; Kathrin Kattler; Fabian Müller; Peter Ebert; Ivan G Costa; Nico Pfeifer; Thomas Lengauer; Marcel H Schulz; Jörn Walter
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

8.  Determinants of correlated expression of transcription factors and their target genes.

Authors:  Adam B Zaborowski; Dirk Walther
Journal:  Nucleic Acids Res       Date:  2020-11-18       Impact factor: 16.971

Review 9.  Statistical and integrative system-level analysis of DNA methylation data.

Authors:  Andrew E Teschendorff; Caroline L Relton
Journal:  Nat Rev Genet       Date:  2017-11-13       Impact factor: 53.242

10.  Chromatin loop anchors predict transcript and exon usage.

Authors:  Yu Zhang; Yichao Cai; Xavier Roca; Chee Keong Kwoh; Melissa Jane Fullwood
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.