Literature DB >> 27993786

DeFCoM: analysis and modeling of transcription factor binding sites using a motif-centric genomic footprinter.

Bryan Quach1,2,3, Terrence S Furey2,3.   

Abstract

Motivation: Identifying the locations of transcription factor binding sites is critical for understanding how gene transcription is regulated across different cell types and conditions. Chromatin accessibility experiments such as DNaseI sequencing (DNase-seq) and Assay for Transposase Accessible Chromatin sequencing (ATAC-seq) produce genome-wide data that include distinct 'footprint' patterns at binding sites. Nearly all existing computational methods to detect footprints from these data assume that footprint signals are highly homogeneous across footprint sites. Additionally, a comprehensive and systematic comparison of footprinting methods for specifically identifying which motif sites for a specific factor are bound has not been performed.
Results: Using DNase-seq data from the ENCODE project, we show that a large degree of previously uncharacterized site-to-site variability exists in footprint signal across motif sites for a transcription factor. To model this heterogeneity in the data, we introduce a novel, supervised learning footprinter called Detecting Footprints Containing Motifs (DeFCoM). We compare DeFCoM to nine existing methods using evaluation sets from four human cell-lines and eighteen transcription factors and show that DeFCoM outperforms current methods in determining bound and unbound motif sites. We also analyze the impact of several biological and technical factors on the quality of footprint predictions to highlight important considerations when conducting footprint analyses and assessing the performance of footprint prediction methods. Finally, we show that DeFCoM can detect footprints using ATAC-seq data with similar accuracy as when using DNase-seq data. Availability and Implementation: Python code available at https://bitbucket.org/bryancquach/defcom. Contact: bquach@email.unc.edu or tsfurey@email.unc.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 27993786      PMCID: PMC6075477          DOI: 10.1093/bioinformatics/btw740

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  27 in total

1.  Epigenetic priors for identifying active transcription factor binding sites.

Authors:  Gabriel Cuellar-Partida; Fabian A Buske; Robert C McLeay; Tom Whitington; William Stafford Noble; Timothy L Bailey
Journal:  Bioinformatics       Date:  2011-11-08       Impact factor: 6.937

2.  High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells.

Authors:  Alan P Boyle; Lingyun Song; Bum-Kyu Lee; Darin London; Damian Keefe; Ewan Birney; Vishwanath R Iyer; Gregory E Crawford; Terrence S Furey
Journal:  Genome Res       Date:  2010-11-24       Impact factor: 9.043

3.  ROCR: visualizing classifier performance in R.

Authors:  Tobias Sing; Oliver Sander; Niko Beerenwinkel; Thomas Lengauer
Journal:  Bioinformatics       Date:  2005-08-11       Impact factor: 6.937

4.  Altering the chromatin landscape for nucleotide excision repair.

Authors:  Ronita Nag; Michael J Smerdon
Journal:  Mutat Res       Date:  2009-01-09       Impact factor: 2.433

5.  F-Seq: a feature density estimator for high-throughput sequence tags.

Authors:  Alan P Boyle; Justin Guinney; Gregory E Crawford; Terrence S Furey
Journal:  Bioinformatics       Date:  2008-09-10       Impact factor: 6.937

6.  DNase footprint signatures are dictated by factor dynamics and DNA sequence.

Authors:  Myong-Hee Sung; Michael J Guertin; Songjoon Baek; Gordon L Hager
Journal:  Mol Cell       Date:  2014-09-18       Impact factor: 17.970

7.  Genome-wide mapping of in vivo protein-DNA interactions.

Authors:  David S Johnson; Ali Mortazavi; Richard M Myers; Barbara Wold
Journal:  Science       Date:  2007-05-31       Impact factor: 47.728

8.  Dynamic reprogramming of chromatin accessibility during Drosophila embryo development.

Authors:  Sean Thomas; Xiao-Yong Li; Peter J Sabo; Richard Sandstrom; Robert E Thurman; Theresa K Canfield; Erika Giste; William Fisher; Ann Hammonds; Susan E Celniker; Mark D Biggin; John A Stamatoyannopoulos
Journal:  Genome Biol       Date:  2011-05-11       Impact factor: 13.583

9.  Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data.

Authors:  Jason Piper; Markus C Elze; Pierre Cauchy; Peter N Cockerill; Constanze Bonifer; Sascha Ott
Journal:  Nucleic Acids Res       Date:  2013-09-25       Impact factor: 16.971

10.  An expansive human regulatory lexicon encoded in transcription factor footprints.

Authors:  Shane Neph; Jeff Vierstra; Andrew B Stergachis; Alex P Reynolds; Eric Haugen; Benjamin Vernot; Robert E Thurman; Sam John; Richard Sandstrom; Audra K Johnson; Matthew T Maurano; Richard Humbert; Eric Rynes; Hao Wang; Shinny Vong; Kristen Lee; Daniel Bates; Morgan Diegel; Vaughn Roach; Douglas Dunn; Jun Neri; Anthony Schafer; R Scott Hansen; Tanya Kutyavin; Erika Giste; Molly Weaver; Theresa Canfield; Peter Sabo; Miaohua Zhang; Gayathri Balasundaram; Rachel Byron; Michael J MacCoss; Joshua M Akey; M A Bender; Mark Groudine; Rajinder Kaul; John A Stamatoyannopoulos
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

View more
  11 in total

Review 1.  Interrogating the Accessible Chromatin Landscape of Eukaryote Genomes Using ATAC-seq.

Authors:  Georgi K Marinov; Zohar Shipony
Journal:  Methods Mol Biol       Date:  2021

Review 2.  Genome-wide analysis of chromatin accessibility using ATAC-seq.

Authors:  Tanvi Shashikant; Charles A Ettensohn
Journal:  Methods Cell Biol       Date:  2018-12-21       Impact factor: 1.441

Review 3.  Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns.

Authors:  Divyanshi Srivastava; Shaun Mahony
Journal:  Biochim Biophys Acta Gene Regul Mech       Date:  2019-10-19       Impact factor: 4.490

4.  Epitome: predicting epigenetic events in novel cell types with multi-cell deep ensemble learning.

Authors:  Alyssa Kramer Morrow; John Weston Hughes; Jahnavi Singh; Anthony Douglas Joseph; Nir Yosef
Journal:  Nucleic Acids Res       Date:  2021-11-08       Impact factor: 16.971

Review 5.  Epigenetic Regulation of Endothelial Cell Lineages During Zebrafish Development-New Insights From Technical Advances.

Authors:  Virginia Panara; Rui Monteiro; Katarzyna Koltowska
Journal:  Front Cell Dev Biol       Date:  2022-05-09

6.  Analytical Approaches for ATAC-seq Data Analysis.

Authors:  Jason P Smith; Nathan C Sheffield
Journal:  Curr Protoc Hum Genet       Date:  2020-06

7.  TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence.

Authors:  Ningxin Ouyang; Alan P Boyle
Journal:  Genome Res       Date:  2020-07-06       Impact factor: 9.043

8.  XL-DNase-seq: improved footprinting of dynamic transcription factors.

Authors:  Kyu-Seon Oh; Jisu Ha; Songjoon Baek; Myong-Hee Sung
Journal:  Epigenetics Chromatin       Date:  2019-06-04       Impact factor: 4.954

Review 9.  From reads to insight: a hitchhiker's guide to ATAC-seq data analysis.

Authors:  Feng Yan; David R Powell; David J Curtis; Nicholas C Wong
Journal:  Genome Biol       Date:  2020-02-03       Impact factor: 13.583

10.  Evidence of widespread, independent sequence signature for transcription factor cobinding.

Authors:  Manqi Zhou; Hongyang Li; Xueqing Wang; Yuanfang Guan
Journal:  Genome Res       Date:  2020-12-10       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.