Literature DB >> 26130577

A DNA shape-based regulatory score improves position-weight matrix-based recognition of transcription factor binding sites.

Jichen Yang1, Stephen A Ramsey2.   

Abstract

MOTIVATION: The position-weight matrix (PWM) is a useful representation of a transcription factor binding site (TFBS) sequence pattern because the PWM can be estimated from a small number of representative TFBS sequences. However, because the PWM probability model assumes independence between individual nucleotide positions, the PWMs for some TFs poorly discriminate binding sites from non-binding-sites that have similar sequence content. Since the local three-dimensional DNA structure ('shape') is a determinant of TF binding specificity and since DNA shape has a significant sequence-dependence, we combined DNA shape-derived features into a TF-generalized regulatory score and tested whether the score could improve PWM-based discrimination of TFBS from non-binding-sites.
RESULTS: We compared a traditional PWM model to a model that combines the PWM with a DNA shape feature-based regulatory potential score, for accuracy in detecting binding sites for 75 vertebrate transcription factors. The PWM+shape model was more accurate than the PWM-only model, for 45% of TFs tested, with no significant loss of accuracy for the remaining TFs.
AVAILABILITY AND IMPLEMENTATION: The shape-based model is available as an open-source R package at that is archived on the GitHub software repository at https://github.com/ramseylab/regshape/. CONTACT: stephen.ramsey@oregonstate.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26130577      PMCID: PMC4838056          DOI: 10.1093/bioinformatics/btv391

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  A genomic code for nucleosome positioning.

Authors:  Eran Segal; Yvonne Fondufe-Mittendorf; Lingyi Chen; AnnChristine Thåström; Yair Field; Irene K Moore; Ji-Ping Z Wang; Jonathan Widom
Journal:  Nature       Date:  2006-07-19       Impact factor: 49.962

2.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities.

Authors:  Michael F Berger; Anthony A Philippakis; Aaron M Qureshi; Fangxue S He; Preston W Estep; Martha L Bulyk
Journal:  Nat Biotechnol       Date:  2006-09-24       Impact factor: 54.908

3.  Integrating genomic data to predict transcription factor binding.

Authors:  Dustin T Holloway; Mark Kon; Charles DeLisi
Journal:  Genome Inform       Date:  2005

4.  Cross-species de novo identification of cis-regulatory modules with GibbsModule: application to gene regulation in embryonic stem cells.

Authors:  Dan Xie; Jun Cai; Na-Yu Chia; Huck H Ng; Sheng Zhong
Journal:  Genome Res       Date:  2008-05-15       Impact factor: 9.043

Review 5.  ChIP-seq: advantages and challenges of a maturing technology.

Authors:  Peter J Park
Journal:  Nat Rev Genet       Date:  2009-09-08       Impact factor: 53.242

6.  PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.

Authors:  Joel Rozowsky; Ghia Euskirchen; Raymond K Auerbach; Zhengdong D Zhang; Theodore Gibson; Robert Bjornson; Nicholas Carriero; Michael Snyder; Mark B Gerstein
Journal:  Nat Biotechnol       Date:  2009-01-04       Impact factor: 54.908

7.  Genome-wide mapping of in vivo protein-DNA interactions.

Authors:  David S Johnson; Ali Mortazavi; Richard M Myers; Barbara Wold
Journal:  Science       Date:  2007-05-31       Impact factor: 47.728

8.  Stubb: a program for discovery and analysis of cis-regulatory modules.

Authors:  Saurabh Sinha; Yupu Liang; Eric Siggia
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

9.  Integration of genome and chromatin structure with gene expression profiles to predict c-MYC recognition site binding and function.

Authors:  Yili Chen; Thomas W Blackwell; Ji Chen; Jing Gao; Angel W Lee; David J States
Journal:  PLoS Comput Biol       Date:  2007-04-06       Impact factor: 4.475

10.  Probabilistic inference of transcription factor binding from multiple data sources.

Authors:  Harri Lähdesmäki; Alistair G Rust; Ilya Shmulevich
Journal:  PLoS One       Date:  2008-03-26       Impact factor: 3.240

View more
  8 in total

1.  Expanding the repertoire of DNA shape features for genome-scale studies of transcription factor binding.

Authors:  Jinsen Li; Jared M Sagendorf; Tsu-Pei Chiu; Marco Pasi; Alberto Perez; Remo Rohs
Journal:  Nucleic Acids Res       Date:  2017-12-15       Impact factor: 16.971

Review 2.  Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns.

Authors:  Divyanshi Srivastava; Shaun Mahony
Journal:  Biochim Biophys Acta Gene Regul Mech       Date:  2019-10-19       Impact factor: 4.490

3.  DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo.

Authors:  Anthony Mathelier; Beibei Xin; Tsu-Pei Chiu; Lin Yang; Remo Rohs; Wyeth W Wasserman
Journal:  Cell Syst       Date:  2016-08-18       Impact factor: 10.304

4.  Quantitative modeling of gene expression using DNA shape features of binding sites.

Authors:  Pei-Chen Peng; Saurabh Sinha
Journal:  Nucleic Acids Res       Date:  2016-06-01       Impact factor: 16.971

5.  Predicting Variation of DNA Shape Preferences in Protein-DNA Interaction in Cancer Cells with a New Biophysical Model.

Authors:  Kirill Batmanov; Junbai Wang
Journal:  Genes (Basel)       Date:  2017-09-18       Impact factor: 4.096

6.  Landscape of transcriptional deregulation in lung cancer.

Authors:  Shu Zhang; Mingfa Li; Hongbin Ji; Zhaoyuan Fang
Journal:  BMC Genomics       Date:  2018-06-05       Impact factor: 3.969

7.  A unified approach for quantifying and interpreting DNA shape readout by transcription factors.

Authors:  H Tomas Rube; Chaitanya Rastogi; Judith F Kribelbauer; Harmen J Bussemaker
Journal:  Mol Syst Biol       Date:  2018-02-22       Impact factor: 11.429

8.  PWM2Vec: An Efficient Embedding Approach for Viral Host Specification from Coronavirus Spike Sequences.

Authors:  Sarwan Ali; Babatunde Bello; Prakash Chourasia; Ria Thazhe Punathil; Yijing Zhou; Murray Patterson
Journal:  Biology (Basel)       Date:  2022-03-09
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.