Literature DB >> 31777934

Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination.

Joshua L Wetzel1,2, Mona Singh1,2.   

Abstract

We are now in an era where protein-DNA interactions have been experimentally assayed for thousands of DNA-binding proteins. In order to infer DNA-binding specificities from these data, numerous sophisticated computational methods have been developed. These approaches typically infer DNA-binding specificities by considering interactions for each protein independently, ignoring related and potentially valuable interaction information across other proteins that bind DNA via the same structural domain. Here we introduce a framework for inferring DNA-binding specificities by considering protein-DNA interactions for entire groups of structurally similar proteins simultaneously. We devise both constrained optimization and label propagation algorithms for this task, each balancing observations at the individual protein level against dataset-wide consistency of interaction preferences. We test our approaches on two large, independent Cys2His2 zinc finger protein-DNA interaction datasets. We demonstrate that jointly inferring specificities within each dataset individually dramatically improves accuracy, leading to increased agreement both between these two datasets and with a fixed external standard. Overall, our results suggest that sharing protein-DNA interaction information across structurally similar proteins is a powerful means to enable accurate inference of DNA-binding specificities.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31777934      PMCID: PMC7028011          DOI: 10.1093/nar/gkz1087

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  49 in total

1.  Announcing the worldwide Protein Data Bank.

Authors:  Helen Berman; Kim Henrick; Haruki Nakamura
Journal:  Nat Struct Biol       Date:  2003-12

2.  Discriminative motif optimization based on perceptron training.

Authors:  Ronak Y Patel; Gary D Stormo
Journal:  Bioinformatics       Date:  2013-12-24       Impact factor: 6.937

3.  Analysis of homeodomain specificities allows the family-wide prediction of preferred recognition sites.

Authors:  Marcus B Noyes; Ryan G Christensen; Atsuya Wakabayashi; Gary D Stormo; Michael H Brodsky; Scot A Wolfe
Journal:  Cell       Date:  2008-06-27       Impact factor: 41.582

Review 4.  The Human Transcription Factors.

Authors:  Samuel A Lambert; Arttu Jolma; Laura F Campitelli; Pratyush K Das; Yimeng Yin; Mihai Albu; Xiaoting Chen; Jussi Taipale; Timothy R Hughes; Matthew T Weirauch
Journal:  Cell       Date:  2018-02-08       Impact factor: 41.582

5.  Position-based sequence weights.

Authors:  S Henikoff; J G Henikoff
Journal:  J Mol Biol       Date:  1994-11-04       Impact factor: 5.469

6.  Modeling protein-DNA binding via high-throughput in vitro technologies.

Authors:  Yaron Orenstein; Ron Shamir
Journal:  Brief Funct Genomics       Date:  2017-05-01       Impact factor: 4.241

7.  BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions.

Authors:  Jianyi Yang; Ambrish Roy; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2012-10-18       Impact factor: 16.971

8.  OnTheFly: a database of Drosophila melanogaster transcription factors and their binding sites.

Authors:  Shula Shazman; Hunjoong Lee; Yakov Socol; Richard S Mann; Barry Honig
Journal:  Nucleic Acids Res       Date:  2013-11-22       Impact factor: 16.971

9.  Building accurate sequence-to-affinity models from high-throughput in vitro protein-DNA binding data using FeatureREDUCE.

Authors:  Todd R Riley; Allan Lazarovici; Richard S Mann; Harmen J Bussemaker
Journal:  Elife       Date:  2015-12-23       Impact factor: 8.140

10.  Alternative evolutionary histories in the sequence space of an ancient protein.

Authors:  Tyler N Starr; Lora K Picton; Joseph W Thornton
Journal:  Nature       Date:  2017-09-13       Impact factor: 49.962

View more
  1 in total

1.  Learning probabilistic protein-DNA recognition codes from DNA-binding specificities using structural mappings.

Authors:  Joshua L Wetzel; Kaiqian Zhang; Mona Singh
Journal:  Genome Res       Date:  2022-09-19       Impact factor: 9.438

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.