Literature DB >> 27045826

A Comparison Study for DNA Motif Modeling on Protein Binding Microarray.

Ka-Chun Wong, Yue Li, Chengbin Peng, Hau-San Wong.   

Abstract

Transcription factor binding sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k = 8∼10). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build TFBS (also known as DNA motif) models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement if choosing di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27045826     DOI: 10.1109/TCBB.2015.2443782

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  4 in total

1.  HIV- 1 lentivirus tethering to the genome is associated with transcription factor binding sites found in genes that favour virus survival.

Authors:  Saqlain Suleman; Annette Payne; Johnathan Bowden; Sharmin Al Haque; Marco Zahn; Serena Fawaz; Mohammad S Khalifa; Susan Jobling; David Hay; Matteo Franco; Raffaele Fronza; Wei Wang; Olga Strobel-Freidekind; Annette Deichmann; Yasuhiro Takeuchi; Simon N Waddington; Irene Gil-Farina; Manfred Schmidt; Michael Themis
Journal:  Gene Ther       Date:  2022-05-05       Impact factor: 5.250

Review 2.  Navigating the pitfalls of applying machine learning in genomics.

Authors:  Sean Whalen; Jacob Schreiber; William S Noble; Katherine S Pollard
Journal:  Nat Rev Genet       Date:  2021-11-26       Impact factor: 53.242

3.  An improved bind-n-seq strategy to determine protein-DNA interactions validated using the bacterial transcriptional regulator YipR.

Authors:  Shi-Qi An; Miguel A Valvano; Yan-Hua Yu; Jeremy S Webb; Guillermo Lopez Campos
Journal:  BMC Microbiol       Date:  2020-01-02       Impact factor: 3.605

4.  iProDNA-CapsNet: identifying protein-DNA binding residues using capsule neural networks.

Authors:  Binh P Nguyen; Quang H Nguyen; Giang-Nam Doan-Ngoc; Thanh-Hoang Nguyen-Vo; Susanto Rahardja
Journal:  BMC Bioinformatics       Date:  2019-12-27       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.