Literature DB >> 34607350

Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data.

Shuangquan Zhang1, Anjun Ma2, Jing Zhao2, Dong Xu3, Qin Ma2, Yan Wang1,4.   

Abstract

Identifying cis-regulatory motifs from genomic sequencing data (e.g. ChIP-seq and CLIP-seq) is crucial in identifying transcription factor (TF) binding sites and inferring gene regulatory mechanisms for any organism. Since 2015, deep learning (DL) methods have been widely applied to identify TF binding sites and predict motif patterns, with the strengths of offering a scalable, flexible and unified computational approach for highly accurate predictions. As far as we know, 20 DL methods have been developed. However, without a clear and systematic assessment, users will struggle to choose the most appropriate tool for their specific studies. In this manuscript, we evaluated 20 DL methods for cis-regulatory motif prediction using 690 ENCODE ChIP-seq, 126 cancer ChIP-seq and 55 RNA CLIP-seq data. Four metrics were investigated, including the accuracy of motif finding, the performance of DNA/RNA sequence classification, algorithm scalability and tool usability. The assessment results demonstrated the high complementarity of the existing DL methods. It was determined that the most suitable model should primarily depend on the data size and type and the method's outputs.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  CLIP-seq; ChIP-seq; TF binding sites identification; deep learning method assessment; motif prediction

Mesh:

Substances:

Year:  2022        PMID: 34607350      PMCID: PMC8769700          DOI: 10.1093/bib/bbab374

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  54 in total

1.  DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors:  Hamid Reza Hassanzadeh; May D Wang
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-01-19

2.  What are DNA sequence motifs?

Authors:  Patrik D'haeseleer
Journal:  Nat Biotechnol       Date:  2006-04       Impact factor: 54.908

3.  Motif discovery and transcription factor binding sites before and after the next-generation sequencing era.

Authors:  Federico Zambelli; Graziano Pesole; Giulio Pavesi
Journal:  Brief Bioinform       Date:  2012-04-19       Impact factor: 11.622

4.  COP1, the negative regulator of ETV1, influences prognosis in triple-negative breast cancer.

Authors:  Mao Ouyang; Hua Wang; Jieyi Ma; Weiming Lü; Jie Li; Chen Yao; Guangqi Chang; Jiong Bi; Shenming Wang; Wenjian Wang
Journal:  BMC Cancer       Date:  2015-03-15       Impact factor: 4.430

5.  DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

6.  Deep-RBPPred: Predicting RNA binding proteins in the proteome scale based on deep learning.

Authors:  Jinfang Zheng; Xiaoli Zhang; Xunyi Zhao; Xiaoxue Tong; Xu Hong; Juan Xie; Shiyong Liu
Journal:  Sci Rep       Date:  2018-10-15       Impact factor: 4.379

7.  Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network.

Authors:  Qinhu Zhang; Zhen Shen; De-Shuang Huang
Journal:  Sci Rep       Date:  2019-06-11       Impact factor: 4.379

8.  JASPAR 2020: update of the open-access database of transcription factor binding profiles.

Authors:  Oriol Fornes; Jaime A Castro-Mondragon; Aziz Khan; Robin van der Lee; Xi Zhang; Phillip A Richmond; Bhavi P Modi; Solenne Correard; Marius Gheorghe; Damir Baranašić; Walter Santana-Garcia; Ge Tan; Jeanne Chèneby; Benoit Ballester; François Parcy; Albin Sandelin; Boris Lenhard; Wyeth W Wasserman; Anthony Mathelier
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

9.  Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.

Authors:  Ujjwal Maulik; Anirban Mukhopadhyay; Sanghamitra Bandyopadhyay
Journal:  BMC Bioinformatics       Date:  2009-01-20       Impact factor: 3.169

10.  DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants.

Authors:  Meng Wang; Cheng Tai; Weinan E; Liping Wei
Journal:  Nucleic Acids Res       Date:  2018-06-20       Impact factor: 16.971

View more
  3 in total

1.  DESSO-DB: A web database for sequence and shape motif analyses and identification.

Authors:  Xiaoying Wang; Cankun Wang; Lang Li; Qin Ma; Anjun Ma; Bingqiang Liu
Journal:  Comput Struct Biotechnol J       Date:  2022-06-16       Impact factor: 6.155

2.  Predicting miRNA-disease associations based on multi-view information fusion.

Authors:  Xuping Xie; Yan Wang; Nan Sheng; Shuangquan Zhang; Yangkun Cao; Yuan Fu
Journal:  Front Genet       Date:  2022-09-27       Impact factor: 4.772

3.  MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data.

Authors:  Shuangquan Zhang; Lili Yang; Xiaotian Wu; Nan Sheng; Yuan Fu; Anjun Ma; Yan Wang
Journal:  Bioinformatics       Date:  2022-08-23       Impact factor: 6.931

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.