Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data.

Literature DB >> 34607350

Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data.

Shuangquan Zhang¹, Anjun Ma², Jing Zhao², Dong Xu³, Qin Ma², Yan Wang^1,4.

Abstract

Identifying cis-regulatory motifs from genomic sequencing data (e.g. ChIP-seq and CLIP-seq) is crucial in identifying transcription factor (TF) binding sites and inferring gene regulatory mechanisms for any organism. Since 2015, deep learning (DL) methods have been widely applied to identify TF binding sites and predict motif patterns, with the strengths of offering a scalable, flexible and unified computational approach for highly accurate predictions. As far as we know, 20 DL methods have been developed. However, without a clear and systematic assessment, users will struggle to choose the most appropriate tool for their specific studies. In this manuscript, we evaluated 20 DL methods for cis-regulatory motif prediction using 690 ENCODE ChIP-seq, 126 cancer ChIP-seq and 55 RNA CLIP-seq data. Four metrics were investigated, including the accuracy of motif finding, the performance of DNA/RNA sequence classification, algorithm scalability and tool usability. The assessment results demonstrated the high complementarity of the existing DL methods. It was determined that the most suitable model should primarily depend on the data size and type and the method's outputs.

Entities: Chemical

Keywords: CLIP-seq; ChIP-seq; TF binding sites identification; deep learning method assessment; motif prediction

Mesh：

Substances：
Transcription Factors

Year: 2022 PMID： 34607350 PMCID： PMC8769700 DOI： 10.1093/bib/bbab374

Source DB: PubMed Journal: Brief Bioinform ISSN： 1467-5463 Impact factor: 13.994

54 in total

1. DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors: Hamid Reza Hassanzadeh; May D Wang
Journal: Proceedings (IEEE Int Conf Bioinformatics Biomed) Date: 2017-01-19

2. What are DNA sequence motifs?

Authors: Patrik D'haeseleer
Journal: Nat Biotechnol Date: 2006-04 Impact factor: 54.908

3. Motif discovery and transcription factor binding sites before and after the next-generation sequencing era.

Authors: Federico Zambelli; Graziano Pesole; Giulio Pavesi
Journal: Brief Bioinform Date: 2012-04-19 Impact factor: 11.622

4. COP1, the negative regulator of ETV1, influences prognosis in triple-negative breast cancer.

Authors: Mao Ouyang; Hua Wang; Jieyi Ma; Weiming Lü; Jie Li; Chen Yao; Guangqi Chang; Jiong Bi; Shenming Wang; Wenjian Wang
Journal: BMC Cancer Date: 2015-03-15 Impact factor: 4.430

5. DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors: Daniel Quang; Xiaohui Xie
Journal: Nucleic Acids Res Date: 2016-04-15 Impact factor: 16.971

6. Deep-RBPPred: Predicting RNA binding proteins in the proteome scale based on deep learning.

Authors: Jinfang Zheng; Xiaoli Zhang; Xunyi Zhao; Xiaoxue Tong; Xu Hong; Juan Xie; Shiyong Liu
Journal: Sci Rep Date: 2018-10-15 Impact factor: 4.379

7. Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network.

Authors: Qinhu Zhang; Zhen Shen; De-Shuang Huang
Journal: Sci Rep Date: 2019-06-11 Impact factor: 4.379

8. JASPAR 2020: update of the open-access database of transcription factor binding profiles.

Authors: Oriol Fornes; Jaime A Castro-Mondragon; Aziz Khan; Robin van der Lee; Xi Zhang; Phillip A Richmond; Bhavi P Modi; Solenne Correard; Marius Gheorghe; Damir Baranašić; Walter Santana-Garcia; Ge Tan; Jeanne Chèneby; Benoit Ballester; François Parcy; Albin Sandelin; Boris Lenhard; Wyeth W Wasserman; Anthony Mathelier
Journal: Nucleic Acids Res Date: 2020-01-08 Impact factor: 16.971

9. Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.

Authors: Ujjwal Maulik; Anirban Mukhopadhyay; Sanghamitra Bandyopadhyay
Journal: BMC Bioinformatics Date: 2009-01-20 Impact factor: 3.169

10. DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants.

Authors: Meng Wang; Cheng Tai; Weinan E; Liping Wei
Journal: Nucleic Acids Res Date: 2018-06-20 Impact factor: 16.971

3 in total