Literature DB >> 33232622

Deep Learning of Sequence Patterns for CCCTC-Binding Factor-Mediated Chromatin Loop Formation.

Shuzhen Kuang1,2, Liangjiang Wang1.   

Abstract

The three-dimensional (3D) organization of the human genome is of crucial importance for gene regulation, and the CCCTC-binding factor (CTCF) plays an important role in chromatin interactions. However, it is still unclear what sequence patterns in addition to CTCF motif pairs determine chromatin loop formation. To discover the underlying sequence patterns, we have developed a deep learning model, called DeepCTCFLoop, to predict whether a chromatin loop can be formed between a pair of convergent or tandem CTCF motifs using only the DNA sequences of the motifs and their flanking regions. Our results suggest that DeepCTCFLoop can accurately distinguish the CTCF motif pairs forming chromatin loops from the ones not forming loops. It significantly outperforms CTCF-MP, a machine learning model based on word2vec and boosted trees, when using DNA sequences only. Furthermore, we show that DNA motifs binding to several transcription factors, including ZNF384, ZNF263, ASCL1, SP1, and ZEB1, may constitute the complex sequence patterns for CTCF-mediated chromatin loop formation. DeepCTCFLoop has also been applied to disease-associated sequence variants to identify candidates that may disrupt chromatin loop formation. Therefore, our results provide useful information for understanding the mechanism of 3D genome organization and may also help annotate and prioritize the noncoding sequence variants associated with human diseases.

Entities:  

Keywords:  3D genome; CTCF; chromatin loops; deep learning; sequence motifs

Year:  2020        PMID: 33232622     DOI: 10.1089/cmb.2020.0225

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  InsuLock: A Weakly Supervised Learning Approach for Accurate Insulator Prediction, and Variant Impact Quantification.

Authors:  Shushrruth Sai Srinivasan; Yanwen Gong; Siwei Xu; Ahyeon Hwang; Min Xu; Matthew J Girgenti; Jing Zhang
Journal:  Genes (Basel)       Date:  2022-03-30       Impact factor: 4.141

2.  Rational, Unbiased Selection of Reference Genes for Pluripotent Stem Cell-Derived Cardiomyocytes.

Authors:  Aaron D Simmons; Sean P Palecek
Journal:  Tissue Eng Part C Methods       Date:  2021-05       Impact factor: 3.056

3.  Shaping modern human skull through epigenetic, transcriptional and post-transcriptional regulation of the RUNX2 master bone gene.

Authors:  Lorena Di Pietro; Marta Barba; Daniela Palacios; Federica Tiberio; Chiara Prampolini; Mirko Baranzini; Ornella Parolini; Alessandro Arcovito; Wanda Lattanzi
Journal:  Sci Rep       Date:  2021-10-29       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.