| Literature DB >> 33866359 |
Yunlong Wang1, Yaqi Liu1, Qian Xu1, Yao Xu1, Kai Cao1, Nan Deng1, Ruimin Wang1, Xueying Zhang1, Ruiqin Zheng1, Guoliang Li2, Yaping Fang2.
Abstract
Topologically associated domains (TADs) are one of the important higher order chromatin structures with various sizes in the eukaryotic genomes. TAD boundaries, as the flanking regions between adjacent domains, can restrict the interactions of regulatory elements, including enhancers and promoters, and are generally dynamic and variable in different cells. However, the influence of sequence and epigenetic profile-based features in the identification of TAD boundaries is largely unknown. In this work, we proposed a method called pTADS (prediction of TAD boundary and strength), to predict TAD boundaries and boundary strength across multiple cell lines with DNA sequence and epigenetic profile information. The performance was assessed in seven cell lines and three TAD calling methods. The results demonstrate that the TAD boundary can be well predicted by the selected shared features across multiple cell lines. Especially, the model can be transferable to predict the TAD boundary from one cell line to other cell lines. The boundary strength can be characterized by boundary score with good performance. The predicted TAD boundary and TAD boundary strength are further confirmed by three Hi-C contact matrix-based methods across multiple cell lines. The codes and datasets are available at https://github.com/chrom3DEpi/pTADS.Keywords: TAD boundary; boundary score; boundary strength; epigenetic profile; machine learning
Year: 2021 PMID: 33866359 DOI: 10.1093/bib/bbab139
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622