Literature DB >> 35488276

A systematic evaluation of Hi-C data enhancement methods for enhancing PLAC-seq and HiChIP data.

Le Huang1, Yuchen Yang2, Gang Li3, Minzhi Jiang4, Jia Wen5, Armen Abnousi6, Jonathan D Rosen7, Ming Hu6, Yun Li5,7,8.   

Abstract

The three-dimensional organization of chromatin plays a critical role in gene regulation. Recently developed technologies, such as HiChIP and proximity ligation-assisted ChIP-Seq (PLAC-seq) (hereafter referred to as HP for brevity), can measure chromosome spatial organization by interrogating chromatin interactions mediated by a protein of interest. While offering cost-efficiency over genome-wide unbiased high-throughput chromosome conformation capture (Hi-C) data, HP data remain sparse at kilobase (Kb) resolution with the current sequencing depth in the order of 108 reads per sample. Deep learning models, including HiCPlus, HiCNN, HiCNN2, DeepHiC and Variationally Encoded Hi-C Loss Enhancer (VEHiCLE), have been developed to enhance the sequencing depth of Hi-C data, but their performance on HP data has not been benchmarked. Here, we performed a comprehensive evaluation of HP data sequencing depth enhancement using models developed for Hi-C data. Specifically, we analyzed various HP data, including Smc1a HiChIP data of the human lymphoblastoid cell line GM12878, H3K4me3 PLAC-seq data of four human neural cell types as well as of mouse embryonic stem cells (mESC), and mESC CCCTC-binding factor (CTCF) PLAC-seq data. Our evaluations lead to the following three findings: (i) most models developed for Hi-C data achieve reasonable performance when applied to HP data (e.g. with Pearson correlation ranging 0.76-0.95 for pairs of loci within 300 Kb), and the enhanced datasets lead to improved statistical power for detecting long-range chromatin interactions, (ii) models trained on HP data outperform those trained on Hi-C data and (iii) most models are transferable across cell types. Our results provide a general guideline for HP data enhancement using existing methods designed for Hi-C data.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  Hi-C; HiChIP; PLAC-seq; deep learning; enhancement; evaluation

Mesh:

Substances:

Year:  2022        PMID: 35488276      PMCID: PMC9116213          DOI: 10.1093/bib/bbac145

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  25 in total

1.  Mapping of long-range chromatin interactions by proximity ligation-assisted ChIP-seq.

Authors:  Rongxin Fang; Miao Yu; Guoqiang Li; Sora Chee; Tristin Liu; Anthony D Schmitt; Bing Ren
Journal:  Cell Res       Date:  2016-11-25       Impact factor: 25.617

Review 2.  Gene regulation in the 3D genome.

Authors:  Yun Li; Ming Hu; Yin Shen
Journal:  Hum Mol Genet       Date:  2018-08-01       Impact factor: 6.150

3.  On Brownian Distance Covariance and High Dimensional Data.

Authors:  Michael R Kosorok
Journal:  Ann Appl Stat       Date:  2009-01-01       Impact factor: 2.083

4.  HiChIP: efficient and sensitive analysis of protein-directed genome architecture.

Authors:  Maxwell R Mumbach; Adam J Rubin; Ryan A Flynn; Chao Dai; Paul A Khavari; William J Greenleaf; Howard Y Chang
Journal:  Nat Methods       Date:  2016-09-19       Impact factor: 28.547

5.  Exploring long-range genome interactions using the WashU Epigenome Browser.

Authors:  Xin Zhou; Rebecca F Lowdon; Daofeng Li; Heather A Lawson; Pamela A F Madden; Joseph F Costello; Ting Wang
Journal:  Nat Methods       Date:  2013-05       Impact factor: 28.547

6.  Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

Authors:  Erez Lieberman-Aiden; Nynke L van Berkum; Louise Williams; Maxim Imakaev; Tobias Ragoczy; Agnes Telling; Ido Amit; Bryan R Lajoie; Peter J Sabo; Michael O Dorschner; Richard Sandstrom; Bradley Bernstein; M A Bender; Mark Groudine; Andreas Gnirke; John Stamatoyannopoulos; Leonid A Mirny; Eric S Lander; Job Dekker
Journal:  Science       Date:  2009-10-09       Impact factor: 47.728

7.  The pluripotent regulatory circuitry connecting promoters to their long-range interacting elements.

Authors:  Stefan Schoenfelder; Mayra Furlan-Magaril; Borbala Mifsud; Filipe Tavares-Cadete; Robert Sugar; Biola-Maria Javierre; Takashi Nagano; Yulia Katsman; Moorthy Sakthidevi; Steven W Wingett; Emilia Dimitrova; Andrew Dimond; Lucas B Edelman; Sarah Elderkin; Kristina Tabbada; Elodie Darbo; Simon Andrews; Bram Herman; Andy Higgs; Emily LeProust; Cameron S Osborne; Jennifer A Mitchell; Nicholas M Luscombe; Peter Fraser
Journal:  Genome Res       Date:  2015-03-09       Impact factor: 9.043

8.  HiC-spector: a matrix library for spectral and reproducibility analysis of Hi-C contact maps.

Authors:  Koon-Kiu Yan; Galip Gürkan Yardimci; Chengfei Yan; William S Noble; Mark Gerstein
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

9.  Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus.

Authors:  Yan Zhang; Lin An; Jie Xu; Bo Zhang; W Jim Zheng; Ming Hu; Jijun Tang; Feng Yue
Journal:  Nat Commun       Date:  2018-02-21       Impact factor: 14.919

10.  DeepHiC: A generative adversarial network for enhancing Hi-C data resolution.

Authors:  Hao Hong; Shuai Jiang; Hao Li; Guifang Du; Yu Sun; Huan Tao; Cheng Quan; Chenghui Zhao; Ruijiang Li; Wanying Li; Xiaoyao Yin; Yangchen Huang; Cheng Li; Hebing Chen; Xiaochen Bo
Journal:  PLoS Comput Biol       Date:  2020-02-21       Impact factor: 4.475

View more
  1 in total

Review 1.  Understanding the function of regulatory DNA interactions in the interpretation of non-coding GWAS variants.

Authors:  Wujuan Zhong; Weifang Liu; Jiawen Chen; Quan Sun; Ming Hu; Yun Li
Journal:  Front Cell Dev Biol       Date:  2022-08-19
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.