Literature DB >> 30702424

Generative modeling of multi-mapping reads with mHi-C advances analysis of Hi-C studies.

Ye Zheng1, Ferhat Ay2,3, Sunduz Keles1,4.   

Abstract

Current Hi-C analysis approaches are unable to account for reads that align to multiple locations, and hence underestimate biological signal from repetitive regions of genomes. We developed and validated mHi-C, a multi-read mapping strategy to probabilistically allocate Hi-C multi-reads. mHi-C exhibited superior performance over utilizing only uni-reads and heuristic approaches aimed at rescuing multi-reads on benchmarks. Specifically, mHi-C increased the sequencing depth by an average of 20% resulting in higher reproducibility of contact matrices and detected interactions across biological replicates. The impact of the multi-reads on the detection of significant interactions is influenced marginally by the relative contribution of multi-reads to the sequencing depth compared to uni-reads, cis-to-trans ratio of contacts, and the broad data quality as reflected by the proportion of mappable reads of datasets. Computational experiments highlighted that in Hi-C studies with short read lengths, mHi-C rescued multi-reads can emulate the effect of longer reads. mHi-C also revealed biologically supported bona fide promoter-enhancer interactions and topologically associating domains involving repetitive genomic regions, thereby unlocking a previously masked portion of the genome for conformation capture studies.
© 2019, Zheng et al.

Entities:  

Keywords:  Hi-C; chromosome chromatin capture; computational biology; human; mouse; multi-reads; probabilistic modeling; systems biology

Mesh:

Substances:

Year:  2019        PMID: 30702424      PMCID: PMC6450682          DOI: 10.7554/eLife.38070

Source DB:  PubMed          Journal:  Elife        ISSN: 2050-084X            Impact factor:   8.140


  56 in total

Review 1.  Topology of mammalian developmental enhancers and their regulatory landscapes.

Authors:  Wouter de Laat; Denis Duboule
Journal:  Nature       Date:  2013-10-24       Impact factor: 49.962

Review 2.  Repetitive DNA and next-generation sequencing: computational challenges and solutions.

Authors:  Todd J Treangen; Steven L Salzberg
Journal:  Nat Rev Genet       Date:  2011-11-29       Impact factor: 53.242

3.  GeneCards Version 3: the human gene integrator.

Authors:  Marilyn Safran; Irina Dalah; Justin Alexander; Naomi Rosen; Tsippi Iny Stein; Michael Shmoish; Noam Nativ; Iris Bahir; Tirza Doniger; Hagit Krug; Alexandra Sirota-Madi; Tsviya Olender; Yaron Golan; Gil Stelzer; Arye Harel; Doron Lancet
Journal:  Database (Oxford)       Date:  2010-08-05       Impact factor: 3.451

4.  The BET Protein BRD2 Cooperates with CTCF to Enforce Transcriptional and Architectural Boundaries.

Authors:  Sarah C Hsu; Thomas G Gilgenast; Caroline R Bartman; Christopher R Edwards; Aaron J Stonestrom; Peng Huang; Daniel J Emerson; Perry Evans; Michael T Werner; Cheryl A Keller; Belinda Giardine; Ross C Hardison; Arjun Raj; Jennifer E Phillips-Cremins; Gerd A Blobel
Journal:  Mol Cell       Date:  2017-04-06       Impact factor: 17.970

5.  Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C.

Authors:  Borbala Mifsud; Filipe Tavares-Cadete; Alice N Young; Robert Sugar; Stefan Schoenfelder; Lauren Ferreira; Steven W Wingett; Simon Andrews; William Grey; Philip A Ewels; Bram Herman; Scott Happe; Andy Higgs; Emily LeProust; George A Follows; Peter Fraser; Nicholas M Luscombe; Cameron S Osborne
Journal:  Nat Genet       Date:  2015-05-04       Impact factor: 38.330

Review 6.  Three-dimensional genome architecture: players and mechanisms.

Authors:  Ana Pombo; Niall Dillon
Journal:  Nat Rev Mol Cell Biol       Date:  2015-03-11       Impact factor: 94.444

7.  Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression.

Authors:  Ferhat Ay; Evelien M Bunnik; Nelle Varoquaux; Sebastiaan M Bol; Jacques Prudhomme; Jean-Philippe Vert; William Stafford Noble; Karine G Le Roch
Journal:  Genome Res       Date:  2014-03-26       Impact factor: 9.043

8.  HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient.

Authors:  Tao Yang; Feipeng Zhang; Galip Gürkan Yardımcı; Fan Song; Ross C Hardison; William Stafford Noble; Feng Yue; Qunhua Li
Journal:  Genome Res       Date:  2017-08-30       Impact factor: 9.043

9.  The UCSC Genome Browser database: 2017 update.

Authors:  Cath Tyner; Galt P Barber; Jonathan Casper; Hiram Clawson; Mark Diekhans; Christopher Eisenhart; Clayton M Fischer; David Gibson; Jairo Navarro Gonzalez; Luvina Guruvadoo; Maximilian Haeussler; Steve Heitner; Angie S Hinrichs; Donna Karolchik; Brian T Lee; Christopher M Lee; Parisa Nejad; Brian J Raney; Kate R Rosenbloom; Matthew L Speir; Chris Villarreal; John Vivian; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

10.  Generative modeling of multi-mapping reads with mHi-C advances analysis of Hi-C studies.

Authors:  Ye Zheng; Ferhat Ay; Sunduz Keles
Journal:  Elife       Date:  2019-01-31       Impact factor: 8.140

View more
  11 in total

1.  FreeHi-C spike-in simulations for benchmarking differential chromatin interaction detection.

Authors:  Ye Zheng; Peigen Zhou; Sündüz Keleş
Journal:  Methods       Date:  2020-07-12       Impact factor: 3.608

2.  Chromatin conformation capture (Hi-C) sequencing of patient-derived xenografts: analysis guidelines.

Authors:  Mikhail G Dozmorov; Katarzyna M Tyc; Nathan C Sheffield; David C Boyd; Amy L Olex; Jason Reed; J Chuck Harrell
Journal:  Gigascience       Date:  2021-04-21       Impact factor: 6.524

Review 3.  Mobile genomics: tools and techniques for tackling transposons.

Authors:  Kathryn O'Neill; David Brocks; Molly Gale Hammell
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2020-02-10       Impact factor: 6.237

4.  Spatial integration of transcription and splicing in a dedicated compartment sustains monogenic antigen expression in African trypanosomes.

Authors:  Joana Faria; Vanessa Luzak; Laura S M Müller; Benedikt G Brink; Sebastian Hutchinson; Lucy Glover; David Horn; T Nicolai Siegel
Journal:  Nat Microbiol       Date:  2021-01-11       Impact factor: 17.745

Review 5.  Probably Correct: Rescuing Repeats with Short and Long Reads.

Authors:  Monika Cechova
Journal:  Genes (Basel)       Date:  2020-12-31       Impact factor: 4.096

6.  Diverse Molecular Mechanisms Contribute to Differential Expression of Human Duplicated Genes.

Authors:  Colin J Shew; Paulina Carmona-Mora; Daniela C Soto; Mira Mastoras; Elizabeth Roberts; Joseph Rosas; Dhriti Jagannathan; Gulhan Kaya; Henriette O'Geen; Megan Y Dennis
Journal:  Mol Biol Evol       Date:  2021-07-29       Impact factor: 16.240

7.  Comparison of Capture Hi-C Analytical Pipelines.

Authors:  Dina Aljogol; I Richard Thompson; Cameron S Osborne; Borbala Mifsud
Journal:  Front Genet       Date:  2022-01-28       Impact factor: 4.599

8.  Generative modeling of multi-mapping reads with mHi-C advances analysis of Hi-C studies.

Authors:  Ye Zheng; Ferhat Ay; Sunduz Keles
Journal:  Elife       Date:  2019-01-31       Impact factor: 8.140

9.  FreeHi-C simulates high-fidelity Hi-C data for benchmarking and data augmentation.

Authors:  Ye Zheng; Sündüz Keleş
Journal:  Nat Methods       Date:  2019-11-11       Impact factor: 28.547

10.  Dynamic evolution of great ape Y chromosomes.

Authors:  Monika Cechova; Rahulsimham Vegesna; Marta Tomaszkiewicz; Robert S Harris; Di Chen; Samarth Rangavittal; Paul Medvedev; Kateryna D Makova
Journal:  Proc Natl Acad Sci U S A       Date:  2020-10-05       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.