Axel Poulet1, Ben Li1, Tristan Dubos2, Juan Carlos Rivera-Mulia3, David M Gilbert3, Zhaohui S Qin1. 1. Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, USA. 2. Laboratoire de Génétique, CHRU-Nancy, Nancy, France. 3. Department of Biological Science, Center for Genomics and Personalized Medicine, Florida State University, Tallahassee, FL, USA.
Abstract
MOTIVATION: The replication timing (RT) program has been linked to many key biological processes including cell fate commitment, 3D chromatin organization and transcription regulation. Significant technology progress now allows to characterize the RT program in the entire human genome in a high-throughput and high-resolution fashion. These experiments suggest that RT changes dynamically during development in coordination with gene activity. Since RT is such a fundamental biological process, we believe that an effective quantitative profile of the local RT program from a diverse set of cell types in various developmental stages and lineages can provide crucial biological insights for a genomic locus. RESULTS: In this study, we explored recurrent and spatially coherent combinatorial profiles from 42 RT programs collected from multiple lineages at diverse differentiation states. We found that a Hidden Markov Model with 15 hidden states provide a good model to describe these genome-wide RT profiling data. Each of the hidden state represents a unique combination of RT profiles across different cell types which we refer to as 'RT states'. To understand the biological properties of these RT states, we inspected their relationship with chromatin states, gene expression, functional annotation and 3D chromosomal organization. We found that the newly defined RT states possess interesting genome-wide functional properties that add complementary information to the existing annotation of the human genome. AVAILABILITY AND IMPLEMENTATION: R scripts for inferring HMM models and Perl scripts for further analysis are available https://github.com/PouletAxel/script_HMM_Replication_timing. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The replication timing (RT) program has been linked to many key biological processes including cell fate commitment, 3D chromatin organization and transcription regulation. Significant technology progress now allows to characterize the RT program in the entire human genome in a high-throughput and high-resolution fashion. These experiments suggest that RT changes dynamically during development in coordination with gene activity. Since RT is such a fundamental biological process, we believe that an effective quantitative profile of the local RT program from a diverse set of cell types in various developmental stages and lineages can provide crucial biological insights for a genomic locus. RESULTS: In this study, we explored recurrent and spatially coherent combinatorial profiles from 42 RT programs collected from multiple lineages at diverse differentiation states. We found that a Hidden Markov Model with 15 hidden states provide a good model to describe these genome-wide RT profiling data. Each of the hidden state represents a unique combination of RT profiles across different cell types which we refer to as 'RT states'. To understand the biological properties of these RT states, we inspected their relationship with chromatin states, gene expression, functional annotation and 3D chromosomal organization. We found that the newly defined RT states possess interesting genome-wide functional properties that add complementary information to the existing annotation of the human genome. AVAILABILITY AND IMPLEMENTATION: R scripts for inferring HMM models and Perl scripts for further analysis are available https://github.com/PouletAxel/script_HMM_Replication_timing. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Lars Guelen; Ludo Pagie; Emilie Brasset; Wouter Meuleman; Marius B Faza; Wendy Talhout; Bert H Eussen; Annelies de Klein; Lodewyk Wessels; Wouter de Laat; Bas van Steensel Journal: Nature Date: 2008-05-07 Impact factor: 49.962
Authors: Amnon Koren; Paz Polak; James Nemesh; Jacob J Michaelson; Jonathan Sebat; Shamil R Sunyaev; Steven A McCarroll Journal: Am J Hum Genet Date: 2012-11-21 Impact factor: 11.025
Authors: Alan P Boyle; Eurie L Hong; Manoj Hariharan; Yong Cheng; Marc A Schaub; Maya Kasowski; Konrad J Karczewski; Julie Park; Benjamin C Hitz; Shuai Weng; J Michael Cherry; Michael Snyder Journal: Genome Res Date: 2012-09 Impact factor: 9.043
Authors: Jason Ernst; Pouya Kheradpour; Tarjei S Mikkelsen; Noam Shoresh; Lucas D Ward; Charles B Epstein; Xiaolan Zhang; Li Wang; Robbyn Issner; Michael Coyne; Manching Ku; Timothy Durham; Manolis Kellis; Bradley E Bernstein Journal: Nature Date: 2011-03-23 Impact factor: 49.962
Authors: Ming Hu; Ke Deng; Zhaohui Qin; Jesse Dixon; Siddarth Selvaraj; Jennifer Fang; Bing Ren; Jun S Liu Journal: PLoS Comput Biol Date: 2013-01-31 Impact factor: 4.475