Luca Pinello1,2,3, Rick Farouni1,2, Guo-Cheng Yuan4,5. 1. Department of Molecular Pathology, Massachusetts General Hospital, Boston, MA, USA. 2. Harvard Medical School, Boston, MA, USA. 3. Broad Institute of MIT and Harvard, Cambridge, MA, USA. 4. Dana-Farber Cancer Institute, Boston, MA, USA. 5. Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Abstract
Motivation: With the increasing amount of genomic and epigenomic data in the public domain, a pressing challenge is to integrate these data to investigate the role of epigenetic mechanisms in regulating gene expression and maintenance of cell-identity. To this end, we have implemented a computational pipeline to systematically study epigenetic variability and uncover regulatory DNA sequences. Results: Haystack is a bioinformatics pipeline to identify hotspots of epigenetic variability across different cell-types, cell-type specific cis-regulatory elements, and associated transcription factors. Haystack is generally applicable to any epigenetic mark and provides an important tool to investigate the mechanisms underlying epigenetic switches during development. This software is accompanied by a set of precomputed tracks, which may be used as a valuable resource for functional annotation of the human genome. Availability and implementation: The Haystack pipeline is implemented as an open-source, multiplatform, Python package called haystack_bio freely available at https://github.com/pinellolab/haystack_bio. Contact: lpinello@mgh.harvard.edu or gcyuan@jimmy.harvard.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: With the increasing amount of genomic and epigenomic data in the public domain, a pressing challenge is to integrate these data to investigate the role of epigenetic mechanisms in regulating gene expression and maintenance of cell-identity. To this end, we have implemented a computational pipeline to systematically study epigenetic variability and uncover regulatory DNA sequences. Results: Haystack is a bioinformatics pipeline to identify hotspots of epigenetic variability across different cell-types, cell-type specific cis-regulatory elements, and associated transcription factors. Haystack is generally applicable to any epigenetic mark and provides an important tool to investigate the mechanisms underlying epigenetic switches during development. This software is accompanied by a set of precomputed tracks, which may be used as a valuable resource for functional annotation of the human genome. Availability and implementation: The Haystack pipeline is implemented as an open-source, multiplatform, Python package called haystack_bio freely available at https://github.com/pinellolab/haystack_bio. Contact: lpinello@mgh.harvard.edu or gcyuan@jimmy.harvard.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Michael M Hoffman; Orion J Buske; Jie Wang; Zhiping Weng; Jeff A Bilmes; William Stafford Noble Journal: Nat Methods Date: 2012-03-18 Impact factor: 28.547
Authors: Sven Heinz; Christopher Benner; Nathanael Spann; Eric Bertolino; Yin C Lin; Peter Laslo; Jason X Cheng; Cornelis Murre; Harinder Singh; Christopher K Glass Journal: Mol Cell Date: 2010-05-28 Impact factor: 17.970
Authors: Bradley E Bernstein; John A Stamatoyannopoulos; Joseph F Costello; Bing Ren; Aleksandar Milosavljevic; Alexander Meissner; Manolis Kellis; Marco A Marra; Arthur L Beaudet; Joseph R Ecker; Peggy J Farnham; Martin Hirst; Eric S Lander; Tarjei S Mikkelsen; James A Thomson Journal: Nat Biotechnol Date: 2010-10 Impact factor: 54.908
Authors: Anthony Mathelier; Oriol Fornes; David J Arenillas; Chih-Yu Chen; Grégoire Denay; Jessica Lee; Wenqiang Shi; Casper Shyr; Ge Tan; Rebecca Worsley-Hunt; Allen W Zhang; François Parcy; Boris Lenhard; Albin Sandelin; Wyeth W Wasserman Journal: Nucleic Acids Res Date: 2015-11-03 Impact factor: 16.971
Authors: Lindsay F Rizzardi; Peter F Hickey; Varenka Rodriguez DiBlasi; Rakel Tryggvadóttir; Colin M Callahan; Adrian Idrizi; Kasper D Hansen; Andrew P Feinberg Journal: Nat Neurosci Date: 2019-01-14 Impact factor: 24.884
Authors: Lei Chen; Shirley Luo; Abigail Dupre; Roshan P Vasoya; Aditya Parthasarathy; Rohit Aita; Raj Malhotra; Joseph Hur; Natalie H Toke; Eric Chiles; Min Yang; Weihuan Cao; Juan Flores; Christopher E Ellison; Nan Gao; Amrik Sahota; Xiaoyang Su; Edward M Bonder; Michael P Verzi Journal: Nat Commun Date: 2021-05-17 Impact factor: 14.919
Authors: Namit Kumar; Yu-Hwai Tsai; Lei Chen; Anbo Zhou; Kushal K Banerjee; Madhurima Saxena; Sha Huang; Natalie H Toke; Jinchuan Xing; Ramesh A Shivdasani; Jason R Spence; Michael P Verzi Journal: Development Date: 2019-03-01 Impact factor: 6.862
Authors: Lei Chen; Natalie H Toke; Shirley Luo; Roshan P Vasoya; Rohit Aita; Aditya Parthasarathy; Yu-Hwai Tsai; Jason R Spence; Michael P Verzi Journal: Development Date: 2019-08-06 Impact factor: 6.862
Authors: Lei Chen; Natalie H Toke; Shirley Luo; Roshan P Vasoya; Robert L Fullem; Aditya Parthasarathy; Ansu O Perekatt; Michael P Verzi Journal: Nat Genet Date: 2019-04-15 Impact factor: 41.307
Authors: Lei Chen; Weihuan Cao; Rohit Aita; Dennis Aldea; Juan Flores; Nan Gao; Edward M Bonder; Christopher E Ellison; Michael P Verzi Journal: Cell Rep Date: 2021-01-26 Impact factor: 9.423