Literature DB >> 33665151

High-performance method for identification of super enhancers from ChIP-Seq data with configurable cloud virtual machines.

Natalia N Orlova1, Olga V Bogatova1, Alexey V Orlov1,2.   

Abstract

A universal method for rapid identifying super-enhancers which are large domains of multiple closely-spaced enhancers is proposed. The method applies configurable cloud virtual machines (cVMs) and the rank-ordering of super-enhancers (ROSE) algorithm. To identify super-enhancers a сVM-based analysis of the ChIP-seq binding patterns of the active enhancer-associated mark is employed. The use of the proposed method is described step-by-step: configuration of cVM; ChIP-seq data alignment; peak calling; ROSE algorithm; interpretation of the results on a client machine. The method was validated for the search of super-enhancers using the H3K27ac mark in the sample datasets of a cell line (human MCF-7), mouse tissue (heart), and human tissue (adrenal gland). The total analysis cycle time of raw ChIP-seq data ranges from 15 to 48 min, depending on the number of initial short reads. Depending on the data processing step and availability of multi-threading, a cVM can be scaled up to a multi-CPU configuration with large amount of RAM. An important feature of the method is that it can run on a client machine that has low-performance with virtually any OS. The proposed method allows for simultaneous and independent processing of different sample datasets on multiple clones of a single cVM.•Cloud VMs were used for rapid processing of ChIP-seq data to identify super-enhancers.•The method can use a low-performance computer with virtually any OS on it.•It can be scaled up for parallel processing of individual sample datasets on their own VMs for rapid high-throughput processing.
© 2020 The Authors. Published by Elsevier B.V.

Entities:  

Keywords:  Chromatin immunoprecipitation followed by sequencing; Epigenomics; H3K27ac; Next generation sequencing; Stitched enhancers

Year:  2020        PMID: 33665151      PMCID: PMC7897706          DOI: 10.1016/j.mex.2020.101165

Source DB:  PubMed          Journal:  MethodsX        ISSN: 2215-0161


  22 in total

1.  Chromatin stretch enhancer states drive cell-specific gene regulation and harbor human disease risk variants.

Authors:  Stephen C J Parker; Michael L Stitzel; D Leland Taylor; Jose Miguel Orozco; Michael R Erdos; Jennifer A Akiyama; Kelly Lammerts van Bueren; Peter S Chines; Narisu Narisu; Brian L Black; Axel Visel; Len A Pennacchio; Francis S Collins
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-14       Impact factor: 11.205

2.  Super-enhancers are transcriptionally more active and cell type-specific than stretch enhancers.

Authors:  Aziz Khan; Anthony Mathelier; Xuegong Zhang
Journal:  Epigenetics       Date:  2018-10-11       Impact factor: 4.528

3.  Super-enhancers delineate disease-associated regulatory nodes in T cells.

Authors:  Golnaz Vahedi; Yuka Kanno; Yasuko Furumoto; Kan Jiang; Stephen C J Parker; Michael R Erdos; Sean R Davis; Rahul Roychoudhuri; Nicholas P Restifo; Massimo Gadina; Zhonghui Tang; Yijun Ruan; Francis S Collins; Vittorio Sartorelli; John J O'Shea
Journal:  Nature       Date:  2015-02-16       Impact factor: 49.962

4.  Master transcription factors and mediator establish super-enhancers at key cell identity genes.

Authors:  Warren A Whyte; David A Orlando; Denes Hnisz; Brian J Abraham; Charles Y Lin; Michael H Kagey; Peter B Rahl; Tong Ihn Lee; Richard A Young
Journal:  Cell       Date:  2013-04-11       Impact factor: 41.582

5.  Selective inhibition of tumor oncogenes by disruption of super-enhancers.

Authors:  Jakob Lovén; Heather A Hoke; Charles Y Lin; Ashley Lau; David A Orlando; Christopher R Vakoc; James E Bradner; Tong Ihn Lee; Richard A Young
Journal:  Cell       Date:  2013-04-11       Impact factor: 41.582

6.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

7.  Genetic predisposition to neuroblastoma mediated by a LMO1 super-enhancer polymorphism.

Authors:  Derek A Oldridge; Andrew C Wood; Nina Weichert-Leahey; Ian Crimmins; Robyn Sussman; Cynthia Winter; Lee D McDaniel; Maura Diamond; Lori S Hart; Shizhen Zhu; Adam D Durbin; Brian J Abraham; Lars Anders; Lifeng Tian; Shile Zhang; Jun S Wei; Javed Khan; Kelli Bramlett; Nazneen Rahman; Mario Capasso; Achille Iolascon; Daniela S Gerhard; Jaime M Guidry Auvil; Richard A Young; Hakon Hakonarson; Sharon J Diskin; A Thomas Look; John M Maris
Journal:  Nature       Date:  2015-11-11       Impact factor: 49.962

Review 8.  Data Lakes, Clouds, and Commons: A Review of Platforms for Analyzing and Sharing Genomic Data.

Authors:  Robert L Grossman
Journal:  Trends Genet       Date:  2019-01-25       Impact factor: 11.639

9.  Super-enhancers in transcriptional regulation and genome organization.

Authors:  Xi Wang; Murray J Cairns; Jian Yan
Journal:  Nucleic Acids Res       Date:  2019-12-16       Impact factor: 16.971

10.  Development of immunoassays using interferometric real-time registration of their kinetics.

Authors:  A V Orlov; A G Burenin; V O Shipunova; A A Lizunova; B G Gorshkov; P I Nikitin
Journal:  Acta Naturae       Date:  2014-01       Impact factor: 1.845

View more
  3 in total

1.  Sharing data, sharing methods, sharing science.

Authors:  Sergio Pantano
Journal:  MethodsX       Date:  2021-12-14

2.  SEAseq: a portable and cloud-based chromatin occupancy analysis suite.

Authors:  Modupeore O Adetunji; Brian J Abraham
Journal:  BMC Bioinformatics       Date:  2022-02-23       Impact factor: 3.169

3.  A multimodal integrative approach to model transcriptional addiction of thyroid cancer on RUNX2.

Authors:  Emanuele Vitale; Elisabetta Sauta; Mila Gugnoni; Federica Torricelli; Veronica Manicardi; Alessia Ciarrocchi
Journal:  Cancer Commun (Lond)       Date:  2022-04-22
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.