Literature DB >> 29028265

StereoGene: rapid estimation of genome-wide correlation of continuous or interval feature data.

Elena D Stavrovskaya1,2, Tejasvi Niranjan3, Elana J Fertig3, Sarah J Wheelan3, Alexander V Favorov3,4,5, Andrey A Mironov1,2.   

Abstract

MOTIVATION: Genomics features with similar genome-wide distributions are generally hypothesized to be functionally related, for example, colocalization of histones and transcription start sites indicate chromatin regulation of transcription factor activity. Therefore, statistical algorithms to perform spatial, genome-wide correlation among genomic features are required.
RESULTS: Here, we propose a method, StereoGene, that rapidly estimates genome-wide correlation among pairs of genomic features. These features may represent high-throughput data mapped to reference genome or sets of genomic annotations in that reference genome. StereoGene enables correlation of continuous data directly, avoiding the data binarization and subsequent data loss. Correlations are computed among neighboring genomic positions using kernel correlation. Representing the correlation as a function of the genome position, StereoGene outputs the local correlation track as part of the analysis. StereoGene also accounts for confounders such as input DNA by partial correlation. We apply our method to numerous comparisons of ChIP-Seq datasets from the Human Epigenome Atlas and FANTOM CAGE to demonstrate its wide applicability. We observe the changes in the correlation between epigenomic features across developmental trajectories of several tissue types consistent with known biology and find a novel spatial correlation of CAGE clusters with donor splice sites and with poly(A) sites. These analyses provide examples for the broad applicability of StereoGene for regulatory genomics.
AVAILABILITY AND IMPLEMENTATION: The StereoGene C ++ source code, program documentation, Galaxy integration scripts and examples are available from the project homepage http://stereogene.bioinf.fbb.msu.ru/. CONTACT: favorov@sensi.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 29028265      PMCID: PMC5860031          DOI: 10.1093/bioinformatics/btx379

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  32 in total

1.  An RNA processing activity that debranches RNA lariats.

Authors:  B Ruskin; M R Green
Journal:  Science       Date:  1985-07-12       Impact factor: 47.728

Review 2.  Chromatin and epigenetic regulation of pre-mRNA processing.

Authors:  Seth J Brown; Peter Stoilov; Yi Xing
Journal:  Hum Mol Genet       Date:  2012-08-29       Impact factor: 6.150

3.  reChIP-seq reveals widespread bivalency of H3K4me3 and H3K27me3 in CD4(+) memory T cells.

Authors:  Sarah Kinkley; Johannes Helmuth; Julia K Polansky; Ilona Dunkel; Gilles Gasparoni; Sebastian Fröhler; Wei Chen; Jörn Walter; Alf Hamann; Ho-Ryun Chung
Journal:  Nat Commun       Date:  2016-08-17       Impact factor: 14.919

4.  The NIH Roadmap Epigenomics Mapping Consortium.

Authors:  Bradley E Bernstein; John A Stamatoyannopoulos; Joseph F Costello; Bing Ren; Aleksandar Milosavljevic; Alexander Meissner; Manolis Kellis; Marco A Marra; Arthur L Beaudet; Joseph R Ecker; Peggy J Farnham; Martin Hirst; Eric S Lander; Tarjei S Mikkelsen; James A Thomson
Journal:  Nat Biotechnol       Date:  2010-10       Impact factor: 54.908

Review 5.  Chromatin modifiers and remodellers: regulators of cellular differentiation.

Authors:  Taiping Chen; Sharon Y R Dent
Journal:  Nat Rev Genet       Date:  2013-12-24       Impact factor: 53.242

6.  The Genomic HyperBrowser: inferential genomics at the sequence level.

Authors:  Geir K Sandve; Sveinung Gundersen; Halfdan Rydbeck; Ingrid K Glad; Lars Holden; Marit Holden; Knut Liestøl; Trevor Clancy; Egil Ferkingstad; Morten Johansen; Vegard Nygaard; Eivind Tøstesen; Arnoldo Frigessi; Eivind Hovig
Journal:  Genome Biol       Date:  2010-12-23       Impact factor: 13.583

7.  QDMR: a quantitative method for identification of differentially methylated regions by entropy.

Authors:  Yan Zhang; Hongbo Liu; Jie Lv; Xue Xiao; Jiang Zhu; Xiaojuan Liu; Jianzhong Su; Xia Li; Qiong Wu; Fang Wang; Ying Cui
Journal:  Nucleic Acids Res       Date:  2011-02-08       Impact factor: 16.971

8.  Uncovering correlated variability in epigenomic datasets using the Karhunen-Loeve transform.

Authors:  Pedro Madrigal; Paweł Krajewski
Journal:  BioData Min       Date:  2015-07-01       Impact factor: 2.522

9.  Global quantitative modeling of chromatin factor interactions.

Authors:  Jian Zhou; Olga G Troyanskaya
Journal:  PLoS Comput Biol       Date:  2014-03-27       Impact factor: 4.475

10.  GAT: a simulation framework for testing the association of genomic intervals.

Authors:  Andreas Heger; Caleb Webber; Martin Goodson; Chris P Ponting; Gerton Lunter
Journal:  Bioinformatics       Date:  2013-06-18       Impact factor: 6.937

View more
  9 in total

1.  Analytical Approaches for ATAC-seq Data Analysis.

Authors:  Jason P Smith; Nathan C Sheffield
Journal:  Curr Protoc Hum Genet       Date:  2020-06

2.  Endogenous oxidized DNA bases and APE1 regulate the formation of G-quadruplex structures in the genome.

Authors:  Shrabasti Roychoudhury; Suravi Pramanik; Hannah L Harris; Mason Tarpley; Aniruddha Sarkar; Gaelle Spagnol; Paul L Sorgen; Dipanjan Chowdhury; Vimla Band; David Klinkebiel; Kishor K Bhakat
Journal:  Proc Natl Acad Sci U S A       Date:  2020-05-13       Impact factor: 11.205

3.  SAMMY-seq reveals early alteration of heterochromatin and deregulation of bivalent genes in Hutchinson-Gilford Progeria Syndrome.

Authors:  Endre Sebestyén; Fabrizia Marullo; Federica Lucini; Cristiano Petrini; Andrea Bianchi; Sara Valsoni; Ilaria Olivieri; Laura Antonelli; Francesco Gregoretti; Gennaro Oliva; Francesco Ferrari; Chiara Lanzuolo
Journal:  Nat Commun       Date:  2020-12-08       Impact factor: 14.919

4.  Spatial correlation statistics enable transcriptome-wide characterization of RNA structure binding.

Authors:  Veronica F Busa; Alexander V Favorov; Elana J Fertig; Anthony K L Leung
Journal:  Cell Rep Methods       Date:  2021-10-01

5.  Cogito: automated and generic comparison of annotated genomic intervals.

Authors:  Annika Bürger; Martin Dugas
Journal:  BMC Bioinformatics       Date:  2022-08-04       Impact factor: 3.307

6.  Epigenetic regulation of gene expression in cancer: techniques, resources and analysis.

Authors:  Luciane T Kagohara; Genevieve L Stein-O'Brien; Dylan Kelley; Emily Flam; Heather C Wick; Ludmila V Danilova; Hariharan Easwaran; Alexander V Favorov; Jiang Qian; Daria A Gaykalova; Elana J Fertig
Journal:  Brief Funct Genomics       Date:  2018-01-01       Impact factor: 4.241

7.  Coloc-stats: a unified web interface to perform colocalization analysis of genomic features.

Authors:  Boris Simovski; Chakravarthi Kanduri; Sveinung Gundersen; Dmytro Titov; Diana Domanska; Christoph Bock; Lara Bossini-Castillo; Maria Chikina; Alexander Favorov; Ryan M Layer; Andrey A Mironov; Aaron R Quinlan; Nathan C Sheffield; Gosia Trynka; Geir K Sandve
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

8.  Colocalization analyses of genomic elements: approaches, recommendations and challenges.

Authors:  Chakravarthi Kanduri; Christoph Bock; Sveinung Gundersen; Eivind Hovig; Geir Kjetil Sandve
Journal:  Bioinformatics       Date:  2019-05-01       Impact factor: 6.937

9.  Fragments of rDNA Genes Scattered over the Human Genome Are Targets of Small RNAs.

Authors:  Nickolai A Tchurikov; Elena S Klushevskaya; Ildar R Alembekov; Anastasiia S Bukreeva; Antonina N Kretova; Vladimir R Chechetkin; Galina I Kravatskaya; Yuri V Kravatsky
Journal:  Int J Mol Sci       Date:  2022-03-10       Impact factor: 5.923

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.