| Literature DB >> 27993776 |
Pedro Madrigal1,2.
Abstract
Summary: Computational evaluation of variability across DNA or RNA sequencing datasets is a crucial step in genomic science, as it allows both to evaluate reproducibility of biological or technical replicates, and to compare different datasets to identify their potential correlations. Here we present fCCAC, an application of functional canonical correlation analysis to assess covariance of nucleic acid sequencing datasets such as chromatin immunoprecipitation followed by deep sequencing (ChIP-seq). We show how this method differs from other measures of correlation, and exemplify how it can reveal shared covariance between histone modifications and DNA binding proteins, such as the relationship between the H3K4me3 chromatin mark and its epigenetic writers and readers. Availability and Implementation: An R/Bioconductor package is available at http://bioconductor.org/packages/fCCAC/ . Contact: pmb59@cam.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.Entities:
Mesh:
Substances:
Year: 2017 PMID: 27993776 PMCID: PMC5408813 DOI: 10.1093/bioinformatics/btw724
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(A) Squared canonical correlations for H3K4me3 (Rep1) and 59 protein–DNA binding datasets (DPY30 and 58 ENCODE TFs). (B) First 5 and last 2 ranked interactions according to their percentage over maximum F. The dashed line indicates perfect covariance (Color version of this figure is available at Bioinformatics online.)