| Literature DB >> 26112292 |
Owen J Marshall1, Andrea H Brand1.
Abstract
UNLABELLED: DamID is a powerful technique for identifying regions of the genome bound by a DNA-binding (or DNA-associated) protein. Currently, no method exists for automatically processing next-generation sequencing DamID (DamID-seq) data, and the use of DamID-seq datasets with normalization based on read-counts alone can lead to high background and the loss of bound signal. DamID-seq thus presents novel challenges in terms of normalization and background minimization. We describe here damidseq_pipeline, a software pipeline that performs automatic normalization and background reduction on multiple DamID-seq FASTQ datasets.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26112292 PMCID: PMC4595905 DOI: 10.1093/bioinformatics/btv386
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Results of the damidseq_pipeline. (A) The gene eyeless (ey) (highlighted) is expressed in D. melanogaster laval neural stem cells (Southall ) and previously published microarray DamID in these cells (i) shows RNA polymerase II occupancy (Southall ). (B) Performing DamID-seq in the same cell type illustrates the high correlation between Dam-Pol II (i) and Dam alone (ii) in terms of RPM (read counts/million mapped reads). Taking the ratio of the two RPM-normalized datasets fails to show significant RNA pol II occupancy at ey (iii); however, processing via the damidseq_pipeline software successfully recovers the RNA pol II occupancy profile while minimizing background (iv). See Supplementary Methods for experimental details