| Literature DB >> 33253153 |
Azusa Tanaka1,2, Yasuhiro Ishitsuka3,4, Hiroki Ohta3,5, Akihiro Fujimoto1, Jun-Ichirou Yasunaga2,6, Masao Matsuoka2,6.
Abstract
The huge amount of data acquired by high-throughput sequencing requires data reduction for effective analysis. Here we give a clustering algorithm for genome-wide open chromatin data using a new data reduction method. This method regards the genome as a string of 1s and 0s based on a set of peaks and calculates the Hamming distances between the strings. This algorithm with the systematically optimized set of peaks enables us to quantitatively evaluate differences between samples of hematopoietic cells and classify cell types, potentially leading to a better understanding of leukemia pathogenesis.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33253153 PMCID: PMC7728210 DOI: 10.1371/journal.pcbi.1008422
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475