Mingzhou Song1,2, Hua Zhong1. 1. Department of Computer Science. 2. Molecular Biology Graduate Program, New Mexico State University, Las Cruces, NM 88003, USA.
Abstract
MOTIVATION: Chromosomal patterning of gene expression in cancer can arise from aneuploidy, genome disorganization or abnormal DNA methylation. To map such patterns, we introduce a weighted univariate clustering algorithm to guarantee linear runtime, optimality and reproducibility. RESULTS: We present the chromosome clustering method, establish its optimality and runtime and evaluate its performance. It uses dynamic programming enhanced with an algorithm to reduce search-space in-place to decrease runtime overhead. Using the method, we delineated outstanding genomic zones in 17 human cancer types. We identified strong continuity in dysregulation polarity-dominance by either up- or downregulated genes in a zone-along chromosomes in all cancer types. Significantly polarized dysregulation zones specific to cancer types are found, offering potential diagnostic biomarkers. Unreported previously, a total of 109 loci with conserved dysregulation polarity across cancer types give insights into pan-cancer mechanisms. Efficient chromosomal clustering opens a window to characterize molecular patterns in cancer genome and beyond. AVAILABILITY AND IMPLEMENTATION: Weighted univariate clustering algorithms are implemented within the R package 'Ckmeans.1d.dp' (4.0.0 or above), freely available at https://cran.r-project.org/package=Ckmeans.1d.dp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Chromosomal patterning of gene expression in cancer can arise from aneuploidy, genome disorganization or abnormal DNA methylation. To map such patterns, we introduce a weighted univariate clustering algorithm to guarantee linear runtime, optimality and reproducibility. RESULTS: We present the chromosome clustering method, establish its optimality and runtime and evaluate its performance. It uses dynamic programming enhanced with an algorithm to reduce search-space in-place to decrease runtime overhead. Using the method, we delineated outstanding genomic zones in 17 humancancer types. We identified strong continuity in dysregulation polarity-dominance by either up- or downregulated genes in a zone-along chromosomes in all cancer types. Significantly polarized dysregulation zones specific to cancer types are found, offering potential diagnostic biomarkers. Unreported previously, a total of 109 loci with conserved dysregulation polarity across cancer types give insights into pan-cancer mechanisms. Efficient chromosomal clustering opens a window to characterize molecular patterns in cancer genome and beyond. AVAILABILITY AND IMPLEMENTATION: Weighted univariate clustering algorithms are implemented within the R package 'Ckmeans.1d.dp' (4.0.0 or above), freely available at https://cran.r-project.org/package=Ckmeans.1d.dp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Wibke Schwarzer; Nezar Abdennur; Anton Goloborodko; Aleksandra Pekowska; Geoffrey Fudenberg; Yann Loe-Mie; Nuno A Fonseca; Wolfgang Huber; Christian H Haering; Leonid Mirny; Francois Spitz Journal: Nature Date: 2017-09-27 Impact factor: 49.962
Authors: William A Flavahan; Yotam Drier; Brian B Liau; Shawn M Gillespie; Andrew S Venteicher; Anat O Stemmer-Rachamimov; Mario L Suvà; Bradley E Bernstein Journal: Nature Date: 2015-12-23 Impact factor: 49.962
Authors: Joseph S Levy; Caleb I Fassett; John W Holt; Reid Parsons; Will Cipolli; Timothy A Goudge; Michelle Tebolt; Lily Kuentz; Jessica Johnson; Fairuz Ishraque; Bronson Cvijanovich; Ian Armstrong Journal: Proc Natl Acad Sci U S A Date: 2021-01-26 Impact factor: 12.779