Eva Budinska1, Eva Gelnarova, Michael G Schimek. 1. Institute of Biostatistics and Analyses, Masaryk University, Kamenice 126/3, 625 00 Brno, Czech Republic. budinska@iba.muni.cz
Abstract
MOTIVATION: Genome analysis has become one of the most important tools for understanding the complex process of cancerogenesis. With increasing resolution of CGH arrays, the demand for computationally efficient algorithms arises, which are effective in the detection of aberrations even in very noisy data. RESULTS: We developed a rather simple, non-parametric technique of high computational efficiency for CGH array analysis that adopts a median absolute deviation concept for breakpoint detection, comprising median smoothing for pre-processing. The resulting algorithm has the potential to outperform any single smoothing approach as well as several recently proposed segmentation techniques. We show its performance through the application of simulated and real datasets in comparison to three other methods for array CGH analysis. IMPLEMENTATION: Our approach is implemented in the R-language and environment for statistical computing (version 2.6.1 for Windows, R-project, 2007). The code is available at: http://www.iba.muni.cz/~budinska/msmad.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Genome analysis has become one of the most important tools for understanding the complex process of cancerogenesis. With increasing resolution of CGH arrays, the demand for computationally efficient algorithms arises, which are effective in the detection of aberrations even in very noisy data. RESULTS: We developed a rather simple, non-parametric technique of high computational efficiency for CGH array analysis that adopts a median absolute deviation concept for breakpoint detection, comprising median smoothing for pre-processing. The resulting algorithm has the potential to outperform any single smoothing approach as well as several recently proposed segmentation techniques. We show its performance through the application of simulated and real datasets in comparison to three other methods for array CGH analysis. IMPLEMENTATION: Our approach is implemented in the R-language and environment for statistical computing (version 2.6.1 for Windows, R-project, 2007). The code is available at: http://www.iba.muni.cz/~budinska/msmad.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Nicole Golob-Schwarzl; Kira Bettermann; Anita Kuldeep Mehta; Sonja M Kessler; Julia Unterluggauer; Stefanie Krassnig; Kensuke Kojima; Xintong Chen; Yujin Hoshida; Nabeel M Bardeesy; Heimo Müller; Vendula Svendova; Michael G Schimek; Clemens Diwoky; Alexandra Lipfert; Vineet Mahajan; Cornelia Stumptner; Andrea Thüringer; Leopold F Fröhlich; Tatjana Stojakovic; K P R Nilsson; Thomas Kolbe; Thomas Rülicke; Thomas M Magin; Pavel Strnad; Alexandra K Kiemer; Richard Moriggl; Johannes Haybaeck Journal: Transl Oncol Date: 2018-11-12 Impact factor: 4.243
Authors: Leighton Pritchard; Hui Liu; Clare Booth; Emma Douglas; Patrice François; Jacques Schrenzel; Peter E Hedley; Paul R J Birch; Ian K Toth Journal: PLoS Comput Biol Date: 2009-08-21 Impact factor: 4.475
Authors: Kira Bettermann; Anita Kuldeep Mehta; Eva M Hofer; Christina Wohlrab; Nicole Golob-Schwarzl; Vendula Svendova; Michael G Schimek; Cornelia Stumptner; Andrea Thüringer; Michael R Speicher; Carolin Lackner; Kurt Zatloukal; Helmut Denk; Johannes Haybaeck Journal: Oncotarget Date: 2016-11-08