Roman Hornung1, David Causeur2, Christoph Bernau3, Anne-Laure Boulesteix1. 1. Department of Medical Informatics, Biometry and Epidemiology, University of Munich, Munich, Germany. 2. Applied Mathematics Department, Agrocampus Ouest, Rennes, France. 3. Leibniz Supercomputing Center, Garching, Germany.
Abstract
Motivation: To date most medical tests derived by applying classification methods to high-dimensional molecular data are hardly used in clinical practice. This is partly because the prediction error resulting when applying them to external data is usually much higher than internal error as evaluated through within-study validation procedures. We suggest the use of addon normalization and addon batch effect removal techniques in this context to reduce systematic differences between external data and the original dataset with the aim to improve prediction performance. Results: We evaluate the impact of addon normalization and seven batch effect removal methods on cross-study prediction performance for several common classifiers using a large collection of microarray gene expression datasets, showing that some of these techniques reduce prediction error. Availability and Implementation: All investigated addon methods are implemented in our R package bapred. Contact: hornung@ibe.med.uni-muenchen.de. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: To date most medical tests derived by applying classification methods to high-dimensional molecular data are hardly used in clinical practice. This is partly because the prediction error resulting when applying them to external data is usually much higher than internal error as evaluated through within-study validation procedures. We suggest the use of addon normalization and addon batch effect removal techniques in this context to reduce systematic differences between external data and the original dataset with the aim to improve prediction performance. Results: We evaluate the impact of addon normalization and seven batch effect removal methods on cross-study prediction performance for several common classifiers using a large collection of microarray gene expression datasets, showing that some of these techniques reduce prediction error. Availability and Implementation: All investigated addon methods are implemented in our R package bapred. Contact: hornung@ibe.med.uni-muenchen.de. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Maria B Rabaglino; Alan O'Doherty; Jan Bojsen-Møller Secher; Patrick Lonergan; Poul Hyttel; Trudee Fair; Haja N Kadarmideen Journal: PLoS One Date: 2021-05-24 Impact factor: 3.240
Authors: Stefanie Warnat-Herresthal; Konstantinos Perrakis; Bernd Taschler; Matthias Becker; Kevin Baßler; Marc Beyer; Patrick Günther; Jonas Schulte-Schrepping; Lea Seep; Kathrin Klee; Thomas Ulas; Torsten Haferlach; Sach Mukherjee; Joachim L Schultze Journal: iScience Date: 2019-12-18
Authors: Adrian Schwarzer; Steven R Talbot; Anton Selich; Michael Morgan; Juliane W Schott; Oliver Dittrich-Breiholz; Antonella L Bastone; Bettina Weigel; Teng Cheong Ha; Violetta Dziadek; Rik Gijsbers; Adrian J Thrasher; Frank J T Staal; Hubert B Gaspar; Ute Modlich; Axel Schambach; Michael Rothe Journal: Mol Ther Date: 2021-06-24 Impact factor: 11.454
Authors: Daniel Samaga; Roman Hornung; Herbert Braselmann; Julia Hess; Horst Zitzelsberger; Claus Belka; Anne-Laure Boulesteix; Kristian Unger Journal: Radiat Oncol Date: 2020-05-14 Impact factor: 3.481