Andrew E Teschendorff1, Martin Widschwendter. 1. Statistical Genomics Group, Paul O'Gorman Building, UCL Cancer Institute, University College London, 72 Huntley Street, London, UK. a.teschendorff@ucl.ac.uk
Abstract
MOTIVATION: The standard paradigm in omic disciplines has been to identify biologically relevant biomarkers using statistics that reflect differences in mean levels of a molecular quantity such as mRNA expression or DNA methylation. Recently, however, it has been proposed that differential epigenetic variability may mark genes that contribute to the risk of complex genetic diseases like cancer and that identification of risk and early detection markers may therefore benefit from statistics based on differential variability. RESULTS: Using four genome-wide DNA methylation datasets totalling 311 epithelial samples and encompassing all stages of cervical carcinogenesis, we here formally demonstrate that differential variability, as a criterion for selecting DNA methylation features, can identify cancer risk markers more reliably than statistics based on differences in mean methylation. We show that differential variability selects features with heterogeneous outlier methylation profiles and that these play a key role in the early stages of carcinogenesis. Moreover, differentially variable features identified in precursor non-invasive lesions exhibit significantly increased enrichment for developmental genes compared with differentially methylated sites. Conversely, differential variability does not add predictive value in cancer studies profiling invasive tumours or whole-blood tissue. Finally, we incorporate the differential variability feature selection step into a novel adaptive index prediction algorithm called EVORA (epigenetic variable outliers for risk prediction analysis), and demonstrate that EVORA compares favourably to powerful prediction algorithms based on differential methylation statistics. CONCLUSIONS: Statistics based on differential variability improve the detection of cancer risk markers in the context of DNA methylation studies profiling epithelial preinvasive neoplasias. We present a novel algorithm (EVORA) which could be used for prediction and diagnosis of precursor epithelial cancer lesions. AVAILABILITY: R-scripts implementing EVORA are available from CRAN (www.r-project.org).
MOTIVATION: The standard paradigm in omic disciplines has been to identify biologically relevant biomarkers using statistics that reflect differences in mean levels of a molecular quantity such as mRNA expression or DNA methylation. Recently, however, it has been proposed that differential epigenetic variability may mark genes that contribute to the risk of complex genetic diseases like cancer and that identification of risk and early detection markers may therefore benefit from statistics based on differential variability. RESULTS: Using four genome-wide DNA methylation datasets totalling 311 epithelial samples and encompassing all stages of cervical carcinogenesis, we here formally demonstrate that differential variability, as a criterion for selecting DNA methylation features, can identify cancer risk markers more reliably than statistics based on differences in mean methylation. We show that differential variability selects features with heterogeneous outlier methylation profiles and that these play a key role in the early stages of carcinogenesis. Moreover, differentially variable features identified in precursor non-invasive lesions exhibit significantly increased enrichment for developmental genes compared with differentially methylated sites. Conversely, differential variability does not add predictive value in cancer studies profiling invasive tumours or whole-blood tissue. Finally, we incorporate the differential variability feature selection step into a novel adaptive index prediction algorithm called EVORA (epigenetic variable outliers for risk prediction analysis), and demonstrate that EVORA compares favourably to powerful prediction algorithms based on differential methylation statistics. CONCLUSIONS: Statistics based on differential variability improve the detection of cancer risk markers in the context of DNA methylation studies profiling epithelial preinvasive neoplasias. We present a novel algorithm (EVORA) which could be used for prediction and diagnosis of precursor epithelial cancer lesions. AVAILABILITY: R-scripts implementing EVORA are available from CRAN (www.r-project.org).
Authors: Karin B Michels; Alexandra M Binder; Sarah Dedeurwaerder; Charles B Epstein; John M Greally; Ivo Gut; E Andres Houseman; Benedetta Izzi; Karl T Kelsey; Alexander Meissner; Aleksandar Milosavljevic; Kimberly D Siegmund; Christoph Bock; Rafael A Irizarry Journal: Nat Methods Date: 2013-10 Impact factor: 28.547
Authors: Martin Widschwendter; Allison Jones; Iona Evans; Daniel Reisel; Joakim Dillner; Karin Sundström; Ewout W Steyerberg; Yvonne Vergouwe; Odette Wegwarth; Felix G Rebitschek; Uwe Siebert; Gaby Sroczynski; Inez D de Beaufort; Ineke Bolt; David Cibula; Michal Zikan; Line Bjørge; Nicoletta Colombo; Nadia Harbeck; Frank Dudbridge; Anne-Marie Tasse; Bartha M Knoppers; Yann Joly; Andrew E Teschendorff; Nora Pashayan Journal: Nat Rev Clin Oncol Date: 2018-02-27 Impact factor: 66.675