Literature DB >> 24953305

Classification of breast cancer subtypes by combining gene expression and DNA methylation data.

Markus List1, Anne-Christin Hauschild2, Qihua Tan3, Torben A Kruse4, Jan Mollenhauer4, Jan Baumbach5, Richa Batra5.   

Abstract

Selecting the most promising treatment strategy for breast cancer crucially depends on determining the correct subtype. In recent years, gene expression profiling has been investigated as an alternative to histochemical methods. Since databases like TCGA provide easy and unrestricted access to gene expression data for hundreds of patients, the challenge is to extract a minimal optimal set of genes with good prognostic properties from a large bulk of genes making a moderate contribution to classification. Several studies have successfully applied machine learning algorithms to solve this so-called gene selection problem. However, more diverse data from other OMICS technologies are available, including methylation. We hypothesize that combining methylation and gene expression data could already lead to a largely improved classification model, since the resulting model will reflect differences not only on the transcriptomic, but also on an epigenetic level. We compared so-called random forest derived classification models based on gene expression and methylation data alone, to a model based on the combined features and to a model based on the gold standard PAM50. We obtained bootstrap errors of 10-20% and classification error of 1-50%, depending on breast cancer subtype and model. The gene expression model was clearly superior to the methylation model, which was also reflected in the combined model, which mainly selected features from gene expression data. However, the methylation model was able to identify unique features not considered as relevant by the gene expression model, which might provide deeper insights into breast cancer subtype differentiation on an epigenetic level.

Entities:  

Mesh:

Year:  2014        PMID: 24953305     DOI: 10.2390/biecoll-jib-2014-236

Source DB:  PubMed          Journal:  J Integr Bioinform        ISSN: 1613-4516


  15 in total

1.  A comparative study of multi-omics integration tools for cancer driver gene identification and tumour subtyping.

Authors:  Anita Sathyanarayanan; Rohit Gupta; Erik W Thompson; Dale R Nyholt; Denis C Bauer; Shivashankar H Nagaraj
Journal:  Brief Bioinform       Date:  2020-12-01       Impact factor: 11.622

2.  Analysis of the interplay between methylation and expression reveals its potential role in cancer aetiology.

Authors:  Bugra Ozer; Ugur Sezerman
Journal:  Funct Integr Genomics       Date:  2016-11-07       Impact factor: 3.410

3.  Molecular Subtyping of Cancer Based on Distinguishing Co-Expression Modules and Machine Learning.

Authors:  Peishuo Sun; Ying Wu; Chaoyi Yin; Hongyang Jiang; Ying Xu; Huiyan Sun
Journal:  Front Genet       Date:  2022-05-02       Impact factor: 4.772

4.  Circulating cell-free DNA in breast cancer: size profiling, levels, and methylation patterns lead to prognostic and predictive classifiers.

Authors:  Maria Panagopoulou; Makrina Karaglani; Ioanna Balgkouranidou; Eirini Biziota; Triantafillia Koukaki; Evaggelos Karamitrousis; Evangelia Nena; Ioannis Tsamardinos; George Kolios; Evi Lianidou; Stylianos Kakolyris; Ekaterini Chatzaki
Journal:  Oncogene       Date:  2019-01-14       Impact factor: 9.867

Review 5.  Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.

Authors:  Emre Arslan; Jonathan Schulz; Kunal Rai
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2021-07-07       Impact factor: 10.680

6.  Classification of breast cancer patients using somatic mutation profiles and machine learning approaches.

Authors:  Suleyman Vural; Xiaosheng Wang; Chittibabu Guda
Journal:  BMC Syst Biol       Date:  2016-08-26

7.  NCG 5.0: updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings.

Authors:  Omer An; Giovanni M Dall'Olio; Thanos P Mourikis; Francesca D Ciccarelli
Journal:  Nucleic Acids Res       Date:  2015-10-29       Impact factor: 16.971

8.  De novo pathway-based biomarker identification.

Authors:  Nicolas Alcaraz; Markus List; Richa Batra; Fabio Vandin; Henrik J Ditzel; Jan Baumbach
Journal:  Nucleic Acids Res       Date:  2017-09-19       Impact factor: 16.971

9.  Machine learning analysis of TCGA cancer data.

Authors:  Jose Liñares-Blanco; Alejandro Pazos; Carlos Fernandez-Lozano
Journal:  PeerJ Comput Sci       Date:  2021-07-12

10.  A novel analysis strategy for integrating methylation and expression data reveals core pathways for thyroid cancer aetiology.

Authors:  Bugra Ozer; Osman Uğur Sezerman
Journal:  BMC Genomics       Date:  2015-12-09       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.