Literature DB >> 33673506

Random Forest Modelling of High-Dimensional Mixed-Type Data for Breast Cancer Classification.

Jelmar Quist1,2,3, Lawson Taylor1,2, Johan Staaf4, Anita Grigoriadis1,2,3.   

Abstract

Advances in high-throughput technologies encourage the generation of large amounts of multiomics data to investigate complex diseases, including breast cancer. Given that the aetiologies of such diseases extend beyond a single biological entity, and that essential biological information can be carried by all data regardless of data type, integrative analyses are needed to identify clinically relevant patterns. To facilitate such analyses, we present a permutation-based framework for random forest methods which simultaneously allows the unbiased integration of mixed-type data and assessment of relative feature importance. Through simulation studies and machine learning datasets, the performance of the approach was evaluated. The results showed minimal multicollinearity and limited overfitting. To further assess the performance, the permutation-based framework was applied to high-dimensional mixed-type data from two independent breast cancer cohorts. Reproducibility and robustness of our approach was demonstrated by the concordance in relative feature importance between the cohorts, along with consistencies in clustering profiles. One of the identified clusters was shown to be prognostic for clinical outcome after standard-of-care adjuvant chemotherapy and outperformed current intrinsic molecular breast cancer classifications.

Entities:  

Keywords:  DNA damage repair; breast cancer; integrative analysis; machine learning; random forest

Year:  2021        PMID: 33673506      PMCID: PMC7956671          DOI: 10.3390/cancers13050991

Source DB:  PubMed          Journal:  Cancers (Basel)        ISSN: 2072-6694            Impact factor:   6.575


  38 in total

1.  Prediction of late distant recurrence after 5 years of endocrine treatment: a combined analysis of patients from the Austrian breast and colorectal cancer study group 8 and arimidex, tamoxifen alone or in combination randomized trials using the PAM50 risk of recurrence score.

Authors:  Ivana Sestak; Jack Cuzick; Mitch Dowsett; Elena Lopez-Knowles; Martin Filipits; Peter Dubsky; John Wayne Cowens; Sean Ferree; Carl Schaper; Christian Fesl; Michael Gnant
Journal:  J Clin Oncol       Date:  2014-10-20       Impact factor: 44.544

2.  Alpelisib for PIK3CA-Mutated, Hormone Receptor-Positive Advanced Breast Cancer.

Authors:  Fabrice André; Eva Ciruelos; Gabor Rubovszky; Mario Campone; Sibylle Loibl; Hope S Rugo; Hiroji Iwata; Pierfranco Conte; Ingrid A Mayer; Bella Kaufman; Toshinari Yamashita; Yen-Shen Lu; Kenichi Inoue; Masato Takahashi; Zsuzsanna Pápai; Anne-Sophie Longin; David Mills; Celine Wilke; Samit Hirawat; Dejan Juric
Journal:  N Engl J Med       Date:  2019-05-16       Impact factor: 91.245

Review 3.  PARP inhibitors: Synthetic lethality in the clinic.

Authors:  Christopher J Lord; Alan Ashworth
Journal:  Science       Date:  2017-03-16       Impact factor: 47.728

4.  TBCRC 031: Randomized Phase II Study of Neoadjuvant Cisplatin Versus Doxorubicin-Cyclophosphamide in Germline BRCA Carriers With HER2-Negative Breast Cancer (the INFORM trial).

Authors:  Nadine Tung; Banu Arun; Michele R Hacker; Erin Hofstatter; Deborah L Toppmeyer; Steven J Isakoff; Virginia Borges; Robert D Legare; Claudine Isaacs; Antonio C Wolff; Paul Kelly Marcom; Erica L Mayer; Paulina B Lange; Andrew J Goss; Colby Jenkins; Ian E Krop; Eric P Winer; Stuart J Schnitt; Judy E Garber
Journal:  J Clin Oncol       Date:  2020-02-25       Impact factor: 44.544

5.  Tracking evolution of aromatase inhibitor resistance with circulating tumour DNA analysis in metastatic breast cancer.

Authors:  C Fribbens; I Garcia Murillas; M Beaney; S Hrebien; B O'Leary; L Kilburn; K Howarth; M Epstein; E Green; N Rosenfeld; A Ring; S Johnston; N Turner
Journal:  Ann Oncol       Date:  2018-01-01       Impact factor: 32.976

6.  Machine Learning methods for Quantitative Radiomic Biomarkers.

Authors:  Chintan Parmar; Patrick Grossmann; Johan Bussink; Philippe Lambin; Hugo J W L Aerts
Journal:  Sci Rep       Date:  2015-08-17       Impact factor: 4.379

7.  On-treatment biomarkers can improve prediction of response to neoadjuvant chemotherapy in breast cancer.

Authors:  Richard J Bownes; Arran K Turnbull; Carlos Martinez-Perez; David A Cameron; Andrew H Sims; Olga Oikonomidou
Journal:  Breast Cancer Res       Date:  2019-06-14       Impact factor: 6.466

8.  Random forest-based modelling to detect biomarkers for prostate cancer progression.

Authors:  Reka Toth; Heiko Schiffmann; Claudia Hube-Magg; Franziska Büscheck; Doris Höflmayer; Sören Weidemann; Patrick Lebok; Christoph Fraune; Sarah Minner; Thorsten Schlomm; Guido Sauter; Christoph Plass; Yassen Assenov; Ronald Simon; Jan Meiners; Clarissa Gerhäuser
Journal:  Clin Epigenetics       Date:  2019-10-22       Impact factor: 6.551

9.  Whole-genome sequencing of triple-negative breast cancers in a population-based clinical study.

Authors:  Johan Staaf; Dominik Glodzik; Åke Borg; Serena Nik-Zainal; Ana Bosch; Johan Vallon-Christersson; Christel Reuterswärd; Jari Häkkinen; Andrea Degasperi; Tauanne Dias Amarante; Lao H Saal; Cecilia Hegardt; Hilary Stobart; Anna Ehinger; Christer Larsson; Lisa Rydén; Niklas Loman; Martin Malmberg; Anders Kvist; Hans Ehrencrona; Helen R Davies
Journal:  Nat Med       Date:  2019-09-30       Impact factor: 53.440

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.