Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Development of an absolute assignment predictor for triple-negative breast cancer subtyping using machine learning approaches.

Literature DB >> 33316552

Development of an absolute assignment predictor for triple-negative breast cancer subtyping using machine learning approaches.

Fadoua Ben Azzouz¹, Bertrand Michel², Hamza Lasla¹, Wilfried Gouraud¹, Anne-Flore François³, Fabien Girka³, Théo Lecointre³, Catherine Guérin-Charbonnel¹, Philippe P Juin⁴, Mario Campone⁵, Pascal Jézéquel⁶.

Abstract

Triple-negative breast cancer (TNBC) heterogeneity represents one of the main obstacles to precision medicine for this disease. Recent concordant transcriptomics studies have shown that TNBC could be divided into at least three subtypes with potential therapeutic implications. Although a few studies have been conducted to predict TNBC subtype using transcriptomics data, the subtyping was partially sensitive and limited by batch effect and dependence on a given dataset, which may penalize the switch to routine diagnostic testing. Therefore, we sought to build an absolute predictor (i.e., intra-patient diagnosis) based on machine learning algorithms with a limited number of probes. To that end, we started by introducing probe binary comparison for each patient (indicators). We based the predictive analysis on this transformed data. Probe selection was first involved combining both filter and wrapper methods for variable selection using cross-validation. We tested three prediction models (random forest, gradient boosting [GB], and extreme gradient boosting) using this optimal subset of indicators as inputs. Nested cross-validation consistently allowed us to choose the best model. The results showed that the fifty selected indicators highlighted the biological characteristics associated with each TNBC subtype. The GB based on this subset of indicators performs better than other models.

Entities: Chemical Disease Species

Keywords: Machine learning; Prediction models; TNBC subtype; Transcriptomics data; Variable selection

Mesh：

Year: 2020 PMID： 33316552 DOI： 10.1016/j.compbiomed.2020.104171

Source DB: PubMed Journal: Comput Biol Med ISSN： 0010-4825 Impact factor: 4.589

Keyword Cloud
Cited

1 in total

1. The Challenges of Implementing Comprehensive Clinical Data Warehouses in Hospitals.

Authors: François Bocquet; Mario Campone; Marc Cuggia
Journal: Int J Environ Res Public Health Date: 2022-06-16 Impact factor: 4.614

1 in total