| Literature DB >> 32429832 |
Hersh D Ravkin1, Ofer Givton1, David B Geffen2, Eitan Rubin3.
Abstract
BACKGROUND: Compared to the many uses of DNA-level testing in clinical oncology, development of RNA-based diagnostics has been more limited. An exception to this trend is the growing use of mRNA-based methods in early-stage breast cancer. Although DNA and mRNA are used together in breast cancer research, the distinct contribution of mRNA beyond that of DNA in clinical challenges has not yet been directly assessed. We hypothesize that mRNA harbors prognostically useful information independently of genomic variation. To validate this, we use both genomic mutations and gene expression to predict five-year breast cancer recurrence in an integrated test model. This is accomplished first by comparing the feature importance of DNA and mRNA features in a model trained on both, and second, by evaluating the difference in performance of models trained on DNA and mRNA data separately.Entities:
Keywords: Breast cancer recurrence; Data science; Gene expression; Genomics; Machine learning; Machine learning explainability; Oncology
Year: 2020 PMID: 32429832 PMCID: PMC7236449 DOI: 10.1186/s12859-020-3512-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1ROC Plot showing the performance of the different models, each trained on different subsets of the full data
Fig. 2SHAP plot of the 10 most prognostic features. This graph shows SHAP values averaged across all the folds when the model was trained on mRNA and mutation data together. Each point on the graph represents a sample from the validation data. The color of each point represents the actual value of that feature. Greater absolute value on the x-axis indicates higher impact on prediction