| Literature DB >> 35083135 |
Adrián Mosquera Orgueira1,2,3, Miguel Cid López1,2,3, Andrés Peleteiro Raíndo1,2,3, Aitor Abuín Blanco1,2, Jose Ángel Díaz Arias1,2, Marta Sonia González Pérez1,2, Beatriz Antelo Rodríguez1,2,3, Laura Bao Pérez1,2,3, Roi Ferreiro Ferro1,2, Carlos Aliste Santos1, Manuel Mateo Pérez Encinas1,2,3, Máximo Francisco Fraga Rodríguez1,2,3, Claudio Cerchione4, Pablo Mozas5, José Luis Bello López1,2,3.
Abstract
Follicular Lymphoma (FL) has a 10-year mortality rate of 20%, and this is mostly related to lymphoma progression and transformation to higher grades. In the era of personalized medicine it has become increasingly important to provide patients with an optimal prediction about their expected outcomes. The objective of this work was to apply machine learning (ML) tools on gene expression data in order to create individualized predictions about survival in patients with FL. Using data from two different studies, we were able to create a model which achieved good prediction accuracies in both cohorts (c-indexes of 0.793 and 0.662 in the training and test sets). Integration of this model with m7-FLIPI and age rendered high prediction accuracies in the test set (cox c-index 0.79), and a simplified approach identified 4 groups with remarkably different outcomes in terms of survival. Importantly, one of the groups comprised 27.35% of patients and had a median survival of 4.64 years. In summary, we have created a gene expression-based individualized predictor of overall survival in FL that can improve the predictions of the m7-FLIPI score.Entities:
Keywords: follicular; gene expression; lymphoma; machine learning; survival
Year: 2022 PMID: 35083135 PMCID: PMC8784530 DOI: 10.3389/fonc.2021.705010
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Baseline characteristics of the two cohorts. Adopted from Pastore et al. (9) and Dave et al. (13).
| GSE16131 Training set | GSE66166 Test set | |
|---|---|---|
|
| 35.5% | 55.0% |
|
| 41.9% | 55.0% |
|
| 9.4% | 15.0% |
|
| 22.9% | 21.0% |
|
| 0.0% | 0.0% |
|
| 6.6 years | 6.7 years |
Figure 1Predicted individual survival curves according to the most accurate random forest model (see text). (A) Out-of-bag survival curves predicted for patients within the training cohort (discontinuous black lines). The thick red line represents overall ensemble survival and the thick green line indicates the Nelson-Aalen estimator. (B) Individual survival curves predicted for patients within the test cohort (discontinuous black lines). The thick red line represents overall ensemble survival. Time scale is in years. (C) Representation of out-of-bag CRPS over time. Red line is the overall CRPS. Additionally, stratified CRPS by quarters of predicted ensemble mortality are provided. Vertical lines above the x axis represent death events.
C-indexes of the predictions created by IAC-FL at different time points in the training and test sets.
| Training cohort | Test Cohort | |
|---|---|---|
|
| 73,57 | 62,62 |
|
| 74,49 | 63,1 |
|
| 75,63 | 64,65 |
|
| 76,34 | 65,13 |
|
| 77,15 | 65,35 |
|
| 77,64 | 65,76 |
|
| 78,15 | 65,87 |
|
| 78,42 | 66,27 |
|
| 78,6 | 66,31 |
|
| 78,82 | 66,24 |
|
| 79,16 | 66,35 |
|
| 79,3 | 66,42 |
|
| 79,57 | 66,42 |
|
| 79,71 | 66,05 |
|
| 79,71 | 66,05 |
|
| 79,35 | 66,16 |
Figure 2Simplified representation of IAC-FL predictions. (A, B) FL patient outcome according to the predicted 5-year survival medains created by IAC-FL in both the training and test sets. (C, D) Survival outcomes according to the predicted 5-year survival terciles created by IAC-FL in the training and test sets, respectively.
Figure 3Survival outcomes of patients in the test set according to m7-FLIPI and IAC-FL predictions at maximum follow-up. Patients are stratified according to the medians of both variables.