| Literature DB >> 31907039 |
Sung Mo Ryu1, Sung Wook Seo2, Sun-Ho Lee3.
Abstract
BACKGROUND: We used the Surveillance, Epidemiology, and End Results (SEER) database to develop and validate deep survival neural network machine learning (ML) algorithms to predict survival following a spino-pelvic chondrosarcoma diagnosis.Entities:
Keywords: Artificial intelligence; Chondrosarcoma; Neural network; Prediction; Survival
Year: 2020 PMID: 31907039 PMCID: PMC6945432 DOI: 10.1186/s12911-019-1008-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Enrolled Study Population and Pipeline of Data Analysis. The training data were validated using 5-fold cross validation
Fig. 2The architecture of the basic learning unit of the RED_SNN model. (a) The network architecture of the basic unit was composed of 8 layers, including two long short term memory (LSTM) layers. The input layer was comprised of 28 nodes that represented 26 input features and 2 latent survival features. The output layer was composed of 2 nodes implementing linear function, representing time and event. Since the two target nodes have different characteristics, we did not use the softmax function. The other layers were composed of fully-connected nodes implementing a rectified linear unit function. (b) The validation data (n = 169) were inputted to the pre-trained network. The number of nodes was gradually reduced across the hidden layers. The output time and event were compared to the true target values
Algorithm of risk estimate distance survival neural network
X_initial is observed variables of each patients at the first visit. Targets include time and event. Time target (T) is a continuous variable representing follow-up months
m is the time window or the interval time, which is a tunable hyper-parameter (In this study, the interval time was 10 month)
ti is the last observed time during the time interval i
Survival target (E) is a binary value representing event (alive: 0, death: 1). Ei is a binary event during the time interval i. The neural network is trained with the targets Y[Ei, ti] recurrently at each time point i using cosine distance as a loss function. (훂 is a hyperparameter representing weight of death.) For example, E1 = 0 at the first time window could be E2 = 0, 1, or censored at the second window, thus the neural network should adjust their parameter to the following targets. After their serial training, the network learned to perceive severity of the cancer patient
Patient demographics, tumor characteristics, and treatment modality
| Characteristic | ||
|---|---|---|
| Age at diagnosis, yrs | Mean ± SD | 51.70 ± 18.81 |
| Median (range) | 52.00 (8–93) | |
| Age at diagnosis stratified, n (%) | 0–29 | |
| 30–59 | ||
| > 59 | ||
| Sex, n (%) | Male | 675 (62.0) |
| Female | 413 (38.0) | |
| Race, n (%) | White | 938 (86.2) |
| Black | 79 (7.3) | |
| Other (Asian) | 59 (5.4) | |
| Unknown | 12 (1.1) | |
| Primary site involved, n (%) | Vertebral column | 218 (20) |
| Pelvic bones, sacrum | 870 (80) | |
| Histology type | Chondrosarcoma, NOS | 948 (87.1) |
| Juxtacortical chondrosarcoma | 7 (0.6) | |
| Myxoid chondrosarcoma | 60 (5.5) | |
| Mesenchymal chondrosarcoma | 23 (2.1) | |
| Clear cell chondrosarcoma | 6 (0.6) | |
| Dedifferentiated chondrosarcoma | 44 (4.0) | |
| Grade | I | 300 |
| II | 397 | |
| III | 121 | |
| IV | 63 | |
| Unknown | 207 (19.0) | |
| Stage of the tumor | Localized | 411 (37.8) |
| Regional | 450 (41.4) | |
| Distant | 147 (13.5) | |
| Unknown | 80 (7.4) | |
| Surgery performed, n (%) | Yes | 818 (75.2) |
| No | 249 (22.9) | |
| Unknown | 21 (1.9) | |
| Radiation therapy (%) | Yes | 228 (21.0) |
| No | 848 (77.9) | |
| Unknown | 12 (1.1) | |
| Chemotherapy (%) | Yes | 139 (12.8) |
| No | 949 (87.2) | |
| Cause of death (%) | Tumor related | 333 (30.6) |
| Other | 223 (20.5) | |
| Censored | 532 (48.9) |
NOS not otherwise specified
Surgical and adjunctive treatment in 1088 patients with spinal and pelvic chondrosarcoma
| Surgery (n) | Radiation (n) | Chemotherapy | Number of patients |
|---|---|---|---|
| Yes (818) | No (673) | No | 622 |
| Yes | 51 | ||
| Yes (135) | No | 106 | |
| Yes | 29 | ||
| Unknown (10) | No | 9 | |
| Yes | 1 | ||
| No (249) | No (159) | No | 128 |
| Yes | 31 | ||
| Yes (88) | No | 64 | |
| Yes | 24 | ||
| Unknown (2) | No | 2 | |
| Unknown (21) | No (16) | No | 14 |
| Yes | 2 | ||
| Yes (5) | No | 4 | |
| Yes | 1 |
Fig. 3Optimization and validation of RED_SNN model using 5-fold cross validation. a ROC curves to evaluate the prediction accuracy of the RED_SNN model. The model was serially trained to learn patient’s survival status within a 10-month time interval, until 62 months from the initial observation. ROC curves predicting survival at each time interval were evaluated with validation sets. b The mean AUC’s of the survival prediction at each time point. The average AUC of 5-fold cross validation was 0.84
Fig. 4Performance evaluation of RED_SNN using test data set. a ROC curves to evaluate the prediction with test data set. The test data were analyzed by pre-trained RED_SNN and its output (expected survival probability) was compared to the real survival of the test set at each time point. b Calibration curves of RED_SNN model to predict the survival rate of the test set. The test data were analyzed using pre-trained RED_SNN, and the test patients were equally divided into seven subgroups according to the model’s predicted survival probability at five years
Fig. 5Kaplan Meier curves of subgroups according to SEER stage vs our model expected survival probability. a SEER stage identified three prognostic subgroups in Kaplan Meier survival analysis. b RED_SNN identified five prognostic subgroups in Kaplan Meier survival analysis