Literature DB >> 28871984

Developing and Validating a Survival Prediction Model for NSCLC Patients Through Distributed Learning Across 3 Countries.

Arthur Jochems¹, Timo M Deist², Issam El Naqa³, Marc Kessler³, Chuck Mayo³, Jackson Reeves³, Shruti Jolly³, Martha Matuszak³, Randall Ten Haken³, Johan van Soest², Cary Oberije², Corinne Faivre-Finn⁴, Gareth Price⁴, Dirk de Ruysscher², Philippe Lambin², Andre Dekker².

Abstract

PURPOSE: Tools for survival prediction for non-small cell lung cancer (NSCLC) patients treated with chemoradiation or radiation therapy are of limited quality. In this work, we developed a predictive model of survival at 2 years. The model is based on a large volume of historical patient data and serves as a proof of concept to demonstrate the distributed learning approach. METHODS AND MATERIALS: Clinical data from 698 lung cancer patients, treated with curative intent with chemoradiation or radiation therapy alone, were collected and stored at 2 different cancer institutes (559 patients at Maastro clinic (Netherlands) and 139 at Michigan university [United States]). The model was further validated on 196 patients originating from The Christie (United Kingdon). A Bayesian network model was adapted for distributed learning (the animation can be viewed at https://www.youtube.com/watch?v=ZDJFOxpwqEA). Two-year posttreatment survival was chosen as the endpoint. The Maastro clinic cohort data are publicly available at https://www.cancerdata.org/publication/developing-and-validating-survival-prediction-model-nsclc-patients-through-distributed, and the developed models can be found at www.predictcancer.org.
RESULTS: Variables included in the final model were T and N category, age, performance status, and total tumor dose. The model has an area under the curve (AUC) of 0.66 on the external validation set and an AUC of 0.62 on a 5-fold cross validation. A model based on the T and N category performed with an AUC of 0.47 on the validation set, significantly worse than our model (P<.001). Learning the model in a centralized or distributed fashion yields a minor difference on the probabilities of the conditional probability tables (0.6%); the discriminative performance of the models on the validation set is similar (P=.26).
CONCLUSIONS: Distributed learning from federated databases allows learning of predictive models on data originating from multiple institutions while avoiding many of the data-sharing barriers. We believe that distributed learning is the future of sharing data in health care.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2017 PMID： 28871984 PMCID： PMC5575360 DOI： 10.1016/j.ijrobp.2017.04.021

Source DB: PubMed Journal: Int J Radiat Oncol Biol Phys ISSN： 0360-3016 Impact factor: 7.038

Tools for survival prediction for non-small cell lung cancer patients treated with chemoradiation or radiation therapy are of limited quality. The gold standard (TNM staging) was originally developed for patients undergoing surgery. We developed a predictive model for survival in non-small cell lung cancer patients following chemoradiation or radiation therapy. The model was trained on a large volume of patients from multiple institutes by use of the distributed learning approach. The model outperforms the gold standard (TNM staging).

Introduction

Learning from large volumes of patient data can greatly increase our capacity to generate and test hypotheses about health care (1). To capture and use the knowledge contained in large volumes of patient data, predictive models are essential for clinical decision making 1, 2, 3, 4, 5, 6, 7, 8. Predictive models can be trained on large volumes of data—from patients who have been treated in the past—to make predictions about survival, disease control, and side effects of treatment for a patient who has yet to be treated 9, 10. Radiation therapy (RT), alone or in combination with chemotherapy, is a common choice of treatment in non-small cell lung cancer (NSCLC) patients whose tumors are inoperable because of metastases to mediastinal lymph node stations and/or because of patients' physical condition (11). The TNM staging system for survival risk stratification of this group of patients is, however, inaccurate (12). The TNM classification has initially been made to look at operability rather than prognosis after chemoradiation therapy (CRT). Other prognostic factors have been identified, such as performance status 13, 14, 15, weight loss 13, 14, presence of comorbidity (15), chemotherapy use in combination with RT 13, 16, radiation dose 13, 17, tumor size 12, 18, 19, 20, 21, and image features, the so-called radiomics approach 22, 23, 24, 25, 26. For other factors such as age and sex, the results have been inconclusive (14). Currently, the TNM staging system is the gold standard for risk stratification. Studies have indicated the TNM system performs poorly for patients receiving CRT or RT, creating an increasing need for more reliable prediction models 27, 28. In this study, we conducted a level III and IV modeling effort on historical data, as a proof of concept, conforming with the TRIPOD (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) statement, in which validation is performed at an independent site that has not seen the data (29). Our hypothesis is 2-fold: Learning from different centers without moving the data is possible, and by learning from a large volume of patient data, we can develop a better model for 2-year survival prediction that is more robust than the existing prognostic tools.

Methods and Materials

Data

Clinical data from 698 lung cancer patients, treated with curative intent with CRT or RT alone, were collected and stored at 2 different medical institutes (559 patients at Maastro clinic [Netherlands] and 139 at Michigan University [United States]). The model was validated on 196 patients originating from The Christie (United Kingdon). None of the patients received stereotactic body RT, and all patients had inoperable stage I through IIIB NSCLC. Patients were treated for their primary lung tumor and did not receive a diagnosis of another tumor in the 5 years before treatment. The patient details are shown in Table 1. Two-year survival, taken from the start of RT, was used as the outcome of this study. The Maastro clinic cohort data are publicly available at https://www.cancerdata.org/publication/developing-and-validating-survival-prediction-model-nsclc-patients-through-distributed.

Table 1

Overview of patient characteristics per hospital

	Maastro clinic (n=559)	Michigan University (n=139)	The Christie (n=196)
Age
Mean, y	68	66	66
SD, y	10	10	10
Missing, n	0 (0%)	0 (0%)	2 (1%)
Sex, n
Male	370 (62%)	107 (77%)	89 (45%)
Female	189 (32%)	32 (23%)	117 (60%)
Missing	0 (0%)	0 (0%)	1 (1%)
ECOG performance status, n
0	102 (17%)	16 (12%)	40 (20%)
1	301 (50%)	100 (72%)	103 (53%)
2	11 (2%)	21 (15%)	45 (23%)
3	21 (4%)	1 (1%)	4 (2%)
4	4 (1%)	0 (0%)	0 (0%)
Missing	120 (20%)	1 (1%)	4 (2%)
T stage, n
0	83 (14%)	25 (18%)	1 (1%)
1	154 (26%)	33 (24%)	13 (7%)
2	89 (15%)	40 (29%)	54 (28%)
3	198 (33%)	40 (29%)	51 (26%)
4	0 (0%)	0 (0%)	70 (36%)
Missing	35 (6%)	1 (1%)	7 (4%)
N stage, n
0	150 (25%)	33 (24%)	51 (26%)
1	32 (5%)	17 (12%)	14 (7%)
2	214 (36%)	58 (42%)	86 (44%)
3	136 (23%)	31 (22%)	40 (20%)
Missing	27 (5%)	0 (0%)	5 (3%)
M stage, n
0	505 (84%)	139 (100%)	191 (97%)
1	0 (0%)	0 (0%)	0 (0%)
Missing	54 (9%)	0 (0%)	5 (3%)
Chemotherapy timing, n
No chemotherapy	119 (20%)	25 (18%)	73 (37%)
Sequential	53 (9%)	0 (0%)	62 (32%)
Concurrent	279 (47%)	114 (82%)	60 (31%)
Missing	108 (18%)	0 (0%)	0 (0%)
Stage group, n
IA	44 (8%)	9 (6%)	7 (4%)
IB	26 (5%)	8 (6%)	12 (6%)
IIA	18 (3%)	0 (0%)	0 (0%)
IIB	53 (9%)	16 (11%)	14 (7%)
IIIA	350 (63%)	105 (76%)	153 (78%)
IIIB	0 (0%)	0 (0%)	0 (0%)
Missing	68 (12%)	1 (1%)	10 (5%)
2-y survival, n
No	339 (57%)	72 (52%)	151 (77%)
Yes	220 (37%)	67 (48%)	45 (23%)
Missing	0 (0%)	0 (0%)	0 (0%)

Abbreviation: ECOG = Eastern Cooperative Oncology Group.

Overview of patient characteristics per hospital Abbreviation: ECOG = Eastern Cooperative Oncology Group.

Maastro clinic cohort

The institute 1 data were collected under an institutional review board (IRB)–approved and registered clinical trial (NCT01949259). Patients were treated between 2007 and 2014. Three different protocol types were administered to the patients in this study: One hundred eighty-nine NSCLC patients were treated according to the new protocol for sequential CRT, which was introduced in August 2005 30, 31, 32. The individualized radiation dose ranged from 54.0 to 79.2 Gy, delivered in fractions of 1.8 Gy, twice daily, until the mean lung dose or maximum dose to the spinal cord was reached. The minimum interval between the fractions was 8 hours. Two hundred eighty-three NSCLC patients received concurrent CRT. A radiation dose of 45 Gy was delivered in fractions of 1.5 Gy, twice daily. The minimum interval between the fractions was 8 to 10 hours. This treatment was followed by an individualized dose ranging from 8 to 24 Gy, delivered in fractions of 2.0 Gy, once daily, until the normal tissue dose constraints were reached. Cisplatin and etoposide were given concurrently on days 2, 9, 23, and 30. Thirty-six NSCLC patients received accelerated high-dose conformal RT: 66 Gy in 24 fractions (2.75 Gy/fraction). Some of these patients received chemotherapy. The concurrent chemotherapy used consisted of intravenous administration of 80 mg/m2 of cisplatin and 100 mg/m2 of etoposide on days 1 through 3. The first cycle was administered before RT, and cycles 2 and 3 were given during RT. In total, 3 cycles of chemotherapy were given (33). For patients receiving sequential CRT, 3 courses of etoposide (100 mg/m2 on days 1 and 8) and cisplatin (75 mg/m2 on day 2) were given (34). The remaining 51 patients received a treatment regimen tailored specifically to the patients.

Michigan University cohort

The Michigan University data were collected from prospective protocols under IRB approval (UMCC 2006.040 and UMCC 2007.123). All patients were treated with curative intent between May 2007 and July 2014. The first study treated patients to standard doses (60-66 Gy) with once-daily fractions of 2 Gy. The second study was a dose-escalation study intensifying doses to persistent positron emission tomography-avid target volumes during treatment with 2.1 to 2.85 Gy/fraction up to a total dose of 85.5 Gy in 30 fractions.

The Christie cohort

One external validation set was used in this study. The Christie cohort consisted of 196 anonymized lung cancer patients with stage I through IIIB NSCLC. The study was conducted under IRB approval. All patients were treated with curative intent between December 2008 and May 2013. Two different protocols were used for treating patients in this dataset. One hundred twenty-one NSCLC patients received 55 Gy in 20 daily fractions (2.75 Gy/fraction), either without chemotherapy or with sequential chemotherapy. Seventy-three NSCLC patients received 60 to 66 Gy in 30 to 33 daily fractions (2 Gy/fraction) with concurrent chemotherapy. The remaining 2 patients received a treatment regimen tailored specifically to the patients.

Bayesian network

A Bayesian network model was developed to predict survival at 2 years after RT start. The model used T category, N category, age, total tumor dose (defined as the prescribed dose to a reference point according to the International Commission on Radiation Units & Measurements), and World Health Organization performance status to make predictions. (The target dose definition for patients who received a boost to [positron emission tomography-avid] tumor subvolumes can be found in the publication of van Elmpt et al (35).) The network structure of this model was prespecified by experts and is a balance between the desire for causal links and the desire for a limit on the number of incoming links to the survival endpoint. It is shown in Figure 1. A further elaboration on the method used to determine the network structure is presented in Appendix E1 (available online at www.redjournal.org).

Fig. 1

Bayesian network structures. The blue nodes represent outcome. (A) Network structure based on expert opinion. (B) Network structure built using algorithmic approach. (C) Network structure adapted from Jayasurya et al (9). Abbreviations: Chemo = chemotherapy; GTV = gross tumor volume. The model's performance is expressed as the area under the curve (AUC) of the receiver operating characteristic (ROC) curve. The maximum value of the AUC is 1.0, indicating a perfect prediction model. A value of 0.5 indicates that patients are only correctly classified in 50% of the cases, that is, as good as chance. A Bayesian network is a probabilistic graphical model that represents a set of variables and their dependencies in a directed acyclic graph (DAG). Within the DAG, variables are depicted as nodes and statistical dependencies are represented as directed edges. We have determined the DAG dependencies based on expert knowledge. The conditional probability tables (CPTs) associated with each variable have been computed by use of a maximum likelihood technique based on the expectation maximization algorithm (36). All continuous variables underwent discretization into either 2 or 3 bins by use of a method described by Kuschner et al (37). The developed models can be found at www.predictcancer.org.

Distributed learning

Distributed learning is defined by learning from data from multiple hospitals without the data leaving the hospitals. To realize distributed learning for this study, we have used a formerly adapted method 38, 39.

Statistical analysis

The Bayesian network model was programmed in Java (Oracle, Redwood Shores, California) by use of the JSMILE (Structural Modeling, Inference, and Learning Engine) framework developed by the Dynamic Systems Laboratory of Pittsburg University (40) and made freely available for academic purposes by BayesFusion (Pittsburgh, PA; http://www.bayesfusion.com/). Analysis of ROC curves was performed in R (version 3.0.0; R Foundation for Statistical Computing, Vienna, Austria) by use of the pROC package (41). Comparison of ROC curves and computation of confidence intervals (CIs) of AUC values were performed with the method described by DeLong et al (42). Kaplan-Meier curves were made in R using the survival package (43). The log-rank test with a ρ value of 0 was used to evaluate differences in Kaplan-Meier curves (44). CI estimation of Kaplan-Meier curves was performed with the method described by Dorey and Korn (45). Tests for significant differences between groups were performed with the Wilcoxon rank sum test 46, 47.

Comparison with previous models

Imputation of missing variables was done by taking the mean for the corresponding variable in the training set. This mean was used for imputation in the training data and validation sets. As Bayesian networks rely on Bayesian inference to do imputation, imputation by taking the mean was omitted for validating and training these models. The number of positive lymph node stations (PLNSs) was used as a variable in the older models, which is unavailable in our current dataset. Nodal stage was used to impute the number of PLNSs. In subsequent analyses of the data, we partitioned the validation set into a young patients' cohort and old patients' cohort. We set the cutoff point at 67 years, as this most closely partitioned the data into 2 sets of equal size. To investigate the performance of the TNM staging variables alone, a logistic regression was performed with these variables. A Bayesian network model previously developed by Jayasurya et al (9) was compared with the model in this study. PLNS was imputed by nodal stage. The nodal stage node was removed from the structure because it is obsolete as a result of the imputation of PLNS. The resulting network structure is shown in Figure 1C.

Results

The Bayesian network model was learned in a distributed setting on the Maastro clinic and Michigan University cohorts. The CPTs of the learned model are shown in Tables E1-E6 (available online at www.redjournal.org). The AUC of the model was 0.62 (95% CI, 0.57-0.66) on a 5-fold cross validation. To investigate the effectivity of the distributed learning methodology used, we compared the CPTs of a model learned on the training data of institutes 1 and 2 in a centralized manner versus the CPTs of the model learned on the same datasets in a distributed manner. The average difference in probability per CPT was 0.6%. The results of using both models on the validation set of The Christie are shown in Figure 2. The discriminative performance of both models on the validation set was similar (P=.26).

Fig. 2

Receiver operating characteristic curves for the Bayesian network model learned using centralized learning and distributed learning. Validation was done on the institute 3 cohort. Abbreviation: AUC = area under curve. Splitting the The Christie cohort into 2 subgroups, according to the mean of the predicted survival probability by the model for all patients in the training cohort, resulted in the identification of a group with a high chance of survival and a group with a low chance of survival. The 2-year survival rate was 33% (95% CI, 25%-42%) for the high–survival chance group and 0% for the low–survival chance group (Fig. 3). A log-rank test on these curves indicated that they were significantly different (P<.01).

Fig. 3

Kaplan-Meier curves for risk group stratification for model. A log-rank test indicated that these curves were significantly different (P<.01).

Kaplan-Meier curves for risk group stratification for model. A log-rank test indicated that these curves were significantly different (P<.01). We conducted an external validation on the The Christie cohort (n=196). The AUC of the model was 0.66 (95% CI, 0.57-0.74). The support vector machine (SVM) model developed by Oberije et al was also validated on this cohort. The AUC of the SVM model developed by Oberije et al was 0.59 (95% CI, 0.49-0.68). Comparison of the 2 ROC curves indicated they were not significantly different (P=.19). The 2 ROC curves are shown in Figure 4. Performance of a model based on the T and N category alone was compared with the Bayesian network. The T and N model performed with an AUC of 0.47 (95% CI, 0.37-0.57) on the external validation set, significantly lower than the Bayesian network model (P<.001). The T and N model performed with an AUC of 0.53 on a 5-fold cross validation on the training set. A model that used stage groups, rather than T and N category separately, performed with an AUC of 0.56 on the validation set (95% CI, 0.49-0.63) and an AUC of 0.54 on a 5-fold cross validation on the training set.

Fig. 4

Receiver operating characteristic curves of models compared in this study. The expert Bayesian network structure is shown in Figure 1A. The Bayesian network structure of Jayasurya et al (9) is shown in Figure 1C. The Patch Condition (PC) algorithm using the Bayesian network structure is shown in Figure 1B. Validation was done on the institute 3 cohort. Abbreviations: AUC = area under curve; SVM = support vector machine. To investigate the effect of using different network structures, we have compared the performance of the model using the network structure presented in the work by Jayasurya et al (9). In addition, we have learned a network structure on the data using an algorithmic approach (48). Network structures are shown in Figure 1B and C. Results are shown in Figure 4. Neither network structure equaled the performance of the network structure defined by experts (P<.05 for Jayasurya et al, P<.001 for the algorithmic approach. To further evaluate all models tested in this study, we present their performances in terms of discrimination (expressed as AUC), calibration (expressed as the coefficient of determination, r2), and actual chance (Brier score) in Tables E7 and E8 (available online at www.redjournal.org). Comparison of the ROC curves for the Bayesian network model on the 5-fold cross validation and external validation indicated that they were not significantly different (P=.33).

Discussion

In this study, we used a distributed learning approach for Bayesian networks to develop a model using data from 3 hospitals in 3 countries without the need for any data to leave the individual hospital. Previous work exists on survival prediction for NSCLC patients receiving CRT 9, 12, 49. Oberije et al found that sex, performance status, forced expiratory volume in 1 second, number of PLNSs, and gross tumor volume were relevant predictive factors and used these in their model (12). Their model was based on a smaller cohort of patients (n=377), and we found that some of our observations are in contrast to theirs. Age was included in our model as it was significantly correlated with 2-year survival (P<.01). Oberije et al found that there was no significant trend between survival and age (12). Sex was found to correlate significantly with survival in the cohort of Oberije et al. In our study, sex did not significantly correlate with outcome (P=.07). Contradictory results regarding both sex and age have been reported previously 50, 51. Dose was included as a parameter in our model. A clinical trial showed that higher dose decreased overall survival (52). A meta-analysis suggested there may be an effect of hyperfractionated and/or accelerated non-concurrent CRT (53). As most of the patients included in the model were treated with the highest dose achievable without exceeding normal tissue constraints, it is possible that RT dose in our model was a surrogate for low volumetric disease burden. To provide conclusive evidence of whether total tumor dose affects overall survival for specific patient groups, large multicenter studies are required.

Model comparison

We have compared our model with an SVM model that was developed previously (49). Although the Bayesian network did not perform significantly better on the validation set as a whole, we observed a significant improvement in survival prediction on the older patients of the validation cohort (Fig. E1; available online at www.redjournal.org). This could be because of the larger volumes of patient data on which we have based our model. As more data are used, a larger variety of patients are included in the model. This in turn makes the model more robust. Variables and trends previously not significant may turn out to be of high value when larger sample sizes are used, as was the case in this study for the age variable. Other weakly significant variables, such as sex, may ultimately be found to be irrelevant. Another advantage of our model over the previously developed model is that it handles missing values better, as has been shown in previous work (9). Furthermore, the T and N model performs poorly on the validation set, whereas both the SVM model of Oberije et al and the current model perform above the chance level, indicating added value of this work (12). Our model outperformed the previous model of Jayasurya et al (9) on this validation set. One possible explanation for this finding is that the network structure used by Jayasurya et al has too many arcs pointing directly to the outcome node. This in turn means that the CPT for this node becomes very large. Such a large CPT will be undersampled, and therefore the model may be subject to overfitting. Every node pointing toward the outcome will interact with all the other variables pointing to the outcome. This is analogous to introducing interaction terms in conventional regression models. For all arrows pointing from the outcome to a variable, independence is assumed among these variables. These are the only kinds of connections observed in naive Bayes classifiers (NBCs). A weakness of NBCs is that they assume independence among variables. An advantage of NBCs is that they require fewer data, because of this independence assumption. In the Bayesian network presented in this study, we used a partially NBC structure and combined it with 2 arcs pointing directly toward the outcome. We thereby avoided overfitting while still enabling enough complexity to allow the model to perform with high discriminative power.

Limitations

This study has a number of limitations. First, a Bayesian network structure has been chosen based on expert opinion. An alternative option is to take an algorithmic approach and let the data determine the best Bayesian network structure. Previous work by Sesen et al (54) has shown that structure learning algorithms outperform structures selected by experts. However, applying an algorithmic approach proved ineffective in this work (Fig. E2; available online at www.redjournal.org). This conflicting finding may be because of the difference in datasets and variables used. Further research is necessary to identify the optimal structure given a certain modeling scenario. Another weakness of this study is related to the comparison of our model with the SVM (49). The SVM model developed previously used the number of PLNSs to make predictions (12). This variable was unavailable in our dataset, making a fair comparison more difficult. An additional weakness of our study is that the model was largely trained on historical patients. Such models have less added value to clinical practice today as RT practice is subject to continual innovation. Phase III clinical trials provide high-grade evidence; however, they have a downside, as such studies take a relatively long time to complete. Distributed learning can fill the temporal gap by learning from large volumes of patient data from different hospitals as soon as the data are entered into the electronic health record system. A model trained on historical patients, as was used in this study, may serve to generate hypotheses for future experiments. In addition, once the system is in place, distributed learning can be repeated to include more recent patients and thus update the model using the latest practice insights with the ultimate goal to realize a rapid learning health care system. Although distributed learning is a major stride towards easier data sharing and learning models on larger volumes of data, some hurdles remain. It takes several months to set up the infrastructure in an institution. Furthermore, retrieving the data from the electronic medical records is still challenging, as data are often scattered across many databases and applications. Once the data are retrieved and the infrastructure is set up, however, distributed learning of any number of models can be done.

Future work

Our future vision is summarized in an animation at https://www.youtube.com/watch?v=ZDJFOxpwqEA. As our methodology permits learning from features of all kinds, we intend to include variables that could potentially have higher predictive value, such as radiomics and genomics features, to learn more sophisticated models in a distributed manner 22, 23. We intend to include these models in customized patient decision aids and use them for patient stratification in clinical trials 3, 7, 55.

48 in total

Review 1. American Society of Clinical Oncology treatment of unresectable non-small-cell lung cancer guideline: update 2003.

Authors: David G Pfister; David H Johnson; Christopher G Azzoli; William Sause; Thomas J Smith; Sherman Baker; Jemi Olak; Diane Stover; John R Strawn; Andrew T Turrisi; Mark R Somerfield
Journal: J Clin Oncol Date: 2003-12-22 Impact factor: 44.544

2. Prognostic factors for survival in stage III non-small-cell lung cancer treated with definitive radiation therapy: impact of tumor volume.

Authors: Kiyoshi Basaki; Yoshinao Abe; Masahiko Aoki; Hidehiro Kondo; Yoshiomi Hatayama; Shigeyuki Nakaji
Journal: Int J Radiat Oncol Biol Phys Date: 2005-10-13 Impact factor: 7.038

Review 3. Chemotherapy for non-small cell lung cancer.

Authors:
Journal: Cochrane Database Syst Rev Date: 2000

4. Recursive partitioning analysis of 1999 Radiation Therapy Oncology Group (RTOG) patients with locally-advanced non-small-cell lung cancer (LA-NSCLC): identification of five groups with different survival.

Authors: M Werner-Wasik; C Scott; J D Cox; W T Sause; R W Byhardt; S Asbell; A Russell; R Komaki; J S Lee
Journal: Int J Radiat Oncol Biol Phys Date: 2000-12-01 Impact factor: 7.038

5. Gross tumor volume, critical prognostic factor in patients treated with three-dimensional conformal radiation therapy for non-small-cell lung carcinoma.

Authors: Jeffrey D Bradley; Nantaken Ieumwananonthachai; James A Purdy; Todd H Wasserman; Mary Ann Lockett; Mary V Graham; Carlos A Perez
Journal: Int J Radiat Oncol Biol Phys Date: 2002-01-01 Impact factor: 7.038

6. Comorbidity and Karnofksy performance score are independent prognostic factors in stage III non-small-cell lung cancer: an institutional analysis of patients treated on four RTOG studies. Radiation Therapy Oncology Group.

Authors: Selim Firat; Roger W Byhardt; Elizabeth Gore
Journal: Int J Radiat Oncol Biol Phys Date: 2002-10-01 Impact factor: 7.038

Review 7. Prognostic factors in non-small cell lung cancer: a decade of progress.

Authors: Michael D Brundage; Diane Davies; William J Mackillop
Journal: Chest Date: 2002-09 Impact factor: 9.410

8. The relevance of the revised version of the international lung cancer staging system for non-small-cell lung cancer treated with radiotherapy.

Authors: H Langendijk; J de Jong; R Wanders; B Slotman
Journal: Clin Lung Cancer Date: 2001-08 Impact factor: 4.785

Review 9. Prognostic factors in non-small cell lung cancer.

Authors: Merrill J Solan; Maria Werner-Wasik
Journal: Semin Surg Oncol Date: 2003

10. Survival is better predicted with a new classification of stage III unresectable non-small cell lung carcinoma treated by chemotherapy and radiotherapy.

Authors: T Berghmans; J J Lafitte; J Thiriaux; P VanHoutte; J Lecomte; A Efremidis; G Koumakis; V Giner; M Richez; J L Corhay; P Wackenier; P Lothaire; P Mommen; V Ninane; J P Sculier
Journal: Lung Cancer Date: 2004-09 Impact factor: 5.705

28 in total

Review 1. Artificial intelligence in radiation oncology.

Authors: Elizabeth Huynh; Ahmed Hosny; Christian Guthier; Danielle S Bitterman; Steven F Petit; Daphne A Haas-Kogan; Benjamin Kann; Hugo J W L Aerts; Raymond H Mak
Journal: Nat Rev Clin Oncol Date: 2020-08-25 Impact factor: 66.675

Review 2. Radiomics: from qualitative to quantitative imaging.

Authors: William Rogers; Sithin Thulasi Seetha; Turkey A G Refaee; Relinde I Y Lieverse; Renée W Y Granzier; Abdalla Ibrahim; Simon A Keek; Sebastian Sanduleanu; Sergey P Primakov; Manon P L Beuque; Damiënne Marcus; Alexander M A van der Wiel; Fadila Zerka; Cary J G Oberije; Janita E van Timmeren; Henry C Woodruff; Philippe Lambin
Journal: Br J Radiol Date: 2020-02-26 Impact factor: 3.039

3. Prospects and challenges for clinical decision support in the era of big data.

Authors: Issam El Naqa; Michael R Kosorok; Judy Jin; Michelle Mierzwa; Randall K Ten Haken
Journal: JCO Clin Cancer Inform Date: 2018-11-09

4. An artificial intelligence framework integrating longitudinal electronic health records with real-world data enables continuous pan-cancer prognostication.

Authors: Olivier Morin; Martin Vallières; Steve Braunstein; Jorge Barrios Ginart; Taman Upadhaya; Henry C Woodruff; Alex Zwanenburg; Avishek Chatterjee; Javier E Villanueva-Meyer; Gilmer Valdes; William Chen; Julian C Hong; Sue S Yom; Timothy D Solberg; Steffen Löck; Jan Seuntjens; Catherine Park; Philippe Lambin
Journal: Nat Cancer Date: 2021-07-22

Review 5. Towards the Interpretability of Machine Learning Predictions for Medical Applications Targeting Personalised Therapies: A Cancer Case Survey.

Authors: Antonio Jesús Banegas-Luna; Jorge Peña-García; Adrian Iftene; Fiorella Guadagni; Patrizia Ferroni; Noemi Scarpato; Fabio Massimo Zanzotto; Andrés Bueno-Crespo; Horacio Pérez-Sánchez
Journal: Int J Mol Sci Date: 2021-04-22 Impact factor: 5.923

6. Predicting 2-year survival in stage I-III non-small cell lung cancer: the development and validation of a scoring system from an Australian cohort.

Authors: Natalie Si-Yi Lee; Jesmin Shafiq; Matthew Field; Caroline Fiddler; Suganthy Varadarajan; Senthilkumar Gandhidasan; Eric Hau; Shalini Kavita Vinod
Journal: Radiat Oncol Date: 2022-04-13 Impact factor: 3.481

7. Decision analytic modeling for the economic analysis of proton radiotherapy for non-small cell lung cancer.

Authors: Wade P Smith; Patrick J Richard; Jing Zeng; Smith Apisarnthanarax; Ramesh Rengan; Mark H Phillips
Journal: Transl Lung Cancer Res Date: 2018-04

Review 8. Requirements and reliability of AI in the medical context.

Authors: Yoganand Balagurunathan; Ross Mitchell; Issam El Naqa
Journal: Phys Med Date: 2021-03-13 Impact factor: 2.685

9. A situational awareness Bayesian network approach for accurate and credible personalized adaptive radiotherapy outcomes prediction in lung cancer patients.

Authors: Yi Luo; Shruti Jolly; David Palma; Theodore S Lawrence; Huan-Hsin Tseng; Gilmer Valdes; Daniel McShan; Randall K Ten Haken; Issam Ei Naqa
Journal: Phys Med Date: 2021-06-04 Impact factor: 3.119

Review 10. Stereotactic Radiosurgery in the Management of Patients With Brain Metastases of Non-Small Cell Lung Cancer: Indications, Decision Tools and Future Directions.

Authors: Dianne Hartgerink; Britt van der Heijden; Dirk De Ruysscher; Alida Postma; Linda Ackermans; Ann Hoeben; Monique Anten; Philippe Lambin; Karin Terhaag; Arthur Jochems; Andre Dekker; Janna Schoenmaekers; Lizza Hendriks; Jaap Zindler
Journal: Front Oncol Date: 2018-05-09 Impact factor: 6.244