Literature DB >> 26942424

Prediction of Hematopoietic Stem Cell Transplantation Related Mortality- Lessons Learned from the In-Silico Approach: A European Society for Blood and Marrow Transplantation Acute Leukemia Working Party Data Mining Study.

Roni Shouval1,2,3, Myriam Labopin4,5,6,7, Ron Unger3, Sebastian Giebel8, Fabio Ciceri9, Christoph Schmid10, Jordi Esteve11, Frederic Baron12, Norbert Claude Gorin4, Bipin Savani13, Avichai Shimoni1, Mohamad Mohty4,5,6,7, Arnon Nagler1,4.   

Abstract

Models for prediction of allogeneic hematopoietic stem transplantation (HSCT) related mortality partially account for transplant risk. Improving predictive accuracy requires understating of prediction limiting factors, such as the statistical methodology used, number and quality of features collected, or simply the population size. Using an in-silico approach (i.e., iterative computerized simulations), based on machine learning (ML) algorithms, we set out to analyze these factors. A cohort of 25,923 adult acute leukemia patients from the European Society for Blood and Marrow Transplantation (EBMT) registry was analyzed. Predictive objective was non-relapse mortality (NRM) 100 days following HSCT. Thousands of prediction models were developed under varying conditions: increasing sample size, specific subpopulations and an increasing number of variables, which were selected and ranked by separate feature selection algorithms. Depending on the algorithm, predictive performance plateaued on a population size of 6,611-8,814 patients, reaching a maximal area under the receiver operator characteristic curve (AUC) of 0.67. AUCs' of models developed on specific subpopulation ranged from 0.59 to 0.67 for patients in second complete remission and receiving reduced intensity conditioning, respectively. Only 3-5 variables were necessary to achieve near maximal AUCs. The top 3 ranking variables, shared by all algorithms were disease stage, donor type, and conditioning regimen. Our findings empirically demonstrate that with regards to NRM prediction, few variables "carry the weight" and that traditional HSCT data has been "worn out". "Breaking through" the predictive boundaries will likely require additional types of inputs.

Entities:  

Mesh:

Year:  2016        PMID: 26942424      PMCID: PMC4778768          DOI: 10.1371/journal.pone.0150637

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Allogeneic hematopoietic stem transplantation (HSCT) is a potentially curative procedure for selected patients with hematological malignancies. Transplant associated morbidity and mortality remains substantial, making the decision of whom, how and when to transplant, of great importance [1]. The European Group for Blood and Marrow Transplantation (EBMT) score, initially developed for prediction of allogeneic HSCT outcomes in chronic myeloid leukemia, and later validated for other diagnoses, has pioneered the field of prognostic modeling in HSCT [2, 3]. Since its release, almost two decades ago, additional scores have also been developed. These have been validated, but do not fully account for transplantation risk in acute leukemia [4-9]. Performance limiting factors of HSCT prediction models might be attributed to inherent procedural uncertainty, the statistical methodology used, or the number and quality of features collected. Using an in-silico approach (i.e., iterative computerized simulations), based on machine learning (ML) algorithms, we set out to explore these factors in order to improve future acute leukemia HSCT outcome prediction models. ML is a field in artificial intelligence. The underlying paradigm does not start with a pre-defined model; rather it lets the data create the model by detecting underlying patterns. Thus, this approach avoids pre-assumptions regarding model types and variable interactions, and may offer additional knowledge, which has eluded detection by standard statistical methods. ML algorithms, have been applied in various "big data" scenarios such as financial markets, complex physical systems, marketing, advertising, robotics, meteorology, biology and more. They are tools in the data mining approach for knowledge discovery in large datasets [10, 11]. Recently, we have developed the EBMT- Alternating Decision Tree (ADT) ML based prediction model for mortality at 100 days following allogeneic HSCT in acute leukemia [9, 12]. Hence, demonstrating feasibility of the data mining approach in HSCT.

Methods

Study population

This was a retrospective, data mining, supervised learning study, based on data reported to the Acute Leukemia Working Party (ALWP) registry of the EBMT. The EBMT is a voluntary group of more than 500 centers, required to report all consecutive HSCT and follow-ups annually in a standardized manner. The study was approved by the ALWP board. Written informed consent was given by participants for their clinical records to be used in EBMT retrospective studies. Inclusion criteria encompassed first allogeneic transplants from HLA matched sibling and unrelated donors (> = 8/10), performed from 2005 to 2013, using peripheral blood stem cells or bone marrow as cell source, on adults (age > = 18 years) diagnosed with de-novo acute leukemia. Haploidentical and cord blood transplants were not included. A total of 26,266 patients from 326 European centers were initially analyzed. Patients lost from follow-up before day 100 post HSCT were discarded from analysis (n = 343, 1.3%). Twenty two variables describing recipient, donor, and procedural characteristics were considered.Variables were defined according to EBMT criteria (Table 1 and Appendix A in S1 File) [13].
Table 1

Patient Characteristics.

Interquartile range (IQR), Body mass index (BMI), Recipient (R), Donor (D), Cytomegalovirus (CMV), Acute lymphoblastic leukemia (ALL), Total body irradiation (TBI), Graft versus host disease (GVHD), Antithymocyte globulin (ATG), Peripheral blood (PB), Bone marrow (BM)

ValueNMissing, n
Median year (IQR)2009 (2007–2011)259230
Median recipient age (IQR)45 (33–56)259230
Median BMI (IQR)24 (22–27)935016573
Median days between diagnosis and HSCT (IQR)191 (138–363)259149
Median donor's age (IQR)38.8 (29–48)1002715896
Recipient gender2587251
Male14228 (55.0%)
Female11644 (45.0%)
Recipient CMV serostatus228553068
-7788 (34.1%)
+15067 (65.9%)
Karnofsky at transplant243691554
> = 8022966 (94.2%)
<801403 (5.8%)
Comorbidity score merged246923454
0403 (16.3%)
1747 (30.3%)
2457 (18.5%)
3419 (17.0%)
> = 4443 (17.9%)
Diagnosis259230
AML18610 (71.8%)
ALL7313 (28.2%)
Cytogenetics risk1343012493
Standard10080 (75.1%)
Poor3350 (24.9%)
Disease stage259230
CR116201 (62.5%)
CR24909 (18.9%)
Advanced4813 (18.6%)
Previous autograft259230
-25235 (97.3%)
+688 (2.7%)
Donor gender25357566
Male15712 (62.0%)
Female9645 (38.0%)
Donor CMV serostatus227263197
-10927 (48.1%)
+11799 (51.9%)
D-R sex combination25318605
Male D to male R9153 (36.2%)
Female D to female R4863 (19.2%)
Male D to female R6528 (25.8%)
Female D to male R4774 (18.9%)
D-R CMV serostatus combination223953528
D-CMV–/R-CMV–5572 (24.9%)
D-CMV+/R-CMV–or D-CMV–/R-CMV+8917 (39.8%)
D-CMV+/R-CMV+7906 (35.3%)
Donor type259230
HLA matched unrelated donor13585 (52.4%)
HLA identical sibling12338 (47.6%)
HLA match degree909016833
10/106519 (71.7%)
9/102068 (22.8%)
<9/10503 (5.5%)
Source of stem cells259230
BM4109 (15.9%)
PB or BM+PB21814 (84.1%)
Conditioning25420503
MAC16836 (66.2%)
RIC8584 (33.8%)
TBI25742181
No15042 (58.4%)
Yes10700 (41.6%)
GVHD prevention232282695
Ex-vivo T cell depletion800 (3.4%)
In-vivo T cell depletion9825 (42.3%)
No T cell depletion12603 (54.3%)
Relapse at day 100259230
-23384 (90.2%)
+2539 (9.8%)
Non relapse related mortality at day 100259230
-23536 (90.8%)
+2387 (9.2%)
Overall mortality at day 100259230
-22643 (87.3%)
+3280 (12.7%)

Patient Characteristics.

Interquartile range (IQR), Body mass index (BMI), Recipient (R), Donor (D), Cytomegalovirus (CMV), Acute lymphoblastic leukemia (ALL), Total body irradiation (TBI), Graft versus host disease (GVHD), Antithymocyte globulin (ATG), Peripheral blood (PB), Bone marrow (BM)

Study objectives

Study objectives included development of multiple prediction models for NRM 100 days post allogeneic HSCT, while estimating effects of the algorithm type, population size, specific subpopulations and number of variable incorporated, on the models' predictive performance. Day 100 NRM was defined as death without previous relapse/progression before day 100.

Study design

Prediction models for day 100 NRM were developed using six ML algorithms (WEKA v. 3-7-11, New-Zealand). Through an in-silico approach, algorithms were iteratively exposed to an increasing population size, varying sub-populations, or an increasing number of ranked variables, selected by a separate feature selection algorithm (Fig 1). For each iteration, a prediction model was trained and tested through 10 fold cross-validation. This process was repeated 5 times, each time randomly sampling the experimental dataset (see below). Performance was evaluated according to the area under receiver operator characteristic curve (AUC) [14, 15].
Fig 1

In-silico predictive modeling- experimental design.

The original dataset was randomly split into an optimization and experimental datasets. The former was used for tuning of machine learning algorithms and feature selection. A. Several experiments were run on the experimental dataset, testing the effects of population size, specific subpopulations and number of variables included on predictive performance. B. A detailed explanation of the increasing population size experiment displayed in panel A. Patients were randomly sampled from the experimental dataset, creating samples with an expending size, which were later introduced to six machine learning algorithms. For each sample a prediction model for day 100 NRM was developed, and performance was measured through the area under the receiver operating curve (AUC). Models were trained and tested with 10 fold cross validation. The sampling process was repeated 5 times. C. For estimation of variable importance (ranked variables experiment in panel A) and the number of variables necessary for optimal prediction of day 100 NRM, we ran a feature selection algorithm on the optimization set. Variables were ranked according to their predictive contribution to each algorithm. The next step involved serial introduction of the variables, according to their importance to six machine learning algorithms which were applied on the experimental dataset. In each iteration a prediction model for day 100 NRM was trained and test with 10 fold cross validation. For instance in the first iteration the top ranking variable was introduced, in the second the top 2 variables and so on until all 23 variables were used. Performance was estimated according to the AUC. Machine learning (ML), Algorithm (Alg).

In-silico predictive modeling- experimental design.

The original dataset was randomly split into an optimization and experimental datasets. The former was used for tuning of machine learning algorithms and feature selection. A. Several experiments were run on the experimental dataset, testing the effects of population size, specific subpopulations and number of variables included on predictive performance. B. A detailed explanation of the increasing population size experiment displayed in panel A. Patients were randomly sampled from the experimental dataset, creating samples with an expending size, which were later introduced to six machine learning algorithms. For each sample a prediction model for day 100 NRM was developed, and performance was measured through the area under the receiver operating curve (AUC). Models were trained and tested with 10 fold cross validation. The sampling process was repeated 5 times. C. For estimation of variable importance (ranked variables experiment in panel A) and the number of variables necessary for optimal prediction of day 100 NRM, we ran a feature selection algorithm on the optimization set. Variables were ranked according to their predictive contribution to each algorithm. The next step involved serial introduction of the variables, according to their importance to six machine learning algorithms which were applied on the experimental dataset. In each iteration a prediction model for day 100 NRM was trained and test with 10 fold cross validation. For instance in the first iteration the top ranking variable was introduced, in the second the top 2 variables and so on until all 23 variables were used. Performance was estimated according to the AUC. Machine learning (ML), Algorithm (Alg). Tuning of the algorithms parameters (Table A in S1 File) and the feature selection process, explained below, were conducted on an optimization dataset (n = 3,888, 15%), whereas the development of the various models of day 100 NRM prediction were done on the experimental dataset (n = 22,035, 85%). Samples were randomly allocated to each dataset from the original dataset.

Machine learning Algorithms

Six popular, supervised classification ML algorithms were selected (Appendix B in S1 File). Naïve bayse (NB), alternating decision trees (ADT) and logistic regression (LR) produce models with interpretable structures, whereas multilayer perceptron (MLP), random forest (RF) and AdaBoost are "black box" models, where the function connecting the predictor variables with response is opaque to the user [16-22].

Feature selection

Feature selection is the process of ranking variables and identifying irrelevant and redundant information. The reduction of dimensionality presents a number of benefits, such as enabling algorithms to operate faster and more effectively, improving classification accuracy, improving data visualization, and enhancing understanding of the derived classification models [23]. Using a classifier based feature selection algorithm, which was applied on the optimization dataset for each of the 6 previously described ML classification algorithms, variables were ranked according to their importance for prediction of day 100 NRM (Appendix C in S1 File).

Results

Patient characteristics

Characteristics of 25,923 analyzed patients are listed in Table 1. The majority had Acute Myeloid Leukemia (AML) (71.8%), were in first complete remission (CR1) (62.5%) and received myeloablative conditioning (MAC) (66.2%). Grafts from matched sibling donors were used in 47.6% of patients. Graft source was mainly peripheral blood (84.1%). NRM and overall mortality prevalence at day 100, were 9.2% (n = 2,387) and 12.7% (n = 3,280) respectively. Whereas 9.8% (n = 2,539) of patients relapsed before 100 days. They were consequently considered as no NRM at day100. The parameter optimization and experimental datasets were similar in terms of baseline characteristics (Table B in S1 File).

Sample size effect on prediction

Day 100 NRM prediction models were developed with 6 ML algorithms on an expanding patient population (110–22,035 patients) sampled from the experimental dataset. When models were developed on all available patients, AUCs ranged from 0.64 for the MLP algorithm to 0.67 for the LR and AdaBoost algorithms (Fig 2 and Table C in S1 File). Depending on the algorithm, predictive performance plateaued on a sample size of 6,611–8,814 patients. Samples consisting of 551 patients or less, demonstrated poor performance with AUCs ranging from 0.56–0.59.
Fig 2

Predictive performance of day 100 NRM prediction models with increasing sample size.

A gradually increasing sample from the experimental dataset was introduced to 6 machine learning algorithms. Prediction models were developed for each incremental step and their discriminative performance is plotted on the Y axis. Alternating decision tree (ADT), Logistic regression (LR), Multilayer perceptron (MLP), Naïve base (NB), Random forest (RF).

Predictive performance of day 100 NRM prediction models with increasing sample size.

A gradually increasing sample from the experimental dataset was introduced to 6 machine learning algorithms. Prediction models were developed for each incremental step and their discriminative performance is plotted on the Y axis. Alternating decision tree (ADT), Logistic regression (LR), Multilayer perceptron (MLP), Naïve base (NB), Random forest (RF).

Sub-population effect on prediction

Prediction models were developed for specific subsets of patients, and performance was compared with models developed on the whole population (Table 2). Performance was significantly lower for the various disease stage subpopulations (e.g., for CR2 AUC ranged from 0.55–0.59) and for patients transplanted from HLA matched unrelated donors. Prediction models developed on the remaining subpopulations had similar performance to models developed using all available patients.
Table 2

Predictive performance of day 100 NRM prediction models on varying subpopulations.

AdaBoostADTLRMLPNBRFAverage performance
Sample SizeAUCSTDVAUCSTDVAUCSTDVAUCSTDVAUCSTDVAUCSTDVAUCSTDV
Full dataset220350.670.020.660.020.670.020.630.010.650.020.660.020.660.01
Age<45108200.660.030.650.030.660.030.640.020.650.030.660.030.650.01
Age> = 45112150.660.030.650.030.660.030.640.020.650.030.650.030.650.01
ALL62140.650.040.640.040.660.040.640.020.640.030.650.030.650.01
AML158210.670.020.660.030.670.020.650.020.650.020.660.020.660.01
CR1137870.630.03*0.610.03*0.640.03*0.630.020.610.03*0.620.03*0.620.01
CR241650.580.05*0.550.04*0.590.05*0.590.03*0.580.05*0.580.05*0.580.01
Advanced40830.620.04*0.610.04v0.610.04v0.590.03v0.60.03*0.610.04*0.610.01
MAC147540.660.020.650.020.660.020.630.020.650.020.660.020.650.01
RIC77030.670.030.660.030.670.030.660.020.650.030.660.030.660.01
MRD104580.650.030.640.030.660.030.650.030.650.030.650.030.650.01
MUD115770.640.02*0.630.03*0.640.03*0.620.020.620.03*0.630.02*0.630.01

* p-value <0.05 (t-test), Performance of reach model was compared with the performance of the model developed on the full experimental dataset, with the designated algorithm. Standard deviation (STDV), Alternating decision tree (ADT), Logistic regression (LR), Multilayer perceptron (MLP), Naïve base (NB), Random forest (RF), HLA matched related donor (MRD), HLA matched unrelated donor (MUD).

* p-value <0.05 (t-test), Performance of reach model was compared with the performance of the model developed on the full experimental dataset, with the designated algorithm. Standard deviation (STDV), Alternating decision tree (ADT), Logistic regression (LR), Multilayer perceptron (MLP), Naïve base (NB), Random forest (RF), HLA matched related donor (MRD), HLA matched unrelated donor (MUD).

Variable importance

When the feature selection process was applied on the optimization set, disease stage, donor type and conditioning were consistently the 3 top ranking variables across all day 100 NRM prediction models (Fig 3). The mean variable rankings of time from diagnosis to transplant, recipient age, and diagnosis were 4–6, respectively. However, standard deviation was considerably high, as their importance varied between algorithms.
Fig 3

Mean variable ranking of day 100 NRM prediction models.

Variable importance were extracted using a feature selection algorithm for 6 machine learning prediction models of day 100 NRM. The circle marks the mean ranking of each variable and the bars describe 2 standard deviations. Disease stage (Ds_Stage); Time from transplant to diagnosis (Time_dx); Diagnosis (Dx); Body mass index (BMI); Donor (D); Recipient (R); Previos autograft (2nd_tx); # of HLA mismatches (#HLA_miss); Graft versus host disease prophylaxis (GVHD_prox); Total body irradiation (TBI);

Mean variable ranking of day 100 NRM prediction models.

Variable importance were extracted using a feature selection algorithm for 6 machine learning prediction models of day 100 NRM. The circle marks the mean ranking of each variable and the bars describe 2 standard deviations. Disease stage (Ds_Stage); Time from transplant to diagnosis (Time_dx); Diagnosis (Dx); Body mass index (BMI); Donor (D); Recipient (R); Previos autograft (2nd_tx); # of HLA mismatches (#HLA_miss); Graft versus host disease prophylaxis (GVHD_prox); Total body irradiation (TBI); To assess the relationship between models' performance and the number of variables incorporated into them, the ranked variables were serially introduced to the 6 ML algorithms. The algorithms were applied on the experimental dataset. Starting from the top ranking variables, gradually adding variables with lower ranking, prediction models for day 100 NRM were iteratively constructed (Fig 1). The maximal predictive performance ranged from 0.65–0.67, with LR and MLP achieving their optimal AUC with only 6 variables (Fig 4). When introduced with the 3 top ranking variables all models achieved an AUC of 0.64.
Fig 4

Predictive performance of day 100 NRM prediction models on a cumulative ranked variable list.

Alternating decision tree (ADT); Logistic regression (LR); Multilayer perceptron (MLP); Naïve base (NB); Random forest (RF);

Predictive performance of day 100 NRM prediction models on a cumulative ranked variable list.

Alternating decision tree (ADT); Logistic regression (LR); Multilayer perceptron (MLP); Naïve base (NB); Random forest (RF);

Discussion

Eligibility of patients with acute leukemia for allogeneic HSCT is based on a risk benefit-assessment of the relapse risk versus NRM risk [24]. Risk scores for transplant associated mortality have been developed based on retrospective registry data. A large HSCT registry was explored, while automatizing the prediction model development processes, creating thousands of models, depending on the questions asked. We show that for day 100 NRM prediction various models, developed on the basis of 6 popular ML algorithms, reach approximately the same performance. With data commonly collected, saturation of predictive performance requires very few variables, but large datasets. The nature of association between predictors and response, the data’s quality and dimensionality (i.e., number of variables analyzed), and the number of events per outcome, all affect the sample size necessary for generation of a robust and generalizable prediction model [15]. Hence, predetermination of the sample size is a matter of empirical testing, rather than a standardized calculation. Using repetitive computerized simulations, we demonstrate that with marginal differences between algorithms, approximately 6,000 patients were needed to achieve maximal or near-maximal predictive performance. Defining a strict cutoff for modeling studies would be erroneous, as the data’s features differ between cohorts. However, a rather solid assumption based on the presented results on “real world” data, would be the need to include thousands of patients when aiming to develop and validate similar modeling problems. Iterative development of prediction models for specific subpopulation has drawn attention to the different disease stage groups. Performance was lowest for the CR2 group, with an AUC ranging from 0.53–0.58. Low performance, but to lesser extent, was also noted for the other disease stage groups. Disease stage is highly predictive of day 100 NRM. Thus, it is not surprising that when disease stage was excluded from the pool of variables considered for prediction, performance declined, as it is highly informative. Prospects for cure are higher for patients in CR1 compared to other disease stages. Hence, estimation of NRM risk is of special interest in this group, as non-transplant alternatives exist [25-27]. Versluis et al., have addressed such a population receiving reduced intensity conditioning. When looked upon separately, the Hematopoietic Cell Transplantation-Specific Comorbidity Index (HCT-CI) and EBMT score were not predictive of NRM, corroborating the challenge we encountered. A new score, based on integrated feature of the comorbidity index and EBMT score, was constructed achieving an AUC of 0.68 [8]. It should be noted that most algorithms reached an AUC of 0.65 using only 3–5 variables. Adding more variables led to a modest improvement, which translates to marginal clinical significance. The top 3 ranking variables, shared by all algorithms were disease stage, donor type, and conditioning. Transplanters will not be surprised by these determinants, which have been validated repeatedly [28, 29]. Predictive weight attributed to other features varied considerably between models, leading at best to modest increment in predictive accuracy. Traditional HSCT prognostic studies, rely on a collection of variables similar to the one presented. Thus, effective prediction of individualized NRM is unlikely to substantially improve. Incorporation of the HCT-CI score holds promise. However, even when applied separately or in combination with other features, the comorbidity index reaches a maximal AUC of 0.7 [4, 7, 8, 30–32]. In other words, contemporary prognostic models are suitable for risk stratification rather than outcome prediction. The discovery of additional prognostic markers, the incorporation of electronic medical records to routine clinical use, and the addition of biological and genetic data to information gathered on leukemia patients, offers great opportunities for model improvement [33, 34]. Mortality following transplantation is likely the result of a complex network of interactions and non-linear associations. Hence, the Occam's razor concept, where the simplest solution is the best solution, might not hold for prediction of transplantation outcomes. Exploiting the abundance of data now available on transplant patients, could potentially improve prediction models’ applicability. Novel modeling techniques such as ML [35, 36], enabling non-parsimonious incorporation of a high number of variables, are warranted. These methods could potentially improve accuracy, but interpretability might be lost. The EBMT-ADT prediction model marked the entrance of the data mining methodology into HSCT prognostic research [9, 22]. The aim of the ADT study was development of a prediction model for overall mortality at 100 days following allogeneic HSCT in acute leukemia patients. Though using a data mining methodology, the perspective of the current study was not prediction per-se, but rather an analysis of the predictive modeling process and its boundaries, while focusing on NRM at day 100 as the objective. Thousands of prediction models, with varying algorithms, were developed and evaluated in order to discover elements that could improve future models. The in-silico experimental system allowed us to dissect the conditions under which the models were developed and the corresponding performance. Thus, providing methodological and clinical insights regarding sample size, modeling technique, and variable importance. The study carries several limitations. First, it is a retrospective analysis susceptible to data selection and measurement biases. However, the registry analyzed reflects real world data, hence conveying contemporary practice. Second, a few variables suffered from a large amount of missing values. That being said, ML algorithms allow prediction of the outcome of interest without strong assumptions regarding the distribution and missingness, In addition, we show that when discarding variables with more than 15% missing values, prediction does not improve (Table D in S1 File). Third, we focus on short term data- day 100 NRM, rather than long term mortality. We believe that the high rate of day 100 NRM (9.2%) makes it a valid objective. Moreover, prediction of long term outcomes might be expected to give lower performance, as more parameters come into play. Hence, the concepts presented should be applicable to modeling distant outcomes. Fourth, we relate to prediction of day 100 NRM as a simple classification task, disregarding the time to event effect. However, given the large sample size, disregarding censored data (1.3%) is unlikely to have impact on performance.

Conclusion

The in-silico approach is a novel experimental system utilizing machine learning algorithms, for empirical estimation of prediction boundaries in HSCT. Several clinical and methodological lessons have been learned by the suggested approach. Large registry studies, involving thousands of patients are necessary for development of robust prediction models, as performance of different algorithms converged when sampling more than 6,000 patients. In addition, an exhaustive search for variable importance, reveal that few variables "carry the weight" with regard to predictive influence. Potential bias of the presented approach include: data quality issues and selection of a short term rather than a long term outcome. Overall, it appears that when using traditional HSCT data, a point of predictive saturation has been reached. Improving performance will likely require additional types of input like genetic, clonal and biologic factors. Appendix A in S1 File: Variables’ Definitions. Appendix B in S1 File: Machine Learning Algorithms. Appendix C in S1 File: Feature Selection. Table A in S1 File: Algorithms' parameters. Table B in S1 File: Comparison between variables in the optimization and experimental datasets. Table C in S1 File: Predictive performance of day 100 NRM prediction models with increasing sample size. Table D in S1 File: Predictive performance of day 100 NRM prediction models discarding variables with prevalent missing values. (DOCX) Click here for additional data file.
  24 in total

1.  Identifying important risk factors for survival in patient with systolic heart failure using random survival forests.

Authors:  Eileen Hsich; Eiran Z Gorodeski; Eugene H Blackstone; Hemant Ishwaran; Michael S Lauer
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2010-11-23

Review 2.  What are artificial neural networks?

Authors:  Anders Krogh
Journal:  Nat Biotechnol       Date:  2008-02       Impact factor: 54.908

3.  The randomized registry trial--the next disruptive technology in clinical research?

Authors:  Michael S Lauer; Ralph B D'Agostino
Journal:  N Engl J Med       Date:  2013-08-31       Impact factor: 91.245

4.  Combination of the Hematopoietic Cell Transplantation Comorbidity Index and the European Group for Blood and Marrow Transplantation score allows a better stratification of high-risk patients undergoing reduced-toxicity allogeneic hematopoietic cell transplantation.

Authors:  Pere Barba; Rodrigo Martino; José Antonio Pérez-Simón; Francesc Fernández-Avilés; Nerea Castillo; José Luis Piñana; Lucía López-Anglada; Montserrat Rovira; Francesc Bosch; Enric Carreras; Lucía López Corral; Jorge Sierra; David Valcárcel
Journal:  Biol Blood Marrow Transplant       Date:  2013-10-17       Impact factor: 5.742

Review 5.  Application of machine learning algorithms for clinical predictive modeling: a data-mining approach in SCT.

Authors:  R Shouval; O Bondi; H Mishan; A Shimoni; R Unger; A Nagler
Journal:  Bone Marrow Transplant       Date:  2013-10-07       Impact factor: 5.483

6.  Validation of the Hematopoietic Cell Transplantation-Specific Comorbidity Index: a prospective, multicenter GITMO study.

Authors:  Roberto Raimondi; Alberto Tosetto; Rosi Oneto; Riccardo Cavazzina; Francesco Rodeghiero; Andrea Bacigalupo; Renato Fanin; Alessandro Rambaldi; Alberto Bosi
Journal:  Blood       Date:  2012-06-27       Impact factor: 22.113

7.  Does the hematopoietic cell transplantation specific comorbidity index (HCT-CI) predict transplantation outcomes? A prospective multicenter validation study of the Kanto Study Group for Cell Therapy.

Authors:  Aya Nakaya; Takehiko Mori; Masatsugu Tanaka; Naoto Tomita; Chiaki Nakaseko; Shingo Yano; Shin Fujisawa; Hisashi Sakamaki; Nobuyuki Aotsuka; Akira Yokota; Yoshinobu Kanda; Toru Sakura; Yasuhito Nanya; Takayuki Saitoh; Heiwa Kanamori; Satoshi Takahashi; Shinichiro Okamoto
Journal:  Biol Blood Marrow Transplant       Date:  2014-07-15       Impact factor: 5.742

8.  Risk score for outcome after allogeneic hematopoietic stem cell transplantation: a retrospective analysis.

Authors:  Alois Gratwohl; Martin Stern; Ronald Brand; Jane Apperley; Helen Baldomero; Theo de Witte; Giorgio Dini; Vanderson Rocha; Jakob Passweg; Anna Sureda; André Tichelli; Dietger Niederwieser
Journal:  Cancer       Date:  2009-10-15       Impact factor: 6.860

Review 9.  Allogeneic stem cell transplantation for acute myeloid leukemia in first complete remission: systematic review and meta-analysis of prospective clinical trials.

Authors:  John Koreth; Richard Schlenk; Kenneth J Kopecky; Sumihisa Honda; Jorge Sierra; Benjamin J Djulbegovic; Martha Wadleigh; Daniel J DeAngelo; Richard M Stone; Hisashi Sakamaki; Frederick R Appelbaum; Hartmut Döhner; Joseph H Antin; Robert J Soiffer; Corey Cutler
Journal:  JAMA       Date:  2009-06-10       Impact factor: 56.272

Review 10.  The European LeukemiaNet AML Working Party consensus statement on allogeneic HSCT for patients with AML in remission: an integrated-risk adapted approach.

Authors:  Jan J Cornelissen; Alois Gratwohl; Richard F Schlenk; Jorge Sierra; Martin Bornhäuser; Gunnar Juliusson; Zdenek Råcil; Jacob M Rowe; Nigel Russell; Mohamad Mohty; Bob Löwenberg; Gerard Socié; Dietger Niederwieser; Gert J Ossenkoppele
Journal:  Nat Rev Clin Oncol       Date:  2012-09-04       Impact factor: 66.675

View more
  12 in total

1.  Prognostic Factors for Mortality among Day +100 Survivors after Allogeneic Hematopoietic Cell Transplantation.

Authors:  Sagar S Patel; Lisa A Rybicki; Donna Corrigan; Brian Bolwell; Robert Dean; Hien Liu; Aaron T Gerds; Rabi Hanna; Brian Hill; Deepa Jagadeesh; Matt Kalaycio; Brad Pohlman; Ronald Sobecks; Navneet S Majhail; Betty K Hamilton
Journal:  Biol Blood Marrow Transplant       Date:  2018-01-31       Impact factor: 5.742

2.  Using a machine learning algorithm to predict acute graft-versus-host disease following allogeneic transplantation.

Authors:  Yasuyuki Arai; Tadakazu Kondo; Kyoko Fuse; Yasuhiko Shibasaki; Masayoshi Masuko; Junichi Sugita; Takanori Teshima; Naoyuki Uchida; Takahiro Fukuda; Kazuhiko Kakihana; Yukiyasu Ozawa; Tetsuya Eto; Masatsugu Tanaka; Kazuhiro Ikegame; Takehiko Mori; Koji Iwato; Tatsuo Ichinohe; Yoshinobu Kanda; Yoshiko Atsuta
Journal:  Blood Adv       Date:  2019-11-26

Review 3.  Acute myeloid leukemia and artificial intelligence, algorithms and new scores.

Authors:  Nathan Radakovich; Matthew Cortese; Aziz Nazha
Journal:  Best Pract Res Clin Haematol       Date:  2020-06-07       Impact factor: 3.020

Review 4.  Application of machine learning in the management of acute myeloid leukemia: current practice and future prospects.

Authors:  Jan-Niklas Eckardt; Martin Bornhäuser; Karsten Wendt; Jan Moritz Middeke
Journal:  Blood Adv       Date:  2020-12-08

Review 5.  From patient centered risk factors to comprehensive prognostic models: a suggested framework for outcome prediction in umbilical cord blood transplantation.

Authors:  Roni Shouval; Arnon Nagler
Journal:  Stem Cell Investig       Date:  2017-05-24

6.  Multidimensional Analysis Integrating Human T-Cell Signatures in Lymphatic Tissues with Sex of Humanized Mice for Prediction of Responses after Dendritic Cell Immunization.

Authors:  Valery Volk; Andreas I Reppas; Philippe A Robert; Loukia M Spineli; Bala Sai Sundarasetty; Sebastian J Theobald; Andreas Schneider; Laura Gerasch; Candida Deves Roth; Stephan Klöss; Ulrike Koehl; Constantin von Kaisenberg; Constanca Figueiredo; Haralampos Hatzikirou; Michael Meyer-Hermann; Renata Stripecke
Journal:  Front Immunol       Date:  2017-12-08       Impact factor: 7.561

Review 7.  Artificial Intelligence Approaches in Hematopoietic Cell Transplantation: A Review of the Current Status and Future Directions

Authors:  Ibrahim N. Muhsen; Tusneem ElHassan; Shahrukh K Hashmi
Journal:  Turk J Haematol       Date:  2018-06-08       Impact factor: 1.831

8.  Patient-based prediction algorithm of relapse after allo-HSCT for acute Leukemia and its usefulness in the decision-making process using a machine learning approach.

Authors:  Kyoko Fuse; Shun Uemura; Suguru Tamura; Tatsuya Suwabe; Takayuki Katagiri; Tomoyuki Tanaka; Takashi Ushiki; Yasuhiko Shibasaki; Naoko Sato; Toshio Yano; Takashi Kuroha; Shigeo Hashimoto; Tatsuo Furukawa; Miwako Narita; Hirohito Sone; Masayoshi Masuko
Journal:  Cancer Med       Date:  2019-07-15       Impact factor: 4.452

9.  Translational Applications of Artificial Intelligence and Machine Learning for Diagnostic Pathology in Lymphoid Neoplasms: A Comprehensive and Evolutive Analysis.

Authors:  Julia Moran-Sanchez; Antonio Santisteban-Espejo; Miguel Angel Martin-Piedra; Jose Perez-Requena; Marcial Garcia-Rojo
Journal:  Biomolecules       Date:  2021-05-25

Review 10.  A Systematic Review of Machine Learning Techniques in Hematopoietic Stem Cell Transplantation (HSCT).

Authors:  Vibhuti Gupta; Thomas M Braun; Mosharaf Chowdhury; Muneesh Tewari; Sung Won Choi
Journal:  Sensors (Basel)       Date:  2020-10-27       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.