Literature DB >> 33286642

Ordinal Decision-Tree-Based Ensemble Approaches: The Case of Controlling the Daily Local Growth Rate of the COVID-19 Epidemic.

Abstract

In this research, we develop ordinal decision-tree-based ensemble approaches in which an objective-based information gain measure is used to select the classifying attributes. We demonstrate the applicability of the approaches using AdaBoost and random forest algorithms for the task of classifying the regional daily growth factor of the spread of an epidemic based on a variety of explanatory factors. In such an application, some of the potential classification errors could have critical consequences. The classification tool will enable the spread of the epidemic to be tracked and controlled by yielding insights regarding the relationship between local containment measures and the daily growth factor. In order to benefit maximally from a variety of ordinal and non-ordinal algorithms, we also propose an ensemble majority voting approach to combine different algorithms into one model, thereby leveraging the strengths of each algorithm. We perform experiments in which the task is to classify the daily COVID-19 growth rate factor based on environmental factors and containment measures for 19 regions of Italy. We demonstrate that the ordinal algorithms outperform their non-ordinal counterparts with improvements in the range of 6-25% for a variety of common performance indices. The majority voting approach that combines ordinal and non-ordinal models yields a further improvement of between 3% and 10%.

Entities: Chemical Disease Gene Species

Keywords: AdaBoost; COVID-19; decision trees; ensemble algorithms; epidemic; information gain; objective-based entropy; ordinal classification; random forest

Year: 2020 PMID： 33286642 PMCID： PMC7517475 DOI： 10.3390/e22080871

Source DB: PubMed Journal: Entropy (Basel) ISSN： 1099-4300 Impact factor: 2.524

1. Introduction

1.1. Conventional Approaches for Predicting the Dynamic Spread of an Epidemic

In epidemiology, mathematical modeling is widely used to predict the transmissibility and dynamic spread of an epidemic, while statistical analysis is often used to evaluate the effect of a variety of variables on epidemic transmission. For the prediction task, the most commonly used mathematical models are those that apply SIR/SEIR (susceptible, (exposed), infectious, and removed) differential equations [1,2,3,4]. These studies usually assume available data on the number of susceptible individuals and the numbers of infections, deaths, and recoveries. Recently, several studies have incorporated spatial patterns into epidemiological mathematical models for predicting the spread of an epidemic, by invoking specific assumptions regarding the behavior and location of individuals in the network [5,6,7]. Research studies that examine the effect of different factors on the spread of an epidemic tend to use statistical analysis techniques such as Pearson’s correlation coefficient, descriptive statistics, and regression models. Mecenas et al. [8], for example, described 17 recent studies that used these techniques to investigate the effect of weather variables on the spread of COVID-19 and SARS. In comparison to that for weather variables, the evidence about the effect of environmental factors on the transmission and viability of COVID-19 and other epidemics is less conclusive. Pedersen and Meneghini [9], for example, explored the effects of containment measures and found that drastic restrictions have reduced the spread of COVID-19 modestly but have been insufficient to halt the epidemic. Mastrandrea and Barrat [10] showed that social interactions shape the patterns of epidemic spreading processes in populations and explored how incomplete data on contact networks affect the prediction of epidemic risk. All of the aforementioned mathematical modeling approaches for spread prediction tend to be predicated on model-specific assumptions and, in general, cannot represent non-linear dynamics or introduce probabilistic variables into the model. On the other hand, the statistical methods used for evaluation of the effect of various factors on transmission are often based on specific types of data, meaning that they cannot identify patterns and relationships among other types of data; in addition, they are not generally suited to dealing with massive data. This research study is motivated by the increasing availability of different types of data, as well as the availability of massive data, which naturally lend themselves to data-driven approaches for epidemic spread prediction [11,12,13,14]. Compared to the research studies described above, which employed known time-series forecasting models, or were predicated on a specific model, we propose the use of ordinal classification algorithms for the prediction and evaluation of the epidemic spread. These algorithms are designed to address the aforementioned limitations of previous models. The next section presents a review of classification algorithms and their adaptability for the evaluation of different factors affecting the spread of an epidemic—the goal of which is to predict the daily growth rate factor.

1.2. Classification Methods for the Evaluation of Different Factors Affecting the Spread of an Epidemic

Classification is one of the most common tasks in machine learning. It is used to identify to which of a set of classes a new observation belongs, based on the values of the explanatory or input variables. In our research, the classification task consists of distinguishing between different levels of a growth rate factor [15] that represents the epidemic spread. The classification is based on the identification of relationships among variables that represent specific conditions and risk factors. For data in which the class attribute exhibits some form of ordering (such as the growth factor level), ordinal classification can be applied, which takes into account the ranking relationship among classes [16]. Ordinal problems commonly address real-world applications such as portfolio investment by expected return performance or classification of the severity of disease, in which a classification error could have critical consequences [17,18,19,20,21]. Most of these techniques assume monotonicity between the explaining and target attributes [21,22,23,24,25]. Several previous research studies have shown that ordinal classifiers yield poor classification accuracy when applied to datasets with high levels of non-monotonic noise [26]. In recent studies, Singer et al. and Singer and Cohen [27,28] proposed an ordinal classification tree based on a weighted information gain measure. They found it to be effective for classification problems in which the class variable exhibits some form of ordering, and where dependencies between the attributes and the class value can be non-monotonic, as may be the case for the current problem of the control of epidemic spread based on environmental factors. For example, the ordinal attribute “forecast temperature” may have a non-monotonic effect on the growth factor; that is, extreme temperature conditions, either very high or very low, may lead to lower growth factor values, while under “moderate” temperatures, the growth factor may be higher. The weighted information gain measure proposed in these studies takes into consideration the magnitude of the potential classification error, where this error is calculated relative to the value of a specific class of the target. In this research paper, we extend the weighted information gain measure such that the classification error can be measured from a statistical value that is not necessarily defined by a single class—for example, the expected value of all classes. We use the proposed measure to develop ordinal decision-tree-based ensemble approaches, i.e., ordinal AdaBoost and random forest models, which are known to outperform individual classifiers. We demonstrate that these ordinal decision-tree-based approaches are naturally suited to identifying large numbers of data patterns with complex dependence structures, without requiring a priori assumptions regarding the dependencies within the data. Thus, the proposed algorithms have the precise characteristics required to address the aforementioned shortcomings of existing approaches for evaluating the effects of different factors on daily and regional growth factors of an epidemic. The main objectives of this study are fourfold: (i) to extend the weighted information gain measure such that the classification error can be measured from a statistical value that is not necessarily defined by a single class; (ii) to develop ordinal decision-tree-based ensemble approaches in which an objective-based information gain measure is used; (iii) to examine the advantage of combining ordinal decision-tree-based ensemble approaches with non-ordinal individual classifiers to leverage the strengths of each type of classifier; and (iv) to examine the ability to carry out multi-class identification of different levels of a daily growth factor using ordinal decision-tree-based ensemble approaches. The remainder of the paper is organized as follows. In Section 2, we provide a detailed description of the proposed ordinal decision-tree-based ensemble approaches. Section 3 presents numerical experiments for evaluating and benchmarking the proposed algorithms against known non-ordinal algorithms. The input data for these experiments consist of COVID-19 growth rate factors, along with environmental factors and containment measures for 19 regions in Italy. Finally, the conclusions and discussion are presented in Section 4.

2. Materials and Methods

In this section, we begin by presenting an extension to a general version of the objective-based entropy measure proposed by Singer et al. and Singer and Cohen [27,28] (Section 2.1). In Section 2.2, we develop ordinal AdaBoost and random forest algorithms based on ordinal decision tree models that use this measure. Further, we propose a majority voting approach based on combined ordinal and non-ordinal algorithms. We then continue, in Section 2.3, by describing the dataset used for classification of the daily COVID-19 growth rate factor, which will be used to evaluate the performance of the suggested approaches.

2.1. An Objective-Based Information Gain Measure for Ordinal Decision-Tree-Based Algorithms

The objective-based information gain (OBIG) is a measure for selecting the attributes with the greatest explanatory value in a decision tree model. The goal of our research is to identify the severity of the epidemic spreading process by classifying the level of the daily growth factor (DGF). The OBIG is appropriate for this purpose, since it takes into consideration the ordinal nature of the DGF level and the magnitude of the potential classification error. Assume that we wish to classify the level of the DGF based on a dataset , where denotes a sample m in the dataset, defined by a vector of values for the K attributes {, and denotes the value of the daily growth factor of sample m. We categorize the values of into n different severity levels denoted by the random variable , where denotes the lowest daily growth rate and the highest daily growth rate. The new discretized daily growth rate variable (the target variable) is denoted by . We define values for the different levels of the daily growth rate, , as an increasing function of the severity, such that . The precise values of the classes should be assigned by considering the consequences of potential classification errors. For example, assume that we have three levels of daily growth rate, (c1, c2, c3) = (low, medium, high), and the decision-makers are conservative regarding restricting the movements of the public, as they wish to prevent medium and high spreading rates of the epidemic. Thus, when a “high” daily growth rate is predicted, the recommendation is global closure, while when a “medium” daily growth is predicted, the recommendation is local closures, which decision-makers believe can also be a highly effective strategy in the case of a high growth rate. It is only when a “low” daily growth rate is predicted that it is considered acceptable to remove all restrictions. The health consequences of predicting a low daily growth rate while the actual growth rate is medium are more severe than those of predicting a medium daily growth rate when the actual growth rate is high. Accordingly, we would assign values to the classes such that the relationship between the values of different levels is . The OBIG uses an objective-based entropy (OBE) measure that, like the conventional concept of entropy, measures the randomness and uncertainty of the outcome of a random variable. However, unlike the conventional measure, the OBE allocates different weights to the classes as follows: where is the probability that record m belongs to class , and is the weight of class . In previous studies that used the OBE formula [27,28], the weights assigned to different classes were calculated according to the values and dispersions of the classes with respect to the value of selected class defined by statistic s. Among the selected classes proposed in these studies were the class with the maximum value, , and the most probable class in the dataset, . In this research study, we generalize the OBE measure by replacing the value of the selected class, , with a targeted value, . The targeted value should not necessarily be related to a selected class, but can reflect a statistical property of the set of classes, such as its expected value: Specific examples of the targeted value, as presented in previous studies, include and . Thus, we define the weight of class in this research study as where is calculated as follows: the absolute deviation of the value of the ith class, , from the targeted value, , divided by the sum of all absolute difference values over all possible classes in the dataset. This measure implies that an attribute with a smaller distribution around the targeted value obtains a smaller OBE value, which represents a lower risk. The factor () is a normalization factor that smooths the distribution of the weights over the different classes. The objective-based entropy measure is used for the calculation of an objective-based information gain measure, , for selecting branching attributes in dataset in decision tree models by partitioning the records in over the attribute having distinct values as follows: where the second expression on the right-hand side of the equation is the objective-based entropy of a possible partitioning on the attribute . The value represents the weight of the rth partition of attribute , i.e., the number of records in relative to the total number of records in D, and represents the objective-based entropy of the sub-dataset . Similar to the conventional information gain measure, the objective-based information gain is overly sensitive to the number of distinct values ; thus, it should be normalized (at least for some algorithms) when there is a large variance in the number of values for different attributes [29,30,31]. In our research, we used the CART model as well as ordinal decision-tree-based ensemble approaches that are composed of CART models. Given that these algorithms branch via binary splitting at each node of the tree (), we used the OBIG measure in Equation (4) without normalization. The attribute with the highest (non-negative) weighted information gain was selected as the branching attribute of the node.

2.2. Incorporating the Objective-Based Information Gain Measure into Ensemble Methods

Ensemble methods attempt to overcome the bias or variance effects of individual classifiers by combining several of them together [32], thus achieving better performance [33,34]. Despite the fact that ensemble methods for nominal classification have received considerable attention in the literature and have been chosen in preference to single individual learning algorithms for many classification tasks, the use of ordinal classification in ensemble algorithms has rarely been discussed [35]. In the literature, the most widely used ensemble methods are categorized into four techniques: bagging, boosting, stacking, and voting [35,36]. In this research, we suggest integrating the objective-based information gain into ensemble methods to enable the use of ordinal classification in ensemble algorithms. Specifically, in Section 2.2.1, we propose an ordinal random forest (RF) algorithm to implement ordinal classification in an ensemble approach based on the bagging technique. In Section 2.2.2, we propose an ordinal AdaBoost algorithm, which is based on the boosting technique. In Section 2.2.3, we propose a majority voting approach based on a combination of non-ordinal and ordinal decision-tree-based models.

2.2.1. Ordinal Random Forest

The random forest algorithm is a bagging method in which we create random sub-samples of our dataset with replacement, and we train decision trees on each sample. Since, in each sub-sample, the gain of each attribute with respect to the target variable may be similar, the different decision trees may have considerable structural similarity. Thus, in order to ensure that the trees are less correlated, while still ensuring that the samples are chosen randomly, the algorithm samples over the attributes in each node and uses only a random subset of them to choose the variable to split on, which reduces the similarities between different decision trees. A random forest built from several learners can significantly reduce the variance, i.e., fit the training data well, but can perform poorly with testing data (called overfitting), usually as a result of the model’s high complexity; thus, the RF approach is most suitable when combining individual learners when each of them suffers from large variance. Indeed, several studies have shown that random forests yield accurate and robust classification results when high variance exists [37,38,39]. Figure 1 presents the pseudocode for the construction of the random forest algorithm, as well as for the classification phase. In the construction phase, the random forest is built from L decision trees {, each trained on the lth bootstrap sample of the dataset, . For each node in the tree, we randomly select a subset of attributes , and the attribute with the highest objective-based information gain is selected as the splitting attribute using the procedure . In the classification phase, for each new instance given to the decision tree classifiers as an input, the output is returned by each model, and the class with the highest number of votes is chosen as follows: where

Figure 1

The meta-code of the proposed ordinal random forest classifier based on the objective-based information gain (OBIG) measure.

In the case where two classes receive the same number of votes, the algorithm chooses one of them at random.

2.2.2. Ordinal AdaBoost

The AdaBoost algorithm is an adaptive boosting method in the sense that subsequent weak classifiers are tweaked in favor of those instances misclassified by previous classifiers, i.e., the classifiers are built sequentially and not in parallel like in the random forest algorithm. Each subsequent classifier focuses on previous “harder-to-classify” instances by increasing their weights in the dataset. In our example, we use 1-level decision trees based on the OBIG measure as the weak classifiers (low-complexity models). AdaBoost can significantly reduce the bias, i.e., the difference between the predicted value and true target value in the training data (called “underfitting”), which usually occurs due to low complexity of the model. Like the random forest approach, AdaBoost reaches a classification by applying multiple decision trees to every sample and combining the predictions made by individual trees. However, rather than selecting the class with the majority vote among the decision trees, as in random forest, in the AdaBoost algorithm, every decision tree contributes to a varying degree to the final prediction according to the incorrectly classified samples (as described below). Several studies have shown that AdaBoost yields accurate and robust results when high bias exists [40,41,42]. Figure 2 presents the pseudocode for the construction and classification phases of the AdaBoost model built from 1-level decision trees based on the OBIG measure. In the construction phase, the BuildAdaBoost procedure builds L-many decision trees, each with a depth of 1 ({). These are built sequentially, where the goal is for each tree to improve upon its predecessor by being trained on a dataset , generated from a set of sample weights , such that samples that were misclassified by previous decision trees have higher weights. We begin in the first step, l = 1, by building a decision tree under the assumption of identical weights for the M samples in the dataset, . At the end of each iteration, we calculate the classifier error and use it to update the samples’ weights for the subsequent iteration and to determine the weight of the current decision tree in the final classification. The error of the classifier is calculated as follows: where is the error function, which is defined in the AdaBoost algorithm by

Figure 2

The meta-code of the proposed ordinal AdaBoost classifier based on the OBIG measure.

Thus, the error is assigned a value of 1 when the predicted value of a sample is different from the real value, and 0 otherwise. Note that we require , such that if the constraint does not hold, we stop generating decision trees. The expression reflects the error of a naïve classifier (i.e., random choice of a class) assuming equal probabilities for the classes. Note that Equation (7) assumes the same error value of 1 for each incorrectly classified sample, regardless of the magnitude of the classification error. Thus, an alternative option for an error function that considers the magnitude of the classification error would be such that the error value is the difference between the values of the predicted and actual class divided by the maximum difference between two different class values, where the denominator serves to normalize the error to the range . As mentioned, we use the error to calculate the decision tree weight in the final classification, which is achieved as follows: where , and its value increases as the classifier’s error decreases, while when , which means that the weight of a classifier tends to zero as its error tends to the naïve classifier’s error. We also use to update the samples’ weights, such that misclassified samples obtain higher weights as follows: where is a normalization constant such that the new sample weights will sum up to 1. Note that when the error of the classifier tends to the naïve classifier error, (i.e., ), or when the error is equal to zero, (i.e., all samples classified correctly), then the weights of the samples will not change compared to previous iterations up to a constant. In any other case, decreases for samples that were correctly classified. In the classification phase, for each new instance given to the decision tree models as an input, the output is returned by each model and contributes to the final prediction as follows: where is defined in Equation (5).

2.2.3. Ensemble Approach Based on Decision-Tree-Based Algorithms

In order to benefit maximally from the various ordinal decision-tree-based algorithms (each with its own objective-based entropy measure), as well as from non-ordinal algorithms, we propose a simple ensemble approach, which, in theory, should leverage the strengths of each individual classifier [43]. Assume that we wish to build J-many classifiers . For each new sample given to classifier as an input, the output is returned using the classification procedure of the relevant model, and the class with the highest number of votes is chosen as follows:

2.3. The Dataset and Data Preparation

We constructed a COVID-19 dataset (see Appendix A) to evaluate the newly developed algorithms and to compare them with their non-ordinal counterparts and other conventional algorithms. The dataset is based on 43 features, created from three different types of daily data relating to 19 regions of Italy between 7th March and 1st April 2020. (The region and the date are two additional features of the dataset, which we refer to as “key features”). The three types of data are as follows: (i) COVID-19 patient data [44], which consists of 13 numerical features (e.g., number of new positive cases, number of tests performed); (ii) weather data [45], comprising 15 numerical features (e.g., temperature, humidity, pressure, wind speed); and (iii) containment and mitigation measures data [46], which includes 15 Boolean features (e.g., regulations regarding outdoor gatherings, public transport cleaning, mass isolation, and school closure). Our target value is the daily growth factor () of positive cases normalized by the number of tests performed, , relative to the corresponding values for the previous day: Daily growth rates equal to zero for specific regions and dates were removed from the dataset, since they usually reflect pre-COVID19-spread periods for these regions. In order to prepare the data for classification, we discretized the DFG into three different levels, denoted by , where represents negative growth and is defined as (43% of cases), represents linear growth and is defined as (23% of cases), and represents exponential growth and is represented by (34% of cases). We denote the values for the different levels of the daily growth rate by . During this experiment, we aimed to predict at time the growth rate per region 6 days later (i.e., at time ), based on (a) the last 6 days of COVID-19 patient data (i.e., data corresponding to ); (b) the subsequent 5 days of weather data, as forecast at time t (i.e., ); and (c) containment measures at time t. We assumed an incubation period of up to 5 days and the ability to forecast the weather 5 days in advance with high accuracy. We further assumed that only the containment measures for the current date were known (we observed from the data that, in most cases, the containment measures did not change during the following 5 days). After the preprocessing stage, the final dataset that was used as an input for the classification algorithms, , consisted of samples and attributes, {. For model evaluation, the data were split into a training dataset, corresponding to data for 7 March to 29 March (80% of the data), and a testing dataset of data for 28 March to 1 April (20% of the data). In that way, the model was trained on past data and then validated on future data that it had not previously encountered.

3. Results

3.1. A Comparison between Ordinal CART Classifiers and the Popular Non-Ordinal CART Classifier

This subsection compares the performance of the OBIG-based ordinal CART, i.e., a single decision tree, with the popular non-ordinal CART. Four different versions of the ordinal CART are evaluated, corresponding to four different targeted values, . For benchmarking purposes, the performance of the ordinal and non-ordinal classifiers was computed using three performance measures for multi-class classification: F-score, Accuracy, and area under the curve (AUC) [47]. Additionally, we used the mean square error (MSE) and Kendall’s correlation coefficient, , which are acceptable performance measures for ordinal classification [27,48]. The best performance values are highlighted in bold in Table 1. The following insights can be gleaned from the table: (1) The ordinal CART algorithms based on and yielded better performance than the regular non-ordinal CART in four out of five indices, while the versions based on and were superior for all indices. (2) The common classification performance measures (F-score, Accuracy, and AUC) were between 9% and 17% higher for the CART based on than for the non-ordinal CART. A similar improvement in performance is seen for MSE, while the improvement for is much higher.

Table 1

Performance measures of CART models for the classification of the daily growth factor.

	Performance Measures for Classification			Performance Measures for Ordinal Classification
	F-Score	Accuracy	AUC	MSE	τb
Non-ordinal classifier
CART	0.361	0.379	0.493	1.537	−0.079
Ordinal classifiers
Ordinal CART–OBE(v(cmode))	0.391	0.389	0.567	1.684	−0.011
Ordinal CART–OBE(v(cmax))	0.366	0.358	0.518	1.211	0.068
Ordinal CART–OBE(v(cmin))	0.385	0.389	0.526	1.274	0.090
Ordinal CART–OBE(EV)	0.409	0.442	0.535	1.316	0.016

Figure 3 illustrates the AUC values obtained for each growth factor level, as well as the global AUC, for the two best ordinal models and the conventional non-ordinal CART model. It can be seen that the ordinal CART models yielded significantly better results for all classes than the individual classifier. The improvement in performance for the best ordinal CART model ranges from 13% to 19% (for the three individual levels and the global average).

Figure 3

Comparison of the AUC values (y-axis) for two best ordinal CART models vs. non-ordinal CART as a function of the growth factor level (x-axis).

3.2. A Comparison between Ordinal and Non-Ordinal Ensemble-Based Classifiers

This subsection reports the performance of the OBIG-based ordinal AdaBoost and random forest algorithms (Table 2 and Table 3, respectively) and compares them with their non-ordinal ensemble counterparts. The best performance values in each table are highlighted in bold. We observe the following: (1) All ordinal AdaBoost algorithms except for the one based on OBE() achieved better performance than the conventional AdaBoost with respect to all five indices. (2) All ordinal random forest algorithms outperformed their non-ordinal counterpart for all indices, with the exception of the ordinal random forest based on OBE(), which yielded a lower value for the Kendall index. (3) The ordinal AdaBoost model based on and the ordinal random forest model based on OBE() yielded the best performance for all five indices. (4) The ranges of improvement of the best ordinal AdaBoost and random forest classifiers relative to their non-ordinal counterparts for the classification measures F-score, Accuracy, and AUC are 14–25% and 6–21%, respectively. The improvement in performance is similar for MSE and considerably higher for .

Table 2

Performance measures for the classification of the daily growth factor using ordinal and non-ordinal AdaBoost models.

	Performance Measures for Classification			Performance Measures for Ordinal Classification
	F-Score	Accuracy	AUC	MSE	τb
Non-ordinal ensemble classifier
ADABoost	0.380	0.411	0.507	1.253	0.006
Ordinal AdaBoost classifies
Ordinal AdaBoost–OBE(v(cmode))	0.377	0.381	0.504	1.56	−0.093
Ordinal AdaBoost–OBE(v(cmax))	0.475	0.526	0.578	1.137	0.163
Ordinal AdaBoost–OBE(v(cmin))	0.405	0.484	0.538	1.242	0.072
Ordinal AdaBoost–OBE(EV)	0.447	0.474	0.563	1.221	0.122

Table 3

Performance measures for the classification of the daily growth factor using ordinal and non-ordinal random forest models.

	Performance Measures for Classification			Performance Measures for Ordinal Classification
	F-Score	Accuracy	AUC	MSE	τb
Non-ordinal ensemble classifier
Random forest (RF)	0.405	0.400	0.540	1.421	0.035
Ordinal random forest classifiers
Ordinal RF–OBE(v(cmode))	0.439	0.484	0.570	1.147	0.185
Ordinal RF–OBE(v(cmax))	0.407	0.453	0.559	1.400	0.126
Ordinal RF–OBE(v(cmin))	0.425	0.484	0.557	1.211	0.131
Ordinal RF–OBE(EV)	0.437	0.442	0.555	1.411	0.026

Figure 4 illustrates the AUC values obtained for each growth factor level for the best ordinal AdaBoost and random forest classifiers compared to their conventional non-ordinal counterparts. It can be seen that the ordinal models yielded significantly better results for all classes except for the case of linear growth in the AdaBoost models. However, errors relative to fringe classes usually lead to more serious consequences; thus, in ordinal problems, it is more important to achieve better performance for these classes.

Figure 4

Comparison of the AUC values (y-axis) for the best ordinal AdaBoost and random forest classifiers vs. their conventional counterparts as a function of the growth factor level (x-axis).

3.3. A Comparison between the Predictions of Ordinal Classifiers and Non-Ordinal Classifiers

In this section, we evaluate whether the differences between the predictions of some of the best ordinal classifiers and their non-ordinal counterparts are significant. This analysis was carried out for each of the three types of classifier—CART, AdaBoost, and random forest—by conducting a two-tailed paired-samples t-test using pairs of predictions for each instance in the testing dataset. Table 4 summarizes the results, showing that the difference was found to be significant for the ordinal decision-tree-based ensemble approaches (p < 0.05).

Table 4

Paired t-test results for the significance of the difference in the predictions of the ordinal classifiers and their non-ordinal counterparts (ordinal CART vs. CART; ordinal AdaBoost vs. AdaBoost; ordinal random forest vs. random forest).

	Paired t-Test p-Value
	Ordinal CART—OBE(EV)	Ordinal AdaBoost—OBE(v(cmin))	Ordinal RF—OBE(v(cmode))
Non-ordinal counterpart	0.14	0.0057	0.0015

Figure 5, Figure 6 and Figure 7 present the distributions of the errors (the actual class values minus the predicted class values) for the ordinal classifiers compared to their non-ordinal counterparts. It can be seen that the frequencies of zero-error cases for the ordinal algorithms were substantially higher than for their non-ordinal counterparts. Furthermore, the frequency of errors with a value of −2 (i.e., the prediction of an exponential growth factor when the actual growth factor was negative) was higher in all non-ordinal models than in the corresponding ordinal models, with the difference being at least a factor of 2 in the case of the CART and random forest models.

Figure 5

Error distribution of ordinal CART and non-ordinal CART.

Figure 6

Error distribution of ordinal AdaBoost and non-ordinal AdaBoost.

Figure 7

Error distribution of ordinal random forest and non-ordinal random forest.

3.4. A Comparison between Ordinal Classifiers and Non-Ordinal Classifiers

For benchmarking purposes, in this subsection we compare the best ordinal classifiers from the previous analysis with a number of popular non-ordinal classifiers. Table 5 presents the performance measures for eleven individual classifiers (eight non-ordinal classifiers and three ordinal), with the two best performance values highlighted in bold. The following insights can be gleaned from the table: (1) The ordinal AdaBoost yielded the best results for F-score, Accuracy, and AUC, and among the two best results for MSE and . Overall, it appears to be the best among the individual classifiers. (2) Logistic regression and ordinal random forest are among the two best classifiers with respect to two indices.

Table 5

Performance measures of the best ordinal classifiers in comparison to eight popular non-ordinal classifiers.

	Performance Measures for Classification			Performance Measures for Ordinal Classification
	F-Score	Accuracy	AUC	MSE	τb
Non-ordinal classifiers
Naïve Bayes	0.246	0.305	0.478	0.916	−0.065
Logistic regression	0.453	0.505	0.560	1.189	0.060
Gradient boosting	0.347	0.356	0.481	1.611	−0.117
XGBoost	0.378	0.389	0.506	1.558	−0.077
K-nearest neighbor	0.433	0.453	0.543	1.305	0.013
AdaBoost	0.380	0.411	0.507	1.253	0.006
Random forest	0.405	0.400	0.540	1.421	0.035
CART	0.361	0.379	0.493	1.537	−0.079
Ordinal classifiers
Ordinal CART–OBE(EV)	0.409	0.442	0.535	1.316	0.016
Ordinal AdaBoost—OBE(v(cmax))	0.475	0.526	0.578	1.137	0.163
Ordinal RF—OBE(v(cmode))	0.439	0.484	0.570	1.147	0.185

3.5. Ensemble Majority Voting Approach Based on Ordinal Classifiers

In this subsection, we apply a majority voting ensemble approach based on ordinal and non-ordinal classifiers. We combined the ordinal AdaBoost with targets , which were found to be the best AdaBoost classifiers in previous analyses, with the best non-ordinal classifier—logistic regression. The performance of the majority voting model is reported in the first column of Table 6. For comparison, we also show the performance of two individual classifiers—logistic regression (Column 2) and the best ordinal classifier, AdaBoost based on (Column 3). The best results are indicated in bold. The majority voting model outperformed the individual classifiers for all five indices with the ranges of improvement for Accuracy, AUC, and F-score being 3–6% (relative to the ordinal classifier) and 6–10% (relative to the non-ordinal classifier).

Table 6

Comparison of global performance measures for the majority voting ensemble approach and the best individual classifiers (non-ordinal and ordinal).

Performance Measure	Majority Voting Model Based on Ordinal and Non-Ordinal Classifiers	Best Non-Ordinal Classifier: Logistic Regression	Best ordinal classifier: ordinal AdaBoost based on OBE(v(cmax))
F-score	0.501	0.453	0.475
Accuracy	0.558	0.505	0.526
AUC	0.596	0.560	0.578
MSE	1.074	1.189	1.137
τb	0.200	0.060	0.016

4. Conclusions and Discussion

In this research we suggest an extension to the objective-based information gain (OBIG) measure that was proposed in [27,28] for selecting the attributes with the greatest explanatory value in a classification problem. In these studies, the weights assigned to different classes were calculated with respect to the value of a selected class. In the present study, we introduced a general targeted value function that is not necessarily related to a specific class. Based on the extended OBIG measure, we proposed novel, ensemble-based ordinal approaches, i.e., (1) ordinal AdaBoost and ordinal random forest decision tree models and (2) a majority voting approach that combines these models together with conventional non-ordinal algorithms. We demonstrated how the ensemble ordinal approaches may be implemented to evaluate the effect of different factors on the level of the regional daily growth factor (DGF) of the spread of an epidemic in order to yield a classification value. The construction of the proposed models considers the magnitude of the classification error of the daily growth factor. The classification tool will enable the spreading process to be tracked and controlled, as the models can yield insights regarding the link between local containment measures and the DGF. We evaluated the performance of the suggested approaches for classification of the daily COVID-19 growth rate factor in 19 regions of Italy. A comparison of each of the ordinal models with its conventional, non-ordinal counterpart demonstrated that the proposed models are superior based on a variety of common performance metrics for both conventional and ordinal classification problems. Specifically, the best individual ordinal CART model yielded a 9–17% improvement when compared to the conventional CART model for three common indices for conventional classification problems: F-score, Accuracy, and AUC. For the same indices, the best ordinal AdaBoost model yielded a 14–25% improvement when compared to the conventional AdaBoost model, and the ordinal random forest model yielded a 6–21% improvement when compared to its non-ordinal counterpart. A similar level of improvement was observed for one of the performance measures designed for ordinal classification (MSE), while the second such measure (Kendall’s correlation coefficient) showed much greater improvement. Furthermore, the ordinal AdaBoost was shown to be the best individual classifier when compared to all other ordinal classifiers and eight popular non-ordinal classifiers. Finally, we investigated a majority voting approach that combines ordinal and non-ordinal classifiers. This ensemble approach achieved better performance in all indices than the best individual ordinal and non-ordinal classifiers, with a level of improvement of 3–10% in all indices. The level of improvement offered by the proposed ordinal approaches relative to their non-ordinal counterparts suggests that these approaches show promise for classification of the regional daily growth factor level in the spread of an epidemic, which is an ordinal target problem with no monotonic constraints on the explaining attributes. However, despite the fact that in our experiment, the relative improvement between the ordinal and the non-ordinal models is systematic, the experiment was performed only on a single dataset with a relatively low number of instances (which limits the prediction performance of the models), while ensemble models are mostly suitable for datasets with large numbers of instances. In conclusion, the main findings of this study are as follows. First, the ordinal decision-tree-based ensemble approaches yielded better classification results than their non-ordinal counterparts, and the best ordinal classifier outperformed eight popular non-ordinal classifiers. Second, when implementing an ensemble approach by combining two ordinal decision tree algorithms with a non-ordinal algorithm, the classification performance is improved even further. Third, the proposed approaches are suitable for carrying out multi-class identification of different levels of the daily growth factor rate. Future research could apply the suggested ordinal algorithms to other datasets, including datasets with larger numbers of instances, to verify the robustness of the algorithms with respect to different settings. Future studies could also consider integrating the magnitude of the classification error into other boosting ensemble methods, such as gradient boosting or XGBoost algorithms. Furthermore, it could be interesting to examine the performance of other ensemble approaches, such as stacking or soft voting.

10 in total

1. Adequacy of SEIR models when epidemics have spatial structure: Ebola in Sierra Leone.

Authors: Wayne M Getz; Richard Salter; Whitney Mgbara
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2019-06-24 Impact factor: 6.237

2. Congestive heart failure detection using random forest classifier.

Authors: Zerina Masetic; Abdulhamit Subasi
Journal: Comput Methods Programs Biomed Date: 2016-03-21 Impact factor: 5.428

3. Panmictic and Clonal Evolution on a Single Patchy Resource Produces Polymorphic Foraging Guilds.

Authors: Wayne M Getz; Richard Salter; Andrew J Lyons; Nicolas Sippl-Swezey
Journal: PLoS One Date: 2015-08-14 Impact factor: 3.240

4. Spatial spread of the West Africa Ebola epidemic.

Authors: Andrew M Kramer; J Tomlin Pulliam; Laura W Alexander; Andrew W Park; Pejman Rohani; John M Drake
Journal: R Soc Open Sci Date: 2016-08-03 Impact factor: 2.963

5. Phase-adjusted estimation of the number of Coronavirus Disease 2019 cases in Wuhan, China.

Authors: Huwen Wang; Zezhou Wang; Yinqiao Dong; Ruijie Chang; Chen Xu; Xiaoyue Yu; Shuxian Zhang; Lhakpa Tsamlag; Meili Shang; Jinyan Huang; Ying Wang; Gang Xu; Tian Shen; Xinxin Zhang; Yong Cai
Journal: Cell Discov Date: 2020-02-24 Impact factor: 10.849

6. Effects of temperature and humidity on the spread of COVID-19: A systematic review.

Authors: Paulo Mecenas; Renata Travassos da Rosa Moreira Bastos; Antonio Carlos Rosário Vallinoto; David Normando
Journal: PLoS One Date: 2020-09-18 Impact factor: 3.240

7. How to Estimate Epidemic Risk from Incomplete Contact Diaries Data?

Authors: Rossana Mastrandrea; Alain Barrat
Journal: PLoS Comput Biol Date: 2016-06-24 Impact factor: 4.475

8. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions.

Authors: Zifeng Yang; Zhiqi Zeng; Ke Wang; Sook-San Wong; Wenhua Liang; Mark Zanin; Peng Liu; Xudong Cao; Zhongqiang Gao; Zhitong Mai; Jingyi Liang; Xiaoqing Liu; Shiyue Li; Yimin Li; Feng Ye; Weijie Guan; Yifan Yang; Fei Li; Shengmei Luo; Yuqi Xie; Bin Liu; Zhoulang Wang; Shaobo Zhang; Yaonan Wang; Nanshan Zhong; Jianxing He
Journal: J Thorac Dis Date: 2020-03 Impact factor: 3.005

Review 9. Estimating epidemic exponential growth rate and basic reproduction number.

Authors: Junling Ma
Journal: Infect Dis Model Date: 2020-01-08

10. A mathematical model for simulating the phase-based transmissibility of a novel coronavirus.

Authors: Tian-Mu Chen; Jia Rui; Qiu-Peng Wang; Ze-Yu Zhao; Jing-An Cui; Ling Yin
Journal: Infect Dis Poverty Date: 2020-02-28 Impact factor: 4.520

10 in total

3 in total

1. NSGA-II as feature selection technique and AdaBoost classifier for COVID-19 prediction using patient's symptoms.

Authors: Makram Soui; Nesrine Mansouri; Raed Alhamad; Marouane Kessentini; Khaled Ghedira
Journal: Nonlinear Dyn Date: 2021-05-18 Impact factor: 5.022

2. Empirical Study on Classifiers for Earlier Prediction of COVID-19 Infection Cure and Death Rate in the Indian States.

Authors: Pratiyush Guleria; Shakeel Ahmed; Abdulaziz Alhumam; Parvathaneni Naga Srinivasu
Journal: Healthcare (Basel) Date: 2022-01-02

Review 3. Mesoscopic Modeling and Rapid Simulation of Incremental Changes in Epidemic Scenarios on GPUs: Fast What-If Analyses of Localized and Dynamic Effects.

Authors: Kalyan S Perumalla; Maksudul Alam
Journal: J Indian Inst Sci Date: 2021-08-03

3 in total