Ling-Yang He1, Hui Li1, Jian-Wu Bi1, Jing-Jing Yang2, Qing Zhou3. 1. College of Tourism and Service Management, Nankai University, Tianjin 300350, China. 2. Macao Institute for Tourism Studies, Colina de Mong-Há, Macau 999078, China. 3. School of Management, Hangzhou Dianzi University, Hangzhou 310018, China.
Abstract
This paper proposes a new foresight approach to estimate the impact of public health emergencies on hotel demand. The forecasting-based influence evaluation consists of four modules: decomposing hotel demand before an emergency, matching each decomposed component to a forecasting model, combining the predictions as the expected demand after the emergency, and estimating the impact by comparing actual demand against that predicted. The method is applied to analyze the impact of COVID-19 on Macao's hotel industry. The empirical results show that: 1) the new approach accurately estimates COVID-19's impact on hotel demand; 2) the seasonal and industry development components contribute significantly to the estimate of expected demand; 3) COVID-19's impact is heterogeneous across hotel services.
This paper proposes a new foresight approach to estimate the impact of public health emergencies on hotel demand. The forecasting-based influence evaluation consists of four modules: decomposing hotel demand before an emergency, matching each decomposed component to a forecasting model, combining the predictions as the expected demand after the emergency, and estimating the impact by comparing actual demand against that predicted. The method is applied to analyze the impact of COVID-19 on Macao's hotel industry. The empirical results show that: 1) the new approach accurately estimates COVID-19's impact on hotel demand; 2) the seasonal and industry development components contribute significantly to the estimate of expected demand; 3) COVID-19's impact is heterogeneous across hotel services.
A public health emergency can bring tourism to a standstill. For example, due to the restrictive measures on population mobility after the outbreak of COVID-19, tourism - an industry largely characterized by the congregation of people and essentially only ever an offline form of consumption - has been seriously affected, with a sharp decline in the demand (Sharma, Shin, Santa-María, & Nicolau, 2021). The highly contagious nature of COVID-19 has made tourists more cautious about travel. As a related sector of the tourism industry, hotels also suffer from huge operating losses owing to this pandemic. For example, early in the current pandemic, Macao's hotel occupancy rate in February 2020 fell to below 15%, and the actual gross domestic product of Macao as a whole plunged about 50% year-on-year in the first quarter of 2020 (Liu, Wang, McCartney, & Wong, 2021). As Jin, Qu, and Bao (2019) pointed out, investigating the effect of crisis events helps to understand the impact and post-event recovery in a mixed political, social, and economic context. Hence, there is an urgent need to be able to produce accurate estimates of the impact of a public health emergency on hotel demand and how quickly it will recover after a pandemic.Qualitative and quantitative studies have examined the impact of public health emergencies on tourism and hospitality. The qualitative analysis focuses on interviewing individuals, which is suitable for impact analysis from a micro perspective (Bajrami et al., 2021). To evaluate the macro impact of public health emergencies, quantitative analysis is introduced, such as intervention analysis (Crespí-Cladera, Martín-Oliver, & Pascual-Fuster, 2021; Kuo, Chen, Tseng, Ju, & Huang, 2008; Polemis, 2021) and the event study approach (Chen, Jang, & Kim, 2007; Sharma & Nicolau, 2020). However, an appropriate control group is difficult to find for intervention analysis. Typically, in the context of public health emergencies, such as SARS and COVID-19, few regions can act as a suitable control group for intervention analysis, e.g. differences-in-differences (Polemis, 2021) and regression discontinuity (Deng, Hu, & Ma, 2019). Similarly, the event study approach is difficult to apply because one of its basic assumptions, regarding the effectiveness of the market (Peterson, 1989), is violated in the context of public health emergencies. Both supply and demand experience a substantial decline in this circumstance, and the cumulative abnormal return in an event study may not truly reflect the impact of the emergency. Thus, a new method particularly suited to estimate the impact of an emergency needs to be investigated, meeting the following characteristics of 1) providing a macro perspective, 2) no need to find a suitable control group, 3) not violating market assumptions, and 4) obtain more accurate estimation results.To meet the above four conditions, we propose a new forecasting-based influence evaluation method, involving adaptive and hybrid forecasting on the expected hotel demand in a virtual world without public health emergencies (the control group) and comparing expected demand against the real demand. It is necessary to restrain the virtual world without a pandemic from being too complex, and so some factors relevant to hotel demand, such as online search data, weather data, and holiday data, are not considered here. Additionally, as what we predict is the development trend of hotel demand without pandemics, the judgmental inputs are also not included, although they can reflect the levels of severity in terms of a pandemic's influence (Zhang, Song, Wen, & Liu, 2021). However, factors that can reflect the future economic status are valuable for the forecasting task (Jaipuria, Parida, & Ray, 2021). Therefore, an economic indicator is added into the model. The hotel demand before a public health emergency is decomposed into independent components with specific meanings and different frequencies via complete ensemble empirical mode decomposition (CEEMD). Then, because these components have different degrees of nonlinearity, each is matched with a suitable forecasting model. The model library contains four types of models popular in hotel demand forecasting: a statistical model, called the seasonal autoregressive integrated moving average with exogenous variables (SARIMAX); a deep learning model based on long short-term memory (LSTM); and two machine learning models, namely artificial neural network (ANN) and support vector regression (SVR). The expected hotel demand during an emergency is estimated by combining the predictions of different components. Ultimately, the impact of a public health emergency on hotel demand is evaluated by comparing the difference between actual demand and expected demand.The new method is applied to analyze the impact of COVID-19 on Macao's hotel industry to demonstrate its effectiveness. There are five principal empirical findings. 1) The new forecasting-based influence evaluation approach accurately estimates COVID-19's impact on hotel demand, and the proposed adaptive hybrid forecaster inside it performs better than a series of benchmarks. 2) The seasonal component and development trend make significant contributions to the estimation of hotel demand, respectively contributing more than 10% and 25%. 3) In the absence of COVID-19, the virtual demand for three- and five-star hotels grows, while that for two- and four-star hotels declines. 4) COVID-19's impact is heterogeneous for hotels with different star ratings: the impact on two-star hotels is the slightest, while that on five-star hotels is the most significant, followed by four-star hotels. 5) The recovery speed of the demand for hotels with different star ratings is also heterogeneous: five-star hotels recover the most rapidly, while two- and four-star hotels recover relatively slowly. Five-star hotels provide diverse services, including accommodation, entertainment, catering, among others, with marketing promotion strategies. This type of service is preferred by travelers after the pandemic. Accordingly, hotel managers should increase the diversity of their services to achieve a high standard in order to be more robust during a public health emergency.The rest of this paper is organised as follows: The next section presents related works on evaluating the impact of public health emergencies on tourism and hospitality and those mainstream forecasting methods. The following section sets out the methodology of the study, and briefly describes the modelling strategies for estimating the impact of a public health emergency with the proposed adaptive hybrid forecaster. Then, the impact of COVID-19 on Macao's hotel industry is used as a case study to investigate the effectiveness of the new forecasting approach. Finally, the contributions and implications are presented in the conclusion section.
Literature review
Impact analysis of crisis events on tourism and hospitality
Many works have investigated the impact of public health emergencies, especially the current COVID-19, on tourism and hospitality, qualitatively or quantitatively; they have utilized interviews (Chen, Demir, García-Gómez, & Zaremba, 2020; Kaushal & Srivastava, 2021; Shapoval et al., 2021), questionnaires (Bajrami et al., 2021; Chadee, Ren, & Tang, 2021; Foroudi, Tabaghdehi, & Marvi, 2021; Kim, Kim, & Lee, 2021; Sobaih, Elshaer, Hasanein, & Abdelaziz, 2021; Tu, Li, & Wang, 2021), and laboratory experiments (Bresciani, Ferraris, Santoro, Premazzi, & Viglia, 2021; Li, Yao, & Chen, 2021). Interviews are particularly useful for the construction and development of relevant theory and are better suited to small samples. However, the findings based on interviews apply only to a particular group, and it is not wise to generalize them. Questionnaires are a conventional and widely used quantitative research method that is better suited to larger samples than interviews. Nevertheless, this method can only measure variables within an established model, and so focus on the impact of relatively few variables, from a particular perspective, such as the employees (Bajrami et al., 2021). As for the laboratory experiment, its great advantage lies in the ability to control the influence of exogenous variables in order to verify causal relationships between variables. However, findings from laboratory experiments also lack generalizability to some extent. As Bresciani et al. (2021) point out, while the results of their study with laboratory experiment are robust to hotels in different European locations, the circumstance that US hotels are rather different from European hotels might affect tourists' perceptions.The generalizability of micro findings produced by using the above methods is limited. Therefore, to explore the macro impact of public health emergencies on tourism and hospitality, some researchers have chosen to use intervention analysis (Crespí-Cladera et al., 2021; Kuo et al., 2008; Polemis, 2021) and the event study approach (Chen et al., 2007; Sharma & Nicolau, 2020). Intervention analysis is regression-based and needs to meet strict assumptions (Box & Tiao, 1975). In such studies, the public health emergency is generally regarded as a dummy variable, which is introduced in a regression model to test whether the emergency causes changes in the development of tourism and hospitality. The impact is measured by the coefficient on the dummy variable (Zhang, Yu, Wang, & Lai, 2009). For example, Crespí-Cladera et al. (2021) used logit regression to identify which hospitality firms in Spain faced financial distress due to the COVID-19 disaster, and found that strong financial ability characterized those that survived. Looking at earlier pandemics and drawing on single-firm as well as panel data, Kuo et al. (2008) adopted the autoregressive moving average model and dynamic panel model to estimate the overall impact of SARS and avian flu on Asian countries. However, many regions will be affected by any given public health emergency, which leaves few comparable regions to act as a suitable control group in the commonly used forms of intervention analysis, such as differences-in-differences (Polemis, 2021) and regression discontinuity (Deng et al., 2019). Thus, the impact of public health emergencies evaluated by them may have systematic errors.The event study approach is a standard analytical approach in economics and management (Mackinlay, 1997). It can assess the impact of unforeseen events, such as public health emergencies, based on the cumulative abnormal return, which is defined as the sum of differences between the actual return and the estimated return, that is, the measure of abnormality caused by events. For example, Chen et al. (2007)) explored the impact of SARS on the stock value of hotels with an event study. The findings indicated that the earnings and share prices of seven listed hotels fell sharply during the SARS outbreak. Sharma and Nicolau (2020) assessed the impact of COVID-19 on the travel industry based on a market-based valuation method and found four sub-industries - hotels, airlines, cruise lines, and car rentals - had experienced a substantial fall in valuation. However, the foundation of event studies - the assumption of the effectiveness of the market - is violated in the context of a public health emergency (Peterson, 1989). The cumulative abnormal return calculated by an event study may not truly reflect the impact of that public health emergency, and the estimation results may be detrimentally affected.
Forecasting methods related to tourism and hospitality
There are also works that analyze the impact of unforeseen events on tourism and hospitality from the perspective of forecasting. For example, Qiu, Wu, Dropsy, Petit, and Ohe (2021) provided an assessment concerning the future development of tourism under the uncertainty surrounding COVID-19 based on a two-stage three scenario forecast framework. Wu, Hu, and Chen (2022) predicted the latest hotel occupancy rates in the context of the COVID-19 pandemic based on the mixed data sampling models (MIDAS). Zhang et al. (2021) combined econometric and judgmental indicators into their model to forecast the possible paths to tourism recovery in Hong Kong. As for the forecasting method, related works can be divided into single forecasting and combined forecasting. Single forecasting mainly includes three types. The first is regression analysis based on statistics, and the representatives are VAR (Wu, Cao, Wen, & Song, 2021) and ARMA family, i.e., ARIMA (Jin et al., 2019), SARIMA (Qiu et al., 2021), ARIMAX (Hu, Qiu, Wu, & Song, 2021), and SARIMAX (Song, Qiu, & Park, 2019; Wu, Song, & Shen, 2017). They are of better adaptability to linear data. The second refers to methods based on machine learning, such as SVR (Chen, Liang, Hong, & Gu, 2015) and ANN (Claveria, Monte, & Torra, 2015). They can deal with forecasting tasks with nonlinear demand data to some extent. The third is those methods based on deep learning. Wherein, commonly used models, e.g., LSTM (Bi, Liu, & Li, 2020) and CNN (Bi, Li, & Fan, 2021), can process demand data with strong nonlinearity. Deep learning methods show a universally well performance on a relatively large sample volume. As for combined forecasting, it integrated the advantages of multiple single models and can avoid the influence of excessive forecasting error of the single model (Song et al., 2019). Presently, mainstream combinations are serial and parallel. The serial forecast the original sequence firstly; then forecast the error; finally combines these two parts. The parallel decomposes the original sequence firstly; then predict each component separately; finally integrates the prediction results of all models. The integrated model is more stable and can generally achieve more accurate predictions than single models (Athanasopoulos, Song, & Sun, 2018). Thus, we construct our model based on the combination way.The ideas and methods of the above-mentioned forecasting works have important guiding values for our research, which convey a basic assumption on reasonable forecasting, i.e., the law of data generation is stable in a virtual world without the interference of emergencies. On the basis of this assumption, a new foresight approach is proposed by using a new adaptive hybrid forecaster to estimate the impact of public health emergencies on hotel demand. We combine statistical regression analysis and mainstream methods in artificial intelligence; then match each of them with the most suitable sequence to increase their adaptability; finally, confirm the practicability of this method by the empirical study on Macao's hotel demand under the surrounding of COVID-19.
Methodology
In this section, we propose a new forecasting-based influence evaluation method. Fig. 1
is a schematic diagram that briefly describes our methodology. Taking the COVID-19 as an example, we first divide the data before the occurrence of COVID-19 (Time < T
2) into three groups: training set (OT
0), validation set (T
0
T
1), and test set (T
1
T
2). We use the training set to train a model; use the validation set to optimize its parameters; and then make predictions on the test set. In the first sub-graph, the purple dotted line represents the prediction result of the model on the test set, which reflects its performance. If the predicted sequence is very close to the original sequence, the corresponding model will be selected. In the second sub-graph, we use the data corresponding to T
1
T
2 to update the parameters of the selected model and then predict the future development trend of hotel demand without COVID-19. Similarly, the dotted line denotes the future development trend of hotel demand without COVID-19, while the blue refers to the real demand after COVID-19.
Fig. 1
A schematic diagram for the methodology.
A schematic diagram for the methodology.Finally, the third sub-graph illustrates how to estimate the impact of COVID-19 on hotel demand, during a period, based on the predictive result. Wherein I
1 represents the impact of COVID-19 between T
2 and T
3; I
1 + I
2 represents the impact between T
2 and T
4; I
1 + I
2 + I
3 represents the impact between T
2 and T
5. Our methodology can also be used to assess the overall impact of COVID-19 if the point when COVID-19 is over comes. In the third sub-graph, the green dotted line indicates the future end of COVID-19. When that point comes, we can get the overall impact of COVID-19, that is, I
1 + I
2 + I
3 + I
4. The proposed method is based on a new adaptive hybrid forecaster (AHF), which features complete ensemble empirical mode decomposition and a model library that includes SARIMAX, LSTM, ANN, and SVR, and a difference-comparing modular. The methodology has four stages: a) decomposing hotel demand before the public health emergency into a series of components, b) matching each decomposed component with a suitable forecasting model, c) verifying the performance of models, and d) evaluating the impact of a public health emergency on hotel demand. The framework of the methodology is illustrated in Fig. 2
.
Fig. 2
The framework for evaluating the impact of public health emergencies on hotel demand.
The framework for evaluating the impact of public health emergencies on hotel demand.
Decomposing hotel demand before a public health emergency
The obvious nonlinear and seasonal characteristics affect the forecasting of hotel demand. Thus, it is helpful to decompose demand data in advance and then construct forecasting models by incorporating the decomposed components. Previous studies have used wavelet analysis to decompose demand data (Balli, Shahzad, & Uddin, 2018). Wavelet analysis lacks adaptability, and its performance relies on the selection of the wavelets. Compared with wavelet analysis, empirical mode decomposition (EMD) has good adaptability and is more suitable for the decomposition of nonlinear, nonstationary, and complex sequences (Xie, Qian, & Wang, 2020; Tuo & Zhang, 2020). However, a drawback named mode mixing exists for EMD: a decomposed component may comprise signals with different frequencies, or signals with the same frequency are in different components. Thus, we choose complementary ensemble empirical mode decomposition (CEEMD), an improved version of EMD, to decompose hotel demand as it can cope with mode mixing by adding white Gaussian noise to hotel demand before decomposing it (Yeh, Shieh, & Huang, 2010). We do not consider the role of factors such as online search data and weather data because these factors need to be forecast in advance. The forecasting errors would inevitably accumulate and eventually lead to a greater evaluation error. But, we introduce an exogenous factor - the month-on-month growth rate of forecasting value on the consumer price index - because it can reflect and judge future economic and hotel development in a virtual world without public health emergencies. According to Wong and Song (2006), the factor is an important macroeconomic variable that describes the hospitality stock indices. The detailed steps for this stage are as follows:
where n − 1 refers to the number of intrinsic mode functions; c
+ and c
− are the j-th intrinsic mode function decomposed respectively by DB
+(t) and DB
−(t); and r
+(t) and r
−(t) are the residues.Step 1: Add white noise ε
(t) (i = 1, 2, ...,
m) to hotel demand before a public health emergency, denoted DB(t), to form two sets of sequences:Step 2: Decompose DB
+(t) and DB
−(t) into intrinsic mode functions (IMFs) and residues (RES) by using the EMD method. The process is set out in detail by Zhang, Lai, & Wang (2008).Step 3: Generate the final intrinsic mode functions based on Eqs. (3), (4):Here, C
1(t),C
2(t), …,C
(t), and Res(t) are n components identified by using CEEMD. They are all independent and of specific meanings and different frequencies.Step 4: Analyze the features of each component, such as its average period, the correlation between each component and hotel demand, and the contribution of each component to hotel demand.
Matching each decomposed component with a suitable forecasting model
The decomposed components of hotel demand based on CEEMD have different degrees of nonlinearity and no forecasting model can deal with all of them effectively. To solve this, we built a library that contains four mainstream models: SARIMAX, LSTM, ANN, and SVR. The training principles and processes of these four models are set out in detail by, respectively, Wu et al. (2017), Bi et al. (2020), Palmer, Montano, and Sesé (2006), and Sun, Wei, Tsui, and Wang (2019). Owing to the limitations on space, they are not included here. These four models are different from each other, which helps to improve the stability of the hybrid forecasting of hotel demand. Based on the library, we match each component with the most appropriate model. Specifically, the components of hotel demand and the exogenous factor are divided into three groups: the training set, the validation set, and the test set. Then, samples from the training set are used to train the models and those from the validation set are used to optimize models' parameters. The steps for Stage 2 are as follows.Step 1: Divide hotel demand, its decomposed components, and exogenous factor (EF(t)) into training sets, validation sets, and test sets, as described in Table 1
.
Table 1
Description of variables.
Hotel demand
Decomposed components of hotel demand
Exogenous factor
Before a public health emergency (1 ≤ t ≤ tTe)
Training sets(1 ≤ t ≤ tTr)
DBTr(t)
C1Tr(t)
C2Tr(t)
...
Cn−1Tr(t)
ResTr(t)
EFTr(t)
Validation sets(tTr < t ≤ tVa)
DBVa(t)
C1Va(t)
C2Va(t)
...
Cn−1Va(t)
ResVa(t)
EFVa(t)
Test sets(tVa < t ≤ tTe)
DBTe(t)
C1Te(t)
C2Te(t)
...
Cn−1Te(t)
ResTe(t)
EFTe(t)
After a public health emergency(t > tTe)
DA(t)
–
–
...
–
–
EF(t)
Description of variables.Step 2: Randomly choose a combination of parameters, denoted as PC
1, and train four models based on {C
1
(t),
EF
(t)}, …, {C
(t),
EF
(t)}, and {Res
(t),
EF
(t)}.Step 3: Make predictions on hotel demand components C
1
(t), …, C
(t), and Res
(t) by using four trained models. As there are four models and n components, a matrix PC
1_F
containing 4 × n prediction results can be formed:Step 4: Analyze the forecasting performance. The root means square error (Eq. (12)) is used as the performance index, and the performance matrix, PC
1_P
, is:Step 5: Choose a new combination of parameters and repeat Steps 3 and 4 until all performance matrices (PC
1_P
, PC
2_P
, …,PC
_P
) are obtained. They are corresponding to different combinations of parameters (PC
1, PC
2, …,PC
).Step 6: Match each component of hotel demand with a suitable model. For the k-th component, the suitable model with the optimal parameters is corresponding to the minimum value in the k-th column of all performance matrices.
Verifying the performance of models for forecasting hotel demand before an emergency
The forecasting performance needs to be evaluated in advance. Thus, in this stage, we use those models selected in Stage 2 to make predictions on the test sets of all components, and then add them to obtain the prediction on the test set of hotel demand. Here, three indexes are adopted to evaluate the performance of our adaptive hybrid forecaster when forecasting hotel demand before an emergency: the mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE). We also take the following six models as benchmarks: exponential smoothing (ETS), SARIMA, SARIMAX, LSTM, ANN, and SVR. The information about ETS can refer to Chen, Li, Wu, and Shen (2019). In this stage, the training set and validation set of hotel demand are used to train these benchmarks and adjust their parameters, respectively. Then, the trained benchmarks with the optimal parameters are employed to predict the test set of hotel demand. The steps for Stage 3 are as follows:
where H is the number of samples; and x
and denote the h-th actual and predicted value respectively.Step 1: Train six models based on DB
(t), with a randomly given parameter combination, and then make predictions on the validation set DB
(t). Here, the matrix F
contains six prediction results:Step 2: Analyze the forecasting performance. The root means square error (Eq. (12)) is used as the performance index and the performance matrix P
is:Step 3: Change a parameter combination for the six models and repeat Steps 1 and 2 until the optimal parameter combination is discovered.Step 4: Make predictions of on DB
(t) by using the six benchmarks with optimal parameter combination.Step 5: Make predictions on components C
1
(t),C
2
(t), …,C
(t), and Res
(t) by using the corresponding models with optimal parameters selected in Stage 2 and add them to obtain the prediction on DB
(t).Step 6: Verify the performance of all models in forecasting hotel demand. Three performance indexes are used here, which are represented by Eqs. (12–14):
Analyzing the impact of a public health emergency
This stage investigates the impact of a public health emergency on hotel demand. To achieve a more accurate evaluation, the sequences before a public health emergency are used to train the adaptive hybrid forecaster to update its parameters. Then, the virtual demand for hotel without the public health emergency is predicted. The impact is evaluated by comparing the real demand after the emergency (the observed group) with the expected demand (the control group), and the degree of the impact is evaluated by the average value. Here, Fig. 1 is used again to illustrate how to calculate the impact and its degree. Assume that T
2 represents the month when COVID-19 occurred. T
3, T
4, and T
5 represent 1, 2, and 3 months after T
2, respectively. denotes the purple dotted line, and DA(t) denotes the green solid line. Then, the impacts for 3 months after the occurrence of COVID-19 are respectively: , , and . Its degree for 3 months after the occurrence of COVID-19 are I
1, (I
1 + I
2)/2, and (I
1 + I
2 + I
3)/3. The detailed steps for this stage are as follows.
where s takes a value of 1, 4, or 12 for yearly, quarterly, and monthly data, respectively.
where N is the number of recovery rates. Relevant statistical tests are also introduced as an additional description in this step.Step 1: Combine the training set and validation set of each component in Stage 2 to retrain the selected model of this component, and then use the test set in Stage 2 to update their optimal parameters.Step 2: Make a prediction of the virtual hotel demand after an emergency (the control group). The prediction result can be regarded as demand without emergencies.Step 3: Analyze the impact of an emergency. The impact of an emergency, I(t), can be described as the difference between the real demand DA(t) and the forecast demand . That is:Step 4: Analyze the degree of the impact, which is considered as the average value of I(t). Then, relevant statistical tests are introduced as an additional description for this step.Step 5: Analyze the recovery rate after the emergency. The recovery rate, RE(t), is calculated based on the following growth rate:Step 6: Analyze the average recovery rate after the emergency. The average recovery rate, ARE, is calculated as:
Experimental study
The COVID-19 pandemic has led to operating losses for hotels. As a city known for its tourism and service industries, Macao has been badly affected by this public health emergency. In this section, the new method is applied in the analysis of the impact of the current pandemic on hotel demand in Macao.
Data on Macao's hotel demand
The information on demand for Macao's hotels with ratings from two to five stars can be retrieved from the official website of Macao tourism department. As for the exogenous factor, we use the month-on-month growth rate of forecasting value on the consumer price index (CPI). The index, according to Wong and Song (2006), is an important macroeconomic variable that describes the hospitality stock indices, so its forecasting value reflects the future economic development in a virtual world without public health emergencies. The data on the index can be found in the iFinD database (http://www.51ifind.cn/). The sample set of this work comprises the monthly data from June 2012 to March 2021 (106 monthly values in total) and the hotel demand in Macao are divided according to its star ratings (two to five stars), resulting in 424 monthly values. Fig. 3
depicts the exogenous factors across the study period, and Fig. 4
plots the hotel demand separately for the four sets of star ratings. After the outbreak of COVID-19, demand dropped sharply in 2020. To find the most accurate estimation function for expected demand without the pandemic, for the separate samples of hotels with each star rating, samples before 2020 (i.e., from 1 to 91) are used as the training set, validation set, and test set. The lengths of the validation and test sets are both set to 12 months. Specifically, samples before 2018 are used as the training set (i.e., from 1 to 67); samples from January 2018 to December 2018 (i.e., from 68 to 79) are the validation set; and samples from January 2019 to December 2019 (i.e., from 80 to 91) are the test set.
Fig. 3
The exogenous factors (forecasting value on the consumer price index) from June 2012 to March 2021.
Fig. 4
The monthly demand for hotels with different star ratings, June 2012 to March 2021.
The exogenous factors (forecasting value on the consumer price index) from June 2012 to March 2021.The monthly demand for hotels with different star ratings, June 2012 to March 2021.
Processes of decomposition of hotel demand before the emergency
Fig. 5 demonstrates the components derived by CEEMD from the demand separately for hotels with different star ratings. The parameters for CEEMD are set in a basic way: the number of repetitions is set to 100, and the standard deviation of white noise in each repetition is set to 0.2. As depicted in Fig. 5, IMFs indicate the different frequency components in hotel demand, and the RES is the component that slowly varies around the long-term average, which is essentially the average trend (Zhang et al., 2009). To analyze the characteristics and meanings of these components, we adopted the following measures: 1) the average period of each component of hotel demand is calculated, which is defined as the value derived by dividing the total number of points by the number of peaks for each component (Zhang, Lai, & Wang, 2008); 2) the correlation between each component of hotel demand and the original demand data was measured using the Pearson correlation coefficient (Xu, Tang, He, & Man, 2016); 3) the contribution of each component to general hotel demand is further analyzed. Considering that the components of hotel demand in a category are independent of each other (Zhang et al., 2008), we use the percentage of variance to explain the contribution. Table 2
shows the results and we can draw the following findings.
Fig. 5
The decomposition results of demand for hotels with different star ratings, June 2012 to March 2021.
Table 2
The measurement results for the components of demand for hotels with different star ratings.
Two-star hotels
Three-star hotels
Mean period(month)
Pearsoncorrelation
Std
Std as % of(ΣIMFs + RES)
Mean period(month)
Pearsoncorrelation
Std
Std as % of(ΣIMFs + RES)
IMF1
3.68
0.51
2020.90
28.31%
2.56
0.17
4610.98
11.59%
IMF2
8.36
0.43
1524.60
21.61%
5.75
0.16
2588.88
6.51%
IMF3
15.33
0.63
1552.38
21.75%
13.14
0.02
2359.53
5.93%
RES
–
0.66
2022.45
28.33%
–
0.97
30,214.97
75.97%
Sum
–
–
–
100.00%
–
–
–
100.00%
Note: the mean period of RES is not provided as it reflects the long-term development trend of hotel demand.
The decomposition results of demand for hotels with different star ratings, June 2012 to March 2021.The measurement results for the components of demand for hotels with different star ratings.Note: the mean period of RES is not provided as it reflects the long-term development trend of hotel demand.
The mean periodicity of components
For all hotels with a given star rating, the average period from IMF1 to IMF3 gradually increases. IMF1 and IMF2 are close to 3-month and 6-month periods, respectively. Thus, they can be regarded as seasonal and semi-annual periodic components, respectively. The average period of IMF3 is more than one year and can be regarded as the yearly pattern. According to Rosselló and Sansó (2017) and Li, Ge, Liu, and Zheng (2020), the longest seasonal periodic in tourism demand seems to be the annual, so here we get three IMFs. As for RES, it is the demand component that slowly varies around the long-term average and can be considered as reflecting the development trend more generally. In the short term, the hotel demand has obvious seasonality. In the long run, the hotel industry is affected by the economic situation and the changing preferences of tourists. As RES indicates, the demand for three-star and five-star hotels has increased continuously over recent years, whereas demand for four-star hotels fell after an initial rise and the demand for two-star hotels experienced an overall decline.
The contributions of the different components of hotel demand
The RES component has a high contribution to hotel demand (reaching more than 25%), which means that the development trend plays a key role in hotel demand. For all hotels with a given star rating, the component IMF1 is found to make a higher contribution than the corresponding low-frequency components, i.e., IMF2 and IMF3, whose contribution to the original hotel demand reaches more than 10%. This finding is different from the findings for other industries, where the low-frequency component generally contains more information than the high-frequency component (Zhang et al., 2008). Here, IMF1 (the seasonal periodic component) contains more information than IMF2 (the semi-annual periodic component) and IMF3 (the yearly periodic component). In summary, the components of seasonal periodicity and the general development trend mainly determine demand within the hotel industry.
Matching each component of hotel demand with a forecasting model
Since the various components of hotel demand have varying degrees of nonlinearity, it is wise to select the most suitable forecasting model for each component from a model library. The library contains SARIMAX, ANN, SVR, and LSTM. Before selecting models, the parameters need to be set. For these four models, the key parameters (those with the greatest impact on their performance) are mainly considered, while the remaining parameters are set in a default manner.In this experiment, there are six parameters for SARIMAX: the order for differences (d), the order for auto-regressive process (p), the order for moving average process (q), the order for seasonal differences (D), the order for seasonal auto-regressive process (P), and the order for seasonal moving average process (Q). There are two parameters for LSTM: time steps and number of units. There are two parameters for ANN: size of input layers and size of hidden layers. Finally, there are two parameters for SVR: kernel types and number of features. To obtain suitable parameters, the training set is used to train the forecasting models, and the validation set is used to select the optimal parameter combinations. In other words, models with different parameter combinations are used to predict samples in the validation set, and then the parameter combination corresponding to the minimum RMSE is selected as the optimal parameter. As for the optimization method, the parameters of SARIMAX are determined by statistical testing, while the parameters of the other three models are determined by the exhaustive grid-search technique (Cho, 2003). The ranges of the corresponding parameters of the four models are shown in Table 3
. Considering that the training processes for LSTM and ANN involve a certain degree of randomness, different prediction results may be obtained for LSTM and ANN with the same parameters. Therefore, we run the LSTM and ANN five times for each parameter and take the average of the five RMSEs as the evaluation index.
(time steps∈{1, 2, …, 10}; number of units∈{5, 10, …, 50}).
ANN
(size of input layers∈{1, 2, …, 10}; size of hidden layers∈{2, 4, …, 20}).
SVR
(kernel types∈{1 = ‘gaussian’, 2 = ‘linear’, 3 = ‘polynomial’}; number of features∈{1, 2, …, 10}).
The range of parameters for the four models.Here, RES for two-star hotels is taken as an example to illustrate the procedure for parameter selection. Fig. 6
presents the performance of LSTM with different parameter combinations on RES for two-star hotels. In Fig. 6, each small square corresponds to a combination of parameters, and the number in the square represents the average value of five RMSEs. For the RES of two-star hotels, LSTM can obtain higher prediction accuracy when the time step is 1 and 2, which means that the development trend for two-star hotels is more short-term dependence. When the number of units and time steps are respectively 40 and 2, LSTM achieves the best performance on the validation set (the average of five RMSEs is only 249.2). Using the same method, we obtained the optimal parameter combinations of the four models corresponding to all components of demand for hotels with different stars. Given that RES needs to be differentiated multiple times to be stable, we take the natural logarithm of RES when training the SARIMAX model and then restore the prediction results to reduce data loss. The results are presented in Table 4
.
Fig. 6
The performance of LSTM on the RES of two-star hotels under different parameter combinations.
Table 4
The models with optimal parameter combinations corresponding to the total 16 components of hotels demand with the four different star ratings, January 2018 to December 2018.
SARIMAX
LSTM
ANN
SVR
Selected model
Optimal parameters
RMSE
Optimal parameters
RMSE
Optimal parameters
RMSE
Optimal parameters
RMSE
Two-starhotels
IMF1
(2,0,1)(0,0,0)
2641.80
(1,20)
2303.05
(2,6)
2247.14
(1,8)
2292.26
ANN
IMF2
(2,0,2)(0,0,0)
681.51
(1,50)
535.21
(2,2)
712.94
(2,2)
710.37
LSTM
IMF3
(4,0,1)(0,0,0)
443.17
(8,5)
1511.45
(7,2)
757.27
(1,4)
1487.15
SARIMAX
RES
(0,2,0)(0,0,0)
57.68
(2,40)
174.73
(4,4)
77.88
(3,1)
736.56
SARIMAX
Three-starhotels
IMF1
(0,0,5)(0,0,0)
5313.05
(5,35)
4215.18
(3,2)
4343.20
(3,8)
3586.26
SVR
IMF2
(2,0,2)(0,0,0)
1649.16
(3,20)
2523.85
(9,14)
1638.57
(1,3)
1274.82
SVR
IMF3
(4,0,1)(0,0,0)
1464.84
(4,45)
442.40
(4,4)
558.00
(1,6)
515.87
LSTM
RES
(0,2,0)(0,0,0)
1290.61
(2,40)
590.49
(5,4)
1332.78
(2,1)
3520.99
LSTM
Four-starhotels
IMF1
(5,0,0)(0,0,0)
11,690.59
(3,35)
10,202.66
(1,4)
10,076.08
(1,1)
10,172.70
ANN
IMF2
(4,0,1)(0,0,0)
4787.16
(10,5)
4112.00
(9,6)
3123.74
(3,9)
4084.35
ANN
IMF3
(4,0,1)(0,0,0)
1813.10
(9,30)
1433.95
(1,10)
2608.28
(1,2)
868.54
SVR
RES
(0,2,0)(0,0,0)
1588.01
(4,35)
833.79
(1,8)
1228.61
(2,3)
2652.20
LSTM
Five-starhotels
IMF1
(3,0,1)(0,0,0)
16,861.49
(2,40)
15,939.98
(2,4)
16,364.05
(3,5)
15,844.90
SVR
IMF2
(4,0,1)(0,0,0)
8017.17
(2,10)
5994.77
(3,2)
6243.48
(1,5)
6664.49
LSTM
IMF3
(5,0,0)(0,0,0)
7202.25
(5,5)
6343.40
(4,8)
5514.25
(2,1)
5849.88
ANN
RES
(0,2,0)(0,0,0)
847.53
(1,45)
5420.72
(1,2)
7211.79
(2,10)
9551.09
SARIMAX
The performance of LSTM on the RES of two-star hotels under different parameter combinations.The models with optimal parameter combinations corresponding to the total 16 components of hotels demand with the four different star ratings, January 2018 to December 2018.As shown in Table 4, no model is suitable for predicting all components of hotel demand. These four models have different performances on the 16 components (the bold means the best performance of all models for each component). LSTM performs best overall and is selected five times. ANN and SVR rank second overall and are selected four times respectively. SARIMAX have average performance, only being selected three times. LSTM, especially the one with many layers, has good fitting capabilities. However, because the structure of Macao's monthly hotel demand is not overly intricate, a shallow LSTM (here, the number of layers is 1) can also achieve better performance on the training set. Furthermore, the complex generation process of demand data can also be simplified by decomposition, which also reduces the sample volume requirement of LSTM. Machine learning models, represented by ANN and SVR, are sensitive to sample size and can also adapt to nonlinear data. Thus, their performance is not as good as that of LSTM. SARIMAX has a low requirement on sample size, but it cannot fit nonlinear data well. The adaptive hybrid forecaster is expected to provide more accurate predictions than single models as it can take full advantage of each of them.
The performance of the adaptive hybrid forecaster in estimating demand before the emergency
This section presents the performance of the adaptive hybrid forecaster in forecasting hotel demand before COVID-19. In Table 4, each component of hotel demand selects the most suitable forecasting model from the model library with the optimal parameter combination. To explore the performance of the adaptive hybrid forecaster, we make predictions on the test set of the 16 components using the models and parameter combinations given in Table 4, and then integrate the results to obtain the final forecasts of demand for hotels with different star ratings. We also make predictions by using the same samples with the six benchmark models. The advantages of the adaptive hybrid forecaster are illustrated by comparing its forecasting results with those of benchmarks. Their optimal parameters are also determined with the exhaustive grid-search technique.Fig. 7 depicts the performance of all models on the test set of demand for hotels with different star ratings before COVID-19. Compared with the benchmarks, the adaptive hybrid forecaster more accurately follows the trend and makes better predictions. Taking two-star hotels as an example, the adaptive hybrid forecaster not only tracks the development trend in the demand data but also captures the irregular fluctuations, at least to a certain extent. In contrast, the benchmarks cannot follow well the general demand trends and overestimate demand for two-star hotels. Furthermore, the adaptive hybrid forecaster seems to find a balance between volatility and trend. Owing to the sharp fluctuations in the hotel demand data, the adaptive hybrid forecaster places more emphasis on following trends to reduce the error of evaluation as much as possible, which is an advantage. To further compare the performance of the models, three indicators based on Eqs. (12), (13), (14) are calculated and presented in Table 5 (the bold means the best performance and the underlined represents the second-best). The adaptive hybrid forecaster performs better than the benchmarks. For two-, three-, four-, and five-star hotels, the MAPE values of the adaptive hybrid forecaster are lower than those of the best benchmark model, with gaps of 7.03%, 0.74%, 0.19%, and 1.15%, respectively. Therefore, according to the results presented in Table 5, it can be considered that the adaptive hybrid forecaster proposed in this paper accurately evaluates demand before the emergency for hotels with different star ratings.
Fig. 7
The performance of all models on the test set of hotel demand before the emergency.
Table 5
The performance of all models on the test set of hotel demand with different star ratings before the emergency.
Models
The demand for two-star hotels
The demand for three-star hotels
MAE
RMSE
MAPE (%)
MAE
RMSE
MAPE (%)
ETS
3839.09
5012.77
21.84
7514.29
8688.63
5.65
SARIMA
5226.25
6071.21
29.71
8530.12
9507.09
6.38
SARIMAX
4622.41
5344.73
26.06
8736.02
9688.27
6.53
SVR
4707.14
6321.21
27.38
8438.84
9293.34
6.27
ANN
4385.18
5765.56
25.11
18,304.60
23,752.26
12.91
LSTM
5218.48
6187.18
29.31
12,235.43
14,943.72
9.33
AHF
2830.77
3355.92
14.81
6725.56
7702.35
4.91
The performance of all models on the test set of hotel demand before the emergency.The performance of all models on the test set of hotel demand with different star ratings before the emergency.
The impact of COVID-19 on hotel demand, differentiated by star ratings
In this section, the adaptive hybrid forecaster is used to evaluate the impact of COVID-19 on the demand for hotels with different star ratings. To explore the impact of the current pandemic, we use the hotel demand before the public health emergency to train and adjust the parameters used in the adaptive hybrid forecaster, and then forecast the expected hotel demand for the virtual case where COVID-19 did not emerge. Accordingly, we have also updated the parameter combinations of the models selected in Table 5 to obtain more accurate predictions. Specifically, we use the samples from months 1 to 79 to retrain the selected models, and use the samples from months 80 to 91 to update their parameters. The impact of the pandemic is then taken to be the difference between the actual demand and that forecast for the same period but without the emergence of COVID-19.Fig. 8 describes the actual and expected demand for hotels with different star ratings. Fig. 9
depicts the impact of the pandemic on the four categories of hotels. Firstly, without the pandemic, the trend in future demand for two- and four-star hotels declines, while the demand for three- and five-star hotels increases. Secondly, the impact of COVID-19 is heterogeneous for hotels with different star ratings: its impact on two-star hotels is relatively low, while that on five-star hotels is the most serious, followed by four-star hotels and three-star hotels. In other words, the impact of the current public health emergency accords with the following sequence: five-star hotels > four-star hotels > three-star hotels > two-star hotels. We also confirm the robustness of the finding based on statistical tests. Considering that the impacts of COVID-19 on hotels with different star ratings may be relevant, we choose Friedman's test to verify this finding. Friedman's test can make full use of the information contained in relevant samples (Pahnehkolaei, Alfi, & Machado, 2021). Fig. 11 presents the results of Friedman's test, in which rank represents the degree of impact. It can be seen that the results obtained by Friedman's test confirm the findings.
Fig. 8
Hotel demand and the expected demand without COVID-19 from using the foresight method.
Fig. 9
The impact of the COVID-19 pandemic on hotel demand.
Fig. 11
Friedman's test for the impact of the outbreak of COVID-19 on hotel demand.
Hotel demand and the expected demand without COVID-19 from using the foresight method.The impact of the COVID-19 pandemic on hotel demand.We also analyze the speed of recovery in hotel demand after the emergency. In Fig. 10
, the solid line reflects the rate recovery of hotel demand and the dotted line shows its average value. Since there was a gap between the emergence of COVID-19 and its impact on the hotel industry, the recovery rate corresponding to January 2020 in Fig. 10 is regarded as noise and hence is discarded. As depicted in Fig. 10, the recovery in hotel demand after the pandemic has obvious volatility. This is caused by the seasonal characteristics of the hotel industry, as the components with seasonal periodicity are important for hotel demand, as shown in Table 2. The speed of recovery in the demand for hotels with different star ratings is distinct: five- and three-star hotels have the fastest recovery, while two- and four-star hotels recover relatively slowly. Again, this finding is confirmed by Friedman's test (Fig. 12).
Fig. 10
The recovery rate of hotel demand after the COVID-19 pandemic.
Fig. 12
Friedman's test for the recovery rate of hotel demand after the outbreak of COVID-19.
The recovery rate of hotel demand after the COVID-19 pandemic.Friedman's test for the impact of the outbreak of COVID-19 on hotel demand.Friedman's test for the recovery rate of hotel demand after the outbreak of COVID-19.
Conclusions
To evaluate the macro impact of a public health emergency on hotel demand with an effective control group and without violating the assumption of market effectiveness, we propose a new forecasting-based influence evaluation method, and apply it in the analysis of the impact of COVID-19 on Macao's hotel industry. The main findings are as follows. 1) We find that the new approach performs well in evaluating the impact of the public health emergency on hotel demand, and the new adaptive hybrid forecaster inside it outperforms a series of benchmark models in estimating hotel demand. Either the most affected hotel category (the hotel star in the present study) or that recovering the most rapidly can be identified with the proposed new method. 2) Before the onset of the public health emergency, Macao's hotel industry was mostly influenced by the general development trend and the seasonal periodicity characteristics of hotel demand; these respectively contributed more than 25% and 10% to overall hotel demand. 3) In the virtual world without the pandemic, the expected demand for three- and five-star hotels grows rapidly, while that for two- and four-star hotels declines. 4) In the real word, the impact of a pandemic on hotel industry exhibits heterogeneity across the four sets of hotels with different star ratings. The impact on five-star hotels is the most significant, followed by that on four-star hotels, while that on two-star hotels was the least. The speed of recovery in hotel demand also shows heterogeneity across the services. Five-star hotels recover the most rapidly, while two- and four-star hotels recover relatively slowly. Since the outbreak of COVID-19, travelers have preferred hotels with diverse high-quality services.Our work makes three key contributions. First, it supplements the current research on the macro impact of a public health emergency on tourism and hospitality by providing a new foresight perspective. The new approach not only overcomes the lack of generalizability of micro qualitative analysis but also lessens the constraints which conventional quantitative analysis requires, such as the need for the similarity between experimental group and control group. When estimating the expected demand in the virtual world without the public health emergency, the new adaptive hybrid forecaster proposed is good at estimating the general development trend component and the seasonal periodicity component in hotel demand before the public health emergency. Thus, a virtual control group can be generated, which also requires no violation of the assumption of market effectiveness. In order to find which category of hotels is most affected by the pandemic and which one will recover the most rapidly after the pandemic, a multiple impact analysis is used to compare the difference between the real hotel demand and the expected demand. The second contribution is that the new approach provided a real-time monitor or evaluation for the impact of a public health emergency on hotel demand. The method can dynamically analyze the impact of a public health emergency in real-time, thereby helping the government implement new policies or adjust the previous ones. Hotel managers can also look for ways to survive or improve hotel performance in view of this dynamic impact. The third contribution is that the new approach is validated in analyzing the impact of the COVID-19 pandemic on hotel demand in Macao. We find the two significant factors of general development trend and seasonal periodicity impact hotel demand before the pandemic. These two factors are processed well by the adaptive hybrid forecaster to generate a good estimation of expected hotel demand in the virtual world without the public health emergency.The management implications are as follows. 1) For tourism and hospitality, managers and regulators can refer to the new method to identify the most affected product, such as different hotel star categories, and find the most rapidly recovered one. None of the existing methods is as effective in this. 2) Identification of the fastest-growing product before the public health emergency is a reference to estimate the most rapidly recovered product after the event. In Macao's hotel industry, five-star hotels would be an important product to promote tourism and hospitality development after an emergency. 3) To survive better during and after a public health emergency, hotels should establish diverse services with an effective mechanism of health protection. Diverse service promotion with health safety as a focus will attract tourists because travelers visiting Macao mainly prefer the diverse services of five-star hotels.There are also some limitations in this research. First of all, we focus on the impact of a public health emergency on overall hotel demand, and ignore the impact on each component of demand. Subsequent work could analyze the impact of emergencies on each component of hotel demand before the emergency. Then, specialized studies focusing on error analysis with other methods would be useful to verify the superiority of the proposed adaptive hybrid forecaster. Finally, our method currently can only evaluate the impact of an emergency on hotel demand from its appearance to the present as the actual demand is a parameter for assessment. On the one hand, the virus can mutate. Once a new virus is discovered, the effects of COVID-19 may continue. On the other hand, new infections are still being discovered and appear in somewhat random places. As a result, forecasting when COVID-19 will end, or the point at which hotel demand fully recovers, relies on identification of these key factors. In future work, we will consider how to effectively predict the time when hotel demand totally recovery and forecast the overall impact of an emergency on hotel demand before the end of the emergency.
CRediT authorship contribution statement
Ling-Yang He contributed via conceptualization of the original idea, theoretical development; experimental design, algorithm design and coding, data collection, results analysis, and finished the original draft. Hui Li conceptualized the original idea, applied for research funding, and completed the following tasks: results analysis, discussion of implications and study significance, and improvement of the original draft. Jian-Wu Bi contributed by discussing the design and the conceptualization, and improving the analysis and presentation. Jing-Jing Yang contributed on data collection, and improving the theorical discussion and presentation. Qing Zhou applied for research funding and improved the presentation.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.