Literature DB >> 32288334

Forecasting tourist arrivals by using the adaptive network-based fuzzy inference system.

Miao-Sheng Chen1, Li-Chih Ying2, Mei-Chiu Pan1.   

Abstract

Since accurate forecasting of tourist arrivals is very important for planning for potential tourism demand and improving the tourism infrastructure, various tourist arrivals forecasting methods have been developed. The purpose of this study is to apply the adaptive network-based fuzzy inference system (ANFIS) model to forecast the tourist arrivals to Taiwan and demonstrate the forecasting performance of this model. Based on the mean absolute percentage errors and statistical results, we can see that the ANFIS model has better forecasting performance than the fuzzy time series model, grey forecasting model and Markov residual modified model. Thus, the ANFIS model is a promising alternative for forecasting the tourist arrivals. We also use the ANFIS model to forecast the monthly tourist arrivals to Taiwan from Japan, Hong Kong and Macao, and the United States.
Copyright © 2009 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Adaptive network-based fuzzy inference system; Tourist arrivals

Year:  2009        PMID: 32288334      PMCID: PMC7126321          DOI: 10.1016/j.eswa.2009.06.032

Source DB:  PubMed          Journal:  Expert Syst Appl        ISSN: 0957-4174            Impact factor:   6.954


Introduction

The tourism industry has grown rapidly over the past few decades. Due to the perishable nature of the tourism industry, the need for accurate forecasts is crucial. Both the public non-profit government sectors and the private profit companies are interested in finding an accurate forecasting technique to make operational, tactical and strategic decisions. Companies can, using the results from this forecasting technique, plan for potential tourism demand successfully and invest in tourism related facilities and equipments sufficiently, and government sectors can play a significant role in maintaining and improving tourism infrastructure. Therefore, various forecasting methods have been developed. They include exponential smoothing (Cho, 2003), ARIMA (Cho, 2003, Chu, 1998, Goh and Law, 2002, Lim and McAleer, 2002), vector autoregressive (Song and Witt, 2006, Wong et al., 2006), neural networks (Chen and Wang, 2007, Cho, 2003, Law, 2000, Law and Au, 1999), fuzzy time series (Wang, 2004, Wang and Hsu, 2008), grey model (Hsu and Wen, 1998, Wang, 2004), econometric (Hiemstra and Wong, 2002, Smeral et al., 1992, Song and Witt, 2000, Witt and Martin, 1987), regression-based model (Chan, 1993, Crouch et al., 1992, Kulendran and Witt, 2001) and genetic algorithm (Chen and Wang, 2007, Hernández-López and Cáceres-Hernández, 2007, Hernández-López, 2004, Hurley et al., 1998). But we can not find any paper adopting an adaptive network-based fuzzy inference system, referred to as ANFIS (Jang, 1993), to forecast tourist arrivals. So, the purpose of this paper is to fill this gap, and we also try to compare the results with those of other models and use the ANFIS model to forecast the monthly tourist arrivals to Taiwan from the top three markets. In this paper, the data used were from the Tourism Bureau of Republic of China (ROC), and, for comparison, the annual tourist arrivals to Taiwan from the three markets: Hong Kong, the United States and Germany from 1989 to 2003 were considered. But, according to the numbers of tourist arrivals to Taiwan, we apply the ANFIS to forecast the monthly tourist arrivals to Taiwan from the top three markets: Japan, Hong Kong and Macao, and the United States. The rest of this paper is organized as follows: Section 2 introduces the architecture and the hybrid learning algorithm of an ANFIS with a simple illustration. Section 3 compares the forecasting accuracy for the different models. Section 4 presents the application of the ANFIS to forecast the monthly tourist arrivals to Taiwan from Japan, Hong Kong and Macao, and the United States. The last section contains some concluding remarks.

Adaptive network-based fuzzy inference system

An ANFIS (Jang, 1993) can help us find the mapping relation between the input and output data through hybrid learning to determine the optimal distribution of membership functions. Five layers are used to construct this inference system. Each layer contains several nodes described by the node function. Adaptive nodes, denoted by squares, represent the parameter sets that are adjustable in these nodes, whereas fixed nodes, denoted by circles, represent the parameter sets that are fixed in the system. The output data from the nodes in the previous layers will be the input in the present layer. To illustrate the procedures of an ANFIS, for simplicity, we consider only two inputs and one output in this system. The framework of ANFIS is shown in Fig. 1 , and the node function in each layer is described below.
Fig. 1

The framework of an ANFIS.

Every node in this layer is an adaptive node with node function as: where is the input of the node, is the linguistic label, is the membership function, usually adopting bell shape with maximum and minimum equal to 1 and 0, respectively, as follows:orwhere is the parameter set. As the values of these parameters change, the bell shaped functions vary accordingly. The parameters in this layer are named premise parameters. Every node in this layer is a fixed node, marked by a circle and labeled , with the node function to be multiplied by input signals to serve as output signalThe output signal represents the firing strength of a rule. Every node in this layer is a fixed node, marked by a circle and labeled N, with the node function to normalize the firing strength by calculating the ratio of the ith node firing strength to the sum of all rules’ firing strength. Every node in this layer is an adaptive node, marked by a square, with node functionwhere and are the fuzzy if-then rules as follows:and where is the parameters set, referred to as the consequent parameters. if x is and y is then if x is and y is then Every node in this layer is a fixed node, marked by a circle and labeled , with node function to compute the overall output by The framework of an ANFIS. As mentioned above, an ANFIS is a multilayer feedforward network in which each node performs a node function on incoming signals as well as a set of parameters belonging to this node. Suppose that the given training data set has n entries. We define the overall error measure bywhere is the error measure for the ith entry of the given training data set, is the desired output of the ith entry and is the output of the ANFIS using the ith entry. From the architecture of the ANFIS, we know that if the premise parameters are fixed, the output of the whole system will be a linear combination of the consequent parameters as follows:Let matricesThen, Eq. (10) can be expressed in matrix form aswhere is an unknown matrix, whose elements come from the consequent parameters set. The least squares estimator(LSE) is given byThe hybrid learning algorithm of the ANFIS combines the gradient method with the least squares method to update the parameters in an adaptive network. Each epoch of this hybrid learning procedure is composed of a forward pass and a backward pass. In the forward pass, after every input vector is given, we calculate the corresponding node output until the matrices B and in Eq. (11) are obtained. The parameters in the consequent parameters set are identified by using Eq. (13); then, we can compute the error measure by Eq. (9). In the backward pass, we have to calculate the error rate for the ith entry of the training data set and for each node output O. If is a parameter of the premise parameters set, by the chain rule, the derivative of the overall error measure E with respect to iswhere V is the set of nodes whose outputs depend on , and is a node output belonging to V. Then, the updated formula for the premise parameter by the gradient method is given byin which is a learning rate. The computation of the data for the ANFIS was conducted using the software Matlab. The ANFIS training algorithms, including the gradient method and the least squares method, were embedded in the software of Matlab’s fuzzy inference toolbox. The main computation procedure includes four steps. The first step is the data input. The input of the event data includes the input data and output data in the form of data array. The second step is generating fuzzy inference system. The third step is using ANFIS training function in the toolbox for the training of the input data. The training of the data will be performed automatically in the system and an array of training error will be obtained. After training, an ANFIS model with forecasting function will be obtained for output forecasting at the last step.

Comparisons of forecasting accuracy for various models

In this section, for comparison with the results proposed by Wang (2004), we employed the annual tourist arrivals to Taiwan from the three markets: Hong Kong, the United States and Germany, shown in Table 1 , from 1989 to 2003 as our research data. The data are divided into two data sets: the training data set( from 1989 to 2000) and the testing data set( from 2001 to 2003).
Table 1

The tourist arrivals to Taiwan from the three markets (from 1989 to 2003).

YearHong KongThe United StatesGermany
198921180422059425002
199019354422491524320
199118176524037525798
199219352325914528969
199321395326911028644
199424177528671331334
199524674729013832944
199626258528990033914
199725966430363434660
199827990530840735343
199931981431780134190
200036130835953334829
200139255234880833716
200245655437747033979
200332317827285828577
The tourist arrivals to Taiwan from the three markets (from 1989 to 2003). For the purposes of comparisons of the forecasting performances among various models, the absolute percentage error (APE) and the mean absolute percentage error (MAPE), given by Eqs. (16), (17) respectively, are used as the indexes of forecasting accuracy. MAPE is a relative measurement and is easy to interpret. MAPE is also independent of scale, reliable and valid (Law & Au, 1999). The smaller the values of MAPE, the closer were the forecasted values to the actual values. where and represent the actual and forecasted values of the ith data, respectively, and N is the number of data. Wang (2004) proposed fuzzy time series, grey forecasting model GM(1, 1) and Markov residual modified model to forecast the tourist arrivals to Taiwan from Hong Kong, the United States and Germany and found that the fuzzy time series is suitable for the tourism demand forecasting of Hong Kong arrivals, the GM(1, 1) model is appropriate for the tourism demand forecasting of Hong Kong and the United States arrivals, and Markov residual modified model is the best for Germany tourism demand forecasting. This study used the ANFIS models to forecast the tourist arrivals to Taiwan from the three markets and compared the results with those of Wang (2004) with the help of software of Matlab’s fuzzy inference toolbox. The input data are the annual tourist arrivals to Taiwan from the three markets: Hong Kong, the United States, and Germany from 1989 to 2000, shown in Table 1. Bell shaped functions were chosen for the membership function expressed in Eq. (3). Fig. 2 shows the fuzzy rule architecture of the ANFIS with two inputs, one output and a total of nine fuzzy rules. The initial value of step size for the training by the ANFIS was set to 0.011.
Fig. 2

Fuzzy rule architecture of the ANFIS. System ANFIS: two inputs, one output, and nine rules.

Fuzzy rule architecture of the ANFIS. System ANFIS: two inputs, one output, and nine rules. According to the settings above, the training process was performed using Matlab 7.0. The root mean square errors became steady after running 348, 210, and 840 epochs of training on data from Hong Kong, the United States, and Germany, respectively. The final convergence values were 2594.5373, 4234.712, and 334.8358 for Hong Kong, the United States, and Germany respectively. Fig. 3, Fig. 4 showed the initial and final membership functions of the input data of the three markets respectively. After the training process, the trained ANFIS system could be used for forecasting. The forecasting results and corresponding MAPE values of the various forecasting models for the different markets are listed in Table 2 .
Fig. 3

Initial bell shaped membership functions of the three markets.

Fig. 4

Final membership functions of the three markets.

Table 2

Forecasting results of the tourist arrivals to Taiwan from Hong Kong, the United States and Germany by various forecasting models.

YearHong Kong
The United States
Germany
ANFISGM (1, 1)MarkovFuzzy (w = 5)ANFISGM (1, 1)MarkovFuzzy (w = 3)ANFISGM (1, 1)MarkovFuzzy (w = 6)
19892118042118042205942205942500225002
19901822931939482303492237332592124580
1991181708195121182970239782239144246140258022679325433
1992193875208850196183259901248275255673289652769429073
1993213598223546210340268507257754265577286452862630024
1994241738239276253044289648267595275868283557313332958931007
1995247251256113241759286570277812286560301160329423058432022
1996261944274134259170258367293808288420297970298285339263161333071
199725982429342427782227420529490429943230921329174734745326773415534919
199828003131407129780526448431132631086430052131808134506337763527535665
199931975533617031921330512533034132273331179631025434155349133339336348
200036131432592537750435183434937533505634662233224834899360873454633860



MAPE (%)0.09696.90942.97683.98621.49573.08942.27253.54390.29924.37201.77132.6881
Initial bell shaped membership functions of the three markets. Final membership functions of the three markets. Forecasting results of the tourist arrivals to Taiwan from Hong Kong, the United States and Germany by various forecasting models. For comparison, we only used the forecasting data from 1989 to 2000, of the various forecasting models. Table 2 shows that, for each market, in tourist arrivals forecasting, the ANFIS model has smaller MAPE values than the fuzzy time series, grey forecasting model GM(1, 1) and Markov residual modified model; in other words, the forecasting performance of the ANFIS model is better than that of the other models. Because each forecasting value of the various forecasting models corresponds to its actual value, these two values are matched. The Kolmogorov–Smimov (K–S) test is used to test the hypothesis that the difference D of the matched data for various forecasting models is normally distributed. If we do not reject the hypothesis, we could establish a 95% confidence interval estimate of the mean of the difference of the matched data. Table 3 shows the statistical results of the p-values of the K–S test (p), the mean of the difference of matched data , and its upper bound (U), lower bound (L), and width (W) of the 95% confidence interval of for the different markets.
Table 3

Statistical results of the difference of the matched data for various forecasting models and for different markets.

MarketStatisticsActual to GM (1, 1)Actual to MarkovActual to FuzzyActual to ANFIS
Hong Kongp0.8660.4200.9210.673
D¯−10686.67−4597.925852.200.10
U−1980.04968.8121076.16234.27
L−19393.30−10164.64−9371.76−234.07
W17413.2611133.4530447.92468.34



The United Statesp0.8210.9690.8870.930
D¯6019.752499.832970.5759.65
U12108.196678.4915920.054726.54
L−68.69−1678.83−9978.91−4607.24
W12176.888357.3225898.969333.78



Germanyp0.7940.9530.8960.497
D¯556.00197.17−442.501.50
U1491.99594.051608.3620.3792
L−379.99−199.72−2493.36−17.3792
W1871.98793.774101.7237.7584
Statistical results of the difference of the matched data for various forecasting models and for different markets. Table 3 shows that all p-values of the K–S test are larger than 0.05, so we can not reject the hypothesis that the differences of matched data for the various forecasting models are normally distributed. From the results of the mean of difference and the 95% confidence interval estimate of the mean difference of matched data, we can see that the values of of the ANFIS model are very close to zero and the widths of the 95% confidence interval of of the ANFIS model are the smallest among all the forecasting models except Markov residual modified model for the United States market; so, the ANFIS model seems to be more accurate than all the other models. As mentioned above, based on the values of MAPE and confidence interval width from 1989 to 2000, the ANFIS model seems to be the most accurate one. It means that the ANFIS model is suitable for the tourist arrivals forecasting. Therefore, we use the ANFIS model to forecast tourist arrivals to Taiwan from the three markets: Hong Kong, the United States and Germany from 2001 to 2003, and compute the APE values of the three markets to present the forecasting accuracy of the ANFIS model. Table 4 shows that all the APE values of Hong Kong, the United States and Germany by the ANFIS model are below 2% in 2001, less than those by the GM (1, 1), Markov and fuzzy models, forecasted by Wang (2004). The reason why the APE values of the ANFIS model do not become so small in 2003, is probably due to the outlier data that happened during the Severe Acute Respiratory Syndrome (SARS).
Table 4

Forecasting results of the tourist arrivals to Taiwan from the three markets by the ANFIS model from 2001 to 2003.

ModelYear
2001
20022003
GM (1, 1) (APE%)Markov (APE%)Fuzzy (APE%)ANFIS (APE%)ANFIS (APE%)ANFIS (APE%)
Hong Kong385144(1.887)366713(6.582)400128(1.93)393232(0.1732)456295(0.0567)434692(34.505)
The United States347848(0.275)360079(3.231)386580(10.83)349153(0.0989)377789(0.0845)369814(35.534)
Germany37301(10.634)35739(6.000)35834(6.28)34361(1.9130)34206(0.6681)34201(19.680)
Forecasting results of the tourist arrivals to Taiwan from the three markets by the ANFIS model from 2001 to 2003.

Forecasting of the monthly tourist arrivals from the top three markets

According to the comparative results shown in the previous section, it could be concluded that the ANFIS model is superior to the others. Thus, the ANFIS model is a promising alternative for forecasting the tourist arrivals. In this section, we apply the ANFIS model to forecast the monthly tourist arrivals to Taiwan from the top three markets: Japan, Hong Kong and Macao, and the United States according to the volume of tourist arrivals from January 2006 to April 2008, shown in Table 5 .
Table 5

Forecasting results of the tourist arrivals to Taiwan from Japan, Hong Kong and Macao, and the United States by ANFIS models.

Year and monthJapan
Hong Kong and Macao
The United States
ActualANFISActualANFISActualANFIS
2006
January855233241431248
February995063412425738
March10945910764434443345543365533530
April844258551942018420903258433570
May908869212135820364153270233004
June916769178539995411204049239507
July810298018838126435383601935888
August987259652744668440702955029681
September10243810370932095306812623126396
October1034659986328029277913434134201
November11454711500228383285823476634005
December998109997941769417263747638221



2007
January10156310230623879236582771227761
February847369798535289352912889228594
March12059912065036283364343604436071
April890218958149732489473219932405
May907849030439057389113155132219
June921279222849526489683898240250
July811168363742788435883635136681
August9779510100149586498882997030055
September10158410264637729379142710027035
October994199476836549371113549536601
November10687510852938047413703366833666
December10076110110452972468944000139561



2008
January983929675630088301433009230267
February923949337150024501062758427749
March10652010686756303564263835037678
April821368253743224432453147831531



MAPE (%)1.822362.165961.10851
Forecasting results of the tourist arrivals to Taiwan from Japan, Hong Kong and Macao, and the United States by ANFIS models. Bell shaped functions were chosen for the membership function defined in Eq. (3). The fuzzy rule architecture is the same as Fig. 2, and the initial value of step size for the training was set to 0.011. The root mean square errors became steady after running 9,247,690, 1,723,549, and 3,161,265 epochs of training on data from Japan, Hong Kong and Macao, and the United States respectively. The final convergence values were 3095.9057, 1792.0489, and 535.1169 for Japan, Hong Kong and Macao, and the United States respectively. Fig. 5, Fig. 6 showed the initial and final membership functions of the input data respectively. After the training process, the trained ANFIS system could be used for forecasting. The forecasting results and corresponding MAPE values of the various forecasting models for the top three markets are listed in Table 5. From Table 5 and Fig. 7 , we can find that seasonal fluctuations occurred in the six time series, and the actual values and the ANFIS forecasting values for tourist arrivals to Taiwan from the top three markets are very close.
Fig. 5

Initial bell shaped membership functions of the top three markets.

Fig. 6

Final membership functions of the top three markets.

Fig. 7

Actual values and the ANFIS forecasting values for monthly tourist arrivals to Taiwan from top three markets.

Initial bell shaped membership functions of the top three markets. Final membership functions of the top three markets. Actual values and the ANFIS forecasting values for monthly tourist arrivals to Taiwan from top three markets.

Conclusion

Accurate forecasting of tourist arrivals is helpful for planning for potential tourism demand to invest in tourism related facilities and equipments and improve tourism infrastructure. This study adopted the ANFIS model to forecast the tourist arrivals to Taiwan, and compared the MAPE and APE values and statistical results of the ANFIS with those of other models. The empirical results, obtained in Section 3, by the ANFIS model yield more accurate tourist arrivals forecasting than that of the other models. Therefore, we conclude that the ANFIS model is a valid and promising alternative for forecasting tourist arrivals. From the results of Section 4, we can see that, owing to seasonal variation, the monthly tourist arrivals forecasting has larger MAPE values than the annual data and the root mean square errors become steady until a great amount of epochs.
  1 in total

1.  The prediction of fluctuation in the order-driven financial market.

Authors:  Fabin Shi; Xiao-Qian Sun; Jinhua Gao; Zidong Wang; Hua-Wei Shen; Xue-Qi Cheng
Journal:  PLoS One       Date:  2021-11-18       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.