Academia.eduAcademia.edu

A new forecasting approach for the hospitality industry

2015, International Journal of Contemporary Hospitality Management

https://doi.org/10.1108/IJCHM-06-2014-0286

Purpose – This study aims to apply a new forecasting approach to improve predictions in the hospitality industry. To do so, the authors developed a multivariate setting that allows the incorporation of the cross-correlations in the evolution of tourist arrivals from visitor markets to a specific destination in neural network models. Design/methodology/approach – This multiple-input-multiple-output approach allows the generation of predictions for all visitor markets simultaneously. Official data of tourist arrivals to Catalonia (Spain) from 2001 to 2012 were used to generate forecasts for one, three and six months ahead with three different networks. Findings – The study revealed that multivariate architectures that take into account the connections between different markets may improve the predictive performance of neural networks. Additionally, the authors developed a new forecasting accuracy measure and found that radial basis function networks outperform the rest of the models. Research limitations/implications – This research contributes to the hospitality literature by developing an innovative framework to improve the forecasting performance of artificial intelligence techniques and by providing a new forecasting accuracy measure. Practical implications – The proposed forecasting approach may prove very useful for planning purposes, helping managers to anticipate the evolution of variables related to the daily activity of the industry. Originality/value – A multivariate neural network framework has been developed to improve forecasting accuracy, providing professionals with an innovative and practical forecasting approach.

Institut de Recerca en Economia Aplicada Regional i Pública Research Institute of Applied Economics Grup de Recerca Anàlisi Quantitativa Regional Regional Quantitative Analysis Research Group Document de Treball 2014/17, 21 pàg. Working Paper 2014/17, 21 pag. Document de Treball 2014/10 21 pàg. Working Paper 2014/10, 21 pag. “A multivariate neural network approach to tourism demand forecasting” Oscar Claveria, Enric Monte and Salvador Torra Working Paper 2014/17, pàg. 2 Working Paper 2014/10, pag. 2 Research Institute of Applied Economics Regional Quantitative Analysis Research Group WEBSITE: www.ub-irea.com • CONTACT: [email protected] WEBSITE: www.ub.edu/aqr/ • CONTACT: [email protected] Universitat de Barcelona Av. Diagonal, 690 • 08034 Barcelona The Research Institute of Applied Economics (IREA) in Barcelona was founded in 2005, as a research institute in applied economics. Three consolidated research groups make up the institute: AQR, RISK and GiM, and a large number of members are involved in the Institute. IREA focuses on four priority lines of investigation: (i) the quantitative study of regional and urban economic activity and analysis of regional and local economic policies, (ii) study of public economic activity in markets, particularly in the fields of empirical evaluation of privatization, the regulation and competition in the markets of public services using state of industrial economy, (iii) risk analysis in finance and insurance, and (iv) the development of micro and macro econometrics applied for the analysis of economic activity, particularly for quantitative evaluation of public policies. IREA Working Papers often represent preliminary work and are circulated to encourage discussion. Citation of such a paper should account for its provisional character. For that reason, IREA Working Papers may not be reproduced or distributed without the written consent of the author. A revised version may be available directly from the author. Any opinions expressed here are those of the author(s) and not those of IREA. Research published in this series may include views on policy, but the institute itself takes no institutional policy positions. 2 Working Paper 2014/17, pàg. 3 Working Paper 2014/10, pag. 3 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Abstract This study compares the performance of different Artificial Neural Networks models for tourist demand forecasting in a multiple-output framework. We test the forecasting accuracy of three different types of architectures: a multi-layer perceptron network, a radial basis function network and an Elman neural network. We use official statistical data of inbound international tourism demand to Catalonia (Spain) from 2001 to 2012. By means of cointegration analysis we find that growth rates of tourist arrivals from all different countries share a common stochastic trend, which leads us to apply a multivariate out-of-sample forecasting comparison. When comparing the forecasting accuracy of the different techniques for each visitor market and for different forecasting horizons, we find that radial basis function models outperform multi-layer perceptron and Elman networks. We repeat the experiment assuming different topologies regarding the number of lags used for concatenation so as to evaluate the effect of the memory on the forecasting results, and we find no significant differences when additional lags are incorporated. These results reveal the suitability of hybrid models such as radial basis functions that combine supervised and unsupervised learning for economic forecasting with seasonal data. JEL classification: L83; C53; C45; R11 Keywords: forecasting; tourism demand; cointegration; multiple-output; artificial neural networks Oscar Claveria. AQR Research Group-IREA, Universitat de Barcelona, Av. Diagonal 690, 08034 Barcelona, Spain. E-mail: [email protected] Enric Monte. Department of Signal Theory and Communications, Polytechnic University of Catalunya. E-mail: [email protected] Salvador Torra. Riskcenter-IREA, Universitat de Barcelona, Av. Diagonal 690, 08034 Barcelona, Spain. E-mail: [email protected] 3 Working Paper 2014/17, pàg. 4 Working Paper 2014/10, pag. 4 Research Institute of Applied Economics Regional Quantitative Analysis Research Group I. Introduction The availability of more advanced forecasting techniques and the requirement for more accurate forecasts of tourism demand at the destination level has led to a growing interest in tourism demand forecasting over the past decades. Despite there is no consensus on the most appropriate approach to forecast tourism demand (Kim and Schwartz, 2013; Song and Li, 2008), it is generally believed that the nonlinear methods outperform the linear methods in modelling economic behaviour (Cang, 2013). As stated by Granger and Terasvirta (1993), real world systems are often nonlinear, so that their responses are not proportional to changes in the inputs. During the 80s, several nonlinear models time series models were developed. See De Gooijer and Kumar (1992) for a review of this field. These nonlinear models are still limited in that an explicit relationship for the data series has to be assumed with little knowledge of the underlying data generating process. Since there are too many possible nonlinear patterns, the specification of a nonlinear model to a particular data set becomes a difficult task. The suitability of artificial intelligence techniques to handle nonlinear behaviour explain why Artificial Neural Networks (ANNs) have become an essential tool for economic forecasting. ANNs can be regarded as one of the multivariate nonlinear nonparametric statistical methods. As data characteristics are associated with forecast accuracy (Kim and Schwartz, 2013), nonlinear data-driven approaches such as ANNs represent a flexible tool for forecasting, allowing for nonlinear modelling without a priori knowledge about the relationships between input and output variables. The introduction of the backpropagation algorithm fostered the use of ANNs for forecasting (Santín et al., 2004; Binner et al., 2005; Vlastakis et al., 2008; Madden and Tan, 2008; Lin et al., 2011; Choudhary and Haider, 2012; Teixeira and Fernandes, 2012). Zhang et al.(1998) review the literature comparing ANNs with statistical models in time series forecasting. Many different ANN models have been developed since the 1980s. ANNs can be classified into two major types of architectures depending on the connecting patterns of the different layers: feedforward networks, where the information runs only in one direction, and recurrent networks, in which there are feedback connections from outer layers of neurons to lower layers of neurons. Feedforward networks were the first ANNs devised. The most widely used feed-forward topology in time series forecasting is the multi-layer perceptron (MLP) network. MLP networks have been widely used for tourism demand forecasting (Pattie and Snyder, 1996; Uysal and El Roubi, 1999; 4 Working Paper 2014/17, pàg. 5 Working Paper 2014/10, pag. 5 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Law, 1998, 2000, 2001; Law and Au, 1999, Burger et al., 2001; Tsaur et al., 2002; Kon and Turner, 2005; Palmer et al., 2006; Claveria and Torra, 2014). A class of multi-layer feed-forward architecture with two layers of processing is the radial basis function (RBF) network (Broomhead and Lowe, 1988). RBF networks have the advantage of not suffering from local minima in the same way as MLP networks, which explains their increasing use in many fields. Cang (2013) has recently compared the forecast accuracy of RBF networks to that of MLP and Support Vector Machine (SVM) networks. Recurrent networks are models with bidirectional data flow: they propagate data linearly from input to output but also allow for a temporal feedback from the outer layers to the lower layers. This feature is specially suitable for time series modelling. There are many recurrent architectures. A special case of recurrent network is the Elman network (Elman, 1990). Whilst MLP networks are increasingly used with forecasting purposes, Elman neural networks have been scarcely used in tourism demand forecasting. Cho (2003) used the Elman architecture to predict the number of arrivals from different countries to Hong Kong. Regarding their learning strategy, ANNs can also be classified into two major types of architectures: supervised and unsupervised learning networks. In supervised learning networks, weights are adjusted to approximate the output to a target value for each pattern of entry. SVMs and MLP networks are examples of supervised learning models. In non-supervised learning networks, the subjacent structure of data patterns is explored so as to organize such patterns according to their correlations. Kohonen self-organizing maps (SOM) are the most used non-supervised models. Some ANNs combine both learning methods, so part of the weights are determined by a supervised process while the rest are determined by unsupervised learning. This is known as hybrid learning. An example of hybrid model is the RBF network. In spite of the increasing interest in machine learning methods for time series forecasting, very few studies compare the accuracy of different ANN architectures for tourism demand forecasting. This study focuses on the implementation of three different ANNs (MLP, RBF and Elman) so as to evaluate how different ways of handling information affect forecast accuracy. We use a multipleoutput approach to predict international tourism demand in order to compare the forecasting performance of the three different architectures. The motivation for applying a multiple-output framework is twofold. On the one hand, there are no studies analyzing the forecasting performance of multiple-output ANNs. On the other hand, a multivariate approach is especially suited when the 5 Working Paper 2014/17, pàg. 6 Working Paper 2014/10, pag. 6 Research Institute of Applied Economics Regional Quantitative Analysis Research Group evolution of tourist arrivals from all the different countries of origin share a common stochastic trend. The fact that tourism data is characterised by strong seasonal patterns and volatility, make it a particularly interesting field in which to apply different types of NN architectures. International tourism is one of the fastest growing industries and accounts for almost 10% of total international trade (Eilat and Einav, 2004). Balaguer and Cantavella-Jordá (2002) showed the important role of tourism in the Spanish long-run economic development. Catalonia is a region of Spain and one of the world’s major tourist destinations. Tourism represents 12% of Catalonian GDP and provides employment for 15% of the working population. These figures show to what extent accurate forecasts of tourism volume play a major role in tourism planning at the destination level. We use official statistical data of tourist arrivals from all countries of origin to Catalonia over the period 2001 to 2012. By means of the Johansen test we find correlated accelerations between the different markets, which lead us to apply a multiple-output approach to obtain forecasts of tourism demand for different forecast horizons (1, 3 and 6 months). To assess the effect of expanding the memory on forecast accuracy, we repeat the experiment assuming different topologies with respect to the number of lags used for concatenation. Finally, we compute several measures of forecast accuracy and the Diebold-Mariano test for significant differences between each two competing series. The structure of the paper is as follows. Section II briefly describes the different neural networks architectures used in the analysis. Section III analyses the data set. In Section IV results of the forecasting competition are discussed. Finally, concluding remarks are given in Section V. II. Methodology Neural networks are flexible structures capable of learning sequentially from observed data. This feature makes ANNs specially suitable for time series forecasting. As opposed to traditional approaches to time series prediction, the specification of ANN models does not depend on a previous set on assumptions. Nevertheless, obtaining a reliable neural model involves selecting a large number of parameters experimentally: determining the number of input nodes, hidden layers, hidden nodes and output nodes, the activation function, the training algorithm, the training and the test samples, as well as the performance measures for cross-validation (Zhang et al.,1998). This range of different choices allows to chose the optimal topology of the ANN, while the weights of 6 Working Paper 2014/17, pàg. 7 Working Paper 2014/10, pag. 7 Research Institute of Applied Economics Regional Quantitative Analysis Research Group the model are estimated by gradient search. A complete summary on ANNs modelling issues can be found in Bishop (1995) and Haykin (1999). Therefore, each network is suited to a combination of a learning paradigm (supervised and nonsupervised learning), a learning rule related to the gradient cost function (Boltzmann, Hebbian, etc.) and a learning algorithm (forward-propagation, back-propagation, self-organization, etc.). The different learning paradigms represent alternative approaches to the treatment of information. In this study we focus on three ANN architectures (MLP, RBF and Elman), each of which deals with data in a different manner. Multi-layer perceptron neural network MLP networks consist of multiple layers of computational units interconnected in a feed-forward way. MLP networks are supervised neural networks that use as a building block a simple perceptron model. The topology consists of layers of parallel perceptrons, with connections between layers that include optimal connections. The number of neurons in the hidden layer determines the MLP network’s capacity to approximate a given function. In order to solve the problem of overfitting, the number of neurons was estimated by cross-validation. In this work we used the MLP specification suggested by Bishop (1995) with a single hidden layer and an optimum number of neurons derived from a range between 5 and 25: yt ^x q · § p ȕ0  6 ȕ j g¨ ijij x t i  ij0 j ¸ ¸ ¨ j 1 ¹ ©i 1 t i ^ ijij , i ^ȕj, j ¦ 1, x t 1 , x t  2 ,  , x t  p c , i 1,  , p 1,  , p, j 1,  , q ` (1) ` ` 1,  , q Where y t is the output vector of the MLP at time t ; g is the nonlinear function of the neurons in the hidden layer; x t i is the input value at time t  i where i stands for the memory (the number of lags that are used to introduce the context of the actual observation.); q is the number of neurons in the hidden layer; ijij are the weights of neuron j connecting the input with the hidden layer; and ȕ j are the weights connecting the output of the neuron j at the hidden layer with the output neuron. 7 Working Paper 2014/17, pàg. 8 Working Paper 2014/10, pag. 8 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Note that the output y t in our study is the estimate of the value of the time series at time t  1 , while the input vector to the neural network will have a dimensionality of p  1 . We considered a MLP p; q architecture that represented the possible nonlinear relationship between the input vector x t i and the output vector y t . The parameters of the network ( ijij and ȕ j ) were estimated by means of the Levenberg-Marquardt algorithm, which is a quasi Newton algorithm. The training was done by iteratively estimating the value of the parameters by local improvements of the cost function. To avoid the possibility that the search for the optimum value of the parameters finishes in a local minimum, we used a multi-starting technique that initializes the neural network several times for different initial random values returning the best result. Radial basis function neural network RBF networks consist of a linear combination of radial basis functions such as kernels centred at a set of centroids with a given spread that controls the volume of the input space represented by a neuron (Bishop, 1995). RBF networks typically include three layers: an input layer; a hidden layer and an output layer. The hidden layer consists of a set of neurons, each of them computing a symmetric radial function. The output layer also consists of a set of neurons, one for each given output, linearly combining the outputs of the hidden layer. The output of the network is a scalar function of the output vector of the hidden layer. The equations that describe the input/output relationship of the RBF are: q yt ȕ 0  6 ȕ j g j x t i g j x t i ^x t i ^ȕj, j j 1 § ¨ ¨ exp¨  ¨ ¨ ¨ © p ¦x t i ȝj j 1 2ı 2j 2 · ¸ ¸ ¸ ¸ ¸ ¸ ¹ 1, x t 1 , x t  2 ,  , x t  p c , i 1,  , p 1,  , q (2) ` ` Where y t is the output vector of the RBF at time t ; ȕ j are the weights connecting the output of the neuron j at the hidden layer with the output neuron; q is the number of neurons in the hidden layer; g j is the activation function, which usually has a Gaussian shape; x t i is the input value at 8 Working Paper 2014/17, pàg. 9 Working Paper 2014/10, pag. 9 Research Institute of Applied Economics Regional Quantitative Analysis Research Group time t  i where i stands for the memory (the number of lags that are used to introduce the context of the actual observation); ȝ j is the centroid vector for neuron j ; and the spread ı j is a scalar that measures the width over the input space of the Gaussian function and it can be defined as the area of influence of neuron j in the space of the inputs. Note that the output y t in our study is the estimate of the value of the time series at time t  1 , while the input vector to the neural network will have a dimensionality of p  1 . In order to assure a correct performance, before the training phase the number of centroids and the spread of each centroid have to be selected. The spread ı j is a hyper parameter selected before determining the topology of the network, and it was determined by cross-validation on the training database. The training was done by adding the centroids iteratively with the spread fixed. Then a regularized linear regression was estimated to compute the connections between the hidden and the output layer. Finally, the performance of the network was computed on the validation data set. This process was repeated until the performance on the validation database ceased to decrease. Elman neural network An Elman network is a special architecture of the class of recurrent neural networks, and it was first proposed by Elman (1990). The architecture is also based on a three-layer network but with the addition of a set of context units that allow feedback on the internal activation of the network. There are connections from the hidden layer to these context units fixed with a weight of one. At each time step, the input is propagated in a standard feed-forward fashion, and then a back-propagation type of learning rule is applied. The output of the network is a scalar function of the output vector of the hidden layer: q yt z j ,t ^x ȕ 0  6 ȕ j z j ,t j 1 · § p g¨ ijij x t i  ij0 j  į ij z j ,t 1 ¸ ¸ ¨ ¹ ©i 1 t i ^ ijij , i ^ȕj, j ^ įij , i ¦ 1, x t 1 , x t  2 ,  , x t  p c , i 1,  , p 1,  , p, j 1,  , q ` (3) ` ` 1,  , q 1,  , p, j 1,  , q ` 9 Working Paper 2014/17, pàg. 10 Working Paper 2014/10, pag. 10 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Where y t is the output vector of the Elman network at time t ; z j ,t is the output of the hidden layer neuron j at the moment t ; g is the nonlinear function of the neurons in the hidden layer; x t i is the input value at time t  i where i stands for the memory (the number of lags that are used to introduce the context of the actual observation); ijij are the weights of neuron j connecting the input with the hidden layer; q is the number of neurons in the hidden layer; ȕ j are the weights of neuron j that link the hidden layer with the output; and į ij are the weights that correspond to the output layer and connect the activation at moment t . Note that the output y t in our study is the estimate of the value of the time series at time t  1 , while the input vector to the neural network will have a dimensionality of p  1 . The training of the network was done by back-propagation through time, which is a generalization of back-propagation for feed-forward networks. The parameters of the Elman neural network were estimated by minimizing an error cost function, which takes into account the whole time series. In order to minimize total error, gradient descent was used to change each weight in proportion to its derivative with respect to the error. A major problem with gradient descent for standard recurrent architectures is that error gradients vanish exponentially quickly with the size of the time lag. Recurrent neural networks cannot be easily trained for large numbers of neuron units and may behave chaotically. III. Data In this study we made use of tourism data. We used the number of tourist arrivals (first destination) provided by the Institute of Tourism Studies (IET) disaggregated by each visitor market over the period 2001:01 to 2012:07. The first four visitor markets (France, the United Kingdom, Belgium and the Netherlands and Germany) account for more than half of the total number of tourist arrivals to Catalonia, although Russia and the Northern countries are the ones experiencing the highest growth in tourist arrivals. First, we tested the unit root hypothesis. In Table 1 we present the results of the augmented Dickey-Fuller (ADF), the Phillips–Perron (PP) and the Kwiatkowski-Phillips-Schmidt-Shin (KPSS) tests. While the ADF and the PP statistics test the null hypothesis of a unit root in xt , the KPSS statistic tests the null hypothesis of stationarity. As it can be seen in Table 1, in most countries we 10 Working Paper 2014/17, pàg. 11 Working Paper 2014/10, pag. 11 Research Institute of Applied Economics Regional Quantitative Analysis Research Group cannot reject the null hypothesis of a unit root at the 5% level. Similar results are obtained for the KPSS test, where the null hypothesis of stationarity is rejected in most cases. When the tests were applied to the first difference of individual time series, the null of non-stationarity is strongly rejected in most cases. In the case of the KPSS test, we cannot reject the null hypothesis of stationarity at the 5% level in any country. These results imply that differencing is required in most cases and prove the importance of deseasonalizing and detrending tourism demand data (Zhang and Qi, 2005). In order to eliminate both linear trends as well as seasonality we used the first differences of the natural log of tourist arrivals. Table 1. Unit root tests on the trend-cycle series of tourist arrivals Test for I(0) Test for I(1) Country ADF PP KPSS ADF PP KPSS France Test for I(2) ADF PP KPSS -2.19 -3.41 0.32 -3.32 -2.48 0.15 -5.17 -3.53 0.04 -1.71 -2.28 0.35 -2.72 -2.88 0.15 -18.98 -2.37 0.06 -3.53 -2.55 0.21 -2.56 -3.42 0.10 -8.36 -4.46 0.02 Germany -2.28 -3.61 0.23 -3.36 -3.70 0.15 -9.07 -4.35 0.05 Italy -0.78 -0.99 0.33 -3.96 -2.46 0.08 -5.45 -3.24 0.24 -1.29 -2.40 0.33 -7.16 -4.06 0.03 -6.90 -2.32 0.02 -3.26 -2.16 0.17 -3.86 -3.61 0.07 -11.36 -2.50 0.03 Switzerland -1.80 -2.99 0.16 -7.11 -4.10 0.07 -6.65 -4.41 0.06 Russia 0.25 0.82 0.30 -5.01 -3.70 0.09 -8.31 -4.07 0.02 Other countries -2.04 -1.96 0.20 -4.56 -4.23 0.06 -9.84 -2.50 0.02 Total -2.14 -1.76 0.30 -2.99 -2.91 0.14 -12.47 -2.34 0.05 United Kingdom Belgium and the NL US and Japan Northern countries Notes: Estimation period 2002:01-2012:07. Tests for unit roots. Intercept included in test equation. Critical values for I(0) and I(1): ADF – Augmented Dickey and Fuller (1979) test, the 5% critical value is -2.88; KPSS – Kwiatkowski et al (1992) test, the 5% critical value is 0.46. Critical values for I(2): ADF – Augmented Dickey and Fuller (1979) test, the 5% critical value is -3.44; PP – Phillips and Perron (1988) test, the 5% critical value is 3.44; KPSS – Kwiatkowski et al (1992) test, the 5% critical value is 0.15. Given the common patterns displayed by most countries, we tested for cointegration using Johansen’s (1988, 1991) trace tests (Lee, 2011; Dritsakis, 2004). Trace tests test the null hypothesis of r cointegrating vectors against the alternative hypothesis of n cointegrating vectors. In Table 2 we present the results of five different unrestricted cointegration rank trace tests. It can be seen that 11 Working Paper 2014/17, pàg. 12 Working Paper 2014/10, pag. 12 Research Institute of Applied Economics Regional Quantitative Analysis Research Group we can only reject the null hypothesis of nine cointegrating vectors with two of the tests. The fact that the evolution of tourist arrivals is multicointegrated has led us to apply a multiple-output neural network approach to obtain forecasts of tourism demand. Table 2. Unrestricted Cointegration Rank Tests Type of test Hypothesized number of Assume no deterministic trend in data No intercept in CE CE(s) No test VAR H0 : r Intercept in CE No trend in VAR Allow for quadratic deterministic trend in data Intercept and trend in CE Linear trend in VAR Allow for linear deterministic trend in data Intercept in CE No intercept in VAR Intercept in CE Test VAR 0 856.6229 969.8334 946.8238 1085.223 1012.763 H0 : r d1 642.9016 741.4399 719.5322 857.7293 785.4048 H0 : r d 2 489.0577 586.3624 566.5598 676.3885 604.4294 H0 : r d 3 358.9547 452.6527 432.9569 541.7908 471.6636 H0 : r d 4 267.2172 344.7378 327.2923 412.1319 342.0272 H0 : r d 5 186.4016 256.9106 240.6405 314.3369 245.5905 H0 : r d 6 118.7815 176.0951 162.8499 227.6863 160.0873 H0 : r d 7 59.45009 110.2719 97.56685 149.9044 92.67206 H0 : r d 8 20.81093 56.72323 47.79385 85.37519 38.75788 H0 : r d 9 0.041106* (0.8681) 18.08417 10.98843 35.64879 0.944397* (0.3311) Notes: Estimation period 2002:01-2012:07. * Denotes rejection of the hypothesis at the 0.05 level. ** MacKinnon-Haug-Michelis (1999) p-values. p-values in parentheses when different from zero. IV. Results In this section we implemented a multiple-output approach to predict arrivals to Catalonia from the different visitor countries. Since growth rates of tourist arrivals from all the different countries of origin share a common stochastic trend, we applied a multivariate forecasting framework. While a single-output approach requires to implement the experiment for each visitor market, the multipleoutput approach allows to simultaneously obtain forecasts for all countries. We compared the forecasting performance of three different multiple-output ANN architectures: multi-layer perceptron, radial basis function and Elman recursive neural networks. 12 Working Paper 2014/17, pàg. 13 Working Paper 2014/10, pag. 13 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Following Bishop (1995) and Ripley (1996), we divided the collected data into three sets: training, validation and test sets. This division is done in order to asses the performance of the network on unseen data. The assessment is undertaken during the training process by means of the validation set, which is used in order to determine the epocs, the topology of the network and, in the case of the RBF the spread. The initial size of the training set was determined to cover a five-year span in order to accurately train the networks and to capture the different behaviour of the time series in relation to the economic cycle. After each forecast, a retraining was done by increasing the size of the set by one period and sliding the validation set by another period. This iterative process is repeated until the test set consisted of the last sample of the time series. Based on these considerations, the first sixty monthly observations (from January 2001 to January 2006) were selected as the initial training set, the next thirty-six (from January 2007 to January 2009) as the validation set and the last 20% as the test set. Note that the sets consist of consecutive subsamples, and the resulting validation and test sets at the beginning of the experiment correspond to different phases of the economic cycle. All neural networks were implemented using Matlab™ and its Neural Networks toolbox. To make the system robust to local minima, we applied the multistartings technique, which consists on repeating each training phase several times. In our case, the multistartings factor was three. The selection criterion for the topology and the parameters was the performance on the validation set. The Elman networks’ parameters and topology had to be optimized taking into account that it could yield an unstable solution such as divergent training due to the fact that during the training the weights of the feedback loop could give rise to an unstable network. Using as a criterion the performance on the validation set, the results that are presented correspond to the selection of the best topology, the best spread in the case of the RBF neural networks, and the best training strategy in the case of the Elman neural networks. Forecasts for 1,3 and 6 months ahead were computed in a recursive way. To summarise this information, two measures of forecast accuracy were computed: the Root Mean Squared Error (RMSE) and the Mean Absolute Error (MAE) (Tables 3 and 4). We repeated the experiment assuming different topologies regarding the memory values, which refer to the number of past months included in the context of the input, ranging from one to three months. Therefore, when the memory is zero, the forecast is done using only the current value of the time series, without any additional temporal context. 13 Working Paper 2014/17, pàg. 14 Working Paper 2014/10, pag. 14 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Table 3. RMSE (2010:04-2012:02) France 1 month 3 months 6 months United Kingdom 1 month 3 months 6 months Belgium and the NL 1 month 3 months 6 months Germany 1 month 3 months 6 months Italy 1 month 3 months 6 months US and Japan 1 month 3 months 6 months Northern countries 1 month 3 months 6 months Switzerland 1 month 3 months 6 months Russia 1 month 3 months 6 months Other countries 1 month 3 months 6 months Total 1 month 3 months 6 months Notes: Memory (0) – no additional lags MLP RBF Elman 0.21 0.09 0.59 0.23 0.09 0.47 0.16 0.08 0.29 Memory (3) – 3 additional lags MLP RBF Elman 0.28 0.09 0.41 0.23 0.09 0.46 0.33 0.09 0.47 0.16 0.31 0.22 0.16 0.16 0.15 0.46 0.46 0.40 0.35 0.35 0.46 0.16 0.16 0.15 0.50 0.41 0.54 0.21 0.13 0.23 0.12 0.11 0.12 0.39 0.34 0.28 0.28 0.24 0.38 0.12 0.12 0.12 0.34 0.34 0.48 0.19 0.27 0.23 0.18 0.18 0.18 0.43 0.43 0.37 0.22 0.28 0.32 0.18 0.18 0.18 0.59 0.46 0.56 0.32 0.37 0.29 0.08 0.09 0.09 0.63 0.43 0.43 0.43 0.33 0.52 0.09 0.09 0.09 0.44 0.49 0.57 0.18 0.32 0.21 0.13 0.13 0.13 0.55 0.28 0.28 0.30 0.30 0.49 0.13 0.13 0.13 0.35 0.39 0.59 0.29 0.33 0.16 0.19 0.17 0.18 0.55 0.34 0.23 0.37 0.41 0.31 0.17 0.18 0.18 0.41 0.34 0.39 0.28 0.40 0.65 0.19 0.19 0.20 0.67 0.70 0.45 0.34 0.42 0.45 0.19 0.19 0.18 0.49 0.55 0.49 0.38 0.86 0.90 0.34 0.38 0.39 1.00 1.04 0.92 0.72 0.76 0.90 0.36 0.37 0.36 1.01 0.86 1.07 0.18 0.28 0.23 0.08 0.08 0.08 0.36 0.27 0.21 0.26 0.25 0.21 0.08 0.08 0.08 0.36 0.31 0.35 0.11 0.20 0.15 0.05 0.05 0.04* 0.33 0.17 0.23 0.14 0.15 0.19 0.05 0.04* 0.05 0.30 0.31 0.28 * Best model. 14 Working Paper 2014/17, pàg. 15 Working Paper 2014/10, pag. 15 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Table 4. MAE (2010:04-2012:02) France 1 month 3 months 6 months United Kingdom 1 month 3 months 6 months Belgium and the NL 1 month 3 months 6 months Germany 1 month 3 months 6 months Italy 1 month 3 months 6 months US and Japan 1 month 3 months 6 months Northern countries 1 month 3 months 6 months Switzerland 1 month 3 months 6 months Russia 1 month 3 months 6 months Other countries 1 month 3 months 6 months Total 1 month 3 months 6 months Notes: Memory (0) – no additional lags MLP RBF Elman 0.15 0.08 0.48 0.15 0.07 0.35 0.12 0.07 0.24 Memory (3) – 3 additional lags MLP RBF Elman 0.23 0.08 0.34 0.18 0.07 0.36 0.25 0.07 0.41 0.12 0.20 0.19 0.14 0.13 0.14 0.37 0.32 0.33 0.27 0.26 0.38 0.13 0.14 0.13 0.41 0.33 0.43 0.17 0.11 0.18 0.11 0.10 0.10 0.30 0.26 0.23 0.21 0.19 0.28 0.10 0.10 0.11 0.24 0.29 0.37 0.15 0.19 0.18 0.14 0.14 0.14 0.32 0.36 0.26 0.17 0.23 0.28 0.14 0.14 0.14 0.45 0.36 0.43 0.23 0.25 0.22 0.06 0.07 0.07 0.48 0.30 0.31 0.31 0.26 0.42 0.07 0.06 0.06 0.36 0.40 0.50 0.14 0.23 0.17 0.11 0.11 0.11 0.44 0.22 0.21 0.25 0.24 0.34 0.11 0.11 0.11 0.28 0.29 0.45 0.20 0.26 0.13 0.16 0.15 0.15 0.44 0.29 0.19 0.29 0.33 0.25 0.14 0.15 0.15 0.32 0.27 0.32 0.23 0.32 0.38 0.16 0.16 0.16 0.53 0.51 0.36 0.29 0.32 0.34 0.16 0.16 0.15 0.37 0.42 0.39 0.31 0.62 0.64 0.30 0.35 0.35 0.80 0.81 0.70 0.57 0.66 0.81 0.33 0.34 0.32 0.86 0.69 0.82 0.14 0.16 0.17 0.07 0.06 0.06 0.26 0.20 0.18 0.19 0.19 0.17 0.07 0.06 0.06 0.28 0.24 0.26 0.09 0.12 0.12 0.04 0.04 0.03* 0.24 0.12 0.18 0.11 0.11 0.14 0.04 0.03* 0.04 0.22 0.25 0.22 * Best model. 15 Working Paper 2014/17, pàg. 16 Working Paper 2014/10, pag. 16 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Table 5. Diebold-Mariano loss-differential test statistic for predictive accuracy Memory (0) versus Memory (3) MLP RBF Elman France 1 month -2.68* 0.47 2.42* 3 months -1.11 0.62 -0.22 6 months United Kingdom 1 month 3 months 6 months Belgium and the Netherlands -2.74* -0.65 -2.92* -3.98* -1.19 -3.75* 0.32 -0.37 1.07 -0.70 -0.17 -1.35 1 month -0.85 0.53 0.75 3 months 6 months Germany -2.63* -1.47 -2.72* -2.19* -0.73 -2.24* 1 month 3 months -0.95 -0.92 0.12 -0.47 -1.36 0.04 6 months Italy -1.75 -0.40 -2.24* 1 month 3 months 6 months US and Japan -1.29 -0.07 -3.09* -0.55 1.84 1.31 1.44 -1.38 -2.08* 1 month 3 months -1.93 -0.09 -0.22 -1.43 2.08 -0.97 6 months Northern countries -1.39 -0.64 -3.62* 1 month 3 months 6 months Switzerland -1.30 -1.17 -2.25* 1.98 -1.95 -0.92 1.27 0.38 -2.54* 1 month -1.48 0.08 1.25 3 months 0.06 -0.52 0.95 6 months Russia 0.36 3.01* -0.38 1 month 3 months -2.66* -0.29 -0.66 1.64 -0.38 0.82 6 months Other countries -1.16 3.41* -0.75 1 month 3 months -1.75 -0.41 -0.07 -0.97 -0.51 -0.66 6 months Total -0.10 -0.24 -1.24 1 month -0.78 0.46 0.25 3 months 6 months 0.20 -0.75 0.62 0.53 -3.55* -0.78 Notes: Diebold-Mariano test statistic with NW estimator. Null hypothesis: the difference between the two competing series is non-significant. A negative sign of the statistic implies that the second model has bigger forecasting errors. * Significant at the 5% level. 16 Working Paper 2014/17, pàg. 17 Working Paper 2014/10, pag. 17 Research Institute of Applied Economics Regional Quantitative Analysis Research Group We also use the Diebold-Mariano test for significant differences between each two competing series for each forecast horizons in order to assess the effect of different memory values on the forecasts (Table 5). When analysing the forecast accuracy, MLP and RBF networks show lower RMSE and MAE values than Elman networks. RBF networks display the lowest RMSE and MAE values in most countries both when the memory is zero and when is set to three. When the forecasts are obtained incorporating additional lags of the time series, the forecasting performance of RBF networks significantly improves in Switzerland and Russia for 6 months ahead. The lowest RMSE and MAE values are obtained with the RBF network for total tourist arrivals, for 3 months ahead when the memory is zero, and for 6 months ahead when using a memory of three lags. When testing for significant differences between each two competing series (Table 5), we find that in most cases, as the number of previous months used for concatenation increases, the forecasting performance of the different networks shows no significant improvement. This result can in part be explained by the pre-processing (detrending) of the original time series and the crosscorrelations accounted for in the multiple-output approach. V. Conclusion The main objective of the study is to evaluate the forecasting performance of three artificial neural networks models: the multi-layer perceptron neural network, the radial basis function neural network and the Elman recursive neural network. The seasonal patterns and the volatility that characterizes tourism data constitute an enabling field in which to compare the forecast accuracy of different neural network architectures that treat information in a different way. We use official statistical data of inbound international tourism demand to Catalonia. By means of the Johansen test we find that the evolution of arrivals from all countries of origin are multicointegrated. Since all markets share a stochastic trend, we apply a multivariate approach to obtain forecasts of tourism demand for all different countries and different forecast horizons. When comparing the forecasting accuracy of the different techniques, we find that radial basis function neural networks outperform both multi-layer perceptron and Elman neural networks. This result shows that hybrid models, which combine supervised and non-supervised learning, are more indicated for economic forecasting with seasonal data than models using supervised learning alone. 17 Working Paper 2014/17, pàg. 18 Working Paper 2014/10, pag. 18 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Our results also suggest that when using dynamic or recurrent neural networks with forecasting purposes scaling issues may arise, which can give rise to divergence in the learning algorithm. In order to evaluate the effect of the memory on the forecasting results, we repeated the experiment assuming different topologies regarding the number of lags used for concatenation. No significant differences are found when additional lags are incorporated in the feature vector, especially in the case of multi-layer perceptron neural networks. The explanation for this result is that the increase in the weight matrix is not compensated by the more complex specification and leads to overparametrization. The fact that increasing the dimensionality of the input does not have a significant effect on forecast accuracy is indicative that the pre-processing of the raw data conditions the forecasting results. Summarizing, the forecasting out-of-sample comparison shows the suitability of applying hybrid models such as radial basis function neural networks to economic forecasting with seasonal time series. The study also reveals that the implementation of multiple-output architectures, taking into account the connections between the different time series, improves the forecasting performance of practical neural network forecasting. A question to be considered in further research is whether these results apply to different data pre-processing methods. References Balaguer, J. and Cantavella-Jordá, M. (2002) Tourism as a long-run economic growth factor: the Spanish case, Applied Economics, 34, 877-884. Bishop, C. M. (1995) Neural networks for pattern recognition, Oxford University Press, Oxford. Binner, J. M., Bissoondeeal, R. K., Elger, T., Gazely, A. M. and Mullineux, A. W. (2005) A comparison of linear forecasting models and neural networks: an application to Euro inflation and Euro Divisia, Applied Economics, 37, 655-680. Broomhead, D. S. and Lowe, D. (1988) Multi-variable functional interpolation and adaptive networks, Complex Syst., 2, 321-355. Burger, C., Dohnal, M., Kathrada, M. and Law, R. (2001) A practitioners guide to time-series methods for tourism demand forecasting: A case study for Durban, South Africa, Tourism Management, 22, 403-409. Cang, S. (2013) A comparative analysis of three types of tourism demand forecasting models: individual, linear combination and non-linear combination, International Journal of Tourism Research, 15. Cho, V. (2003) A comparison of three different approaches to tourist arrival forecasting, Tourism Management, 24, 323-330. 18 Working Paper 2014/17, pàg. 19 Working Paper 2014/10, pag. 19 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Choudhary, M. A. and Haider, A. (2012) Neural network models for inflation forecasting: an appraisal, Applied Economics, 44, 2631-2635. Claveria, O. and Torra, S. (2014) Forecasting tourism demand to Catalonia: Neural networks vs. time series models, Economic Modelling, 36, 220-228. De Gooijer, J. G. and Kumar, K. (1992) Some recent developments in non-linear time series modelling, testing and forecasting, International Journal of Forecasting, 8, 135-156. Dickey, D. A. and Fuller, W. A. (1979) Distribution of the estimators for autoregressive time series with a unit root, Journal of American Statistical Association, 74, 427-431. Dritsakis, N. (2004) Cointegration analysis of German and British tourism demand for Greece, Tourism Management, 25, 111-119. Diebold, F. X. and Mariano, R. (1995) Comparing predictive accuracy, Journal of Business and Economic Statistics, 13, 253-263. Eilat, Y. and Einav, L. (2004) Determinants of international tourism: a three dimensional panel data analysis, Applied Economics, 36, 1315-1327. Elman, J. L. (1990) Finding structure in time, Cognitive Science, 14, 179-211. Granger, C. W. J. and Terasvirta, T. (1993) Modelling nonlinear economic relationships, Oxford University Press, Oxford. Haykin, S. (1999) Neural networks. A comprehensive foundation, Prentice Hall, New Jersey. Johansen, S. (1988) Statistical analysis of co-integration vectors, Journal of Economic Dynamics and Control, 12, 231-254. Johansen, S. (1991) Estimation and hypothesis testing of co-integration vectors in Gaussian vector autoregressive models, Econometrica, 59, 1551-1580. Kim, D. and Shwartz, Y. (2013) The accuracy of tourism forecasting and data characteristics: a metaanalytical approach, Journal of Hospitality Marketing amd Management, 22, 349-374. Kwiatkowski, D., Phillips, P. C. B., Schmidt, P. and Shin, Y. (1992) Testing the null hypothesis of stationarity against the alternative of a unit root, Journal of Econometrics, 54, 159-178. Kon, S. C. and Turner, W. L. (2005) Neural network forecasting of tourism demand, Tourism Economics, 11, 301-328. Law, R. (1998) Room occupancy rate forecasting: A neural network approach, International Journal of Contemporary Hospitality Management, 10, 234-239. Law, R. (2000) Back-propagation learning in improving the accuracy of neural network-based tourism demand forecasting, Tourism Management, 21, 331-340. Law, R. (2001) The impact of the Asian financial crisis on Japanese demand for travel to Hong Kong: a study of various forecasting techniques, Journal of Travel & Tourism Marketing, 10, 47-66. 19 Working Paper 2014/17, pàg. 20 Working Paper 2014/10, pag. 20 Research Institute of Applied Economics Regional Quantitative Analysis Research Group Law, R. and Au, N. (1999) A neural network model to forecast Japanese demand for travel to Hong Kong, Tourism Management, 20, 89-97. Lee, K. N. (2011) Forecasting long-haul tourism demand for Hong Kong using error correction models, Applied Economics, 43, 527-549. Lin, C. J., Chen, H. F. and Lee, T. S. (2011) Forecasting tourism demand using time series, artificial neural networks and multivariate adaptive regression splines: Evidence from Taiwan, International Journal of Business Administration, 2, 14-24. MacKinnon, J. G., Haug, A. and Michelis, L. (1999) Numerical Distribution Functions of Likelihood Ratio Tests for Cointegration, Journal of Applied Econometrics, 14, 563-577. Madden, G. and Tan, J. (2008) Forecasting international bandwidth capacity using linear and ANN methods, Applied Economics, 40, 1775-1787. Palmer, A., Montaño, J. J. and Sesé, A. (2006) Designing an artificial neural network for forecasting tourism time-series, Tourism Management, 27, 781-790. Pattie, D. C. and Snyder, J. (1996) Using a neural network to forecast visitor behavior, Annals of Tourism Research, 23, 151-164. Phillips, P. C. B. and Perron, P. (1988) Testing for a unit root in time series regression, Biometrika, 75, 335346. Ripley, B. D. (1996) Pattern recognition and neural networks, Cambridge University Press, Cambridge. Santín, D., Delgado, F. J. and Valiño, A. (2004) The measurement of technical efficiency: a neural network approach, Applied Economics, 36, 627-635. Song, H. and Li, G. (2008) Tourism demand modelling and forecasting – a review of recent research, Tourism Management, 29, 203-220. Teixeira, J. P. and Fernandes, P. O. (2012) Tourism Time Series Forecast – Different ANN Architectures with Time Index Input, Procedia Technology, 5, 445-454. Tsaur, S. H., Chiu, Y. C. and Huang, C. H. (2002) Determinants of guest loyalty to international tourist hotels: a neural network approach, Tourism Management, 23, 397-405. Uysal, M. and El Roubi, M. S. (1999) Artificial neural Networks versus multiple regression in tourism demand analysis, Journal of Travel Research, 38, 111-118. Vlastakis, N., Dotsis, G. and Markellos, R. (2008) Nonlinear modelling of European football scores using support vector machines, Applied Economics, 40, 111-118. Zhang, G. and Qi, M. (2005) Neural network forecasting for seasonal and trend time series, European Journal of Operational Research, 160, 501-514. Zhang, G., Putuwo, B. E. and Hu, M. Y. (1998) Forecasting with artificial neural networks: the state of the art, International Journal of Forecasting, 14, 35-62. 20  5HVHDUFK,QVWLWXWHRI$SSOLHG(FRQRPLFV 5HJLRQDO4XDQWLWDWLYH$QDO\VLV5HVHDUFK*URXS  :RUNLQJ3DSHUSjJ :RUNLQJ3DSHUSDJ 32