Academia.eduAcademia.edu

Robust Tests for Convergence Clubs

SSRN Electronic Journal

In many applications common in testing for convergence the number of cross-sectional units is large and the number of time periods are few. In these situations asymptotic tests based on an omnibus null hypothesis are characterised by a number of problems. In this paper we propose a multiple pairwise comparisons method based on an a recursive bootstrap to test for convergence with no prior information on the composition of convergence clubs. Monte Carlo simulations suggest that our bootstrap-based test performs well to correctly identify convergence clubs when compared with other similar tests that rely on asymptotic arguments. Across a potentially large number of regions, using both crosscountry and regional data for the European Union we find that the size distortion which afflicts standard tests and results in a bias towards finnding less convergence, is ameliorated when we utilise our bootstrap test.

Faculty of Economics Cambridge Working Papers in Economics Cambridge Working Papers in Economics: 1873 ROBUST TESTS FOR CONVERGENCE CLUBS Luisa Corrado Thanasis Stengos Melvyn Weeks M. Ege Yazgan 21 December 2018 In many applications common in testing for convergence the number of cross-sectional units is large and the number of time periods are few. In these situations asymptotic tests based on an omnibus null hypothesis are characterised by a number of problems. In this paper we propose a multiple pairwise comparisons method based on an a recursive bootstrap to test for convergence with no prior information on the composition of convergence clubs. Monte Carlo simulations suggest that our bootstrap-based test performs well to correctly identify convergence clubs when compared with other similar tests that rely on asymptotic arguments. Across a potentially large number of regions, using both cross-country and regional data for the European Union we find that the size distortion which afflicts standard tests and results in a bias towards finnding less convergence, is ameliorated when we utilise our bootstrap test. Robust Tests for Convergence Clubs∗ Luisa Corrado† University of Rome Tor Vergata and University of Cambridge Thanasis Stengos University of Guelph Melvyn Weeks Faculty of Economics and Clare College, University of Cambridge M. Ege Yazgan Istanbul Bilgi University December 20, 2018 Abstract In many applications common in testing for convergence the number of cross-sectional units is large and the number of time periods are few. In these situations asymptotic tests based on an omnibus null hypothesis are characterised by a number of problems. In this paper we propose a multiple pairwise comparisons method based on an a recursive bootstrap to test for convergence with no prior information on the composition of convergence clubs. Monte Carlo simulations suggest that our bootstrap-based test performs well to correctly identify convergence clubs when compared with other similar tests that rely on asymptotic arguments. Across a potentially large number of regions, using both cross-country and regional data for the European Union we find that the size distortion which afflicts standard tests and results in a bias towards finding less convergence, is ameliorated when we utilise our bootstrap test. Keywords: Multivariate stationarity, bootstrap tests, regional convergence. JEL Classifications: C51, R11, R15. ∗ We are grateful to Andrew Harvey, Hashem Pesaran, Jonathan Temple, Mike Wickens, Gernot Dopplehoffer, Vasco Carvalho for their comments on the work. † Luisa Corrado gratefully acknowledges the Marie-Curie Intra European fellowship 039326. [email protected] 1 1 INTRODUCTION 1 Introduction The extent to which countries and or regions are similar across one or more dimensions is a question that has long been of interest to economists and policymakers. Within the European Union the ECB targets a single Euro Area inflation rate, and in this respect the degree to which there exists convergence in regional per capita incomes and output is of critical relevance to European regional development policies (Boldrin and Canova (2001)). Moreover, one of the core components of the European cohesion policy has been to reduce the disparities between income levels of different regions and in particular the backwardness of the least favoured regions; this objective has, in general, been manifest as the promotion of convergence between eu regions.1 In this context it is evident that the correct detection of the extent of convergence within a regional economy is paramount given that policy usually tries to achieve regional convergence by reducing the gap between the richest and the poorest regions. In this respect a test of convergence which exhibits bias, for example being oversized in small samples, will mislead, and in this instance imply less convergence suggesting the need for more policy initiatives than may actually be required. Economists have conceptualised the notion of similarity using formal definitions of convergence based upon growth theory. Standard neoclassical growth models (Solow (1956) and Swann (1956)) founded upon the key tenets of diminishing returns to capital and labour and perfect diffusion of technological change, dictate that countries will converge to the same level of per capita income (output) in the long run, independent of initial conditions. The New Growth theory (see, for example, Romer (1986); Lucas (1988); Grossman and Helpman (1994); Barro and Sala-i-Martin (1997)) allows for increasing returns to accumulable factors such as human capital in order to determine the (endogenous) long-run growth rate.2 The emergence over the past decade of New Economic Geography3 models of industrial location and agglomeration, has resulted in the identification of other forces which generate increasing returns, two notable examples being the relationship between location and transportation costs (Louveaux et al., 1982) and the effect of regional externalities 1 See Article 158 of the Treaty establishing the European Community. Other variants of the New Growth theories predict the emergence of multiple locally-stable steady-state equilibria instead of the unique globally-stable equilibrium of the neoclassical growth model as a result of differences in human and physical capital per worker across countries (Basu and Weil (1998)), their state of financial development (Acemoglu and Zilibotti (1997)) or other externalities caused by complementarity in innovation (Ciccone and Matsuyama (1996)). 3 In the ‘new economic geography’ models the sources of increasing returns are associated with Marshallian-type external localisation economies (such as access to specialised local labour inputs, local market access and size effects, local knowledge spillovers, and the like). These models provide a rich set of possible long run regional growth patterns that depend, among other things, on the relative importance of transport costs and localisation economies (Fujita, et al. 1999; Fujita and Thisse 2002). 2 2 1 INTRODUCTION (Cheshire and Hay (1989)). To the extent that the process of growth is different across regions in the sense that there are different long-run steady-states, the standard neoclassical growth model is not valid. In this context traditional approaches to test for convergence are hard to justify, difficult to interpret, and often difficult to implement. For example, a rejection of the omnibus null of convergence across a groups of regions provides increasingly less information as the number of regions increases and where prior knowledge over both the number and composition of convergence clubs is minimal. Moreover, the justification of constructing such a large intersection null hypothesis is often questionable at the outset. Faced with the emergence of larger panels, with an attendant increase in cross-sectional heterogeneity, there has been a number of significant developments in testing. For example, the use of a heterogeneous alternative hypothesis partially alleviates the problem of testing over a large group of potentially heterogenous regions (see, for example, Im et al., 2003). In a further progression away from the testing of general omnibus hypotheses, Pesaran (2007) conducts pairwise tests for region pairs, with inference focussed on the proportion of output gaps that are stationary. One drawback of this approach is that limited inference can be made as to the significance of individual gaps, or indeed whether a group of output comparisons form a convergence club. An approach which allows for an endogenous determination of the number of clubs using a sequence of pairwise stationarity tests has been developed by Hobijn and Franses (2000). In extending this approach Corrado, Martin and Weeks (2005) developed a testing strategy that facilitates both the endogenous identification of the number and composition of regional clusters (or ‘clubs’), and the interpretation of the clubs by comparing observed clusters with a number of hypothesized regional groupings based on different theories of regional growth. However, given that the time series are relatively short, there are potential problems in basing inference on asymptotic results for stationarity tests. Reliance on large T asymptotics is likely to impart a size distortion, biasing the results towards finding less convergence than actually exists. To circumvent this problem we propose in this paper a recursive bootstrap test for stationarity which is designed to detect multiple convergence clubs without prespecification of group membership. Monte Carlo simulations suggest that the proposed bootstrap based method performs quite well in identifying club membership when compared with the Hobijn and Franses (2000) approach that is based on asymptotic arguments. We implement our bootstrap recursive test of convergence using the original cross-country dataset used by Hobijn and Franses (2000) and the European regional data used in Corrado, Martin and Weeks (2005). We then compare the asymptotic and bootstrap generated cluster outcomes. Our results show that by 3 2 TESTS FOR CONVERGENCE resolving the size distortion which afflicts the asymptotic test we find considerably more evidence of convergence in both the applications considered. The paper is structured as follows. Section two reviews existing tests for convergence clubs, and in section three we present the bootstrap version of the test. In section four we propose a Monte Carlo experiment to compare the properties of the asymptotic and bootstrap tests. Section five describes the data and applies the proposed tests to two real word datasets. In section six we discuss our findings and conclusions. 2 Tests for Convergence In this section we briefly discuss a number of significant developments in tests designed to detect convergence and identify clubs which are able to address a number of questions such as whether a particular pair of countries have converged, or whether a group of regions or countries form a convergence club. We briefly discuss the different approaches to detect convergence tracking a gradual progression away from multivariate time series and panel data tests based on an omnibus null, towards sequential tests and tests that are founded upon multiple pairwise comparisons.4 Our focus here is to identify endogenously clubs using multivariate tests for stationarity. However, given that the time series are relatively short, we show that there are potential problems in basing inference on asymptotic results for stationarity tests. To circumvent this problem we bootstrap the stationarity test and assess the effect of the size distortion on the cluster outcomes using two different applications based on country and regional level data. The use of multivariate time-series to test for convergence was initiated by the seminal papers of Bernard and Durlauf (1995, 1996). Given a set ̥ of N economies, a multi country definition of relative convergence asks whether the long-run forecast of all output differences with respect to a benchmark economy, (denoted with the subscript 1) tend to a country-specific constant as the forecasting horizon tends to infinity.5 We may then write lim E(y(i1),t+s | It ) = µ1i s→∞ ∀i 6= 1, (1) 4 Corrado and Weeks (2011) provide a more detailed overview. A necessary condition for regions i and j to converge, either absolutely or relatively, is that the two series must be cointegrated with cointegrating vector [1, −1]. However, if output difference are trend stationary, this implies that the two series are co-trended as well as cointegrated. Hence a stronger condition for convergence is that output differences cannot contain unit roots or time trends (Pesaran (2007)). 5 4 2 TESTS FOR CONVERGENCE where y(i1),t+s = yit+s − y1t+s and µ1i is a finite constant.6 There exist a number of problems with multivariate time series tests. First, the testing procedure is sensitive to the choice of the benchmark country. Second, in keeping with the problems of omnibus tests, in the event of rejecting the non-convergence null we have no information as to which series are I(0) and I(1), nor the composition of any convergence groups. Third, given the system properties of the test, a dimensionality constraint means that it can handle only a small number of economies simultaneously. Panel unit root procedures have also been adopted to test for convergence by considering the stationary properties of output deviations with respect to a benchmark economy (Fleissig and Strauss, 2001; Evans, 1998; Carlino and Mills, 1993). First, the so called ‘first-generation’ panel unit-root tests,7 maintain that errors are independent across cross-sectional units which imparts a size distortion. To overcome this problem a ‘second generation’ of panel unit root tests have been developed which allows for different forms of cross-sectional dependence.8 However, as pointed out by Breitung and Pesaran (2008), panel data unit root tests poses similar problems in that as N becomes large the likelihood of rejecting the omnibus null increases with no information on the exact form of the rejection. The problem of identifying the mix of I(0) and I(1) series whilst still utilising the attendant power from a panel by exploiting coefficient homogeneity under the null, has been addressed by the sequential test proposed by Kapetanios (2003).9 Specifically, Kapetanios employs a sequence of unit root tests of panels of decreasing size to separate stationary and nonstationary series,10 facilitating an endogenous identification of the number and identity of stationary series. Although a positive development there are a number of limitations. Critically the utility of this approach depends on the use of a panel framework to add power in a situation where most series are stationary but very persistent. In addition, the method only permits the classification of the N series into two groups whereas there may be many more groups. As a consequence it is not possible to address a number of questions that may be of interest: such as whether a particular pair of countries have converged, or whether a group of regions or countries form a convergence club. When applied to output deviations, an additional problem with the Kapetanois 6 We consider this as a more reasonable definition of convergence in the sense that it allows the process of convergence to stop within a neighborhood of zero mean stationarity (absolute convergence) and is consistent with the existence of increasing costs of convergence. 7 See, for example, Maddala and Wu, 1999; Im et al., 2003; Levin, Lin and Chu, 2002. 8 For example, Taylor and Sarno (1998) adopt a multivariate approach and estimate a system of N − 1 ADF equations using Feasible GLS to account for contemporaneous correlations among the disturbances. Other notable example of second generation of panel unit root tests with cross-sectional dependence include Pesaran (2007) and Moon and Perron (2007). 9 See also Flores et al. (1999) and Breuer et al. (1999). 10 This method is referred to as the Sequential Panel Selection Method (SPSM). 5 2.1 Sequential Pairwise Tests 2 TESTS FOR CONVERGENCE test is that the testing procedure is still sensitive to the choice of the benchmark country. One approach which avoids the pitfalls of the choice of a benchmark country and, more generally, the dimensionality problem that afflicts the application of omnibus tests, is to conduct separate tests of either stationarity and/or nonstationarity. By considering a particular multi-country definition of convergence, Pesaran (2007) adopts a pairwise approach to test for unit-roots and stationarity properties of all N (N − 1)/2 possible output pairs {yit − yjt }. The definition of convergence that is adopted is that the N countries converge if Pr(∩i=1,...,N,j=i+1,...,N |yit+s − yjt+s | < c|It ) > π, (2) for all horizons, s = 1, ..., ∞, and c a positive constant. π ≥ 0 denotes a tolerance probability which denotes the proportion that one would expect to converge by chance. In testing the significance of the proportion of output gaps that indicate convergence, the dimensionality constraint that affects the application of system-wide multivariate tests of stationarity is circumvented. However, although the pairwise tests of convergence proposed by Pesaran (2007) is less restrictive than the omnibus tests proposed by Bernard and Durlauf (1995), the subsequent inference is limited in that it does not allow inference on which pairs of regions have converged, or the number and composition of convergence clubs. Below we examine various testing strategies for club convergence and in particular the sequential testing procedure proposed by Hobijn and Franses (2000). 2.1 Sequential Pairwise Tests Despite the use of multivariate time series and panel data methodologies to test for convergence, there has been relatively few attempts to utilize this approach to systematically identify convergence clubs (see Durlauf et al. 2005). The existing early methods were generally focused on the convergence of various a-priori defined homogeneous country groups which were assumed to share the same initial conditions. Baumol (1986) for example grouped countries with respect to political regimes (OECD membership, command economies and middle income countries), Chatterji (1992) allowed for clustering based on initial income per capita levels and tested convergence cross-sectionally, while Durlauf and Johnson (1995) grouped countries using a regression tree method based on different variables such as initial income levels and literacy rates that determined the different ”nodes” of the regression tree. Similarly, Tan (2009), utilizes a regression tree approach which again utilises 6 2.1 Sequential Pairwise Tests 2 TESTS FOR CONVERGENCE exogenous information in the form of conditioning variables. An alternative approach to the cross-sectional notion of β− convergence was introduced by Bernard and Durlauf (1995, 1996) based on a time series framework that makes use of unit root and cointegration analysis (see Durlauf et al. (2005) for a comprehensive literature review for convergence hypothesis). There are many other studies where the identification of convergence clubs has been achieved exogenously through testing in conjunction with a pre-classification of clubs using parametric techniques.11 For example, Weeks and Yao (2003) adopt this approach when assessing the degree of convergence across coastal and interior provinces in China over the period 1953-1997. As Maasoumi and Wang (2008) note, the principal problem with pre-classification is that as the number of regions increases such a strategy is not robust to the existence of other convergence clubs within each sub-group. A different approach is advanced using the notion of σ−convergence by Phillips and Sul (2007) who developed an algorithm based on a log-t regression approach that clusters countries with a common unobserved factor in their variance. In general it is straightforward to test whether two regions form a single group. We could simply construct a single output (income) deviation and test for stationarity or a unit root. In the context of differentiating between the stationarity properties of multiple series (or output deviations), the contribution by Kapetanios (2003) has provided new techniques that utilise the power of an omnibus null in conjunction with a sequential test that allows greater inference under the alternative hypothesis. However, for a large set of regions, ̥, locating partitions that are consistent with a particular configuration of convergence clubs generates difficulties given that the number of combinations is large and related, that we have little prior information.12 Hobijn and Franses (2000) propose an empirical procedure that endogenously locates groups of similar countries (convergence clubs) utilising a sequence of stationarity tests. Cluster or club convergence in this context implies that regional per capita income differences between the members of a given cluster converge to zero (in the case of absolute convergence) or to some finite, cluster specific non-zero constant (in the case of relative convergence). Below we illustrate the method. The Hobijn and Franses (2000) test represents a multivariate extension of the Kwiatkowski et al.(1992) test (hereafter KPSS test). We introduce the test by first 11 See Quah (1997) for an example of non-parametric techniques to locate convergence clubs. Harvey and Bernstein (2003), utilize non-parametric panel-data methods focussing on the evolution of temporal level contrasts for pairs of economies, identifying the number and composition of clusters. Beylunioğlu et al (2018) also propose a method based on the maximum clique method from graph theory that relies on Augmented Dickey Fuller (adf) unit root testing. However, as it will be explained below, the maximum clique method is a “top down” method that leads to a different definition of clubs and is not directly comparable to the “bottom up” Hobijn and Franses’ approach. 12 7 2.1 Sequential Pairwise Tests 2 TESTS FOR CONVERGENCE denoting yt = {yit } as the N × 1 vector of log per capita income and write yt as yt = α + βt + D t X vs + εt , (3) s=1 where α = {αi } is a N ×1 vector of constants, β = {βi } is a N ×1 vector of coefficients for the deterministic trend t, and vs = {vl,s }, l = 1, ..., m represents a m × 1 vector of first differences of the m stochastic trends in yt , m ∈ (0, ..., N ). D = {Di,l } denotes a N × m matrix with element Di,l denoting the parameter for the lth stochastic trend. εt = {εi,t } is a N × 1 vector of stochastic components. In considering the difference in log per capita income for regions i and j we write y(ij),t = α(ij) + β(ij) t + m X D(ij),l X t vl,s s=1 l=1  + ε(ij),t , (4) where (4) admits two different convergence concepts: absolute and relative. The restrictions implied by the null of relative convergence are β(ij) = 0 ∀i 6= j ∈ ̥ and D(ij),l = Dil − Djl = 0 ∀l = 1, ..., m, with the latter restriction indicating that the stochastic trends in log per capita income are cointegrated with cointegrating vector [1 − 1]. The additional parameter restrictions for the null hypothesis of absolute convergence are that α(ij) = 0 ∀i 6= j ∈ ̥. Both asymptotic absolute and relative convergence imply that the cross sectional variance of log per capita income converges to a finite level. t P Denoting the partial sum process St = y(ij),s , the test statistic for zero mean s=1 stationarity is given by13 Denoting ht = y(ij),t − 1 T 13 T P τb0 = T −2 T X ′ St [b σ 2 ]−1 St . (6) t=1 y(ij),t and the partial sum process as S̄t = t=1 t P hs the test s=1 For the stationary null, Hobijn and Franses (2000) utilise the Kwiatkowski et al. (1992) test. The KPSS test is operationalised by regressing the pairwise difference in per capita income y(ij),t against an intercept and a time trend giving residuals εb(ij),t = y(ij),t − α b(ij) − βb(ij) t. and σ b2 = (5) T L T X 1 X 1X 2 εb(ij),t + 2 ω(k, L) εb(ij),t εb(ij),t−k , T t=1 T k=1 t=k+1 represents the consistent Newey-West estimator of the long-run variance. ω(k, L) = 1 − k/(1 + L), k = 1, ..., L is the Bartlett kernel, where L denotes the bandwidth. 8 2.1 Sequential Pairwise Tests 2 TESTS FOR CONVERGENCE statistic for level stationarity is given by τbµ = T −2 T X ′ S̄t [b σ 2 ]−1 S̄t . (7) t=1 Examining (4), we note that in the case of two regions, and focussing on a test of relative convergence with restrictions D(ij),l = 0 ∀l = 1, ..., m and β(ij) = 0, it is obviously straightforward to test whether two regions form part of a single group. However, for a large number of regions locating the partitions over ̥ that are consistent with a particular configuration of convergence clubs is infeasible both because the number of combinations is large and related, that we have little prior information on the form of D and the likely combination of zeros restrictions over the differences β(ij) and α(ij) . The alternative testing strategy proposed by Hobijn and Franses (2000) forms groups from the bottom up using a clustering methodology to determine, endogenously, the most likely combination of restrictions, and as a consequence, the most likely set of convergence clubs. The cluster algorithm is based on the hierarchical farthest neighbour method due to Murtagh (1985). We illustrate the sequential test using the set of regions ̥ = {1, 2, 3, 4}. (i) We first initialise singleton clusters K(i) for each region i = 1, ..., 4. The null hypothesis of level stationarity is tested for all N (N −1)/2 = 6 region pairs. We b s=1 = {p(ij) }, where p(ij) = Pr(b collect p-values in the vector p τ(ij),µ < c(ij) |It ), τb(ij),µ denotes the test statistic and c(ij) the critical value. s = 1 denotes the first iteration. b s=1 , indicating the Clusters are formed on the basis of the max p-value in p pair of regions which are most likely to converge. If, for example, p(1,2) = {max{b ps=1 } > pmin } then regions 1,2 are the first pair of regions to form a i,j∈̥ 14 ′ club. We denote the first cluster as K(1 ) = {1, 2} and discard the singleton ′ cluster 2, which is now part of the two-region cluster K(1 ). ′ ′ (ii) In the second iteration (s = 2) we define the set of regions as ̥ = (1 , 3, 4). We form pairwise output differences between the N − 2 remaining singleton clusters and the two-region cluster K(1′ ). Once again we collect the p-values b s=2 . Letting p(r,v) = {max{b in the vector p ps=2 } > pmin }, then if, for i,j∈̥ example, p(r,v) = p(1′ 3) , the singleton cluster K(3) joins cluster K(1′ ) forming a ′′ three-region cluster K(1 ) = {1, 2, 3}. 14 The choice of pmin has a direct effect on the cluster size. Since the stationarity test is known to be oversized in small samples, this bias will generate inference towards finding less convergence. 9 3 A BOOTSTRAP TEST ′′ (iv) In this example we find a three-region cluster K(1 ) and a singleton cluster K(4), so the procedure stops. The principle difference between this sequential testing strategy and the SPSM approach of Kapetanios is that the SPSM test is designed to endogenously classify stationary and nonstationary series. This is achieved by sequentially reducing the size of the omnibus null by removing series with the most evidence against the unit root null, classifying these series as stationary. The stopping point is when the unit root null does not reject, such that all the remaining regions are declared nonstationary. In contrast the Hobijn and Franses method seeks to endogenously allocate N series to J ≤ N convergence clubs. This is achieved by only classifying regions that provide, at each recursion and conditional on exceeding pmin , the most evidence for convergence. Although the sequential multivariate stationarity test is consistent in that for large T the tests will reveal the true underlying convergence clubs, the principle shortcoming is that the test statistic is known to be oversized in small samples (Caner and Kilian, 2001). When testing for convergence using yearly data T is likely to be small, and as a result inference is likely to be biased in the direction of finding less convergence. Similar size distortions also emerge when the series are stationary but highly persistent: in this case the partial sum of residuals which are used to derive the KPSS test resemble those under the alternative in the limit. Below we outline a bootstrap approach which circumvents the pitfalls of inference based upon asymptotic arguments since it is able to generate independent bootstrap resamples using a parametric model which is conditional on the sample size and the dependence structure of the dataset. In section 5 we utilise this test to investigate the extent of convergence in two different applications: (i) the cross-country dataset originally adopted by Hobijn and Franses (2000); (ii) the European regional dataset used by Corrado, Martin and Weeks (2005). 3 A Bootstrap Test To derive the parametric model with which to create independent bootstrap samples under the stationarity null, following Kuo and Tsong (2005) and Leybourne and McCabe (1994), we exploit the equivalence in second order moments between an unobserved component model and a parametric ARIMA model (Harvey (1989)) for the differenced data. In demonstrating this equivalence we note that (4) may be rewritten in structural form as a function of a deterministic component (α(ij) + β(ij) t), 10 3 A BOOTSTRAP TEST a random walk (rt ) and a stationary error (ε(ij),t ): y(ij),t = α(ij) + β(ij) t + m X D(ij),l rl,t + ε(ij),t (8) l=1 rl,t = rl,t−1 + vl,t , (9) P where rl,t = ts=1 vl,s represents the l-th stochastic trend for regions i and j with r0 , the fixed initial value, set to zero. We also assume that ε(ij),t is a stationary P∞ error process ε(ij),t = s=0 ψ(ij),s u(ij),t−s = Ψ(L)u(ij),t where ψ(ij),0 = 1, u(ij),t ∼  P s 15 Under these assumptions ε(ij),t has i.i.d(0, σu2(ij) ) and Ψ(L) = 1 + ∞ s=1 ψ(ij),s L . an infinite order autoregressive representation ε(ij),t = ∞ X λ(ij),s ε(ij),t−s + u(ij),t , (10) s=1  P s where Λ(L) = Ψ(L)−1 = 1 + ∞ s=1 λ(ij),s L . Given (8), since ε(ij),t is a stationary process, the necessary condition for convergence of regions i, j is that the variance of the random walk error (σv2 ) is zero. Focussing on a test for relative convergence, below we describe the nature of the recursive multivariate stationarity test using critical values generated from the empirical distribution of the test statistic constructed using bootstrap sampling. In generating a bootstrap test for relative convergence we focus on relative convergence where β(ij) = 0, which rules out the presence of a deterministic trend.16 The idea is to estimate the null finite sample distribution of the KPSS test statistics by exploiting the equivalence between the unobservable component model and the parametric ARIMA model. Harvey (1989) demonstrates that the components from the structural model (8) can be combined to give a reduced form ARIMA(0,1,1) model. In particular, assuming independence of ε(ij),t and vt , (8) becomes a local component model which, after time differencing, can be expressed as the MA model ∆y(ij),t = (1 − θL)η(ij),t where η(ij),t ∼ i.i.d(0, ση2(ij) ) and ση2(ij) = σε2(ij) /θ. The reduced form parameter θ is derived by equating the autocovariances of first differences at lag one in the structural and reduced forms. This gives the following relationship between the parameters of the components model (8) and the P∞ ε(ij),t is assumed to be invertible s=0 s ψ(ij),s < ∞. 16 For the test of absolute convergence the restrictions are β(ij) = α(ij) = 0. 15 11 3 A BOOTSTRAP TEST ARIMA(0,1,1) model: where q = σv2 σε2  1  σv2 +2− θ= 2  σε2(ij) σv2 σε2(ij) + σ2 4 2v σε(ij) !1/2    . (11) is the signal to noise ratio. Under the stationarity null, namely (ij) that regions i, j are converging, the variance of the random walk component (σv2 ) is zero, which in turn implies that θ = 1 in the ARIMA representation. Therefore by imposing a moving average unit root in the ARIMA representation one can use the parametric model for sampling instead of the ”unobservable” component model. Our bootstrap sampling scheme is based on the following procedure. First, for each region pair i, j and contemporaneous difference y(ij),t = yi,t − yj,t , we fit an ARMA(p, 1) model to the differenced series ∆y(ij),t = y(ij),t − y(ij),t−1 , namely ∆y(ij),t = p X φ(ij),k ∆y(ij),t−k + η(ij),t − θη(ij),t−1 , (12) k=1 The MA component in (12) follows from the reparametrisation of the structural component model to reproduce the stationarity properties of the data in the ARMA representation. The AR(p) component17 represents an approximation to the assumed infinite-order moving average errors to capture the dependence structure in the data. By imposing a moving average unit root in the sampling procedure we can then construct the bootstrap distribution of the test statistic for level stationarity defined in (7). The accuracy of the bootstrap test relative to the asymptotic approximations hinges on the bootstrap sample being drawn independently. Given the presence of a known dependence structure, in this case a stationary ARMA(p, 1) model, we utilise the Recursive Bootstrap.18 To achieve independent re-sampling from (12) we estimate r φ̂(ij),k and η̂(ij),t , and we draw a bootstrap sample {η̄(ij),t }Tt=1 from the distribution of PT 1 b(ij),t . centered19 residuals {η̄(ij),t }Tt=1 , where η̄(ij),t = ηb(ij),t − T −1 t=2 η r Given the bootstrapped residuals, {η̄(ij),t }Tt=1 , the rth bootstrap sample for the 17 p denotes optimal lag length, chosen using the AIC criterion. See Horowitz (2001) on the merits of the recursive bootstrap for linear models, and Maddala and Li (1997) and Efron and Tibshirani (1986) for specific examples. 19 Centering the residuals reduces the downward bias of autoregression coefficients in small samples (Horowitz, 2001). 18 12 4 A MONTE CARLO STUDY r data {∆y(ij),t }Tt=1 is generated based on the recursive relation20 r ∆y(ij),t = p X k=1 r r r φb(ij),k ∆y(ij),t−k + η̄(ij),t − η̄(ij),t−1 . (13) We then recover the level of the series (where the level denotes the contemporaneous regional difference) directly from (13) r y(ij),t = r y(ij),t−1 + p X k=1 r Defining hrt = y(ij),t − 1 T T P t=1 r r r φb(ij),k ∆y(ij),t−k + η(ij),t − η(ij),t−1 . (14) r y(ij),t , then for rth bootstrap sample, and the i, j region r pair, a test statistic for relative convergence, τb(ij),µ , is given by where S̄tr = t P s=1 r τb(ij),µ =T −2 T X r,′ S̄t [b σ r,2 ]−1 S̄tr , (15) t=1 hrs . For each region pair we draw R bootstrap samples and construct B the empirical distribution of the test statistic under the null, which we denote τ(ij),µ . B Bootstrap critical values C(ij),µ can then be recovered at the required significance levels and we can implement the algorithm described in section 3 utilising a vector of bootstrapped empirical p-values, p̂B . 4 A Monte Carlo Study In this section we compare the performance of the bootstrap version of the kpss test, cw henceforth, with the original hf test based on the asymptotic version of the multivariate kpss testing procedure. To the best of our knowledge there is no other comparable Monte Carlo study in the literature that evaluates clustering methods in the same context as we do here. hf is a method that relies on a “bottom up” algorithm that clusters groups one by one. To determine whether a set of countries is convergent, hf applies a multivariate stationarity test to panels comprised of consecutive pairwise difference series set elements and confirms convergence if the null hypothesis of stationarity 20 r r r r r r Initial values, ∆y(ij),t−1 = y(ij),t−1 − y(ij),t−2 = ... = ∆y(ij),t−p = y(ij),t−p − y(ij),t−p−1 are set to zero. 13 4.1 Monte Carlo Structure 4 A MONTE CARLO STUDY of the panel is not rejected using the kpss test. For example, if we want to test the convergence of countries 1,2,3 and 4, a panel consisting of y 12 , y 13 , y 23 , y 14 , y 24 and y 34 is subjected to the kpss test, where for example y 12 ,y 23 , and y 34 denote the difference in log per capita between countries 1 and 2, 2 and 3 and 3 and 4 respectively. If stationarity cannot be rejected the panel is then augmented with series other than 1, 2, 3 and 4, each added separately. If then for each of these additional panels the stationarity null is rejected, then these four countries are said to be convergent. 4.1 Monte Carlo Structure In this subsection, we will discuss the data generating processes that is used in our Monte Carlo study. We generated a number of datasets to conduct the evaluation of the clustering methods cw and hf that we compare. We will examine the performance of these methods to determine success rates in detecting club membership for various parameter configurations including the number of countries, club size, time span and number of clubs. The analysis is carried out for two separate cases. In the first case we analyze single club data, while in the second case we include multiple clubs. Below we present the data generating processes and evaluation procedures employed in this study. A similar design was used by Beylunioğlu et al (2018) in assessing the properties of the maximum clique method, an alternative clustering mechanism that relies on Augmented Dickey Fuller (adf) unit root testing. However, the maximum clique method is a “top down” method that leads to a different definition of clubs and is not considered in the present comparison.21 Data Generating Processes The simulation assumes that the log gdp series for region i is given by yit = αi + di rt + ǫit , (16) where ǫit ∼ I(0) is the error term and rt is the common factor which affects all countries the same way (such as technology). If we assume non-stationarity of the factor, a pair of countries can only be convergent if the country specific constants, di , 21 Another method developed by Phillips and Sul (2007) stands out by means of not requiring a priori classification of countries. However, we exclude this method for the reason that it is based on the notion of σ convergence. The method depends on the definition of convergence by means of reduction of variance over time and thus convergence of series to a steady state. Therefore, it is not appropriate to compare this method with HF and the CW method developed in this study as both of the latter deal with convergence of the mean (function) of the series. 14 4.1 Monte Carlo Structure 4 A MONTE CARLO STUDY which measure the impact of common factor are equal. In other words, for the pair i and j, if i = dj , rt is canceled out and yit − yjt becomes αi − αj + ǫit − ǫjt . In this case, since the error terms are assumed to be stationary, we have αi − αj + ǫit − ǫjt ∼ I(0) and the pair i and j would be convergent by definition. Likewise, for any subset of countries having equal di , all pairwise difference series in that subset would be stationary and hence these countries would constitute a convergence club. Finally, the constants, αi are country specific and are generated once for all data sets. The non-stationarity of rt is modeled using the arima process rt = rt−1 + vt , vt = ρv vt−1 + et , et ∼ iid N (0, 1 − ρ2v ), (17) where we allow ρv = {0.2, 0.6} as separate cases. In addition we also allow the error term of the log gdp series in equation (16) to have serial dependence, following the specification ǫit = ρi ǫi,t−1 + vit , vit ∼ iid N (0, σv2i (1 − ρ2i )), (18) where we assume that the error terms vit are i.i.d. distributed Normal random variables. The autoregressive coefficient ρi and σv2i are country specific and invariant among the single and multiple clubs datasets. We generated the coefficients to have the following property. 2 σvi ∼ iid U [0.5, 1.5], ρi ∼ iid U [0.2, 0.6] To generate a dataset containing a single club the coefficients of the m convergent countries are assumed to be di = dj = 1. For the remaining (N − m) countries, di is generated randomly as di ∼ iid Xm2 . Similarly, we also generate country specific constants as αi ∼ iid Xm2 . For multiple clubs, in order to assess successful detection in club membership we want to make sure that there are some non-convergent countries present in the data that do not belong to any club. In that case, the value m of club size, when the number of clubs (k) and the number of countries (N ) are given, is chosen in such a way as to allow for at least a pair of non-convergent countries to be present in order to evaluate successful converging behaviour. For a given k and N , the clubs sizes m’s are randomly drawn from a Poisson distribution with a rate of λ = N/k. For each N , random draws are repeated k times.22 The simulations are repeated 2000 times using different combinations of T = 22 Obviously we did not allow the sum of m to exceed N , if this happens we redraw the last club size. 15 4.1 Monte Carlo Structure 4 A MONTE CARLO STUDY {50, 100} time intervals, N = {10, 20, 30} count of countries, m = {3, 5, 7, 10} number of club members for the single club case. In the multiple club case we considered T = {50, 100}, N = 10, m = {3, 5} and k = {2, 3} number of clubs.23 4.1.1 Testing and Evaluating Procedures To evaluate convergence we utilise evaluation tools from the literature on forecasting. The first one is the Kupiers Score (ks), while the second one is the Pesaran and Timmermann (1992) (pt) test statistic commonly used in the forecasting times series literature for the evaluation of sign forecasts. It is worth noting that sign forecasts are used for predicting whether an underlying series would increase relative to a benchmark such as, for example, a zero return threshold. This test is cast in terms of a binary process where success is the increase relative to the chosen benchmark. In our case we take a “success” as the correct detection of a country’s membership in a club. In the context of forecasting, this is equivalent to success in forecasting the sign of a time series. Since granting membership into a club or denying it can occur randomly,24 ks takes the correct forecasts and false alarms into account separately. ks is defined as H − F where OI II , and F = . H= II + IO OI + OO I (O) are binary indicators indicating whether the country under investigation is a member (not a member) of a given club. In considering the pairs of letters, the first letter indicates whether the country is found to be a member in the Monte Carlo experiment, while the second letter denotes its actual membership state (i.e. whether the country is actually in the club or not). II then indicates that a country as a member of the club is correctly identified; OO denotes that a country is correctly identified as not a member of the club. Furthermore, IO indicates that a country is detected to be a member of the club, while actually it is not (false detection). OI refers to the reverse case where the country is misclassified as being outside, even though it is a member of the club (false alarm). The ratio H captures the rate of “correct hits” in detecting club membership, whereas F denotes the “false alarm” rate, that is the rate of false exclusions. As in the case of sign prediction in the forecasting literature, success can be the outcome of a pure chance probability event of 0.5. Hence, to test the statistical 23 The computational burden for larger values of m, k and N proved to be be quite high at this point. 24 This is similar to expecting an unbiased coin to come up heads with 50% probability. 16 4.2 Simulation Results 4 A MONTE CARLO STUDY significance of ks, we will employ the following pt statistic PT = c∗ Pb − P ∼ N (0, 1). c∗ )] 21 [Vb (Pb) − Vb (P Pb refers to the proportion of correct predictions (correct detections of countries as c∗ denotes being a member or non member) over all predictions (N countries), and P the proportion of correct detections under the hypothesis that the detections and actual occurrences are independent (where success is a random event of probability c∗ ) stand for the variances of Pb and P c∗ respectively. 0.5). Vb (Pb) and Vb (P In simulations involving multiple clubs, it is not possible to use either the ks or the pt statistic given that the success/failure classification is no longer binary - as in the case of the single club case. In the multiple club case there are more than two distinct cases for the actual membership state: the country can be either a member of the correct club, belong to the “wrong” club, or not be a member of any club. To confront this problem, in the case of multiple clubs we utilise a much stricter criterion by counting the successful cases in our simulations in which all countries are detected correctly. We do not evaluate success as a binary outcome, country by country as in the case of a single club in each replication, but we only count as success having all countries satisfying the convergence condition. This is a much stricter criterion given that success depends on the overall results in each replication in which all countries are detected correctly. 4.2 Simulation Results Below we discuss the findings of the simulations based on the data generating processes of club formation. The comparison involves the bootstrap version of the kpss test (henceforth cw) proposed in this paper, and the original hf test based on the asymptotic version of the multivariate kpss test. 4.2.1 Single Club Results The results are presented in Table 1 for 0.05 and 0.10 significance levels.25 The total number of countries N are set at (N = 10, 20, 30); there are two choices of time span (T = 50, 100) that mimic the real data time span availability; and two choices of the persistence parameter (ρv = {0.2, 0.6}). It is expected that as the number of 25 We also have the results for the 0.01 significance level but to conserve space we do not report them. They are available from the authors on request. 17 4.2 Simulation Results 4 A MONTE CARLO STUDY countries N and club size m increase, the likelihood of an incorrect classification will also increase, but the opposite will be the case for an increase of the time span T for given N and m. As seen in Table 1, cw outperforms hf in all categories. For example, with m = 3, N = 10, T = 50 and ρ = 0.2 (configuration A), and significance levels 0.05 and 0.10, the ks results (the “correct hit” ratio net of “false alarms”) for cw are, respectively, 0.87 and 0.89. The comparable numbers for hf are 0.63 and 0.66. Similarly, for the cases with m = 10, N = 30, T = 100 and ρ = 0.6 (configuration B), cw with 0.77 and 0.76 outperforms the hf method - 0.60 and 0.60. The results are in line with our prior expectations that larger N and m values would result in lower success rates. However, in all cases the cw test does better. The pt statistics26 for configuration A yield values 1.91 and 2.02 for hf and 2.51 and 2.49 for cw; for configuration B the hf values are 2.86 and 2.97; with 3.37 and 3.48 for cw. Note that the rejections of the null hypothesis of random success outcomes are higher with the pt test for cw in all cases. The results clearly demonstrate that the cw, bootstrap kpss test offers a significant improvement over the hf procedure. 4.2.2 Multiple Club Results The results for the multiple club case are presented in Table 2. The multiple clubs cases involve classifications with k = 2 and 3 and N = 10 for T = {50, 100} and ρv = {0.2, 0.6}. The club sizes associated with each club are listed in the second column of Table 2 for each k. For example, the entry 4, 4 for club size m refers to two clubs of equal size 4, for k = 2. In the case of k = 3, m enters as 3, 3, 2, that is two clubs of size 3 and one club of size 2. The cw test outperforms hf in the majority of cases. For example, with N = 10, k = 2, T = 100 and ρv = 0.6, cw detects 54%, 45% and 37% correct classifications at the 0.01, 0.05 and 0.10 significance levels; hf does that with frequency 44%, 38% and 34.80% respectively. For the case when k = 2 and T = 50, hf does slightly better than cw, but when k = 3 the performance of hf deteriorates rapidly. In that case when N = 10, k = 3, T = 100 and ρv = 0.2, cw detects 27%, 25% and 22% correct classifications, while the comparable results for hf, respectively, are 10.40%, 8.20% and 5.60%. Since we have adopted a much stricter criterion where success is defined as all countries detected correctly, we do expect lower rates of correct detection than was the case for single clubs. In all cases, we see an improvement for cw when the number of clubs increases even when T is relatively small, but not for hf. 26 The PT statistic follows an asymptotic standard normal variate. 18 5 APPLICATIONS Overall, the multiple club results suggest that in terms of accuracy the cw does better in detecting the presence of clubs or clusters of countries. This gives us confidence that applying the above method to real data can provide us with useful insights about how countries over time collect themselves into different club formations of similar characteristics as far as economic activity is concerned. 5 Applications As shown in the Monte Carlo study a problem with the asymptotic test is that it does not permit reliable inference with only 30 years of data. Using an asymptotic test of the null of stationarity (convergence) tends to distort club membership detection due to size distortions which are ameliorated when we implement the bootstrap. That is, the test is oversized resulting in a tendency to reject the null hypothesis of convergence. In this section we assess the extent to which a size distortion affects our inference on the degree of convergence using two real-world datasets. We first compare the results of the asymptotic and bootstrap tests using the cross-country dataset originally adopted by Hobijn and Franses (2000). We then utilise data gathered at a finer geographical and sectoral scale by making the same comparison based upon the European regional dataset used by Corrado, Martin and Weeks (2005). We expect to resolve the size distortion which afflicts the asymptotic test and to find more evidence of convergence than what originally acknowledged by Hobijn and Franses (2000). 5.1 Cross-Country Convergence In this section we compare the asymptotic and the bootstrap results using the Hobijn and Franses (2000) dataset for the period 1960-1989 which comprises 112 countries from the Penn World Table listed in Table 3. Focussing upon log per capita gdp, the results based upon the asymptotic test have a striking feature, namely a very large number of convergence clubs. In particular, Hobijn and Franses (2000) find 63 asymptotically perfect convergence clubs and 42 asymptotically relative27 convergence clubs.28 As Table 4 shows, in the case of perfect convergence the lack of convergence is manifest in 29 singletons and 22 two-country clusters. A similar result can be observed in the case of relative convergence where Hobijn and Franses (2000) find a large number of two and three-country clusters. The lack of convergence is also 27 Note that perfect convergence implies convergence to identical log real GDP per capita levels. Relative convergence implies convergence to constant relative real GDP per capita levels. 28 These are the results using pmin = 0.01 and L = 2 as presented in Tables BII and BIII of the Hobijn and Franses (2000) paper. 19 5.1 Cross-Country Convergence 5 APPLICATIONS evident in the fact there are no clusters of size six or more for asymptotic perfect convergence and only one club of size six for relative convergence. We therefore implement the bootstrap version of the test on the same dataset and find a significant increases in the extent of convergence. Specifically, in the perfect convergence case and relative to the asymptotic results, we observe a 57% reduction in the number of convergence clubs (from 63 to 27); for relative convergence the reduction is 38% (from 42 to 26). In other words, there is evidence towards finding more convergence. Looking at the change in the distribution of cluster sizes for perfect convergence, we observe a dramatic reduction (by 96%) in the number of singletons (from 29 to 1) and by 81% in the number of two-country clusters (from 22 to 4). Commensurate with this finding, we note that countries are now clustering at a larger scale with two clubs having up to seven countries and with a substantial increase in the number of clusters containing five and six countries. A similar increase in the degree of convergence can be observed in the case of relative convergence. Tables 5 and 6 report the asymptotic and bootstrap cluster composition for relative convergence. A number of noteworthy observations can be made. We confirm the findings of Hobijn and Franses that convergence is more widespread among low income economies, and in particular Sub-Saharan Africa. Similarly we find that in general low income countries do not converge to high income. The two exceptions to this found in the asymptotic results, namely Kenya and Ecuador forming clubs alongside Australia and Denmark and Canada, are not found in the bootstrap results. In contrast to Hobijn and Franses we do find a significantly higher degree of convergence, both amongst low income and high income countries. The results based on the asymptotic test indicate very little convergence for the richer economies with all groups of size two. The bootstrap test locates a greater degree of convergence, for example, cluster 11 (Germany, Denmark, France, Luxemburg and New Zealand) and cluster 17 (Belgium, Great Britain, Netherlands and Norway). In the next section we apply the asymptotic and bootstrap version of the test to the European regional dataset originally used by Corrado, Martin and Weeks (2005). A critical difference with respect to the analysis undertaken at the aggregate country level, is that we allow for the possibility that convergence is more prevalent at a sector-specific level, and in addition consider a smaller geographical unit. Much of the theory of convergence highlights the potential role of technology spillovers as one of the possible drivers of convergence. As a result, in what follows we move away from an aggregate analysis to considering how convergence differs across agricultural, manufacturing and service sectors. 20 5.2 European Regional Convergence 5.2 5 APPLICATIONS European Regional Convergence In the following sections we examine the extent of regional convergence within the eu. Regional convergence – or what the European Commission calls ‘regional cohesion’ – is a primary policy objective, and is seen as vital to the success of key policy objectives, such as the single market, monetary union, eu competitiveness, and enlargement (European Commission, 2004). As a result, the theory of and evidence on long-run trends in regional per capita incomes and output are of critical relevance to the eu regional convergence and regional policy debate (Boldrin and Canova, 2001). Indeed, according to Fujita et al. (1999), the implications of increasing economic integration for the eu regions has been one of the factors behind the development of the ‘new economic geography’ models of regional growth. To date, however, very few of these models have been tested empirically on eu evidence. In response to the policy and research questions outlined above our empirical analysis will be framed around the identification of regional convergence clubs in the eu. To identify regional convergence clusters we use the method introduced by Hobijn and Franses (2000) which allows for the endogenous identification of the number and membership of regional convergence clusters (or ‘clubs’) and compare the results of the bootstrap and asymptotic versions of the test to assess the differences in terms of number, size and composition of the resultant clusters. 5.2.1 Data The so-called Nomenclature of Statistical Territorial Units (nuts) subdivides the economic territory of the 15 countries of the European Union using three regional and two local levels. The three regional levels are: nuts3, consisting of 1031 regions; nuts2, consisting of 206 regions; and nuts1 consisting of 77 regions. nuts0 represents the delineation at the national level and comprises France, Italy, Spain, uk, Ireland, Austria, Netherlands, Belgium, Luxemburg, Sweden, Norway, Portugal, Greece, Finland, Denmark and West Germany. We are aware of the problems that surround the choice of which spatial units to use.29 For example, many of the regional units used by eurostat have net inflows of commuters and in addition, these regions also tend to be those with the highest per capita income. Boldrin and Canova (2001) criticize the European Commission for utilizing inappropriate regional units. Whereas nuts1, nuts2 and nuts3 regions are neither uniformly large or sufficiently heterogeneous such that a finding of income divergence across regions cannot unequivocally be taken as evidence for the existence of an endogenous 29 Chesire and Magrini (2000) provide a useful discussion of these issues, focussing on the importance of centering the analysis on regions that are self-contained in labour market terms. 21 5.2 European Regional Convergence 5 APPLICATIONS cumulative growth processes. In fact, the smaller the geographical scale, the more incomplete and fragmented is the statistical information we can get. Although we do not wish to detract from the importance of these matters, in this study our primary focus is a comparison of two different tests for regional convergence for which the unit of analysis is the same. In conducting our analysis we choose to focus on nuts1 regions, achieving a compromise between the availability of reliable data at a regional level which is sufficiently homogeneous, and the need to move beyond national borders. The complete list of nuts1 regions30 used in this study is given in Table 7. We use regional data on Gross Value Added31 per worker for the period 1975 to 1999 for the agriculture, manufacturing and services sectors. Although data are available for more recent years, we focus on this particular time frame to facilitate a comparison with the results of Corrado, Martin and Weeks (2005). The service sector has been further sub-divided into market and non-market services: market services comprise distribution, retail, banking, and consultancy; non-market services comprise education, health and social work, defence and other government services. 5.2.2 Results In this section we present the main results of our analysis. Given the large number of eu regions in Figures 1 and 2 we first present the results for the asymptotic and bootstrap test of convergence in mapped rather than tabular form. Table 8 summarises this information in terms of the number and size of the convergence clubs and group characteristics, such as average per-capita income. 5.2.3 Graphing Convergence Clusters In Figures 1 and 2 clusters which contain the largest number of member regions are indicated with a darker shade on each map. Regions which belong to two-region clusters or do not cluster with any other region have no shading. In the key to the maps, the first number indicates the cluster size and the second letter denotes the cluster identifier. In Figure 1 maps a) and b) ( c) and d)) present the asymptotic and 30 For Portugal, Luxemburg and Ireland, data are only available at the nuts0 level. For Norway we have no data at the nuts1 level. Time series data for the sample period considered are not available for East Germany, which is therefore excluded from the analysis. 31 gva has the comparative advantage with respect to gdp per capita of being the direct outcome of various factors that determine regional competitiveness. Regional data on gva per-capita at the nuts1 level for agriculture, manufacturing, market and non-market services, have been kindly supplied by Cambridge Econometrics, and are taken from their European Regional Database. All series have been converted to constant 1985 prices (ecu) using the purchasing power parity exchange rate. 22 5.2 European Regional Convergence 5 APPLICATIONS bootstrap generated outcomes for agriculture (manufacturing). The relative pattern of convergence corroborates with our prior expectations, namely that the bootstrap test is obviously rejecting the stationary null with a lower frequency and thereby locating more evidence for convergence. In Figure 2 we find a similar pattern for market and non-market services. In Table 8 we present the frequency distribution of the cluster size for both bootstrap and asymptotic tests and for each32 economic sector. Row totals provide an indication of the degree of convergence for each economic sector. Column totals provide information on the number of convergence clubs across sectors by cluster size. The asymptotic results are displayed in panel I and the bootstrap results are displayed in panel II. Overall, we observe a common pattern, namely a shift in the probability distribution towards a fewer number of clusters of larger size, and a commensurate increase in the extent of regional convergence. The total number of clusters for the asymptotic tests is 81, which falls by 32% to 55 clusters for the bootstrap test. This pattern is repeated for all sectors. Comparing column totals across the two tests is also informative since it gives the total number of clusters by cluster size, also shown in Figure 3. For the asymptotic test, more than 80% of the probability mass is distributed in clusters of size 4 or less, with approximately 10% of clusters of size 6 or more. In contrast, for the bootstrap test, approximately 50% of the clusters have a cluster size of 4 or less, with approximately 40% of clusters of size 6 or more. Examining the results for each sector, for agriculture the size of the largest cluster generated by bootstrap critical values increases from seven to ten regions, with a commensurate decrease in the number of clusters of size 5 or less. Similarly for the manufacturing sector we observe an increase in the size of the largest cluster from six to nine regions and a decrease in the number of clusters of size 4 or less. In the market-service sector there is a reduction in the size of the largest cluster from nine to eight and for non-market services there is no change in the size of the largest cluster, but a substantial increase in clustering at the medium scale. In both service sectors there is a decrease in the number of clusters of size 4 or less. Cluster Composition In establishing whether the composition of the clusters (i.e. the constituent regions) is changing between the two tests, we first collect the asymptotic (A) generated cluster outcomes in a N × N matrix MA = {mA ij }; element A mij equals to 1 if regions i and j belong to the same cluster and zero otherwise. MB = {mB ij } denotes the same for the bootstrap (B) generated cluster outcomes. The correlation parameter between the asymptotic, MA , and the bootstrap cluster 32 In order to directly compare the bootstrap and asymptotic results in Corrado et al. (2005) we set pmin to be equal to 0.01 and the bandwidth L = 2. The number of bootstrap samples is set at 200. 23 5.2 European Regional Convergence 5 APPLICATIONS pattern, MB , is then given by  1/2 N P N P A   mB ij × mij   i=1 j6=i   ζl =   ! ! 1/2 1/2   P N P N N P N P   B A mij mij i=1 j6=i , (19) i=1 j6=i where l indexes the set {Agriculture, Manufacturing, Market Services, Non-Market Services}. The results are reported in panel III of Table 8. With correlation coefficients ranging between 50% for manufacturing and 67% for agriculture, we note further evidence of a significant difference in the composition of the clusters generated by the asymptotic and bootstrap tests.33 Mean Income In order to assess the properties of each cluster we compute mean log per-capita income,34 ȳg for each test. The top panel of Figure 4 shows that the asymptotic test generates a distribution with a large number of small clubs while in the bootstrap test there are a fewer number of clusters of larger size. A visual impression of the oversized property of the asymptotic test of convergence is also evident in the distribution of the cluster mean of log per-capita income and in a relatively higher right kurtosis of this distribution, as presented in the lower panel of Figure 4. In this case an overrejection of the convergent null generates a distribution with a large number of small clubs characterised by a higher mean log per-capita income which results in a widening of the gap between the poorest and the richest clusters. In examining the comparable bootstrap distribution we observe a marked decrease in right kurtosis and a commensurate narrowing of the gap between the richest and the poorest cluster. Summary statistics are provided in the last three columns of panels I and II of Table 8. Note that for the bootstrap distribution the reduction in the gap between the richest and the poorest clusters is evident in a lower standard deviation of mean cluster per-capita income (from 15.2 to 5.4). The narrowing of the gap between the richest and poorest cluster translates into an increase in mean log per-capita income of the poorest cluster, ȳmin , by around 24% (from 9.4 to 11.7) and a decrease in mean log per-capita income of the richest cluster, ȳmax , by almost 50% (from 103 to 62.6). These results demonstrate the importance of the correct identification of convergence clubs. Given that many policy instruments are designed 33 The method used in this paper to locate convergence clubs bypasses the particular problem of exactly how to utilize conditioning information in the model specification. Corrado and Weeks (2011) provide further information on how to interpret the results by confronting the resulting cluster composition, for both the asymptotic and the bootstrap tests, with a set of hypothetical clusters based on different theories and models of regional growth and development. 34 Mean income is the cluster mean of log per-capita GVA. 24 6 CONCLUSIONS to reduce the gap between the richest and the poorest regions, basing inference and policy decisions on the results of the asymptotic test would indicate the need for a stronger action than is actually needed when looking at the bootstrap test outcomes. 6 Conclusions This study represents an extension of the multivariate test of stationarity which allows for endogenous identification of the number and composition of regional convergence clusters using sequential pairwise tests for stationarity. The main drawback of this approach is the short time-horizon which affects the size of the test. Our proposed bootstrap based extension to the sequential pairwise multivariate tests for stationarity performs well in Monte Carlo simulations in identifying and detecting correctly cluster membership when compared with the asymptotic version of the Hobijn and Franses (2000) approach. Based upon Monte Carlo evidence comparing the performance of cw with hf varying the number of countries, data span, club size and degree of persistence, indicate that detection rates of club membership (net of misclassifications) improve considerably when we implement the bootstrap. In operationalizing a bootstrap test of multivariate stationarity our results confirm the oversized property of the asymptotic test, and reveal a significantly greater degree of convergence. This evidence is gathered using both cross-country and regional data for the European Union for a number of industrial sectors. Our results show that by resolving the size distortion which afflicts the asymptotic test we find considerably more evidence of convergence in both the applications considered. 25 References [1] Acemoglu, D. and Zilibotti, F. (1997). ‘Was Prometheus Unbounded by Chance? Risk, Diversification and Growth’, Journal of Political Economy, 105(7), pp. 710-751. [2] Barro, R. J. and Sala-i-Martin, X. (1997). ‘Technological Diffusion, Convergence and Growth’, Journal of Economic Growth, 2(1), pp. 1-26. [3] Basu, S. and Weil, D.N. (1998). ‘Appropriate Technology and Growth’, Quarterly Journal of Economics, 113, pp. 1025-1054. [4] Baumol, W.J. (1986). ‘Productivity Growth, Convergence and Welfare: What are the Long-Run Data Show’, The American Economic Review, 76, pp. 1072-1085. [5] Bernard, A. B. and Durlauf, S. N. (1995). ‘Convergence in International Output’, Journal of Applied Econometrics, 10(2), pp. 97-108. [6] Bernard, A. B. and Durlauf, S. N. (1996).‘Interpreting Tests of the Convergence Hypothesis’, Journal of Econometrics, 71(1-2), pp. 161-73. [7] Boldrin, M. and Canova, F. (2001). ‘Inequality and Convergence: Reconsidering European Regional Policies’, Economic Policy, 16(32), pp. 207-53. [8] Breuer, Boucher J. , McNown, R., and Wallace, M. (1999). ‘Series-specific Tests for a Unit Root in a Panel Setting with an Application to Real Exchange Rates’, Discussion Paper No. 99-9, University of Colorado, June. [9] Breitung, J. and M.H. Pesaran (2008). ‘Unit Roots and Cointegration in Panels’, by L. Matyas, and P. Sevestre (eds), The Econometrics of Panel Data (Third Edition), Ch. 9, pp. 279-322. [10] Beylunioğlu, F., Stengos, T. and Yazgan, E. (2018), ‘Detecting Convergence Clubs’, Macroeconomic Dynamics, forthcoming, doi:10.1017/s1365100518000391 [11] Caner, M. and Kilian, L. (2001), ‘Size Distortions of Tests of the Null Hypothesis of Stationarity: Evidence and Implications for the PPP Debate’, Journal of International Money and Finance, 20, pp. 639-657. [12] Carlino, G. A. and Mills, L. (1993). ‘Convergence and US States: A Time Series Analysis’, Journal of Monetary Economics, 32, pp. 335-46. 26 [13] Chatterji, M. (1992). ‘Convergence Clubs and Endogenous Growth’, Oxford Review of Economic Policy, 8(4), pp. 57-69. [14] Cheshire, P. C. and Magrini, S. (2000). ‘Endogenous Processes in European Regional Growth: Convergence and Policy’, Growth and Change, 31, pp. 455-479. [15] Ciccone, A. and Matsuyama, K. (1996). ‘Start-up Costs and Pecuniary Externalities as Barriers to Economic Development’, Journal of Development Econonomics, 49, pp. 33-59. [16] Corrado, L., Martin, R. and Weeks, M. (2005). ‘Identifying and Interpreting Regional Convergence Clusters across Europe’, The Economic Journal, 115(502), pp. C133-C160. [17] Corrado, L. and Weeks, M. (2011). ‘Tests of Convergence Clubs’, Cambridge Working Paper Series CWPE1031. [18] Durlauf, S. N. and Johnson, P. (1995). ‘Multiple Regimes and Cross-Country Growth Behavior’, Journal of Applied Econometrics, 10(4), pp. 365-84. [19] Durlauf, S. N., Johnson, P. A., and Temple, J. R. W. (2005). ‘Growth Econometrics’. In P. Aghion and S. N. Durlauf (eds.) Handbook of Economic Growth, Vol. 1A, North-Holland: Amsterdam, pp. 555-677. [20] Efron, B. and Tibshirani, R. (1986). ‘Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy’, Statistical Science,1(1), pp. 54-75. [21] Evans, P. (1998). ‘Using Panel Data to Evaluate Growth Theories,’ International Economic Review, 39 (1998), pp. 295-306. [22] Fleissig, A. and Strauss, J. (2001). ‘Panel Unit-Root Tests of OECD Stochastic Convergence’, Review of International Economics, 9(1), pp. 153-162. [23] Flores, R., P. Jorion, P.Y. Preumont and Szarfarz, A. (1999). ‘Multivariate Unit Root Tests of the PPP Hypothesis’, Journal of Empirical Finance, 6, pp. 335–353. [24] Fujita, M., Krugman, P. and Venables, A. J. (1999). The Spatial Economy: Cities, Regions and International Trade, Cambridge, MA: MIT Press. [25] Fujita, M. and Thisse, J. F. (2002). Economics of Agglomeration. Cities, Industrial Location, and Regional Growth, Cambridge: Cambridge University Press. 27 [26] Grossman, G. M. and Helpman, E. (1994). ‘Endogenous Innovation in the Theory of Growth’, Journal of Economic Perspectives, Vol. 8(1), pp. 23-44. [27] Harvey, A. C. (1989). Forecasting, Structural Time Series Models and the Kalman Filter, Cambridge University Press. [28] Harvey, A. and J. Bernstein, (2003). ‘Measurement and Testing of Inequality from Time Series of Deciles with an Application to U.S. Wages’, The Review of Economics and Statistics, MIT Press, 85(1), pp. 141-152. [29] Hobijn, B. and Franses, P. H. (2000). ‘Asymptotically Perfect and Relative Convergence of Productivity’, Journal of Applied Econometrics, 15(1), pp. 59-81. [30] Horowitz, L. (2001). ‘The Bootstrap’, in J. J. Heckman and E. E. Leamer, eds., Handbook of Econometrics, Vol. 5, pp. 3159-3228, Amsterdam, Elsevier. [31] Im, K.S., Pesaran, M. H. and Shin, Y. (2003). ‘Testing for Unit Roots in Heterogeneous Panels’, Journal of Econometrics, 115(1), pp. 53-74. [32] Kapetanios, G. (2003). ‘Determining the Stationarity Properties of Individual Series in Panel Dataset’, Queen Mary University Department of Economics Working Paper No. 495. [33] Kuo, B. and Tsong, C. (2005). ‘Bootstrap Inference for Stationarity’, Helsinki Center of Economic Research Discussion Paper No 50. [34] Kwiatkowski, D., Phillips, P., Schmidt, P. and Shin, Y. (1992). ‘Testing the Null Hypothesis of Stationarity against the Alternative of a Unit Root. How Sure are we that Economic Time Series have a Unit Root?,’ Journal of Econometrics, 54(1-3), pp. 159-78. [35] Levin, A., C. Lin, and C.J. Chu (2002), ‘Unit Root Tests in Panel Data: Asymptotic and Finite-sample Properties,’ Journal of Econometrics, 108, 1-24. [36] Leybourne, S. and McCabe, B. (1994). ‘A Consistent Test for a Unit Root’, Journal of Business and Economic Statistics, 12, pp. 157-166. [37] Louveaux, F., Thisse, J. F. and Beguin, H. (1982). ‘Location Theory and Transportation Costs’, Regional Science and Urban Economics, 12(4), pp. 529-45. [38] Lucas, R. (1988). ‘On the Mechanics of Economic Development’, Journal of Monetary Economics, 22(1), pp. 3-42. 28 [39] Maasoumi, E. and Wang, L. (2008), ‘Economic Reform, Growth and Convergence in China’, The Econometrics Journal, 11(1), pp. 128-154. [40] Maddala, G.S. and Li, H. (1997). ‘Bootstrapping Cointegrating Regressions’, Journal of Econometrics, 80, pp. 297-318. [41] Maddala, G.S. and Wu, S. (1999). ‘A Comparative Study of Unit Root Tests with Panel Data and a New Simple Test’, Oxford Bulletin of Economics and Statistics, 61, pp. 631–652. [42] Moon, H.R. and Perron, B. (2007), ‘Efficient Estimation of the SUR Cointegration Regression Model and Testing for Purchasing Power Parity’, Econometric Reviews, 23, pp. 293-323. [43] Murtagh, F. (1985). Multidimensional Clustering Algorithms, Physica-Verlag, Wurzburg. [44] Pesaran, M. and A. Timmermann (1992). ‘A simple nonparametric test of predictive performance’, Journal of Business and Economic Statistics 10, 461-465. [45] Pesaran, M. H. (2007). ‘A Pairwise Approach to Testing for Output and Growth Convergence’, Journal of Econometrics, Elsevier, 138(1), pp. 312-355. [46] Phillips, P. and Sul, D. (2007) ‘Transition modeling and econometric convergence tests’ Econometrica, 75(6), 1771-1855. [47] Quah, D.T. (1997). ‘Empirics for Growth and Distribution: Stratification, Polarization and Convergence Clubs’, Journal of Economic Growth, Vol. 2, pp. 27–59. [48] Romer, P. (1986). ‘Increasing Returns and Long-Run Growth’, Journal of Political Economy, 94(5), pp. 1002-37. [49] Solow, R. (1956). ‘A Contribution to the Theory of Economic Growth’, Quarterly Journal of Economics, 70(1), pp. 65–94. [50] Swan, T.W. (1956), ‘Economic Growth and Capital Accumulation’, Economic Record, 32(63), pp. 334–361. [51] Tan, C. M. (2009). ‘No One True Path: Uncovering the Interplay between Geography, Institutions, and Fractionalization in Economic Development’, Journal of Applied Econometrics, forthcoming. 29 [52] Taylor, M. P. and Sarno, L. (1998). ‘The Behavior of Real Exchange Rates during the Post-Bretton Woods Period,’ Journal of International Economics, 46(2), pp. 281-312. [53] Weeks, M. and Yao Y. J. (2003). ‘Provincial Conditional Income Convergence in China, 1953–1997: A Panel Data Approach,’ Econometric Reviews, 22(1), pp. 59-77. 30 Table 1: Single Clubs Results DataType N m ρ T 0.2 3 10 0.6 0.2 5 0.6 0.2 3 0.6 0.2 5 20 0.6 0.2 7 0.6 0.2 10 0.6 0.2 3 0.6 0.2 5 30 0.6 0.2 7 0.6 0.2 10 0.6 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 50 100 KS 0.01 0.60 0.73 0.69 0.73 0.61 0.69 0.67 0.71 0.47 0.60 0.58 0.65 0.60 0.67 0.63 0.71 0.61 0.64 0.63 0.68 0.52 0.61 0.56 0.65 0.54 0.59 0.60 0.65 0.54 0.59 0.56 0.64 0.48 0.60 0.52 0.63 0.43 0.53 0.49 0.61 HF 0.05 0.63 0.78 0.71 0.81 0.61 0.76 0.67 0.76 0.47 0.58 0.60 0.69 0.58 0.71 0.63 0.70 0.60 0.66 0.61 0.69 0.48 0.64 0.50 0.62 0.49 0.62 0.56 0.67 0.52 0.61 0.54 0.67 0.46 0.57 0.53 0.64 0.41 0.52 0.42 0.60 0.10 0.66 0.83 0.74 0.82 0.64 0.73 0.67 0.77 0.57 0.63 0.64 0.73 0.58 0.71 0.66 0.71 0.58 0.66 0.62 0.68 0.47 0.64 0.52 0.62 0.56 0.65 0.61 0.72 0.50 0.63 0.57 0.66 0.44 0.57 0.54 0.62 0.41 0.53 0.43 0.60 0.01 0.85 0.76 0.87 0.77 0.87 0.79 0.89 0.80 0.66 0.54 0.72 0.62 0.87 0.80 0.92 0.81 0.90 0.88 0.91 0.88 0.73 0.81 0.76 0.82 0.76 0.72 0.80 0.73 0.81 0.76 0.85 0.79 0.77 0.82 0.84 0.85 0.67 0.74 0.72 0.78 PT CW 0.05 0.87 0.79 0.88 0.81 0.88 0.82 0.89 0.83 0.74 0.61 0.81 0.72 0.88 0.82 0.91 0.82 0.87 0.88 0.88 0.88 0.72 0.79 0.74 0.80 0.80 0.76 0.84 0.76 0.82 0.77 0.85 0.79 0.76 0.82 0.82 0.85 0.67 0.73 0.71 0.77 31 0.10 0.89 0.81 0.91 0.81 0.87 0.82 0.88 0.84 0.73 0.65 0.81 0.72 0.87 0.83 0.90 0.84 0.86 0.88 0.86 0.89 0.70 0.79 0.74 0.79 0.78 0.77 0.81 0.77 0.78 0.76 0.85 0.80 0.73 0.81 0.80 0.85 0.65 0.73 0.69 0.76 0.01 1.67 1.96 2.04 2.02 1.68 2.10 2.15 2.21 1.60 2.39 1.82 2.64 2.74 2.92 2.34 3.05 2.78 3.03 2.13 3.22 2.42 3.09 2.24 3.08 1.76 2.11 2.01 2.13 1.93 2.22 2.13 2.29 1.67 2.06 2.09 2.22 2.46 2.68 2.63 2.85 HF 0.05 1.91 2.16 2.17 2.42 2.00 2.18 2.25 2.40 2.18 2.46 1.98 2.83 2.88 2.99 2.56 3.39 2.80 2.90 2.67 3.08 2.48 3.06 2.18 3.09 1.85 2.33 2.13 2.43 1.93 2.40 2.13 2.40 1.71 2.09 2.21 2.49 2.44 2.94 2.67 2.86 0.10 2.02 2.34 2.40 2.52 1.95 2.31 2.21 2.40 2.43 2.71 2.35 2.91 2.94 3.04 2.55 3.17 2.74 3.09 2.68 3.29 2.37 3.01 2.28 3.02 1.97 2.56 2.23 2.55 2.03 2.31 2.14 2.43 2.16 2.29 2.41 2.75 2.49 3.00 2.89 2.97 0.01 2.49 2.26 2.51 2.29 2.68 2.62 2.75 2.62 2.87 2.09 3.14 2.13 3.84 3.40 3.91 3.48 3.98 4.03 4.06 3.91 3.28 3.83 3.36 3.77 2.56 2.21 2.62 2.25 2.77 2.55 2.83 2.57 2.62 1.94 2.94 2.24 3.76 3.24 3.98 3.31 CW 0.05 2.51 2.28 2.58 2.33 2.73 2.68 2.76 2.63 3.08 2.39 3.46 2.57 3.86 3.56 3.89 3.37 3.79 4.01 3.90 3.85 3.25 3.70 3.33 3.62 2.65 2.31 2.69 2.38 2.77 2.62 2.83 2.66 2.98 2.20 3.32 2.62 3.82 3.34 3.98 3.37 0.10 2.72 2.49 2.64 2.29 2.69 2.69 2.73 2.69 3.53 2.98 3.51 3.21 3.82 3.75 3.72 3.62 3.79 4.11 3.83 3.94 3.24 3.64 3.36 3.61 2.71 2.38 2.77 2.37 2.75 2.62 2.79 2.67 2.97 2.35 3.32 2.65 3.80 3.40 3.94 3.48 Table 2: Multiple Club Results N Data Type m k ρ 0.2 4,4 2 10 0.6 0.2 3,3,2 3 0.6 T 50 100 50 100 50 100 50 100 0.01 34.20% 51.80% 38.80% 44.00% 1.20% 10.40% 13.00% 3.20% HF 0.05 28.00% 39.00% 26.60% 38.00% 1.00% 8.20% 14.00% 2.60% 32 0.1 17.00% 33.00% 19.00% 34.80% 1.00% 5.60% 10.00% 1.00% 0.01 28.00% 61.20% 35.00% 54.00% 15.60% 27.00% 13.00% 20.00% CW 0.05 22.00% 45.00% 24.00% 45.00% 15.60% 25.00% 15.60% 17.80% 0.1 14.40% 40.00% 17.00% 37.00% 14.40% 22.00% 14.40% 15.00% Table 3: List of Countries (PWT) AGO ARG AUS AUT BDI BEL BEN BGD BOL BRA BRB BUR BWA CAF CAN CHE CHL CIV CMR COG COL CPV CRI CSK CYP DEU DNK DOM DZA ECU EGY ESP FIN FJI FRA GAB GBR GHA Country Angola Argentina Australia Austria Burundi Belgium Benin Bangladesh Bolivia Brazil Barbados Myanmar Botswana Central African Rep. Canada Switzerland Chile Ivory Coast Cameroon Congo Colombia Cape Verde Is. Costa Rica Czechoslovakia Cyprus West Germany Denmark Dominican Rep. Algeria Ecuador Egypt Spain Finland Fiji France Gabon United Kingdom Ghana GIN GMB GNB GRC GTM GUY HKG HND HTI HVO IDN IND IRL IRN ISL ISR ITA JAM JOR JPN KEN KOR LKA LSO LUX MAR MDG MEX MLI MLT MOZ MRT MUS MWI MYS NAM NER NGA Country Guinea Gambia Guinea Bissau Greece Guatemala Guyana Hong Kong Honduras Haiti Burkina Faso Indonesia India Ireland Iran Iceland Israel Italy Jamaica Jordan Japan Kenya Korea Sri Lanka Lesotho Luxembourg Morocco Madagascar Mexico Mali Malta Mozambique Mauritania Mauritius Malawi Malaysia Namibia Niger Nigeria 33 NLD NOR NZL PAK PAN PER PHL PNG PRI PRT PRY RWA SEN SGP SLV SOM SUR SWE SWZ SYC SYR TCD TGO THA TTO TUN TUR UGA URY USA VEN YUG ZAF ZAR ZMB ZWE Country Netherlands Norway New Zealand Pakistan Panama Peru Phillipines Papua N. Guinea Puerto Rico Portugal Paraguay Rwanda Senegal Singapore El Salvador Somalia Suriname Sweden Swaziland Seychelles Syria Tcad Togo Thailand Trinidad/Tobago Tunisia Turkey Uganda Uruguay United States Venezuela Former Yugoslavia South Africa Zaire Zambia Zimbabwe Table 4: Joint Frequency Distribution (PWT) I: Asymptotic Number of Clusters Cluster size Perfect Relative 1 2 3 4 5 6 7 29 2 22 21 9 12 3 4 0 2 0 1 0 0 Total Clusters 63 42 II: Bootstrap Number of Clusters Cluster size 1 2 3 4 5 6 7 Perfect Relative 1 1 4 3 4 4 8 7 3 3 5 6 2 2 34 Total Clusters 27 26 Table 5: Asymptotic: Relative Convergence (PWT) No 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 22 24 26 28 30 32 34 36 38 40 41 42 Countries AUS DNK KEN LUX MUS AUT ESP ISR ITA PRI CAN CSK ECU GRC IRL BDI HVO MLI MWI BGD BUR HND NZL CAF IND NER UGA GUY JOR SLV SYC AGO GHA HTI BEN GIN VEN BOL LKA PNG BRB IDN THA CIV COG MAR CMR CRI NGA CPV GNB RWA FIN ISL TTO FJI NAM PER IRN PRT YUG MRT PAK SOM MYS SWZ TUR Clusters with two countries ARG GMB 21 BEL NOR BRA SUR 23 BWA MLT CHE USA 25 CHL GAB COL JAM 27 CYP SGP DEU FRA 29 DOM SWE DZA GTM 31 EGY ZWE GBR NLD 33 HKG KOR LSO TGO 35 MDG ZMB MEX URY 37 MOZ SEN PAN SYR 39 PRY TUN TCD ZAR Two separate countries JPN PHL Listed according to size. ZAF pmin = 0.01 and l = 2. 35 Table 6: Bootstrap: Relative Convergence (PWT) No 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 Countries COL JAM MYS NAM PAN AGO BGD CMR GHA MRT CHE CHL CSK GRC MEX GUY JOR PNG SLV SYC CRI GAB IRN MUS SUR CIV COG LKA MAR PHL AUT FIN ISL ITA JPN BDI BUR GIN HVO MLI LSO RWA TCD TGO ZAR CPV GMB GNB NER UGA DEU DNK FRA LUX NZL CYP PRT SGP YUG DZA ECU GTM SWZ BOL DOM KOR PRY BRA FJI MLT PER CAF IND KEN NGA BEL GBR NLD NOR BEN MDG ZMB ZWE AUS CAN SWE BRB HKG IRL BWA MOZ SEN ESP ISR PRI Clusters with two countries HTI IDN EGY HND ARG URY One separate country VEN SYR PAK USA TUN ZAF THA TTO MWI Listed according to size. pmin = 0.01 and l = 2. The number of bootstrap samples is set at 200. 36 TUR SOM Table 7: NUTS1 code Code Country AT AT1 AT2 AT3 BE BE1 Austria BE2 BE3 DE DE1 DE2 DE3 DE5 DE6 DE7 DE9 DEA DEB DEC DEG DK ES ES3 ES4 ES5 ES6 ES7 F1 FR FR1 FR2 FR3 FR4 FR5 FR6 FR7 FR8 GR GR1 GR2 GR3 GR4 Ostosterreich Sudosterreich Westosterreich Belgium Region Bruxelles-Capital-Brussels Hoofdstedelijke Gewest Vlaams Gewest Region Wallonne Germany Baden-Wurttemberg Bayern Berlin Bremen Hamburg Hessen Niedersachsen Nordrhein-Westfalen Rheinland-Pfalz Saarland Thuringen Code Country IE Ireland IT IT1 IT2 IT3 IT4 IT5 IT6 IT7 IT8 IT9 ITA ITB LU Italy NL NL1 NL2 NL3 NL4 PT PT1 SE Denmark Spain Comunidad de Madrid Centro Este Sur Canarias UK UKC UKD UKE Finland UKF UKG UKH UK1 UKJ UKK UKL UKM France Ile de France Bassin-Parisien Nord Pas de Calais Est Ouest Sud-Ouest Centre-Est Mediterranee Greece Voreia Ellada Kentriki Ellada Attiki Nisia Aigaiou, Kriti 37 Nord Ovest Lombardia Nord Est Emilia-Romagna Centro Lazio Abruzzo-Molise Campania Sud Sicilia Sardegna Luxembourg Netherlands Noord-Nederland Oost-Nederland West-Nederland Zuid-Nederland Portugal Continente Sweden United Kingdom North East North West Yorkshire and Humber East Midland West Midlands East of England London South East South West Wales Scotland Table 8: Joint Frequency Distribution I: Asymptotic Number of Clusters Cluster size 1 2 3 4 Agriculture Manufacturing Market Service Non-market Service 0 0 1 1 3 7 9 6 7 9 3 7 2 4 4 1 6 0 2 1 Total Clusters 2 25 26 14 5 6 Summary Statistics 7 8 9 10 1 1 0 0 0 1 0 0 0 0 0 1 0 1 0 1 1 1 0 0 6 3 3 1 1 0 7 8 9 10 Total Clusters 18 22 21 20 81 σȳ 15.2 ȳ min 9.4 ȳ max 103 σȳ 5.4 ȳ min 11.7 ȳ max 62.6 II: Bootstrap Number of Clusters Cluster size 1 2 3 4 Agriculture Manufacturing Market Services Non-market Services 0 0 0 0 3 2 1 1 1 5 3 3 1 1 1 2 2 2 2 3 Total Clusters 0 7 12 6 5 8 6 0 1 3 1 1 3 0 1 1 0 4 1 1 0 0 3 0 2 0 0 10 3 8 2 2 12 15 14 14 55 III Correlation Between Asymptotic and Bootstrap Cluster Outcomes Agriculture Manufacturing Market Services Non-Market Services NB: σȳ denotes the standard deviation of cluster means. 0.672 0.509 0.557 0.591 of cluster means. ȳ min and ȳ max denote the Min and Max 38 (a) Relative Convergence Asymptotic Results in Agriculture: (b) Relative Convergence Bootstrap Results (c) Relative Convergence in Manufacturing: Asymptotic Results in Agriculture: (d) Relative Convergence in Manufacturing: Bootstrap Results Figure 1: Asymptotic and Bootstrap Results for Agriculture and Manufacturing 39 (a) Relative Convergence in Market Services: Asymptotic Results (b) Relative Convergence in Market Services: Bootstrap Results (c) Relative Convergence in Services: Asymptotic Results (d) Relative Convergence Services: Bootstrap Results Non-Market in Non-Market Figure 2: Asymptotic and Bootstrap Results for Non-Market and Market Services 40 Figure 3: The Distribution of Cluster Size. Figure 4: The distribution of average log per-capita GVA by cluster: All sectors. Skewness (Asymptotic ) 1.29 (Bootstrap) 0.27 Kurtosis (Asymptotic ) 6.82 (Bootstrap) 2.20 41