Academia.eduAcademia.edu

Extending the Lee–Carter model: a three-way decomposition

2011, Scandinavian Actuarial Journal

In this paper, we focus on a Multidimensional Data Analysis approach to the Lee-Carter (LC) model of mortality trends. In particular, we extend the bilinear LC model and specify a new model based on a three-way structure, which incorporates a further component in the decomposition of the log-mortality rates. A multi-way component analysis is performed using the Tucker 3 model. The suggested methodology allows us to obtain combined estimates for the three modes: i) time, ii) agegroups and iii) different populations. From the results obtained by the Tucker 3 decomposition, we can jointly compare, in both a numerical and graphical way, the relationships among all three modes and obtain a time series component as a leading indicator of the mortality trend for a group of populations. Further, we carry out a correlation analysis of the estimated trends in order to assess the reliability of the results of the three-way decomposition. The model's goodness of fit is assessed using an analysis of the residuals. Finally, we discuss how the synthesised mortality index can be used to build concise projected life tables for a group of populations. An application which compares ten European countries is used to illustrate the approach and provide a deeper insight into the model and its implementation.

City Research Online City, University of London Institutional Repository Citation: Russolillo, M., Giordano, G. and Haberman, S. (2011). Extending the Lee Carter Model: a Three-way Decomposition. Scandinavian Actuarial Journal, 2011(2), pp. 96-117. doi: 10.1080/03461231003611933 This is the accepted version of the paper. This version of the publication may differ from the final published version. Permanent repository link: http://openaccess.city.ac.uk/4034/ Link to published version: http://dx.doi.org/10.1080/03461231003611933 Copyright and reuse: City Research Online aims to make research outputs of City, University of London available to a wider audience. Copyright and Moral Rights remain with the author(s) and/or copyright holders. URLs from City Research Online may be freely distributed and linked to. City Research Online: http://openaccess.city.ac.uk/ [email protected] Extending the Lee Carter Model: a Three-Way Decomposition Maria Russolillo1, Giuseppe Giordano Department of Statistics and Economics University of Salerno, Italy [email protected]; [email protected] & Steven Haberman Faculty of Actuarial Science and Insurance, Cass Business School, City University, London, UK [email protected] Corresponding Author: Maria Russolillo, Dip. Scienze Economiche e Statistiche, Università di Salerno, 84084 – Fisciano (Salerno), Italy. Tel. +39 089 963457 Fax. +39 089 962049. E-mail: [email protected] 1 1 Maria Russolillo, Giuseppe Giordano and Steven Haberman Extending the Lee Carter Model: a Three-Way Decomposition Scandinavian Actuarial Journal Abstract In this paper, we focus on a Multidimensional Data Analysis approach to the Lee-Carter (LC) model of mortality trends . In particular, we extend the bilinear LC model and specify a new model based on a three-way structure, which incorporates a further component in the decomposition of the log-mortality rates. A multi-way component analysis is performed using the Tucker 3 model. The suggested methodology allows us to obtain combined estimates for the three modes: i) time, ii) agegroups and iii) different populations. From the results obtained by the Tucker 3 decomposition, we can jointly compare, in both a numerical and graphical way, the relationships among all three modes and obtain a time series component as a leading indicator of the mortality trend for a group of populations. Further, we carry out a correlation analysis of the estimated trends in order to assess the reliability of the results of the three-way decomposition. The model’s goodness of fit is assessed using an analysis of the residuals. Finally, we discuss how the synthesised mortality index can be used to build concise projected life tables for a group of populations. An application which compares ten European countries is used to illustrate the approach and provide a deeper insight into the model and its implementation. Keywords: Biplot, Mortality Forecasting, Singular Value Decomposition, Tucker3 model. JEL Classification: C49, G20. 1. Introduction and Basic Notation The Lee-Carter model (Lee and Carter, 1992) represents an extrapolative method based on a multiplicative two-way model which is used with time series analysis in forecasting mortality. Tabeau et al. (2001) have highlighted the statistical features of this method as an association model for a two-way data matrix. The model fitting can be addressed, from both an algebraic and a statistical point of view, by means of the Singular Value Decomposition (SVD – Eckart and Young, 1936). The model used for mortality data is2:   ln yij   j  i  j   ij i  1, , I ; j  1, , J (1) which represents the logarithm of death rates y ij measured over I years and J age-classes, as the sum of an age-specific component  j , that is independent of time and another component that is the product of a time-varying parameter  i , reflecting the trend in the level of mortality, and an age-specific component  j , that represents how, rapidly or slowly, mortality at each age varies when the general level of mortality changes. The  ij component denotes the error term, which is assumed to be homoscedastic and normally distributed. 2 The notation used here is different from the original proposed by Lee and Carter for sake of comparability with the terminology used in the three-way data analysis literature (see Kiers, 2000). 2 The LC model (1) is usually expressed by considering the mean centred log-mortality rates: (see for example: Lee and Carter, 1992; Renshaw and Haberman, 2003a):   ~ yij  ln yij   j  i  j   ij i  1, , I ; j  1, , J (2) The values of j are computed by averaging the log-mortality rates across the I years, for each age-class j = 1, ..., J. ~ The dependent term ~yij can be arranged in the rectangular matrix Y (I×J), ̃ ( ̃ ̃ ̃ ̃ ̃ ̃ ̃ ̃ ̃ ̃ (3) ) The SVD of this matrix is stated as follows: ̃[ ] [ ] [ ] [ (4) ] ~ In the square brackets there is the size of each array. The scalar h is the rank of Y , S is a diagonal ~ matrix holding the positive singular values of Y . The matrices U and V hold, respectively, the left and right singular vectors ui and vj forming orthogonal bases for the I rows and the J columns of the ~ matrix Y . The estimates of  i and  j , in equation (2), are obtained as: ˆi  ui1; ˆ j  1v'1 j (5) and they minimize: ~yij   i  j 2 i (6) j The aim of this contribution is to extend the two-way model introduced by Lee and Carter to include the case of a further criterion, so that mortality data can be shaped in a three-dimensional array. When mortality data are derived from a more extensive dataset, there can be auxiliary variables of interest which can be used as aggregating criteria, for example: gender, countries, ethnic groups, causes of death. In the literature, there are several papers which have considered the modelling and forecasting of population mortality using the LC framework for a number of European Countries such as Belgium (Brouhns et al., 2002), Austria (Carter and Prkawetz, 2001), England and Wales (Renshaw and Haberman, 2003a,b), and Spain (Guillen and Vidiella-i-Anguera, 2005). It is noteworthy, however, that none of these papers take into account more than one country simultaneously. Whilst the demographic and actuarial literature has emphasised mainly the analytical role of SVD as an estimation method, in a previous work, Giordano et al. (2008), have exploited the descriptive features of the SVD in the light of Explorative Data Analysis (EDA, Tukey, 1977). In particular, they show how the Principal Component Analysis (PCA, Hotelling, 1933) can be applied 3 to mortality rates, providing a useful synthesis and enhancing the model interpretation. A recent paper that has used a PCA extension to the lee-Carter model is Hatzopoulos and Haberman (2009). In this paper, a new specification of the LC demographic model is presented. We introduce a three-way model structure which looks at the mortality data aggregated according to three criteria: time, age-class and country. Specifically, we refer to the Tucker 3 decomposition method (Kroonenberg, 1983). The proposed approach allows us to simplify the data structure and to obtain a rank reduced representation. In the proposed model, the resulting time series component  i no longer concerns a single country, but provides an aggregate estimate of a set of countries. It could be used as a benchmark for the individual time series of each country and to build concise projected life tables for the set of countries under consideration. The paper is organised as follows. Section 2 presents the three-way extension of the Lee Carter model. Section 3 contains an application both to a two-way and a three-way dataset. The three-way dataset involves mortality rates divided by age-group for ten European countries in the period 1950-2000. Section 4 provides some concluding remarks. 2 . Three-way LC Model Frequently, the crude mortality data can be disaggregated according to several criteria or according to different socio-demographic attributes, for example: gender, countries, ethnic groups, regions, causes of death and so on. We consider the case of a three-way structure. It would be interesting to assess whether such data can be described as combinations of a reduced number of latent factors. Since the data are classified according to three different criteria, we could analyse them by carrying out as many two-way analyses as the number of elements of the third criterion. Furthermore, we would like to analyse the variability across the third criterion. Thus, we introduce a new specification of the LC model, which considers the new effect as follows:   ~ yijl  ln yijl   jl   i  j  l   ijl i  1, , I ; j  1, , J ; l  1, , L (7) where , and have the same interpretation as before (see eq. 1 and 2), while l (l = 1,...,L) represents the term associated to the generic element of the third criterion (which, henceforth, we refer to as representing different countries). We note that all of the parameters share information common to the three ways, that is the estimates are influenced both by time and country effects. The left hand term can be arranged in a three way array, for example: (Time × Age-group × Country). The dependent term can be expressed as a deviation from the mean effects computed either for one or two criteria. The singular value decomposition needs to be reformulated to take into account the new data structure. In order to address this decomposition issue, several algorithms leading to different statistical methods (i.e. Multiple Factorial Analysis, Generalised Canonical Analysis, PARAFAC model, Tucker’s Methods, etc.) have been proposed in the literature (Carroll 1968, Escofier and Pagès 1990, Kiers 2000 and Kroonenberg 1983). The natural extension of the SVD for the threeway data structure is the Tucker 3 model (Tucker, 1966). This method, which is also known as Three-Mode Principal Component Analysis, can be seen as a generalisation of SVD. In the two-way SVD, the singular values are arranged in a diagonal matrix (see eq. 4). In the Tucker 3 model these values form a three-way array - the core array - (see fig.1). However, the relationships with the singular values of the various two-way analysis are not so immediate. In Fig. 1 is shown the three-way decomposition according to the Tucker 3 model. The three dimensional array ̃ is decomposed in a three-way core array  and in three matrices (K, G, B). 4 G(L×Q) K(I×N) = + (N×P×,Q) ̃( ) (P×J) E(I×J×L) Fig. 1 – The three-way model Referring back to equation (8), the matrix K (I×N) holds the generic element κin; the matrix B (J×P) has generic element jp; and the matrix G (L×Q) has generic entry lq. The K, G and B are column-wise ortho-normal matrices and their columns hold the components associated to each mode. The Tucker 3 model factorises the three-way array ̃ such that3: (̃ ) ∑ ∑ ∑ n=1,...,N; p =1,...P; q=1,...,Q (8) The left hand side of equation (8) holds the log-mortality rates expressed as deviations from the average of each age-group j. The terms N, P and Q denote the numbers of elements in each of the three modes, respectively years, age-groups, and countries; I, J, K are the numbers of components selected in the approximation. The coefficients jp, ,lq, are the entries of the component matrices for the first, second and third mode. The npq are the elements of the three-way core matrix. By setting n, p, q equal to 1, we obtain the estimates for the model stated in (7) assuming the remaining components as residual term. The patterns of the SVD residuals can be analysed to confirm whether further components may contain useful information in terms of goodness of fit. By retaining further components, we may also obtain two-dimensional maps displaying the relationships among the three modes in the array ̃. As described in detail by Kiers (2000), several possibilities exist for graphical presentation of the model: i) to plot the entities of one single mode, ii) to plot combinations of two modes and superimpose the entities of the third mode, or iii) to plot entities of two modes jointly, for each of the entities of the third mode (the so called “joint plots”, see Kroonenberg, 1983, p.164-165). Different tools are available to interpret the results of a three-way decomposition (see Tucker, 1966; Kroonenberg, 1983 for a thorough discussion). The squared singular values in the core matrix  represent the amount of variation accounted for by each component in each mode. These can be summed in order to provide the global variability. When divided by the total variability, the core elements indicate the proportional fit of a component. Moreover, it can be shown (Kroonenberg, 1983) that the total sum of squares can be split into a fitted part and a residual part for each mode. This can be useful, for instance, for indicating how well the model fits the data. Further insight into the interpretation phase is provided both by the output components and by the core matrix elements as shown by the example in the following section. 3 The algorithms for estimating the model in eq. (7) are described in Kroonenberg & De Leeuw (1980). 5 3. The Procedure at Work: Exploring Mortality Rates in Ten European Countries An in-depth illustration of our procedure at work is shown by using the mortality rates from 1950 to 2000 provided by the Human Mortality Database4 for ten different European countries: Austria (Aus), Belgium (Bel), Denmark (Den), Finland (Fin), France (Fra), Italy (Ita), Netherland (Net), Spain (Spa), Sweden (Swe), United Kingdom (UK). This choice is motivated by the requirement of dealing with a homogeneous data-set (western Europe). The available data for ten countries show no missing values in the considered period. The data are arranged according to the notation given in section 2. The three-way data array holds the mean centred log-mortality rates for each country. The rows represent the 51 years, the columns are the 21 age-groups: “0”, “1-4”, “5-9”,…, “95-99”. The third mode is given by the ten countries. The data can be interpreted as follows: for each country, year and age group, a positive value indicates a mortality rate greater than the average value of the correspondent age-group. Notice that data are centred across observations in the first mode (years). Thus, we can compare the results for the individual countries, given by a two-way analysis, with the three-way synthesis. In the next two subsections, firstly, the PCA is carried out for the two-way Italian mortality dataset. Afterwards, we apply the Tucker3 method to the extended LC model for our threedimensional array. In this case, we show several features of the proposed methodology and provide some interpretative comments on the relationships between the age-groups and the different years and countries. The PCA of each two-way dataset allows the estimation and visualisation of the mortality indices (  i ) and the age-specific component (  j ) for the different countries. The comparison is made easier by the graphical display of the first two components. As an illustration, we firstly analyse the Italian dataset (total population). This analysis, just applied to one country, is useful to show the peculiar interpretation of the PCA to the LC model (Giordano et al, 2008). Then, the extension to the three-way analysis provides a unique mortality index as a synthesis of the mortality trends in the ten countries. The principal results are reported in the following subsections. 3.1 Two-Way Data Analysis: Italian Data We carried out the Principal Component Analysis on the variance-covariance matrix obtained for the Italian mortality data. The first eigenvalue accounts for a large amount of inertia (95.74%) showing the high correlation among the age-groups. We also consider the second component which accounts for the 3.13% of the variability. Thus, the first two components in the factorial plan explain 98.88% of the total variation (see Table 1). Table 1 – The first three eigenvalues: Italian data. Number 1 2 3 Eigenvalue 2.9055 0.0951 0.0094 Percentage 95.74 3.13 0.31 Cumulated Percentage 95.74 98.88 99.19 The relationships among the age-group variables are showed in Table 2. For each age-group we determine the factor loading. On the first axis, the largest contributions are given by the age groups “0”, “1-4”, “5-9”, “10-14” and “15-19”. As an indication of the significance of the first component in the LC model, we observe how the factor loadings on the second axis are much smaller than the first axis. A thorough description of the aforementioned relationship can be observed in the biplot representation (see Fig. 2), which has been obtained by means of the software SPAD®. In this figure, each year is represented as a point, whereas the age-groups are represented as vectors. The factorial map, provided by the first two components, shows how the first component represents the 4 Data Source: Human Mortality Database. University of California, Berkeley (USA), and Max Planck Institute for Demographic Research (Germany). Available at www.mortality.org or www.humanmortality.de (data downloaded on 2007). 6 common temporal trend for all age-groups. Indeed, the year-points lie from right to left, ordered from the year 1950 to the year 2000, showing that all age-groups share highly correlated trends. The second component characterises a diversification of the mortality indices due to the effect of specific age-groups. In particular, on the upper side are located the oldest age groups. On the lower side, we find the central age-groups 30-34, 35-39. Thus, for the Italian data, the choice of the first principal component as a composite indicator of all of the age-groups seems to be well supported by the empirical observations. Table 2 – Factor loadings for the first two components: Italian data. Age group 0 1-4 5-9 10-14 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 65-69 70-74 75-79 80-84 85-89 90-94 95-99 Loadings Axis 1 Axis 2 0.82 0.07 0.91 -0.12 0.60 -0.01 0.46 0.01 0.25 0.00 0.23 -0.06 0.25 -0.12 0.25 -0.13 0.27 -0.05 0.27 0.03 0.26 0.06 0.24 0.08 0.21 0.08 0.20 0.07 0.20 0.06 0.21 0.06 0.22 0.06 0.21 0.05 0.18 0.05 0.14 0.04 0.11 0.03 Fig.2 – The first two components of the PCA decomposition: Italian data 7 3.2. Three-Way Data Analysis: European Data The three-way data analysis has been carried out by means of the Matlab® toolbox Tucker3 (written by Henk A.L. Kiers, April 2001). We find that the model with the selection of (2×2×2) components gives a good fit, explaining 84.75% of the total variation. In Table 3, we present the rotated Core Matrix values. The value -31.16 indicates the higher contribution made by the first SVD components for the three modes. In Table 4, the explained variation of each core entry is analysed. Table 3 – The core matrix elements Age 1 × Country 1 Age 2 × Country 1 Age 1 × Country 2 Age 2 × Country 2 Years1 16.71 -.001 -0.031 -0.29 Years2 -.00 3.48 0.47 -2.13 Table 4 – Core Component Analysis: % of importance Age 1 × Country 1 Age 2 × Country 1 Age 1 × Country 2 Age 2 × Country 2 Years1 79.90 0.00 0.00 0.02 Years2 0.00 3.47 0.06 1.30 The first two components are plotted in Fig. 3, respectively, for i,n=1,2 (mode 1) and  j,p=1,2 mode 2). 0.5 Comp. 2 Comp. 1 0.4 0.3 0.2 0.1 0 -0.1 -0.2 -0.3 1950 1952 1954 1956 1958 1960 1962 1964 1966 1968 1970 1972 1974 1976 1978 1980 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 Mode1: Years 0.6 Comp.1 Comp.2 0.4 0.2 0 -0.2 -0.4 -0.6 -0.8 0 1-4 5-9 10-14 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 65-69 70-74 75-79 80-84 85-89 90-94 95-99 Mode2: Age groups Fig. 3 - Plots of the components for years (mode 1) and age-groups (mode 2). 8 In the top part of Fig. 3, we note the linear trend of the first component which couldn be used in a subsequent forecasting phase. In the bottom part of Fig. 3, we notice that the first component shows a regular pattern for the age-groups going from 15-19 to 80-84. For this component, we note the different contributions given by the younger age-groups. After exploring the component scores, it is interesting to analyse in more detail the relationships between the trend (years) and age profiles. Usually, this is performed conditionally on each component of the third mode. In Fig.4, we show the joint plot of years and age-groups conditional on the first component of country. In this representation, the age-group components should be thought of as vector-points (loading scores) and the years play the role of “statistical units”. By considering each age-group as variable, we may project along its direction all of the years-points and analyse the particular trend for that age-group. 0.6 25-29 0.5 30-34 1950 0.4 20-24 1951 0.3 1-4 15-19 1957 40-441988 1956 10-14 1987 1958 1959 1961 1960 1986 1964 45-49 1962 1984 1963 1980 1983 1979 1982 1972 1985 1965 1978 1981 1966 1970 1973 85-89 1977 1967 1971 50-54 1969 1968 1974 19751976 80-84 75-79 55-59 70-74 65-69 60-64 0 -0.1 -0.3 -0.6 1995 1994 1996 1991 1992 1993 1990 1997 1989 1953 1954 1955 0.1 -0.2 35-39 1952 0.2 5-9 0 -0.5 -0.4 -0.3 -0.2 -0.1 0 0.1 2000 1998 1999 95-99 90-94 0.2 0.3 Fig.4 - The joint plot of the three-way mortality data: years and age-groups plotted conditionally to the first component of country. In Fig. 4, we have plotted the age-groups as a vector-variable. We note that this interpretation is still valid because of the preliminary choice of centring the log-mortality rates across the years component for each age-group. A different pre-treatment would have given different results and hence different interpretations. The age-groups ranging from “0” to “10-14” seem to explain the greater part of the variability along the horizontal axis. Along the vertical axis, we note a division between the agegroups ranging from 20 to 40 years old and the oldest age-groups from 45 to 94 years old. Let us notice that this result is similar to the two-way analysis shown for the Italian data in Fig.2, but with an axis rotation. 9 0.6 1950 1951 0.2 Nl 1952 1953 1954 1957 1958 1959 1956 1964 1960 1984 1986 1979 1961 1962 1978 1980 1983 1966 1970 1972 1981 1963 1985 1973 1975 1967 1971 1982 1974 1977 1965 1968 1969 1976 -0.2 Dn 1995 1991 1992 1994 1996 1990 1993 1997 2000 1989 1998 1988 1999 1987 1955 0 Au Sv 0.4 Fr Be Fi Uk It -0.4 -0.6 -0.8 -0.3 Sp -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 Fig.5 - The joint plot of the three-way mortality data: years and country plotted conditionally to the first component of age-group. Similarly, Fig.5 shows the bi-plot of years and country conditional on the first component of the age-group mode. We notice that the ten countries show highly correlated patterns. However the estimated trends for Belgium, Finland, France, Italy and UK seem to be better represented on the first axis of the bi-plot. The second axis separates Spain (on the bottom side) from Austria, Sweden, the Netherlands and Denmark (on the top side). Finally, we show a comparison among the ten European Countries by plotting the ( ) estimated for each single country. In Fig.6, the ten trends are compared with the joint estimated trend obtained through the three-way decomposition (shown in Fig. 3). 10 Fig. 6 – The ten i trends and the three-way synthesis (bold line) From Fig. 6, it is evident that there is a strong trend similarity among the ten series. This enforces the idea that the synthesised (shown in bold) is a good approximation for the whole series, as confirmed by the correlation matrix in Table 5. Table 5 – Correlation matrix among the ten estimates i and the three-way synthesis (Countries) Countries Au Be Dn Fi Fr It Nl Sp Sv Uk Countries 1.000 0.993 0.996 0.991 0.995 0.998 0.996 0.994 0.985 0.995 0.995 Au Be Dn Fi Fr It Nl Sp Sv 1.000 0.997 0.983 0.980 0.994 0.985 0.989 0.962 0.992 0.994 1.000 0.985 0.985 0.996 0.988 0.993 0.968 0.994 0.997 1.000 0.986 0.987 0.987 0.985 0.966 0.990 0.988 1.000 0.988 0.992 0.989 0.985 0.988 0.986 1.000 0.992 0.991 0.980 0.994 0.994 1.000 0.990 0.985 0.989 0.986 1.000 0.968 0.991 0.992 1.000 0.970 0.966 1.000 0.995 Uk 1.000 3.2.1 Goodness of Fit Analysis As regards the assessment of goodness of fit, the sole use of the explained variation is not a satisfactory diagnostic indicator but, as noted by Brouhns et al (2002) and by Renshaw and Haberman (2003a), it has been widely used in the demographic literature for the explorative purpose for choosing the number of relevant components (Tuljapurkar et al., 2000). An useful tool is the boxplot of the residuals, for each age group, against Years and Country. For illustrative purposes, in Fig. 7, we plot the three-way residual patterns for the full set of countries. The boxplots show the residual patterns of (51x10) observations for each age-groups. We notice that, for each age-group and each year, the standardised residuals show symmetric and regular patterns, confirming that we have a good fit for the three-way LC method. 11 3 2 Values 1 0 -1 -2 -3 1 2 3 4 5 6 7 8 9 10 11 12 Age Class 13 14 15 16 17 18 19 20 21 Fig. 7 – Box-plots of the Tucker3 model residuals for each age group from 0, 1-4,..., to 95-99. Each box represents the variability of 510 observations (51 years x 10 countries). 3.2.2 Discussion on forecasting by the Three-Way Model The advantages of the Lee-Carter model are essentially twofold: 1. it is a simple model which is combined with time-series analysis for describing the secular changes in mortality, it has been successfully used for forecasting, and the results compete favourably with those from other extrapolative procedures; 2. it allows the discussion of mortality uncertainty through the uncertainty of the time-varying variable ki (see, for example, Renshaw and Haberman, 2009). The attractive and convenient linear feature of the time-varying variable ki is purely statistical and varies for different countries and different periods; the reasons behind the statistical features of the parameters vary too. Therefore, care would be needed when the model is used for explanatory purposes. In this subsection, we provide indications around the way that the Three-Way Model extension might be of practical use in terms of the traditional Lee-Carter model application, with the intention of extending this topic in further studies. As with the traditional LC modelling approach, we could apply a time series analysis to the estimated ki and produce projected life tables. In our case, we would be interested both in each country and in highlighting the opportunity to use the synthesised ki, derived by the three-way model, in order to generate aggregated life tables. An advantage of computing aggregated life tables would be that such tables would take into account the moderating effect of being based on a synthesised ki. This effect could be used, for example, for presenting projected life expectancy at birth for the ten countries and for Europe, in aggregate. In future research, we aim to compute period and cohort based life expectancies from forecast mortality rates for each single country by using the particular time series ki of that country. In this paper, our aim is to extrapolate the threeway LC model in order to obtain a synthesised ki that could be used for forecasting. In the actuarial literature, it is well known that mortality improvements have a significant effect on the pricing of and reserving for life annuities. More generally, mortality trends affect any insurance cover that provides some kind of “living benefits”. In order to protect the insurance company from the effect of mortality improvements, actuaries have to employ life tables that include a forecast of the future trends of mortality rates (the so-called projected tables). For this 12 reason, by applying a time series approach to the synthesised ki derived by our procedure, the idea could be to generate a projected aggregate life table for the ten countries of Europe and a set of projected life tables for the individual countries. These could then be used to extrapolate life expectancies and annuity values aggregated for all ten European countries, as well as individually. We would expect that the life expectancies and annuity values forecast under the synthesised ki are intermediate between the ones generated by using the ki series related to each single country, reflecting the different model structure that generates the respective time series ki. This possibility remains to be verified. 4. Concluding Remarks In this paper, we deal with the traditional formulation of the LC model and extend it by introducing a third component in the model specification. We introduce a three-way structure which looks at the mortality data aggregated according to three criteria: time, age-class and country. In order to deal with the three-way structure, we refer to the Tucker 3 decomposition method. This factorial decomposition is interpreted in the light of a three-way data analysis and allows us to simplify the data structure and obtain a rank reduced representation. Through the results derived by the Tucker3 decomposition we have compared, in a numerical and graphical way, the relationships among all three modes and extrapolated a time series component as a leading indicator of the mortality for the whole group of countries. By means of our procedure, we have compared the results of the individual countries, on the basis of a two-way analysis, with the three-way synthesis. The extension to the three-way analysis provides a unique mortality index  i which no longer concerns a single country, but provides an aggregate estimate for a set of countries. In the discussion, we suggest how the model proposed in this paper may be used to deal with forecasting. The resulting time series component could be used as a benchmark for the individual time series of each country and to build concise projected life tables for the whole group of countries. We acknowledge that we have returned to the original version of the LC model and extended the SVD formulation. More recent developments of the model have dealt with other aspects – for example, the introduction of the more natural Poisson error structure (Brouhns et al, 2002; Renshaw and Haberman, 2003a, 2003b). We believe that the exploratory tools of multivariate data analysis, which we have proposed, give a new perspective to the demographic analysis supporting the analytical results with a geometrical interpretation and a graphical representation. Acknowledgements The authors thank an anonymous referee whose suggestions improved the original manuscript. 13 Appendix. The Matlab Procedure. In the following is reported the Matlab output to reproduce the three-way analysis and the related plot shown in this paper. It makes use of the Matlab source code written by Henk Kiers in 2001 as reported in his web page http://www.ppsw.rug.nl/~kiers/ where the whole toolbox can be downloaded. The Tucker3.m must run provided that a Data Matrix X is shaped as specified in the note 2 below. Here is reported the interactive parameters choice in bold “< >” symbols. ********************************************************************************************* WELCOME to the interactive TUCKER3 analysis program (written by Henk A.L. Kiers, April 2001) The program works fully interactively. Simply answer the questions, or press the Enter key for the default option. NOTES: 1. The program does not handle missing data: you should make sure that all missing values are replaced by sensible values 2. The data should be available in the matrix X, which is set up as follows: - rows correspond to A-mode entities - columns correspond to combinations of B- and C-mode entities, with B-mode entities nested within C-mode entities For example, a 2 x 2 x 3 array is given as follows: x_1_1_1 x_1_2_1 x_1_1_2 x_1_2_2 x_1_1_3 x_1_2_3 x_2_1_1 x_2_2_1 x_2_1_2 x_2_2_2 x_2_1_3 x_2_2_3 3. The program may change matrix X; keep a copy of X ! 4. Tucker3.m works in the workspace, and makes (and hence overwrites) many matrices; you are advised to BACKUP the contents of your workspace before analysis with Tucker3.m 5. You can specify labels for A- B- and C- mode entities (in laba, labb, labc); otherwise standard labels will be made by the program, as laba, labb, labc. 6. All matrix output is tab-delimited, and can hence easily be converted to tables, after copying to Word 7. Faster analysis (for advanced users) can be carried out using splithalft3.m Note: If you are interested in doing a botstrap analysis, it is assumed that the A-mode is the mode from which you want to resampling If this is not the case, first reorder your data START of program Specify the number of A-mode entities: <51> Specify the number of B-mode entities: <21> Specify the number of C-mode entities: <10> To see anova results, specify "1": <1> Total ssq = 1060.338983 SS_gm = 0.000000 ( 0.00) SS_a = 673.790430 ( 63.54) SS_b = 0.000000 ( 0.00) SS_c = 0.000000 ( 0.00) SS_ab = 289.212959 ( 27.28) SS_ac = 37.252262 ( 3.51) SS_bc = 0.000000 ( 0.00) SS_abc = 60.083333 ( 5.67) How do you want to CENTER your array ? 1= across A-mode 14 2= across B-mode 3= across C-mode 12= across A-mode and across B-mode 13= across A-mode and across C-mode 23= across B-mode and across C-mode Specify your choice: <12> Data have been centered across A-mode and across B-mode How do you want to NORMALIZE your array ? 1= within A-mode 2= within B-mode 3= within C-mode Specify your choice: <enter> NOTE: The preprocessed data are now available in Xprep Xprep can be used for analyses outside the Tucker3.m program To see PCA of MEAN, specify "1": <1> Over which mode do you want to compute means? Enter 1,2 or 3 (for A,B, or C)? <3> A 51 by 21 matrix for modes A x B will be computed Which mode will serve as "variables mode"? Enter 1 (=A) or 2 (=B): <2> B will contain loadings; A will contain scores If you want to standardize "variables", type "1": <enter> Eigenvalues and cumulative percentages (Scree plot in Figure) 26.89 92.98 1.19 97.09 0.48 98.73 0.12 99.14 0.09 99.45 0.05 99.63 0.04 99.75 0.02 99.84 0.01 99.88 0.01 99.91 0.01 99.94 0.01 99.95 0.00 99.97 0.00 99.98 0.00 99.98 0.00 99.99 0.00 99.99 0.00 100.00 0.00 100.00 0.00 100.00 0.00 100.00 How many components do you want to maintain for rotation? <2> A1, B1 and/or C1 contain solution based on varimax rotation of loadings To study/plot the output you can now temporarily leave the interactive program to get back, type "return". You are now about to do a Tucker3 analysis. You have to specify the dimensionalities to use. (see Timmerman & Kiers, 2000, Kiers & der Kinderen, 2003) If you want to do the PCASUPs, specify "1": <return> You can now do a TUCKER3 ANALYSIS 15 How many A-mode components do you want to use? <2> How many B-mode components do you want to use? <2> How many C-mode components do you want to use? <2> Tucker3 function value at start is 56.49327452 Final Tucker3 function value after iteration 7 is 53.26369179 Tucker3 analysis with 2x2x2 components gave a fit of 84.75 % PLOTTING No special plotting routines are incorporated, but you can use any scatter-plotting routine you like (e.g., scatter, etc.,or "plotje.m") If you want to make plots of entities of one mode, you can use: matrices Aplot, Bplot, or Cplot, and use labels in laba, labb, labc. If you want to make plots of entities of two modes, with the third projected in it as axes, you can use: (BAplot,C),(ACplot,B), or (CBplot,A) It is sometimes useful to RENORMALIZE components, e.g., such that the influence of the different A-mode components on the core is equalized This will usually lead to correlated components in that mode, and to a more simple structure for the components in the OTHER modes, If you want to renormalize components, specify "1": <enter> Find useful simple structure rotation of core and components You can now carry out SIMPLE STRUCTURE rotations with varying weights; if desired, you can specify a range of different weights for each mode. To specify a range of values you type, e.g., 1:4 (then the values 1,2,3 and 4 are chosen), or 1:5:25 (then the values 1,6,11,16 and 21 are chosen), or [1 2 5 10 100] (then the values 1,2,5,10 and 100 are chosen). You can also specify a single weight (e.g., just 1). Analyses will be carried out with all combinations of relative weights. Specify (range of) relative weight(s) for A: <enter> Specify (range of) relative weight(s) for B: <enter> Specify (range of) relative weight(s) for C: <enter> Simple structure rotation with relative weights 0 0 0 Results from 5 random starts Run no. 1 f= Run no. 2 f= Run no. 3 f= Run no. 4 f= Run no. 5 f= 5.289 5.289 5.289 5.289 5.289 (core: (core: (core: (core: (core: 5.289; A: 5.289; A: 5.289; A: 5.289; A: 5.289; A: 0.000; B: 0.000; B: 0.000; B: 0.000; B: 0.000; B: 0.000; C: 0.000; C: 0.000; C: 0.000; C: 0.000; C: Three-way orthomax function value for best solution is 0.000), 0.000), 0.000), 0.000), 0.000), 4 iters 4 iters 4 iters 4 iters 3 iters 5.2891 Varimax values of core and AS, BT and CU (all based on "natural weights") Core 5.289 A B 1.553 C 2.878 0.773 16 These simplicity values are based on "natural" weights and therefore comparable across matrices. When multiplied by the relative weights, they give the contribution to the overall simplicity value (they are I^2/p, J^2/q or K^2/r, resp., times the sum of the variances of squared values) relative weights A B 0.00 0.00 C 0.00 simplicity values core A B 5.29 1.55 2.88 C 0.77 Give SIMPLE STRUCTURE rotated solution(s) in detail Use last simple structure rotated solution ? type "return" to specify a different simple structure rotation, specify "1" <return> Backnormalize A,B,C, and H Rotated Solution for A, B and C is in AS, BT and CU Rotated core in K Rotated B (BT) -0.21 0.23 -0.10 0.02 0.07 0.33 0.50 0.47 0.23 0.03 -0.08 -0.15 -0.19 -0.19 -0.19 -0.19 -0.18 -0.16 -0.13 -0.09 -0.02 -0.58 -0.55 -0.36 -0.18 0.08 0.09 0.06 0.07 0.08 0.10 0.10 0.09 0.09 0.09 0.08 0.07 0.07 0.09 0.13 0.18 0.22 Rotated C (CU) 0.43 -0.01 0.29 -0.07 -0.04 -0.22 0.29 -0.63 0.42 -0.16 0.35 0.29 0.27 0.33 0.28 0.40 0.27 0.42 0.28 0.22 Rotated core 17 B1xC1 B2xC1 -2.13 -0.29 B1xC2 0.47 -0.03 3.48 -0.01 B2xC2 0.00 16.71 You can keep present as final solution, or study a different one If you want to study one more solution, specify "1" : <return> You can now manually PERMUTE and REFLECT columns/rows of solution If you want to reflect/permute columns/rows, specify "1": <1> Give vector for reflection of columns of A (e.g., [1 -1 -1 1 ..]) : <return> Give vector with new order of columns of A (e.g., [3 1 4 2 ..]): < [2 1]> Give vector for reflection of columns of B (e.g., [1 -1 -1 1 ..]) : <enter> Give vector with new order of columns of B (e.g., [3 1 4 2 ..]): < [2 1]> If you want to further reflect/permute columns/rows, specify "1": <enter> Rotated B (BT) B1 B2 B3 B4 B5 B6 B7 B8 B9 B10 B11 B12 B13 B14 B15 B16 B17 B18 B19 B20 B21 -0.58 -0.55 -0.36 -0.18 0.08 0.09 0.06 0.07 0.08 0.10 0.10 0.09 0.09 0.09 0.08 0.07 0.07 0.09 0.13 0.18 0.22 -0.21 0.23 -0.10 0.02 0.07 0.33 0.50 0.47 0.23 0.03 -0.08 -0.15 -0.19 -0.19 -0.19 -0.19 -0.18 -0.16 -0.13 -0.09 -0.02 Rotated C (CU) C1 C2 C3 C4 C5 C6 C7 C8 0.35 0.29 0.27 0.33 0.28 0.40 0.27 0.42 0.43 -0.01 0.29 -0.07 -0.04 -0.22 0.29 -0.63 18 C9 C10 0.28 0.22 0.42 -0.16 Rotated core B1xC1 B2xC1 16.71 0.00 B1xC2 -0.01 3.48 B2xC2 -0.03 0.47 -0.29 -2.13 You can produce a JOINT PLOT now In joint plots, one mode is fixed, while the entities of the other two are plotted If you want to produce a joint plot for the current solution, specify "1": <1> Which mode do you want to keep fixed ("1", "2" or "3")? <3> For which component do you want to see the joint plot? <1> The plot displays 100.00 % of the information represented by the component of your choice Do you want to produce another joint plot? If so, type "1": <1> Which mode do you want to keep fixed ("1", "2" or "3")? <2> For which component do you want to see the joint plot? <1> The plot displays 100.00 % of the information represented by the component of your choice If you want to carry out a BOOTSTRAP procedure for computing confidence intervals for the current solution, specify "1": <enter> If you want to carry out a STABILITY CHECK on current or different solution, specify "1": <enter> Press any key to conclude analysis with FITPARTITIONING Contribution to fit (in %s) for all combinations of components These contributions can simply be added to get aggregate contributions of in case all components are orthogonal If this is not the case, aggregate fit contributions are given separately, below B1xC1 79.90 0.00 0.00 3.47 B2xC1 0.00 0.06 B1xC2 B2xC2 0.02 1.30 Relative fit of B-mode entities (in %) B1 B2 B3 B4 B5 B6 B7 B8 B9 B10 B11 B12 B13 B14 B15 B16 95.5 94.5 83.0 69.8 22.5 49.7 66.2 67.2 62.6 64.4 69.6 69.7 73.4 73.0 73.6 71.8 19 B17 B18 B19 B20 B21 70.7 72.5 76.8 82.2 83.4 Relative fit of C-mode entities (in %) C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 87.8 85.0 67.6 83.4 81.3 92.8 81.2 91.7 78.9 81.8 NOTE: Residuals on "Res", can be used for further analyses References Andersson, C. A., Bro, R., 2000., The N-way Toolbox for MATLAB, Chemometrics & Intelligent Laboratory Systems. 52 (1):1-4. http://www.models.life.ku.dk/source/nwaytoolbox/ Booth, H., Maindonald, J., Smith, S. 2002. Applying Lee-Carter under conditions of varying mortality decline. Population Studies 56: 325-336. Brouhns, N., Denuit, M., Vermunt, J., 2002. A Poisson log-bilinear regression approach to the construction of projected lifetables. Insurance: Mathematics and Economics 31 (3): 373-393. Carroll, J.D., 1968. Generalization of canonical correlation analysis to three or more sets of variables. Proceedings of the 76th Annual Convention of the American Psychological Association 3: 227-228. Carter, L., Prkawetz, A., 2001. Examining structural shifts in mortality using the Lee-Carter method. Mpidr wp 2001-2007. Center for Demography and Ecology Information, University of Wisconsin-Madison. Eckart, C., Young, G., 1936. The approximation of one matrix by another of lower rank. Psychometrika 1: 211-218. Escofier, B., Pagès, J., 1990. Multiple factor analysis. Computational Statistics and Data Analysis 18: 121-140. Gabriel, K. R., 1971. The biplot-graphic display of matrices with application to principal component analysis. Biometrika 58: 453-467. Giordano, G., Russolillo, M., Haberman, S., 2008. Comparing Mortality Trends via Lee Carter Method in the Framework of Multidimensional Data Analysis. Mathematical and Statistical Methods in Insurance and Finance, 131-138. Edited by C. Perna & M. Sibillo. Milan: Springer. 20 Girosi, F., King G., 2005. A Reassessment of the LC Mortality Forecasting Method. Online at http://gking.harvard.edu/files/abs/lc-abs.shtml. Guillen, M., Vidiella-i-Anguera, A., 2005. Forecasting Spanish natural life expectancy. Risk Analysis 25 (5): 1161-1170. Hatzopoulos, P. Haberman, S., 2009. A parameterized approach to modelling and forecasting mortality. Insurance: Mathematics and Economics 44: 103-123 Hotelling, H., 1933. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 24: 417-441, 498-520 Hyndman, R.J., Ullah, Md.S., 2007. Robust forecasting of mortality and fertility rates: a functional data approach. Computational Statistics and Data Analysis 51: 4942-4956. Kiers, H.A.L., 2000. Displaying results from three-way methods. Journal of Chemometrics 14: 151170. Kiers, H.A.L., van Mechelen, I., 2001. Three-way component analysis: Principles and illustrative application. Psychological Methods 6: 84-110. Kroonenberg, P.M., 1983. Three mode Principal Component Analysis. Theory and Applications. Leiden: DSWO Press. Kroonenberg, P.M., de Leeuw, 1980. Principal Component Analysis of three mode data by means of alternating least squares algorithms. Psychometrika 45: 69-97. Lee, R.D., Carter, L.R., 1992. Modelling and Forecasting U.S. Mortality. Journal of the American Statistical Association 87: 659-671. Li, N., Lee, R.D., 2005. Coherent mortality forecasts for a group of populations: an extension of the Lee-Carter method. Demography 42: 575-594. Renshaw, A., Haberman, S., 2003a. Lee-Carter mortality forecasting: a parallel generalized linear modelling approach for England and Wales mortality projections. Applied Statistics 52: 1, 119-137. Renshaw, A., Haberman, S., 2003b. On the forecasting of mortality reduction factors. Insurance: Mathematics and Economics 32: 379-401. Renshaw, A., Haberman, S., 2009. On age-period-cohort mortality rate parametric projections. Insurance: Mathematics and Economics 45: 255-270. Tabeau, E.; van den Berg Jeths, A.; Heathcote, C. eds. 2001. Forecasting mortality in developed countries. Dordrecht: Kluwer Academic. Tucker, L.R. 1966. Some mathematical notes on three-mode factor analysis. Psychometrika 31: 279–311. Tukey, J. W., 1977. Exploratory Data Analysis. Addison-Wesley. 21 Wilmoth, J.R., Valkonen, T. 2001. A parametric representation of mortality differentials over age and time. Paper presented at EAPS Seminar, “Trends in Differentials in Morbidity and Mortality: Analysis and Explanation.” Pontignano, Italy. 22