How Large is the Retirement Consumption Drop in Italy?∗
Erich Battistin
University of Padova and IFS
Agar Brugiavini
University of Venice and IFS
Enrico Rettore
University of Padova
Guglielmo Weber
University of Padova, CEPR and IFS
first draft September 2005
this version 27th February 2006
Abstract
In this paper we investigate the size of the consumption drop at retirement in Italy. We use
micro data on food, non-durable and total household spending covering the period 1991-2002,
and evaluate the change in consumption that accompanies retirement by exploiting the exogenous
variability in pension eligibility to correct for the endogenous nature of the retirement decision.
We take a regression discontinuity design approach, and make the identifying assumption that
consumption would be the same around the threshold for pension eligibility, if individuals would
not retire. We check in our data that a non-negligible fraction of individuals retire as soon as they
become eligible, and estimate at 9.2% the part of the consumption drop that is associated with
retirement induced by eligibility.
Keywords: Consumption, Regression Discontinuity Design, Retirement
JEL Classification: D9; E2
Paper presented at the RTN meeting on the Economics of Ageing, Venice, October 2005. We thank our discussant,
Mike Hurd, and conference participants for their helpful comments. We are grateful to Alberto Fossaluzza for excellent
research assistance, and to Lisa Callegaro for explanations on the codes that calculate pension eligibility for SHIW
respondents.
∗
1
Introduction
In most developed countries, consumption accounts for over two-thirds of GDP. In these countries a
rising fraction of the population is past retirement age. The way consumers respond to retirement
and the way they spend in their old age is thus a topic of great interest in the analysis of aggregate
economic fluctuations and in the economic policy debate.
The standard model to analyse the consumption-saving choice by the household sector has been
Modigliani’s life cycle model. The model has been extended to cover uncertainty, leisure choice and
a bequest motive (Deaton, 1992, Browning and Lusardi, 1996), but its key prediction can still be
described as follows: consumers form intertemporal plans aimed at smoothing their standard of living
(or marginal utility of consumption) over their life-cycle.
However, recent micro evidence has emphasized that there is a one-off drop in consumption at
the time of retirement that may be hard to reconcile with life-time optimizing behavior. This is
documented for the UK (Banks, Blundell and Tanner, 1998), for the US (Bernheim, Skinner and
Weinberg, 2001), and for Italy (Miniaci, Monfardini and Weber, 2005, MMW in what follows) and is
known as the retirement consumption puzzle (or retirement savings puzzle).
The literature mentions as possible reasons for this drop changes in preferences due to increased
non-market time, unexpectedly low pensions or liquidity problems as well as myopic or perhaps timeinconsistent behavior (see Hamermesh, 1984, Hurd and Rohwedder, 2003, and Aguiar and Hurst,
2004).The Italian case is of particular interest because one can rule out explanations related to lack
of resources. In fact, MMW document that actual and expected replacement rates were indeed
close to each other independently of the type of job previously held by the newly retired. Also,
liquidity problems are unlikely to play a role: Italian employees receive a large lump-sum payment
upon retirement (technically, a severance pay worth three times the gross annual salary). If cash
considerations matter, we would expect a surge in consumption at retirement rather than a drop.
These two facts suggest that consumption falls at retirement cannot be attributed to unexpected
income drops or liquidity problems.
The evidence provided in MMW is based on SFB diary-level data covering the 1985-96 period. The
SFB (Survey on Family Budgets, run by ISTAT) contains excellent information on expenditures, but
no information on past work activities. Thus MMW estimates the fall in non-durable consumption at
retirement, and emphasizes the role that changes in leisure have on both the composition and the level
of consumption, but cannot identify what part of the consumption fall at retirement is as planned
and what other part is instead due to the realization of bad health outcomes or other shocks to the
consumer’s environment that affect the retirement decision.
In this paper we investigate the size of the consumption drop associated to retirement in Italy by
exploiting the exogenous variability in eligibility to identify the causal effect in a regression discontinuity design framework. To this end, we use a different micro data set, the Bank of Italy Survey
on Household Income and Wealth (SHIW), covering the 1991-2002 period, that has recall records on
food, non-durable and total household spending (Battistin, Miniaci and Weber, 2003, compare consumption data across SFB and SHIW). We evaluate the change in consumption caused by retirement
by exploiting the exogenous variability in pension eligibility to correct for the endogenous nature of
the retirement decision: our identifying assumption is that consumption would be the same around
the threshold for pension eligibility, if no individual retired.
The remainder of this paper is organized as follows. Section 2 presents how we deal with the
endogeneity problem arising from self-selection of individuals into retirement. Section 3 deals with
data-related issues, in particular with the definition of the pension eligibility. In Section 4 we show
that pension eligibility is a variable that is measured with error, and we discuss the implications of
this on our estimates. Section 5 presents our results. The implications of our findings for reconciling
the retirement savings puzzle are presented in Section 6, while Section 7 concludes.
2
2
2.1
Identification
The regression discontinuity idea
This section presents the basic features of regression discontinuity analysis following the discussion in
Hahn et al. (2001), to which the interested reader is referred for further details. The relationship with
the literature on programme evaluation is established by comparing the retirement decision to that of
being exposed to a treatment.
Following the notation of the potential outcome approach to causal inference, let (Y1 , Y0 ) be the
two potential outcomes one would experience by retiring and not retiring, respectively. In the context
of this paper, Y1 and Y0 represent the household consumption expenditures corresponding to the head
being retired and not being retired, respectively. The causal effect of retirement on expenditures is
then defined as the difference between these outcomes, β = Y1 − Y0 , which is not observable at the
household level since being retired reveals Y1 but conceals Y0 . Accordingly, though not observable β
represents the change in consumption expenditures corresponding to a change in the retirement status
of the household head, which is our quantity of interest.
Let R be the binary variable denoting the treatment status, with R = 1 for retired heads and R = 0
otherwise. A discontinuity design (Thistlethwaite and Campbell, 1960) arises when the treatment
status R depends on an observable variable S and there exist a known point in the support of S where
the probability of being treated changes discontinuously. Formally, if s̄ is the discontinuity point, then
a regression discontinuity is defined if
P r{R = 1|s̄+ } 6= P r{R = 1|s̄− }.
(1)
Here and in the following s̄+ and s̄− refer to those individuals marginally above and below s̄, respectively. Throughout this paper, S is assumed to be continuous on the real line.
In the context of this paper, the expression in (1) implies that the probability of retirement varies
discontinuously with an observable variable S. Throughout our analysis, S will denote the distance
from the first time the household head becomes eligible for retirement. To fix ideas, let the eligibility
status be established according to the deterministic rule 1l(S ≥ 0). That is, individuals are eligible for
retirement if and only if they present a value of the variable S above the threshold s̄ = 0. Of course,
such a variable can take on negative values (if individuals are not yet eligible for retirement) as well
as positive values (if individuals, regardless of their retirement status, already are).
The potential of using eligibility rules to overcome the selection problems arising in the study of
causal effects has been already pointed out by several papers in the literature (see, amongst others,
Battistin and Rettore, 2005). In the context of this paper, pension eligibility does not necessarily imply
that individuals are actually retired; on the other hand, individuals not eligible can not be retired,
thus inducing a discontinuity in the probability of retirement around the threshold for eligibility. As
we will show in what follows, such a discontinuity can help solve the endogeneity problem arising from
the analysis of the retirement status.
Following Trochim (1984), the distinction between sharp and fuzzy designs depends on the size of
the discontinuity. The former design occurs when the probability of participating conditional on S
steps from zero to one as S crosses the threshold s̄. That is, the treatment status deterministically
depends on whether individuals’ values of S are above s̄
R = 1l(S ≥ s̄).
(2)
For example, a sharp design would correspond to the hypothetical situation in which retirement is
mandatory conditional on eligibility: is such an extreme case, we would have that S ≥ 0 implies R = 1
with probability one.
A fuzzy design occurs when the size of the discontinuity at s̄ is smaller than one. For example, a
fuzzy design can be thought as an instance in which R is a deterministic function of S for all subjects
3
but this function is different across individuals (see Hahn et al., 2001). It therefore follows that the
retirement decision fits a fuzzy regression discontinuity conditional on S, as not all eligible individuals
are actually retired and all ineligibles are in fact not retired.
As the decision to retire is entirely up to eligible individuals, it is crucial to discuss how the endogeneity problem arising from the retirement status can be accounted for in the context of discontinuity
designs. We will consider the case of a sharp discontinuity first, and discuss the fuzzy case further
below in this section. Let
Y = Y0 + R(s)β
be the observed outcome as it results from taking or not taking part into the programme: it follows
from the last expression that Y ≡ Y1 or Y ≡ Y0 depending on the retirement status of individuals
(R = 1 and R = 0, respectively). The dependence of the retirement status R on S is stressed by writing
R(s). The difference of mean outcomes for individuals marginally above and below the threshold s̄
E{Y |s̄+ } − E{Y |s̄− }
(3)
E{Y0 |s̄+ } − E{Y0 |s̄− } + E{R(s)β|s̄+ } − E{R(s)β|s̄− },
(4)
can be written as
which simplifies to
E{Y0 |s̄+ } − E{Y0 |s̄− } + E{β|s̄+ }
because of (2). The following condition is then sufficient for the mean impact of the treatment at s̄+
to be identified with a sharp discontinuity.
Condition 1. The mean value of Y0 conditional on S is a continuous function of S at s̄.
Accordingly, Condition 1 requires that in the counterfactual world of no retirement, no discontinuity would take place at the threshold for selection. In other words, this implies that the consumption
profile under the no retirement alternative is smooth enough as S crosses s̄. Intuitively, in order to
give a causal interpretation to discontinuities of Y around the threshold for eligibility s̄ = 0, it has to
be the case that in the absence of the treatment no discontinuity would be observed in the outcome
Y around s̄ = 0. If this condition holds, we can write
E{β|s̄+ } ≡ E{Y |s̄+ } − E{Y |s̄− },
so that the difference in expected consumption expenditures above and below the threshold for eligibility identifies the causal effect of retirement on consumption.1
When the treatment status is not the result of a sharp assignment, the discontinuity in the probability to retire around the threshold is smaller than one. According to the current literature, such a
discontinuity in the probability of retirement defines a fuzzy design. It follows that the mean impact
at s̄ cannot be identified by simply comparing the mean outcome for marginal retired to the mean
outcome for marginal non-retired households. In general, additional conditions are required to recover
meaningful causal parameters from (3), thus losing much of the simplicity of the design. Hahn et al.
1
It is worth stressing again that to meaningfully define marginal units (with respect to s̄) the selection variable S has
to be continuous. Estimation of the right-hand side (left-hand side) of (3) makes use of data only in a neighborhood on
the right (left) side of the discontinuity point. Unless one is willing to make some parametric assumptions about the
regression curve away from s̄, only data local to the discontinuity point help to estimate the jump. Asymptotically the
neighborhood needs to shrink as with usual non-parametric estimation, implying a non-standard asymptotic theory for
the resulting estimator of the mean impact (see Hahn et al., 2001).
4
(2001) as well as many other authors in the literature point out that assumptions can be made to
recover causal effects for a particular group of individuals around the threshold s̄. Such assumptions
qualify S as an instrumental variable for R around s̄, so that a LATE (Angrist and Imbens, 1994)
parameter can be estimated for the group of compliers. Heckman et al. (1999) emphasize this point
by saying that much of the simplicity of the design is lost moving from a sharp design to a fuzzy
design.
2.2
Discontinuities and pension eligibility
In the context of this paper, however, self selection of households into retirement fits the partially
fuzzy design described by Battistin and Rettore (2005).2 As a result of the eligibility rule and of
self-selection, the probability of retirement for those heads scoring a value of S below the threshold s̄
is zero by definition, since they are not eligible for retirement. The probability of retirement for those
scoring above s̄ is smaller than one because retirement is not mandatory. As a result, the probability of
retirement is discontinuous at the threshold for eligibility and the size of the discontinuity is less than
one (i.e. according to the terminology introduced in the previous section, a fuzzy design is defined).
As pointed out by Battistin and Rettore (2005), despite the fuzziness of this design the existence of a
sharp eligibility rule can help recover much of the simplicity of the design.
To recover the regularity conditions required for identification consider again the difference in (3).
Since participation is precluded to marginally ineligibles (R(s̄− ) = 0), the expression in (4) can be
written as
E{Y0 |s̄+ } − E{Y0 |s̄− } + E{R(s)β|s̄+ }.
If Condition 1 holds, by using the law of iterated expectations and by noting that E{R(s)β|R =
0, s̄+ } = 0 the previous expression equals
E{R(s)β|s̄+ } = E{β|R = 1, s̄+ }P r{R = 1|s̄+ },
so that the mean impact on participants in a right-neighborhood of s̄ is identified by
E{β|R = 1, s̄+ } =
E{Y |s̄+ } − E{Y |s̄− }
.
E{R|s̄+ }
(5)
In other words, Condition 1 is sufficient for the effect of the treatment on the treated to be identified
locally at the threshold for eligibility s̄.
It turns out that, despite the prima facie fuzzy nature of this set-up, the LATE (Imbens and
Angrist, 1994) at the discontinuity point is identified under the same condition used to estimate the
average treatment effect in the sharp design. The result rests on the fact that the probability of
retirement on the left-hand side of s̄ is zero by design, and this simplifies the expression in (4) without
further assumptions on individuals’ behavior.3
It also follows that (5) can be estimated from an instrumental variable procedure, where eligibility
is used as an instrument for the actual status R conditional on S.
3
Data
3.1
The definition of pension eligibility
In our analysis, information on consumption expenditures and pension status is obtained from the
Bank of Italy Survey on Household Income and Wealth (SHIW in what follows). The aim of this
2
Heckman et al. (1999) as well as Van der Klaauw (2002) explicitly mentions the potential for using the dicontinuity
arising from the eligibility criteria for a social programme.
3
Results by Hahn et al. (2001) on non-parametric inference in regression discontinuity designs straightforwardly apply
to the estimation of (5).
5
Table 1: Composition of the SHIW sample
1991 1995 1998 2000 2002
Males
Worker 47%
44%
44%
45%
44%
Retired 17%
20%
19%
22%
25%
Females
Worker 24%
25%
26%
26%
27%
Retired 12%
14%
12%
13%
16%
section is to summarize how we derive the variable S that measures the time to (or from) pension
eligibility (see also Boeri and Brugiavini, 2005).
As we have shown in the last section, pension eligibility is a crucial variable in our analysis. This
is measured both on the basis of age and on the basis of seniority (accrued contributions years). The
SHIW sample can be used to compute eligibility: it is a large cross-sectional sample and covers several
years (1987 to 2002), though it has been run every two years. SHIW also has a small panel component
(rotating panel) that can be used to study actual transitions from work to retirement.
Table 1 provides a brief description of the SHIW data available for workers and retired individuals
(we neglected other conditions such as disability or being homemaker). It should be noted that a large
percentage of individuals who are currently active or have been active in the past are men. This result
is largely dominated by the labour market behaviour of older cohorts: women were characterized by
lower educational attainments and lower labour market participation. An indication of this is the fact
that over time there is a non negligible increase in the percentage of working women.
3.1.1
Working life and pension claims
A crucial feature of many pension systems is the design of pension benefits: in most European countries
this is the defined benefit (DB) variety and it is related to some average of lifetime earnings. An extreme
version of this is a final salary computation method: this was basically adopted in Italy until the 1992
pension reform as well as during the transitional phase which Italy is going through. In particular, until
1992 the pension benefit was based on the average of the last five years earnings, during the transitional
phase these became the last 10 years earnings. The 1995 Pension reform changes radically this system
as benefit should be computed according to a Notionally Defined Contribution (NDC) method. In the
latter case, pension benefits are automatically linked to an average of lifetime earnings, adjusted by
some actuarial coefficients.
In this paper we are particularly interested in eligibility conditions: until 1992-1993 they were
quite simple as people could retire at age 60 (55 women) in the private sector, or any age if they had
completed 35 years of contributions. The early retirement option was quite generous because it did
not attract any actuarial penalty and a large fraction of workers retired before the reforms through
this route. After the 1992 reform the normal retirement age were set at 65 for men and 60 for women
(to be reached gradually by the year 2001). Both age and seniority requirements for early retirement
grew over time, starting essentially in 1995 (according to the sector of employment) as shown in
Table 2 (rules prevailing after 1998 according to the Law 449/1997; these rules apply to white- collar
employees, they differ only slightly for blue-collar employees). There was enough flexibility offered by
these criteria which we can model explicitly in our data. One final problem to be mentioned is that,
with some interruptions, the Government has imposed a delay-window on retirees after they became
eligible, which, depending on the month of birth, could postpone retirement by 3 or even 6 months.
While we cannot observe the effects of these windows directly, we do not think they would introduce
too much noise in our data because we measure consumption and eligibility at yearly frequency.
6
Table 2: Retirement eligibility rules: age and years of contributions
Private Sector
Public Sector
Self-employed
age and
only
age and
only
age and
only
years
years
years
years
years
years
1996 54 and 35
36
53 and 35
36
57 and 35
40
1997 54 and 35
36
53 and 35
36
57 and 35
40
1998 54 and 35
36
53 and 35
36
57 and 35
40
1999 55 and 35
37
53 and 35
37
57 and 35
40
2000 55 and 35
37
54 and 35
37
57 and 35
40
2001 56 and 35
37
55 and 35
37
58 and 35
40
2002 57 and 35
37
55 and 35
37
58 and 35
40
2003 57 and 35
37
56 and 35
37
58 and 35
40
2004 57 and 35
38
57 and 35
38
58 and 35
40
2005 57 and 35
38
57 and 35
38
58 and 35
40
2006 57 and 35
39
57 and 35
39
58 and 35
40
2007 57 and 35
39
57 and 35
39
58 and 35
40
2008 57 and 35
40
57 and 35
40
58 and 35
40
3.1.2
The measurement of years of contributions
In some years contributions have been explicitly recorded in SHIW for workers (not for people currently
retired); however this question was not asked in 1993 and missing values are occasionally found also
in other years. Because the age of the respondent does not provide enough information to measure
eligibility, we have adopted a simple imputation method, by distinguishing retirees from workers. In
fact, we make use of retrospective information on (i) the age that the respondent reported as age
entering the labour market, and (ii) the self-reported age of retirement (if retired). The imputation
is carried out also taking account of whether the individual belongs to the panel component or not.
We can distinguish the following five cases. First, if the worker is interviewed only once in 1993,
then we have no choice than to use current age minus the self-reported age of entry in the labour
market. Second, if the worker belongs to the panel sub-sample of SHIW, such a measure of years
of contributions is corrected to account for the contributions recorded in successive years. Third, for
a retired person who is only observed in 1993, use is made of the retirement age minus entry in the
labour market. Fourth, if a retired person belongs to the panel, a coherency check is carried out.
Finally, if a retired person belongs to the panel and worked in a previous wave, contributions can be
recovered from the previous wave.
It is of course worth looking at the difference between the measure of years contributions as selfreported by individuals and the measure imputed by our procedure. This imputation could be a
noisy measure because the respondent can have a vague recollection (especially if far in the past)
of the events. However as Figure 1 shows, if we exclude the year 1993 there is basically complete
coincidence between the original years of contributions and the final sample (including imputations
based on potential experience). When we include the year 1993 there is a more pronounced heaping
at age 35 for men, while for women the distribution shows an equally disperse pattern (see Figure 2).
It follows that the variable which measures the distance from the first eligibility year (S) is computed by first establishing on the basis of recorded age and years of contributions the eligibility year
and taking the difference with the survey year. The distribution of this variable for males and females
is reported in Figure 3. Negative values of this variable imply that eligibility for retirement has not
yet been attained. Positive values measure the time from the first year of eligibility.
7
20
20
Contributive years − Male − Waves: 1995 to 2002
Percent
10
5
0
0
5
Percent
10
15
Imputed
15
Original
25
30
35
40
20
25
30
35
40
Contributive years − Female − Waves: 1995 to 2002
15
15
20
Percent
5
0
0
5
Percent
10
Imputed
10
Original
15
20
25
30
35
40
15
20
Figure 1: Contributive years - waves 1995 to 2002
8
25
30
35
40
25
25
Contributive years − Male
5
Percent
10
15
20
Imputed
0
0
5
Percent
10
15
20
Original
20
25
30
35
40
20
25
30
35
40
15
15
Contributive years − Female
Percent
5
0
0
5
Percent
10
Imputed
10
Original
15
20
25
30
35
40
15
20
Figure 2: Contributive years - waves 1993 to 2002
9
25
30
35
40
0
1
2
Percent
3 4 5
6
7
8
Male
−20
−15
−10
−5
0
5
10
15
20
25
30
35
40
0
1
2
Percent
3 4 5
6
7
8
Female
−20
−15
−10
−5
0
5
10
15
Figure 3: Distribution of S
10
20
25
30
35
40
Table 3: Percentage of retired individuals by eligibility status
eligibility
males
females
-5
0.0441426 0.0269058
-4
0.1073826 0.0681818
-3
0.0893855 0.0431655
-2
0.0677966 0.0854701
-1
0.1645161 0.2121212
0
0.2810345 0.4880952
1
0.5779377 0.7289157
2
0.5747663 0.7167630
3
0.6376812 0.7113402
4
0.7164179 0.7927461
5
0.6564885 0.6898148
3.1.3
Years of contributions and the retirement status
To conclude this section, in Table 3 we present the percentage of individuals self-reporting to be retired
by the values of the eligibility variable S. As already discussed above, since retirement can only be
entered conditional on eligibility, it has to be the case that no retired individuals are observed for
negative values of the eligibility variable. If this were the case, it could be symptomatic of errors in
the recording of the eligibility variable or, possibly, misreporting of the retirement status (or both).
Despite the sharp design implied by the eligibility rule, we find that a non-negligible proportion
of individuals whose imputed value of the eligibility variable is negative are in fact retired. Notably,
something like 16 percent of males and 21 percent of females marginally ineligible for retirement selfreport to be retired. These figures decrease as the time to the first eligibility year increases, though
proportions are still non-negligible when S = −5.
Sadly enough, it is somehow immediate to conclude that the identification strategy outlined in
Section 2 no longer applies. More precisely, estimators of the causal effect of retirement on consumption
based on discontinuities around S = 0 are in general biased for the parameter of interest. In the next
section, we will address this problem and propose and estimation strategy which - under certain
assumptions on the data generating process of the errors - provides consistent estimates of the causal
parameter.
3.2
Sample selection
In estimation we take all observations that are within a 10-year band from eligibility. However we
drop those that are exactly at eligibility, because the recall question on consumption could cover both
pre and post-retirement periods. We investigated different selection criteria: smaller bands (such as
5-years bands) are heavily affected by outliers in the age profile; with larger bands (15-year and more)
composition effects start playing an important role, because of mortality.
In our empirical exercise, we take the head of the household to be the man. We select couples and
single males, and do not use information on the wife’s employment position to classify a household as
retired or otherwise.
4
Measurement errors
Throughout this section we will allow for measurement error in the variable S, but we will maintain
the assumption that R in not mismeasured. For a detailed analysis of the impact of measurement
error in S for the identification of the causal effects of a binary treatment see Battistin and Chesher
11
(2004). For the impact of misclassification (and/or misreporting) of R see Battistin and Sianesi (2005).
Note, however, that in the context of this paper having measurement error in S implies that also the
eligibility status is potentially misrecorded, thus inducing a more complicated structure of the error.
The motivation of allowing for measurement error in the time-to(from)-eligibility variable S builds
on the evidence presented in Section 3.1, in that a non-negligible fraction of non-eligibles for pension
indeed self-report to be pensioners. Under the maintained assumption that the pension status is not
misreported, such evidence is consistent with the problem of having measurement error in S. It is
worth noting that the crucial assumption of correct classification of pensioners can be motivated by
the structure of the SHIW questionnaire, in that those individuals who self-report to receive pension
benefits are actually forced to answer a very detailed set of questions in this respect.
The results in this section can be summarized as follows. First, we show that the evidence provided
in Section 3.1 is not consistent with the hypothesis of having classical measurement error in S. A
more general model for measurement error is therefore needed. Second, we assume that individuals
whose observed value of the eligibility variable is τ are in fact a mixture of individuals whose true
value, S ∗ say, is S ∗ = τ and individuals whose reported value S = τ is affected by measurement error
Sobs = S ∗ Z + S(1 − Z),
where Z denotes a dummy taking value one for the exact reporters and S 6= S ∗ because of measurement
error. This is known as the contaminated sampling model discussed, amongst others, by Horowitz and
Manski (1995). Third, we show that if the mixture groups indexed by Z are not systematically
different with respect to (Y, R, S ∗ ), the sample analogues of the quantities in (5) would define a biased
estimator for the parameter of interest.4 Finally, we show that the sample analogue of the fuzzy
regression discontinuity estimand is in fact consistent for the parameter defined in (5). This results
will be heavily used in the estimation section below, as it implies that consistent estimates of the causal
effect of retirement on consumption can be recovered by a simple instrumental variable strategy where
the eligibility status is used to solve for the endogeneity of the retirement status (see Imbens and
Angrist, 1994, and Hanh et al., 2001).
4.1
A formal setup
From the definition of Sobs given above and by using the law of iterated expectations we have
E{R|Sobs = sobs } = E{R|S ∗ = sobs , Z = 1}E{Z|Sobs = sobs }
+ E{R|S = sobs , Z = 0}(1 − E{Z|Sobs = sobs })
= E{R|S ∗ = sobs , Z = 1}E{Z|Sobs = sobs }
+ (1 − E{Z|Sobs = sobs })
Z
E{R|S = sobs , S ∗ = τ, Z = 0}fS ∗ |S,Z (τ |sobs , 0)dτ.
Under the assumption Z⊥(Y, R, S ∗ , S) which states the irrelevance of Z for the sampling process, and
the assumption of non-differential measurement error, (Y, R)⊥S|S ∗ , the last expression becomes
E{R|Sobs = sobs } = E{R|S ∗ = sobs }E{Z|Sobs = sobs }
+ (1 − E{Z|Sobs = sobs })
Z +∞
E{R|S ∗ = τ }fS ∗ |S (τ |sobs )dτ,
s̄
E{R|S ∗
as
= τ } = 0 when τ < s̄. Note that, in general, the measurement error in S does not need
to be classical, though it has to be non-differential, i.e. it must contain no information on (Y, R) once
the true value S ∗ has been controlled for (see Bound et al., 2001).
4
In future research we aim at relaxing this assumption by allowing for no-zero correlation between Z and (Y, R, S ∗ ).
12
Under smoothness conditions of the distribution of (S, S ∗ ) around (s̄, s̄), it follows from the last
expression that
E{R|Sobs = s̄+ } − E{R|Sobs = s̄− } = E{R|S ∗ = s̄+ }E{Z|Sobs = s̄− },
(6)
implying that the discontinuity in the retirement probability observed around the threshold for eligibility understates the true discontinuity by means of the term E{Z|Sobs = s̄− }. It therefore follows
that the estimated discontinuity is downward biased for the true discontinuity. The bias term can
be estimated from the proportion of heads who self-report being retired though marginally ineligible
according to Sobs , P {R = 1|Sobs = s̄− }.
By applying a similar argument to the regression function of Y on Sobs we obtain
E{Y |Sobs = s̄+ } − E{Y |Sobs = s̄− } = (E{Y |S ∗ = s̄+ } − E{Y |S ∗ = s̄− })
E{Z|Sobs = s̄− },
(7)
which implies that the discontinuity in consumption expenditures estimated around Sobs = s̄ is still
downward biased for the true discontinuity.
4.2
Identification
In the notation of the previous section, the parameter in (5) can be written as
E{β|R = 1, S ∗ = s̄+ } =
E{Y |S ∗ = s̄+ } − E{Y |S ∗ = s̄− }
,
E{R|S ∗ = s̄+ }
which depends on the joint distribution of (Y, R, S ∗ ). Because of measurement error in Sobs , (6) and
(7) imply that the estimator constructed by using the empirical analogues of the quantities in the last
expression from raw data (Y, R, S) is not consistent for the parameter of interest.
However, it is immediate to see how the following estimator
E{Y |Sobs = s̄+ } − E{Y |Sobs = s̄− }
,
E{R|Sobs = s̄+ } − E{R|Sobs = s̄− }
(8)
is consistent for the causal effect of retirement on consumption under the assumptions made on the
measurement error in Sobs . As pointed out by Imbens and Angrist (1994), the latter expression can
be interpreted as an instrumental variable estimand, where the eligibility status is used to correct for
the endogeneity of R.
5
5.1
Empirical analysis
Estimation issues
The causal effects of retirement on consumption can be estimated along the lines of what discussed in
the last section. The sample analogue of (8) can be obtained by taking the ratio of the discontinuity
pictured in the top panel of Figure 4 to the discontinuity in the bottom panel of the same figure.
Though the effect of interest can be non-parametrically identified, the analysis presented in what
follows builds on a fully parametric approach, which - we think - represents a better framework to use
while communicating our results. As the sample analogue of (8) coincides with an instrumental variable
estimator where the endogenous variable R is instrumented with the eligibility status 1l(Sobs ≥ s̄) (see
Imbens and Angrist, 1994), the regressions presented in the next section will all take the following
form
2
Y = β0 + β1 R + β2 Sobs + β3 Sobs
+ ε,
where the first-stage regression is run on 1l(Sobs ≥ s̄).
13
9.7
total expenditure
9.75
9.8
9.85
9.9
extrapolated at S=0
−9
−7
−5
−3
−1
1
3
time to/from eligibility
5
7
9
5
7
9
0
proportion of retired heads
.2
.4
.6
.8
extrapolated at S=0
−9
−7
−5
−3
−1
1
3
time to/from eligibility
Figure 4: Non-parametric estimation of the causal effect of retirement on non-durable consumption
expenditures
14
5.2
Table 4: Estimation results
Non-durable expenditure
coefficient std. err.
t
Male retired
-0.09217
0.04111
-2.24
Sobs male
-0.00485
0.00145
-3.35
2 male
Sobs
-0.00026
0.00008
-2.96
p-value
0.039
0.004
0.009
Food expenditure
coefficient
Male retired
-0.06882
Sobs male
-0.00480
2 male
Sobs
-0.00024
p-value
0.197
0.023
0.062
std. err.
0.05108
0.00191
0.00001
t
-1.35
-2.51
-2.01
Results
The estimation procedure that we take can be described as follows. First, averages of household nondurable expenditure and proportions of retired male heads by values of Sobs (between −10 and 10) are
considered, that is the twenty observations corresponding to the top and the bottom panels of Figure
4, respectively. Second, we regress averages of household non-durable expenditure on proportions
of retired heads and a quadratic polynomial in Sobs , restricting the sample to “husband and wife”
and “single males” household only (the inclusion of additional regressors such as time dummies leads
to qualitatively similar results, so we have decided to report the simplest specification). Implicit in
this specification is the assumption that the leisure of the spouse is separable from that of the male
head. Retirement is instrumented with the eligibility status, yielding good results for the first stage.
Clustering is explicitly taken into account in estimation, by computing a weighted IV estimator with
weights reflecting the cell size.
Two sets of results are presented: for non-durable consumption and for food at home consumption.
Results for non-durable consumption are reported in the top panel of Table 4, suggesting a consumption
drop of around 9.2 percent, that is significantly different from zero at the 95 percent level. Results for
food at home are reported in the bottom panel, suggesting a smaller consumption drop (6.9 percent),
that is not significantly different from zero. We also estimated these equations using per-equivalent
adult consumption measures, with almost identical results.
6
Some back of the envelope stuff
We have found that consumption drops at retirement by a relatively large amount: 9.2%. The question
is whether this drop should be taken as evidence against life-time optimizing behaviour.
A number of papers have emphasized that a consumption drop at retirement is compatible with the
life-cycle model if leisure affects the marginal utility of consumption. Specific ways in which this nonseparability may come into play at retirement are that work-related expenditures are no longer needed
(Banks, Blundell and Tanner, 1998, emphasize this channel, but their estimate of the consumption
drop is much smaller, around 3%, two thirds of which is anticipated) or some goods and services may
start being produced at home (as argued in Aguiar and Hurst, 2004, and Hurd and Rohwedder, 2003,
who report anticipated drops in the 15 − 20% region).
A simple way to find out whether our estimated consumption drop is consistent with the life
cycle model is to perform some back-of-the-envelope calculations. Let us take the simplest possible
utility function, a power utility defined over a Cobb-Douglas composite good made of non-durable
15
consumption, C, and (male) leisure, l:
¢1−γ
Ctα lt1−α
Ut =
1−γ
¡
where γ > 0 is the reciprocal of the elasticity of intertemporal substitution (EIS) and α measures the
within period consumption share for periods when leisure is not at a corner (hence 0 < α < 1):
α=
pt Ct
pt Ct + wt lt
where pt is the price of the consumption good at time t and wt the nominal wage at time t.
Life-time optimization implies that households should keep the marginal utility of consumption,
Uc , constant:
Uc = αC α−1−αγ l(1−α)(1−γ)
This implies that consumption must react to changes in leisure according to the following relation:
C(1 − α)(1 − γ)
dC
|
.
=
dl Uc =U
l(1 − α + αγ)
We see that consumption should drop as a result of an increase in leisure if γ > 1, it should increase
otherwise. A great attraction of the Cobb-Douglas lies in its analytical tractability: the formula above
can be used to derive an elasticity as a function of just two preference parameters, α and γ.
We can estimate α in our data as follows. We take a sample of prime-age workers and assume
that l is defined as (T − h), where T is the maximum number of hours a person could possibly work
(16 a day, say) and h is hours of work. In the case where h = 8, for instance, l = 8, and wl = wh =
earnings. The average propensity to consume out of earnings is roughly 80%, so α = 0.44.No estimate
for the elasticity of intertemporal substitution exists in Italy (to our knowledge), but studies on micro
data run in other countries suggest that the elasticity should lie in the 0.5 − 0.8 range. The real
business cycle literature as well as recent studies on aggregate consumption and asset returns (Bansal
and Yaron, 2004) emphasize the elasticity should be in excess of unity. We shall therefore consider a
range for γ between 0.5 and 1.5.
We have estimated the average consumption drop associated to retirement to be 9.2%, with a
standard error of 4.1%. In our calculations of the α parameter, retirement doubles leisure. Thus the
estimated elasticity is indeed −0.092.
A 0.092 consumption drop is consistent with utility maximization if γ = 1.1775 (EIS = 0.85).
If we consider a one-standard error range around the point estimate, we find that a 0.133 drop is
consistent with γ = 1.265 (EIS = 0.79) and a 0.051 drop is consistent with γ = 1.095 (EIS = 0.91).
In this context, a zero drop obtains if γ = 1 (EIS =1), whereas consumption should increase at
retirement for smaller values of γ.
This simple, one-good case hides the fact that the impact of retirement on total (non-durable)
consumption is ambiguous, because some goods may be leisure substitutes and some other leisure
complements. MMW show examples of both, notably food out and transport as substitutes, food at
home and heating fuel as complements.
We do not have detailed expenditure information in our main data set, SHIW, but can access
diary-level data on consumer spending for the year 2002. This large data set, collected by the Italian
statistical office (ISTAT), contains records of current employment, household composition, size of the
main residence and a few other household characteristics, but no information on years of contributions,
or past employment histories. Thus years to and from pension eligibility are not known, and our
identification strategy cannot be applied.
What we can do is to compare two groups of households, those whose head’s age lies between
50 and 54 and 65 to 69. Heads of household in the former group are mostly employed (81.8% are
16
Table 5: Consumption drop in SFB diary data
unadjusted
p-score adjusted
Category
difference
s.e.
difference
s.e.
Non-durable
-510.35
22.70
-241.02
29.74
Food at home
-89.70
6.74
-5.96
8.90
Meals out
-44.37
2.95
-35.68
3.46
Alcohol
-5.24
0.96
-2.75
1.13
Tobacco
-13.35
0.95
-7.98
1.26
Clothing
-99.09
7.26
-58.05
9.28
Personal services
-7.96
2.73
-5.29
3.01
Transport
-153.67
7.56
-76.04
9.91
Heating
-19.75
2.56
-8.73
3.19
Phones
-15.95
0.96
-8.99
1.14
Housing services
-7.82
4.34
-2.55
4.47
Other
-53.43
4.02
-28.99
4.93
employed, 9.6% are retired, the others are either unemployed or out of the labour force), in the latter
they are mostly retired (82.7% are retired, 8.0% are employed, all the others are out of the labour
force).
In Table 5 we report the difference between average spending of the older group and average
spending of the younger group, and its standard error. The first column lists the various commodities
considered, the second and third columns present a straight comparison, whereas the fourth and fifth
columns refer to a comparison that corrects for composition effects in terms of region of residence,
number of equivalent adults and size of the main residence.5 This correction is meant to remove that
part of difference that can be attributed to age-related changes in household composition as well as
(to some extent) cohort effects, under the assumption that the size of the main residence for a given
household size correlates with life-time wealth.
The first row of numbers tells us that non-durable consumption falls by 510 euros a month (−31.1%)
between the early fifties and late sixties. However, once composition effects are taken into account, this
drop is reduced to 241 euros a month (−15.6%). This is larger than our estimate of the consumption
drop at retirement, suggesting that age and composition effects play a role that is not fully accounted
for in our adjustment procedure. One possible interpretation of our estimates is that almost 60% of
the overall consumption drop over this period of the life cycle is due to retirement, the remaining 40%
reflects changes in preferences due to poorer health and other unobservable age-related characteristics.
If we look at the adjusted drops, we see that the largest items are meals out, clothing and transport,
that account together for 169.77 euros, that is for over two thirds of the overall drop. At least two of
these items, clothing and transport, are typically considered work-related expenses, whereas the drop
in meals out is consistent with the home production hypothesis. In fact, these is a switch from meals
out to food at home in relative terms: food at home stays constant once composition effects are taken
into account (and increases its budget share considerably), whereas meals out are reduced by 35.68
euros - around 41% in relative terms! - and so does its budget share.
The conclusion that we draw from this exercise is that the our estimated retirement consumption
drop could well be due entirely to a reduction of work-related expenses and a substitution away from
market goods to home-production of food.
5
The adjustment was made so to make the distribution of these characteristics for the younger and the older groups
be equal to the distribution of households whose head is aged between 59 and 64.
17
7
Conclusions
In this paper we have investigated the size of the consumption drop due to retirement in Italy. We have
used micro data covering the 1991-2002 period on food, non-durable and total household spending,
and evaluated the change in consumption that accompanies retirement by exploiting the exogenous
variability in pension eligibility to correct for the endogenous nature of the retirement decision. We
have taken a regression discontinuity design approach, and made the identifying assumption that
consumption would be the same around the threshold for pension eligibility, if the individual could
not retire. We have shown that a non-negligible fraction of individuals retire as soon as they become
eligible, and estimated the part of the consumption drop that is associated with retirement induced
by eligibility. Given that pension eligibility is a variable that is measured with error, we have also
evaluated the impact of measurement error on our estimates.
Our key result is that non-durable consumption drops by 9.2% because of (male) retirement. We
have shown that such fall is compatible with a constant marginal utility of consumption if preferences
are non-separable between consumption and leisure. In particular, if preferences are Cobb-Douglas in
consumption and male leisure, we have shown that we would require the elasticity of intertemporal
substitution to be between 0.8 and 0.9, in line with recent micro evidence for the US and the UK.
We have also produced evidence that this drop can be mostly accounted for by falls in goods that are
typicaly considered to be work-related expenses or leisure substitutes (clothing, transport, meals out).
References
[1] Aguiar Mark and Erik Hurst (2004), ”Consumption vs. Expenditure”, NBER WP 10307
[2] Angrist, Joshua D., and Guido W. Imbens (1994), ”Identification and Estimation of Local Average
Treatment Effects”, Econometrica, 62(2), 467-476
[3] Banks, James, Richard Blundell and Sarah Tanner (1998) ”Is There a Retirement-Savings Puzzle?”, American Economic Review, 88(4), 769-788
[4] Battistin, Erich and Andrew Chesher (2004), ”The Impact of Measurement Error on Evaluation
Methods based on Strong Ignorability”, mimeo, Institute for Fiscal Studies, London
[5] Battistin, Erich, Raffaele Miniaci and Guglielmo Weber (2003) ”What Do We Learn from Recall
Consumption Data?”, Journal of Human Resources, 38(2), 354-385
[6] Battistin, Erich and Enrico Rettore (2005), ”Ineligibles and Eligible Non-Participants as a Double
Comparison Group in Regression Discontinuity Designs”, Journal of Econometrics, forthcoming
[7] Battistin, Erich and Barbara Sianesi (2005), ”Misreported Schooling and Returns to Education:
Evidence from the UK”, mimeo, Institute for Fiscal Studies, London
[8] Bernheim, B. Douglas, Jonathan Skinner and Steven Weinberg (2001) ”What Accounts for the
Variation in Retirement Wealth Among U.S. Households?”, American Economic Review, 91(4),
832-857
[9] Boeri, Tito and Agar Brugiavini (2005), ”Pension Reforms and Women at Work”, paper presented
at IZA conference on The Older Worker in the Labor Market: Work Time and Work Incentives,
Lisbon.
[10] Bound, John, Brown, Charlie and Nancy A. Mathiowetz, N. (2001), “Measurement error in survey
data”, in J.J. Heckman and E. Leamer (eds.), Handbook of Econometrics. Vol. 5, Amsterdam:
North-Holland, 3705-3843
18
[11] Browning, Martin and Anna Maria Lusardi (1996) ”Household Saving: Micro Theories and Micro
Facts”, Journal of Economic Literature, 34(4), 1797-1855
[12] Deaton, Angus (1992) Understanding Consumption, Oxford: Oxford University Press
[13] Hahn, Jinyong, Petra Todd. and Wilbert van der Klaauw(2001), ”Identification and Estimation
of Treatment Effects with a Regression-Discontinuity Design”, Econometrica, 69(1), 201-209
[14] Hamermesh, Daniel S. (1984) ”Consumption During Retirement: The Missing Link in the LifeCycle”, Review of Economics and Statistics, 66(1), 1-7.
[15] Heckman, James J., Robert Lalonde and Jeffrey Smith (1999), ”The Economics and Econometrics
of Active Labor Market Programs”, Handbook of Labor Economics, Volume 3, Ashenfelter, A.
and Card, D. (eds.), Amsterdam: Elsevier Science
[16] Horowitz, Joel L. and Charles F. Manski (1995), ”Identification and Robustness with Contaminated and Corrupted Data”, Econometrica, 63(2), 281-302
[17] Hurd, Michael and Susann Rohwedder (2003), ”The Retirement-Consumption Puzzle: Anticipated and Actual Declines in Spending at Retirement”, NBER WP 9586
[18] Johnson, Norman L., Samuel Kotz and N. Balakrishnan (1994), Continuous Univariate Distributions, New York: Wiley
[19] van der Klaauw, Wilbert (2002), ”Estimating the Effect of Financial Aid Offers on College Enrollment: A Regression-Discontinuity Approach”, International Economic Review, 43(4), 1249-1287
[20] Miniaci, Raffaele, Chiara Monfardini and Guglielmo Weber (2005) ”Is There a Retirement Consumption Puzzle in Italy?”, mimeo, University of Padua
[21] Thistlethwaite, Donald L. and Donald T. Campbell (1960), ”Regression discontinuity analysis: an
alternative to the ex post facto experiment”, Journal of Educational Psychology, 51(6), 309-317
[22] Trochim, William M. K. (1984), Research Design for Program Evaluation: the RegressionDiscontinuity Approach, Beverly Hills: Sage Publications
19