Comparison of Stochastic Claims Reserving Models in Insurance
László Martinek
1
∗1
, Miklós Arató
†1
and Miklós Mályusz1
Institute of Mathematics, Eötvös Loránd University, Budapest, Hungary
arXiv:1501.06155v1 [stat.ME] 25 Jan 2015
January 27, 2015
Table of Contents
1 Introduction
2
2 Distributional Models
3
2.1
Log-normal Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
2.2
Negative Binomial Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
2.3
Poisson Model
4
2.4
Over-dispersed Poisson Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
2.5
Gamma Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3 Stochastic Claims Reserving
6
3.1
Parametric models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6
3.2
Bootstrap methods with over-dispersed Poisson and gamma distributions . . . . . . . . .
6
3.3
Semi-stochastic methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6
4 Probabilistic Forecasts
7
4.1
Introductory Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7
4.2
PIT, Empirical Coverage, Average Width . . . . . . . . . . . . . . . . . . . . . . . . . . .
8
4.3
Continuous Ranked Probability Score and Energy Score . . . . . . . . . . . . . . . . . . .
9
4.4
Mean square error of prediction (MSEP) . . . . . . . . . . . . . . . . . . . . . . . . . . . .
11
5 Simulation and Results
12
5.1
Preliminary Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
12
5.2
Monte Carlo Type Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
12
5.3
Results on a Gamma distributed example . . . . . . . . . . . . . . . . . . . . . . . . . . .
13
5.4
Public payments data example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
15
6 Conclusions
16
*
Abstract
The appropriate estimation of incurred but not reported (IBNR) reserves is traditionally one of
the most important task of actuaries working in casualty and property insurance. As certain claims
∗
[email protected]
• Room D-3-309, Pázmány Péter sétány 1/C, 1117 Budapest, Hungary
• Room D-3-311, Pázmány Péter sétány 1/C, 1117 Budapest, Hungary
†
[email protected]
1
are reported many years after their occurrence, the amount and appropriateness of the reserves has
a strong effect on the results of the institution. In recent years, stochastic reserving methods had
become increasingly widespread. The topic has a wide actuarial literature, describing development
models and evaluation techniques. We only mention the summarizing article England and Verrall
[2002] and book Wüthrich and Merz [2008].
The cardinal aim of our present work is the comparison of appropriateness of several stochastic
estimation methods, supposing different distributional development models. We view stochastic reserving as a stochastic forecast, so using the comparison techniques developed for stochastic forecasts,
namely scores, is natural, see Gneiting and Raftery [2007], for instance. We expect that in some cases
this attitude will be more informative than the classical mean square error of prediction measure.
Keywords: run-off triangles, stochastic claims reserving, scores, Monte Carlo simulation
1
Introduction
The appropriate estimation of outstanding claims (incurred but not reported (IBNR) claims and reported
but not settled (RBNS) claims) is crucial to preserve the solvency of insurance institutions, especially
in casualty and property insurance. A general assumption is that claims related to policyholders, which
occur in a given year, are reported to the insurance company in the subsequent years, sometimes many
years later. Often, the payout is delayed as well. Thus, reserves have to be made to cover these arising
obligations.
Data are usually represented by so called run-off triangles. These are n × m matrices, where element
Xi,j (i, j = 1, 2, . . .) represents the claim amount incurred in year i, and paid with a delay of j − 1 years,
or may also indicate aggregate values. In our research work we suppose that n = m, i.e., restrict data
to be squared in that sense, for simplicity reasons. Although, thereby generality will not be violated,
methods can be similarly used for arbitrary n and m positive integers.
Elements for indices i + j > n + 1 are unknown, have to be predicted. If Xi,j denotes an incremental
P
P
Xi,j . We focus on the so
Xi,j −
value in the triangle, then the outstanding claims are
1≤i,j,≤n
i+j≤n+1
called ultimate claim value (U C), which is the sum of observed payments (upper triangle) and outstanding
claims (lower triangle). Remark that throughout the entire article, Xi,j will denote incremental, and Ci,j
will denote aggregate claim values, i.e., Ci,j = Xi,1 + . . . + Xi,j .
The topic has a wide actuarial literature, describing development models and evaluation techniques.
See Taylor [2000]; Mack [1994]; Verrall [1994], for instance. After the quintessential works of these authors,
stochastic claims reserving methods are becoming more and more common. These methods not only
estimate the expected value of the outstanding claims, but also examine its stochastic behavior. An
extensive description of stochastic reserving can be found in the summarizing article England and Verrall
[2002] and book Wüthrich and Merz [2008].
Most of the stochastic reserving methods estimate the distribution of the amount of outstanding
claims, so they can be considered as probabilistic forecasts. There are numerous methods for the comparison of probabilistic forecasts, and we chose to concentrate on the probabilistic scores, see Gneiting and Raftery
[2007]. The cardinal aim of our present work is to demonstrate that using these comparisons is an adequate way of comparing stochastic reserving methods. Our hope is that in some cases this attitude
will be more informative than the classical mean square error of prediction measure or quantile values.
We also will pay attention to the sharpness of predictions. The goal of our work is the comparison of
appropriateness of several stochastic estimation methods, supposing different distributional development
models. We used several run-off triangles from the actuarial literature. It is important to note that the
results we present are not fully comprehensive, as it would far exceed the constraints of this article. For
example, we only used paid run-off triangles, thus we could not make comparisons with the MCMC model
2
described in Merz and Wüthrich [2010]. Due to lack of capacity, we did not calculate with the bootstrap
methods in Björkwall et al. [2009] and Pinheiro et al. [2003], and we left out the MCMC method from
Márkus and Gyarmati-Szabó [2007].
Of the scores, we used only the CRPS and the Energy score, since in most cases a Monte Carlo
type evaluation is needed, as the explicit description of the distribution of the ultimate claim value is
not possible or too difficult. For instance, this is the case if this value is the sum of random variables
derived from log-normal distribution. We applied our comparison method for an itemized (claims and
payouts) dataset from an insurance company, where both the upper and the lower triangles were known.
We created 2000 scenarios with random draw and replacement from the claims, and chose the most
appropriate stochastic reserving methods based on these scenarios.
2
Distributional Models
In the following section we briefly expound the distributional models used in our research.
2.1
Log-normal Model
The following calculation is based on [Wüthrich and Merz, 2008, p. 168.]. Briefly, in the Log-normal
model, triangular elements Ci,j (i, j ∈ Z+ , i + j ≤ n + 1) indicate cumulative claims. The so called
Fi,j =
σj2
Ci,j
Ci,j−1
development factors are log-normally distributed random variables, with µj log-scale and
shape parameters, and let Ci,0 be 1. Here we only remark that the parameter estimation is given by
the solution of Equation 2.1, 2.2, except σˆn := 0. For further details see Wüthrich and Merz [2008].
n−j+1
X
Ci,j
1
log
µˆj =
n − j + 1 i=1
Ci,j−1
σˆj2 =
2
n−j+1
X Ci,j
1
log
− µˆj
n − j i=1
Ci,j−1
(2.1)
j ∈ {1, . . . , n}
(2.2)
j ∈ {1, . . . , n − 1}
Obviously, the distributions of Ci,n values – in other words the ultimate payments for accident year
n
n
P
P
σk2 . Unfortunately, the
µk , and shape parameter
i – are also log-normal with log-scale parameter
distribution of U C =
variables.
2.2
n
P
k=1
k=1
Ci,n cannot be expressed explicitely, since this is the sum of n log-normal
i=1
Negative Binomial Model
The construction of negative binomial development model below is based on [Wüthrich and Merz, 2008,
p. 183.]. Rows
are assumed
tobe independent, and in each row, the distribution of Xij increment value
βj
is Poisson Θi,j−1 · βj−1 − 1 . Here, Θi,j−1 ∼ Γ(ci,j−1 , 1) random variable under the assumption that
{ci,j = Ci,j }, thus, the model is recursive. On the other hand, for j ∈ {1, . . . , n}, βj =
partial sums of γ = (γ1 , . . . , γn ) payout pattern. Let fj :=
βj
βj−1
j
P
γk are the
k=1
(j = 1, . . . , n−1), and suppose that fj > 1,
i.e., γk > 0 for all k, which means an increasing payout pattern.
It can
be shown that the distribution
1
of increment Xij – conditionally on Ci,j−1 – is NegBinom Ci,j−1 , fj−1 (j = 2, . . . , n). Remark that the
r
NegBinom(r, p) distribution is defined as P (ξ = k) = r+k−1
p (1 − p)k (k = 0, 1, . . .).
k
Parameter estimation is based on maximum likelihood estimator as follows. Let γ denote the unknown
parameters, X the upper triangle, moreover, suppose that the first column is fixed. The likelihood function
3
is
L(γ, X) =
n−1
Y
n−1
Y
P (rowi ) =
n−1
Y
P (Xi,2 = ki,2 , . . . , Xi,n−i+1 = ki,n−i+1 ) =
i=1
i=1
P (Xi,n−i+1 = ki,n−i+1 |Xi,n−i = ki,n−i , . . . , Xi,2 = ki,2 , Xi,1 = ki,1 ) · . . .
i=1
. . . · P (Xi,2 = ki,2 |Xi,1 = ki,1 ) =
Ci,n−i + ki,n−i+1 − 1 Ci,n−i
pn−i (1 − pn−i )ki,n−i+1 · . . .
ki,n−i+1
Ci,1 + ki,2 − 1 Ci,1
...·
p1 (1 − p1 )ki,2 =
ki,2
n−1
Y Ci,j + ki,j+1 − 1 C
Y n−j
pj i,j (1 − pj )ki,j+1 =
k
i,j+1
j=1 i=1
n−1
Y
i=1
const ·
n−1
Y
n−j
P
n−j
P
Ci,j
i=1
pj
(1 − pj ) i=1
ki,j+1
−→ max .
p
j=1
Let l(γ, X) be log L(γ, X), i.e., the loglikelihood:
l(γ, X) = const +
n−1
X
X n−j
Ci,j log pj +
n−1
X
X n−j
ki,j+1 log(1 − pj ).
j=1 i=1
j=1 i=1
n−j
P
∂
∂pj l
Thus we get the following system of equations:
=
n−j
P
ki,j+1 > 0. This gives the estimator
Suppose that ∀j
n−j
P
Ci,j
i=1
pj
−
ki,j+1
i=1
1−pj
= 0, j ∈ {1, . . . , n − 1}.
i=1
p̂j =
n−j
P
Ci,j
i=1
n−j
P
,
Ci,j+1
i=1
which relate to the chain-ladder development factors. Note that γ1 = p1 p2 . . . pn−1 , γi = (1−pi−1 )pi . . . pn−1
(i = 2, . . . , n − 1), γn = 1 − pn−1 .
At last, note that in England and Verrall [2002] another approach of the Negative Binomial Model
can be found. According to that paper, a dispersion parameter is included, and increments are so called
over-dispersed negative binomially distributed random variables with mean (fj − 1) · Ci,j−1 and variance
φfj (fj − 1) · Ci,j−1 , respectively.
2.3
Poisson Model
The Poisson model is available in [Wüthrich and Merz, 2008, p. 25.], for instance. Briefly, suppose that
the increments are independent Xij ∼ P oisson(µi γj ) variables, i, j = 1, . . . , n. Now suppose that the
upper triangle - as a condition, denoted by D - is given. On the one hand, the estimation of γ1 , . . . , γn
payout pattern values works exactly the same way as in the Negative Binomial case. On the other hand,
µ̂i =
n−i+1
P
Xik
k=1
n−i+1
P
.
γ̂k
k=1
Since one of our aims is to generate ultimate claim values, assuming that the real parameters are γ̂i and
n
P
Ci,n−i+1 ) and Y lower
µ̂i , one random ultimate claim variable is the sum of upper triangle values (or
i=1
n
n
P
P
Ci,n−i+1 + Y .
triangle, where Y ∼ Poisson
µ̂i (γ̂n−i+2 + . . . + γ̂n ) , i.e., U C =
i=1
i=2
4
2.4
Over-dispersed Poisson Model
A slightly more general model rather applied in practice is the Poissonmodel
with a dispersion parameter.
µi γj
φ
We have the following distributional assumptions. If Yij ∼ P oisson
random variable, let Xij :=
φYij . Thus E(Xij ) = µi γj and V ar(Xij ) = φµi γj , and Xij is called over-dispersed Poisson random
variable.
For µ and γ we used the regular chain-ladder method, which provides unbiased estimation. On the
other hand, evaluation of φ requires the determination of Pearson residuals, for instance. For a detailed de-
scription see [Wüthrich and Merz, 2008, p. 218.], England and Verrall [2006]. Briefly, in the over-dispersed
Poisson case, Pearson residuals are defined as
P
r̂ij
=
Thus
xij − µ̂i γ̂j
xij − m̂ij
p
= p
.
m̂ij
µ̂i γ̂j
φ̂P =
P
i+j≤n+1
P
r̂ij
N −p
where N denotes the number of observations, i.e., N =
2
,
n(n+1)
,
2
and p is the number of predicted unknown
parameters, i.e., p = 2n − 1. Remark that this method provides a biased estimator for φ (and also for µ
parameters).
The distribution of U C, similarly to the Poisson model is U C =
n
P
µ̂i (γ̂n−i+2 + . . . + γ̂n )
P oisson φ1
n
P
Ci,n−i+1 + Y , where Y ∼ φ ·
i=1
i=2
Remark that another parameterization of the model is as follows. Let α1 , . . . , αn ; β1 , . . . , βn ; c; φ be
parameters. Incremental
claims
are defined as
α +β +c
i
j
Xij ∼ φ·P oisson e φ
. Thus E(Xij ) = eαi +βj +c and V ar(Xij ) = φeαi +βj +c . Our aim is to estimate
α and β parameters using maximum likelihood method under the usual constraint that α1 = β1 = 0.
2.5
Gamma Model
On the one hand, the model described in [England and Verrall, 2002, 3.3] is the following. Increments are
assumed to be Gamma distributed random variables, i.e., Xij ∼ Γ(α, β), with expected value E(Xij ) =
mij and variance V ar(Xij ) = φm2ij . Thus parameters are α =
1
φ
and β =
1
φmij .
On the other hand, in [Wüthrich and Merz, 2008, 5.2.4] Xij increments are deterministic sums of
rij
P
(k)
(k)
Xij , where Xij ∼ Γ(ν, mνij ).
(rij ) independent Gamma random variables. In other words, Xij =
k=1
Since the rate parameters are equal, the distribution of Xij is Γ(νrij , mνij ). Here, E(Xij ) = rij mij and
V ar(Xij ) =
rij
2
ν mij .
If rij = 1 ∀i, j, the above mentioned two models are identical, and ν =
1
φ.
The only difference
is that rij ’s can be arbitrary integers, thus the second model is more flexible. Note that for handling
estimation methods, it is not satisfying to know the upper triangle, but the triangle of rij numbers too.
Model Assumptions 5.19 in Wüthrich and Merz [2008] state that there exist µ1 , . . . , µn and γ1 , . . . , γn
n
P
γi = 1, such that E(Xij ) = rij mij = µi γj . The estimates µ̂i and
parameters under the constraint that
i=1
γ̂j are averages of the observations weighted by numbers rij . For more details see Model Assumptions
5.19. in the book.
On the one hand, µ̂ and γ̂ parameters are the solution of the following system of equations:
µ̂i =
n+1−i
P
j=1
Xij
γ̂j
n+1−i
, γ̂j =
5
n+1−j
P
i=1
Xij
µ̂i
n+1−j
.
(Note that for technical reasons, in our simulations we used simple chain-ladder method instead of
solving the above mentioned system of equations.) On the other hand, the estimation of parameter ν
using Pearson residuals is the following. For i + j ≤ n + 1 let
P
r̂ij
=
thus
φ̂P =
3
3.1
xij − µ̂i γ̂j
,
µ̂i γ̂j
P
P
r̂ij
i+j≤n+1
n(n+1)
− (2n
2
2
− 1)
.
Stochastic Claims Reserving
Parametric models
We are using the parametric models introduced previously. We estimate the parameters from the upper
triangle, and approximate the conditional distribution of the lower triangle by determining the conditional
distribution with the estimated parameters. Naturally, this approach might have drawbacks, for example
the prediction intervals will not be precise - to see the problem, we could just think of how to make a
prediction interval for the (n + 1)th member of an i.i.d. sample of normal distribution from the first n
members. As the conditional distribution of the lower triangle is difficult, and in some cases, impossible
to calculate analitically, we estimate the distribution with Monte Carlo method, generating 5000 lower
triangles.
3.2
Bootstrap methods with over-dispersed Poisson and gamma distributions
The detailed description of these models can be found in England and Verrall [2002], Appendix 3. These
models are also included in the ChainLadder package for R. The heart of these methods is bootstrapping
q
adj
n
P
· ri,j
of the upper triangle. From the resampled
the adjusted Pearson residuals ri,j
= 1/2·n(n+1)−2n+1
residuals, we create a new upper triangle, fit the standard chain-ladder to it, obtain the new expected
values m
e i,j and variances φm
e i,j for the elements of the lower triangle, simulate the lower triangle, and
store the ultimate claim amount. We do the bootstrapping 5000 times, and use the 5000 stored ultimate
claim amounts as the predictive distribution.
3.3
Semi-stochastic methods
We use the models presented in Faluközy et al. [2007]. Consider αj (i) =
be independent discrete uniform random variables with P (αj = αj (i)) =
Ci,j+1
Ci,j , and let αj , j = 1, ..., n
1
n−j . The model assumption is
that for each element of the lower triangle, the Ci,j+1 = Ci,j · αj equation holds. It is easy to see that
n−j
1 P Ci,j+1
E (αj ) = n−j
Ci,j . The first method (denoted as Uniform) is to simulate a sufficiently large number
i=1
(in our case, 5000) of lower triangles, take the arousing 5000 ultimate claims, and use this empirical
distribution as the predicted distribution. The second method (denoted as Unifnorm) is based on the
assumption that the ultimate claim amount is nearly a normal random variable, and the task is to
calculate the mean and variance.
The mean is the expected value of the ultimate claim amount, which is
!
n−1
n
Q
P
E (αk ) .
Cn−j+1,j
j=1
k=j
!!
n−1
n
n−1
Q
Q 2
P
2
Cn−j+1,j
E αk −
.
The variance of the ultimate claim is
E αk
j=1
k=j
6
k=j
actuary
predictive distribution
ideal
LN (µ, 1)
long-term
LN (0, 2)
ordinary
1
2 LN (µ, 1)
intern
+ 21 LN (µ + δ, 1), where δ = ±1 with probability 12 , 12
4µ + 1 µ ≥ 0
LN (−|µ|, σ 2 ), where −|µ| + σ 2 = µ + 21 , i.e. σ 2 =
1
µ<0
Table 4.1: Distributions regarding certain actuaries in Example 1 (Log-normal). ξ ∼ LN (µ, 1), where
µ ∼ N (0, 1).
actuary
predictive distribution
ideal
Poisson(x · λ)
long-term
1
)
NegBin(1.5, 1+x·0.5
ordinary
1
2 Poisson(x
intern
NegBin(2x · λ, 32 )
· λ) + 21 Poisson(x · λ · δ), where δ = 1 ±
1
10
with probability 12 , 12
Table 4.2: Distributions regarding certain actuaries in Example 2 (Poisson). η ∼Poisson(x · λ), where
λ ∼ Γ(1.5, 0.5) and x = 1000.
4
Probabilistic Forecasts
4.1
Introductory Example
As we mentioned in the previous section, stochastic reserving is essentially the prediction of the distribution of the future payments. The systematic analysis of these distributions begun with the work of Dawid
[1984]. Most applications of probabilistic forecast theory is used in meteorology. Gneiting et al. [2007]
give an overwiev of the most important methods for comparing the forecasts. To present these methods,
we give 2 simple examples, the first of which is similar to the one in the aforementioned article.
In Example 1, suppose that an automobile insurance company already has paid 1 million euros for the
accidents occured in year 2012. Let ξ be the total amount paid for accidents of 2012 till the end of 2013.
Assume that ξ is of log-normal distribution with log-scale and shape parameters µ and 1, respectively,
where µ is supposed to be a standard normal random variable.
In the integer valued Example 2, the number of liability insurance claims for damages occured and
reported in 2012 was 1000. Let η be the number of claims for damages regarding 2012, that are reported
in 2013. Assume that the distribution of η is Poisson with parameter 1000 · λ, where lambda is a gamma
distributed random variable with shape parameter 1.5 and scale parameter 0.5.
We compare the performance of 4 imaginary actuaries predicting the distributions in the above examples. The ideal actuary knows all the relevant circumstances, which means the knowledge of the exact
value of µ in Example 1, and λ in Example 2, respectively. The long-term actuary does not intend to
get involved in the actual information, and simply estimates the unconditional distribution. The ordinary actuary attempts to estimate the parameters, but the estimation has some possible error. At last,
the intern actuary finds the expected value, but does not care about the distribution itself. We give an
overview of the distributions and the predicted distributions in Table 4.1 and Table 4.2. Year 2013 was
simulated 10, 000 times and the different probabilistic forecasts were compared according to them.
7
PIT histograms for Example 1 (sample size=10000)
ideal
long−term
ordinary
intern
2000
1500
1000
frequencies
500
0
2000
1500
1000
500
0
0.0
0.2
0.4
0.6
0.8
1.0
0.0
0.2
0.4
0.6
0.8
1.0
value
Figure 4.1: PIT histograms for Example 1 (Log-normal example)
4.2
PIT, Empirical Coverage, Average Width
In the evaluation of the predictions, we have (Fi , xi ) pairs, where Fi denotes the predicted distribution
in the ith case, and xi the realization, respectively. Dawid [1984] suggested using the Probability Integral
Transform (PIT), and the description available in Gneiting et al. [2007] is also worth seeing. The main
idea is that substituting a random variable with continuous probability function into its own distribution
function, will yield a uniformly distributed variable on the (0,1) interval. The distribution of PIT values
has to be uniform on interval (0, 1), otherwise the estimation is biased. Remark that the uniform rank
histogram property is a necessary but not sufficient criterion for ideal forecasts, as illustrated by an example in Hamill [2001]. For a fixed upper triangle - as a condition - we calculate the predicted distribution
function of the ultimate claim, and substitute the real ultimate claim into it.
We check the prediction by plotting the empirical CDF of the corresponding PIT values and comparing
it to the identity function, see P-P plots below, or by plotting the histogram of the PIT values and
checking for uniformity. On the one hand, U-shaped histograms indicate underdispersed, excessively
light-tailed predictive distributions. On the other hand, ∩-shaped histograms hint at overdispersion, too
heavy predictive distribution in tails, and skewed histograms occur when central tendencies are biased.
In high count data cases this method can be used very well. In low count data cases, randomized PIT,
or non-randomized uniform version of the PIT histogram is the more appropriate choice, see Czado et al.
[2009]. PIT histograms regarding the two example can be seen on Figure 4.1 and Figure 4.2, where
dashed lines represent uniform distributions. These figures provide additional examples for inaccurate
probabilistic forecasts with appropriate PIT histograms, since shapes regarding long-term and ordinary
predictions deviate not more considerable from uniform distribution, as in the ideal case. Nevertheless,
the intern case shows inappropriateness immediately.
We also intend to define P-P plots for random variables ξ and predictive distributions F̂ . Namely, let
sp := sup{z : F̂ (z) ≤ p}, thus the P-P plot function ([0, 1] → [0, 1]) is as follows: p 7→ P (ξ < sp ). Remark
z
that integrating the PIT function results the P-P plot, i.e., there is a bijection between the two concepts.
It also follows obviously that a slanted-S shaped P-P plot corresponds to a ∩-shaped PIT, for instance.
8
PIT histograms for Example 2 (sample size=10000)
ideal
long−term
ordinary
intern
600
500
400
300
200
frequencies
100
0
600
500
400
300
200
100
0
0.0
0.2
0.4
0.6
0.8
1.0
0.0
0.2
0.4
0.6
0.8
1.0
value
Figure 4.2: PIT histograms for Example 2 (Poisson example)
An example for run-off triangles can be seen on Figure 5.2. Using the notations of our simulations, for a
N
P
real number p ∈ (0, 1), the function value is N1
χ{ξj <Qu(η·j ,p)} , where Qu(η·j , p) is the p-quantile value
j=1
of empirical distribution defined by η1j , . . . , ηMj .
At last, two linked measures will be considered for each probabilistic forecast. The idea derives from
Baran et al. [2012], and an example calculation can be seen on Table 5.2. Coverage in % related to a central
quanprediction interval α · 100% is the proportion of observations between the lower and upper 1−α
2
N
P
tiles. In accordance with our terminology, it can be written as N1
χ{Qu(η·j ,(1−α)/2)<ξj <Qu(η·j ,(1+α)/2)} .
j=1
The average width for a central prediction interval is the difference of lower and upper
1−α
2
quantiles
expressed in payments, in expectation. It can be interpreted as the sharpness of the prediction. If the
performances of two stochastic claims reserving techniques are relatively close to each other, measured
in scores,
width is the better. Formally we calculated the following way:
the one with the narrower
N
P
1
η ∗(1+α)·M − η ∗(1−α)·M
. Even though these could be derived from the PIT, in some applications
N
j=1
2
,j
2
,j
it is important to know the exact values, thus a separate calculation is advised. See Table 4.3 and Table
4.4 regarding Example 1 and Example 2, respectively. It can be seen again, looking at the mentioned
tables that inappropriate distributions may provide good results. Namely, the coverage of the long-term
and ordinary actuary, and the average width of the intern actuary is acceptable as well. However, in our
examples only the ideal gives suitable values for both measures.
4.3
Continuous Ranked Probability Score and Energy Score
Probability scores are an increasingly widespread technique to measure the quality of the predicted
distributions. A score is a function of the predictive distribution and a realization of the real value. The
predictive distribution can be represented by its CDF, empirical CDF or PDF, depending on which score
is used. Forecast models are then ranked by comparing the average score of the predictive distributions
from each model.
The most highlighted measure of goodness of fit in this article is the so called Continuous Ranked
9
Coverage (%)
Average width
66%
90%
66%
90%
ideal
67.0
90.5
1.93
3.29
long-term
67.3
90.5
2.74
4.65
ordinary
67.2
90.4
5.01
11.64
intern
47.3
74.2
2.87
4.88
Actuary
Table 4.3: Coverage and Average width in Example 1 (Log-normal)
Actuary
Coverage (%)
Average width
66%
90%
66%
90%
ideal
66.2
89.5
48.91
83.16
long-term
65.5
89.5
1052.00
1868.00
ordinary
66.4
89.5
98.24
140.62
intern
76.2
95.2
59.89
101.84
Table 4.4: Coverage and Average width in Example 2 (Poisson)
Probability Score or CRPS, which is devoted to handle the case of prediction of distribution functions. The
main reason of giving preference to the CRPS over other several score concepts is the general applicability,
i.e., it can be applied to predictions (functions) regardless of absolute continuity or discrete behaviour.
A detailed description can be found in [Gneiting and Raftery, 2007, p. 366.], for instance. Briefly, let F
denote the predictive distribution function, and c one single observation. Here
CRP S(F, c) = −
Z∞
(F (y) − 1{y≥c} )2 dy.
−∞
(See Table 5.1 and Figure 5.3.) A generalization of CRPS is the so called Energy Score, see the
aforementioned Gneiting and Raftery [2007], for instance. The definition of one dimensional energy score
is
1
EF |X − X ′ |β − EF |X − c|β ,
2
where X, X ′ are i.i.d. variables from distribution F , and β ∈ (0, 2). It can be shown that for β = 1 we
ES(F, c) =
get the CRPS back. We note that CRPS can be evaluated directly, or in case of β 6= 1, it is viable to
approximate the energy score by sampling from the empirical distribution, which is typically a much less
rapid way, because of the necessary sample size to be drawn to reach a satisfying accuracy. (During our
simulations, the magnitude of difference in time was approximately 102 , although even so not significant.)
Table 4.5 presents the score results in expectation, regarding the two examples (Log-normal and
Poisson). For instance, consider the Log-normal example in case of the ideal actuary. Given {µ = m}
R∞
2
and {ξ = x}, the score value is in the form 0 Φ(log y − m) − 1{y≥x} dy. It turns out that the score
measure provided proper ranking, although, the difference between ideal and intern values regarding the
second example is not of distinction.
10
ideal
long-term
ordinary
intern
Example 1
-0.48
-1.86
-0.63
-1.83
Example 2
-14.49
-327.62
-32.62
-14.64
Table 4.5: CRPS results in Example 1 and Example 2
4.4
Mean square error of prediction (MSEP)
The mean square error of prediction (MSEP) of predictor (point estimation) X̂ for ultimate claim X,
conditionally on the σ-algebra F is defined as
2
msepX|F (X̂) = E
X − X̂ |F = V ar(X|F ) + (X̂ − E(X|F ))2 .
See Definition 3.1 in Wüthrich and Merz [2008], for instance. The unconditional MSEP is
2
msepX (X̂) = E
X − X̂
= E(V ar(X|F )) + E((X̂ − E(X|F ))2 ).
In the detailed examples the above defined errors can be calculated right away.
For instance, actuaries according to Example 2 have mean square error of predictions as follows.
msepη (ideal) = msepη (intern) = E(msepη|λ (xλ)) =
= E(V ar(η|λ)) + E((xλ − E(η|λ))2 ) = E(xλ) = x · α · β
α 1−
msepη (long-term) = msepη (E(η)) = V ar(η) =
1
1+x·β
2
1
1+x·β
= x · α · β · (1 + x · β)
1
msepη (ordinary) = E(msepη|λ ( · xλ · (1 + δ))) =
2
1
= E(V ar(η|λ)) + E(( · xλ · (1 + δ) − E(η|λ))2 ) =
2
2
x
x · (1 + α) · β
2
=x·α·β+
· E(λ ) = x · α · β 1 +
400
400
where α = 1.5, β = 0.5 denote the shape and scale parameters of the gamma distributions. Results in
conjunction with the two simple example are shown on Table 4.6. The tables show that in the long-term
case, the prediction is inaccurate even though the PIT histogram and the prediction interval coverages
suggest that the fit is good.
ideal
long-term
ordinary
intern
Example 1
34.5
47.2
37.2
34.5
Example 2
750.0
375750.0
3093.8
750.0
Table 4.6: Mean Square Error of Prediction in Example 1 and Example 2
When we work with run off triangles, the definiton is the following.
msepUC|Dj (UˆC j ) = V ar(U C|Dj ) + (UˆC j − E(U C|Dj ))2
j = 1, . . . , N,
for each real triangle. Let xj be the real ultimate claims (j = 1, . . . , N ) and yij the generated ultimate
claims (i = 1, . . . , M , j = 1, . . . , N ). On the other hand, let zij be an ultimate claim generated with the
11
real parameters and according to the real development distribution, conditionally on upper triangle Dj .
Thus, in our calculations
msepUC|Dj (UˆC j ) ≈
M
P
2
(zij − E(U C|Dj ))
i=1
M −1
P
M
y
i=1 ij
+
M −
M
P
zij
i=1
M
2
.
Remark that if we calculate the msep values supposing the knowledge of real parameters instead of
estimation, we get the pure V ar(U C|Dj ) back, since yij = zij .
On the other hand, Table 5.1 shows MSEP values regarding the run-off triangle example detailed in
Section 5.
5
Simulation and Results
5.1
Preliminary Remarks
Our calculations have been implemented in R, and beyond a self made program code, we used the
ChainLadder package. The documentation of the latter can be found on webpage
http://cran.r-project.org/web/packages/ChainLadder/ChainLadder.pdf. The former is available
on url http://onrequest with a short user manual enclosed. Remark that the detailed simulation results
for several parameter sets are also to find on this page. Moreover, the completion of tables and figures
have been made using packages xtable and ggplot2.
In this section, a Monte Carlo type method will be introduced, followed by simulated examples, consisting of corresponding goodness of fit values described in Section 4. Parameters of the example come
from the run-off triangle RAA, which is an accumulated claims triangle from the Automatic Facultative
business in General Liability, originally published in Historical Loss Development, Reinsurance Association of America (RAA), 1991, and was also used as an example in England and Verrall [2002] and Mack
[1994]. It is crucial to emphasize that this well known triangle plays no role in the present article apart
from providing parameters. In other words, during our research work we fitted several distributional models only to get parameter values from real data, but without studying goodness of fits, which is another
objective. Due to lack of space, only a special example is presented, but additional simulation results are
available on the aforementioned webpage. Just to mention some claims data from literature, these are related to ABC (a workers’ compensation portfolio of a large company, first used in Barnett and Zehnwirth
[2000], which has an in-depth analysis of the data array as well), GenIns (a general claims data triangle
from Taylor and Ashe [1983], and was also used in Mack [1993]) and M3IR5 (a simulated triangle, from
Zehnwirth [1994]).
5.2
Monte Carlo Type Method
This subsection is devoted to the detailed description of our Monte Carlo type method (MC method) for
comparison of various claims reserving methods in case of different distributional background of run-off
triangles. In other words, our goal is to establish a ranking among the different stochastic claims reserving
techniques, if the real development property of claims payments for accident years is in accordance with
certain models, as described in Section Distributional Models, for instance. This comparison now is not
based on mean square error of prediction or quantile values, but on scoring rules. Nevertheless, the
results and plots containing the former values are also included in the article, which allows us to compare
them with the unconventional score results in theory of insurance. Remark that the important role in
n
P
Ci,n payments for accident years
our approach is played by ultimate claim values, i.e., the aggregate
i=1
12
1, . . . , n. Although, if someone is more interested in focusing on solely the next year payments, i.e., in
n
P
Xn−j+2,j , the Monte Carlo type method can be interpreted easily without any special alteration. This
j=2
latter reserving value is suggested by Solvency II.
Suppose that the development distribution model of run-off triangle and corresponding parameters
are given. As a first step, we generate N run-off triangles independently, and besides, for each the ultimate
claim values U C1 , U C2 , . . . , U CN . Let D1 , D2 , . . . , DN denote these upper triangles, as conditions to be
used later.
In the second step, for every generated upper triangle, we calculate the predicted distribution of the
ultimate claims, using the methods described in Distributional Models. As we mentioned there, we determine these predicted distributions via Monte Carlo method. Specifically, in the parametric models
case, the parameters for each generated upper triangle have to be evaluated. They will certainly differ
from each other, and from the real parameters as well. Assuming these parameter values for each condition Di , or upper triangle, in other words, M ultimate claim values have to be generated, denoted by
UˆC i,1 , UˆC i,2 , . . . , UˆC i,M , as stochastic predictors. Thus we get the predictive distributions. Following
that, we prepare the forecasts using the 2 bootstrap, the uniform and the uniform normal methods.
At last, in the third step, for each pair U Ci and tuple (UˆC i,1 , UˆC i,2 , . . . , UˆC i,M ) scores, PITs, msep
and quantile values, and additionally the confidence intervals (coverage and average width) have to be
calculated according to Section Probabilistic Forecasts. Generally, a predictive distribution function Fi
derives from values UˆC i,1 , UˆC i,2 , . . . , UˆC i,M , and the corresponding c value is U Ci .
As an example, the results for regarding a Gamma Model example are shown on several figures below.
N is chosen to be 2000, and M to be 5000. Figure 5.1 contains the histograms of 2000 PIT values. On
the one hand, Figure 5.3 consists of boxplot representations of CRPS scores for each claims reserving
technique. On the other hand, the mean CRPS values can be found on Table 5.1. Similarly, Figure 5.4
represents the energy scores for β = 12 . Remember that the higher score values mean more appropriate
reserving methods. Remark that on boxplots, the ’reserving methods’ axis has to be interpreted as follows:
1 - Log-normal
2 - Negative Binomial
3 - Poisson
4 - Over-dispersed P.
5 - Gamma
6 - Uniform
7 - Unif. Normal
8 - Bootstrap Gamma
9 - Bootstrap Od. Pois.
10 - Ideal
We have not mentioned method ’Ideal’ so far, which differs from all other methods essentially. Namely, for
each distributional model, we assume that parameters are known, and use them in the above described
second step, instead of any estimation. The reason of the inclusion of these results is that theoretically
this means the best way of the prediction of claims reserves, in expectation. In other words, we used it
for reference purposes, recall the example of ideal actuary.
5.3
Results on a Gamma distributed example
We now present the results for a Gamma distributional model. These can be interpreted in a number of
different ways.
1. Which score or error number is the most consistent, and how do they correlate with each other?
Do the equipments applied in stochastic predictions choose the actual models better than regular
measures, such as mean square error of prediction, for instance?
2. Which non-parametric, distributional free methods predict the distributions properly? Do they
outperform prediction methods derived from parametric models?
3. How reliable and sharp are the prediction intervals?
13
Parameters are
(µ1 , . . . , µ10 ) = (21048, 17507, 23723, 29562, 25751, 18680, 15676, 22141, 19019, 18402),
(γ1 , . . . , γ10 ) = (0.112, 0.224, 0.209, 0.147, 0.119, 0.092, 0.037, 0.031, 0.016, 0.009),
and ν = 2.22. Based on the MSEP values on Table 5.1, the first thing that stand out is the poor fit of
the Log-normal model. The other 4 parameter estimating models are roughly the same. The bootstrap
methods are slightly worse, and the uniform and unifnorm methods are giving very poor results compared
to the others.
Res. Method
Log-normal
Negative Binomial
Poisson
Over-dispersed P.
CRPS (mean)
En. Sc. (mean)
MSEP (mean)
MSEP (median)
-17840
-77.47
6.257e+09
2.159e+09
-9878
-81.03
1.799e+08
9.856e+07
-10010
-84.23
1.799e+08
9.862e+07
-7547
-57.10
1.800e+08
9.911e+07
Gamma
-6911
-54.28
1.539e+08
9.109e+07
Uniform
-20980
-80.25
6.144e+09
4.716e+09
Unif. Normal
-50040
-145.90
6.139e+09
4.711e+09
-7856
-57.82
2.021e+08
9.880e+07
Bootstrap Gamma
Bootstrap Od. Pois.
-7865
-57.84
2.015e+08
1.005e+08
Ideal
-4074
-41.53
5.296e+07
5.295e+07
Table 5.1: Scores and Mean Square Errors in the Gamma Model example
If we take a look at the PIT histograms on Figure 5.1, we can see that the Negative Binomial
and Poisson distributional model produces very poor fits, as we might have expected. In other words,
these provide great examples for light-tailed predictions. The reason is that the model parameters imply
e.g. a Poisson variable with a very high expected value, which makes the standard deviation very low
compared to it. Hence the ultimate claim values will be in a very narrow range compared to the predicted
distributions, leading to PIT histograms, where all the occurrences are in a 10-20 percent wide probability
range of the predicted distributions. The other side of this attribute of the Poisson distribution is that
when we use a Poisson distribution as the predicted distribution, the difference between the smallest and
the largest value of the empirical predicted distribution will be very small, consequently the real ultimate
claim values will most of the time be either bigger or smaller than all 5000 values of the predicted
distribution. Thus, the corresponding PIT graphs will contain occurrences almost exclusively in the 5
percent and 95 percent probability levels. In essence, the PIT analysis strongly suggests against using the
Poisson distribution as the incremental claims of the IBNR claims in the current example, as even though
the MSEP values are acceptable, the low variance attribute leads to the Poisson distribution seemengly
being little upgrade over a simple point estimation of the expected value. Although, it is important to
remark that considering real data, occurrence of nearly Poisson distributed triangles for pay amounts is
not unprecedented. Looking at the ∩-shaped histograms deriving from the bootstrap methods, there is
an example for heavy-tailed predictors to be seen.
When examining the P-P plots, we expected to see the best results, i.e., a graph closest to the identity
function, when the real distribution is of the same type as the one in the MC method, and the calculations
proved just that, see Figure 5.2. Although, even if we get the Gamma model right, but do not know the
real parameters, the prediction is slightly overdispersed. We can conclude that in terms of the consistency
of quality of the P-P plots, the bootstrap methods provide quite acceptable results. The Uniform and
Uniform normal semi-stochastic methods, even though we expected them to perform almost as well as
14
Gamma Model
Log−normal
Negative Binomial
Poisson
Over−dispersed P.
Gamma
Uniform
Unif. Normal
Bootstrap Gamma
Bootstrap Od. Pois.
Ideal
800
600
400
200
frequencies
0
800
600
400
200
0
800
600
400
200
0
0.00.20.40.60.81.0 0.00.20.40.60.81.0 0.00.20.40.60.81.0 0.00.20.40.60.81.0
value
Figure 5.1: Histograms of PIT values in the Gamma distributed example
the bootstrap methods in the P-P plot test, are giving worse results. Regarding the Poisson distribution,
we can come to the same conclusions as we did in the PIT analysis: it provides too little variance to
effectively predict any other distribution, and predicting it with any other method, the variance of the
resulting distribution would be too big to be considered a good fit.
Regarding the stochastic methods in the example, the Gamma model is clearly the best in the score
metric, followed by Negative Binomial, and bootstrap methods. See Figure 5.3 and 5.4, which contain the
boxplot representations of CRPS and Energy Score values. Table 5.1 summarizes the mean score values.
On the other hand, the Uniform and Uniform normal methods give even worse results than if we were
using a wrong distribution in any of the 5 MC methods, so even though those 2 do not require us to
guess the underlying distribution, their results are worse then if we were making the wrong guess, and
then build a parameter approximating MC model based on that guess.
Table 5.2 presents the inaccuracy of the applied prediction intervals, moreover, the other run-off
triangles and models gave similar results in this sense. In cases where the coverage is said to be acceptable,
the sharpness is mostly inappropriate, i.e., intervals are wide. The coverage values for Log-normal model
are closer to the 66% and 90% values, then in the Gamma case, but average width results are higher.
5.4
Public payments data example
In case of the detailed dataset provided by an insurance company, we proceeded not exactly as in the
analysis of the known run-off triangles. Which means, each of the 2000 real run-off triangles were generated
the following way. We used sampling with replacement 43, 081 times from the set of 43, 081 accidents,
which determines a triangle. Moreover, it resulted in a 6 × 6 quadrangle also, and in the real ultimate
claim, respectively. Following this simulation, we did exactly as described before, i.e., determined the
predictive distributions for the 2000 triangles with several methods, and compared them to the real
ultimate claim values.
Figure 5.5 shows that in the light of PIT values, none of the methods based on distributional models
are recommended. Best histograms are resulted by the bootstrap estimation methods. Although uniform
15
Gamma Model
Log−normal
Negative Binomial
Poisson
Over−dispersed P.
Gamma
Uniform
Unif. Normal
Bootstrap Gamma
Bootstrap Od. Pois.
Ideal
1.0
0.8
0.6
0.4
0.2
0.0
quantiles
1.0
0.8
0.6
0.4
0.2
0.0
1.0
0.8
0.6
0.4
0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.00.0 0.2 0.4 0.6 0.8 1.00.0 0.2 0.4 0.6 0.8 1.00.0 0.2 0.4 0.6 0.8 1.0
values
Figure 5.2: P-P Plots in the Gamma Model example
and uniform normal methods are not much worse, they underperform in the cases of extreme claim values.
It is also confirmed by Table 5.3, i.e., the proportion of prediction intervals of bootstrap methods are
close to the 66.67% and 90% values. Unfortunately, in exchange for the appropriate coverage percentages,
average width values are higher.
On the one hand, according to the CRPS values in Table 5.4, the uniform claims reserving method
has the highest score, which has not been performing well for other triangles. On the other hand, energy
scores are slightly better for the bootstrap methods. It has to be mentioned that the msep values are the
highest in the bootstrap cases, in other words, this measure implies a different ranking.
6
Conclusions
We conclude our article by trying to answer the 3 questions proposed in the previous section. The method
that gives the best results overall, and is the least sensitive to the underlying distribution is the bootstrap
gamma and bootstrap over-dispersed Poisson method. Had we chosen a different one for our paper, we
might have deduced that the 2 bootstrap methods are the same in strength. It also became clear that the
Uniform and Uniform normal methods are significantly worse than the bootstraps in every evaluation.
The 5 parameter approximating MC methods give good results when we guess the underlying distribution
correctly, and as the P-P plots and PIT values show, they can be much worse then the bootstraps when
applied to the wrong distribution. The PIT and P-P plots advised against the use of the Poisson model
for the RAA dataset. When we studied other datasets, we found that the Poisson model exhibits the
same limitations, however, the Log-normal model was just as good as the other distributions. In general,
scores provide a reasonable fit to the background distribution. Unfortunately, the methods used in this
paper – and also applied in insurance industry – result not reliable prediction intervals.
This negative result is not surprising, since a typical run-off triangle contains much less information
than the unknown parameters. To improve the predictions, Meyers [2007] proposed the usage of Bayesian
methods. In our opinion, other than these methods, it would also be worthwhile to try using individual
16
Gamma Model
−50000
methods
1 − Log−normal
score values
2 − Negative Binomial
3 − Poisson
4 − Over−dispersed P.
−100000
5 − Gamma
6 − Uniform
7 − Unif. Normal
8 − Bootstrap Gamma
9 − Bootstrap Od. Pois.
−150000
10 − Ideal
−200000
1
2
3
4
5
6
7
8
9
10
reserving methods
Figure 5.3: Boxplots of CRPS values in the Gamma Model example
contract and claim data in probabilistic forecasts.
In our work, we used the individual claim dataset of an insurance company as well, but not for
probabilistic forecasting, but to propose a technique for comparing stochastic methods. This technique is
not sufficiently mathematically established yet, but may be applied in practice. One can simulate many
scenarios from the past, and choose a model, which best fits the claims data of the insurance institution.
We hope our work helped to show that in stochastic claims reserving, both in theoretical and in
applied situations, it is worthwhile to test the quality of the different methods, and in multiple ways if
possible.
As it was mentioned before, our goal was not the construction of the best stochastic claims reserving
technique, but to propose an adequate methodology for comparisons in the future. We hope to take the
first step in this direction with our work.
17
Gamma Model, beta=0.5
−50
methods
1 − Log−normal
−100
score values
2 − Negative Binomial
3 − Poisson
4 − Over−dispersed P.
−150
5 − Gamma
6 − Uniform
7 − Unif. Normal
−200
8 − Bootstrap Gamma
9 − Bootstrap Od. Pois.
10 − Ideal
−250
−300
1
2
3
4
5
6
7
8
9
10
reserving methods
Figure 5.4: Boxplots of Energy Score values in the Gamma Model example
Reserving method
Log-normal
Coverage (%)
Average width
66.67% interval
90% interval
66.67% interval
90% interval
63.4
81.1
89035
245659
Negative Binomial
3.3
5.4
919
1562
Poisson
1.4
2.8
444
755
Over-dispersed P.
48.7
71.0
15497
26322
Gamma
50.4
73.6
15506
26533
Uniform
75.4
97.0
111512
422161
Unif. Normal
95.9
100.0
287986
489479
Bootstrap Gamma
86.6
98.4
39946
70131
Bootstrap Od. Pois.
86.6
98.5
39951
70112
Ideal
66.8
89.4
13967
23821
Table 5.2: Coverage and Average width in the Gamma Model example
18
Log−normal
Negative Binomial
Poisson
Over−dispersed P.
Gamma
Uniform
Unif. Normal
Bootstrap Gamma
Bootstrap Od. Pois.
1200
1000
800
600
400
200
0
frequencies
1200
1000
800
600
400
200
0
1200
1000
800
600
400
200
0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
value
Figure 5.5: Histograms of PIT values in the Public Data case
Reserving method
Log-normal
Coverage (%)
Average width
66.67% interval
90% interval
66.67% interval
90% interval
19.2
31.9
1261338486
2170190302
0.0
0.0
573564
974878
Negative Binomial
Poisson
0.0
0.0
163897
278579
Over-dispersed P.
15.2
26.7
954272655
1621537701
Gamma
23.8
40.0
1395724439
2375100112
Uniform
56.2
72.5
3228640816
4738399784
Unif. Normal
38.8
59.9
1963981053
3337926040
Bootstrap Gamma
61.5
87.1
4490491606
8679547627
Bootstrap Od. Pois.
61.6
87.1
4492743160
8679552105
Table 5.3: Coverage and Average width in the Public Data case
Reserving Method
CRPS (mean)
En. Sc. (mean)
MSEP (mean)
MSEP (median)
-1.526e+09
-27780
5.420483e+18
3.699599e+18
Negative Binomial
-1.817e+09
-38850
5.348926e+18
3.671239e+18
Poisson
-1.817e+09
-38970
5.348924e+18
3.671239e+18
Over-dispersed P.
-1.579e+09
-28820
5.350275e+18
3.671087e+18
Gamma
-1.397e+09
-25880
4.803373e+18
3.239738e+18
Log-normal
Uniform
-1.227e+09
-23530
4.101087e+18
2.726758e+18
Unif. Normal
-1.231e+09
-23600
4.100103e+18
2.723995e+18
Bootstrap Gamma
-1.347e+09
-23470
7.204460e+19
5.300120e+18
Bootstrap Od. Pois.
-1.347e+09
-23470
4.082055e+19
5.308000e+18
Table 5.4: Scores and Mean Square Errors in the Public Data case
19
References
S. Baran, D. Nemoda, and A. Horányi. Probabilistic wind speed forecasting in Hungary. arXiv:1202.4442,
2012.
G. L. Barnett and B. Zehnwirth. Best estimates for reserves. In Proceedings of the Casualty Actuarial
Society LXXXVII, 2000.
S. Björkwall, O. Hössjer, and E. Ohlsson.
Non-parametric and parametric bootstrap tech-
niques for age-to-age development factor methods in stochastic claims reserving.
navian Actuarial Journal, 2009(4):306–331, 2009.
Scandi-
doi: 10.1080/03461230903239738.
URL
http://www.tandfonline.com/doi/abs/10.1080/03461230903239738.
C. Czado, T. Gneiting, and L. Held.
rics, 65(4):1254–1261, 2009.
Predictive model assessment for count data.
ISSN 1541-0420.
Biomet-
doi: 10.1111/j.1541-0420.2009.01191.x.
URL
http://dx.doi.org/10.1111/j.1541-0420.2009.01191.x.
A. P. Dawid. Present position and potential developments: Some personal views: Statistical theory: The
prequential approach. Journal of the Royal Statistical Society. Series A (General), 147(2):pp. 278–292,
1984. ISSN 00359238. URL http://www.jstor.org/stable/2981683.
P. D. England and R. J. Verrall. Stochastic claims reserving in general insurance. British Actuarial
Journal, 8(3):443–518, 2002.
P. D. England and R. J. Verrall. Predictive distributions of outstanding liabilities in general insurance.
Annals of Actuarial Science, 1(02):221–270, 8 2006. ISSN 1748-5002. doi: 10.1017/S1748499500000142.
URL http://journals.cambridge.org/article_S1748499500000142.
T. Faluközy, I. I. Vitéz, and M. Arató.
Stochastic models for claims reserving in insurance busi-
ness, chapter 13, pages 102–113. World Scientific, 2007. doi: 10.1142/9789812709691_0013. URL
http://www.worldscientific.com/doi/abs/10.1142/9789812709691_0013.
T. Gneiting and A. E. Raftery.
tion.
Strictly proper scoring rules, prediction, and estima-
Journal of the American Statistical Association, 102(477):359–378, Mar 2007.
URL
http://dx.doi.org/10.1198/016214506000001437.
T. Gneiting, F. Balabdaoui, and A. E. Raftery.
sharpness.
Probabilistic forecasts, calibration and
Journal of the Royal Statistical Society: Series B (Statistical Methodology),
69(2):243–268, 2007.
ISSN
1467-9868.
doi:
10.1111/j.1467-9868.2007.00587.x.
URL
http://dx.doi.org/10.1111/j.1467-9868.2007.00587.x.
T.
M.
Hamill.
casts.
Interpretation
Monthly
Weather
of
rank
Review,
histograms
129(3):550–560,
for
verifying
March
ensemble
2001.
foreURL
http://nldr.library.ucar.edu/repository/collections/AMS-PUBS-000-000-000-008.
T. Mack. Distribution-free calculation of the standard error of chain ladder reserve estimates. ASTIN
Bulletin – The Journal of the International Actuarial Association, 23(2):213–225, November 1993. URL
http://www.actuaries.org/LIBRARY/ASTIN/vol23no2/213.pdf.
T.
Mack.
Insurance:
Which
stochastic
Mathematics
and
model
is
Economics,
underlying
the
15(2-3):133–138,
chain
ladder
1994.
http://EconPapers.repec.org/RePEc:eee:insuma:v:15:y:1994:i:2-3:p:133-138.
20
method?
URL
L. Márkus and J. Gyarmati-Szabó. A Hierarchical Bayesian model to predict belatedly reported claims
in insurances, chapter 17, pages 137–144. World Scientific, 2007. doi: 10.1142/9789812709691_0017.
URL http://www.worldscientific.com/doi/abs/10.1142/9789812709691_0017.
M.
Merz
and
method.
M.
V.
Insurance:
Wüthrich.
Mathematics
Paid-incurred
and
Economics,
chain
46(3):568–579,
claims
reserving
2010.
URL
http://EconPapers.repec.org/RePEc:eee:insuma:v:46:y:2010:i:3:p:568-579.
G. Meyers. Thinking outside the triangle, paper presented to the 37th astin colloquium, florida, 2007.
P. J. R. Pinheiro, J. M. A. e Silva, and Maria de Lourdes Centeno.
Bootstrap methodology in
claim reserving. The Journal of Risk and Insurance, 70(4):701–714, 2003. ISSN 00224367. URL
http://www.jstor.org/stable/3519936.
G.
ing
C.
Taylor
claims.
and
F.
Journal
R.
Ashe.
of
Second
Econometrics,
moments
23(1):37–61,
of
estimates
September
of
outstand-
1983.
URL
http://ideas.repec.org/a/eee/econom/v23y1983i1p37-61.html.
G.C. Taylor. Loss Reserving: An Actuarial Perspective. Huebner International Series on Risk, Insurance and Economic Security Series. Springer-Verlag GmbH, 2000. ISBN 9780792385028. URL
http://books.google.hu/books?id=NmM-EH4h7kcC.
R. J. Verrall. A method for modelling varying run-off evolutions in claims reserving. ASTIN Bulletin
– The Journal of the International Actuarial Association, 24(2):325 – 332, November 1994. URL
http://www.casact.org/library/astin/vol24no2/325.pdf.
M. V. Wüthrich and M. Merz. Stochastic Claims Reserving Methods in Insurance. Wiley, 2008.
B. Zehnwirth. Probabilistic development factor models with applications to loss reserve variability, prediction intervals, and risk based capital. In Variability in Reserves Prize Program Papers, volume 2.
Casualty Actuarial Science Forum, 1994.
21
View publication stats