Box-Jenkins (Part 1)
Box-Jenkins (Part 1)
Box-Jenkins (Part 1)
MODELS
S T Q S 3 11 3 S TAT I S T I C A L M O D E L L I N G
2
• Three basic ARIMA models for a stationary time series :
– Autoregressive model of order ,
3
• In an ARIMA model, the random disturbance term is typically known as a “white
noise”.
• It is identically and independently distributed with a mean of 0 and a common variance
across all observations.
• We write
4
A FOUR-STEP ITERATIVE PROCEDURE
Forecasting
Diagnostics
Checking
Parameter
Estimation
Tentative Identification
• Stationarity Checking and
Differencing
• Model Identification
5
STEP ONE: TENTATIVE IDENFICATION
A) STATIONARITY CHECKING
6
• The white noise series satisfies the stationarity condition because
7
EXAMPLES –STATIONARY TS
8
EXAMPLES –STATIONARY TS
9
NON-STATIONARITY
• How to identify?
– Time plot
– The ACF of stationary data drops to zero relatively quickly
– The ACF of non-stationary data decreases slowly.
– Unit root test shows positive for structural breaks. Structural breaks refers to
sudden and unexpected change in the mean of the data (drop down, go up).
10
EXAMPLES – NON-STATIONARY TS
11
EXAMPLES – NON-STATIONARY TS
12
DIFFERENCING
• The number of times that the original series must be differenced in order to achieve
stationarity is called the order of integration, denoted by .
• In practice, it is almost never necessary to go beyond second difference, because real
data generally involve only first or second level non-stationarity.
13
14
15
DIFFERENCING
• Backward shift operator, .
• operating on , has the effect of shifting the data back one period.
• Two applications of B on shifts the data back two periods:
• The backward shift operator is convenient for describing the process of differencing.
• Therefore,
16
UNIT-ROOT TEST
• To determine more objectively whether differencing is required, we can use unit root
test.
• One of the test is Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test.
Time series are stationary.
Time series are non-stationary.
• Small p-value suggests differencing is required.
17
EXAMPLE – GLOBAL TEMPERATURE DEVIATIONS
Time series are stationary.
Time series are non-stationary.
18
STATIONARITY SUMMARY
19
STEP ONE: TENTATIVE IDENFICATION
B) MODEL IDENTIFICATION
• When the time series are stationary, one may proceed to tentative identification of
models through visual inspection of both the sample autocorrelation (SAC) and partial
sample autocorrelation (PSAC) functions.
20
SAMPLE AUTOCORRELATION FUNCTION (SAC)
• For the stationary series , the SAC at lag is
• The statistics is
21
THE BEHAVIOUR OF SAC
1. The SAC can cut off. A spike at lag exists in the SAC if is statistically large. If
then is considered to be statistically large. The SAC cuts off after lag if there are no
spikes at lags greater than in the SAC.
22
2. The SAC is said to die down if this function does not cut off but rather decreases in a
‘steady’ fashion. The SAC can die down in
i. A damped exponential fashion
ii. A damped sine-wave fashion
iii. Or a combination of both i. and ii.
• The SAC can die down fairly quickly or extremely slowly.
• The time series values should be considered stationary if the SAC of the time series
values either cuts off fairly quickly or dies down fairly quickly.
• However, if the SAC of the time series values dies down extremely slowly, and at
lag 1 is close to 1, then the time series values should be considered non-stationary.
23
SAMPLE PARTIAL AUTOCORRELATION
FUNCTION (SPAC)
where
for .
24
• when ,
• when
and
25
• may intuitively be thought of as the sample autocorrelation of time series observations
separated by a lag k time units with the effects of the intervening observations
eliminated.
• This function plays an important role in data analysis aimed at identifying the extent of
the lag in an autoregressive model.
• The standard error of is
• The statistic is
26
• The behavior of SPAC similar to its of the SAC. The only difference is that is
considered to be statistically large if
for any .
27
SUMMARY ON THE BEHAVIOURS OF SAC AND
SPAC
28
29
30
31
COMMON DEPENDENCE STRUCTURE
Model SAC (ACF) SPAC (PACF)
Dies down in damped
MA(1) Cuts off after lag 1
exponential decay
Dies down
MA(2) Cuts off after lag 2 (damped exponential
decay & sine wave)
Dies down in damped
AR(1) Cuts off after lag 1
exponential decay
Dies down
AR(2) (damped exponential Cuts off after lag 2
decay & sine wave)
Dies down in damped Dies down in damped
ARMA(1,1)
exponential decay exponential decay
32
EXAMPLE - LUTEINIZING HORMONE IN BLOOD
SAMPLES
33
• Cuts at lag 1 • Cuts at lag 1
34
AR(1)
• Should we combine
both and ? ARMA(1,1)
• or ARIMA(1,0,
1)
MA(1)
35