1.1 Basic Time Series Decomposition PDF
1.1 Basic Time Series Decomposition PDF
1.1 Basic Time Series Decomposition PDF
, … , 𝑛 ~ 𝑁 𝜇, 𝜎
~ 𝑁 𝜇, 𝜎 /
𝑆 −
~ 𝜒𝑛−
𝜎
Properties of the Estimators
1. Unbiasedness
2. Consistency
Method of Moments Estimation
MOM Approach: Equate the distribution moments to
the observed moments
𝑛
𝑝 𝑝
Ε = 𝑖
𝑖=
Example: ,…, 𝑛 ~ 𝑁 𝜇, 𝜎
𝑛 𝑛
𝑝= ∶ Ε =𝜇 = 𝑖= 𝑖⇨ 𝜇 = 𝑖= 𝑖
𝑛 𝑛
𝑛
𝑛 𝑋𝑖 −𝑋
𝑝= ∶ Ε =𝜎 −𝜇 = 𝑖= 𝑖 ⇨𝜎 =
𝑖=
𝑛 𝑛
Maximum Likelihood Estimation
MLE Approach: Maximize the likelihood function of 𝜃 given the data.
𝑓 ,..., 𝑛; 𝜃 =𝑓 ;𝜃 ...𝑓 𝑛; 𝜃
Likelihood function
𝜃∶ ,..., 𝑛 =𝑓 ,..., 𝑛; 𝜃
Pr 𝜃 ∈ 𝜃 − , 𝜃 + = −𝛼
Formal Definition
A stochastic process is a collection of random variables 𝑡, 𝑡 ∈𝑇 ,
defined on a probability space Ω, 𝐹, 𝑃 .
Note: The term “time series” is also used to refer to the realization
of such a process (observed time series).
Example: Time Series
• Monthly sales of Australian red wine
• Monthly accidental deaths in the U.S.
• Daily Average Temperature from La Harpe station in Hancock
County, Illinois
• Daily stock price of IBM stock
• US monthly interest rates
• US yearly GDP
• 1-minute intraday S&P500 return
Time Series: Characteristics
• Trend: long-term increase or decrease in the data over time
• Seasonality: influenced by seasonal factors (e.g. quarter of the year,
month, or day of the week)
• Periodicity: exact repetition in regular pattern (seasonal series often
called periodic, although they do not exactly repeat themselves)
• Cyclical trend: data exhibit rises and falls that are not of a fixed
period
• Heteroskedasticity: varying variance with time
• Dependence: positive (successive observations are similar) or
negative (successive observations are dissimilar)
Example: GDP
Example: Daily IBM Stock Price
Example: S&P500 Intraday
Is Time Series Analysis Necessary?
Time Series ⇒ Dependence
• Data redundancy: number of degrees of freedom is smaller
than T (T is the number of observations)
• Data sampling: 𝑡 , 𝑡 = , . . , 𝑇 concentrated about a small part
of the probability space
Ignoring dependence leads to
• Inefficient estimates of regression parameters
• Poor predictions
• Standard errors unrealistically small (too narrow CI ⇒
improper inferences)
Time Series: Objectives
Description
• Plot the data and obtain simple descriptive measures of the
main properties of the series.
Explanation
• Find a model to describe the time dependence in data.
Forecasting
Given a finite sample from the series (observations), forecast
the next value or the next several values.
Control/Tuning
• After forecasting, adjust various control/tune parameters.
Time Series Analysis: Approaches
Time domain approach
Estimation Methods
1. Moving Average
2. Parametric Regression (Linear, Quadratic, etc.)
3. Non-Parametric Regression
Trend: Moving Average
Estimate the trend for with a width of the moving window d:
If the width is 𝑑 = 𝑞, use
𝑥𝑡− 𝑥𝑡+
𝑡 = + 𝑥𝑡− + + 𝑥𝑡− + + . . . +𝑥𝑡+ − + .
𝑑
If the width is 𝑑 = 𝑞 + , use
𝑡 = 𝑥𝑡+
𝑑
=−
The width selection reflects the bias-variance trade-off:
• If width large, then the trend is smooth (i.e. low variability)
• If width small, then the trend is not smooth (i.e. low bias)
Trend: Parametric Regression
Estimate the trend 𝑡 assuming a polynomial in t:
𝑡 =𝛽 +𝛽 +𝛽 + ⋯+ 𝛽
• Commonly use small order polynomial (p=1 or 2)
• Estimation approach: Fit a linear regression model where
the predicting variables are ( , ,…, )
• Which terms to keep? Use model selection to select
among the predicting variables. Cautious! Strong
correlation among the predicting variables.
Trend: Non- Parametric Regression
Estimate the trend 𝑡 with t in { , , … , 𝑛 }:
1. Kernel Regression
𝑡 = = 𝑛= 𝑡𝑖 where a weight function
depending on a kernel function.
2. Local Polynomial Regression
• An extension of the kernel regression and the polynomial
regression: fit a local polynomial within a width of a data point
3. Other Approaches
• Splines regression
• Wavelets
• Orthogonal basis function decomposition
Trend: Non- Parametric Regression
Which one to choose?
→ Local polynomial regression is preferred over kernel
regression since it overcomes boundary problems and its
performance is not dependent on the design of the time points
→ Other methods are to be selected depending on the level of
smoothness of the function to be estimated
→ For estimating the trend in time series, local polynomial or
splines regression will perform well in most cases
Data Example:
Temperature in Atlanta, Georgia
Data: Average monthly temperature records starting in 1879 until 2016.