IWB Chapter 10 - Inter-Relationships Between Variables
IWB Chapter 10 - Inter-Relationships Between Variables
IWB Chapter 10 - Inter-Relationships Between Variables
Outcome
The underpinning detail for this chapter in your Integrated Workbook can
be found in Chapter 10 of your Study Text
193
Chapter 10
Overview
Nature of Strength of
Big data
relationship relationship
Introduction
INTER-RELATIONSHIPS
BETWEEN VARIABLES
Correlation Regression
Pearson’s correlation
coefficient
Coefficient of
determination
Rank correlation
194
Inter-relationships between variables
Introduction
1.1 Definition
This chapter examines the strength and nature of the relationship between two sets
of figures.
195
Chapter 10
Big data
Big data allows businesses to expand their knowledge of their customers and
develop products/ services that are best suited to their needs.
Big data can help boost marketing activities, since it provides businesses with a
chance to analyse customer behaviour on multiple channels and understand
when the customer is most likely to buy products/ services.
196
Inter-relationships between variables
Correlation
This measure has the property of always lying in the range –1 to +1, where:
The coefficient of determination, r2, gives the proportion of changes in y that can
be explained by changes in x, assuming a linear relationship between x and y.
E.g. If r = +0.7, then r2 = 0.49 and we could state that 49% of the observed
changes in y can be explained by the changes in x but that 51% of the changes
must be due to other factors.
197
Chapter 10
Question 1
By calculating the correlation coefficient, you are demonstrating the
maths skill of following the order of precedence of operators, including
indices.
Evaluate Pearson’s correlation coefficient for the data on sales and advertising
spend in the table below.
r = 61,220/√[545 × 7,418,000]
r = 61,220/63,583 = 0.96
198
Inter-relationships between variables
199
Chapter 10
Question 2
Pearson's correlation coefficient
Evaluate Pearson’s correlation coefficient for the data below.
n = 6, ∑x = 21, ∑y = 482, ∑xy = 2,213, ∑x2 = 91, ∑y2 = 60,514
n ∑ xy – ∑ x ∑ y
r=
2 2
(n ∑ x2– ( ∑ x) )(n ∑ y2– ( ∑ y) )
r = (6 × 2,213 – 21 × 482)/√[(6 × 91 – 212) (6 × 60,514 – 4822)]
r = 3,156/√[105 × 130,760]
r = 3,156/3,705 = 0.85
Question 3
By calculating the coefficient of determination and explaining the result,
you are developing the skill of analysis.
Coefficient of determination
If the Pearson’s correlation coefficient is 0.96, calculate the coefficient of
determination and explain what the result means.
r2 = 0.962 = 0.92
92% of the variation in dependent y values can be explained by changes in the
independent x values
200
Inter-relationships between variables
Looks at the link between the ranking of a set of objects and one of their
properties
201
Chapter 10
Question 4
By calculating the correlation coefficient, you are demonstrating the
maths skill of following the order of precedence of operators, including
indices.
Calculate Spearman’s rank correlation coefficient for the following data, ranking
the best scores as 1 through to the worst as 5.
chemistry score maths score
Bob 43% 55%
Jim 61% 57%
Ash 57% 88%
Zak 76% 59%
Jen 85% 96%
2
6∑d
R=1–
n(n2 -1)
chemistry maths Chemistry Maths
score score rank rank d d2
Bob 43% 57% 5 5 0 0
Jim 61% 59% 3 4 –1 1
Ash 57% 88% 4 2 2 4
Zak 76% 71% 2 3 –1 1
Jen 85% 96% 1 1 0 0
––
∑d2 6
R = 1 – (6 × 6)/[5 × (25 – 1)]
R = 1 – 36/120 = 0.7
202
Inter-relationships between variables
Question 5
Spearman's rank correlation
R = 1 – 36/210 = 0.83
203
Chapter 10
Regression
4.2 Calculation
n ∑ xy – ∑ x ∑ y
b= 2
(n ∑ x2– ( ∑ x) )
a = y – bx
The actual relationship between the two sets of figures may not be linear – use
the coefficient of determination to investigate further.
We need to be careful using the regression line to make forecasts outside of the
range of the original data ("extrapolation").
204
Inter-relationships between variables
Question 6
Least-squares regression
Calculate the line of best fit formula using least-squares regression on the
following data.
Advertising spend ($000) Sales value ($000)
10 1,500
12 2,000
15 2,200
18 2,700
19 2,700
21 2,720
n=6
x y xy x2
10 1,500 15,000 100
12 2,000 24,000 144
15 2,200 33,000 225
18 2,700 48,600 324
19 2,700 51,300 361
21 2,720 57,120 441
––– ––––– –––––– ––––
95 13,820 229,020 1,595
∑ ∑ ∑
b=
∑ ∑
b = (6 × 229,020 – 95 × 13,820)/(6 × 1,595 – 952)
b = 61,220/545
b = 112.33
a = y – bx
a = 525.25
y = 525.25 + 112.33x
205
Chapter 10
Question 7
By calculating the line of best fit formula, you are demonstrating the
maths skill of following the order of precedence of operators, including
indices.
Least-squares regression
Evaluate the line of best fit formula from the following information:
∑ ∑ ∑
b=
∑ ∑
b = 3,156/105 = 30.06
a = y – bx
a = –24.88
y = –24.88 + 30.06x
Question 8
Regression equation
If the regression equation linking costs (y, in $) and productivity levels (x, in
units produced) is y = 55,000 + 25x, forecast the cost when 6,000 units are
produced.
206
Inter-relationships between variables
Question 9
By calculating the value of b, you are demonstrating the maths skill of
following the order of precedence of operators, including indices.
Regression equation
4,100 = 2,580 + b × 95
1,520 = b × 95
b = 1,520/95 = 16
207
Chapter 10
You should now be able to answers all the questions from chapter 10 of the
Study Text and questions 192 – 203 from the Exam Practice Kit.
208
Inter-relationships between variables
Answers
Question 1
n=6
x y xy x2 y2
10 1,500 15,000 100 2,250,000
12 2,000 24,000 144 4,000,000
15 2,200 33,000 225 4,840,000
18 2,700 48,600 324 7,290,000
19 2,700 51,300 361 7,290,000
21 2,720 57,120 441 7,398,400
––– –––––– –––––– ––––– –––––––––
95 13,820 229,020 1,595 33,068,400
n ∑ xy – ∑x∑y
r=
2 2
(n ∑ x2 – ( ∑ x) )(n ∑ y2 – ( ∑ y) )
r = (6 × 229,020 – 95 × 13,820)/√[(6 × 1,595 – 952) (6 × 33,068,400 – 13,8202)]
r = 61,220/√[545 × 7,418,000]
r = 61,220/63,583 = 0.96
209
Chapter 10
Question 2
n ∑ xy – ∑ x ∑ y
r=
2 2
(n ∑ x2– ( ∑ x) )(n ∑ y2– ( ∑ y) )
r = (6 × 2,213 – 21 × 482)/√[(6 × 91 – 212) (6 × 60,514 – 4822)]
r = 3,156/√[105 × 130,760]
r = 3,156/3,705 = 0.85
Question 3
r2 = 0.962 = 0.92
92% of the variation in dependent y values can be explained by changes in the
independent x values
210
Inter-relationships between variables
Question 4
2
6∑d
R=1–
n(n2 -1)
chemistry maths Chemistry Maths
score score rank rank d d2
Bob 43% 57% 5 5 0 0
Jim 61% 59% 3 4 –1 1
Ash 57% 88% 4 2 2 4
Zak 76% 71% 2 3 –1 1
Jen 85% 96% 1 1 0 0
––
∑d2 6
R = 1 – (6 × 6)/[5 × (25 – 1)]
R = 1 – 36/120 = 0.7
211
Chapter 10
Question 5
6∑d2
R=1–
n(n2 -1)
R = 1 – 36/210 = 0.83
212
Inter-relationships between variables
Question 6
n=6
x y xy x2
10 1,500 15,000 100
12 2,000 24,000 144
15 2,200 33,000 225
18 2,700 48,600 324
19 2,700 51,300 361
21 2,720 57,120 441
––– ––––– –––––– ––––
95 13,820 229,020 1,595
∑ ∑ ∑
b=
∑ ∑
b = (6 × 229,020 – 95 × 13,820)/(6 × 1,595 – 952)
b = 61,220/545
b = 112.33
a = y – bx
a = 525.25
y = 525.25 + 112.33x
213
Chapter 10
Question 7
∑ ∑ ∑
b=
∑ ∑
b = 3,156/105 = 30.06
a = y – bx
a = –24.88
y = –24.88 + 30.06x
Question 8
y = 55,000 + 25 × 6,000 = $205,000
Question 9
4,100 = 2,580 + b × 95
1,520 = b × 95
b = 1,520/95 = 16
214