Correlation Lecture Notes
Correlation Lecture Notes
Correlation Lecture Notes
∑ ∑ ∑
r= = = = +0.87
√[ ∑ ∑ ][ ∑ ∑ ] √[ ][ ] √
Interpretation: There exists a positive high correlation between sales and advertising expense. High sales are associated with high advertising
expense.
1
Spearman Rho
There exists a negative moderate correlation between the rankings of Philbert and Karyll of the Tourists destinations. Philbert’s high ranks are
associated with Karyll’s low ranks.
2
Kendall’s Coefficient of Concordance
∑
W= total sum of ranks = average rank = = = 27.5
Sum of
JUDGES D2
Projects Ranks
VIOLA RIZZA MARIAN XYLI ZIAN
A 2 1 2 3 4 12 (12-27.5)2=240.25
B 1 3 1 2 2 9 342.25
C 3 4 4 1 3 15 156.25
D 5 5 5 5 1 21 42.25
E 4 2 6 7 6 25 6.25
F 7 8 3 4 7 29 2.25
G 6 6 8 6 5 31 12.25
H 8 7 7 8 9 39 132.25
I 9 10 10 9 8 46 342.25
j 10 9 9 10 10 48 420.25
2
Σsum of ΣD = 1696.50
ranks= 275
total sum of ranks = = = 275 m=how many sets of ranks are there ; n = number of objects being rated
∑
W= = = 0.82
There exists high correlation among the rankings of the five judges.
3
Point Biserial Coefficient
-determine relationship between a dichotomous (nominal) and continuous (interval or ratio) variable.
∑ ∑ ∑ ∑
rpb =
√∑ ∑ [∑ ∑ ∑ ]
we assign the fp column for the positive response counts in the nominal variable; f w for the negative response counts in the nominal,; Y column
for the responses (scores) in the continuous variable.
Sample. Determine correlation between the results of the interview (pass or fail) and the entrance exams of the applicants.
∑ ∑ ∑ ∑
rpb = = = = 0.64
√ [ ]
√∑ ∑ [∑ ∑ ∑ ]
4
There exists a positive moderate correlation between entrance test scores and the interview results of the applicants. High scores in
the entrance exam are associated with passing results in the interview.
Partial correlation
-determining the relationship between two variables holding constant (eliminating) the influence of a third variable.
The general formula (correlation between variables 1 and 2 with the effects of variable 3 partialed out).
r12.3 =
√
Correlation between variables 2 and 3 with the effects of variable 1 partialed out
r23.1 =
√
Correlation between variables 1 and 3 with the effects of variable 2 partialed out
r13.2 =
√
Sample. Suppose
What is the correlation between weight (2) and scores(3) with the effects of age(1) removed?
5
r23.1 = = r23.1 = = = 0.04
√ √
There exists positive yet negligible correlation between weight and scores with the effects/influence of age removed (partialed
out).
Multiple Correlation
-to get the correlation between one variable and the combined effects of two or more other variables.
Formula for the correlation between variable 1 and the combined effects of variables 2 and 3
R1.23 = √
R3.12 == √
R2.13 == √
6
Suppose: 1 = age; 2 = weight 3 =scores in the math test and
What is the correlation between age(1) and the combined effect of weight(2) and scores(3)?
R1.23 = √ = √ =√ = 0.83
There exists a high correlation between age and the combined effects of weight and scores.