Math 100: Mathematics in The Modern World (MMW) Data Management

Download as pdf or txt
Download as pdf or txt
You are on page 1of 32

Math 100

Mathematics in the
Modern World (MMW)

Data Management

By:
Asst Prof Ronald F. Judan
ME-ECE/MaEd MAth
Objectives
a) Use a variety of statistical tools to process and
manage numerical data

b) Use the methods of linear regression and


correlations to predict the value of a variable given
certain conditions

d) Advocate the use of statistical data in making


important decisions
Data Management: Linear Regression
ESTIMATE the value of the dependent variable corresponding to a
given value of the independent variable.
Least Square Regression Line Equation
y = a + (b) (x)
where:
y - predicted score, dependent variable, outcome
a – the y-intercept
b – the slope of the line
x - independent variable (controlled-manipulated change)
explanatory

a= (y)(x2) – (x) (xy)


N (x2) – (x)2

b= N (XY) - (X) (Y)


N (X2) - (X)2
Example

a. Find:
•Regression equation
•Plot scatter diagram
•y if x = 0, 1, 3, 16

X 1 3 4 6 8 9 11 14
Y 1 2 4 4 5 7 8 9
Solution

X Y X2 XY
1 1 1 1
3 2 9 6
4 4 16 16
6 4 36 24
8 5 64 40
9 7 81 63
11 8 121 88
14 9 196 126
--- --- --- ---
56 40 524 364
Solution
a = (40)(524) – (56) (364)
8 (524) – (56)2
• = 0.545454…
• = 0.55

• b= 8 (364) - (56) (40)
• 8 (524) - (56)2
• = 0.636363…
• = 0.64
• Therefore:
• y = (0.55) + (0.64)(x)
Prediction
if x = 0 then y = (0.55) + (0.64)(0)
• = (0.55) + 0
• = 0.55
if x = 1 then y = (0.55) + (0.64)(1)
• = (0.55) + (0.64)
• = 1.19
if x = 3 then y = (0.55) + (0.64)(3)
• = (0.55) + (1.92)
• = 2.47
• if x = 16 then y = (0.55) + (0.64)(16)
• = (0.55) + (10.24)
• = 10.79
Example
X 1 3 4 6
Y 3 7 9 13

b. Find:
•Regression equation
•Plot scatter diagram
•y if x = 0, 1, 3, 16
Solution

X Y X2 XY
1 3 1 3
3 7 9 21
4 9 16 36
6 13 36 78
--- --- --- ---
14 32 62 138
Solution

• a = (32)(62) – (14) (138)


4 (62) – (14)2
• =1

• b = 4 (138) - (14) (32)
• 4 (62) - (14)2
• =2

• Therefore y = (1) + (2)(x)

Prediction
if x = 0 then y = (1) + (2)(0)
• = (1) + 0
• = 1
• if x = 1 then y = (1) + (2)(1)
• = (1) + (2)
• = 3
• if x = 3 then y = (1) + (2)(3)
• = (1) + (6)
• = 7
• if x = 16 then y = (1) + (2)(16)
• = (1) + (32)
• = 33
Correlation
• Degree that describe the relationship between two sets of
variables.
• Statistical tool used to measure the association of two or
more quantitative variables whether a relationship exists, If
so how close or how significant the relationship might be.
• The strength of a correlation is measured by the correlation
coefficient r.
• Another name for r is the Pearson product moment
correlation coefficient in honor of Karl Pearson who
developed it about 1900.
• A value of zero for r does not necessarily mean that there is
no correlation; there could be a nonlinear correlation.
Examples

• Is tardiness a factor of academic performance?


• Is age a factor in driving accidents?
• Is there a relationship between grades in Algebra and
grades in Calculus?
• Is the amount of money spent on food a factor of
obesity in children?
• Will you die younger if you smoke?
• Is height a factor for self esteem?
Scatter Plots

Maximum positive Strong positive Zero correlation


correlation (r = 1.0) correlation (r = 0.80) (r = 0)

Maximum negative Moderate negative Strong correlation


correlation (r = -1.0) correlation (r = -0.43) with outlier (r = 0.71)
Correlation
• According to Sevilla (1994), r may be interpreted as
follows:
✓ 0.00 to  0.19 No correlation exist.
✓ 0.20 to  0.39 Slight correlation exist.
✓ 0.40 to  0.59 Substantial correlation exist.
✓ 0.60 to  0.79 Significant correlation exist.
✓ 0.80 to  0.99 Very significant correlation exist.
✓ 1.00 Perfect correlation exist.
Pearson Product Moment
• The Pearson product moment correlation coefficient is one
of the most widely used correlation.
• It is a parametric test that assumes the sample data come
from a normally distributed population.
• It is applied when the level of measurement of our data is
interval or ratio.
Pearson Product Moment
• The correlation coefficient, r is computed using the
formula below:

n( xy) − ( x)( y)


r=
 2 2

n( x ) − ( x) n( y ) − ( y) 2 2

Where n is the number of data pairs and x & y are individual data.
• Please pay particular attention to xy and the product
(x)(y).
• It will be best to tabulate your calculation.
Pearson Product Moment
Ex. 1. Below are grades in Algebra and Calculus. Is there a
significant correlation between grades earned in Algebra
and grades earned in Calculus by the same set of students?
Algebra 85 79 83 88 76 81 87 77 90 95
Calculus 80 75 78 83 78 81 79 75 85 88

Agebra, x Calculus, y x2 y2 xy
85 80 7225 6400 6800
79 75 6241 5625 5925
83 78 6889 6084 6474
88 83 7744 6889 7304
76 78 5776 6084 5928
81 81 6561 6561 6561
87 79 7569 6241 6873
77 75 5929 5625 5775
90 85 8100 7225 7650
95 88 9025 7744 8360
841 802 71059 64478 67650
Pearson Product Moment
• From our table xy 67650
x 841
y 802
x2 71059
• Solving for r we have: y2 64478

10(67650) – (841)(802)
r=
√10(71059) – (841)2] [10(64478) – (802)2

• r = 0.8837
• Hence, there is a very significant correlation between grades in
Algebra and Calculus in the set of data that we have.
Height of 12 father and corresponding height of eldest son. Find
the correlation coefficient and interpret the result.
Note: We know that the variables Height of the Father and height
of the eldest son are related to the height of the father as a
factor that strongly influences the height of eldest sons.
Generally the taller the father the taller the son.
Given Data:
Father Height of Height of XY X2 Y2
Father in Eldest son in
(Inches) X (Inches) Y
1 65 68 4420 4225 4624
2 63 66 4158 3969 4356
3 67 68 4556 4489 4624
4 64 65 4160 4096 4225
5 68 69 4692 4624 4761
6 62 66 4092 3844 4356
7 70 68 4760 4900 4624
8 66 65 4290 4356 4225
9 68 71 4828 4624 5041
10 67 67 4489 4489 4489
11 69 68 4692 4761 4624
12 71 70 4970 5041 4900
800 811 54107 53418 54849
484
484
474,472
474,472

= 0.7027 = 0.7

Interpretation: There exists a significant correlation


between height of the father and the height of the eldest
son.
Example 3: Consider the intelligence test scores and mathematics
grades of freshmen in a certain University. The data given below:
Find: a) Pearson’s r
b) Regression Line Equation (Y =a+bx)
c) Interpret Results

Student Intelligence Test Math Grade X*Y X2 Y2


Scores
1 65 85
2 50 74
3 55 76
4 65 90
5 55 85
6 70 87
7 65 94
8 70 98
9 55 81
10 70 91
11 50 76
12 55 74
Summation
12(61685) − (725)(1011)
r=
12(44475) − (725)  12(85905) − (1011) 
2 2

Solution
X = 725 y = 1011 xy= 61685 x2 = 44475 y2 = 85905

= 0.86
Interpretation:
A very significant relationship between Intelligence Test
Scores and Math performance of freshmen. The significant
value of r = 0.86 indicates a marked relationship between
the two variables.
Activity
1. A study was conducted to determine the relationship existing
between the grade in English and the grade in Mathematics. A
random sample of 10 CTE students in UC were taken and the
following are the results of the sampling.
a) Formulate H0 and Ha
b) compute for the Pearson( r )
c) Find equation of regression line
d) Interpret and conclude results
Note:
Solve on a bond paper take a picture and file upload in canvas

Student 1 2 3 4 5 6 7 8 9 10
English 75 83 80 77 89 78 92 86 93 84
MATHEMATICS 78 87 78 76 92 81 89 89 91 84
•End
END

You might also like