Act 5.6. 1-5

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

Name: MIKI A.

RULOMA _ Course & Year: BS IN CIVIL ENGINEERING - 1A

ACTIVITY 5.6.1: Let’s Diagnose Your Knowledge


The following items talk about the data management that diagnosed your understanding related
to linear regression and correlation. Read carefully each item and determine whether each
statement is true or false.
FALSE 1. The dependent variable is the variable that is being described, predicted, or
controlled.
TRUE 2. A simple linear regression model is an equation that describes the
straight-line relationship between a dependent variable and an
independent variable.
TRUE 3. The residual is the difference between the observed value of the dependent
variable and the predicted value of the dependent variable.
FALSE 4. When using simple regression analysis, if there is a strong correlation
between the independent and dependent variable, then we can conclude
that an increase in the value of the independent variable causes an
increase in the value of the dependent variable.
TRUE 5. In a simple linear regression model, the correlation coefficient not only
indicates the strength of the relationship between independent and
dependent variable, but also shows whether the relationship is positive
or negative.
TRUE 6. If r = -1, then we can conclude that there is a perfect relationship between X
and Y.
TRUE 7. The slope of the simple linear regression equation represents the average
change in the value of the dependent variable per unit change in the
independent variable (X).
FALSE 8. The least squares simple linear regression line minimizes the sum of
the vertical deviations between the line and the data points.
FALSE 9. The notation ˆY refers to the average value of the dependent variable Y.
FALSE 10. A significant positive correlation between X and Y implies that changes in X
cause Y to change.
FALSE 11. The estimated simple linear regression equation minimizes the sum of the
squared deviations between each value of Y and the line.

ACTIVITY 5.6.2: linear regression and correlation data INQUIRY


The following items talk about the data management that diagnosed your understanding related
to linear regression and correlation. Read each statement, and answer the questions.

1. What two things should be done before performing the regression analysis and correlation?
 Construct a scatter plot then quantitatively describe the strength and direction of
the linear relationship using “r”.
2. What is meant by the line of best fit?
 Best fit means that the sum of the squares of the vertical distances from each
point to the line is at a minimum.
3. What is the relationship between the sign of the correlation coefficient and the sign of the
slope of the regression line?
 Both quantify the direction and strength of the relationship between two numeric
variables. When the correlation (r) is negative, the regression slope (b) will be
negative. When the correlation is positive, the regression slope will be positive.
4. How is the value of the correlation coefficient related to the accuracy of the predicted value
for a specific value of x?
 It measures the strength and direction of a linear relationship between two
variables.
5. When the value of r is not significant, what value should be used to predict y?
 If r is not significant OR if the scatter plot does not show a linear trend, the line
should not be used for prediction.

ACTIVITY 5.6.5: We solve!


In a study on speed and braking distance, researchers looked for a method to estimate how fast
a person was traveling before an accident by measuring the length of the skid marks. An area
that was focused on in the study was the distance required to completely stop a vehicle at
various speeds. Use the following data to answer the questions.

MPH Braking Distance (feet)


20 20
30 45
40 81
50 133
60 205
80 411

Assume MPH is going to be used to predict stopping distance.


1. Find the linear regression equation.

MPH (x) Braking Distance (y) xy x2 y2


20 20 400 400 400
30 45 1350 900 2025
40 81 3240 1600 6561
50 133 6650 2500 17689
60 205 12300 3600 42025
80 411 32880 6400 168921
∑ x=280 ∑ y=985 ∑ xy=56820 ∑ x 2=15400 ∑ y 2=237621
( ∑ y ) ( ∑ x 2 )−(∑ x)(∑ xy ) ( 895 ) (15400 )−(280)(56820) 13783000−15909600 −2126600
a= = = = =−151.9
2
n ( ∑ x )−(∑ x ) 2
( 6 ) (15400 )−(280)2 92400−78400 14000

n (∑ xy )−(∑ x ) ( ∑ y ) ( 6 ) ( 56820 )−(280)(895) 340920−250600 90320


b= = = = =6.45
2
n ( ∑ x ) −(∑ x ) 2
( 6 ) ( 15400 )−(280)2 92400−78400 14000

The Linear regression line: y ' =a+ bx=−151.9+6.45 x

2. What does the slope tell you about MPH and the braking distance? How about the y
intercept?
 The slope tells that for each additional mile per hour that the car is traveling, the
stopping distance will increase to an average of 6.45 feet. The y intercept is the
braking distance we would expect for a car traveling 0 mph,
3. Find the braking distance when MPH=45.
y ' =a+ bx=−151.9+6.45 ( 45 )=139.25
The braking distance for a car traveling 45 mph is a 139.25 feet.
4. Find the braking distance when MPH=100.
y ' =a+ bx=−151.9+6.45 ( 100 )=493.1
The braking distance for a car traveling 100 mph is a 493.1 feet.
5. Comment on predicting beyond the given data values.
 It is now not appropriate to make predictions of braking distance for speeds
outside of the given data values due to the fact we know nothing about the
relationship between the two variables outside of the vary of the data.

You might also like