Faculty of Science, Technology & Environment School of Computing, Information & Mathematical Sciences

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

ST130: Basic Statistics

Faculty of Science, Technology & Environment


School of Computing, Information & Mathematical Sciences

Final Examination
Semester II, 2018

Mode: Face to Face/ Online

Duration of Exam: 3 hours + 10 minutes

Reading Time: 10 minutes

Writing Time: 3 hours

Total marks: 100

INSTRUCTIONS:

1. This exam has FIVE (5) questions of 20 marks each and all are compulsory.
2. Write your answers in the answer booklet provided.
3. Start each question on a new page.
4. Show all necessary working. Partial marks will be awarded for partially correct answers.
5. There are SEVEN (7) pages in this exam paper (including this cover page).
6. This exam is worth 50% of the overall mark. The minimum exam mark is 40/100.
7. You may use a NON PROGRAMABLE calculator.
8. Eton Statistical Table and Formula Sheet are provided.
ST130 FINAL EXAMINATION SEMESTER II, 2018
Question 1 Start on a new page [10 + 10 = 20 marks]

(A) An environmental researcher is interested in the mean maximum temperature for a particular area in
the Pacific for June 2017. The maximum daily temperature ( oC ) for the area in June 2017 is listed
below:

Date Temp Date Temp Date Temp Date Temp Date Temp
( oC ) ( oC ) ( oC ) ( oC ) ( oC )
1st 30.1 7th 28.7 13th 30.0 19th 30.1 25th 30.5
2nd 30.5 8th 29.0 14th 29.8 20th 30.5 26th 30.2
3rd 29.4 9th 29.8 15th 28.5 21st 30.4 27th 30.1
4th 29.3 10th 30.0 16th 28.6 22nd 30.5 28th 30.0
5th 29.0 11th 31.0 17th 28.9 23rd 31.0 29th 29.2
6th 28.5 12th 31.2 18th 29.0 24th 31.2 30th 29.8

(i) Should we consider this data set as a population or a sample? Explain your answer.

(ii) Data can be classified by its level of measurement as either nominal, ordinal, interval or ratio.
Which level of measurement can be used to categorize the temperature data above? Explain
your answer.

(iii) The environmental researcher would like to study a sample of size 10 from the data above, so
he uses a simple random sampling method, with random numbers (Ran#) generated by the
fx82 calculator. He take the last two digit to determine his samples. Which temperatures will
be chosen as his sample, if listed below are the random numbers displayed in order of
appearance, given from left to right:

0.251 0.418 0.075 0.717 0.678 0.815 0.629 0.347 0.707 0.185
0.255 0.413 0.801 0.617 0.985 0.091 0.905 0.91 0.1 0.423

(2 + 2 + 6 = 10 marks)

Page 2 of 7
ST130 FINAL EXAMINATION SEMESTER II, 2018
(B) The maximum daily temperature ( oC ) for a particular area as shown in part A) above in June 2017
is listed below:

30.1 30.5 29.4 29.3 29 28.5 28.7 29 29.8 30 31 31.2 30 29.8 28.5
28.6 28.9 29 30.1 30.5 30.4 30.5 31 31.2 30.5 30.2 30.1 30 29.2 29.8

(i) Construct a frequency distribution table using the classes 28.5-29.0, 29.1-29.6, … , 30.9-31.4.
(ii) Draw the histogram for this distribution and state one information about the data that you can
draw from the graph.
(4 + 6 = 10 marks)

Question 2 Start on a new page [10 + 10 = 20 marks]

(A) The following data give the number of text messages sent on 10 randomly selected days in 2013
by a student in USP:
32 37 41 42 44 44 47 47 47 60

(i) Calculate the percentile rank of 42 and interpret the result.


(ii) Is there any outlier(s) among the data?
(3 + 7 = 10 marks)

(B) Consider an experiment of tossing a coin (C) and rolling a die (D) together at one time.
(i) Define sample space, give an example from the experiment above.
(ii) Are the events C and D dependent? Explain your answer.
(iii) Find the probability of getting a Head (H) on the coin and a SIX (6) on a die.
(2 + 3 + 5 = 10 marks)

Question 3 Start on a new page [10 + 6 + 4 = 20 marks]

(A) Answer the following:


(i) When an event is certain to occur, what is its probability?
(ii) When a meteorologist say that there is a 30% chance of showers, what type of probability is
the person using?
(iii) How many ways can 3 cards be selected from a standard deck of 52 cards.

Page 3 of 7
ST130 FINAL EXAMINATION SEMESTER II, 2018
(iv) Suppose in a gambling card game, a casino will pay you $10 if you select an ace. If you fail to
select an ace, you are required to pay the casino $0.50. How much money do you expect to
win, if you play the game 20 times?
(1 + 1 + 2 + 6 = 10 marks)

(B) The mean time taken to design a house plan by 50 architects was found to be 23 hours. Assume
the population standard deviation to be 3.75 hours.
(i) What is the point estimate of .
(ii) Construct a 99% confidence interval for the population mean .
(iii) What is the margin of error of the estimate for  in part (ii)?
(1 + 4 + 1 = 6 marks)
(C) Briefly explain the meaning of each of the following terms.
(i) Two types of error.
(ii) Significance level.
(2 + 2 = 4 marks)

Question 4 Start on a new page [8 + 2 + 10 = 20 marks]

(A) A food company is planning to market a new type of frozen yogurt. However, before marketing this
yogurt, the company wants to find what percentage of the people like it. The company’s management
has decided that it will market this yogurt only if at least 35% of the people like it. The company’s
research department selected a random sample of 400 persons and asked them to taste this yogurt.
Of these 400 persons, 112 said they liked it. Testing at the 2.5% significance level, can you conclude
that the company should market this yogurt? To draw your conclusion, state the hypotheses, find the
critical value(s), label the acceptance and rejection region, calculate the test value, make the decision
to reject or do not reject the null hypothesis and summarize the results.

(B) What two things should be done before one performs a regression analysis?

Page 4 of 7
ST130 FINAL EXAMINATION SEMESTER II, 2018
(C) The following data represent trends in cigarette consumption ( x) per capita and lung cancer
mortality rate ( y) in a country.

Consumption, x 11.8 12.5 15.7 19.2 21.9 23.3


Mortality rate, y 10.4 16.5 22.9 26.6 33.8 42.8

(i) Test whether the correlation between consumption and mortality rate is significant at 5%
level of significance.
(ii) Find the regression equation for predicting mortality rate.
(6 + 4 = 10 marks)

Question 5 Start on a new page [2 + 8 + 2 + 8 = 20 marks]

(A) State two assumptions for the chi-square goodness-of-fit test.

(B) A drug company is interested in investigating whether the color of their packaging has any impact
on sales. To test this, they used five different colors (blue, green, orange, red, and white) for the
boxes of an over- the counter pain reliever, instead of their traditional white box. The following table
shows the number of boxes of each color sold during the first month.

Box color Blue Green Orange Red white


Number of boxes sold 310 292 280 216 296

Using the 1% significance level, test the null hypothesis that the number of boxes sold of each of
these five colors is the same. To draw your conclusion, state the hypotheses, find the critical value(s),
label the acceptance and rejection region, calculate the test value and summarize the results.

(C) Briefly explain when a one-way ANOVA procedure is used to make a test of hypothesis.

Page 5 of 7
ST130 FINAL EXAMINATION SEMESTER II, 2018
(D) The following ANOVA table, based on information obtained for three samples selected from three
independent populations that are normally distributed with equal variances, has a few missing
values.

Source of Sum of Squares Degrees of Freedom Mean Square F


Variation
Between 2 19.2813
Within (error) 89.3677
Total 12

(i) In your answer booklet, copy this ANOVA table. Find the missing values and complete the
table.
(ii) Using 1% significance level, what is your conclusion for the test with the null hypothesis that
the means of the three populations are all equal against alternative hypothesis that the means
of the three populations are not all equal?
(2 + 6 = 8 marks)
   END OF EXAM   

Page 6 of 7
ST130 FINAL EXAMINATION SEMESTER II, 2018

FORMULAE

np
1. c 
100
number of data less than X  0.5
2. Percentile Rank( X )  100
n
3. IQR  Q3  Q1
4. P( A  B)  P( A)  P( B). If A and B are independent.
5. P( A  B)  P( A)  P( B A). If A and B are dependent.
6. P  A  B   P( A)  P( B)  P( A  B).
7.   E ( X )   X P( X )
 
8. X  Z 2    X  Z 2 .
n n
pˆ  p
9. z  .
pq / n
n2
10. t  r , d . f  n  2.
1 r2

11. a 
  y    x     x   xy 
2

and b 
n   xy     x   y 
.
n  x    x n   x2     x 
2 2 2

(O  E ) 2
12.  2   , d . f  number of categories -1.
E

Page 7 of 7

You might also like