Sta1501 2024 TL 011 0 e

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

STA1501/011/0/2024

Tutorial Letter 011/0/2024

Descriptive Statistics and Probability


STA1501

Year module

Department of Statistics

ASSIGNMENT 1 QUESTIONS
STA1501/011/0

ASSIGNMENT 01
Unique Nr.: 390154
Opening date: 20 March 2024
Due date: 4 April 2023

Please study Chapters 1 to 5 in the prescribed textbook and the corresponding units in the
study guide before answering the following questions.

Remember that, in this module STA1501, to get examination admission you must have a year
mark of at least 40%. Your year mark is calculated as the average of the 6 assignments. Your
year mark will count for 30% and the examination will count for 70% of the final module mark.

The assignment consists of two parts:

• Part A: For the true/false questions. There are 10 questions for 20 marks.
• Part B: For the written questions for 80 marks. There are 7 questions to answer by
showing the working steps.

PART A: 20 Marks

State whether the following statements are true or false.

QUESTION 1 [2 marks]

When you are looking at the modules for your degree at UNISA, the module STA1501 listed for
your degree represents a population.

QUESTION 2 [2 marks]

The technique you use to visualise your data on the qualitative variables is a bar chart.

QUESTION 3 [2 marks]
A population size in a particular area of Gauteng Province represents a continuous quantitative
variable.

QUESTION 4 [2 marks]

A rating of a newly elected politician (excellent, good, fair, poor) represents an ordinal
qualitative variable.

2
STA1501/011/0

QUESTION 5 [2 marks]

A province in which a person lives represents a qualitative variable.

QUESTION 6 [2 marks]

In statistics there are several methods to use to analyze data. Descriptive statistics method
enable you to organize and summarize the data.

QUESTION 7 [2 marks]

Consider the frequency table given below.

Class limits 5−9 10 − 14 15 − 19 Total

Frequency 51 36 33 120

The cumulative frequency table is

Class limits 5−9 10 − 14 15 − 19 Total

Frequency 51 87 120

QUESTION 8 [2 marks]

Given the regression equation line

𝑦𝑦� = 3.68 + 4.20𝑥𝑥,

the estimated value of 𝑦𝑦 when 𝑥𝑥 = 10 equals to 42.

QUESTION 9 [2 marks]

Consider the following data:

0 2 0 6 3 1 0 4 9 0 5 6 1 0 2

The median is equal to the second quartile.

3
STA1501/011/0

QUESTION 10 [2 marks]

Consider the following summary statistics for a simple linear relationship:

𝑛𝑛 = 5 � 𝑥𝑥𝑥𝑥 = 70 � 𝑥𝑥 = 15 � 𝑦𝑦 = 20

The covariance of 𝑋𝑋 and 𝑌𝑌 is 2.5.

PART B: 80 Marks

QUESTION 1 [25 marks]

Consider the data as given below.

13 22 14 30 33 19
21 14 33 16 27 21
18 17 16 33 21 21

1. Construct a stem-and-leaf. (4 marks)


2. Calculate the sample mean (round at 4 decimals). (2 marks)
3. Calculate the median and the mode. (6 marks)
4. Describe the shape of the data using the measure of central location. (3 marks)
5. Determine the number of classes intervals. (2 marks)
6. Determine the appropriate class interval width of the classes. (1 marks)
7. Calculate the value of the first, second and third quartile of the data (this is equivalent to
the 25th, 50th and 75th percentiles). (6 marks)
8. Calculate the interquartile range. (1 marks)

4
STA1501/011/0

QUESTION 2 [19 marks]

Consider the following summary statistics of two variables (𝑋𝑋 = independent variable and 𝑌𝑌 =
dependent variables).

𝑛𝑛 = 12

� 𝑥𝑥𝑖𝑖 = 984 � 𝑦𝑦𝑖𝑖 = 912

� 𝑥𝑥𝑖𝑖2 = 81228 � 𝑦𝑦𝑖𝑖2 = 70726

� 𝑥𝑥𝑥𝑥 = 74563

2.1 Calculate (round to 4 decimals each step):


1. The mean of 𝑋𝑋 and the mean of 𝑌𝑌. (2 marks)
2. The variance of 𝑋𝑋 and the variance of 𝑌𝑌. (4 marks)
3. The covariance of 𝑋𝑋 and 𝑌𝑌. (3 marks)
4. The coefficient of determination. (4 marks)
5. The coefficient of correlation. (4 marks)

2.2 Explain what the coefficient of:


1. determination tells you; and (1 marks)
2. correlation tells you. (1 marks)

QUESTION 3 [17 marks]

A lecturer wants to determine whether or not the marks that a student received is related to the
amount of time spent in studying a module. To test the relationship between the marks (𝑦𝑦) and
the study time (𝑥𝑥), the following data were used:
𝑋𝑋 in minutes 42 40 37 47 25 44 41
𝑌𝑌 in % 63 77 79 86 51 78 83
1. Construct a scatter diagram of the data. (5 marks)
2. Calculate the intercept (𝑏𝑏0 ) and the slope (𝑌𝑌). [Round your answer at each step at 4
decimals]. (3 marks)
3. Write the regression model for prediction. (3 marks)
4. Calculate the standard deviation of 𝑋𝑋 and the standard deviation of 𝑌𝑌. (6 marks)

5
STA1501/011/0

QUESTION 4 [19 marks]

A bank manager in South Africa was asked the following question: Where do you expect the
South African economy to be 12 months from now? The citizen chooses better, same or worse.
A sample of the responses to this question was recorded as given below.

same same same better worse


same same better same worse
same better same better same
worse same same same worse
same same same better same

1. Calculate the frequency distribution that summarize the responses of the questionnaire.
(4 marks)
2. Calculate the relative frequency distribution. (3 marks)
3. Calculate the cumulative frequency distribution. (3 marks)
4. Construct a bar chart based on the frequency table. (5 marks)
5. What is the most common response to the questionnaire? (2 marks)
6. How would you characterize the economy in South Africa? (2 marks)

TOTAL: [100]

You might also like