Statistics Sample Questions

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

Statistics sample questions [27 marks]

1a. [2 marks]
Each athlete on a running team recorded the distance (𝑀 miles) they ran in 30 minutes.
The median distance is 4 miles and the interquartile range is 1.1 miles.
This information is shown in the following box-and-whisker plot.

Find the value of 𝑎.

Markscheme
valid approach (M1)
eg 𝑄3 − 𝑄1 , 𝑄3 − 1.1 , 4.5 − 𝑎 = 1.1
𝑎 = 3.4 A1 N2
[2 marks]
1b. [1 mark]
The distance in miles, 𝑀, can be converted to the distance in kilometres, 𝐾, using the
8
formula 𝐾 = 5 𝑀.
Write down the value of the median distance in kilometres (km).

Markscheme
32
(= 6.4) (km) A1 N1
5

[1 mark]
1c. [4 marks]
16
The variance of the distances run by the athletes is km2 .
9

The standard deviation of the distances is 𝑏 miles.


Find the value of 𝑏.

Markscheme
METHOD 1 (standard deviation first)
valid approach (M1)

16
eg standard deviation = √variance , √ 9
4
standard deviation= 3 (km) (A1)

valid approach to convert their standard deviation (M1)

4 5 16 8
eg × 8 , √9 = 5𝑀
3

20 5
(miles) (= 6) A1 N3
24

4
Note: If no working shown, award M1A1M0A0 for the value 3.
4
If working shown, and candidate’s final answer is 3, award M1A1M0A0.

METHOD 2 (variance first)


valid approach to convert variance (M1)
5 2 64 16 5 2
eg (8) , , × (8)
25 9
25
variance = 36 (A1)

valid approach (M1)

25 16 5 2
eg standard deviation = √variance , √36 , √ 9 × (8)

20 5
(miles) (= 6) A1 N3
24

[4 marks]
1d. [3 marks]
A total of 600 athletes from different teams compete in a 5km race. The times the 600
athletes took to run the 5km race are shown in the following cumulative frequency graph.

There were 400 athletes who took between 22 and 𝑚 minutes to complete the 5km race.
Find 𝑚.
Markscheme
correct frequency for 22 minutes (A1)
eg 20
adding their frequency (do not accept 22 + 400) (M1)
eg 20 + 400 , 420 athletes
𝑚 = 30 (minutes) A1 N3
[3 marks]
1e. [5 marks]
The first 150 athletes that completed the race won a prize.
Given that an athlete took between 22 and 𝑚 minutes to complete the 5km race, calculate
the probability that they won a prize.

Markscheme
27 (minutes) (A1)
correct working (A1)
150−20 13
eg 130 athletes between 22 and 27 minutes, P(22 < 𝑡 < 27) = ,
600 60

evidence of conditional probability or reduced sample space (M1)


P(22<𝑡<27) 150
eg P(𝐴𝐵) , P(𝑡 < 2722 < 𝑡 < 30) , ,
P(22<𝑡<𝑚) 400

correct working (A1)


130
150−20
eg 600
400 , 400
600

130 13 78000 390


(40 = 240000 = 1200 = 0.325) A1 N5
400

150
Note: If no other working is shown, award A0A0M1A0A0 for answer of 400.
3
Award N0 for answer of 8 with no other working shown.

[5 marks]
2a. [3 marks]
Lucy sells hot chocolate drinks at her snack bar and has noticed that she sells more hot
chocolates on cooler days. On six different days, she records the maximum
daily temperature, 𝑇, measured in degrees centigrade, and the number of hot chocolates
sold, 𝐻. The results are shown in the following table.

The relationship between 𝐻 and 𝑇 can be modelled by the regression line


with equation 𝐻 = 𝑎𝑇 + 𝑏.
Find the value of 𝑎 and of 𝑏.
Markscheme
valid approach (M1)
eg correct value for 𝑎 or 𝑏 (or for 𝑟 or 𝑟 2 = 0.962839 seen in (ii))
𝑎 = −9.84636, 𝑏 = 221.592
𝑎 = −9.85, 𝑏 = 222 A1A1 N3
[3 marks]
2b. [1 mark]
Write down the correlation coefficient.

Markscheme
−0.981244
𝑟 = −0.981 A1 N1
[1 mark]
2c. [2 marks]
Using the regression equation, estimate the number of hot chocolates that Lucy will sell on
a day when the maximum temperature is 12C.

Markscheme
correct substitution into their equation (A1)
eg −9.85 × 12 + 222
103.435 (103.8 from 3 sf)
103 (hot chocolates) A1 N2
[2 marks]
3a. [3 marks]
The number of messages, 𝑀, that six randomly selected teenagers sent during the month of
October is shown in the following table. The table also shows the time, 𝑇, that they spent
talking on their phone during the same month.

The relationship between the variables can be modelled by the regression equation 𝑀 =
𝑎𝑇 + 𝑏.
Write down the value of 𝑎 and of 𝑏.

Markscheme
evidence of set up (M1)
eg correct value for 𝑎 or 𝑏 (accept 𝑟 = 0.966856)
4.30161, 163.330
𝑎 = 4.30, 𝑏 = 163 (accept 𝑦 = 4.30𝑥 + 163) A1A1 N3
[3 marks]
3b. [3 marks]
Use your regression equation to predict the number of messages sent by a teenager that
spent 154 minutes talking on their phone in October.

Markscheme
valid approach (M1)
eg 4.30(154) + 163
eg 825.778 (825.2 from 3 sf values) (A1)
number of messages = 826 (must be an integer) A1 N3
[3 marks]

Printed for FECEN-COLEGIO ALBANIA


© International Baccalaureate Organization 2021
International Baccalaureate® - Baccalauréat International® - Bachillerato Internacional®

You might also like