Statistics Sample Questions
Statistics Sample Questions
Statistics Sample Questions
1a. [2 marks]
Each athlete on a running team recorded the distance (𝑀 miles) they ran in 30 minutes.
The median distance is 4 miles and the interquartile range is 1.1 miles.
This information is shown in the following box-and-whisker plot.
Markscheme
valid approach (M1)
eg 𝑄3 − 𝑄1 , 𝑄3 − 1.1 , 4.5 − 𝑎 = 1.1
𝑎 = 3.4 A1 N2
[2 marks]
1b. [1 mark]
The distance in miles, 𝑀, can be converted to the distance in kilometres, 𝐾, using the
8
formula 𝐾 = 5 𝑀.
Write down the value of the median distance in kilometres (km).
Markscheme
32
(= 6.4) (km) A1 N1
5
[1 mark]
1c. [4 marks]
16
The variance of the distances run by the athletes is km2 .
9
Markscheme
METHOD 1 (standard deviation first)
valid approach (M1)
16
eg standard deviation = √variance , √ 9
4
standard deviation= 3 (km) (A1)
4 5 16 8
eg × 8 , √9 = 5𝑀
3
20 5
(miles) (= 6) A1 N3
24
4
Note: If no working shown, award M1A1M0A0 for the value 3.
4
If working shown, and candidate’s final answer is 3, award M1A1M0A0.
25 16 5 2
eg standard deviation = √variance , √36 , √ 9 × (8)
20 5
(miles) (= 6) A1 N3
24
[4 marks]
1d. [3 marks]
A total of 600 athletes from different teams compete in a 5km race. The times the 600
athletes took to run the 5km race are shown in the following cumulative frequency graph.
There were 400 athletes who took between 22 and 𝑚 minutes to complete the 5km race.
Find 𝑚.
Markscheme
correct frequency for 22 minutes (A1)
eg 20
adding their frequency (do not accept 22 + 400) (M1)
eg 20 + 400 , 420 athletes
𝑚 = 30 (minutes) A1 N3
[3 marks]
1e. [5 marks]
The first 150 athletes that completed the race won a prize.
Given that an athlete took between 22 and 𝑚 minutes to complete the 5km race, calculate
the probability that they won a prize.
Markscheme
27 (minutes) (A1)
correct working (A1)
150−20 13
eg 130 athletes between 22 and 27 minutes, P(22 < 𝑡 < 27) = ,
600 60
150
Note: If no other working is shown, award A0A0M1A0A0 for answer of 400.
3
Award N0 for answer of 8 with no other working shown.
[5 marks]
2a. [3 marks]
Lucy sells hot chocolate drinks at her snack bar and has noticed that she sells more hot
chocolates on cooler days. On six different days, she records the maximum
daily temperature, 𝑇, measured in degrees centigrade, and the number of hot chocolates
sold, 𝐻. The results are shown in the following table.
Markscheme
−0.981244
𝑟 = −0.981 A1 N1
[1 mark]
2c. [2 marks]
Using the regression equation, estimate the number of hot chocolates that Lucy will sell on
a day when the maximum temperature is 12C.
Markscheme
correct substitution into their equation (A1)
eg −9.85 × 12 + 222
103.435 (103.8 from 3 sf)
103 (hot chocolates) A1 N2
[2 marks]
3a. [3 marks]
The number of messages, 𝑀, that six randomly selected teenagers sent during the month of
October is shown in the following table. The table also shows the time, 𝑇, that they spent
talking on their phone during the same month.
The relationship between the variables can be modelled by the regression equation 𝑀 =
𝑎𝑇 + 𝑏.
Write down the value of 𝑎 and of 𝑏.
Markscheme
evidence of set up (M1)
eg correct value for 𝑎 or 𝑏 (accept 𝑟 = 0.966856)
4.30161, 163.330
𝑎 = 4.30, 𝑏 = 163 (accept 𝑦 = 4.30𝑥 + 163) A1A1 N3
[3 marks]
3b. [3 marks]
Use your regression equation to predict the number of messages sent by a teenager that
spent 154 minutes talking on their phone in October.
Markscheme
valid approach (M1)
eg 4.30(154) + 163
eg 825.778 (825.2 from 3 sf values) (A1)
number of messages = 826 (must be an integer) A1 N3
[3 marks]