9 Unit6
9 Unit6
9 Unit6
Main Contents
6.1 Statistical data
6.2 Probability
Key Terms
Summary
Review Exercises
Mathematics Grade 9
INTRODUCTION
You have some knowledge about statistics and its basics, such as collection of data,
presentation of data, etc., from your primary grade mathematics. In this unit, we
formally define ‘statistics’ as a branch of applied mathematics and learn more about
collection, presentation and analysis of numerical data. The unit also reviews the
concept of probability, which was introduced in Grade 8 and teaches you more about
experimental and theoretical approaches to probability and helps you solve simple
problems based on these approaches.
Definition 6.1
Statistics is the science of collecting, organizing, presenting, analyzing and
interpreting data (quantitative information) in order to draw conclusions.
1 Collection of data
The first step of statistics is collection of data. This is the process of obtaining
measurements or counts. For example, measuring the heights of students in your class,
or counting the number of persons admitted to a certain hospital are examples of data
collection.
2 Organization of data
The second step of statistics refers to the organization of data. Collected data has to be
organized in a suitable form to understand the information gathered. The collected data
must be edited, classified and tabulated.
3 Presentation of data
The main purpose of data presentation is to facilitate statistical analysis. This can be
done by illustrating the data using graphs and diagrams like bar graph, histograms, pie-
charts, pictograms, frequency polygons, etc.
4 Analysis of data
In order to meet the desired purpose of investigation, data has to be analyzed. The
purpose of analyzing data is to highlight information useful for decision making.
5 Interpretation of data
Based on analyzed data, conclusions have to be drawn. This step usually involves
decision making about a large collection of objects (the population) based on
information gathered from a small collection of similar objects (the sample).
The decision making processes used by the managers of modern businesses and industry
is governed by statistical application. Statistical methods can be applied to any situation
where numerical information is gathered with the objective of making rational decisions
in the face of uncertainty.
251
Mathematics Grade 9
The following examples show us how statistics plays a major role in decision making in
different sectors.
Example 1 Information gathered about the incidence or prevalence of diseases in a
community provides useful information on changing trends in health
status, mortality, nutritional status or environmental hazards.
Example 2 Statistics is used to study existing conditions and the prevalence rate of
HIV/AIDS in order to design new programs or to study the merits of
different methods adopted to control HIV/AIDS. It assists in determining
the effectiveness of new medication and the importance of counselling.
Example 3 Demographic data about population size, its distribution by age and sex
and the rate of population growth, etc., all help policy makers in
determining future needs such as food, clothing, housing, education,
health facilities, water, electricity and transportation systems.
Example 4 Recording annual temperatures in a country provides the community with
timely warning of environmental hazards.
Example 5 Statistical data collected on customer services provides feedback that can
help to reform policies and systems.
In the absence of accurate and timely data, it is impossible to form suitable policies.
Statistics also plays a vital role in monitoring the proper implementation of programs
and policies.
In its ordinary usage, population refers to the number of people living in an area or
country. In statistics, however, population refers to the complete collection of
individuals, objects or measurements that have a common characteristic.
Gaining access to an entire group (or population) is often difficult, expensive and
sometimes destructive. Therefore, instead of examining the entire group, a researcher
examines a small part of the group, called a sample.
Data can be classified as either qualitative or quantitative. However, statistics deals
mainly with quantitative data.
Example 6 Data collected from the population of students in Ethiopia could be;
i Qualitative if the data is based on some characteristic whose values are not
numbers, such as their eye colour, sex, religion or nationality.
ii Quantitative if the data is numerical such as height, weight, age or scores in
tests.
A rule which gives a corresponding value to each member of a population is called a
population function.
252
Unit 6 Statistics and Probability
Example 7 Here is a table showing the approximate sizes of major lakes in Ethiopia.
TABLE 6.1: Size of major lakes in Ethiopia
Name of Lake Length (km) Width (km) Area (km2)
Abaya 60 20 1160
Abayata 17 15 205
Ashenge 5 4 20
Hawassa 16 9 229
Chamo 26 22 551
Hayk 7 5 35
Koka 20 15 205
Langano 18 16 230
Shalla 28 12 409
Tana 70 60 3600
Ziway 25 20 434
We can think of the set of the eleven lakes as the population and the functions
L: Length, W: width, A: area, etc as functions on this population.
Example 8 The following table shows the age of 10 students in a certain class.
TABLE 6.2: Age of students
253
Mathematics Grade 9
The students are members of the population and their age, A is the population function.
Statistics can be classified into two types: Descriptive statistics and inferential statistics.
Definition 6.2
Descriptive is a branch of statistics concerned with
statistics
summarizing and describing a large amount of data without drawing
any conclusion about a particular bit of data.
Definition 6.3
Inferential statistics is a branch of statistics concerned with interpreting
data and drawing conclusions.
1 Primary data
Data is said to be primary, if it is obtained first hand for the particular purpose on
which one is currently working. Primary data is original data, obtained personally from
primary sources by observation, interview or direct measurement.
Example 9 If you measure the heights of students in your class, this is primary data.
Example 10 The data gathered by the Ministry of Education about the number of
students enrolled in different universities of Ethiopia is primary data for
the Ministry itself. (If you were to use this data, it would be secondary
data for you.)
2 Secondary data
Data which has been collected previously (for similar or different purpose) is known as
secondary data. Secondary data refers to that data which is not originated by the
researcher himself/herself, but which he/she obtains from someone else’s records. Some
sources of secondary data are official publications, journals, newspapers, different
studies, national statistical abstracts, etc.
254
Unit 6 Statistics and Probability
ACTIVITY 6.1
1 Classify the employees in your school by household income.
2 Group the number of HIV/AIDS victims recorded in your
nearest health centre according to their age group.
3 Collect data on age, height and mathematics exam score of the students in your
class. Classify or tabulate the data collected.
Different people or organizations collect data for different reasons and the basis of
classification they use is also different accordingly. To see this, consider the following
examples.
Example 13 An economist in the Labour Department of a Regional Social Affairs
Bureau may classify the households in a certain locality by household
income as shown in the table below.
above 1550 10
Total 300
255
Mathematics Grade 9
Example 14 According to the 2007 Ethiopian Census, the Ethiopian Central Statistical
Agency (CSA) has classified the population by sex as follows.
TABLE 6.4: Population by sex (2007 Ethiopian census)
Region Male ( in 1000) Female ( in 1000) Both sexes ( in 1000)
Tigray 2124.8 2189.6 4314.4
Affar 786.3 624.7 1411.0
Amhara 8636.9 8577.2 17214.1
Oromiya 13676.2 13482.3 27158.5
Somale 2468.8 1970.4 4439.2
Benshangul 340.4 330.5 670.9
SNNP 7482.0 7560.5 15042.5
Gambela 159.7 147.2 306.9
Harari 92.3 91.1 183.4
Addis Ababa 1304.5 1433.7 2738.2
Dire Dawa 171.9 170.9 342.8
Total 37243.8 36577.4 73821.2
A statistical table is a systematic presentation or organization of numerical data in
columns and rows. Columns are vertical arrangements and rows are horizontal. The
main purpose of a statistical table is to allow the reader to quickly access relevant
information. A title and row and column headers are important.
Exercise 6.1
1 What are the steps used in doing a statistical study?
2 What do we mean by organizing or presenting data?
3 Explain each of the following statistical terms by giving examples.
a qualitative data b quantitative data c population
d population function e sample
4 Mention four uses of statistics.
5 What is descriptive statistics?
6 Describe in your own words the difference between a population and a sample.
7 Determine whether the following data is qualitative or quantitative.
a Gender b Temperature c Zip code
d Number of days e Religions f Occupations
g Ages h Colours i Nationality
256
Unit 6 Statistics and Probability
Definition 6.3
A quantity that we measure from observation is called a variate or
variable denoted by V. The distribution of a population function is the
function which associates with each variate of the population function a
corresponding frequency denoted by f.
Methods for organizing raw data include the drawing of tables or graphs, which allow
quick overview of the information collected.
Example 1 Suppose there are 10 students in a group whose scores in a mathematics
quiz were as follows:
13, 12, 14, 13, 12, 12, 13, 14, 15, 12
Organize the data in tabular form. What are the variates? Give the frequency of each
variate.
Solution: The data given above is raw data.
We may now tabulate the given data in the form given below.
Score (V) 12 13 14 15
Number of students (f) 4 3 2 1
The table given above is called the frequency distribution table. The scores are the
variate and the number of students getting a particular score is the frequency of the
variate.
257
Mathematics Grade 9
Definition 6.4
A frequency distribution is a tabular or graphical representation of a
data showing the frequency associated with each data value.
8, 9, 8, 7, 10, 9, 6, 4, 9, 8,
7, 8, 10, 9, 8, 6, 9, 7, 8, 8
Value(V) 4 5 6 7 8 9 10 Total
Frequency(f) 1 0 2 3 7 5 2 20
Definition 6.5
A histogram is a graphical representation of a frequency distribution in
which the variate (V) is plotted on the x-axis (horizontal axis) and the
frequency ( f ) is plotted on the y-axis (vertical axis).
258
Unit 6 Statistics and Probability
ACTIVITY 6.2
Consider the following data that shows the number of days 25
individuals participated in soil and water conservation tasks:
3 8 7 4 8
5 9 8 5 9
7 8 3 7 5
8 5 6 8 8
10 7 4 4 7
Construct a frequency distribution table and a histogram for the above data.
Example 3 The temperature in oC for the first 14 days of September in a certain
town were recorded as
4
Frequency
0
18 19 20 21 22 23 24 25 26 27 28
Temperature
Figure 6.1
259
Mathematics Grade 9
Example 4 The following histogram shows the daily income (in Birr) of 30
employees in a factory.
Number of Employees
5
4
3
2
1
0
84 85 86 87 88 89 90 91 92 93
Daily income
Figure 6.2
Exercise 6.2
1 Give two reasons why raw data should be summarized into a frequency distribution.
2 What is the difference between a frequency distribution table and a histogram?
3 The ages (to the nearest year) of 40 children in a certain village are as follows:
10 7 4 5 1 9 3 6 5 4
2 7 5 3 2 5 6 2 8 9
5 8 9 9 5 2 1 3 9 4
3 5 7 9 6 3 6 8 1 2
Prepare a frequency distribution table and a histogram for the given data.
260
Unit 6 Statistics and Probability
4 Collect the score the students in your class obtained in their mid–semester
mathematics exam and
a Prepare a frequency distribution table.
b Draw a histogram.
c What score is most frequent?
d What is the least score obtained?
5 A sample of 50 couples married for 10 years were asked how many children they
had. The result of the survey is as follows:
0 4 2 2 1 3 0 3 2 4
3 3 1 3 3 3 3 3 2 2
1 3 3 2 4 3 1 5 2 2
2 0 0 2 1 2 2 2 3 2
3 3 3 4 3 1 3 0 3 2
a Construct a frequency distribution.
b Construct a histogram.
c What percentage of couples have two children?
d What percentage of couples have at least two children?
6 Here are quintals of fertilizer distributed to 50 farmers.
20 24 22 19 20 10 18 24 10 15
21 20 20 19 20 10 14 22 10 18
18 15 14 18 20 15 14 22 14 20
15 14 15 20 21 10 20 20 15 24
10 10 15 22 14 21 20 14 15 10
a Construct a frequency distribution.
b Construct a histogram.
7 Suppose the following data represents the number of persons who took
counselling on HIV/AIDS on 40 consecutive days:
10 5 10 3 4 5 12 9 11 13
10 9 6 10 8 7 3 7 9 10
4 6 8 6 7 6 4 4 11 8
10 9 5 8 8 7 8 8 6 12
a Construct a frequency distribution table from the data.
b Construct a histogram.
c On what percent of days did more than 10 people take counselling?
261
Mathematics Grade 9
ACTIVITY 6.3
1 After completing a unit, a mathematics teacher gave a test
marked out of 10, and the scores of 22 students were as follows:
6, 5, 8, 10, 6, 7, 3, 9, 3, 2, 9, 6, 7, 2, 6, 5, 4, 8, 6, 4,8, 3
a Did the group do well in the test?
b Prepare a frequency distribution table from the given data.
c What is the average score of the group?
d How many students score above average?
e From the average obtained, can we say something about the performance of
the group?
f What relation can we see between the single value obtained in c and the
marks of the students? Can the single value summarize the data?
2 Record the height and age of each student in your class.
a What is the average height and age of the students?
b What is the middle value of height and age of the students?
c What value of height and age is most frequent (or has the highest
frequency)?
3 Suppose a student scored the following marks in five subjects:
85, 89, 78, 92, 91
a What is the average score of the student?
b What is the middle value of the score?
4 Considering the following data
20, 21, 21, 22, 23, 23, 25, 27, 27, 27, 29, 98, 98
a Find the mean, median and mode.
b Which measure of location does not give a clear indication of the centre of the
distribution?
5 Could you find the arithmetic mean of qualitative data? What about median and
mode?
262
Unit 6 Statistics and Probability
Definition 6.6
The arithmetic mean (or the mean) of a variable is the sum of all the data
values, divided by the total frequency (number of observations).
If x1, x2, x3,….., xn are the n observations of a variable, then the mean, x ,
is given by
x1 + x2 + x3 + ... + xn sum of values
Mean : x = =
n total number of values
Note: The mean of a population function can also be calculated from its frequency
distribution. So, if the values x1, x2, x3, …, xn occur f1, f2, f3, …, fn times,
respectively, then the mean ( x ) is given by
x1 f1 + x2 f 2 + ... + xn f n
Mean: x =
f1 + f 2 + .. + f n
Example 2 The following table shows the age of 14 students in a certain class:
Age in years (V) 12 13 16 18
Number of students (f) 3 4 2 5
Compute the mean age of the students.
12 × 3 + 13 × 4 + 16 × 2 + 18 × 5 36 + 52 + 32 + 90 210
Solution: x= = = = 15 years
3+ 4+ 2+5 14 14
Properties of the mean
ACTIVITY 6.4
There are five students in a group. Lemlem wants to know how much
money each student has and asked all the members of the group. She
found the following amounts:
Birr 6, Birr 9, Birr 8, Birr 4 and Birr 3.
a What is the mean of the amount of money within the group?
263
Mathematics Grade 9
b If Lemlem gives Birr 2 to each member of the group, what will be the new
mean?
c If the amount of money in the pocket of each member is multiplied by 3,
what will be the new mean?
d If you subtract the mean of the data obtained from each value, what will be
the sum of the differences obtained?
e Discuss, what you observed from your answers to a, b, c and d.
The above Activity should help you to observe different properties of the mean.
The difference between a single data value x and the mean is called the deviation from
the mean (or simply the deviation) and is given by ( x − x ) . A data point that is close to
the mean will have a small deviation, whereas data points far from the mean will have
large deviations as shown in the figure below.
x1 x x2
large small
deviation deviation
1 The sum of the deviations of individual observations from mean ( x ) is zero. That
is, let x1, x2, x3,…, xn be n observations with mean x . Then the sum of the
deviations of the observations from the mean is given by
( x1 − x ) + ( x2 − x ) + ( x3 − x ) + . . . + ( xn − x ) = 0
Proof:-
Since the mean of n observations x1, x2, x3, …, xn is given by x ,
x + x2 + x3 + ... + xn
x= 1 which shows x1 + x2 + x3 + ... + xn = nx
n
Now, ( x1 − x ) + ( x2 − x ) + ( x3 − x ) + ... + ( xn − x )
= ( x1 + x2 + x3 + ... + xn ) − ( x + x + x ... + x )
n times
= ( x1 + x2 + x3 + ... + xn ) − nx
= nx − nx = 0 as required.
Example 3 Let the ages of 5 children be 2, 3, 6, 9, 10. Then, the mean age
2 + 3 + 6 + 9 + 10 30
x= = =6
5 5
The sum of the deviations from the mean is:
(2 – 6) + (3 – 6) + (6 – 6) + (9 – 6) + (10 – 6) = –4 – 3 + 0 + 3 + 4 = 0
264
Unit 6 Statistics and Probability
2 If a constant k is added to (or subtracted from) each data value, then the new mean
is the sum (or the difference) of the old mean and the constant k.
Proof:- Let x be the mean of the data values x and k be the constant.
x1 + x2 + x3 + ... + xn
=x
n
Adding k to each data value, the new mean is then
( x1 +k ) + ( x2 + k ) + ( x3 + k ) + … + ( xn + k )
=
n
x1 + x2 + x3 +... + xn + k + k + k + ... + k
n
x + x2 + x3 + ... + xn nk
= 1 +
n n
x
= + k (the old mean plus k).
A similar proof can be done for the case when k is subtracted from each data value.
3 The mean of the sum or difference of two population functions (of equal numbers
of observations) is equal to the sum or difference of the means of the two
population functions.
Proof:-
x1 + x2 + x3 + ... + xn y + y2 + y3 + ... + yn
Let = x and 1 = y
n n
Then the mean of their sum,
( x1 + y1 ) + ( x2 + y2 ) + . . . + ( xn + yn )
Mean (x + y) =
n
( x1 + x2 + x3 + . . . +xn ) + ( y1 + y2 + y3 + . . . + yn )
=
n
( x + x + x + . . . + xn ) ( y + y 2 + y3 + . . . + y n )
= 1 2 3 + 1
n n
= x + y (the sum of the means)
265
Mathematics Grade 9
Proof:-
kx1 + kx2 + kx3 + . . . + kxn k ( x1 + x2 + x3 + . . . + xn )
= = kx
n n
Example 5 The mean of 8, 9, 6, 8, 4, is 7. If you multiply each of value by 5, you
will obtain 40, 45, 30, 40, 20. Then the new mean is 5 × 7 = 35
Note:
1 The mean is unique.
2 The mean is affected by extreme values.
2 The median
The following Activity will help you to revise what you learned in previous grades.
ACTIVITY 6.5
1 Find the median for each of the following sets of data.
a 5, 2, 9, 7, 3 b 12 , 8, 10, 14, 13, 9
2 What did you observe about the middle term when the number of observations is
odd or even?
A second measure of location of quantitative data is the median.
Definition 6.7
The median is the value that lies in the middle of the data when it is
arranged in ascending or descending order. So, half the data is below the
median and half the data is above the median.
f 2 3 2 4 2
Solution: There are 13 data values. So, the median is the 7th piece of data, which
is 3.
Note that the median of a set of data with values arranged in ascending or descending
order is:
i the middle value of the list if there is an odd number of values.
ii half of the sum of the two middle values if there is an even number of values.
3 The mode
The following activity should help you to recall what you have learnt about mode
previously.
ACTIVITY 6.6
1 Find the mode(s) of the following data
a 5, 7, 8, 7, 9, 11 b M, F, M, F, F
2 Can you find the mean and median for the above data?
3 Discuss your observation.
A third measure of location is the mode. The mode can be found for both quantitative
and qualitative data.
Definition 6.8
The value of the variable which occurs most frequently in a data set is
called the mode.
c 9, 8, 7, 10, 6, 8
267
Mathematics Grade 9
Solution:
a It has no mode because each value occurs only once.
b The values 10 and 12 both occur twice, while the others occur only once.
It has two modes and the data is a bimodal.
c 8 is the mode because it occurred twice (most frequently).
Example 9 Find the mean, median and mode of the following distribution of
temperatures in a certain town for one month.
Temperature in oC(V) 20 21 23 24 26 28
Number of days(f) 2 4 5 9 3 7
Exercise 6.3
1 a Find the mean, mode and median of the following data.
11, 9,14, 3, 11, 4, 10, 21, 8, 15, 350
b Which measure of location is preferable for this data?
268
Unit 6 Statistics and Probability
V –2 –3 0 1 2 3
f 3 2 3 6 5 1
3 Given the numbers 5, 6, 7, 10, 12, which number must be removed in order to
make the mean of the resulting values 7.5?
4 Given the numbers 10, 12, 9, 15, 8, what number could be included so that the
median is 11? (Explain)
5 Given 3, 4, x, 5, y,12. Find the values of x and y, if the mode of the data is 3 and
the mean is 6.
6 If the mean of a, b, c, d is k, then what is the mean of
a a + b, 2b , c + b, d + b? b ab, b2, cb, db?
7 Calculate the mean, median and mode of the following data;
Value 10 15 20 25 30 35 40
Frequency 15 10 50 4 10 8 3
1
a If the mean number of occupants is 2 , find x.
3
b If the mode is 2, find the largest possible value of x.
c If the median is 2, find the largest possible value of x.
9 A researcher tabulated the number of cases heard by 8 judges on a given day in a
court and found the following data:
Judges 1 2 3 4 5 6 7 8
Count of cases 6 3 1 2 0 5 5 4
a Find the mean, median and mode.
b The researcher reported that over half of the judges heard above “average”.
What does the researcher mean by the “average”?
269
Mathematics Grade 9
10 The following raw data represents the number of HIV/AIDS patients waiting for
counselling at 8:00 am on 40 consecutive Saturdays at a certain hospital.
11 6 5 8 11 6 3 7 4 6
5 4 13 14 9 11 13 8 10 9
10 9 6 5 10 7 8 7 8 3
8 7 8 9 6 10 11 8 8 4
a Draw a frequency distribution table.
b Calculate the mean, median and modal number of HIV/AIDS patients.
c Draw a histogram.
11 In a mathematics test the scores for boys were 6, 7, 8, 7, 5 and the scores for girls
were 6, 3, 9, 8, 2, 2, 5, 7,3
a Find the mean score for the boys.
b Find the mean score for the girls.
c Find the mean score for both the boys and girls.
d What do you conclude?
12 The mode of some data is 20. If each value in the data is increased by 2, what will
be the mode of the new data?
13 Find the mean, median and mode of the data represented by the histogram below.
3
Frequency
0
1 2 3 4 5 6 7
Value
Figure 6.3
14 An Agricultural Development Station sells seedlings of plant through the post. It
claims that the average height of the plants after one year’s growth will be 85 cm.
A sample of 24 of the plants were measured after one year with the following
results (in cm).
6 7 7 9 34 56 85 89 89 90 90 91
91 92 93 93 94 95 95 96 97 97 99 93
a Find the mean and the median height of the sample.
b Is the station’s claim about average height justified?
15 In order to receive a grade of A in her mathematics exam, Abeba needs a mean
score of 90 and above on 4 tests. So far Abeba had scored 80, 91 and 93 on 3 tests.
What is the lowest score that she must get in her last test in order to receive a
grade of A?
270
Unit 6 Statistics and Probability
1 Range
The simplest and the most crude measure of dispersion of quantitative data is the range.
Definition 6.9
The range R of a set of numerical data is the difference between the
highest and the lowest values. i.e.,
Range = Highest value – Lowest value
271
Mathematics Grade 9
Example 1 The ages of six students are 24, 20, 18, 13, 16, 15 years, respectively.
What is the range?
Solution: Range = highest value – lowest value = 24 – 13 = 11 years.
Example 2 Find the range of the distribution given in the table below.
V 2 8 9 13 15 18
f 3 4 2 1 5 4
2 Variance (σ2 )
Definition 6.10
Variance, denoted by (σ2), is defined as the mean of the squared deviations of
each value from the arithmetic mean.
ACTIVITY 6.7
Consider the following data set:
2, 3, 10, 6, 9
a Find the mean x .
b Find the deviation of each data value from the mean (x – x ).
c Square each of the deviations (x – x )2.
d Find the mean of these squared deviations and its principal square root.
The standard deviation is the most valuable and widely used measure of dispersion.
Definition 6.11
Standard deviation, denoted by σ, is defined as the positive square root of
the mean of the squared deviations of each value from the arithmetic
mean.
272
Unit 6 Statistics and Probability
x (x−x) (x−x)2.
3 −5 25
5 −3 9
8 0 0
11 3 9
13 5 25
Total 40 68
68
Variance (σ2) = = 13.6
5
V 2 3 5 6 8
f 3 4 4 5 4
273
Mathematics Grade 9
x f xf x−x ( x − x )2 f ( x − x )2
2 3 6 -3 9 27
3 4 12 -2 4 16
5 4 20 0 0 0
6 5 30 1 1 5
8 4 32 3 9 36
Total 20 100 0 84
84
Variance (σ 2 ) = = 4.2
20
Standard deviation (σ ) = 4.2 ≈ 2.05
Therefore, the population variance and standard deviation are 4.2 and 2.05 respectively.
274
Unit 6 Statistics and Probability
Proof:-
Consider x1, x2, . . ., xn whose mean is x and variance is σ2.
Multiplying each data value by c gives us a new mean of c x .
(cx1 − cx ) 2 + (cx2 − cx ) 2 + (cx3 − cx )2 + . . . + (cxn − cx ) 2
Then, new variance =
n
c [( x1 − x ) + ( x2 − x ) + ( x3 − x ) + . . . + ( xn − x ) 2 ]
2 2 2 2
=
n
= c × the old variance = c2σ2
2
Exercise 6.4
1 Find the range, variance and standard deviation of the following data.
4, 2, 3, 3, 2, 1, 4,3, 2, 6
2 Find the range, variance and standard deviation of the distribution in the table below.
V –1 –2 0 1 2
f 2 1 3 3 1
3 Find the range, variance and standard deviation from the histogram in the figure
below.
4
Frequency
0
1 2 3 4
Value
Figure 6.4
4 What is the value of y, if the standard deviation of 8, 8, 8, 8, y, 8 is 0?
5 If the variance of a, b, c, d is k, then what is
a the variance of a + c, b + c, 2c, d + c?
b the standard deviation of a + c, b + c, 2c, d + c?
c the variance of ac, bc, c2, dc?
d the standard deviation of ac, bc, c2, dc?
6 If a population function x has mean M(x) = 2 and M(x2) = 8, find its standard
deviation.
276
Unit 6 Statistics and Probability
6.2 PROBABILITY
"The true logic of this world is the calculus of probabilities". James Clerk Maxwell
H
HIISSTTO
ORRIIC
CAALL N
NO TEE::
OT
277
Mathematics Grade 9
Definition 6.12
An experiment is a trial by which an observation is obtained but whose
outcome cannot be predicted in advance.
Experimental probability
Probability determined using data collected from repeated experiments is called
experimental probability.
Example 1 The numbers 1 to 20 are each written on one of 20 identical cards. One
card is chosen at random.
a List the set of all possible outcomes.
b List the elements of the following events:
i The number is less than 5.
ii The number is greater than 15.
iii The number is greater than 21.
iv The number is divisible by 5.
v The number is a prime.
Solution:
a S = {1, 2, 3, …, 19, 20}
b i {1, 2, 3, 4}
ii {16, 17, 18, 19, 20}
iii { } or ∅, since no card has a number greater than 20.
iv {5, 10, 15, 20}
v {2, 3, 5, 7, 11, 13, 17, 19}
ACTIVITY 6.8
Arrange yourselves into groups of 5. Let each group perform the
following activities.
1 Take a coin, toss it 5 times, 10 times and 15 times, and record
your observations in the following table.
Total
Number of tosses
2 Throw a die 20 times. Record the observation in each trial and complete the
following table.
Number on the upper face of the die 1 2 3 4 5 6
Number of times it shows up
Definition 6.13
The possibility set (or sample space) for an experiment is the set of all
possible outcomes of the experiment.
Example 2
a Give the sample space for tossing a coin.
b What is the sample space for throwing a die?
Solution:
a When we toss a coin there are only two possible outcomes: Heads (H) or
Tails (T). So S = {H,T}.
b When we throw a die the score can be any of the six numbers 1, 2, 3, 4, 5,
6, so S = {1,2,3,4,5,6}.
279
Mathematics Grade 9
Definition 6.14
An event is a subset of the possibility set (sample space).
ACTIVITY 6.9
Suppose we toss a coin 1000 times and obtain 495 heads.
a How many times was the experiment performed?
b If our event is Heads, how many times does this event occur?
c What is the probability of Heads based on the result of this experiment?
Definition 6.15
If an experiment has n equally likely outcomes and if m of these
represent a particular event, then the probability of this event occurring
m
is .
n
280
Unit 6 Statistics and Probability
H HH
H
Start T HT
H TH
T
T TT
281
Mathematics Grade 9
Example 6 If we throw a die once, what is the probability that an even number will
show on the upper face of the die?
Solution:
S = {1, 2, 3,4, 5, 6}
E = {2, 4, 6}
n (E) 3 1
P(even) = = = .
n (S ) 6 2
ACTIVITY 6.10
We are going to investigate whether the theoretical probability of a
coin landing on Heads is backed up by experimental results.
a Toss a coin 10 times, 20 times, 30 times . . . .
b Keep a record of your results,
Number of throws Number of heads
10
20
30
c For each row in the table, what proportion of the number of throws landed as heads?
1
How do your answers compare with P (head) = ? (the theoretical probability)
2
282
Unit 6 Statistics and Probability
Definition 6.17
Let S be the possibility set of an experiment and each element of S be
equally likely to occur. Then the probability of the event E occurring,
denoted by P(E), is defined as:
number of elements in E n (E)
P(E) = =
number of elements in S n (S )
Example 7 A die is thrown once. What is the probability that the number appearing
will be
a 3? b a number less than 5?
Solution: There are six possible outcomes: {1, 2, 3, 4, 5, 6}. Hence n = 6.
a Only one of these outcomes is 3. Hence the probability that 3 will be on the
1
upper face of the die is .
6
b {1, 2, 3, 4} is the required set, which has four elements. Hence the
4 2
probability is = .
6 3
Example 8 A die and a coin are tossed together.
a Sketch a tree diagram showing the outcomes of this experiment.
b What is the probability of getting a head and an even number?
c What is the probability of getting a tail and an odd number? 1 2
3
Solution:
4
a The outcomes of this experiment are:
H 5
S = {H1, H2, H3, H4, H5, H6, T1, T2, T3, T4, T5,T6}.
Start 6
So, n (S) = 12
1
3 1 2
b E1 = {H2,H4,H6} . Hence P (E1) = = . T
12 4
3
3 1
c E2 = {T1, T3, T5}. Hence P (E2) = = . 4
12 4
6 5
Example 9 Use a tree diagram to list the sample space (possibility set) showing the
possible arrangement of boys and girls in a family with exactly three
children.
a What is the probability that all three children are boys?
b What is the probability that two children are boys and one is a girl?
283
Mathematics Grade 9
Exercise 6.5
1 Two dice are simultaneously thrown once. List the ways in which the following
events can occur.
a A= the same number is shown on each die.
b B = the sum of the numbers is 13.
c C = the product of the two numbers is 1.
d D = the quotient of the two numbers is 7.
284
Unit 6 Statistics and Probability
2 Three coins are tossed at the same time. Sketch a tree diagram for the outcomes of
this experiment. What is the possibility set?
3 A bag contains four red balls and three black balls. What is the possibility set for
colour, if 2 balls are chosen at random?
4 Toss a coin and keep a record of whether it lands on Heads or Tails. Do this at
least 20 times for each experiment and perform at least five experiments. Enter
your results in a table like the following.
Experiment Number of coin tosses Number of heads obtained
1
2
3
4
5
Total
a Do you feel that the two outcomes “head” and “tail” are equally likely?
b Do your experimental results support this feeling?
c What is the ratio of the number of heads to the number of tosses in each
experiment?
d What ratio do you have for the total number of heads to the total number of
tosses?
5 A fair die is rolled once. Calculate the probability of getting:
a an odd number b a score of 5
c a prime number d a score of 0
6 A number is selected at random from the set of whole numbers 1 to 20, both
inclusive. Find the probability that the number selected is:
a even c a multiple of 3 e the square of 2
b a multiple of 2 and 3 d even or odd f the square of 6
7 A bag contains five red balls, three black balls and four white balls. A ball is
drawn out of the bag at random. What is the probability that the ball drawn is:
a white? b red? c black?
8 A bag contains 100 identical cards on which the numbers 1 to 100 are marked. A
card is drawn randomly. Find the probability that the number on the card is:
a an even number b an odd number c a multiple of 7
d a multiple of 5 e a multiple of 3 f less than 76
g greater than 32 h a factor of 24
285
Mathematics Grade 9
Key Terms
analysis measure of central tendency range
arithmetic mean measure of dispersion raw data
average measure of location sample
classification median sample space
collection mode secondary data
descriptive statistics outcomes standard deviation
equally likely presentation statistical data
event population statistics
frequency population function tabulation
frequency distribution possibility set variable (or variate)
histogram primary data variance
interpretation probability
Summary
1 Statistics is the science of collecting, organizing, presenting, analysing and
interpreting data in order to draw conclusions.
2 A population is the complete collection of individuals, objects or measurements
that have a characteristic in common.
3 A small part (or a subset) of a population is called a sample.
4 If the categories of a classification are based on some attribute or characteristics
whose values are not numbers, then it is called qualitative classification.
5 If the characteristic of interest is numerical, then it is called quantitative
classification.
6 Descriptive statistics is a branch of statistics concerned with summarizing and
describing a large amount of data.
7 Data is said to be primary, if it is obtained first-hand for the particular purpose on
which one is currently working.
8 Data that has been previously collected for a similar or different purpose is called
secondary data.
286
Unit 6 Statistics and Probability
287
Mathematics Grade 9
25
20
Frequency
15
10
0
1 2 3 4 5 6 7
Value
Figure 6.5
a Prepare a frequency distribution table.
b What is the highest variable?
c What is the highest frequency?
d How many variates occur 5 times?
e Which variates have the minimum frequency?
4 Find the mean, median, mode, range, variance and standard deviation of the
population function whose distribution is given in the table below.
V 2 3 4 5 6
f 2 4 1 2 3
5 Find the mean, median, mode, range and variance from the histogram given
below.
288
Unit 6 Statistics and Probability
Frequency
4
0
1 2 3 4 5 6 7 8
Value
Figure 6.6
3 π
6 Why can the probability of an event not be 2 2 − π, ? or
2 2
7 An integer n, 1 ≤ n ≤ 144, is picked at random. What is the probability that n is
the square of an integer?
8 Given the following values of a population function:
5, 4, 7, 3, 6, 5, 3, 1, 5, 7, 5, 9.
Find the probability that a randomly chosen value from the data is
a a modal value;
b below the mean value;
c any of the numbers 1, 4, 6 or 9;
d an odd number greater than the mean value.
9 Two fair dice are rolled once. What is the probability that the difference of the
two numbers shown is 1?
10 Given below is the frequency distribution of a population function V.
V −10 −5 0 5 10
f 5 10 5 20 10
If an element is drawn randomly from the population find the probability that it is:
a non-negative; b non-zero;
c less than or equal to −5; d positive;
11 The median of x – 4, x, 2x and 2x + 12 is 9, where x is a positive integer. Find the
value of x.
12 The table below shows the number of students who scored marks 3, 4 or 5 in a
maths test.
Mark 3 4 5
Number of Students 3 x 4
13 In a class of boys and girls, the mean weight of 8 boys is 55 kg and the mean
weight of a group of girls is 48 kg. The mean weight of all the children is 50.8 kg.
How many girls are there?
14 There are 24 right-handed students in a class of 30. What is the probability that a
student chosen at random will be left-handed?
15 Suppose you write the days of the week on identical pieces of paper. You mix
them in a bowl and select one at a time. What is the probability that the day you
select will have the letter r in it?
16 A pair of dice are rolled. Find the probability that the sum of the numbers on the
upper faces is:
a 9; b greater than 9; c even; d not greater than 9 ;
e greater than 9 and even; f greater than 9 or even.
17 From the members of a farmers’ association 50 farmers cultivated wheat. An
agricultural expert wants to study the farmers’ yield in terms of quintals they
harvested per hectare and found the following
50 45 45 50 46 48 55 48 52 54
51 52 45 55 46 50 55 54 49 51
48 46 51 52 47 45 49 54 46 48
53 52 48 46 55 47 51 47 50 53
47 53 48 45 54 48 50 46 52 54
290