Statistics
Statistics
Statistics
Statistics
Collection of methods for planning experiments, obtaining data, and then
organizing, summarizing, presenting, analyzing, interpreting, and drawing
conclusions.
2 types of statistics: DESCRIPTIVE STATISTICS from INFERENTIAL STATISTICS.
Inferential Statistics
Inferential statistics are produced through complex
mathematical calculations that allow scientists to infer
trends about a larger population based on a study of a
sample taken from it. Scientists use inferential statistics to
examine the relationships between variables within a
sample and then make generalizations or predictions about
how those variables will relate to a larger population.
DESCRIPTIVE STATISTICS
In this branch of statistics, the goal is to describe.
Numerical measures are used to talk about features of a set of
data. There are number of items that belong in this portion of
statistics, such as:
The average, or measure of the center of a data set,
consisting of the mean, median and mode
The spread of a data set, which can be measured with
the range or standard deviation
Two Types of Data.
What is Ungrouped Data ?
When the data has not been placed in any categories and no
aggregation/summarization has taken placed on the data then it is known as
ungrouped data. Ungrouped data is also known as raw data.
What is Grouped Data?
When raw data have been grouped in different classes then it is said to be grouped
data.
Height Frequency (f)
Height of students:
131 – 140 1
(171,161,155,155,183,191,185,170,172, 141 – 150 7
177,183,190,139,149,150,150,152,158,1 151 – 160 5
59,174,178,179,190,170,143,165,167,18 161 – 170 9
7,169,182,163,149,174,174,177,181,170 171 – 180 9
,182,170,145,143): 181 – 190 10
total 41
This is the raw data/ ungrouped data
Mean = = 2.4
this means that the five households have an average of 2.4 children.
Median of Ungrouped Data
The median is the value at the middle of a distribution of data when those data are
organized from the lowest to the highest value.
Let’s suppose we have the following list of numbers: 5, 7, 10, 43, 2, 69, 31, 6, 22.
Find the median.
5, 6, 5, 7, 5, 10, 8
Mode = 5
5, 6, 5, 7, 5, 10, 8, 10, 10
Mode = 5 and10
Let’s suppose we have the following scores of students taking 10-item quiz: 5, 7,
10, 10, 5, 7 , 9, 4, 6, 10, 9, 7, 8, 9.
Find the mean, median and the mode.
Mean = = 7.57
Median = 4, 5, 5, 6, 7, 7, 7, 8, 9, 9, 9, 10, 10, 10 =
Mode = 7, 9, 10
How to Make
a Frequency
Distribution?
Alex measured the lengths of leaves on the oak tree
(to the nearest cm):
9,16,13,7,8,4,18,10,17,18,9,12,5,9,9,16,1,8,17,1,10,5,9
,11,15,6,14,9,1,12,5,16,4,16,8,15,14,17.
To get started, put the numbers in order, then find the
smallest and largest values in your data, and calculate the
range (range = largest - smallest)
In order the lengths are:
1, 1, 1, 4, 4, 5 , 5, 5, 6, 7, 8, 8, 8, 9, 9, 9, 9, 9, 9, 10, 10, 11, 12, 12, 13,
14, 14, 15, 15, 16, 16, 16, 16, 17, 17, 17, 18, 18
Now calculate an approximate group size, by dividing the range by how many
groups you would like. ( commonly it is ranging from 5 - 20 ) Always round up the
answer.
= 3.4
Then round that up it becomes 4
1, 1, 1, 4, 4, 5 , 5, 5, 6, 7, 8, 8, 8, 9, 9, 9, 9, 9, 9, 10, 10, 11, 12, 12, 13,
14, 14, 15, 15, 16, 16, 16, 16, 17, 17, 17, 18, 18
Group size = 4
Pick a starting value that is less than or equal to the smallest value.
Starting at 0 and with a group size of 4 we get: 0, 4, 8, 12, 16. Write
down the groups.
Include the end value of each group that must be less than the next
group
Length in cm The last group goes to 19 which is
0-3 greater than the largest value. That is
OK: the main thing is that it must
4-7 include the largest value which is 18.
8 – 11
12 – 15
16 – 19
Now tally the results to find the frequencies. And do a total
1, 1, 1, 4, 4, 5 , 5, 5, 6, 7, 8, 8, 8, 9, 9, 9, 9, 9, 9, 10, 10, 11, 12, 12, 13, 14, 14, 15, 15,
16, 16, 16, 16, 17, 17, 17, 18, 18
Length in cm Frequency
0-3 3
4-7 7
8 – 11 12
12 – 15 7
16 – 19 9
i=4 38
Mean of Grouped Data
;
f = frequency
x = the mid value/ class mark
n = number of entry/ data
𝑚𝑒𝑎𝑛 =
∑ 𝑓𝑥
𝑛
Length in Frequency x fx
cm
0-3 3 1.5 4.5
4-7 7 5.5 38.5
8 – 11 12 9.5 114
12 – 15 7 13.5 91
16 – 19 9 17.5 157.5
i=4 n =38 405.5
Median of Grouped Data
;
Median class =
= lower boundary of median class
cf< = less than cumulative frequency
frequency of the median class
i = interval / group size
Median class =
Median class =
= = 19
Length f x fx LB cf
in cm
0-3 3 1.5 4.5 0 3 =