CHAPITRE 02 Statistical Series With One Variable

Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Chapter 2: Statistical Series with One Variable

Introduction:

A statistical series with one variable is a list of numerical data all related to
a single characteristic or variable.

Throughout the rest of the chapter, we consider the following situation

𝑿𝑿 ∶ → 𝑥𝑥1 , 𝑥𝑥2 , … , 𝑥𝑥𝑛𝑛


We denote X as: "Ages of students in the 5th grade class

For example, if you collect the grades of a group of students in a course,


you have a statistical series with one variable (the variable being "grades")

X 𝒙𝒙𝟏𝟏 𝒙𝒙𝟐𝟐 𝒙𝒙𝟑𝟑 𝒙𝒙𝟒𝟒 𝒙𝒙𝟓𝟓 𝒙𝒙𝟔𝟔 𝒙𝒙𝟕𝟕 𝒙𝒙𝟖𝟖 𝒙𝒙𝟗𝟗


Ages of students in the
10 11 12 11 10 11 13 10 12
5th grade
The grades of a group of
12 14 13 15 17 10 08 11 11
students
Study of a Discrete Statistical Variable

A statistical characteristic becomes discrete when its possible values are


limited to a finite and reasonable number, such as the number of
children or the number of pieces, etc.

 Effective 𝒏𝒏𝒊𝒊
 Frequency 𝒇𝒇𝒊𝒊
 Percentage %
Definition of effective 𝒏𝒏𝒊𝒊

The effective represents the number of observations (individuals) or


elements in a dataset It is often denoted as 𝒏𝒏𝒊𝒊

Example of Application 01:

Data for student ages: 18, 20, 22, 19, 20, 21, 18, 19, 20, 23, 22, 21, 20, 19,
20, 22, 21, 24, 20, 19
𝑥𝑥𝑖𝑖 18 19 20 21 22 23 24
𝑛𝑛𝑖𝑖 (𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸ve) 2 4 6 3 3 1 1

ni : The number of individuals who have the same 𝑥𝑥𝑖𝑖 , this is called the
partial effective of 𝑥𝑥𝑖𝑖 .
Example of Application 01:

Data for car colors : Black, Blue, Black, Green, Black, White, Blue,
Yellow, Black, White, Green, Yellow, Black, Blue

𝑥𝑥𝑖𝑖 Black Blue Green White Yellow

𝑛𝑛𝑖𝑖 5 3 2 2 2

Question: The effective of red-colored cars? Answer : It is 0.


Definition of the total effective

The total effective (N), in statistics, refers to the total number of


observations, individuals, or elements included in a dataset or a statistical
series.

To calculate the total effective (N), you need to add the total number of
observations or individuals in your dataset, regardless of the criteria or
characteristic you are considering.
Example of Application : Car colors

The total number of cars 𝑁𝑁 = 14

Example of Application :
Suppose you have a dataset representing the number of people in
different families. Here are the data.

family A B C D E
people 2 4 2 5 3

𝑁𝑁 = 16
Definition of frequency

In many statistical situations, datasets are typically large, that is to say,


they contain a lot of observations or individuals, and it is interesting to
calculate measures that summarize the series.
𝑛𝑛𝑖𝑖
For each value 𝑥𝑥𝑖𝑖 , we define it as follows : 𝑓𝑓𝑖𝑖 =
𝑁𝑁

𝒇𝒇𝒊𝒊 is called the partial frequency of 𝒙𝒙𝒊𝒊 . The frequency of a value is the ratio of the frequency of that value to the total
frequency.
Example of Application :
In our previous example (ages of students), we have 𝑁𝑁 = 20 :

𝑥𝑥𝑖𝑖 18 19 20 21 22 23 24
𝑛𝑛𝑖𝑖 (𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝑬𝐯𝐯𝐯𝐯) 2 4 6 3 3 1 1

𝑓𝑓𝑖𝑖 Frequency 2� = 0,1 0,2 0,3 0,15 0,15 0,05 0,05


20
Percentage
Percentages make it easier to compare different categories, groups, or subsets
within the same population or study.

We can replace 𝑓𝑓𝑖𝑖 with 𝑓𝑓𝑖𝑖 × 100, which then represents a percentage.

𝑓𝑓𝑖𝑖 (%) 𝟏𝟏𝟏𝟏 20 30 15 15 5 5

𝑛𝑛𝑖𝑖 6
𝐹𝐹𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑦𝑦 % = × 100 ; 𝑓𝑓 % = × 100 = 30 %
𝑁𝑁 20
Example of Application :
You can compare the percentage of people from different age groups who
voted in an election.

Age group Total Number of (Participation rate)


(years) number participants Percentage (%)
300
18-30 500 300 𝑋𝑋 100 = 60 %
500
31-50 700 500 71,43 %
Plus de 50 800 600 75 %
Cumulative effective, Cumulative frequency

Cumulative effective
For each value 𝑥𝑥𝑖𝑖 , we define it as follows :
𝑁𝑁𝑖𝑖 = 𝑛𝑛1 + 𝑛𝑛2 + ⋯ + 𝑛𝑛𝑖𝑖
Cumulative frequency
For each value 𝑥𝑥𝑖𝑖 , we define it as follows :

𝐹𝐹𝑖𝑖 = 𝑓𝑓1 + 𝑓𝑓2 + ⋯ + 𝑓𝑓𝑖𝑖


Example of Application :
A survey conducted in a village focuses on the number of dependent children
per family. We denote X as the number of children per family, and the results
are given in this table:
𝑵𝑵𝒊𝒊 (cumulative
𝒙𝒙𝒊𝒊 𝒏𝒏𝒊𝒊 (Effective) 𝒇𝒇𝒊𝒊 (Frequency) 𝒇𝒇𝒊𝒊 (%) 𝑭𝑭𝒊𝒊 (Cumulative)
Effective)
18� 0,09 𝑋𝑋 100 = 9 0,09 9%
0 18 18 200 = 0,09

1 32 50 0,16 16 0,25 25 %

2 66 116 0,33 33 0,58 58 %

3 41 157 0,205 20,5 0,785 78,5 %

4 32 189 0,16 16 0,945 94,5 %

5 09 198 0,045 4,5 0,99 99 %

6 02 200 0,01 1 1 100 %


Graphical Representation of Statistical Series

We have learned that tables are an important tool for organizing and
presenting data from a statistical population. In the following paragraph,
we will see how to transform these tables into graphs, which will help us
visually summarize the data.
Distribution of a qualitative nature
Here are some types of charts for representing qualitative variables
Bar chart
𝑥𝑥𝑖𝑖 Coffee Tea hot chocolate Juice Water

𝑛𝑛𝑖𝑖 30 20 15 30 50
60
50
50

40
People

30 30
30
20
20 15

10

0
Café Thé Chocolat chaud Jus Eau

Beverage preferences
Pie chart or sector chart
Let's go back to the example of car colors

𝑥𝑥𝑖𝑖 Black Blue Green White Yellow

𝑛𝑛𝑖𝑖 5 3 2 2 2

2 The degree of a sector is determined using the rule of


three as follows: 𝑁𝑁 → 360°
5 𝑛𝑛𝑖𝑖 → 𝑑𝑑𝑖𝑖 (𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷 𝑜𝑜𝑜𝑜 𝑡𝑡𝑡𝑡𝑡 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠 )
2

𝒏𝒏𝒊𝒊 𝑿𝑿 𝟑𝟑𝟑𝟑𝟑𝟑
𝒅𝒅𝒊𝒊 =
𝑵𝑵
2

3
Noire Bleu Vert Blanc Jaune
Distribution of a discrete quantitative nature
When examining a discrete quantitative variable, you have two types of
charts to represent it: the bar chart and the cumulative chart (see below).
sticks Chart
Let's go back to the example of children per family:
70 66
60

50
41
Effective

40
32 32
30

18
20
9
10
2
0
0 1 2 3 4 5 6

Numbre of children per family

You might also like