Module 1&2 - Statistics Q1W1

Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 27

Module 1

Statistics

What this module is about

This module deals with the definition of statistics and terms used in the
study of statistics. It will also discuss the importance of the study of statistics,
summation rules, collection of statistical data, sampling techniques, and the
levels of measurement. As you go over the discussion and exercises, you will
appreciate more the importance of statistics in daily life. Enjoy learning this
module and go over the discussion and examples if you have not yet mastered a
concept.

What you are expected to learn

This module is designed for you to:

1. define statistics
2. give the importance and application of the study of statistics
3. Pose problems that can be solved by Statistics.
4. Differentiate a population from a sample
5. explain the different sampling techniques
6. Distinguished quantitative data from qualitative data and discrete variables
from continuous variable.
7. identify the levels of measurement
8. identify the methods in collecting the data.

How much do you know

1. It is the branch of mathematics concerned with the


techniques by which data are collected, organized, analyzed, and interpreted.
a. Information Technology b. Statistics
c. Trigonometry d. Geometry
2. This demands a higher order or critical judgment and
mathematical methods.
a. Descriptive c. Qualitative
b. Inferential d. Quantitative
3. These are units of information, often numeric,
recorded and used for the purpose of analysis.
a. data c. discrete
b. continuous d. survey
4. It is composed of numbers representing counts and
measurements.
a. Qualitative data c. continuous
b. Quantitative data d. discrete
5. It is a level of measurement where in data are arranged in some particular
order.
a. Nominal c. ordinal
b. Interval d. ratio

2
What you will do

Lesson 1
WHAT IS STATISTICS?

Statistics is a branch of Mathematics that deals with collecting, organizing,


presenting, analyzing, and interpreting data to come up with useful and
meaningful information.

I. TWO DIVISIONS OF STATISTICS


A. DESCRIPTIVE STATISTICS
It is concerned with summarizing and describing important features
of numerical data without attempting to infer. This method can either be
graphical or computational
B. INFERENTIAL STATISTICS
This demands a higher order or critical judgment and mathematical
methods. It aims to give information about a large group of data without
dealing with each and every element of these groups.

II. APPLICATION AND IMPORTANCE


Statistics is being used in most areas of our daily living. Like enrolment
yearly, weather forecast, effectiveness of drugs and medicines, business profit,
number of births, quality check of products, predictions on presidential and
senatorial elections, and trends in technology.

A. Uses of statistics

 It aids in decision making.


 It summarizes data for public use.
 It can give a precise description of data.
 It can predict the behavior of an individual.
 It can be used to test hypothesis.

B. It is essential tool in:

Education Psychology Heredity


Government Census Public opinion polling
Medicine Sociology Business and economics
III. Misused of Statistics
Example:

 I trust him 200% (It should be 100 %.)

 Crazy sale! Up to 120% (It should be 100 %.)

 A survey was conducted to know whether the company must allow cell
phones at work. They collected data by asking the respondents to send
their votes via text message. (The process of collecting information
does not cover all possible respondents. Some of the respondents don’t
have cell phones.)

Try this out!

Directions: Determine whether the following is a statistical question or not. Write


S if the question is a statistical question and N if it is non-statistical question.
____1. Do all the COVID patients in the clinic need to be vaccinated at once
for them to get totally well?
____2. How many pieces of vitamin C capsules could be taken by a COVID
patient in a day?
____3. . Who has a bigger house the Governor or the Mayor?
____4. What is the average grade in Mathematics Grade 7 students last
school year?
____5. What is the highest paying job in the Philippines today?

Lesson 2
Sampling Techniques

POPULATION - It is the complete set of individuals, objects, places, events and


reactions having some characteristics in common. In statistical investigation, it is defined
by naming its unique properties. It consists of numerical values associated with objects
or individuals.
SAMPLE – is a specific group that you will gather data from or a sub-collection of
elemets drawn from a population. It has the size lesser than the size of population.
CENSUS- is a collection of data in a population.

POPULATION SAMPLE
Alphabets Vowels
All countries in the world Countries in Asia
Students in Domingo Yu Chu NHS Students from Grade 9 of Domingo Yu Chu NHS
One cavan of rice One kilo of rice
Number of people in the Philippines Number of people in MIMAROPA
The method of drawing samples from a population is of very important. There are
several ways of doing this.
I. SAMPLING
1. Estimate the number of items in the population.
2. Assess resources such as time and money factors which are available to
pursue the research.
3. Determine the sample size needed in the study using the Slovin’s formula:
n= N
1 + Ne²
Where: n = sample size
N = population
e= margin of error
4. Pick the sample by using the appropriate sampling technique.
5. Prepare the questions to be asked in the interview or in the questionnaire.
A. SAMPLING TECHNIQUE
1. Probability Sampling – All members of the population have equal chances of
being chosen as part of the sample.
a. Random Sampling – Members of your sample are selected through
lottery.
b. Systematic Sampling – Members of your population are written in a list
systematically with corresponding numbers.
- the process of selecting the sample when units
are obtained by drawing every nth element of the
population.
nth = total number of elements in the population
n = desired sample size
nth = N
n

c. Stratified Sampling – Members of your population are grouped. You can


choose equal number of respondents in each group, or in proportion to the
number of elements in each group.

d. Cluster Sampling – Members of your population are grouped. Selections


of all respondents are in groups. You can choose all the respondents in
your selected groups.
2. Non-Probability Sampling- All members of the population do not have equal
chances of being chosen as part of the sample.

a. Convenience Sampling- Samples are selected because of their


immediate availability.
b. Purposive Sampling – Samples are determined by the researcher base
on the purpose of the study.
c. Quota Sampling – Samples are selected to achieve the needed number
of participants in the study.
d. Snowball Sampling – Samples are selected based on the
recommendation of other members in the sample.

Try this out

I. Direction: Identify which item in each column is the population and the
sample. Write P if the item is a population and S if the item is a sample.

1. ____ MIMAROPA P ____ Mindoro S


2. ____ Tablespoon of sugar S ____ Jar of sugar P
3. ____ STEM Students S ____ Academic track students P
4. ____ Juice in a pitcher P ____ juice in a glass S
5. ____ All manufactured cellular phones P ____ model units of cellular phone
S

II. Direction: Identify the probability sampling used in each item.


_________1. The online reseller writes all her loyal customers in a sheet of paper and
draws 20 customers. RANDOM SAMPLING
________ 2. The coordinator selects 3 students in each grade level. STRATIFIED

III. Identify the non- probability sampling used in each item.


________ 3. Posting an online survey and accepting only 300 responses. QUOTA
SAMPLING
________ 4. Asking 100 customers who are leaving the mall. CONVENIENCE
SAMPLING
________ 5. Accepting blood donations from persons with AB- blood type and asking
them if they can also refer friends whom they know with the same blood
type.SNOWBALL SAMPLING

Lesson 3
STATISTICAL DATA COLLECTION
DATA – are units of information usually numerical that are collected through
observation.
I. CLASSIFICATION OF DATA
A. Qualitative Data
 Categories that show classifications or attributes
 It is distinguished by some nonnumeric characteristics.
Examples of Qualitative Data
Gender, Marital Status, Grade Level, Senior High Track/Strand
B. Quantitative Data
 Numbers or values that represent counts or measures.
Examples of Qualitative Data
Weight, number of siblings, hours spent in studying, general average of
students
a. Types of Quantitative Data
1. CONTINUOUS VARIABLES-these are variables that can take the form
of decimals.
Examples:
The temperature of a cup of coffee served at a restaurant.
The average amount spent on electricity each July by a randomly
selected household in a certain state.
2. DISCRETE VARIABLES-these are variables that cannot take the form
of decimals.
Examples:
The number of boys in a randomly selected three-child family.
The number of vehicles owned by a randomly selected household
II. LEVELS OF MEASUREMENT
A. Nominal- In this level of measurement, we can use words, letters, and
alpha-numeric symbols.
Examples of Nominal Data
 Blood type, Zip code, Gender, Political Party
B. Ordinal – In this level of measurement, data are arranged in some
particular order.
Examples of Ordinal Data
 Socio-economic status, Level of education, Level of income, customer’s
level of satisfaction
C. Interval – It is like an ordinal level but with meaningful amount of
differences between data can be determined. Zero does not represent an
absence of something in an interval scale.
Examples of Ordinal Data
 Temperature
 IQ Test
D. Ratio – It is exactly the same as interval except that the zero on the scale
means does not exist.
Examples of Ratio Data
 Weight, area, speed and velocity
III. METHODS OF COLLECTING DATA
Data are units of information, often numeric, recorded and used for the
purpose of analysis. These data can be gathered using the tool called statistical
instruments in the following method:
A. Survey – is a process of gathering, sampling, analyzing and interpreting
data from a group of people.
These are Example of survey method:
a. Interview Method – is done when a person asks information from
another person.
- A direct method of gathering data because the
data came directly from the source.
b. Questionnaire method – is done using printed questions regarding a
certain matter.
- An indirect method of gathering data
because this makes use of written questions to be answered by the
respondent.
B. Observation – In observation, the person who gathers data is called
investigator while the person/object being observed is called the subject.
- Make use of the different human senses in gathering
information.
C. Experimentation – This is used by physicists and behavioural scientists in
collecting data.
- An experiment is a data collection method where you
as a researcher change some variables and observe their effect on other
variables.
- The variables that you manipulate are referred to
as independent while the variables that change as a result of manipulation
are dependent variables.
D. Registration- This refers to continuous, permanent, compulsory recording of
the occurrence of vital events together with certain identifying or descriptive
characteristics concerning them.
-It is a method of utilizing the existing data or fact or
information, which is kept systematized by the office concerned such as
registration of births, death, motor vehicles, and marriages and licenses
because these are being enforced by certain laws.
Try this out!
I. Direction: Classify each random variable as either discrete or continuous
1. The number of patrons arriving at a restaurant between 5:00p.m5:00p.m.
and 6:00pm.
2. The number of new cases of influenza in a particular county in a coming
month.
3. The air pressure of a tire on an automobile.
4. The amount of rain recorded at an airport one day.
5. The number of students who actually register for classes at a university next
semester.
II. Direction: Determine what type of data are the following; cluster them by
writing each one of them inside the circles below:

100 lbs. Male Female Civil 1st year 6 inches


100 degree Celsius 5 years old 45 Php 100

NOMINAL ORDINAL INTERVAL RATIO


III. Direction: Determine the specific data collection method fit for what is/are
being required by the following statements below:
_________1. How many parents or guardians of |grade 7 learners of this school year
2020-2021 are working in public or private agencies?
_________2. How many from the ushers and usherettes in the party are very welcoming
and smiling while the guests are approaching the reception hall?
_________3. How fast does the counter crew serve the ordered meal of the tree
consecutive costumers?
_________4. Is the stoplight in the city helpful to all concerned?
_________5. What is the average grade of the Grade 7 learners in Mathematics school
year 2022-2023?
Let’s Summarize

Statistics is a branch of Mathematics that deals with collecting, organizing,


presenting, analyzing, and interpreting data to come up with useful and meaningful
information.

Population is a defined group or aggregates of objects, animals, materials,


measurements, “things”, “events” or “happenings” of any kind.

Sample is defined as any subgroup of the population drawn by some appropriate


method from the population.

Sampling is the process of selecting the elements of a sample from the


population being studied. The methods of sampling include random sampling,
systematic sampling, stratified sampling, and, cluster sampling.

CLASSIFICATION OF DATA
Qualitative Data - Categories that show classifications or attributes
Quantitative Data - Numbers or values that represent counts or measures.
Types of Quantitative Data
CONTINUOUS VARIABLES-these are variables that can take the
form of decimals.
3. DISCRETE VARIABLES-these are variables that cannot take the form
of decimals.

LEVELS OF MEASUREMENT
a. Nominal b. Ordinal c. Interval d. Ratio

METHODS OF COLLECTING DATA


a. Survey b. Observation c. Experimentation d. Registration

Name: __________________________________ Date: _____________________


Grade and Section: _____________________ Score: ____________________

STATISTICS
Quiz 1.1

11
I. Direction: Identify what is being asked in each item. Choose the correct
answer from the words found inside the box.

Qualitative data Census Sample

Quantitative data Statistics Population

2. It deals with methods in the collection, gathering, presentation, analysis, and


interpretation of data.
3. It refers to the numerical information gathered about the samples.
4. A subset of a population or a collection of some elements in a population.

II. Determine whether the numbers obtained in the following variables are
discrete or continuous.
5. Books in a library
6. Weights of students
6. The number of staple wire in a stapler.

III. Determine what level of measurement are the following;


7. Zip Code
8. Speed and velocity
9. IQ Test

IV. Identify the probability sampling used in each item.


10. The teacher picks students in a stack of index cards on which their names are
written.
11. The Local Government Unit chooses respondents only from barangays that
are placed under hard lockdown.
12. A radio program staff member answers every 50th caller.

V. Classify what best method in collecting data is suitable for the following:
13. Enrollment of Grade 7 students
14. Student’s responses/feedback in a school program.
15. PHIVOLCS collects data from Taal Volcano.

Module 2
Statistics

12
13
Lesson 1
Collecting and Organizing Data in a Table

The study of statistics begins with the collection of data or measurements.


Data collected should be organized systematically for easier and faster
interpretation. They maybe presented in any of the following forms:
The textual form can be used if the data to be presented if few.
The tabular and graphical forms are used when more detailed information
about the data is to be presented.
A table is used when you want to present a data in a systematic and
organized manner so that reading and interpretation will be simpler and easier.
When a table is used, you must remember the following:
1. The title of the table.
2. Indicate the date of the survey.
3. Arrange the data systematically in columns. The columns must be properly
labeled.
4. Identify the source of the data.

Example 1:
Mahusay National High School
Enrolment, SY 2005-2006
Year Level Male Female
First 216 267
Second 197 216
Third 187 227
Fourth 176 215
Total 776 925

You will observe that the table above shows clearly the enrolment data in
Mahusay National High School for the school year 2005-2006.

Another type of tabular presentation is the frequency table also known as a


frequency distribution. It is an arrangement of the data that shows the frequency of
occurrence of different values of the variables.

A frequency table is constructed by listing the measurements from highest


to lowest, then making tally marks to record how often each number occurs. After
tallying, count the marks and record them in the proper column.

Example 2: The scores of 45 students on a 20-point Science quiz are as follows:

14
17 20 15 18 19 16 11 10 15 16
12 12 13 14 11 10 14 13 12 11
13 15 14 10 15 16 17 17 18 20
20 18 19 19 18 17 16 15 12 12
13 14 15 19 20

Prepare a frequency table for the set of data.

Solution: To prepare a frequency table for the given set of scores, the scores are
listed from highest to lowest, tally marks are made and counted. The counted tally
marks will then be recorded under the column frequency. Notice that every 5 th tally
crosses the first four tallies. This is done to make counting of marks easier
especially if the number of cases is rather big.

Frequenc
Score Tallies y
20  //// 4
19  //// 4
18  //// 4
17  //// 4
16  //// 4
15  //// / 6
14  //// 4
13  //// 4
12  //// 5
11  /// 3
10  /// 3
Total 45

Try this out

1. The school budget for Maintenance and Other Operating Expenses of a


certain school for Calendar Year 2004 is given below.

15
Amount (in
Expense Item Pesos)
Power 600 000
Water 95 000
Communication 60 000
Supplies 1 600 000
Repair 920 000
Others 100 000

a) How much is the total budget of the school for CY 2004?

b) Which expense item received the biggest allocation? What percent of the total
budget was allocated for it?

c) Which expense item received the least allocation? What percent of the
total budget was allocated for it?

1. a. Php 3, 375, 000


b. supplies – 47.4%
c. communication – 1.78%

2. The following shows the scores of 15 students in mathematics for the second
grading period. Prepare a frequency table given the data below.

87 90 89 92 94
88 90 91 88 87
90 94 92 91 90

2. Score Frequenc
y
94 2
92 2
91 2
90 4
89 1
88 2
87 2
Total 15

Lesson 2

Frequency Distribution Tables

16
If the number of measures in consideration is rather big, the presentation of
data is further simplified by grouping the measures into class intervals called a
frequency distribution.

A frequency distribution is a distribution of the total number of measures or


frequencies over arbitrarily defined categories or classes. The number of measures
falling under a class is called class frequency.

Example 1.

The frequency distribution below shows the scores obtained by 300 students
in an English test of 50 items.

Number of
Score Students
45-49 15
40-44 32
35-39 42
30-34 108
25-29 67
20-24 21
15-19 10
10-14 5
Total 300

In the example above, the symbol 45-49 and the other symbols which follow
up to 10-14 are called class intervals. The end numbers are called class limits. For
instance in the class interval 45-49, 45 is called the lower limit while 49 is called the
upper limit.

Each class interval has also a lower boundary and a higher boundary. For
the class interval 45-49, the lower boundary is 44.5 while the higher boundary is
49.5. Hence, for the class interval 45-49, 44.5 – 49.5 are called the class
boundaries.

The size of the class interval, also called class size is the difference between
the upper boundary and the lower boundary. Hence, the class size in the given
example is 5

A class interval has also a midpoint or a class mark. It is obtained by taking


half the sum of the lower and upper class limit. For instance, the midpoint of the
class interval 45-49 is or 47.

STEPS IN CONSTRUCTING FREQUENCY DISTRIBUTION

17
STEP 1. Determine the range.

STEP 2 : Determine the desired number of classes. The ideal number is


somewhere between 5 and 15.

STEP 3. Determine the approximate size or class width of class interval.

STEP 4. Construct a frequency table by making class intervals starting with


the lowest value in the lower limit of the first class interval, and then add
the computed class size to obtain the lower limit of the next class interval.

STEP 5. Write the obtained frequency from each class interval by counting
the tallied form.

STEP 6. Determine the class mark of each class interval

STEP 7. Determine the class boundaries or class limits by subtracting 0.5


from every lower limit and adding 0.5 from every upper limit.

Example 2:

STEP 1. Determine the range.

range = highest score- lowest score


= 90-51
= 39

STEP 2 : Determine the desired number of classes. The ideal number is


somewhere between 5 and 15.

18
Number of Classes = 2k > n

Where k = unknown number;


n= total number of correspondent.

Desired class interval interval


2k > n

2k > n 21> 50 2 > 50


2k > n 22> 50 4 > 50
2k > n 23> 50 8 > 50
C.I. = 8 2k > n 24> 50 16 > 50
2k > n 25> 50 32 > 50
2k > n 26> 50 64 > 50
2k > n 27> 50 128 > 50

STEP 3. Determine the approximate size or class width of class interval.

i = range / class interval


Class size = range/class interval
= 39 / 8
= 4.875
class size or i =5

STEP 4. Construct a frequency table by making class intervals starting with


the lowest value in the lower limit of the first class interval, and then add
the computed class size to obtain the lower limit of the next class interval.

Class Interval
86-90
81-85
76-80
71-75
66-70
61-65
56-60
51-55

STEP 5. Write the obtained frequency from each class interval by counting
the tallied form.

19
Class Interval f
86-90  //// ///
81-85  ////
76-80 //// //
71-75 //// ////
66-70 //// ////
61-65 ////
56-60 ///
51-55 ////

STEP 6. Determine the class mark of each class interval

x = (lower limit + upper limit)/2

STEP 7. Determine the class boundaries or class limits by subtracting 0.5


from every lower limit and adding 0.5 from every upper limit.

Try this out

20
1. A sample of fifty shoppers at a newly opened convenience store has
been randomly selected. The following data show the shoppers’ ages.
Determine the appropriate class interval to use then prepare a frequency
distribution for the data.

12 20 17 19 23 32 15 45 60 65
18 22 27 35 37 57 47 38 40 28
13 10 19 24 29 28 38 47 48 57
27 29 33 34 49 76 55 65 37 39
40 14 17 20 32 33 60 65 62 57

2. The following are the weights of randomly selected 1 st year students in


kilograms. Prepare a frequency distribution for this set of data.

37 35 40 42 36 57 38 44 60 45 52
64 38 39 40 42 50 56 45 43 38 39
50 41 42 56 57 54 55 60 35 38 40
40 42 53 47 48 39 50 35 37 39 39
50

Lesson 3

21
The Summation Process

The study of statistics involves the collection of data or measurement. Thus,


there is always a need to add several numbers. The Greek capital letter sigma, Σ is
used in the process. The symbol Σ, read as the sum of tells you to add certain
numerical values.

Example 1: Consider the scores obtained by 10 students in a 50-items


mathematics test.

Student No. Score


1 35
2 40
3 29
4 37
5 25
6 33
7 49
8 47
9 28
10 42

For convenience, variables will be used to present the data.

Let x = score obtained by each student


xi = different values or observations of x
xi is read as “x sub i ” where i is a subscript which indicates the position of
each value in the series.

In the given data, there are 10 observations denoted as x 1, x2, x3, x4, x5, x6,
x7, x8, x9, x10.

Hence, = x1+ x2+ x3+ x4+ x5+ x6+ x7+ x8+ x9+ x10.

The symbol is read as “the sum of 10 observations x1 to x10”.

To substitute the data:


= 35 + 40+ 29 + 37 + 25 + 33 + 49 + 47 + 28 + 42

= 365

For large observations, say 50, the summation will be expressed as:

22
= x1+ x2+ x3 + …..+x50.

In general, = x1+ x2+ x3 + …..+xn.

If all the given values of a variable are to be used in finding the sum, the
limits of the summation are usually omitted, as
=

Example 2: Given are the ages of the first 4 shoppers at a newly opened
convenience store in the neighborhood – 12, 24, 30, 45.

1. What will x represent in the information given?


2. What will the subscript i represent?
3. Write an expression for the sum.
4. What are the lower and upper limits of the expression?
5. Write the formula for the summation and find the sum of the given
information.

Answers:
1. x will represent the ages of the first 4 shoppers in the newly opened
convenience store.
2. I will represent the first 4 shoppers in the newly opened convenience store.
3. is the expression for the summation.

4. The lower limit is 1 and the upper limit is 4.


5. = x1 + x2 + x3 + x4
= 12 + 24 + 30 + 45
= 111

This time, consider 5 observations. If the sum of five observations is written


as:
= x1 + x2 + x3 + x4 + x5 ;

the sum of the squares of the five observations is represented as:


= x12+ x22+ x32+ x42+ x52;

the sum of the products of pairs of five observations is expressed as:


= a1x1+ a2x2+ a3x3+ a4x4+ a5x5

23
Example 3: Consider the first four multiples of 2: 2, 4, 6, 8. Use the corresponding
summation formula to find the following:

1. the sum of the first four multiples of 2


2. the sum of the squares of the first four multiples of 2
3. the sum of the products of pairs of values consisting of the first four counting
numbers and the first four multiples of 2.

Solutions:

1. = x1+ x2+ x3+ x4


=2+4+6+8
= 20

2. = x12+ x22+ x32+ x42

= 22 + 4 2 + 6 2 + 8 2
= 4 + 16 + 36 + 64
=120

3. = a1x1+ a2x2+ a3x3+ a4x4


= 1(2) + 2(4) + 3(6) + 4(8)
= 2 + 8 + 18 + 32
= 60

Example 4: Find 1. 2.

Solutions:

1. = 3 + 3 + 3 + 3 + 3 + 3 = 6(3) = 18

2. = (-3) + (-3) + (-3) + (-3) + (-3) + (-3) = 6(-3) = -18

Observe that in example 4, the summation of a constant c is the product of


the constant and the number of terms n in the summation, that is,

24
Try this out

I. Express each of the following as a sum:

25
1.

2.

3.

4.

26
II. Express the following sums in
summation notation:

5. x1+ x2+ x3 + …..+x20


6. x12+ x22+ x32+ x42+ x52+ x62+ x72+ x82 11. the sum of the squares of the first
7. a1x12+ a2x22+ a3x32+ a4x42 five positive even integers
8. 4x12+ 4x22+ 4x32+ 4x42 12. the sum of the products of the first
four counting numbers and the first
four
III. Use summation to find the
multiples of 3.
following:

9. the sum of the positive odd integers IV. Compute:


less than 20
10. the sum of the first ten positive 13.

even integers
14.

15.

You might also like