Introd 2
Introd 2
Introd 2
INTRODUCTION TO STATISTICS
1
2
WHAT IS STATISTICS
The mathematical science that deals with
interpretation of data.
There are two main branches of statistics :
statistics.
Collection of
Data
Interpretation of
Data Statistics Presentation of
Data
Analysis of Data
3
• Statistical techniques are used extensively by marketing,
WHO USES accounting, quality control, consumers, professional
STATISTICS? sports people, hospital administrators, educators,
politicians, physicians, etc...
4
BRANCHES OF STATISTICS
5
INFERENTIAL STATISTICS
DESCRIPTIVE STATISTICS
involves using sample data to
involves organizing, summarizing, draw conclusion about a
and displaying data. population.
• If the purpose of the study is to examine and explore information for its own natural interest
only, the study is descriptive.
• However, if the information is obtained from a sample of a population and the purpose of the
study is to use that information to draw conclusions about the population, the study is
6
inferential.
Population Sample
The basketball team The starting five
Population Sample
Music store customer Customers who make a purchase
10
11
Parameter and Statistic
A parameter is a descriptive measure computed from an entire population of
data.
A statistic(or estimate) is a descriptive measure computed from a sample of
data.
Examples
Parameter Statistic
Proportion of all students who attended the last Mean height of a sample of NBA basketball
home football game. players.
Mean SAT of entering freshmen Mean number of pepperoni slices on a 12” pizza
from a sample of a certain brand of pepperoni
pizzas.
Population :
all the species that live in the lake
PRACTICE QUESTION 1
The parameter is the
number of species in
the lake
A scientist takes a big bucket of
water from a lake and counts how
Sample:
many species of bacteria, bugs, and the species that are in the bucket
other creepy crawlies he finds in the
bucket. the estimate is the number
of species found in the bucket.
Identify the population, the sample,
the parameter, and the estimate in
this situation. 12
13
Practice Question II
A school takes a poll to find The population is all the students at the school,
out what students want to eat and the parameter is the lunch preferences of
the whole school.
at lunch. 70 students are
randomly chosen to answer the
poll questions.
14
• The collection of information from the elements
of a population or a sample is called a survey.
Experimental Unit
(Sampling Unit) The first step in detailing data collection protocol is to
define the experimental unit. An experimental or
sampling unit is the person or object that will be studied
by the researcher.
18
Variable: A measurable quantity which can vary
from one individual or object to another is called a
variable.
Example: height, age, number of siblings, martial
status, eye color, etc.
19
Quantitative Variable: A variable is one which can
assume a numerical value, for example, balance in
Qualitative Variable
your checking account, minutes remaining in class,
number of children in a family. height of plant, weight
A qualitative variable is also of grains, number of students in class etc. Quantitative
known as categorical variable is variable can further be placed into two types
one which is not capable of
depending upon the type of measurement possible.
taking numerical
measurements. For example,
gender, religious affiliation,
type of automobile owned, state Continuous
of birth, eye color, general Discrete
knowledge (poor, moderate, variables
good) etc. variables;
20
a) A continuous variable is one that can take all possible values
continuous in an interval on the number line. For example, The pressure in
variable a tire, the weight of a pork chop, or the height of students in a
class, atmospheric pressure, plant height, student height,
temperature.
21
Measurement Nominal: unordered categories. This includes measurements of
Scales: categories such as gender, religion, sport etc.
I. Nominal scale;
Ordinal: Ordered categories. It has variable measurements of variable
II. Ordinal scale; categories such a size, behavior etc.
22
Interval Scale: like the ordinal level, with the additional property
that meaningful amounts of differences between data values can be
Interval Scale determined. There is no natural zero point.
Ratio Scale: the interval level with an inherent zero starting point.
Ratio Scale Differences and ratios are meaningful for this level of
measurement.
23
Nominal: unordered, categories e.g.
24 male/female, smoke/non-smoker,
Alive/dead
Qualitative
(categorical)
Quantitative
(Numeric)
26
Source of Data
27
Source of Data Collection:
PRIMARY DATA SECONDARY DATA
•These are the data that are collected for the first •These are the data that are sourced from someplace that has
time by an investigator for a specific purpose. originally collected it.
•This means that this kind of data has already been collected
•Primary data are ‘pure’ in the sense that no
by some researchers or investigators in the past and is
statistical operations have been performed on available either in published or unpublished form.
them and they are original. •This information is impure as statistical operations may
have been performed on them already.
•An example of primary data is Census of
•An exmaple is an information available on the government
Pakistan. of Pakistan, the Department of Finance’s website or in other
repositories books, journals, etc.
28
29
METHODS FOR COLLECTION OF
PRIMARY DATA
3. Questionnaire Method
5. Registrations
30
1. DIRECT PERSONAL
INVESTIGATION
• IN THIS METHODS, THE INVESTIGATOR INTERVIEWS THE PERSONS CONCERNED OR OBSERVERS FACTS
PERSONALLY.
• THE INVESTIGATOR MAY GO TO LIVE WITH THE PEOPLE, MIX UP WITH THEM FREELY AND GATHER THE
FACTS.
31
• IT IS SUITABLE ONLY IN LABORATORY EXPERIMENTS OR LOCALIZED INQUIRES
2. INDIRECT PERSONAL
INVESTIGATION
INFORMATION.
• QUESTIONNAIRE IS AS AN INSTRUMENT FOR RESEARCH, WHICH CONSISTS OF A LIST OF QUESTIONS, ALONG WITH THE CHOICE OF
ANSWERS, PRINTED OR TYPED IN A SEQUENCE ON A FORM USED FOR ACQUIRING SPECIFIC INFORMATION FROM THE RESPONDENTS.
• IN GENERAL, QUESTIONNAIRES ARE DELIVERED TO THE PERSONS CONCERNED EITHER BY POST OR MAIL, REQUESTING THEM TO
• INFORMANTS ARE EXPECTED TO READ AND UNDERSTAND THE QUESTIONS AND REPLY IN THE SPACE PROVIDED IN THE QUESTIONNAIRE
ITSELF.
33
• THE QUESTIONNAIRE IS PREPARED IN SUCH A WAY THAT IT TRANSLATES THE REQUIRED INFORMATION INTO A SERIES OF QUESTIONS,
• THIS METHOD IS AN ALTERNATIVE WAY TO GET INFORMATION OF PRIMARY DATA FROM RURAL
AREA.
• A NUMBER OF ENUMERATORS ARE SELECTED AND TRAINED. THEY ARE PROVIDED WITH
STANDARDIZED QUESTIONNAIRE.
34
• THESE ENUMERATORS GOES TO THE RESPONDENTS ALONG WITH THE QUESTIONNAIRE AND
5. REGISTRATION
1. OFFICIAL SOURCES, E.G. PUBLICATIONS OF FEDERAL BUREAU OF STATISTICS, MINISTRIES OF AGRICULTURE, FINANCE,
2. SEMI- OFFICIAL SOURCES, E.G. PUBLICATIONS OF STATE BANK OF PAKISTAN, CENTRAL COTTON COMMITTEE,
3. PRIVATE SOURCES, E.G. PUBLICATIONS OF TRADE ASSOCIATIONS, CHAMBERS OF COMMERCE AND INDUSTRY,
36
MARKET
COMMITTEES, ETC.