Sampling and Sample Size Determination
Sampling and Sample Size Determination
Sampling and Sample Size Determination
Size Determination
Terms
Sample
Population
Population element
Census
Cost
Speed
Accuracy
Destruction of test units
Steps
Target Population
(step 1)
Operational Definition
Sampling Frame
(step 2)
List of elements
Probability or
Nonprobability (step 3)
Probability Sample:
A sampling technique in which every
member of the population will have a
known, nonzero probability of being
selected
Non-Probability Sample:
Units of the sample are chosen on the
basis of personal judgment or
convenience
There are NO statistical techniques for
measuring random sampling error in a
non-probability sample. Therefore,
generalizability is never statistically
appropriate.
Classification of Sampling
Methods
Sampling
Methods
Probability
Samples
Systematic
Cluster
Nonprobability
Stratified
Simple
Random
Convenience
Judgment
Snowball
Quota
Probability Sampling
Methods
Sample Size
Probability of Selection = Population Size
Advantages
minimal knowledge of population needed
External validity high; internal validity
high; statistical estimation of error
Easy to analyze data
Disadvantages
High cost; low frequency of use
Requires sampling frame
Does not use researchers expertise
Larger risk of random error than stratified
Systematic Sampling
An initial starting point is selected by a
random process, and then every nth
number on the list is selected
n=sampling interval
The number of population elements
between the units selected for the
sample
Error: periodicity- the original list has a
systematic pattern
?? Is the list of elements randomized??
Advantages
Moderate cost; moderate usage
External validity high; internal validity
high; statistical estimation of error
Simple to draw sample; easy to verify
Disadvantages
Periodic ordering
Requires sampling frame
Stratified Sampling
Sub-samples are randomly drawn from
samples within different strata that are
more or less equal on some characteristic
Why?
Can reduce random error
More
Advantages
minimal knowledge of population needed
External validity high; internal validity
high; statistical estimation of error
Easy to analyze data
Disadvantages
High cost; low frequency of use
Requires sampling frame
Does not use researchers expertise
Larger risk of random error than stratified
Systematic Sampling
An initial starting point is selected by a
random process, and then every nth
number on the list is selected
n=sampling interval
The number of population elements
between the units selected for the
sample
Error: periodicity- the original list has a
systematic pattern
?? Is the list of elements randomized??
Advantages
Moderate cost; moderate usage
External validity high; internal validity
high; statistical estimation of error
Simple to draw sample; easy to verify
Disadvantages
Periodic ordering
Requires sampling frame
Stratified Sampling
Sub-samples are randomly drawn from
samples within different strata that are
more or less equal on some characteristic
Why?
Can reduce random error
More
How?
1.Identify variable(s) as an efficient
basis for stratification. Must be known
to be related to dependent variable.
Usually a categorical variable
2.Complete list of population elements
must be obtained
3.Use randomization to take a simple
random sample from each stratum
Disproportional
Stratified Sample:
Advantages
Assures representation of all groups in
sample population needed
Characteristics of each stratum can be
estimated and comparisons made
Reduces variability from systematic
Disadvantages
Requires accurate information on
proportions of each stratum
Stratified lists costly to prepare
Cluster Sampling
The primary sampling unit is not the
individual element, but a large cluster of
elements. Either the cluster is randomly
selected or the elements within are
randomly selected
Why? Frequently used when no list of
Multistage
area sample:
Advantages
Disadvantages
Classification of Sampling
Methods
Sampling
Methods
Probability
Samples
Systematic
Cluster
Nonprobability
Stratified
Simple
Random
Convenience
Judgment
Snowball
Quota
Non-Probability Sampling
Methods
Convenience Sample
The sampling procedure used to obtain
those units or people most conveniently
available
Why: speed and cost
External validity?
Internal validity
Is it ever justified?
Advantages
Very low cost
Extensively used/understood
No need for list of population elements
Disadvantages
Variability and bias cannot be measured
or controlled
Projecting data beyond sample not
justified.
Advantages
Moderate cost
Commonly used/understood
Sample will meet a specific objective
Disadvantages
Bias!
Projecting data beyond sample not
justified.
Quota Sample
Advantages
moderate cost
Very extensively used/understood
No need for list of population elements
Introduces some elements of
stratification
Disadvantages
Variability and bias cannot be measured
or controlled (classification of subjects0
Projecting data beyond sample not
justified.
Snowball sampling
Advantages
low cost
Useful in specific circumstances
Useful for locating rare populations
Disadvantages
Bias because sampling units not
independent
Projecting data beyond sample not
justified.
Variance or heterogeneity of
population
Previous studies? Industry expectations?
Pilot study?
Sequential sampling
Rule of thumb: the value of standard
deviation is expected to be 1/6 of the
range.
n = (ZS/E) 2
n = Z2 pq/ E2
n = pc (100 pc) Z2/ E2