SAMPLING and Sampling Distribution
SAMPLING and Sampling Distribution
SAMPLING and Sampling Distribution
Geographical area
Demographics
Usage/lifestyle
Awareness
Sampling: Meaning
Sampling Techniques
1. Convenience Sampling
2. Judgment Sampling
3. Quota Sampling
4. Snow-ball Sampling
Convenience Sampling
Convenience sampling attempts to obtain a
sample of convenient elements. Often,
respondents are selected because they happen to
be in the right place at the right time.
• Suppose that all possible samples of size n = 2 are taken from this
population
30
Sampling Distribution of Sample Mean
Population:
(54, 54) (55, 54) (59, 54) (63, 54)
54 55 59 63 64 68 69 70 (54, 55) (55, 55) (59, 55) (63, 55)
(54, 59) (55, 59) (59, 59) (63, 59)
All possible samples of n = 2: (54, 63) (55, 63) (59, 63) (63, 63)
(54, 64) (55, 64) (59, 64) (63, 64)
(54, 68) (55, 68) (59, 68) (63, 68)
• Then take the means of all of the samples (54, 69) (55, 69) (59, 69) (63, 69)
(54, 70) (55, 70) (59, 70) (63, 70)
(64, 54) (68, 54) (69, 54) (70, 54)
(64, 55) (68, 55) (69, 55) (70, 55)
31
Sampling Distribution of Sample Mean
Means of the samples:
54 54.5 56.5 58.5 59 61 61.5 62
54.5 55 57 59 59.5 61.5 62 62.5
56.5 57 59 61 61.5 63.5 64 64.5
58.5 59 61 63 63.5 65.5 66 66.5
59 59.5 61.5 63.5 64 66 66.5 67
61 61.5 63.5 65.5 66 68 68.5 69
61.5 62 64 66 66.5 68.5 69 69.5
62 62.5 64.5 66.5 67 69 69.5 70
32
Sampling Distribution of Sample Mean
33
Sampling Distribution of Sample Mean
34
Central limit theorem
When the random samples of observations
are drawn from a non-normal population
with finite mean μ and standard deviation
σ, and as the sample size n is increased,
the sampling distribution of sample mean x
is approximately normally distributed, with
mean and standard deviation as:
x
x n
x
x
z x
x
n
Central limit theorem
Central limit theorem says that, provided that the sample
size is sufficiently large, the sampling distribution of
sample mean X-bar has an approximately normal
distribution. Even if the variable of interest is not normally
distributed in the population.
No matter how a variable is distributed in the population,
the sampling distribution of the sample mean is always
approximately normal, as long as the sample size is large
enough.
As a guideline for ‘large enough’ a sample size of 30 or
larger is often used.
The central limit theorem is useful in statistical inference.
When the sample size is sufficiently large, estimations
such as ‘average’ or ‘proportion’ that are used to make
inferences about population parameters are expected to
have sampling distribution that is approximately normal.
Point Estimation
A sample statistic (such as x bar, s, or p ) that is
calculated using sample data to estimate most likely
value of the corresponding unknown population
parameter (such as μ, σ or p) is termed as point
estimator, and the numerical value of the estimator is
termed as point estimate.
For example, if we calculate that 10 per cent of the items
in a random sample taken from a day’s production are
defective, then the result ‘10 per cent’ is a point estimate
of the percentage of items in the whole lot that are
defective. Thus, until the next sample of items is not
drawn and examined, we may proceed on manufacturing
with the assumption that any day’s production contains
10 per cent defective items.
Drawback of Point Estimates
The draw back of a point estimate is that no
information is available regarding its reliability,
i.e. how close it is to its true population
parameter. In fact, the probability that a single
sample statistic actually equals the population
parameter is extremely small.
For this reason, point estimates are rarely used
alone to estimate population parameters. It is
better to offer a range of values within which
the population parameters are expected to fall
so that reliability (probability) of the estimate
can be measured. This is the purpose of
interval estimation
Interval Estimation
Interval Estimation is calculation of
the interval(range of values) within
which the population parameters
are expected to fall.
It is calculated from a sample
expected to include the
corresponding population
parameter.
It is calculated by with the help of
confidence interval and margin of
error.
Confidence Interval and Margin of
error
Margin of error: The value added or subtracted from a
point estimate in order to develop an interval estimate of
a population parameter.
Margin of error =
z *
n
Or = s
t *
n
Confidence interval: The interval within which the
population parameter is expected to lie.
Confidence interval= Point estimate ± Margin of error
Confidence limits: The boundaries (both upper and
lower) of a confidence interval.
Excel Formula
Margin of Error
= Confidence. Norm (alpha, standard
deviation,size)
Or
= Confidence.t (alpha, standard
deviation,size)
n= n0N/n0+(N-1)
Example