23 4 14 Prob Distribution

Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 132

Statistics for Business and

Economics
Chapter
Random Variables &
Probability Distributions

Content
1. Two Types of Random Variables
2. Probability Distributions for Discrete
Random Variables
3. The Binomial Distribution
4. Poisson and Hypergeometric Distributions
5. Probability Distributions for Continuous
Random Variables
6. The Normal Distribution

Content (continued)
7. Descriptive Methods for Assessing
Normality
8. Approximating a Binomial Distribution with
a Normal Distribution
9. Uniform and Exponential Distributions
10. Sampling Distributions
11. The Sampling Distribution of a Sample
Mean and the Central Limit Theorem

Learning Objectives
1. Develop the notion of a random variable
2. Learn that numerical data are observed values of
either discrete or continuous random variables
3. Study two important types of random variables
and their probability models: the binomial and
normal model
4. Define a sampling distribution as the probability of
a sample statistic
5. Learn that the sampling distribution of x follows a
normal model
2011 Pearson Education, Inc

Thinking Challenge
Youre taking a 33 question multiple
choice test. Each question has 4
choices. Clueless on 1 question, you
decide to guess. Whats the chance
youll get it right?
If you guessed on all 33 questions, what
would be your grade? Would you pass?

Two Types of
Random Variables

Random Variable
A random variable is a variable that assumes
numerical values associated with the random
outcomes of an experiment, where one (and only
one) numerical value is assigned to each sample
point.

Discrete
Random Variable
Random variables that can assume a countable
number (finite or infinite) of values are called
discrete.

Discrete Random Variable


Examples
Experiment

Random
Variable

Possible
Values

Make 100 Sales Calls

# Sales

0, 1, 2, ..., 100

Inspect 70 Radios

# Defective

0, 1, 2, ..., 70

Answer 33 Questions

# Correct

0, 1, 2, ..., 33

Count Cars at Toll


Between 11:00 & 1:00

# Cars
Arriving

0, 1, 2, ...,

Continuous
Random Variable
Random variables that can assume
values corresponding to any of the
points contained in one or more
intervals (i.e., values that are infinite
and uncountable) are called
continuous.

Continuous Random
Variable Examples
Experiment

Random
Variable

Possible
Values

Weigh 100 People

Weight

45.1, 78, ...

Measure Part Life

Hours

900, 875.9, ...

Amount spent on food

$ amount

54.12, 42, ...

Measure Time
Between Arrivals

Inter-Arrival 0, 1.3, 2.78, ...


Time

Probability Distributions
for Discrete Random
Variables

Discrete
Probability Distribution
The probability distribution of a
discrete random variable is a
graph, table, or formula that specifies
the probability associated with each
possible value the random variable
can assume.

Requirements for the


Probability Distribution of a
Discrete Random Variable x
1. p(x) 0 for all values of x
p(x) = 1
where the summation of p(x) is over all possible
values of x.

Discrete Probability
Distribution Example
Experiment: Toss 2 coins. Count number of
tails.
Probability Distribution
Values, x Probabilities, p(x)
0

1/4 = .25

2/4 = .50

1/4 = .25

Visualizing Discrete
Probability Distributions
Listing

Table

{ (0, .25), (1, .50), (2, .25) }

# Tails

f(x)
Count

p(x)

0
1
2

1
2
1

.25
.50
.25

Graph
p(x)
.50
.25
.00

Formula
x
0

p (x ) =

n!
px(1 p)n x
x!(n x)!

Summary Measures
1. Expected Value (Mean of probability distribution)
Weighted average of all possible values
= E(x) = x p(x)
2. Variance
Weighted average of squared deviation about
mean
2 = E[(x (x p(x)
3.

Standard Deviation

Summary Measures
Calculation Table
x

p(x)

Total

x p(x)

xp(x)

(x

(x p(x)

(x p(x)

Thinking Challenge
You toss 2 coins. Youre interested in
the number of tails. What are the
expected value, variance, and
standard deviation of this random
variable, number of tails?

Expected Value & Variance


Solution
x

p(x)

x p(x)

.25

1.00

1.00

.25

.50

.50

.25

.50

1.00

1.00

.25

= 1.0

(x (x p(x)

Probability Rules for Discrete


Random Variables
Let x be a discrete random variable with probability
distribution p(x), mean , and standard deviation .
Then, depending on the shape of p(x), the following
probability statements can be made:
Chebyshevs Rule

Empirical Rule

P x x

.68

P x 2 x 2

34

.95

89

1.00

P x 3 x 3

The Binomial
Distribution

Binomial Distribution
Number of successes in a sample of n
observations (trials)
Number of reds in 15 spins of roulette wheel
Number of defective items in a batch of 5 items
Number correct on a 33 question exam
Number of customers who purchase out of 100
customers who enter store (each customer is
equally likely to pyrchase)

Binomial Probability
Characteristics of a Binomial Experiment
1. The experiment consists of n identical trials.
2. There are only two possible outcomes on each trial. We
will denote one outcome by S (for success) and the other
by F (for failure).
3. The probability of S remains the same from trial to trial.
This probability is denoted by p, and the probability of
F is denoted by q. Note that q = 1 p.
4. The trials are independent.
5. The binomial random variable x is the number of Ss in
n trials.

Binomial Probability
Distribution

n
n!
x n x
p( x) p q
p x (1 p) n x
x ! (n x)!
x
p(x) = Probability of x Successes
p = Probability of a Success on a single trial
q = 1p
n = Number of trials
x = Number of Successes in n trials
(x = 0, 1, 2, ..., n)
n x = Number of failures in n trials

Binomial Probability
Distribution Example
Experiment: Toss 1 coin 5 times in a row. Note
number of tails. Whats the probability of 3 tails?
n!
p( x)
p x (1 p) n x
x !(n x)!
5!
p(3)
.53 (1 .5)53
3!(5 3)!
.3125

Binomial Probability Table (Portion)


n=5

.01

0.50

.99

.951

.031

.000

.999

.188

.000

1.000

.500

.000

1.000

.812

.001

1.000

.969

.049

Cumulative Probabilities
p(x 3) p(x 2) = .812 .500 = .312

Binomial Distribution
Characteristics

n = 5 p = 0.1

Mean
E(x) np
Standard Deviation

npq

n = 5 p = 0.5

Binomial Distribution
Thinking Challenge
Youre a telemarketer selling service contracts for
Macys. Youve sold 20 in your last 100 calls (p
= .20). If you
call 12 people tonight, whats the probability of
A. No sales?
B. Exactly 2 sales?
C. At most 2 sales?
D. At least 2 sales?

Binomial Distribution Solution*


n = 12, p = .20
A. p(0) = .0687
B. p(2) = .2835
C. p(at most 2) = p(0) + p(1) + p(2)
= .0687 + .2062 + .2835
= .5584
D. p(at least 2)
= p(2) + p(3)...+ p(12)
= 1 [p(0) + p(1)]
= 1 .0687 .2062
= .7251

4.4
Other Discrete Distributions:
Poisson and Hypergeometric

Poisson Distribution
1. Number of events that occur in an interval
events per unit
Time, Length, Area, Space

2. Examples
Number of customers arriving in 20 minutes
Number of strikes per year in the U.S.
Number of defects per lot (group) of DVDs

Characteristics of a Poisson
Random Variable
1. Consists of counting number of times an event
occurs during a given unit of time or in a given
area or volume (any unit of measurement).
2. The probability that an event occurs in a given unit
of time, area, or volume is the same for all units.
3. The number of events that occur in one unit of
time, area, or volume is independent of the number
that occur in any other mutually exclusive unit.
4. The mean number of events in each unit is denoted
by

Poisson Probability
Distribution Function
x

p (x )

x!

(x = 0, 1, 2, 3, . . .)



p(x) = Probability of x given
= Mean (expected) number of events in unit
e = 2.71828 . . . (base of natural logarithm)
x = Number of events per unit

Poisson Probability
Distribution Function
= 0.5

Mean

E(x)
= 6
Standard Deviation

Poisson Distribution Example


Customers arrive at a rate of 72 per hour. What
is the probability of 4 customers arriving in 3
minutes?

Poisson Distribution Solution


72 Per Hr. = 1.2 Per Min. = 3.6 Per 3 Min. Interval

p ( x)

e
x

x!

3.6

p (4)

4!

-3.6

.1912

Poisson Probability Table (Portion)

Cumulative Probabilities
p(x 4) p(x 3) = .706 .515 = .191

Thinking Challenge
You work in Quality Assurance for an
investment firm. A clerk enters 75
words per minute with 6 errors per
hour. What is the probability of 0
errors in a 255-word bond
transaction?

Poisson Distribution Solution:


Finding *

75 words/min = (75 words/min)(60 min/hr)


= 4500 words/hr

6 errors/hr = 6 errors/4500 words


= .00133 errors/word

In a 255-word transaction (interval):


= (.00133 errors/word )(255 words)
= .34 errors/255-word transaction

Poisson Distribution Solution:


Finding p(0)*
p( x)

e
x

x!

.34

p(0)

0!

-.34

.7118

Characteristics of a
Hypergeometric
Random Variable
1. The experiment consists of randomly drawing n
elements without replacement from a set of N
elements, r of which are Ss (for success) and (N
r) of which are Fs (for failure).
2. The hypergeometric random variable x is the
number of Ss in the draw of n elements.

Hypergeometric Probability
Distribution Function
r
x
p x

nr

N
where . . .

N r
n x
N
n

[x = Maximum [0, n (N r), ,


Minimum (r, n)]

r N r n N n

N 2 N 1
2

Hypergeometric Probability
Distribution Function
N = Total number of elements
r = Number of Ss in the N elements
n = Number of elements drawn
x = Number of Ss drawn in the n
elements

Probability
Distributions for
Continuous Random
Variables

Continuous Probability
Density Function
The graphical form of the probability distribution for a
continuous random variable x is a smooth curve

Continuous Probability Density


Function
This curve, a function of x, is denoted by the symbol f(x)
and is variously called a probability density function
(pdf), a frequency function, or a probability
distribution.
The areas under a probability
distribution correspond to
probabilities for x. The area A
beneath the curve between two
points a and b is the probability
that x assumes a value between a and b.

The Normal
Distribution

Importance of
Normal Distribution
1. Describes many random processes or
continuous phenomena
2. Can be used to approximate discrete
probability distributions

Example: binomial

3. Basis for classical statistical inference

Normal Distribution
1. Bell-shaped &
symmetrical

f(x )

2. Mean, median, mode


are equal

x
Mean
Median
Mode

Probability Density Function


1
f (x)
e
2

1 x

2

where
= Mean of the normal random variable x
= Standard deviation
= 3.1415 . . .
e = 2.71828 . . .
P(x < a) is obtained from a table of normal
probabilities

Effect of Varying
Parameters ( & )

Normal Distribution
Probability
Probability is
area under
curve!

P(c x d)

f(x)

f (x)dx ?

Standard Normal Distribution


The standard normal distribution is a normal
distribution with = 0 and = 1. A random
variable with a standard normal distribution,
denoted by the symbol z, is called a standard
normal random variable.

The Standard Normal Table:


P(0 < z < 1.96)
Standardized Normal Probability
Table (Portion)
Z

.04

.05

.06

=1

1.8 .4671 .4678 .4686

.4750

1.9 .4738 .4744 .4750


2.0 .4793 .4798 .4803

= 0 1.96 z

2.1 .4838 .4842 .4846

Probabilities

Shaded area
exaggerated

The Standard Normal Table:


P(1.26 z 1.26)
Standardized Normal Distribution

=1
.3962

.3962

P(1.26 z 1.26)
= .3962 + .3962

1.26 1.26 z
=0
Shaded area exaggerated

= .7924

The Standard Normal Table:


P(z > 1.26)
Standardized Normal Distribution

=1
P(z > 1.26)

.5000

= .5000 .3962

.3962

1.26
=0

= .1038

The Standard Normal Table:


P(2.78 z 2.00)
Standardized Normal Distribution

=1
P(2.78 z 2.00)

.4973

= .4973 .4772

.4772

2.78 2.00

=0

Shaded area exaggerated

= .0201

The Standard Normal Table:


P(z > 2.13)
Standardized Normal Distribution

=1
.4834

P(z > 2.13)

.5000

= .4834 + .5000

2.13
=0
Shaded area exaggerated

= .9834

Non-standard Normal
Distribution
Normal distributions differ by
mean & standard deviation.

Each distribution would


require its own table.

f(x)

Thats an infinite
number of tables!

Property of Normal Distribution


If x is a normal random variable with mean and
standard deviation , then the random variable z,
defined by the formula
x
z

has a standard normal distribution. The value z describes


the number of standard deviations between x and .

Standardize the
Normal Distribution
x
z

Normal
Distribution

Standardized Normal
Distribution

= 1

= 0
One table!

Finding a Probability Corresponding to a


Normal Random Variable
1. Sketch normal distribution, indicate mean, and shade
the area corresponding to the probability you want.
2. Convert the boundaries of the shaded area from x
values to standard normal random variable z
x
z

Show the z values under corresponding x values.


3. Use Table IV in Appendix A to find the areas
corresponding to the z values. Use symmetry when
necessary.

Non-standard Normal = 5, = 10:


P(5 < x < 6.2)
Normal
Distribution

x 6.2 5
z

.12

10
Standardized Normal
Distribution

= 10

=1
.0478

= 5 6.2

Shaded area exaggerated

= 0 .12

Non-standard Normal = 5, = 10:


P(3.8 x 5)
Normal
Distribution

x 3.8 5
z

.12

10

Standardized Normal
Distribution

= 10

=1
.0478

3.8 = 5

Shaded area exaggerated

-.12 = 0

Non-standard Normal = 5, = 10:


P(2.9 x 7.1)

x 2.9 5
z

.21

10

x 7.1 5
z

.21

10

Normal
Distribution

Standardized Normal
Distribution

= 10

=1
.1664
.0832 .0832

2.9 5 7.1

x
Shaded area exaggerated

-.21 0 .21

Non-standard Normal = 5, = 10: P(x


8)
x 85
z

.30

10
Normal
Standardized Normal
Distribution

Distribution

= 10

=1
.5000
.1179

=5

x
Shaded area exaggerated

=0

.3821

.30 z

Non-standard Normal = 5, = 10:


P(7.1 X 8)
x 7.1 5
z

.21

10

Normal
Distribution

x 85
z

.30

10
Standardized Normal
Distribution

= 10

=1
.1179
.0832

=5

7.1 8

Shaded area exaggerated

=0

.21 .30

.0347

Normal Distribution
Thinking Challenge
You work in Quality Control for GE. Light bulb life
has a normal distribution with = 2000 hours and
= 200 hours. Whats the probability that a bulb will
last
A. between 2000 and 2400
hours?
B. less than 1470 hours?

Solution* P(2000 x 2400)


x 2400 2000
z

2.0

200
Normal
Distribution

Standardized Normal
Distribution

= 200

=1

.4772
= 2000 2400

=0

2.0

Solution* P(x 1470)


x 1470 2000
z

2.65

200
Normal
Distribution

Standardized Normal
Distribution

= 200

=1
.5000

.0040
1470

= 2000

.4960

2.65 = 0

Finding z-Values
for Known Probabilities
Standardized Normal
Probability Table (Portion)

What is Z, given
P(z) = .1217?
.1217

=1

.00

.01

0.2

0.0 .0000 .0040 .0080


0.1 .0398 .0438 .0478

=0
Shaded area
exaggerated

.31
?

0.2 .0793 .0832 .0871

0.3 .1179 .1217 .1255

Finding x Values
for Known Probabilities
Normal Distribution

Standardized Normal Distribution

= 10

=1
.1217

= 5 8.1
?

.1217

= 0 .31

x z 5 .3110
Shaded areas exaggerated

4.7
Descriptive Methods for
Assessing Normality

Determining Whether the Data


Are from an Approximately
Normal Distribution
1. Construct either a histogram or stem-and-leaf
display for the data and note the shape of the graph.
If the data are approximately normal, the shape of
the histogram or stem-and-leaf display will be
similar to the normal curve.

Determining Whether the


Data Are from an
Approximately Normal
Distribution
2. Compute the intervals x s, x 2s, and x 3s, and
determine the percentage of measurements falling
in each. If the data are approximately normal, the
percentages will be approximately equal to 68%,
95%, and 100%, respectively; from the Empirical
Rule (68%, 95%, 99.7%).

Determining Whether the Data Are from


an Approximately Normal Distribution
3. Find the interquartile range, IQR, and standard
deviation, s, for the sample, then calculate the ratio
IQR/s. If the data are approximately normal, then
IQR/s 1.3.

IQR Q3 Q1

s
s

4. Examine a normal
probability plot for the
data. If the data are
approximately normal,
the points will fall
(approximately) on a
straight line.

Expected zscore

Determining Whether the Data


Are from an Approximately
Normal Distribution

Observed value
2011 Pearson Education, Inc

Normal Probability Plot


A normal probability plot for a data
set is a scatterplot with the ranked data
values on one axis and their
corresponding expected z-scores from a
standard normal distribution on the other
axis. [Note: Computation of the expected
standard normal z-scores are beyond the
scope of this text. Therefore, we will rely
on available statistical software packages
to generate a normal probability plot.]

Approximating a
Binomial Distribution
with a Normal
Distribution

Normal Approximation of
Binomial Distribution
1. Useful because not all
binomial tables exist
2. Requires large sample
size
3. Gives approximate
probability only
4. Need correction for
continuity

n = 10 p = 0.50
p(x)
.3
.2
.1
.0
0

x
2

10

Why Probability, is Approximate


Probability Added
by Normal Curve

p(x)
.3
.2

Probability Lost by
Normal Curve

.1
.0

x
0

Binomial Probability:
Bar Height

10

Normal Probability: Area Under


Curve from 3.5 to 4.5

Correction for Continuity


1. A 1/2 unit adjustment to
discrete variable
2. Used when approximating
a discrete distribution
with a continuous
distribution
3. Improves accuracy
3.5
(4 .5)

4.5
(4 + .5)

Using a Normal Distribution to


Approximate Binomial Probabilities
1. Determine n and p for the binomial distribution,
then calculate the interval:

3 np 3 np 1 p
If interval lies in the range 0 to n, the normal
distribution will provide a reasonable
approximation to the probabilities of most
binomial events.

Using a Normal Distribution to


Approximate Binomial Probabilities
2. Express the binomial probability to be
approximated by the form
P x a orP x b P x a
For example,
P x 3 P x 2

P x 5 1 P x 4

P 7 x 10 P x 10 P x 6

Using a Normal Distribution to


Approximate Binomial Probabilities
3. For each value of interest a, the correction for
continuity is (a + .5), and the corresponding
standard normal z-value is
a .5

Using a Normal Distribution to


Approximate Binomial Probabilities
4. Sketch the approximating normal distribution and
shade the area corresponding to the event of
interest. Using Table IV and the z-value (step 3),
find the shaded area.
This is the approximate
probability of the
binomial event.

Normal Approximation Example


What is the normal approximation of p(x = 4)
given n = 10, and p = 0.5?
P(x)

.3
.2
.1
.0

x
0

4
6
3.5 4.5

10

Normal Approximation Solution


1. Calculate the interval:
np 3 np 1 p 10 0.5 3 10 0.5 1 0.5
5 4.74 0.26, 9.74

Interval lies in range 0 to 10, so normal


approximation can be used
2.

Express binomial probability in form:


P x 4 P x 4 P x 3

Normal Approximation Solution


3. Compute standard normal z values:
a .5 n p

n p 1 p
a .5 n p

n p 1 p

3.5 10 .5

10 .5 1 .5
4.5 10 .5

10 .5 1 .5

0.95

0.32

Normal Approximation Solution


4. Sketch the approximate normal distribution:

=0
=1

.3289
- .1255
.2034
.1255
.3289
-.95

-.32

Normal Approximation Solution


5. The exact probability from the binomial formula is
0.2051 (versus .2034)

.3
.2
.1
.0

p(x)

x
0

10

Other Continuous
Distributions:
Uniform and
Exponential

Uniform Distribution
Continuous random variables that appear to have
equally likely outcomes over their range of possible
values possess a uniform probability distribution.
Suppose the random
variable x can assume
values only in an
interval c x d.
Then the uniform
frequency function
has a rectangular
shape.

Probability Distribution for a


Uniform Random Variable x
Probability density function:

cd
Mean:
2

1
f (x)
dc

cxd

dc
Standard Deviation:
12

P a x b b a d c , c a b d

Uniform Distribution
Example
Youre production manager of a soft drink bottling
company. You believe that when a machine is set
to dispense 12 oz., it really dispenses between
11.5 and 12.5 oz. inclusive. Suppose the
amount dispensed has a uniform distribution.
What is the probability that less than 11.8 oz. is
dispensed?

Uniform Distribution Solution


1
1

d c 12.5 11.5
1
1.0
1

f(x)
1.0

11.5 11.8
P(11.5 x 11.8)

12.5

= (Base)/(Height)
= (11.8 11.5)/(1) = .30

Exponential Distribution
The length of time between emergency
arrivals at a hospital, the length of time
between breakdowns of manufacturing
equipment, and the length of time between
catastrophic events (e.g., a stockmarket
crash), are all continuous random
phenomena
we
want tobetween
describe
The length ofthat
time
ormight
the distance
probabilistically.
occurrences of random events like these
can often be described by the exponential
probability distribution. For this reason,
the exponential distribution is sometimes
called the waiting-time distribution.

Probability Distribution
for an Exponential Random
Variable x
Probability density function:
Mean:

Standard Deviation:

1 x
f (x) e

x 0

Finding the Area to the


Right of a Number a for an
Exponential Distribution

Exponential Distribution
Example
Suppose the length of time (in hours) between
emergency arrivals at a certain hospital is modeled as
an exponential distribution with = 2. What is the
probability that more than 5 hours pass without an
emergency arrival?
Mean: 2
Standard Deviation: 2

1
f (x) e

x 0

Exponential Distribution
Solution
Probability is the area A to
the right of a = 5.
A e

52

e2.5

From Table V:
A e2.5 .082085

Probability that more than 5


hours pass between emergency arrivals is about .08.

4.10

Sampling Distributions

Parameter & Statistic


A parameter is a numerical descriptive measure
of a population. Because it is based on all the
observations in the population, its value is
almost always unknown.
A sample statistic is a numerical descriptive
measure of a sample. It is calculated from the
observations in the sample.

Common Statistics & Parameters


Sample Statistic

Population Parameter

Mean

Standard
Deviation

Variance

s2

Binomial
Proportion

^
p

Sampling Distribution
The sampling distribution of a sample statistic
calculated from a sample of n measurements is
the probability distribution of the statistic.

Developing
Sampling Distributions
Suppose Theres a Population ...
Population size, N = 4
Random variable, x
Values of x: 1, 2, 3, 4
Uniform distribution

Population Characteristics
Summary Measure
N

xi
i1

2.5

Population Distribution
.3
.2
.1
.0

P(x)

All Possible Samples


of Size n = 2
16 Samples

16 Sample Means

1st 2nd Observation


Obs 1
2
3
4

1st 2nd Observation


Obs 1
2
3
4

1,1 1,2 1,3 1,4

1 1.0 1.5 2.0 2.5

2,1 2,2 2,3 2,4

2 1.5 2.0 2.5 3.0

3,1 3,2 3,3 3,4

3 2.0 2.5 3.0 3.5

4,1 4,2 4,3 4,4

4 2.5 3.0 3.5 4.0

Sample with replacement

Sampling Distribution
of All Sample Means
16 Sample Means
1st 2nd Observation
Obs 1
2
3
4

1 1.0 1.5 2.0 2.5


2 1.5 2.0 2.5 3.0
3 2.0 2.5 3.0 3.5
4 2.5 3.0 3.5 4.0

Sampling Distribution
of the Sample Mean
P(x)
.3
.2
.1
.0
1.0 1.5 2.0 2.5 3.0 3.5 4.0

Summary Measure of
All Sample Means
N

1.0 1.5 ... 4.0


X

2.5
N
16
i 1

Comparison
Population
.3
.2
.1
.0

Sampling Distribution

P(x)

2.5

P(x)
.3
.2
.1
.0
1.0 1.5 2.0 2.5 3.0 3.5 4.0

x 2.5

The Sampling Distribution


of a Sample Mean and the
Central Limit Theorem

Properties of the Sampling


Distribution of x
1. Mean of the sampling distribution equals mean of
sampled population*, that is,
x E x .
2. Standard deviation of the sampling distribution equals
Standarddeviationofsampledpopulation
Squarerootofsamplesize

.
That is, x
n

Standard Error of the Mean


The standard deviation x is often referred to
as the standard error of the mean.

Theorem 4.1
If a random sample of n observations is selected from
a population with a normal distribution, the sampling
distribution of x will be a normal distribution.

Sampling from
Normal Populations
Central Tendency
x

Population Distribution
= 10

Dispersion

x
n
Sampling with
replacement

= 50

Sampling Distribution
n=4
x = 5

n =16
x = 2.5
x- = 50

Standardizing the Sampling


Distribution of x
x x x
z

x
n

Sampling
Distribution

Standardized Normal
Distribution

= 1

=0

Thinking Challenge
Youre an operations analyst for
AT&T. Long-distance telephone calls
are normally distributed with = 8
min. and = 2 min. If you select
random samples of 25 calls, what
percentage of the sample means
would be between 7.8 & 8.2
minutes?

Sampling Distribution Solution*


x 7.8 8

.50

2
25
n
x 8.2 8
z

.50

2
Standardized Normal
25
n
Distribution
z

Sampling
Distribution

x = .4

=1
.3830
.1915 .1915

7.8 8 8.2 x

.50 0 .50

Sampling from
Non-Normal Populations
Central Tendency

Population Distribution
= 10

Dispersion

x
n
Sampling with
replacement

= 50

Sampling Distribution
n=4
x = 5

n =30
x = 1.8

x- = 50

Central Limit Theorem


Consider a random sample of n observations selected
from a population (any probability distribution) with
mean and standard deviation . Then, when n is
sufficiently large, the sampling distribution of x will be
approximately a normal distribution with mean x
and standard deviation x n . The larger the
sample size, the better will be the normal approximation
to the sampling distribution of x .

Central Limit Theorem


As sample
size gets
large
enough
(n 30) ...

x
n
sampling
distribution
becomes almost
normal.

Central Limit Theorem Example


The amount of soda in cans of a
particular brand has a mean of 12 oz
and a standard deviation of .2 oz. If
you select random samples of 50
cans, what percentage of the sample
means would be less than 11.95 oz?

Central Limit Theorem Solution*


x 11.95 12
z

1.77

.2
Sampling
Standardized Normal
n
50

Distribution

Distribution

x = .03

.0384

=1

.4616

11.95 12

1.77 0

Shaded area exaggerated

Key Ideas
Properties of Probability Distributions
Discrete Distributions
1. p(x) 0
2. p x 1
allx

Continuous Distributions
1. P(x = a) = 0
2. P(a < x < b) = area under curve between a and b

Key Ideas
Normal Approximation to Binomial
x is binomial (n, p)
P x a P z a .5

Key Ideas
Methods for Assessing Normality
1. Histogram

Key Ideas
Methods for Assessing Normality
2. Stem-and-leaf display
1

3389

245677

19

Key Ideas
Methods for Assessing Normality
3. (IQR)/S 1.3
4. Normal probability plot

Key Ideas
Generating the Sampling Distribution of x

You might also like