5
5
5
Introduction
Probability Concepts
Random Variables
Expectation and Moments
Moment Generating Functions
Binomial Distribution
Poisson Distribution
Geometric Distribution
Uniform Distribution
Exponential Distribution
Normal Distribution
Functions of Random Variable
1.1 INTRODUCTION
First of all let us go through some basic concepts in probability. Try to be thorough in the
basics so you may feel comfortable with the subject.
If an experiment is repeated under the same conditions , any number of times, it does not
give unique results but may result in any one of several outcomes. Thus an action which
can produce any result or outcome is called a RANDOM EXPERIMENT. When a
random experiment is performed each time it is called a TRIAL and the outcomes are
known as EVENTS or CASES.
Sample Space : The sample space is an exhaustive list of all the possible outcomes of an
experiment. Each possible result of such a study is represented by one and only one point
in the sample space, which is usually denoted by S.
Examples
Experiment Rolling a die once:
Sample space S = {1,2,3,4,5,6}
Experiment Tossing a coin:
Sample space S = {Heads,Tails}
Experiment Measuring the height (cms) of a girl on her first day at school:
Sample space S = the set of all possible real numbers
An event whose occurrence is inevitable when an experiment is performed is called as
Certain event or Sure event.
An event which can never occur when an experiment is performed is called an Impossible
event. Events may be simple or compound.
For example: In drawing cards from a pack of 52 cards, the chance of getting spade 5 is a
simple event and the chance of getting a king is compound event. Occurrence of getting
two 8 diamond cards is an impossible event.
Favorable Events: The number of cases favorable to an event in a trial is the number of
outcomes which entail the happening of the event.
Equally likely events: The outcomes are said to be equally likely if none of them is
expected to occur in preference to other. Thus two or more events are said to be equally
likely if each one of them has an equal chance of happening.
For example : when a coin is thrown, the head is as likely to turn up as tail. Hence H and
T are equally likely events.
Exhaustive events: Outcomes are said to be exhaustive when they include all possible
outcomes. For example, while rolling a die, the possible outcomes are 1,2,3,4,5 and 6 and
hence the exhaustive number of cases is 6.
Independent Events: Two events are independent if the occurrence of one of the events
gives us no information about whether or not the other event will occur; that is, the events
have no influence on each other.
For example, if a coin is thrown twice, the result of the second throw is no way affected
by the result of the first throw. Thus the events are independent events.
P() = 1 m/n
Note: If P(A) = 1 the event A is called a certain event and if P(A) = 0, the event is called
an impossible event.`
Examples
If a trial results in n cases and m of them are favorable to the happening of the event
then
Definition of probability :
Let S be a sample space associated with an experiment. To each event A, there is a real
number P(A) associated , called the probability of A satisfying the following axioms:
i) P(A) 0
ii) P(S) = 1
iii) If A1,A2,A3,-----are mutually exclusive events, then
P(A1 U A2 U A3 ---- U An) = P(A1) + P(A2) +-------- P(An)
Theorem : The probability of the impossible event is zero, i.e, if is the subset ( event )
containing no sample point , P ( ) = 0
Proof The certain event S and the impossible event are mutually exclusive
But S U = S
Thus P (S ) = P(S) + P( )
Thus P( ) = 0
Thus P (A U ) = P (S)
= 1 [ Axiom (ii)]
i.e. (PA) + P() = 1 [ Axiom (iii)]
Thus P( ) = 1 P (A)
Proof : A is the union of the mutually exclusive events AB and AB and b is the union of
the mutually exclusive events B and AB
Thus P(A) = P(AB) + P(AB ) [ Axiom (iii) ]
AB AB
B
A B
Thus P(A) + P(B) = [P(AB) + P(AB) + P(B) + P(AB)]
= P(A U B ) + P (A B)
Proof:
AB
The addition rule is a result used to determine the probability that event A or event B
occurs or both occur.
The result is often written as follows, using set notation:
P(A U B) = P(A) +P(B) P(AB)
where:
P(A) = probability that event A occurs
P(B) = probability that event B occurs
P(A U B) = probability that event A or event B occurs
P(AB ) = probability that event A and event B both occur
For mutually exclusive events, that is events which cannot occur together:
P(AB) = 0
For independent events, that is events which have no influence on each other:
P(AB) = P(A)P(B)
Example
Suppose we wish to find the probability of drawing either a king or a spade in a single
draw from a pack of 52 playing cards.
We define the events A = 'draw a king' and B = 'draw a spade'
Since there are 4 kings in the pack and 13 spades, but 1 card is both a king and a spade,
we have:
P(A U B) = P(A) +P(B) P(AB)
= 4/52 + 13/52 - 1/52 = 16/52
So, the probability of drawing either a king or a spade is 16/52 (= 4/13).
The multiplication rule is a result used to determine the probability that two events, A and
B, both occur.
The Conditional Probability of an event B, assuming that the event A has happened, is
denoted by P(A/B) and defined as
For independent events, that is events which have no influence on one another, the rule
simplifies to:
P(AB) = P(A)P(B) OR P(A and B ) = P(A).P(B)
That is, the probability of the joint events A and B is equal to the product of the
individual probabilities for the two events.
Proof : We have
P(A/B) = n(AB)
n(B)
= P(B) P(A/B)
= P(A) P(B/A)
Note:
Conditional probabilities P(B/A)and P(A/B) are defined if and only if P(A) 0 and
P(B) 0 respectively.
P(A/A) = 1
If A and B are independent, then P(A/B) = P(A) and P(B/A) = P(B).
If A and B are independent their compliments are also independent.
Theorem: If the events A and B are independent, the events and B (and similarly A
and B )are also independent.
Proof:
The events A B and B are mutually exclusive such that (A B) U ( B) = B
Based on the above discussed theory some problems are illustrated. Most of the examples
are dealing with real life experiments so that it will be interesting for you .
Solution: Since there are 14 numbers from 2 to 15 the total number of numbers is 14. Out
of 14 numbers 2,3,5,7,11,13 are prime numbers. Hence the number of favourable
numbers is 6. Hence the probability that the number chosen is prime is
No: of favorable cases = 6 = 3
Total no: of cases 14 7
Example 2: What is the chance that a) leap year selected at random will contain 53
Sundays b)a non-leap year selected at random will contain 53 Sundays?
Solution:
a) In a leap year, there are 366 days i.e, 52 weeks and 2 days, remaining 2 days can be
any two days of the week. The different possibilities are: Sunday and Monday, Monday
and Tuesday, Tuesday and Wednesday, Wednesday and Thursday, Thursday and Friday,
Friday and Saturday, Saturday and Sunday.
In order to have 53 Sundays, out of remaining two days one must be Sunday.
No: of favorable cases = 2
Total no: of cases = 7
Required Probability = 2/7
b) In a non leap year, there are 365 days i.e, 52 weeks and 1 day. The remaining 1 day
can be any day of the week.
Total no: of cases = 7
There will be 53 Sundays if the remaining one day is Sunday.
No: of favorable cases = 1
Required probability = 1/7.
Example 3: If we draw a card from a well shuffled pack of 52 cards, what is the
probability that the card is either an ace or a king?
Solution:
The events that the card drawn is either an ace or a king are mutually exclusive. Let the
event of the ace drawn be denoted by A and the king drawn be denoted by K. A pack has
4 aces and a kings.
Thus P(A) = 4/52 and P(K) = 4/52
The probability that the card is either ace or a king =
P(A or K) = P(A U K) = P(A) + P(K) = 4/52 + 4/52 = 2/13.
Example 4: A,B and C toss a coin. The one who get the head first wins. What are their
respective chances?
Solution: The probability that A may win in the first round = . He may win in the
second round, after all of them failed in the first round, with the probability = (. ).
= ()3, iin the third round with probability = ()6 ..and so on.
Now by addition theorem the chance of As success = + ()3 + ()6 + - - - - which
is a G P with first term and common ratio = ()2.
Therefore the above expression is = . . = 4/7.
1 - ()3
B may win the first round after As failure with probability = ()( )= , the second
round with probability = ()3 and so on.
Therefore Bs chances of success = + ()3 + - - - = . . = 2/7
1 - ()3
Similarly, the chance of winning the game for C = . 1/8 . = 1/7
1 - ()3
Example 6: A problem in mathematics is given to five students A1, A2, A3, A4 and A5.
Their chances of solving it are 1/6, 1/5, , 1/3, respectively. What is the probability
that the problem will be solved?
Solution:
The probability that A1 fails to solve the problem = 1 1/6 = 5/6.
The probability that A2 fails to solve the problem = 1 1/5 = 4/5.
The probability that A3 fails to solve the problem = 1 1/4 = 3/4.
The probability that A4 fails to solve the problem = 1 1/3 = 2/3.
The probability that A5 fails to solve the problem = 1 1/2 = 1/2.
The probability that the problem is not solved by all five students
=5 4 3 2 1 = 1
6 5 4 3 2 6
Therefore the probability that the problem will be solved 1 1 = 5
6 6
Example 7: A lot consists of 10 good articles, 4 with minor defects and 2 with majoe
defects. Two articles are chosen from the lot at random (without replacement). Find the
probability that i) both are good, ii)both have major defects, iii) at least 1 is good, iv) at
most 1 is good v)exactly 1 is good vi)neither has major defects and vii) neither is good?
Solution:
Since the articles are drawn without replacement, we can consider that both the articles
are drawn simultaneously.
i) P(both are good) = No : of ways of drawing 2 articles from good articles
Total no : of ways of drawing 2 articles.
= 10C2 = 3
16C2 8
ii) P(both have major defects) = No : of ways of drawing 2 articles with major defects
Total no : of ways of drawing 2 articles.
= 2C2 = 1
16C2 120
iii) P(atleast 1 is good) = P (exactly 1 is good and 1 is bad or both are good)
= 10C1 x 6C1 + 10C2 = 7
16C2 8
vi) P(neither has major defects) = P(both are non-major defective articles)
= 14C2 = 91
16C2 120
Example 8: A and B throw alternately with a pair of dice. A wins if he throws (sum of
numbers on top two faces) 6 before B throws 7, and B wins if he throws 7 before A
throws 6. If A begins, find his chance of winning.
Solution:
Example 9: A box contains 4 bad and 6 good tubes. Two are drawn out from the box at a
time. One of them is tested and found to be good. What is the probability that the other
one is also good?
Solution:
Let A= one of the tubes drawn is good and B = the other tube is good..
P(AB) = P(both tubes drawn are good)
= 6C2 = 1
10C2 3
With the condition that one is good, we have to find the conditional probability that the
other tube is also good is required. i.e P(B/A) is required.
P(B/A) = P(AB) = 1/3 = 5
P(A) 6/10 9
Example 10: The odds against the student X solving a problem in mathematics are 8:6
and odds in favour the student Y solving the same problem are 14:16 i) What is the
chance that the problem will be solved if they both try? ii) what is the probability that
both working independently will solve the problem? Iii) What is the probability that
neither solves the problem?
Solution:
Let A be the event that the student X solves the problem, and B be the evnt that the
student Y solves the problem. Then by data
P() = 8/14 and P(B) = 14/30.
Thus, P(A) = 1 - P() = 1 8/14 = 6/14
P( B ) = 1 P(B) = 1 14/30 = 16/30
i)Probability that the problem will be solved
= P( any one solves the problem)
= P(A or B)
= P(A) +P(B) P(AB) = P(A) +P(B) P(A)P(B)
= 6/14 + 14/30 (6/14 x 14/30)
ii) Probability of solving the problem if they work independently is P(A and B)
= P(AB) = P(A)P(B) = 6/14 x14/30
iii) Probability that both will not solve the problem is
= P( A and B ) = P(A)P(B) = 1/14 x 16/30
1.3.4 Theorem of total probability
If B1,B2, - - - Bn be a set of exhaustive and mutually exclusive events, and A is another
associated with Bi , then
n
P(A ) = P(Bi)P(A /Bi)
i=n
Solution:
Let E1, E2, E3 be the events that the bolts are manufactured by machines A, B,C
respectively.
Hint: given A turns out twice as many bolts as B and machines B and C produce equal
number of bolts.
P(A) = 2P(B) & P(B) = P(C)
But P(A) + P(B) + P(C) = 1
2P(B) + P(B) + P(B) = 1
4P(B) = 1
P(B) =
P(C) =
P(A) =
Therefore P(E1) = , P(E2) = , P(E3) =
Example 2: The contents of urns I, II, and III are as follows: 2 white, 3blacks and 4 red
balls; 3 white , 2 black and 2 red balls and 4 white, 1 black and 3 red balls. An urn is
chosen at random and two balls are drawn. They happen to be white and red. What is the
probability that they come from urns I, II, and III?
Solution:
Let E1, E2 and E3 denote the events that the urn I, II, and III be chosen respectively and
let A be the event that the two balls taken from the urn be white and red.
Example 3: An urn contains5 white and 3 green balls and another urn contains 3 white
and 7 green balls. Two balls are chosen at random from the first urn and put into the
second urn . Then a ball is drawn from the second urn. What is the probability that it is a
white ball?
Solution:
The two balls drawn from the first urn can be
i)both white which is denoted byE1
ii) both green which is denoted byE2
iii)one white and one green which is denoted by E3
P(E1) = 5C2 = 5
8C2 14
P(E2) = 3C2 = 3
8C2 28
P(E3) = 5C13C1 = 15
8C2 28
After the balls are transferred from the first urn to the second urn, the second urn will
contain
i) 5 white and 7 green balls
ii) 3 white and 9 green balls
iii)4 white and 8 green balls
Let A be the event of drawing a white ball from the second urn. Then
= 125 = 0.372
336
Example 4: A toy is rejected if the design is faulty or not. The probability that the design
is faulty is 0.1 and that the toy is rejected if the design is faulty 0.95 and otherwise 0.45.
If a toy is rejected, what is the probability that it is due to faulty design?
Solution:
Let D1, D2 denote the events that the design is faulty or not. Let A denote the event that
the toy is rejected.
Solution :
Example: A die is tossed until 6 appears. What is the probability that it must be tossed
more than 4 times.
Solution:
P(X = x) = (1/6) (5/6)x 1 x = 1,2,3, - - - -
P(x>4) = 1 P(x 4)
4
= 1 - (1/6) (5/6)x 1
x=1
= 1 (1/6)[1 + 5/6 + 25/36 + 125/216] = 0.48225
a) Tossing a coin.
b)Rolling a single 6-sided die.
c)Choosing a marble from a jar.
d)All of the above
2) Which of the following is an outcome?
3)Which of the following experiments does NOT have equally likely outcomes?
4)A number from 1 to 11 is chosen at random. What is the probability of choosing an odd
number?
a)
b)
c)
1. From a bag containing 3 red and 2 black balls, 2 balls are drawn at random.
Find the probability that they are of the same colour?
(Solution:2/5)
2. Event A and B are such that P(A + B) =3/4, P(AB) = and P() = 2/3 find
P(B).
(Solution:2/3)
3. In a random experiment P(A ) = 1/12, P(B) = 5/12 and P(B/A) = 1/15.
Find P (AUB)
(Solution: 89/180)
4. If P(A) = 0.5, P(B) = 0.3 and P(AB) = 0.15, find P(A/ B )
(Solution: 0.5)
5. Probability that India wins a cricket match against Australia is known to be 2/5.
If India and Australia play 3 test 3 matches what is the probability that i) India
will loose all the three matches ii) India will win all the tests iii) India will win
atmost one match.
(Solution: i) 27/125 ii) 98/125 iii) 8/125 iv) 81/125 )
6. Two weak students attempt to write a program. Their chances of writing the
program successfully is 1/8 and 1/12 and the chance of making a common error
is 1/10001.Find the chance that the program is correctly written.
(Solution: 0.9924)
7. A fair dice is rolled 5 times. Find the probability that 1 shows twice, 3 shows
twice and 6 shows once. (Solution: 0.0039)
9. Urn I has 2 white and 3 black balls, urn II contains 4 white and 1 black balls and
urn III contains 3 white and 4 black balls. An urn is selected at random and is
found to be white. Find the probability that urn I was selected.
(Solution: 14/57)
1.4 RANDOM VARIABLES
You have seen a number of examples of sample spaces associated with various
experiments. In some, the outcomes were numerical and in some others the outcomes
were non numerical.
For example in the experiment concerned with tossing a dice, we may have any one of
the following outcome as 1,2,3,4,5 and 6 which is numerical in nature. While the result of
a coin tossing experiment in which a coin is tossed once, we have the outcome and head
or tail which is non numerical in nature.
Random Variable Let E be an experiment and S a sample space associated with the
experiment. A function X assigning to each element s S, a real number X is called a
random variable.
The set of values which the random variable X takes is called the spectrum of the random
variable.
In the finite cases the list terminates. In the countably infinite cases, the list continues.
Let x1,x2 . be possible values of a discrete random variable X. Then P(xi ) is called the
probability function or probability mass function or point probability function of the
discrete random variable X if
i) P(xi) 0 or i=1,2
ii) P(xi) = 1
i
The collection of pairs ( xi, P(xi)), i=1,2. is called the probability distribution.
1.4.1.2 Continuous Random Variable A random variable X is called a continuous
random variable if X takes all its possible values of an interval (or) equivalently. If the
range space Rx of the random variable X is an interval or a collection of intervals, X is
called continuous random variable.
ii) f(x) dx = 1
Rx
Where Rx is the range of X
1.4.1.3 Properties
2) f(x) dx = 1 or f(x) dx = 1
Rx
b
3) P( a < x < b ) = f(x) dx
a
4) Probability at a particular point is zero. i.e it is impossible that a continuous random
a
variable assumes a specific value since, P(X = a) = P ( a X a ) = f(x) dx =0
a
This means that it is almost impossible that a continuous random variable assumes a
specific value. Hence
If X is discrete,
F(x) = Pj
j
xj x
1.4.2.1 Properties
2) F( - ) = 0 and F( ) = 1
Solution:
pj = + + = 1
j=1
Hence the total probability is 1
Example 2: Consider a random experiment of tossing three times a fair coin. Let X
denote the number of tails and Y denotes the number of consecutive tails.
Find
i)The probability distribution of X and Y
ii)the distribution function of x
iii)the probability distribution of X + Y and XY.
Solution:
The sample space of the random experiment is
Each element of S occurs with probability 1/8. The values of X, Y, X + Y and XY for
each outcome is tabulated.
X 0 1 1 2 1 2 2 3
Y 0 0 0 2 0 0 2 3
X+Y 0 1 1 4 1 2 4 6
XY 0 0 0 4 0 0 4 9
i)Probability distribution of X
Value of X, x 0 1 2 3
ii)Probability distribution of Y
Value of Y, y 0 2 3
Value of X+Y 0 1 2 4 6
iv)Probability distribution of X + Y
Value of XY 2 4 9
v) Probability distribution of XY
Solution:
Given f(x) = cx,2, 0<x<3
= 0, otherwise
By the property of pdf
f(x)dx = 1
-
3 3
cx,2dx = 1 => [cx3/3] = 1 => 9c =1 => c = 1/9.
0
0
The distribution function is given by
x
F(x) = P(Xx) = f(x)dx
-
x
When x0, F(x) = P(X0) = f(x)dx = 0 since f(x) is not defined for x<0.
-
0 x x x
When 0<x<3, F(x) = P(Xx) = f(x)dx + f(x)dx = x 2/9 dx = x3/27] = x3/27.
0
- 0 0
When x3
0 3 x 3 3
F(x) = P(Xx) = f(x)dx + f(x)dx + f(x)dx = x 2/9 dx = x3/27] = 1
0
- 0 3 0
F(x) = 0, x0
3
x /27, 0<x<3
1, x 3.
2 2 2
P(1<x<2) = f(x)dx = x2/9dx = 1/9(x3/3) = 7/27.
1
1 1
X : 0 1 2
p(x) : 3c2 4c 10c2 5c - 1
Find i)the value of c ii) P(0<X<2/X>0) and iii) the distribution function of X iv)the
largest value of X for which F(x) < and v) largest value of X for which F(x) >
Solution:
i)since p(x) = 1, 3c2 + 4c 10c2 + 5c -1 = 1
7c2 -9c + 2 = 0
c= 2/7, 1
The value c = 1makes some p(x) negative which is meaningless.
Therefore c = 2/7
ii) P(0<x<2/x>0)
We know P(A/B) = P(AB)
P(B)
P(0<x<2/x>0) = P(0<x<2 x>0) = 4c 10c2 = 4c 10c2 where c =2/7
P(x>0) P(x=1) + P(x=2) 4c 10c2 + 5c - 1
= 4c 10c2 = 8/7 - 40/49 = 16/37
10c2 + 9c 1 -40/9 +18/7 -1
v) Similarly form the above cdf it is clear that the largest value of X for which F(x) > is
x=1
Solution:
Let P(X =j) = pj
pj = +1/22 1/23 + - - - - - which is geometric series
j =1
= = 1
1-
The mean of X is defined as E(X) = j pj
j =1
Therefore E(X) = a +2a2 +3a3 + - - - - where a = 1/2
= a(1 + 2a + 3a2 + - - - - )
= a(1 - a)-2
= = 2.
()2
Where [E(X)]2 = j2 pj
j=1
[E(X)] = j2 aj , where a = 1/2
2
j =1
= [j(j + 1) j] aj = j(j + 1) aj - jaj
j=1 j =1 j =1
= a x 2(1-a)3 a x (1-a)-2
= 2a - a (where a =1/2)
3
(1-a) (1-a)2
= 8-2 = 6
= P(X = 2 )+P(X = 4) + P( X= 6) +- - - - - -
= P(X = 5 )+P(X = 6) + P( X= 7) +- - - - - -
= ()5
1-
= 1/16
= P(X = 3 )+P(X = 6) + P( X= 9) +- - - - - -
= 1/8
1 1/8
= 1/7
Example 6: The diameter of an electric cable X is a continuous RV with pdf
f(x) = kx(1 -x), 0X1. Find i)the value of k, ii)cdf of X, iii) the value of a such that
P(X<a) = 2P(X>a) and iv) P(X/1/3<X<2/3)
Solution
i) If f(x) is pdf f(x)dx = 1
1 1 1
kx(1 -x)dx = 1 => k(x x2)dx = 1 => k[x2/2 - x3/3] = 1
0
0 0
When x1 0 1 x 1 1
F(x) = P(Xx) = f(x)dx + f(x)dx + f(x)dx = k(x x2) dx = k[x2/2 - x3/3 ]
0
- 0 1 0
= 1
Therefore cdf is given by
F(x) = 0, x0
3x2 2x3, 0x1
1, x 1.
a 1
=> f(x)dx = 2 f(x)dx
0 a
a 1
=> k(x x2)dx = 2 k(x x2)dx
0 a
a 1
2 3 2 3
=> k[x /2 - x /3 ] = 2 k[x /2 - x /3 ]
0 a
iv) P(0X1/2 1/3 < X < 2/3) = P(1 /3 < X < 1/2)
P(1 /3 < X < 2/3) P(1 /3 < X < 2/3) 0 1/3 2/3 1
1/2 1/2
k(x x2) dx [x2/2 - x3/3 ] 1/8 1/24 - 1/18 + 1/81
1/3
1/3 = =
2/3
k(x x2) dx [x2/2 - x3/3 ] 4/18 - 8/81 -1/18 +1/81
1/3
1/3
= (13/24) / (13/162) =
Example 7: Suppose that the life of a certain radio tube (in hours) is a continuous R V
with pdf
Solution:
The distribution function is given by P(Xx)
When x 100
F(x) = P(Xx) = 0 since f(x) is 0
When x>100 x x x
F(x) = P(Xx) = f(x)dx = 100/x2 dx = 100 [-1/x] = 1 - 100/x
100
100 100
F(x) = 0 , x 100
= 1 - 100/x , x > 100
iii)The probability that a tube will not last for 150 hours = 1-1/3 = 2/3.
Probability that two tubes last for 150 hours = (1/3)2.Since there are 3C1 ways of
choosing 1 tube from the 3 tubes, the required probability that exactly after 150 hours
= 3C1(1/3)2(2/3) = 2/9.
v)Suppose there are n tubes. The probability that all the n tub4es are functioning after
150 hours is(2/3)n. Since this probability is given t be 0.1, we have (2/3)n = 0.1. if we
substitute values for n we can see that n = 5 is the maximum value of n that satisfies the
equation. Hence 5 tubes are to be inserted so that all of them are functioning after 150
hours.
Solution:
The points x = 0,1/2 and 3 are points of continuity
we know that if X is a continuous random variable
d F(x) = f(x)
dx
Although the points x = 1/2 , 3 are points of discontinuity for f(x), we may assume that
f(1/2) = 3/5 and f(3) = 0.
P(|X|1) = P(-1x1)
1 1
= f(x)dx = 2x dx + 6/25(3 - x)dx = 13/25
-1 0
If we use property of cdf
P(|X|1) = P(-1x1) = F(1) - F(-1) = 13/25
TRY YOURSELF !
X: 0 1 2 3 4
P(X): K 3K 5K 7K 9K
1.5.1 Expectation
If X is a discrete random variable, then the expected value or the mean value of g(x)
defined as
E (g(x)) = g(xi) pi
i
Where,
pi = P(X = xi) is the probability mass function of X.
= x f(x) dx if X is continuous
Rx
Var(x) = x2 = E ((x- x )2 )
= (xi - x) 2 pi , if X is a discrete
Note:
E(a) = a
E (aX) = a E (X)
E (aX + b) = a E (X) + b
Var (X) = E(X2) (E(X))2
Var (aX) = a2 Var(X)
Var (aX + bY) = a2 Var(X) + b2 Var(Y)
1.5.2 Moments:
If X is a discrete or continuous random variable the n th moment about the origin
is defined as the expected value of the n th power of X and is denoted by n'
n' = E (Xn)
= xi n pi
= x n f(x) dx
-
The n th central moment of a random variable X is its moment about its mean
value X and is defined as
E[(X - X)n]
= (xi X )n pi = n , if X is discrete
= (x- x )n f(x) dx = n , if X is continuous
-
Since the first and second moments about the origin are given by 1' = E (X) and
2' = E (X2) we have
mean = first moment about the origin
Var(X) = 2' (1')2
Note:
E(|X|n) and E ( |X - x|n ) are called absolute moments of X.
1' = d MX (t)
dt t=0
2' = d2 MX(t)
dt2 t=0
n' = dn MX(t)
dtn t=0
Theorem: McX(t) =MX(ct) where c is a constant.
Proof:
By definition of m g f
McX(t) = E(etcX) = E(ectX) = MX(ct)
Theorem: The m g f of the sum of independent random variables is equal to the product
of their respective m g fs. i.e If X1, X2, - - - Xn are n independent random variables then
the m g f of X1+ X2 + - - - + Xn is given by
M X1+ X2 + - - - + Xn (t) = M X1(t)M X2(t) - - -M Xn (t).
Proof:
By the definition of ,m g f
M X1+ X2 + - - - + Xn (t) = E [et (X1+ X2 + - - - + Xn )]
= E [etX1 et X2 - - - et Xn )]
= E [etX1][ et X2] - - - [et Xn )]
= M X1(t)M X2(t) - - -M Xn (t).
Proof:
MU(t) = E[etU] = E[et(X - a)/h]
= E[et(X/h e-at/h]
= e-at/h E[et(X/h]
= e-at/h MX(t/h)
Example1:
Find the m g f for f(x) = 2/3 at x = 1
1/3 at x = 2
0 otherwise
And also find 1' & 2'
Solution:
MX(t) = etx f(x) = et(2/3) + e2t(1/3)
Solution:
By definition
tX
MX(t) = E(e ) = etx f(x) dx ,
-
b
= etx (1/b-a) dx , = etb - eta
a t (b- a)
= d 1 + t (b + a) + t2(b2 + ab + a2) + - - -
dt 2! 3! t=0
= 0 + (b + a) + remaining terms will contain t in the numerator
2
= d2 1 + t (b + a) + t2(b2 + ab + a2) + - - -
dt2 2! 3! t=0
= 2 (b2 + ab + a2)
3!
= (b - a)2
12
Another method to find mean and variance:
Solution:
2 2
MX(t) = e f(x)dx = 1/3 e dx = 1/3 [e /t] = 1/3[e2t/t e-t/t] = e2t - e-t , t 0
tx tx tx
-1
- -1 3t
When t = 0 MX(0) = 0
0
Applying LHospital rule
Example 4:
Lt X be the random variable which assumes the value x with the probability
P(X = x) = qx-1p, x = 1,2,- --. Find the m g f of X and find its mean and variance.
Solution:
MX(t) = etx f(x) = etx qx-1p = p (qet)x = p (qet) (qet)x-1
x=1 q x =1 q x =1
= (1 - qet)2 [(1- qet) pet + pet(-qet) + 2pqe2t] {[(1- qet) pet + pqe2t)]2(1 - qet) (-qet)}
(1 - qet)4 t=0
Example 5:
Let X have the probability mass function
f(x) = 6 1 , k = 1, 2,- - -
2 k2
0 otherwise.
Find the m g f ?
Solution:
tx
MX(t) = e f(x) = etk 6 1 (as k , etk )
k=1
2 k2
MX(t) is infinity for any t>0
TRY YOURSELF !
1. Find the mgf for the distribution where
f(x) = 2/3 at x = 1
= 1/3 at x = 2
= 0, otherwise
Also find the mean and variance?
(Solution: et 2/3 + e2t 1/3, 4/3, 2)
Many random processes can be mapped into two outcomes. For example:
win or lose
success or failure
heads or tails
Suppose I am at a party, and I ask a girl to dance. There are two outcomes. She can agree
to dance with me, or she can turn me down.
At the party, I can ask any number of girls to dance with me. Each girl has the same two
choices: agree to dance with me, or turn me down.
Assume that each girl's decision is independent of one another, and that my probability of
success with each girl is the same. These are the iid assumptions that we described at the
end of the last lecture.
Let n be the number of girls I ask to dance. Let X be the number of girls who agree to
dance with me. We say that X is a binomial random variable, because it is the sum of
binary outcomes.
Suppose that when I ask a girl to dance with me, the probability that she will agree to do
so is .15. We call this p, the probability of success.
The distribution of the random variable X--my overall number of dance partners--will
depend on the number of girls I ask, n, and my probability of success, p.
Such a trial is know as a Bernoulli trial. Some examples of Bernoulli trials are
A random variable X which takes only 2 values 0 and 1 with probability q and p
respectively ie P ( X = 1 ) = p, P (x = 0 ) = q. Then q = 1 p is called a Bernoulli variable
and X is said to have Bernoulli distribution. P is called the parameter of Bernoulli
variate.
A Bernoulli random variable with parameter rp has probability mass function given by P
( X x ) = px q 1 x where x = 0,1 .
If X1 and X2 are 2 input binomial random variable with parameter ( p1, n1 ) and p1, n2 )
then X1 + X2 is a Binomial random variable with parameters ( p, n1 + n2)
E(X) = xipi
n
= x. nCxpxqn-x
x=0
n
= x. n! pxqn-x (1)
x = 0 x!(n - x)!
n
= np (n-1)! px-1q(n-1) (x 1)
x = 0 (x-1)!{(n - 1) (x - 1)}!
n
= np (n - 1)Cx 1. px-1q(n - 1) (x - 1)
x=1
= (q + p)n-1
= np. (2)
n
2 2
E(X ) = xi pi = x2px
0
n
= {x(x-1) + x} n! pxqn-x
x=0 x! (n - x)!
n n
= x(x-1) n! pxqn-x + x n! pxqn-r
x=0 x! (n - x)! x = 0 x! (n - x)!
n
= n(n - 1)p2 (n - 2)! px-2q(n - 2) (x - 2) + np
x = 2 (x - 2)!{(n-2) (x - 2)}!
n
= n(n - 1)p2 (n - 2)Cx-2px-2 qn-x + np
x=2
= n(n - 1)p2 + np
n
= n Cxr(pet) xqn-x
x=0
= (pet +q)n
1.7.1.5 Recurrence formula for the central Moments of the binomial distribution.
By definition , the kth order central moment k = E{X E(X)}k.
For the binomial distribution B(n,p)
n
k = (x - np)kn Cxpxqn-x (1)
x=0
= -nk-1 + 1 k+1
pq
using recurrence relation (2) we may compute moments of higher order, provided
moments of lower order are known
Putting k = 1 in (2), we get
2 = pq d1 + n0
dp
= pq d [np(1 - p)]
dp
= npq[1 2p] = npq(q p)
P(X = x) = nCxpxqn-x
P(X = x + 1) = nCx+1px+1qn-(x+1)
P(X = x + 1) = nCx+1px+1qn-(x+1)
P(X = x) nCxpxqn-x
Example 1:
For a binomial distribution with parameter n = 5 and p, the probability of success = 0.3,
find the probabilities of getting i) at least 3 success ii) at most 3 success iii)exactly 3
failures.
Solution:
Let X denote the success.
Probability distribution is given nCxpxqn-x
Given n = 5. p= 0.3, q = 1 p = 1 0.3 = 0.7
i)the probability of atleast 3 successes
= P(X = 3) + P(X = 4) + P(X = 5)
= 5C3(0.3)3(0.7)2 + 5C4(0.3)4(0.7)1 + 5C5(0.3)5(0.7)0
= 0 .1631
npq = 4
np 5
q = 4/5, p = 1 q = 1 4/5 = 1/5
We have np = 5, but p = 1/5
Therefore n = 25
Hence the Binomial distribution is nCxpxqn-x = 25Cx(1/5)x(4/5)n-x, x = 0,1,2, - - - -25
Example 3:
The mean and variance of a Binomial variate are 8 and 6.Find P(X2).
Solution:
Given np = 8 and npq = 6.
npq = 6 = 3
np 8 4
Therefore q = , hence p = 1 =
But np = 8 i.e, n = 8 => n = 32
The Binomial distribution is nCxpxqn-x = 32Cx()x( )32 x, x = 0,1,2, - - -32
P(X2) = 1 [p(0) + p(1)]
= 1 - ()31 +3 2/4
= 1 (35/4)( )31
Example 4:
If on an average one vessel in every ten is wrecked, find the probability that out of five
vessels expected to arrive, at least four will arrive safely.
Solution:
Let p denote the probability that a vessel will arrive safely
Then p = 9/10 and q = 1/ 10
= 5C4(9/10)4(1/10) + (9/10)5
= 0.9185
Example 5:
In a Binomial distribution consisting of 5 independent trials, the probabilities of 1 and 2
successes are 0.4096 and 0.2048 respectively. Find the parameter p of the distribution.
Solution:
The Binomial distribution is p(x) = nCxpxqn-x
p(1) = 5C1pq4 = 0.4096 (1)
p(2) = 5C2p2q3 = 0.2048 (2)
Example 6:
If X and Y are independent Binomial variables with parameters B1(5, )
And B2(7, ), find P(X + Y = 3)
Solution:
Since X and Y are independent binomial variables with parameters (5, ) and (7, ),
(X + Y) is a binomial variable with parameters (12, )
Therefore the probability distribution of the binomial variable X + Y is given by,
P[X + Y = x] = 12 Cx ()x()12-x
P[X + Y = 3] = 12 C3 ()3()9 = 55/ 704
Example 7:
Ten coins are tossed simultaneously, find the probability of getting i) at least seven heads
ii) exactly seven heads iii) at most seven heads.
Solution:
n = 10
p = probability of head in the toss of a coin =
q = 1 p = 1 =
p(x) = nCxpxqn-x
p(x) = 10Cxpxq10 x
p(x) = 10Cx()x()10-x
= 10Cx (1/210)
i)Probability of getting at least 7 heads
P(x7) = P(X = 7) + P(X = 8) + P(X = 9) + P(X = 10)
= ()10 [10C7 + 10C8 + 10C9 + 10C10 ]
Example 8:
If 10% of the screws produced by an automatic machine are defective, find the
probability that out of 20 screws selected at random, there are
i)exactly two defectives
ii)at most three defectives
iii)at least two defectives
iv)between one and three defectives(inclusive).
Find also mean and variance.
Solution:
p = probability that a screw is defective
= 0.1
q = 0.9
p(x) = 20Cx(1/10)x(9/10)20-x x = 0,1,2, - - - 20
i)probability exactly two are defectives
p(x) = 20C2(1/10)2(9/10)18
= 190 .918
1020
ii) probability that there are at most three defectives
= P(X = 0) + P(X = 1) + P(X = 2) + P(X = 3)
= (9/10)20 + 20C1(1/10)(9/10)19 + 20C2 (1/10)2(9/10)18 + 20C3(1/10)3(9/10)17
= (9/10)17 [(9/10)3 + 20(1/10)(9/10)2 + 190 (1/10)2(9/10) + 1140(1/10)3]
= 917/1020 [729 + 1620 + 1710 + 1140]
iii)Probability that there are atleast two defectives
P(X2) = 1 P(X<2)
= 1 [P(X = 0) + P(X = 1)]
= 1 [ (9/10)20 + 20C1(1/10)(9/10)19]
= 1 (919/1020)(9 +20)
= 1 (919/1020)(29)
iv)Probability that the number of defectives is between one and three (inclusive)
P(1X3) = P(X = 1) + P(X = 2) + P(X = 3)
= 20C1(1/10)(9/10)19 + 20C2 (1/10)2(9/10)18 + 20C3(1/10)3(9/10)17
= (917/1020)(4470)
v)Mean = np = 20 .(1/10) = 2
variance = npq = 20.(1/10)(9/10) = 9/5
Solution:
p(x) = nCxpxqn-x
i)Exactly 5 attempts will be required to get 3 successes, if 2 successes occur in the first
four attempts and third success in the fifth attempt.
Therefore P(exactly 5 attempts are required )
= P(2 successes in 4 attempts) x P(success in the single fifth attempt)
Now, P(2 successes in 4 attempts = 4C2p2q2 (since it follows binomial distribution)
P(success in the single fifth attempt) = p (given)
Therefore P(exactly 5 attempts are required ) = 4C2p2q2 x p = 6p3q2.
iii)Five attempts will be required to get 3 consecutive successes, if the first 2 attempts
result in failures and the last 3 attempts result in success.
Therefore required probability = q.q.p.p.p = q2p3
Three attempts will be required to get 3 consecutive success, if each attempt result in a
success.
Therefore required probability =p.p.p = p3
Four attempts will be required to get 3 consecutive success, if first attempt result in a
failure and the remaining attempts result in a success each.
Therefore required probability = q.p.p.p = qp3
Therefore, if the launching attempts are made until 3 consecutive successful launchings
occur, probability = q2p3 + p3 + qp3 = p3 (1 + q)
x: 0 1 2 3 4 5 6 total
f: 5 18 28 12 7 6 4 80
Solution:
To find the binomial distribution which fits the given data, we require N, n and p. We
assume N = total frequency = 80 and from the given data n = 6.
To find p :
We know mean = np. From the given data we will find the mean and then equate to np
so that we can find the value of p.
x: 0 1 2 3 4 5 6 total
f : 5 18 28 12 7 6 4 80
fx: 0 18 56 36 28 30 24 192
mean = fx = 192
f 80
= 2.4
i.e, np = 2.4, hence p = 0.4 and q = 0.6
Thus we have
x: 0 1 2 3 4 5 6
f: 3.73 14.93 24.88 22.12 11.06 2.95 0.33
Converting these values into whole numbers consistent with the condition that the total
frequency distribution is 80, the corresponding binomial frequency distribution is as
follows
x: 0 1 2 3 4 5 6
Theoretical f: 4 15 25 22 11 3 0
Example 11: Assume that half of the population is vegetarian so that the chance of an
individual being a vegetarian is . Assuming that 100 investigators take samples of 10
individuals each to see whether they are vegetarians, how many investigators would you
expect the reports that 3 people or less were vegetarians.
]
Solution
p = probability that an individual is a vegetarian =
q=1p=
N = Number of investigators
The expected number of investigators reporting x persons as vegetarians
= N. n Cx px q n x
= 100. 10 Cx ( ) x . ( ) 10 x
= 100. 10 Cx ( ) 10
= (100/1024) [ 1 + 10 + 45 +120 ]
Example 12: In a certain town, 20% samples of the population is literate and assume that
200 investigators take samples of ten individuals to see whether they are literate. How
many investigators would you expect to report that three people or less are literates in the
samples.
Solution
p = probability that an individual is literate = 20% = 0.2
q = 1 p = 0.8
n = 10
p (x) = 10Cx (0.2)x ( 0.8 ) 10 x
= 200 [ (0.8 )10 + 10C1 (0.2) (0.8)9 + 10C2 (0.2)2 (0.8)8 + 10C3 (0.2)3 (0.8)7 ]
= 176
Example 13An irregular six faced die is thrown and the probability that in 10 throws it
will give five even numbers is twice the probability that it will give four even numbers.
How many times in 10,000 sets of 10 throws would you expect it to give no even
numbers.
Solution
Let p = probability of getting an even number in a throw of a die.
10 -5 + 1 . p = 2q . nCr = nr+1
5 nCr-1 r
6p = 10 ( 1 p )
p= 5 , q= 3
8 8
= 10,000 ( 3 /8 )10 = 1
The Poisson distribution is used to model the number of events occurring within a given
time interval.
1.7.2.2 Properties:
The number of outcomes occurring during a time interval is independent of the number
that occurs in any other disjoint time interval. So it is memory less.
The probability that a single outcome will occur during a very short time interval is
proportional to the length of the time interval. It does not depend on the number of
outcomes that occur outside this time interval.
The probability that mere then one outcome will occur in such a short time interval is
negligible.
E(X) = xr pr
x
= x e- x (1)
x=0 x!
= e- x - 1
x = 0 (x - 1)!
= e- e = (2)
E(X2) = xx2 px
x
= {x(x - 1) + x} e- x
x=0 x!
= 2 e- x 2 +
x = 2 (x - 2)!
= 2e- e + = 2 +
= e- [ 1 + et + (et)2 + --]
1! 2!
t
- ( e )
= e e or e - exp(et) = exp((et -1))
P(X = x + 1) = .
P(X = x) (x +1)
If X and Y are two independent Poisson Random Variables with means 1 & 2
Respectively, then X + Y is also a Poisson Random Variable with mean 1 + 2.
Proof:
If X is a binomially distributed r v with parameters n and p then,
P(X = x ) = nCx pxqn-x, x = 0,1,2,- - -
Solution
To find the poisson distribution which fits the given data, we require N and . We assume
N = total frequency = 400
To find :
We know mean =
. From the given data we will find the mean and hence .
x: 0 1 2 3 4 5
f : 142 156 69 27 5 1
fx: : 0 156 138 81 20 5
mean = fx = 400 = 1
f 400
= 1
Using Poisson
ii) since p = 0.05 is very small and n = 20 is sufficiently large, binomial distribution
may be approximated by poisson distribution with parameter
= np = 1
therefore P(X = x) = e- x = e -1
x! 1!
Example 4: Assume that the number of planes crossing Indian border during war
between 5p.m and 6p.m is a Poisson random variable with parameter 3 and the number
between 6p.m and 7p.m is a Poisson random variable with parameter 4. If these two
random variables are independent what is the probability that more than 5 planes cross
the border between 5p.m and 7 p.m?
Solution:
Let X1 be the number of planes crossing the border between 5p.m and 6p.m
X2 be the number of planes crossing the border between 6p.m and 7p.m. Since X1 and X2
are independent Poisson random variables with parameters 3 and 4 respectively, X1 + X2
is a Poisson random variable with parameter 7.
P(X1 + X2 > 5) = 1 P(X1 + X2 5)
5
= 1- e-7 7x
x = 0 x!
= 1 0.3007 = 0.6993
Example 5: After correcting the proofs of the first 50 pages of a book, it is found that on
the average there are 3 errors per 5 pages. Use Poisson probabilities and estimate the
number of pages with 0,1,2,3 errors in the whole book of 1000 pages(e-6 = 0.5488)
Solution:
= mean = average no : of errors per page = 3/5 = 0.6
The probability that there are x errors per page is
p(x) = e- x = e-0.6 (0.6)x , x = 0,1,2 - - -- -
x! x!
Example 7: If X and Y are independent poisson random variables, show hat the
conditional distribution of X, given the value of X + Y, is a Binomial distribution.
Solution:
Let X and Y follow Poisson distributions with parameters 1& 2 respectively.
Now
P[X =x / (X + Y) = n] = P[X = x and(X + Y) = n] = P[X = x ;Y = n x]
P[ (X + Y) = n] P[ (X + Y) = n]
= nCr prqn-r
r n-r
Where p = 1 2
(1 + 2) q= (1 + 2)
Example 8: One-fifth of the blades produced by a blade manufacturing factory turn out
to be defective. The blades are supplied in packets of 10. Use Poisson distribution to
calculate the approximate number of packets containing no defective and two defective
blades in a consignment of 1,00,000 packets.
Solution:
p = (1/5)/100 = .002
n = 10 = np = 0.2
p(x) = e- x = e-0.2 (0.2)x
x! x!
Suppose that I am at a party and I start asking girls to dance. Let X be the number of girls
that I need to ask in order to find a partner. If the first girl accepts, then X=1. If the first
girl declines but the next girl accepts, then X=2. And so on.
When X=n, it means that I failed on the first n-1 tries and succeeded on the nth try. My
probability of failing on the first try is (1-p). My probabilty of failing on the first two tries
is (1-p)(1-p).
P(X = n) = (1 - p)n-1p
Now X= x, if and only if the first ( x-1) trails result in failure ( occurrence of ) and
the xth trial results in success ( occurrence of A ) Hence
P ( X = x ) = q x - 1 p; x = 1,2,3.,
Where P ( A ) = p and P () = q
If X is a discrete RV that can assume the values 1,2,3,.. such that its probability mass
function is given by
P ( X = x ) = q x - 1 p; x = 1,2,3., where p + q = 1
= p ( 1 + q + q2 + . )
p =1
1-q
E(X) = xx Px
x
= xq x 1 p
x=1
= p ( 1 + 2q + 3q2 + ..... + )
= p ( 1 q ) -2 = 1/p
E ( X2 ) = x2 px
x
= x2 q x 1 p
x=1
= p { x ( x + 1 ) x} q x - 1
x=1
= p [{1 X 2 + 3 X 3q + 3 X 4q2 + ..... + } { 1 + 2q + 3q2 + ...... + }]
= p [ 2 ( 1 q ) -3 - ( 1 q ) -2 ]
Var ( X ) = E ( X2 ) - { E (X) } 2
P ( X = x ) = qx p; x = 0,1,2,. where p + q = 1
Mx(t) = E(etx) = etx p qx-1
x=1
= pet (etq)x-1
= pet(1 - qet)-1
= pet
1 - qet
We have
P ( X = x ) = q x-1 p
P ( X = x + 1 ) = qx p
So P ( X = x + 1) = q
P(X=x)
P ( X = x + 1) = q P ( X = x )
Example:1 If the probability is 0.05 that a certain kind of measuring device will show
excessive drift, what is the probability that the sixth of these measuring devices tested
will be the first to show excessive drift?
Solution:
If the sixth device should show excessive drift then there should be 5 failures before the
sixth trial, which is geometric distribution.
P ( X = x ) = q x - 1 p; x = 1,2,3.,
P ( X = 6) = q 6 - 1 p
Given p = 0.05, q = 0.95
P ( X = 6) =(0.95)5 0.05 = 0.039
Example:2 A and B shoot independently until each has hit its own target. The
probabilities of their hitting the target at each shot are 3/5 and 5/7 respectively.
Find the probability that B will require more shots than A.
Solution:
Let X denote the number of trials required by A to get his first success.
p = 3/5 q = 2/5
Then X follows a geometric distribution given by
P ( X = x ) = q x - 1 p; x = 1,2,3.,
= 3/5. (2/5) x 1 ; x = 1,2,3.,
Let Y denote the number of trials required by B to get his first success.
p = 5/7 q = 2/7
Then X follows a geometric distribution given by
P ( Y = x ) = q x - 1 p; x = 1,2,3.,
= 5/7. (2/7) x 1 ; x = 1,2,3.,
Probability that B will require more shots than A. i.e, B requires more trials to get his first
success than A requires to get his first success . i.e, probability of A getting his success
in the xth trial and B getting his success in x + 1th or x + 2th or - - - trial.
= P[X = x and Y = x + 1th or x + 2th or - - - ]
x=1
= P[X = x ].P[Y = x + 1th or x + 2th or - - - ]
x=1
= (3/5)(2/5)x-1 . (5/7)(2/7)x + k -1
x=1 k=1
= (3/7) (2/5)x-1 (2/7) k(2/7)x-1
x=1 k=1
= (3/7) (2/5) (2/7) . (2/7) k
x-1 x-1
x=1 k=1
= (3/7) (4/35)x-1 (2/7) k
x=1 k=1
= (3/7) (4/35)x-1 (2/7)
x=1 1 2/7
= 6/35 (4/35)x-1 = (6/35) 1
x=1 1 - 4/35
= 6/31
Example:3 A die is thrown until 1 appears. Assuming that the throws are independent
and the probability of getting 1 is p, find the value of p so that the probability that an odd
number of throws is required is equal to 0.6. Can you find a value of p so that the
probability is 0.5 that an odd number of tosses is required?
Solution:
Let X denote the number of throws required to get the first success (getting 1). Then the
distribution of X is geometric
P ( X = x ) = q x - 1 p; x = 1,2,3.,
P(X = an odd number) = P( X = 1 or 3 or 5 - - - -)
= P(X = 2x - 1)
x=1
= pq2x-2
x=1
= (p/q ) q2k
2
x=1
= (p/q2)[q2 + q4 + - - ]
= (p/q2)(q2/1-q2) = p/(1 +q)
= pqk = qk.
1 q
P{X>m +n / X>m} = P{X>m +n and X>m}
P{X>m}
Let P(>1) = q
Now P{X = (x +1)} = P(X > x) - P(X > (x +1)) (1)
1. Derive the moment generating functions of all the distributions discussed above.
2. Derive the mean and variance of Poisson distribution.
3. State the additive property of Poisson distribution
Try yourself !
1. Determine the binomial distribution for which the mean is 4 and variance is 3.
(Solution: 16 Cx (1/4)x(3/4)16 x, x = 0,1,2,- - -)
2.A and B playas game in which their chance of winning is in the ratio 3:2. Find As
chance of winning atleast three games out of five games played.
(Solution: 0.68)
3.If X is a poisson variate such that P(x =2) = 9 P(X = 4) + 90 P( X = 6), find the
Variance. (Solution: 1 )
4.A manufacturer of cotter pins knows that 5% of the product is defective. If he sells
cotter pins the boxes of 100 and guarantees that not more than 4 pins will be defective.
What is the approximate probability that a box will fail to meet the guaranteed quality.
(Solution: P(X>4) = 0.5620)
6.If the probability that an applicant for a drivers license will pass the road test on any
given trial is 0.8, what is the probability that he will finally pass the test a) on the fourth
trial b) in fewer than 4 trials?
(Solution: 0.0064, 0.9984)
1.8 CONTINUOUS DISTRIBUTION
1.8.1 UNIFORM OR RECTANGULAR DISTRBUTION
For example, if X is uniformly distributed between 0 and 1, then the probability that X
will be between 0.3 and 0.4 is .1, because there are ten intervals of width .1 each. The
probability of X falling between 0.1 and 0.25 is .15.
If X is uniformly distributed between 0 and 2, then what is the probability that X will fall
between 0.3 and 0.4? Between 0.1 and 0.25?
Uniform distributions do not occur very often in nature. However, random number
generators often are built to simulate the uniform distribution.
We can use a uniform random number generator to determine the winner of a raffle.
Suppose that we have 247 entries, numbered one through 247. We can choose a random
number between 0 and 1, multiply it by 247, and then round it to the the nearest integer in
order to pick the winner.
Moments
b b
r
r' = E(X ) = 1 xr dx = 1 x r+1
= br+1 ar+1
(b a) a (b a) r +1 (r +1)(b a)
A
Example 1:
If X is uniformly distributed over (-3,3), find the probability that a)X<2
b)|x|<2 , c)|X - 2|<2 and find k such that P[X>k] = 1/3.
Solution:
f(x) = 1/(b - a) a < x < b
0, otherwise
f(x) = 1/6 -3 < x < 3
0, otherwise
2 2
a) P(X<2) = 1/6 dx = x/6 = 5/6
-3 -3
2 2
b) P(|X|<2) = P(-2 < x <2 ) = 1/6 dx = x/6 = 4/6 = 2/3
-2 -2
c) P(|X-2|<2) = P(-2 < x-2 <2 ) = P(0 < x < 4) = P(0 < x < 3) [since interval is (-3,3)]
3 3
1/6 dx = x/6 = 3/6 =
0 0
d) P(X>k) = 1/3
3 3
=> 1/6 dx = x/6 = (3 k)/6 = 1/3 (given)
k k
Example 2: A passenger arrives at a bus stop at 10am knowing that the bus will arrive at
some time uniformly distributed between 10am and 10.30am. What is the probability that
he will have to wait longer than 10min? If at 10.15am the bus has not arrived, what is the
probability that he will have to wait atleast 10 additional minutes?
Solution:
Then pdf of X is
= P( X > 10 )
30
= dx= 2
10 30 3
= P ( X > 25 / X > 15 )
= P ( X > 25 X > 15 )
P (X > 15 )
= P ( X > 25 )
P ( X > 15 )
30
= dx
25 30
30
dx
15 30
= 5 = 1
15 3
Example 3: Show that the mgf about origin for the rectangular distribution on ( -a, a ) is
1 sinh at. Also show that moments of even order are given by 2n = a2n and all
at (2n + 1)
moments of odd order vanish ( i.e 2n +1 = 0 )
Solution:
Mgf about origin is given by
a
MX (t) = E [ e ] = 1 etx dx
tX
2a -a
= 1 ( eat e at )
2at
= sin hat
at
= 1 + ( at )2 + (at )4 ..
3! 5!
Since there are no terms with odd powers of t in MX (t), all moments of odd order vanish
i.e 2n +1 = 0. in particular 1' = 0 . Thus r = r' . Hence 2n +1 = 0 . The moments of
even
order are given by 2n +1 = coefficient of t 2n = a2n .
(2n)! 2n + 1
x
Example 4: If RV has the density function f (x ) prove that y = f (x) = f (x) dx has a
-
rectangular distribution over ( 0 , 1 ). If
f (x) = x-1 ,1 x 3
2
0 , otherwise
Determine what interval for Y will correspond to the interval 1.1 X 2.9
Solution:
The RV Y is defined as Y = FX (x) the distribution function of Y is
Fy (y ) = P ( Y y )
= P ( FX (x) y )
= P ( X f -1x (y)
= F x [ F-1x (y ) ] ( since P ( X x ) = Fx ( x) )
= y
Fy (y) = d [ FY (y) ] = 1
dy
Since Y = Fx (x) , Y = 1 ( X 1 )2
4
Thus when 1.1 X 2.9,
1 ( 1.1 1 )2 Y 1 ( 2.9 1 )2
4 4
0.0025 Y 0.9025.
Example 5: Buses arrive at a specified stop at 15 min. intervals starting at 7 a m. i.e. they
arrive at 7, 7:15. 7:30,7:45and so on . If a passenger arrives at the stop between 7 and
7.30 am, find the probability that he waits
a)less than 5 min for a bus and
b)atleast 12 min for a bus.
Solution
Let X denotes the time in minutes past 7 a.m, when the passenger arrives at the stop.
Then X is uniformly distributed over (0,30)
f(x) = 1 0 < x < 30
30
= 0 otherwise
a) The passenger will have to wait less than 5 min. if he arrives at the stop between
7:10 and 7:15 or 7:15 and 7:30.
Therefore the required probability= P(10<x<15) + P(25<x<30)
15 30
= dx + dx = 1/3
10 30 25 30
b)The passenger will have to wait atleast 12 minutes. if he arrives between 7 and 7:03
or 7:15 and 7:18
Therefore the required probability= P(0<x< 3) + P(15<x<18)
3 18
= dx + dx = 1/5
0 30 15 30
Example 6: Let two independent random variables X and Y have the geometric
distribution. Show that the conditional distribution of X/(X + Y= k) is uniform.
Solution:
P(X = x) = P(Y = y) = pqx-1
P{X/(X + Y= k)} = P{X = x and (X + Y= k)}
1.8.2.1Exponential distribution
f(x) = e x x0
0, otherwise
Distribution function:
The distribution function F(x) is given by
x
F(x) = e x dx = 1 - e x
0
Therefore F(x) = 1 - e x x 0
0 otherwise
r
r' = E[x ] = (y/ ) r e x (dy/ )
0
= (1/ r) (r + 1) = r! / r
2' = E(X2) = 2 / 2
0
1
( - t)x
= e = = 1 t
-( - t) -t
Example 1:If X is exponentially distributed prove that the probability that X exceeds its
expected value is less than .
Solution:
Let X be exponentially distributed with parameter . Then
f(x) = e x x0 E(X) = 1/
P(X > 1/) = e x dx = e 1 = 0.3679 < =
1/
= 1/3000 e x/3000 dx
15000
= e 5
P(Stock insufficient for 2 days) = (e 5)2 e 10
Example 4: The mileage which car owners get with a certain kind of radial tire is a
random variable having an exponential distribution with mean 40,000 km. Find the
probabilities that one of these tires will last i) at least 20,000 km and ii) at most 30,000
km.
Solution:
Let X denotes the mileage obtained with the tire.
Mean = 1/ = 40,000 = > = 1/40000
The density function of X is given by
f(x) = e x x0
= 1/40000 e (1/40000)x
i)The probability that one of these tires will last atleast 20,000 km is given by
P(X20,000)
= 1/40000 e (1/40000)x dx
20,000
= e (1/40000)x = e0.5
40000 20,000
ii)The probability that one of those tires will last atmost 30,000 km is given by
P(X30,000) 30,000
= 1/40000 e (1/40000)x dx
0
30,000
= e (1/40000)x = e0.75 +1 = 0.5270
40000
0
Solution:
The density function of X is given by
f(x) = e x x0
If process I is used , given Expected value as 100, i.e. 1/ = 100 = > = 1/100
Then f(x) = (1/100)e (x /100) x0
P(X200)
= (1/100)e (x /100) dx = e 2
200
P(X<200) = 1 - P(X200) = 1 -e 2
If process II is used , given Expected value as 150, i.e. 1/ = 150 = > = 1/150
Then f(x) = (1/100)e (x /100) x0
P(X200)
= (1/150)e (x /150) dx = e 4/3
200
P(X<200) = 1 - P(X200) = 1 -e 4/3
Let C1 and C2 be the costs per fuse corresponding to process I and II respectively.
Then
C1 = 6 X200
46 X<200 (loss of Rs.40 + manufacturing cost Rs. 6)
Similarly,
C2 = 12 X200
52 X<200 (loss of Rs.40 + manufacturing cost Rs. 6)
The normal distribution is almost the opposite of the uniform distribution. The uniform
distribution is mathematically simple but occurs rarely in nature. The normal distribution
is mathematically complex but occurs frequently in nature.
= =
Mean =
E(X ) = x2 f(x) dx
2
-
= 1 x2 exp(- (x - )2/ 2 2) dx
2 -
= 1 ( + 2 t)2exp (-t2)dt (on putting t = x - )
- 2
= 1 exp (-t )dt + 22 t exp (-t )dt + 2 t2 exp (-t2)dt
2 2 2 2
- - -
= 2 + 0 + 2 2 2 t.t exp (-t2)dt ; (since t2exp(-t2) is an even function)
0
= 2 + 2 2 t exp (-t2) 2t dt
0
= 2 + 2 2 u1/2 exp (-u) du
0
2 2
= + 2 (3/2) (by definition of gamma function | n = xn-1e-xdx )
0
= 2+ 2 2 (1/2) | (1/2) ( | n = (n - 1) |(n -1) )
= 2 + 2 ( | (1/2) = )
= 2 + 2
Var (X) = E(X2) [ E(X)]2 = 2
1 exp(- (x - )2/ 2 2) dx + =
M 2
f(x) dx = 0
M
M=
Therefore median =
Note: Have you noticed anything while finding the mean, median and mode?
Yes! For normal distribution mean, median and mode are same
= 1 (x - )r exp(- (x - )2/ 2 2) dx
2 -
= 1 (2 t)r exp (-t2)dt
-
= 2r / 2 r tr exp (-t2)dt
-
Case i) r is an odd integer, that is r = 2n + 1
Therefore 2n+1 = 2(2n +1)/2 2n +1 t2n+1exp (-t2)dt
-
= 0, (since the integrand is an odd function)
2n = 2n2n t2nexp(-t2) dt
-
= 2 2 t2nexp(-t2) dt
n 2n
(since t2nexp(-t2) is an even function of t)
0
(put t2 = u; 2tdt = du)
n 2n
=2 u n- e-u du
0
= 2n 2n |( n+ ) (1)
= 2n 2n (2n -1 ) ( 2n - 1)
2 2
= 2n 2n (2n -1 ) ( 2n 3) ( 2n - 3)
2 2 2
= 2n 2n (2n -1 ) ( 2n 3) - - - - 1 |
2 2 2
( There are n terms in the series 1 . 3. 5.. . . . .(2n - 3)(2n - 1) )
For example consider 1.3.5.7 we have 4 terms i.e. 7 -1 + 1
i.e in general (n - 1)/2 + 1. 2
Therefore in the series 1 . 3. 5.. . . . .(2n - 3)(2n - 1) )
there are 2n 1 1 + 1 = n)
2
n 2n
= 2 1.3.5. ...... (2n - 1) ( | = )
2n
2n = 2n 1.3.5. ...... (2n - 1)
2n-2 = 2n - 1 2n - 2 |( n - ) (2)
From (1) and (2) we get
2n = 2 2 (n )
2n-2
2n = 2 2 (n ) 2n-2
which gives a recurrence relation for the even order central moments of the normal
distribution.
1.8.3.9 Mean deviation about the mean of the normal distribution
The central moment of the first order of a random variable X is called the mean deviation
(MD) about the mean X .i.e. E{| X E(X)| }
For the normal distribution N(, )
MD = |x - | 1 exp(-(x - )2/22) dx
- 2
= 1 | 2 t | exp(-t2) 2 dt
2 -
= 2 |t| exp(-t2) dt
-
= 22 |t| exp(-t2) dt (since |t| exp(-t2) is an even function. )
0
= 2 (exp(-t2))
0
= 2
Now Mx(t) = 1 + t ( + 2t / 2 ) + t 2 ( + 2t / 2 )2 + t 3 ( + 2t / 2 )3 + - - - - -
1! 2! 3!
E(X) = Coefficient of t / 1! =
E(X2) = Coefficient of t2 / 2! = 2 + 2
E(X3) = Coefficient of t3 / 3! = 3 2 + 3 and so on.
= etz (z)dz
-
= 1 exp(-z2/2) etz dz
- 2
= 1 [exp-(z2 2tz)/2] dz
- 2
= 1 [exp-[(z-t)2 t2)/2]] dz
- 2
Put u = z t ; du = dz/2
2
= 1 exp(t2/2)2 exp(-u2)du
2 -
= 1 exp(t2/2) | ( | = )
= exp(t2/2)
Note: If X has the distribution N(,) then Y = aX + b has the distribution N(a + b, a )
with mgf
= exp[( aii)t} exp( ai2i2t2/2 ) which is the mgf of normal random variable
with mean aii and variance ai2i2.
= - npt + nlog 1+ p t + p t2 + p t3 + - - - - -
npq npq 2npq 6 (npq)3/2
= - npt + n p t 1+ t + t2 + - - - - --
2 2 2
npq npq 2npq 6n p q
- 1 p2 t 2 1 + t + t2 + 2
+--
2 npq 2npq 6n2p2q2
which is the mgf of the standard normal distribution. Hence the limit of the standardized
binomial distribution, as n , is the standardized normal distribution.
Areas of application: Normal distribution is the most important continuous probability
distribution in the field of statistics. It describes many phenomena that occur in industry ,
in error calculations of experiments, statistical quality control, in nature like rainfall and
meteorological studies etc.
Example1: The marks obtained by a number of students in a certain subject are assumed
to be approximately normally distributed with mean 55 and SD(standard deviation) 5. If 5
students are taken at random from this set, what is the probability that 3 of them would
have scored marks above 60?
Solution:
If X represents the marks obtained by the students, X follows the distribution N(55, 5).
P(a student scores above 60)
Example 2: The mean and SD of a certain group of 1000 high school grades, that are
normally distributed are 78% and 11% respectively. Find how many grades were above
90%?
Solution:
Let X represents the no: of grades. Given = 78% = 0.78 & = 11% = 0.11
P(X>90%) =P(X>0.09) = P(0.09 < X < ])
= P[(0.09 0.78)/ 0.11 < (X )/ < ]
= P[1.090 < Z < ]
Example 3: The local authorities in a certain city install 10,000 electric lamps in the
streets of the city. If these lamps have an average life of 1000 burning hours with a SD of
200h, how many lamps might be expected to fail i) in the first 800 burning hours?ii)
between 800 and 1200 burning hours. After how many burning hours would you expect
iii)10% of the lamps to fail? iv) 10% of the lamps to be still burning? Assume that the
life of lamps is normally distributed.
Solution:
If X represents life of the electric lamps , X follows the distribution
N(1000, 200)
i)P(X < 800) =P(- <X < 800) = P(- < (X )/ < (800 1000)/200 )
= P(- < Z < -1)
=> P(0< Z < (1000 - x1)/200)) = 0.4 (search for the value nearer to 0.4 inside the table )
x1 = 744
x2 = 1256
Example 4: The marks obtained by the students in Mathematics, Physics and Chemistry
in an examination are normally distributed with the means 52, 50 and 48 and with
standard deviations 10,8 and 6 respectively. Find the probability that a student selected at
random has secured a total of i) 810 or above and ii) 135 or less.
Solution:
Let X,YZ Denote the marks obtained by students in mathematics, physics and chemistry
respectively.
X follows N(52,10), Y follows N(50,8) and Z follows N(48,6)
By the additive property of normal distribution U = X + Y+ Z follows the distribution
Example 5: In a normal population with mean 15 and SD 3.5, it is found that 647
observations exceeds 16.25. What is the total number of observations in the population?
Solution:
Let N be the no: of observation.
Substituting in (1)
N x 0.3594 = 647
N = 647 / 0.3594 = 1800
Example 6: In a normal distribution, 7% of the items arte under 35 and 89% are under
63. what are the mean and standard deviation of the distribution? What percentage of the
items are under 49?
Solution:
P(X < 35) = 0.07 P(X <63) = 0.89
P(- <X<35) = 0.07 P(- <X<63) = 0.89
P(- <(X )/ < (35- )/ ) = 0.07 P(- <(X )/ < (63 )/ ) = 0.89
P(- < Z < (35- )/ ) = 0.07 P(- < Z < (63 )/ ) = 0.89
{Let (35- )/ = a and (63 )/ = b.}
0.5 P(0 < Z < ( -35)/ ) = 0.07 0.5 + P((63- )/ < Z< 0 ) = 0.89
From 1 & 2
35 = -1.47
63 = 1.23
P (X <49) = 45.22%
Example 7: There are 400 students in the first year class of an engineering college. The
probability that any student requires a copy of a particular mathematics book from the
college library on any day is 0.1. How many copies of the book should be kept in t he
library so that the probability may be greater than 0.95 that one of the students requiring a
copy from the library has to come back disappointed?
(Use normal approximation to the binomial distribution)
Solution:
p = P(a student require book from) = 0.1
q = 0.9, n = 400
If X represents the number of students requiring the book, then X follows a binomial
distribution with mean = np = 400 x 0.1 = 40 and SD =npq = 6
Let x1be the required number of books, satisfying the given condition
i.e P(X < x1) >0.95
Example 8: If X and Y are independent random variables following N(8,2) and N(12,
43) respectively find the value of such that
P(2X Y 2) = P(X +2Y )
Solution:
Let A = 2X Y and B = X + 2Y
2X Y is N( 2x8 12, 4 x 4 + 1 x 48
Therefore = 8.67
Normal
Deviate .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
z
0.0 .0000 .0040 .0080 .0120 .0160 .0199 .0239 .0279 .0319 .0359
0.1 .0398 .0438 .0478 .0517 .0557 .0596 .0636 .0675 .0714 .0753
0.2 .0793 .0832 .0871 .0910 .0948 .0987 .1026 .1064 .1103 .1141
0.3 .1179 .1217 .1255 .1293 .1331 .1368 .1406 .1443 .1480 .1517
0.4 .1554 .1591 .1628 .1664 .1700 .1736 .1772 .1808 .1844 .1879
0.5 .1915 .1950 .1985 .2019 .2054 .2088 .2123 .2157 .2190 .2224
0.6 .2257 .2291 .2324 .2357 .2389 .2422 .2454 .2486 .2518 .2549
0.7 .2580 .2612 .2642 .2673 .2704 .2734 .2764 .2794 .2823 .2852
0.8 .2881 .2910 .2939 .2967 .2995 .3023 .3051 .3078 .3106 .3133
0.9 .3159 .3186 .3212 .3238 .3264 .3289 .3315 .3340 .3365 .3389
1.0 .3413 .3438 .3461 .3485 .3508 .3531 .3554 .3577 .3599 .3621
1.1 .3643 .3665 .3686 .3708 .3729 .3749 .3770 .3790 .3810 .3830
1.2 .3849 .3869 .3888 .3907 .3925 .3944 .3962 .3980 .3997 .4015
1.3 .4032 .4049 .4066 .4082 .4099 .4115 .4131 .4147 .4162 .4177
1.4 .4192 .4207 .4222 .4236 .4251 .4265 .4279 .4292 .4306 .4319
1.5 .4332 .4345 .4357 .4370 .4382 .4394 .4406 .4418 .4429 .4441
1.6 .4452 .4463 .4474 .4484 .4495 .4505 .4515 .4525 .4535 .4545
1.7 .4554 .4564 .4573 .4582 .4591 .4599 .4608 .4616 .4625 .4633
1.8 .4641 .4649 .4656 .4664 .4671 .4678 .4686 .4693 .4699 .4706
1.9 .4713 .4719 .4726 .4732 .4738 .4744 .4750 .4756 .4761 .4767
2.0 .4772 .4778 .4783 .4788 .4793 .4798 .4803 .4808 .4812 .4817
2.1 .4821 .4826 .4830 .4834 .4838 .4842 .4846 .4850 .4854 .4857
2.2 .4861 .4864 .4868 .4871 .4875 .4878 .4881 .4884 .4887 .4890
2.3 .4893 .4896 .4898 .4901 .4904 .4906 .4909 .4911 .4913 .4916
2.4 .4918 .4920 .4922 .4925 .4927 .4929 .4931 .4932 .4934 .4936
2.5 .4938 .4940 .4941 .4943 .4945 .4946 .4948 .4949 .4851 .4952
2.6 .4953 .4955 .4956 .4957 .4959 .4960 .4961 .4962 .4963 .4964
2.7 .4965 .4966 .4967 .4968 .4969 .4970 .4971 .4982 .4973 .4984
2.8 .4974 .4975 .4976 .4977 .4977 .4978 .4979 .4979 .4980 .4981
2.9 .4981 .4982 .4982 .4983 .4984 .4984 .4985 .4985 .4986 .4986
3.0 .4865 .4987 .4987 .4988 .4988 .4989 .4989 .4989 .4990 .4990
4.0 .4997
Have you understood ?
(Answers:1.False,2.False,3.True)
Try yourself !
1.Electric trains on a certain line run every half hour between mid-night and six in
morning, What is the probability that a man entering that station at a random time
during this period will have to wait atleast twenty minutes.
(Solution: 1/3)
2. If X has uniform distribution in(-a, a), a>0 find a such that P(|X| < 1) = P(|X| >1)
(Solution: a = 2)
3. The length of the shower in a tropical island in rainy season has an exponential
distribution with parameter 2, time being measured in minutes. What is the probability
that it will last for atleast one more minute?
(Solution: 0.1353 )
4.The marks obtained by a number of students in a certain subject are approximately
normally distributed with mean 65 and standard deviation 5. If 3 students are selected at
random from this group, what is the probability that atleast 1 of them would have
scored above 75.
(Solution: 0.0667)
5.In an examination, a student is considered to have failed, secured second class, first
class and distinction, according as he scores less than 45%, between 45% and 60%,
between 60% and 75% and above 75% respectively. In a particular year 10% of the
students failed in examination and 5% of the students get distinction. Find the
percentage of students who have got first class and second class.
(Assume normal distribution of marks)
(Solution: 38, 47s)
1.9 FUNCTIONS OF RANDOM VARIABLES
1.9.1 Transformation of one dimensional random variable
Let X be a continous random variable with pdf fx ( X ). We determine the density
function of a new random variable Y = g (x ) where g is a function. Further let y = g (x)
be strictly monotonic function of x.
Solution:
Given Y = (X + |X| )
y = 0 when x < 0
y = x when x 0
Now, distribution function
FY(y) = P(Y y)
For y < 0, FY(y) = 0
For y 0 , FY(y) = P(Y y)
= P[(X y)/ X 0]
= P( 0 X y) = P( 0 X y)
P(X 0) 1 - P(X < 0)
= Fx(y) Fx(0)
1 - Fx(0)
Solution:
F Y(y) = P( Y y) =P(X2 y)
= P(y X y), if y 0
= Fx(y) Fx(y)
Differentiating with respect to y,
fY(y) = 1 .{fX(y) + fX(y) }, if y 0 (1)
2y
= 0, if y < 0.
It is given that X follows N(0,)
Therefore fX(x) = 1 . exp(-x2 / 22), <x<
2
using this value in (1) , we get
fY(y) = 1 . exp(-y / 22), y >0
2y
Solution:
fY(y) = (y +1) /4, 0<y<2
0, otherwise
REFERENCES:
1. T.Veerarajan, Probability, statistics and Random Process , Tata McGraw Hill, 2002.
2.P.Kandasamy, K. Thilagavathi and K. Gunavathi,Probability, Random Variables and
Random processors, S. Chand, 2003.