Problem Set 2
Problem Set 2
Problem Set 2
30 lakhs (as was targeted) or less than 30 lakhs. One person from the team collects a
sample of same size 10. A second person from the team selects another sample of the
same size. The first person rejects the null if her sample mean is less than K, and the
second person rejects the null if her sample mean is less thanL. The team leader finds
thatK > L.
a) Who is testing at a lower significance level? Why?
b) Who has a higher chance of committing type II error? Why?
c) What should the person, testing with a higher significance level, do to reduce her
significance level to that of the other keeping the decision rule same? Why?
d) Suppose α =0.05 . If type I error occurs, the company spends 5 lakhs per month on
advertisement for increasing the average sales. On the other hand, if type II error
occurs with maximum probability, the company loses ₹ 100,000. Which type of error
is more important for the company? Why?
1-a If it is known that the standard deviation of monthly income of the employees of
IIMN is Rs.5000, then what is the probability that a random sample of 25 employees will
yield an average monthly income, which is within Rs.500 around the actual average
monthly income?
a. If Rahul wishes to have a 2.5% chance of not establishing the portal when the actual
average revenue is Rs.3000, what sample size should be selected?
b. If Rahul decides that a random sample of 20 candidates should be surveyed, compute
the probability of establishing the portal when the actual average revenue is Rs.3000
per candidate.
c. Discuss the difference in your results in a) and b).
d. A sample of 25 candidates is actually surveyed yielding a sample average revenue of
Rs.2825 with a sample standard deviation of Rs.675. Construct a 95% confidence interval
for i. The population average revenue, ii. The population standard deviation of revenue.
7 Sketch the cumulative distribution function curve ofZ, the standard Normal
random variable.
10 The daily return of a stock varies uniformly from -2% to 3%.What is the
expected value of the squared return? Sketch the return density.
11 Daily return of a stock is normally distributed. The return takes the value of at
least 1.78% once in 5 days and of at least 1.65% in 7 out of 10 days.
a) Sketch the above information using the graph of the return density.
b) What are the mean and standard deviation of the return?
c) Find the probability that on a given day the return will be exactly 2.5%?
12 Suppose that the amount of time Anuja spends in a bank is exponentially
distributed with mean 10 minutes.
a) What is the probability that Anuja will spend more than 15 minutes in the
bank given that she has already spent 10 minutes in the bank?
b) What is the probability that Anuja will spend more than 20 minutes in the
bank given that she has already spent 15 minutes in the bank?
c) What is the probability that Anuja will spend more than 25 minutes in the
bank given that she has already spent 19 minutes in the bank?
d) Can you make some comment from the above three results?
e) If you pick 5 randomly selected Anuja’s visits to the bank, what is the
probability that she never waited for more than 10 minutes?
14 Let X be the time between two calls received at a health-care provider. It is given that the
mean time between calls is 12 minutes.
a) Find the probability that the first call will be within 5 to 10 minutes?
b) What is the probability that the time between tenth and eleventh call will exceed 2
minutes?
15 In a 5-star Hotel in Bangalore, on an average, 4 ISD calls are received every 2 hours.
It is given that on a particular day 30 ISD calls have been received.
a) What is the probability that a total of 6 ISD calls have been received between 9
p.m. to 11 p.m.?
16 A manufacturer produces iron rod of length L meters, where L is distributed uniformly over the
interval [15, 30]. After producing the iron rod he will send those rods to his supplier by pick-up trucks
each of which can carry exactly 20 iron rods.
Find the probability that each of 3 randomly selected pick-up trucks will contain at least 8 iron rods of
length more than 24 meters?
15. A group of students (consisting of PGPIs and PGPIIs) of one of the IIMs had
undertaken a study on Detergent Powder. The study was sponsored by Gamble and
Gamble, in a Marketing Fair, held in November, 1998. The main objective of the
study was to understand the “Brand Images” of a number of brands of Detergent
Powder. The following seven brands were included in the project:
Through several Focus Group Discussions (FGDs), the Study Team identified 19 image
parameters, which were reduced to set of eight image parameters (through a technique called
Factor Analysis). The eight image parameters thus identified are given below:
1. Innovative, 2. Indianness, 3. Gentle, 4. Young and Energetic
5. Trustworthy and Caring, 6. Versatile, 7. Powerful and efficient, and 8. Modern.
A game was designed to collect data from the female respondents only. Data on a large
number of demographic variables such as, age, education, occupation, marital status,
monthly income etc. were collected for each respondent. Each respondent was asked to
select three brands (two used by her, and one not used but she was aware of) from the set
of seven brands. Then she was asked to rate each brand on a 5-point scale (giving a rating of
l if she thought that the particular image parameter was least, and 5 if she perceived it to be
highest in the brand) on each of the eight image parameters.
Two brands, namely, Nirma and Wheel had to be discarded from analysis because of
inadequate responses on these two brands.
A measure of “Image” of a brand, as perceived by a respondent, was obtained by averaging
the ratings across all eight “Image Parameters” by the respondent for the brand. For each
respondent, one of the rated brands was then used for understanding “Image” of the brand.
The Study Group wanted to find out whether there was significant difference in “Image”
across the brands; in case there was, which brand had the “Best Image”, what score could
reasonably indicate the “Best Image” brand, and so forth. Unfortunately, the PGPIIs had
forgotten all that they had learnt in QM courses in the first year, while for PGPIs, the QM
courses had just started then. So, though, the team framed a set of questions but they
remained unanswered. Now that you have just finished QMM, can you answer following
questions with the help of the data given in Tables 1, 2 and 3?
The Project team was particularly concerned about the assumption of normality of the
ratings. Checking this assumption for each brand was considered to be time consuming and
unnecessary. The members of the team argued that if the assumption was true for one
brand, it must also be true for others. Therefore, it was decided to test the assumption for
only one of the brands.
a. Carry out a suitable test to test the normality of the Image scores using the data in Table 1
for Ariel Compact.
b. Assuming normality of the image scores for all brands, test if there is a significant difference
in the average “Image” scores among the five brands.
c. If the assumption of normality of image scores is valid, test if there is a significant
difference in the variance of “Image” scores between the first two brands.
d. If the assumption of normality is not valid, test if there is any significant difference
between the distributions of the last two brands.
e. Determine which brand has the “Best Image” on the basis of highest average image
score.
f. Calculate a 95% confidence interval for average image score for the “Best Image”
brand.
g. Assuming equality of variance, calculate a 90% confidence interval for the common
2 2
variance of image scores. ( χ 0 .95; 95 = 73.3050, χ 0 .05; 95 = 118.5232)
h. After the Marketing Fair, when the team sat down to do the data analysis, it figured
out that each respondent had given highest rating to exactly one of the three brands
she had rated. One of the members of the group argued that the choice of “Best
Image” brand must be related to the respondent’s exposure level to the
advertisement of the product. But as they did not collect any information on
respondents’ exposure to advertisement, another member suggested that the
selection of “Best Image” brand might be related to her income. Table 3 was thus
formed using the information of 212 responses. Test if this is true at 5% level of
significance.
i. Ariel Supersoaker was considered to be a competitor of Surf with Wash Booster. The
study team was thus interested in finding out if there was a significant difference in
the market share of these two brands. The first 100 respondents were asked if they
used Ariel Supersoaker; 27 of them responded in the affirmative. While out of the
next hundred, 32 respondents said they used Surf with Wash Booster. What can you
conclude (use p-value)?
16. Three researchers have collected household income data from three districts, A, B,
and C, independently and individually. They wish to investigate the income
distributions in these districts. The summary of their sample data are given below.
a) The first researcher takes a sample of size 9 from district A. The upper limit of
95% C.I. of the population variance of income is 29.10. The 95% C.I. of the
population mean income is (17.84, 22.16).
b) The second researcher takes a sample from district B such that the expected value
of (n2 -1) (S22/σ 22) is 7 and the midpoint of the 95% C.I. of the population mean is
22 and length of the C.I. is 7.80.
c) The third researcher takes a sample from district C such that the expected value of
(n3 -1) (S23/σ 32) is 10 and the midpoint of the 95% C.I. of the population mean is 23.
From the above information (and clearly stating the assumptions which are needed)
i. Verify whether the mean income in the above three districts are equal or
not.
ii. Verify if the income variabilities in the districts AandB are equal or not.
iii. Give a 95% C.I. for the difference of mean incomes in the districts Aand
C.
18 The probability of detecting a disease in X- ray examination of a person suffering from the
disease is 3/5. The probability of diagnosing a healthy person as suffering from the disease is
1/5. 30% of the whole population is suffering from the disease. Find the probability that a
person is healthy, if after examination the person is diagnosed as suffering from the disease.
19. After the correction of quiz 1 answer books, it is found that 50% of the PGP 1 students
got more than 60% marks in DS1 paper. A random sample of twenty students is selected.
(Exact numerical answer is not required. Only steps and mathematical formulation will be
enough)
a) What is the expected number of students in that group who scored more than 60% marks?
b) What is the probability that at least 10 students in the selected sample got more than 60%
marks?
c) Can you plot a rough probability distribution of marks of that group?
20 A business office orders paper supplies from one of three vendors,V 1 ,V 2, orV 3.
Orders are to be placed on two successive days, one order per day. Thus, (V 2 ,V 3)
might denote that vendor V 2 gets the order on the first day and vendor V 3 gets the
order on the second day.
a) List the sample points in this experiment of ordering paper on two successive
days.