Chapter 5 Olc
Chapter 5 Olc
Chapter 5 Olc
Moments are a very useful statistical tool as they enable us to describe a set of data for its
central tendency, dispersion, skewness and kurtosis. Before discussing their usefulness, let us
define moments and look into their computational aspects. A moment may be defined as the
mean of the deviations of a variate from some value raised to a certain power.
Calculation of Moments
To calculate moments, the following steps are involved:
(i) The mean of the data is calculated, if moments from mean are to be computed. Else, we
consider the given value, A, about which moments are required to be calculated.
(ii) The deviations of each of the values from the mean (or A, whichever relevant) are taken
(iii) The deviations are raised to a certain power, r.
(iv) The powered deviations are added up and the total is divided by n, the number of values
in the data.
The moment calculated by using a given power r is referred to as the rth moment. To
illustrate, when the deviations are raised to power 3, the calculated moment is termed as the
third moment, and when the power taken is 4, the corresponding moment is called the fourth
moment. It may be mentioned that although moments can be calculated by raising deviations
to any power, we are concerned with only the first four moments.
It may be noted that since A can be assigned any value, innumerable four-moment value sets
can be derived for a given set of data. However, a significant value that A may be assigned is
zero. When A = 0, the calculated moments are known as moments about origin (or moments
about zero). The first moment about zero (or origin) is equal to mean.
The calculation of moments is given below. The inputs are shown in Table 1.
The moments of a set of values are calculated by taking deviations from X or some
value A, raising them to powers 1, 2, 3 and 4 and then dividing their respective totals by
n. This gives four moments. Measuring deviations from X yields central moments while
others are called moments about (arbitrary value) A. If A = 0, the moments are termed
moments about origin. The two sets of moments are, expectedly, interrelated.
Thus, if X > A, then d is positive and if X < A then d is negative. Further, if A = 0, then the
moments so obtained would be the moments about origin.
Example 2 (a) Using the moments about 6 calculated in Example 1, obtain central moments.
Verify that the values are the same as obtained earlier.
(b) From the central moments in (a), compute moments about (i) 6, and (ii) 10.
(a) The first four moments about 6 as calculated earlier are 2, 13, 64.25 and 404.5. Using
these values, the central moments are calculated here:
μ1=ν 1−ν1 =2−2=0
2 2
μ2=ν 2−ν 1 =13−2 =9
3 3
μ3=ν 3−3 ν 2 ν 1 +2 ν 1 =64.25−3 ×13 ×2+2 ×2 =2.25
2 4 2 4
μ4 =ν 4−4 ν 3 ν 1 +6 ν 2 ν1 −3 ν 1 =404.5−4 ×64.25 × 2+ 6 ×13 ×2 −3 × 2 =154.5
(b) Moments about 0 are 0, 9, 2.25, and 154.5, while mean = 8.
(i) For A = 6, d = 8 – 6 = 2. Thus,
ν1 =μ 1+ d=0+ 2=2
2 2
ν 2=μ2+ d =9+2 =13
3 3
ν3 =μ3 +3 μ 2 d + d =2.25+3 ×9 × 2+ 2 =64.25
2 4 2 4
ν 4=μ 4 +4 μ 3 d +6 μ2 d +d =154.5+4 × 2.25 ×2+6 × 9× 2 + 2 =404.5
(ii) For A = 10, d = 8 – 6 = −2. Accordingly,
ν1 =μ 1+ d=0−2=−2
ν 2=μ2+ d 2=9+ (−2 )2=13
3 3
ν3 =μ3 +3 μ 2 d + d =2.25+3 ×9 × (−2 )+ (−2 ) =−59.75
2 4 2 4
ν 4=μ 4 +4 μ 3 d +6 μ2 d +d =154.5+4 × 2.25 × (−2 ) +6 × 9× (−2 ) + (−2 ) =368.5
Now,
'
Σf d −105
ν1 = ×C= × 10=−4.57
Σf 230
Σ f d'2 733
ν 2= ×C 2= ×10 2=318.70
Σf 230
'3
Σf d 3 −1,155 3
ν3 = ×C = × 10 =−5,021.74
Σf 230
Σ f d'4 4 6,145 4
ν4= ×C = ×10 =2,67,173.91
Σf 230
(b) Central moments are shown calculated here:
μ1=ν 1−ν1 =−4.57−(−4.57 )=0
μ2=ν 2−ν 12=318.70−(−4.57 )2=297.82
3 3
μ3=ν 3−3 ν 2 ν 1 +2 ν 1 =−5,021.74−3 × 318.70× (−4.57 )+2 × (−4.57 ) =−843.25
2 4
μ4 =ν 4−4 ν 3 ν 1 +6 ν 2 ν1 −3 ν 1
2 4
¿ 2,67,173.91−4 × (−5,021.74 )(−4.57 ) +6 × 318.70× (−4.57 ) −3 × (−4.57 ) =2,14,004.10
(c) Sheppard’s corrections applied to the second and fourth moments are shown here:
1 1
^μ2=μ 2− i 2=297.82− ×102=289.48
12 12
1 2 7 4 1 2 7 4
^μ4 =μ4 − μ2 i + i =2,14,004.10− ×297.82 ×10 + ×10 =1,99,405
2 240 2 240
Significance of Moments
The first four moments are very useful in a study of the different aspects of the formation of a
frequency distribution, namely central tendency, dispersion, skewness and kurtosis. As
already indicated, we have first moment about origin equal to arithmetic mean and the second
moment about mean equal to variance. These moments help us, thus, to have an idea about
the central tendency and the scatter of observations about the central tendency. The moments
are profitably employed to study also the shape of the distribution in terms of the skewness
and kurtosis. The skewness and kurtosis are studied with the help of beta and gamma
coefficients. The gamma coefficients are referred to as g-statistics.
It is essentially an index of skewness. Its value is equal to zero for a symmetrical distribution
and the greater its value the higher the extent of skewness in the distribution. It may be
observed that this coefficient cannot assume a negative value because neither the numerator
nor the denominator can be a negative quantity.
A measure involving the fourth and the second moments, for relative kurtosis in a distribution
is β2. We have,
μ
β 2= 42
μ2
For a mesokurtic distribution, β2 = 3, while for a leptokurtic distribution β2 > 3 and for a
platykurtic distribution β2 < 3. Obviously, the higher the value of β2, the more peaked is the
curve of the distribution and the smaller its value the more flat-topped is the curve.
Example 4 Using the central moments calculated in Example 3, calculate the beta and the
gamma coefficients. Comment on the results. For your calculations, use the corrected values of
the moments.
The four moments obtained earlier are 0, 289.48, −843.25 and 1,99,405. On the basis of
these, we have
Beta coefficients:
μ 32 (−843.25 )2
β 1= 3 = 3
=0.029
μ2 ( 289.48 )
μ4 1,99,405
β 2= 2 = =2.380
μ 2 ( 289.48 )2
Gamma coefficients:
μ −843.25
g1= 33 = =−0.171
σ √ ( 289.48 )3
μ4 1,99,405
g2= 2 −3= −2.380=−0.620
μ2 ( 289.48 )2
On the basis of the results obtained, it may be observed that the given distribution is slightly
negatively skewed and it is platykurtic since β2 < 3 (alternately, g2 < 0)
Example 5 For a distribution, the first four moments about 20 are 8; 212; 1,510 and 28,214
respectively. Calculate moments about 30 for this distribution.
This will be done by first converting the given moments into central moments and there from
to the required moments.
Step 1: Conversion into central moments
μ1=ν 1−ν1 = 8 – 8 = 0
Example 7 For a normal distribution, the first moment about 12 is 28, and the fourth
moment about 40 is equal to 1,875. Find the coefficient of variation of the distribution.
We know, X =A + ν 1
Thus, here X =28+12=40
Now, since fourth moment about 40 is given, it is the value of fourth central moment.
Further,
μ4
β 2= 2
μ2
With the value of β2 = 3 for a normal distribution, we have
1875
3= 2
μ2
2
Or, μ2 =1875 /3=625
Thus, µ2 or variance = 25, and standard deviation = 5.
σ 5
Co−efficient of variation= ×100= ×100=12.5 %
X 40
μ1=
∑ fx =0
∑f
μ2=
∑ fx2 = 1,500 =15
∑ f 100
μ3 =
∑ 3
fx 11,000
= =110
∑ f 100
μ4 =
∑ 4
fx 67,500
= =675
∑ f 100
These are central moments since the first moment is equal to zero. Now,
μ4 675
β 2= 2 = 2 =3
μ 2 15
The distribution is mesokurtic but is not symmetric because µ 3 ≠ 0. A normal distribution is
both symmetric and mesokurtic. Hence the given calculations are not in respect of a normal
distribution.
Example 9 The calculation of moments of a set of 50 values yielded the following results:
Mean = 16, standard deviation = 7, and β1 = 0.6196 with a positive µ3.
It was later on discovered that one observation 12 was recorded as 22. Find the corrected
values of the first three moments.
∴ μ 3=270
But, since one observation was recorded incorrectly, it follows that the moments calculated
are no longer central moments. Instead, they are moments about A = 16.
Let us first find corrected mean.
ΣX = 50 ×16 = 800
Corrected ΣX = 800 + 12 – 22 = 790
Corrected X =790/50=15.8
Now, before we calculate moments about 15.8, we first correct moments about 16 calculated
above for mistake made.
Incorrect ∑ x
2
Incorrect ν 2=
n
Incorrect ∑ x
2
49=
50
Incorrect Σx = 49 × 50 = 2,450
2
2430
Corrected ν 2= =48.6
50
Similarly,
Incorrect ∑ x
3
Incorrect ν 3=
n
Incorrect ∑ x
3
270=
50
Incorrect ∑x = 270 × 50 = 13,500
3
EXERCISES
1. Calculate (a) central moments, (b) moments about A = 6, and (c) moments about origin for the
following data:
9, 4, 6, 9, 11, 13, 8, 4
2. Calculate four central moments for the following distribution:
Class
10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
Interval:
Frequency: 8 12 15 8 5 2
3. Given, n = 10, ∑(X – 10) = – 40, ∑(X – 10) = 2,870 and ∑(X – 10) = – 480. Calculate the
2 3
following:
(a) Arithmetic mean
(b) Variance
(c) Third moment about 10, and
(d) A measure of relative skewness based on moments
4. For the following distribution, calculate first four (i) moments about 45, and (ii) central
moments. Also, comment on the shape of the distribution using gamma statistics.
Class
10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
Interval:
Frequency: 2 5 12 20 8 2 1
5. Given the following distribution:
Class
10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
Interval:
Frequency: 2 6 12 20 12 6 2
(a) Calculate mean, median and mode and hence find the value of Karl Pearson’s coefficient of
skewness.
(b) Calculate the first four central moments and the beta coefficient of skewness.
(c) Comment on the results.
6. The first four moments of a distribution are 0, 5, 0, and 75 respectively. Comment on the shape
of the distribution.
7. The first four moments of a distribution are 2, 20, 40, and 500 respectively. Comment on the
shape of the distribution.
8. The first four moments of a distribution about 12 are: –8, 210, –42 and 5216 respectively. You
are required to
(a) Calculate moments about origin.
(b) Calculate the two gamma coefficients and comment upon the shape of the distribution.
9. The first four moments of a distribution are calculated as 6; 235; 1,248; and 24,680. You are
required to examine the skewness and kurtosis of the distribution.
10. From the following information about two distributions, find which is more skewed?
Distribution Second Moment Third Moment
A 16 –15.7
B 40 25.8
11. For a mesokurtic distribution, the fourth central moment is 768. Obtain its standard deviation.
12. The first four moments of a distribution about the value 4 of the variable are –1.5, 17, ––30 and
108. Its mean is given to be 2.5. You are required to
(a) Calculate the central moments and moments about origin.
(b) Determine the coefficient of variation and variance of the distribution.
(c) Examine the shape of the distribution.
13. A distribution has standard deviation equal to 5. For what values of the fourth central moment
will this distribution be (a) mesokurtic, (b) platykurtic, and (c) leptokurtic?
14. For a mesokurtic distribution, coefficient of variation = 40% and arithmetic mean = 40. Find the
value of its fourth central moment.
15. If variance = 42, then what values of µ4 would make a distribution (i) mesokurtic, (ii) platykurtic,
and (iii) leptokurtic?
16. For a set of 20 observations, mean and variance were calculated as 9 and 18 respectively. Later,
it was discovered that one observation 12 was misread as 21. Rectify the error and obtain the
corrected values of mean and variance. Also, calculate the values of the first and second
moments about origin.
17. The first two moments about 40 for a set of 25 values were calculated as equal to 65 and 2,985
respectively. Test if the calculations are consistent.
18. For a mesokurtic distribution, it is known that the first moment about 32 is 28 while the fourth
moment about 60 is 62,208. Determine the values of mean and coefficient of variation.
19. For a normal distribution, the first moment about 8 is 22 and the fourth moment about 30 is 243.
Find the coefficient of variation of the distribution.
20. The following are the calculations for a frequency distribution:
∑f = 520; ∑f(X – 56) = 0; ∑f(X – 56)2 = 143,000; ∑f(X – 56)3 = –28,750
∑f(X – 56) = 78,650,000
4
Class interval width = 10
You are required to calculate the four central moments and use Sheppard’s corrections.
ANSWERS TO EXERCISES
1. (a) 0, 9, 2.25, 154.5 (b) 2, 13, 64.25, 404.5 (c) 10. Distribution A
8, 73, 730.25, 7778.5 11. 4
2. 0; 179.4; 951; 80,354 12. (a) 0; 14.75; 39.75; 142.3125; 2.5; 21; 166;
3. (a) 6 (b) 271 (c) −48 (d) g1 = 0.733 1,132 (b) 183.3%; 21 (c) g1 = 0.702 g2 = −2.35
4. (i) −2.6; 150; −1,100; 75,000 (ii0 0; 143.24; 13. (a) 1,875 (b) <1,875 (c) >1,875
34.848; 69,506.9; g1 = 0.0203 g2 = 0.388 14. 1,96,608
5. (a) 45, 45, 45; 0 (b) 0; 180; 0; 90,000; 15. (i) 5,292 (ii) <5,292 (c) >5,292
β1 = 0 16. 8.55; 11.05; 8.55; 84.15
6. Symmetrical, mesokurtic 17. No, μ2 < 0
7. β1 = 1, β2 = 2.39 18. 60; 20%
8. (a) 4; 162; 5,790; 150,080 (b) g1 = 2.25 19. 10%
g2 = 0.388 20. 0; 275; −55.3; 1,51,250 Corrected: μ2 = 266.7
9. Low positive skewness, Platykurtic μ4 = 137,791.7
CHAPTER SUMMARY
Skewness and kurtosis are both related to shape of the frequency curve.
Skewness means lack of symmetry, which implies that the mean, median and mode
are unequal in such a case. Skewness is positive when its longer tail is to the right and
negative when it is on the left.
There are three measures of skewness, given by Karl Pearson which is based on
averages and standard deviation; by Bowley which uses median and quartiles; and by
Kelly, based on median and the tenth and ninetieth percentiles.
Kurtosis refers to relative height of the frequency curve. Distributions can be
mesokurtic, leptokurtic and platykurtic on this basis.
Five-point summary, comprising lowest and highest values, median and two quartiles
is often used to learn about a given distribution. This information is represented by a
box and whisker plot.
Moments provide a useful method to study various characteristics of a set of data.
Moments calculated about mean are called central moments. There can also be
moments about any given value A. When A = 0, moments calculated are called
moments about origin.
It is possible to convert moments about A in to central moments and vice-versa.
The first moment about zero is equal to mean and the first moment about mean is
equal to zero.
The second central moment is the variance of the distribution.
The beta and g-statistics are calculated with central moments and are used to learn
about skewness and kurtosis. β1 and g1 are measures of skewness while β2 and g2
measure kurtosis.
If the third central moment is positive, the skewness is positive while if it is negative,
the skewness is negative. For a symmetrical distribution, it is equal to zero. β1 is
always positive but greater its value, more the skewness.
If β2 = 3, the distribution is mesokurtic; if β 2 is less than 3, the distribution is
platykurtic while if it is greater than 3, the distribution in question is leptokurtic.
MULTIPLE CHOICE QUESTIONS
1. For a leptokurtic distribution, the standard deviation is equal to 5. What should be the
value of its fourth central moment?
(a) 1875
(b) greater than 1875
(c) no more than 1875
(d) at least 1875
9. What is the standard deviation of a distribution which has µ 4 = 12,288 and known to
be other than platykurtic or leptokurtic?
(a) It cannot be calculated.
(b) 3
(c) 192
(d) 8
16. If each value in a given set of data is divided by 3, how will it affect the Pearson’s
coefficient of skewness?
(a) It will reduce the value of the coefficient to one-thirds.
(b) It will increase the value of the coefficient to three times.
(c) It will not affect the coefficient of skewness value.
(d) Nothing can be said about the change.