Академический Документы
Профессиональный Документы
Культура Документы
Measures of Dispersion
2015-16
MEASURES OF DISPERSION 75
2015-16
76 STATISTICS FOR ECONOMICS
Activity
Calculate Range and Q.D. of the
following observations:
• Collect data about 52-week high/
20, 25, 29, 30, 35, 39, 41,
low of shares of 10 companies
from a newspaper. Calculate the
48, 51, 60 and 70
range of share prices. Which Range is clearly 70 – 20 = 50
company’s share is most volatile For Q.D., we need to calculate
and which is the most stable? values of Q3 and Q1.
2015-16
MEASURES OF DISPERSION 77
2015-16
78 STATISTICS FOR ECONOMICS
value; i.e., 30th value, which lies in to rich and poor, from the median of
class 40–60. Now using the formula the entire group.
for Q3, its value can be calculated as Quartile Deviation can generally be
follows: calculated for open-ended distribu-
3n tions and is not unduly affected by
- c.f. extreme values.
Q3 = L + 4 i
f
3. MEASURES OF D ISPERSION FROM
30 - 29 AVERAGE
Q3 = 40 + 20
7 Recall that dispersion was defined as
Q 3 = 42.87 the extent to which values differ from
their average. Range and Quartile
42.87 - 16.25 Deviation are not useful in measuring,
Q.D. = = 13.31
2 how far the values are, from their
In individual and discrete series, Q 1 average. Yet, by calculating the spread
of values, they do give a good idea
n +1 about the dispersion. Two measures
is the size of th value, but in a
4 which are based upon deviation of the
continuous distribution, it is the size values from their average are Mean
n Deviation and Standard Deviation.
of th value. Similarly, for Q3 and Since the average is a central
4
median also, n is used in place of
value, some deviations are positive
n+1. and some are negative. If these are
added as they are, the sum will not
reveal anything. In fact, the sum of
If the entire group is divided into
two equal halves and the median deviations from Arithmetic Mean is
calculated for each half, you will have always zero. Look at the following two
the median of better students and the sets of values.
median of weak students. These Set A : 5, 9, 16
medians differ from the median of the Set B : 1, 9, 20
entire group by 13.31 on an average.
Similarly, suppose you have data You can see that values in Set B
about incomes of people of a town. are farther from the average and hence
Median income of all people can be more dispersed than values in Set A.
calculated. Now if all people are Calculate the deviations fro m
divided into two equal groups of rich Arithmetic Mean and sum them up.
and poor, medians of both groups can What do you notice? Repeat the same
be calculated. Quartile Deviation will with Median. Can you comment upon
tell you the average difference between the quantum of variation from the
medians of these two groups belonging calculated values?
2015-16
MEASURES OF DISPERSION 79
2015-16
80 STATISTICS FOR ECONOMICS
2015-16
MEASURES OF DISPERSION 81
M. D. from Median is thus, (iii) Multiply each |d| value with its
corresponding frequency to get f|d|
Σ | d | 11
M.D.( median ) = = = 2 .2 values. Sum them up to get Σ f|d|.
n 5
(iv) Apply the following formula,
Short-cut method Σf | d |
M.D. ( x ) =
To calculate Mean Deviation by short Σf
cut method, a value (A) is used to Mean Deviation of the distribution
calculate the deviations and the in Table 6.2 can be calculated as
following formula is applied. follows:
M.D.(M edian )
Example 6
Σ |d| +(Median − A )(Σ fB − Σf A )
=
n C.I. f m.p. |d| f|d|
2015-16
82 STATISTICS FOR ECONOMICS
2015-16
MEASURES OF DISPERSION 83
Σx 2
Example 9 σ= − ( x)2
n
X d (x-Ax̄ ) d2
4150
5 –20 400 or σ = − (24 )2
10 –15 225 5
25 0 0
30 +5 25 or σ = 254 = 15 .937
50 +25 625
Standard Deviation is not affected
–5 1275 by the value of the constant fr om
which deviations are calculated. The
Formula for Standard Deviation value of the constant does not figure
2
in the standard deviation formula.
Σd 2 Σ d Thus, Standar d Deviation is
σ= −
n n Independent of Origin.
2
1275 −5 Step-deviation Method
σ= − = 254 = 15 .937
5 5 If the values are divisible by a common
factor, they can be so divided and
Note that the sum of deviations
from a value other than actual
standard deviation can be calculated
mean will not be equal to zero. from the resultant values as follows:
Example 11
Direct Method
Since all the five values are divisible
Standard Deviation can also be by a common factor 5, we divide and
calculated from the values directly, get the following values:
i.e., without taking deviations, as
x x' d (x'-x̄') d2
shown below:
5 1 –3.8 14.44
Example 10 10 2 –2.8 7.84
25 5 +0.2 0.04
2 30 6 +1.2 1.44
X X
50 10 +5.2 27.04
5 25
10 100 0 50.80
25 625 (Steps in the calculation are same
30 900
50 2500 as in actual mean method).
The following formula is used to
120 4150
calculate standard deviation:
2015-16
84 STATISTICS FOR ECONOMICS
2015-16
MEASURES OF DISPERSION 85
2015-16
86 STATISTICS FOR ECONOMICS
above example. The values so Suppose the values in Set A are the
obtained are ‘d'’ values (Col. 5). daily sales recorded by an ice-cream
3. Multiply ‘d'’ values with vendor, while Set B has the daily sales
corresponding ‘f'’ values (Col. 2) to of a big departmental store. Range for
obtain ‘fd'’ values (Col. 6). Set A is 500 whereas for Set B, it is
30,000. The value of Range is much
4. Multiply ‘fd'’ values with ‘d'’ values
higher in Set B. Can you say that the
to get ‘fd'2’ values (Col. 7)
variation in sales is higher for the
5. Sum up values in Col. 6 and Col. 7 departmental store? It can be easily
to get Σ fd' and Σ fd'2 values. observed that the highest value in Set
6. Apply the following formula. A is double the smallest value, whereas
2
for the Set B, it is only 30% higher.
Σfd ′2 Σfd ′
σ = − ×c Thus absolute measures may give
Σf Σf misleading ideas about the extent of
2
variation specially when the averages
472 4 differ significantly.
or σ = − ×5
40 40 Another weakness of absolute
or σ = 11.8 − 0.01 × 5 measures is that they give the answer
in the units in which original values
or σ = 11.79 × 5
are expressed. Consequently, if the
σ = 17.168 values are expressed in kilometers, the
dispersion will also be in kilometers.
Standard Deviation: Comments However, if the same values are
Standard Deviation, the most widely expressed in meters, an absolute
used measure of dispersion, is based measure will give the answer in meters
on all values. Therefore a change in and the value of dispersion will appear
even one value affects the value of to be 1000 times.
standard deviation. It is independent To over come these problems,
of origin but not of scale. It is also
relative measures of dispersion can be
useful in certain advanced statistical
problems.
used. Each absolute measure has a
relative counterpart. Thus, for Range,
there is Coefficient of Range which is
5. ABSOLUTE AND RELATIVE M EASURES calculated as follows:
OF DISPERSION
L −S
All the measures, described so far, are Coefficient of Range =
L +S
absolute measures of dispersion. They where L = Largest value
calculate a value which, at times, is S = Smallest value
difficult to interpret. For example,
consider the following two data sets: Similarly, for Quartile Deviation, it
Set A 500 700 1000
is Coefficient of Quartile Deviation
Set B 1,00,000 1,20,000 1,30,000 which can be calculated as follows:
2015-16
MEASURES OF DISPERSION 87
2015-16
88 STATISTICS FOR ECONOMICS
2015-16
MEASURES OF DISPERSION 89
Recap
• A measur e of dispersion improves our understanding about the
behaviour of an economic variable.
• Range and Quartile Deviation are based upon the spread of values.
• M.D. and S.D. ar e based upon deviations of values from the average.
• Measures of dispersion could be Absolute or Relative.
• Absolute measures give the answer in the units in which data are
expressed.
• Relative measures are free from these units, and consequently
can be used to compare different variables.
• A graphic method, which estimates the dispersion from shape
of a curve, is called Lorenz Curve.
EXERCISES
2015-16
90 STATISTICS FOR ECONOMICS
2015-16