Академический Документы
Профессиональный Документы
Культура Документы
MODULE IV
LECTURE - 13
STRATIFIED SAMPLING
DR. SHALABH
DEPARTMENT OF MATHEMATICS AND STATISTICS
INDIAN INSTITUTE OF TECHNOLOGY KANPUR
1 J
(Y )
=
Var
j ( tot )
L
1
(Yij ( tot ) Yj ( tot ) ) 2 .
L( L 1) i =1
=
Ytot
Ytot is
k
1 L k
=
Y
Yij (tot )
j ( tot )
k
=j 1
=i 1 =j 1
j =1
L k
1
(Yij ( tot ) Yj ( tot ) ) 2 .
L( L 1)=i 1 =j 1
(Y =
Var
tot ) = Var (Y j ( tot ) )
Post Stratifications
Sometimes the stratum to which a unit belongs may be known after the field survey only. For example, the
age of persons, their educational qualifications etc. can not be known in advance. In such cases, we adopt the
post stratification procedure to increase the precision of the estimates.
In post stratification,
draw a sample by simple random sampling from the population and carry out the survey.
After the completion of survey, stratify the sampling units to increase the precision of the estimates.
Assume that the stratum size Ni is fairly accurately known. Let
mi : number of sampling units from ith stratum, i = 1,2,...,k.
k
m
i =1
= n.
Note that mi is a random variable (and that is why we are not using the symbol ni as earlier).
Assume n is large enough or the stratification is such that the probability that some mi =0 is negligibly
small. In case, mi =0 for some strata, two or more strata can be combined to make the sample size nonzero before evaluating the final estimates.
N y .
i =1
y post
1
=
N
Now
1 k
E ( y post ) = E N i E ( yi / m1 , m2 ,..., mk )
N i =1
1 k
E N i yi
N i =1
=Y
Var ( y post ) E Var ( y post | m1 , m2 ,..., mk ) + Var E ( y post | m1 , m2 ,..., mk )
=
k
1
1 2
= E wi2
Si + Var (Y )
i =1 mi N i
=
w
i =1
1
To find E
mi
2
i
1 1 2
E Si (Var (Y ) = 0).
mi N i
1
, proceed as follows:
Ni
yj
Yj
y
Y
=
1
= j 1=
R=
R= = Nj
,
.
n
x
X
xj
Xj
=j 1 =j 1
We know that
N n RS X2 S XY
E ( R) R =
.
.
Nn
X2
Let
and
=
R
n
=
n
xj i
j =1
n
j =1
N
=
R
N
=
N
xj i
j =1
N
j =1
N i2
Ni 2
1 N 2
1
1
2
S=
x j Nx =
Ni
N i N 2=
N 1 j =1
N
N
N
N
1
1
2
x
Ni N
1 N
1
N
=
x j y j Nx y=
0.
N 1 j =1
N
N
1
S xy=
n
E ( R ) =
R E
ni
N N ( N n)( N N i )
.
=
2
N
nN
N
(
1)
i
i
Thus
1 1
N ( N n)( N N i ) 1
N
E
=
+
n 2 N i2 ( N 1)
Ni
ni N i nN i
=
N 1
( N n) N
.
1 +
n( N 1) N i N i n n
1 1
( N n) N
E =
mi N i n( N 1) N i
N 1
.
1 +
N
n
n
i
=
Var ( y post )
w
i =1
2
i
1
E
mi
1 2
Si
Ni
N n N
2 2
w
S
.
i i
N
n
Ni
(
1)
i =1
N n k 2 2 1
wi Si w
n( N 1) i =1
i
N 1
+
1
nN i n
1
1
1 +
nwi n
N n k
1
wi Si2 n 1 +
2
n ( N 1) i =1
wi
N n k
= 2
(nwi + 1 wi ) Si2
n ( N 1) i =1
N n k
N n k
2
w
S
+
(1 wi ) Si2 .
i i
2
n( N 1) i 1 =
n ( N 1) i 1
=
=
Assuming N 1 N .
N n n 2 2 N n n
wi Si + 2 (1 wi ) Si2
Nn i 1 =
n N i1
=
V ( y post=
)
N n n
V prop ( yst ) +
=
(1 wi ) Si2 .
2
Nn i =1
The second term is the contribution in the variance of
y post
distributed.
=
(1
)
(
1)
(
Since
i
i
w
Nn 2 i 1 =
Nn 2
i 1
k 1 N n 2
=
Sw
n Nn
=
k 1
Var ( yst ).
n
The increase in variance over Varprop ( yst ) is small if the average sample size n =
reasonably large.
n
per stratum is
2
Thus a post stratification with a large sample produces an estimator which is almost as precise as an