Lecture 13 NPTEL SAMPLING THEORY

Sampling Theory
MODULE IV
LECTURE - 13
STRATIFIED SAMPLING
DR. SHALABH
DEPARTMENT OF MATHEMATICS AND STATISTICS
INDIAN INSTITUTE OF TECHNOLOGY KANPUR
Implementation of interpenetrating subsamples in stratified sampling

Consider the set up of stratified sampling. Suppose that each stratum provides an independent
interpenetrating subsample. So based on each stratum, there are L independent interpenetrating subsamples
drawn according to same sampling scheme.
Let
Yij (tot ) be the unbiased estimator of total of
jth stratum based on the ith subsample, i = 1,2,...,L; j = 1,2,...,k.
An unbiased estimator of the jth stratum total is given by
1 J
Y j (tot ) = Yij (tot )

L i =1
and an unbiased estimator of the variance of
(Y )
=
Var
j ( tot )
Yij (tot ) is given by
L
1
(Yij ( tot ) Yj ( tot ) ) 2 .
L( L 1) i =1
Thus an unbiased estimator of population total
=
Ytot
Ytot is
k
1 L k
=
Y
Yij (tot )
j ( tot )
k
=j 1
=i 1 =j 1
and unbiased estimator of its variance is
j =1
L k
1
(Yij ( tot ) Yj ( tot ) ) 2 .
L( L 1)=i 1 =j 1
(Y =

Var
tot ) = Var (Y j ( tot ) )
Post Stratifications
Sometimes the stratum to which a unit belongs may be known after the field survey only. For example, the
age of persons, their educational qualifications etc. can not be known in advance. In such cases, we adopt the
post stratification procedure to increase the precision of the estimates.
In post stratification,
draw a sample by simple random sampling from the population and carry out the survey.
After the completion of survey, stratify the sampling units to increase the precision of the estimates.
Assume that the stratum size Ni is fairly accurately known. Let
mi : number of sampling units from ith stratum, i = 1,2,...,k.
k
m
i =1
= n.
Note that mi is a random variable (and that is why we are not using the symbol ni as earlier).
Assume n is large enough or the stratification is such that the probability that some mi =0 is negligibly
small. In case, mi =0 for some strata, two or more strata can be combined to make the sample size nonzero before evaluating the final estimates.
A post stratified estimator of population mean Y is

k
N y .
i =1
y post
1
=
N
Now
1 k
E ( y post ) = E N i E ( yi / m1 , m2 ,..., mk )
N i =1
1 k
E N i yi
N i =1
=Y
Var ( y post ) E Var ( y post | m1 , m2 ,..., mk ) + Var E ( y post | m1 , m2 ,..., mk )
=
k
1
1 2
= E wi2
Si + Var (Y )
i =1 mi N i
=
w
i =1
1
To find E
mi
2
i
1 1 2
E Si (Var (Y ) = 0).
mi N i
1
, proceed as follows:
Ni
Consider the estimate of ratio based on ratio method of estimation as

n
yj
Yj
y
Y
=
1
= j 1=
R=
R= = Nj
,
.
n
x
X
xj
Xj
=j 1 =j 1
We know that
N n RS X2 S XY
E ( R) R =
.
.
Nn
X2
Let
1 if j th unit belongs to i th stratum

xj =
0 otherwise
and
y j = 1 for all j = 1,2,...,N.
Then R, R and S x reduces to

2
=
R
n
=
n
xj i
j =1
n
j =1
N
=
R
N
=
N
xj i
j =1
N
j =1
N i2
Ni 2
1 N 2
1
1
2
S=
x j Nx =
Ni
N i N 2=
N 1 j =1
N
N
N
N
1
1
2
x
Ni N
1 N
1
N
=
x j y j Nx y=
0.
N 1 j =1
N
N
1
S xy=
Using these values in E ( R ) R, we have
n
E ( R ) =
R E
ni
N N ( N n)( N N i )
.
=
2
N
nN
N
(
1)
i
i
Thus
1 1
N ( N n)( N N i ) 1
N
E
=
+
n 2 N i2 ( N 1)
Ni
ni N i nN i
=
N 1
( N n) N
.
1 +
n( N 1) N i N i n n
Replacing mi in place of ni , we obtain
1 1
( N n) N
E =
mi N i n( N 1) N i
N 1
.
1 +
N
n
n
i
Now substitute this in the expression of Var ( y post ) as
=
Var ( y post )
w
i =1
2
i
1
E
mi
1 2
Si
Ni
N n N
2 2
w
S
.
i i
N
n
Ni
(
1)
i =1
N n k 2 2 1
wi Si w
n( N 1) i =1
i
N 1
+

1
nN i n
1
1

1 +
nwi n
N n k
1
wi Si2 n 1 +
2
n ( N 1) i =1
wi
N n k
= 2
(nwi + 1 wi ) Si2
n ( N 1) i =1
N n k
N n k
2
w
S
+
(1 wi ) Si2 .
i i
2
n( N 1) i 1 =
n ( N 1) i 1
=
=
Assuming N 1 N .
N n n 2 2 N n n
wi Si + 2 (1 wi ) Si2
Nn i 1 =
n N i1
=
V ( y post=
)
N n n
V prop ( yst ) +
=
(1 wi ) Si2 .
2
Nn i =1
The second term is the contribution in the variance of
y post
due to mi ' s not being proportionately
distributed.
If Si2 S w2 , say for all i, then the last term is

k
N n k
N n 2
2
w
S
S
k
wi 1)
=
(1
)
(
1)
(
Since
i
i
w
Nn 2 i 1 =
Nn 2
i 1
k 1 N n 2
=
Sw
n Nn
=
k 1
Var ( yst ).
n
The increase in variance over Varprop ( yst ) is small if the average sample size n =
reasonably large.
n
per stratum is
2
Thus a post stratification with a large sample produces an estimator which is almost as precise as an
estimator in stratified sampling with proportional allocation.

Lecture 13 NPTEL SAMPLING THEORY

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Lecture 13 NPTEL SAMPLING THEORY

Загружено:

Авторское право:

Доступные форматы

Sampling Theory

Implementation of interpenetrating subsamples in stratified sampling

Yij (tot ) be the unbiased estimator of total of

jth stratum based on the ith subsample, i = 1,2,...,L; j = 1,2,...,k.

An unbiased estimator of the jth stratum total is given by

Y j (tot ) = Yij (tot )

Yij (tot ) is given by

Thus an unbiased estimator of population total

and unbiased estimator of its variance is

A post stratified estimator of population mean Y is

Consider the estimate of ratio based on ratio method of estimation as

1 if j th unit belongs to i th stratum

y j = 1 for all j = 1,2,...,N.

Then R, R and S x reduces to

Using these values in E ( R ) R, we have

Replacing mi in place of ni , we obtain

Now substitute this in the expression of Var ( y post ) as

due to mi ' s not being proportionately

If Si2 S w2 , say for all i, then the last term is

estimator in stratified sampling with proportional allocation.

Вам также может понравиться